

When are household load profiles similar? Comparing distance measures for Smart Meter Data Analytics
Marcus Voss
This event took place on 22nd July 2019 at 11:30am (10:30 GMT)
Knowledge Media Institute, Berrill Building, The Open University, Milton Keynes, United Kingdom, MK7 6AA
The smart meter rollout in many countries leads to energy providers, energy valueadded service providers, and grid operators having access to increasing amounts of highresolution load profiles (e.g. 15minute resolution) compared to only having one measurement per year as before. These load profiles will be analyzed within diverse data mining tasks such as classification, clustering, and forecasting. However, such lowly aggregated highresolution load profiles are generally quite intermittent and have less structure to be exploited by standard data mining algorithms. If for instance pointwise distances, such as the Euclidean distance, are used to compare household load profiles, they may inflict a doublepenalty if a spike has about the correct height, but is shifted slightly in time. To compare household load profiles, a local permutation invariant (LPI) distance measure was introduced as the adjusted pnorm error to assess household shortterm load forecasts and forecasting models minimizing it have since been introduced. This talk will first introduce the characteristics of load profiles at low aggregation levels and introduce the LPI distance as well as the related Dynamic Time Warping (DTW) distance popular in the time series literature. It will discuss the problem of finding a sample mean under the DTW and LPI distances, and introduce approximate optimization methods based on subgradient descent. It will then show how the choice of the distance measure (and its sample mean) affect the results within typical data analytics use cases, namely shortterm load forecasting, load profile clustering, and classification. A novel distance measure combining properties of the LPI and DTW, the local nearest neighbor alignment (LNNA) distance is introduced and discussed. 
The webcast was open to 300 users
