Smart Meter Data Analytics: Practical Use-Cases and Best Practices of Machine Learning Applications for Energy Data in the Residential Sector
- Tobias Brudermüller (Brudermueller), Bits to Energy Lab, ETH Zurich, tbrudermuell@ethz.ch
- Markus Kreft, Bits to Energy Lab, ETH Zurich, mkreft@ethz.ch
- Prof. Dr. Elgar Fleisch, Bits to Energy Lab, ETH Zurich & University of St. Gallen
- Prof. Dr. Thorsten Staake, Bits to Energy Lab, ETH Zurich & University of Bamberg
- Practical tutorial for the 11th International Conference on Learning Representations (ICLR 2023)
- Part of the ICLR 2023 Workshop: Tackling Climate Change with Machine Learning
- Hosted by Climate Change AI
- Execute the tutorial via Google Colab by using this link!
In this tutorial, we provide a practical guide to current trends in smart meter data analytics. In particular, we focus on feature engineering and machine learning scenarios for energy data. During this tutorial:
- You will gain insights into current trends and use cases in the energy space.
- You will learn best practices for energy data visualization.
- You will apply both machine learning approaches and data mining techniques to real-world energy data.
- You will get a sense of typical and atypical energy use in a residential building.
In addition, some concepts offer the possibility of being applied to other time series data outside of the energy domain.
Note: We would also like to point out that some utilities allow their customers access to their own smart meter data either through customer portals or upon request. If you have an active smart meter at home, you can use this guide as a starting point for analyzing your own energy usage.
If you want to use this tutorial, please cite us in you work:
@inproceedings{brudermueller2023smart,
title={Smart Meter Data Analytics: Practical Use-Cases and Best Practices of Machine Learning Applications for Energy Data in the Residential Sector},
author={Brudermueller, Tobias and Kreft, Markus},
booktitle={ICLR 2023 Workshop on Tackling Climate Change with Machine Learning},
url={https://www.climatechange.ai/papers/iclr2023/3},
year={2023}
}
To cope with climate change, the energy system is undergoing a massive transformation. Due to the electrification of all sectors, the power grid is facing high additional demand. As a result, the digitization of the grid is increasingly gaining attention. The smart grid relies heavily on the increasing deployment of smart meters around the world. The corresponding smart meter data is typically a time series of power or energy measurements with a resolution between 1 s and 60 min. This data provides valuable insights and opportunities for monitoring and controlling activities on the power grid.
In this tutorial, we therefore provide an overview of best practices for analyzing smart meter data. We focus on machine learning applications and low resolution (15 min) energy data in a residential setting. We use only real-world data and cover use cases that are highly relevant to practical applications. Although this tutorial is specifically tailored for an energy audience, we believe that anyone in the data analytics and machine learning field can benefit from it, as many techniques are applicable to all time series data. Through our tutorial, we hope to foster new ideas, contribute to interdisciplinary exchange between different research fields, and educate people about energy use.
In the best case, you have already worked with Python and have some basic knowledge of machine learning. Nevertheless, this tutorial can be completed and understood by anyone with basic programming skills.
- If you are a student, this tutorial will provide you with a practical introduction to applying machine learning to time series data.
- If you are a researcher in machine learning, you can use this tutorial to find ideas for your research and potential data sets for benchmarking.
- If you work for a utility, this tutorial will provide you with an overview of practical use cases and the current state of the art in smart meter data analytics.
- If you are an individual interested in energy-related topics, you can learn more about smart grid applications and energy consumption in this tutorial.
1. Introduction to smart meter data
- What is smart meter data?
- Relevance for tackling climate change
- Chances and limitations
- Power vs. energy measurements
- Data resolution
- Data availability
- Contextual information and domain knowledge
- Making use of additional data sources
2. Preparations for this tutorial
- Importing packages
- Importing the data
3. Best practices for visualizing smart meter data
- Time-series visualizations of energy data
- Visualizing distributions of energy consumption
- Multi-dimensional visualizations
- Annotating visualizations with additional context information
- Displaying aggregated demand
4. Pre-processing smart meter data
- Combining smart meter data with temperature data
- Add additional information for filtering timestamps
- Normalization methods
- Interpolation methods and downsampling
- Outlier detection with Hampel filter
- Simple baseload estimation
- Enhancing small activities
- Detecting switching activities
- Sliding window approaches
- Feature extraction
- Detecting peaks in distribution
- Frequency-based methods for low-resolution data
5. Non-Intrusive Load Monitoring (NILM) / Load Disaggregation
- Using classification algorithms to detect appliance installations
- Applying simple deep learning models for NILM
- Applying Hidden Markov Models for NILM
- Correctly evaluating NILM approaches
- Rule-based heuristics for pattern isolations
6. Flexibility Estimation
- Estimating load shifting potential of disaggregated appliances
7. Anomaly Detection
- Introduction to online vs. offline change point detection
- Finding state changes with offline change point detection
- Finding anomalies with sliding window
- Applying Symbolic Aggregate ApproXimation
- Finding discords and motifs
8. Customer segmentation
- Extraction of energy signatures through regression
- Extraction of average day profiles
- Applying clustering algorithms
9. Load Forecasting
- Brief and short introduction (as the topic has already been covered in previous CCAI-tutorial)
10. Conclusion
11. Additional Resources
- Data Sets
- Software Packages
- Other
12. References