A Transformer-Based Framework for Appliance Detection Using Smart Meter Consumption Series (VLDB 2024)
Millions of smart meters have been installed by electricity suppliers worldwide during the last decade, allowing them to collect a large amount of electricity consumption data, albeit sampled at a low frequency (one point every 30min). One of the important challenges these suppliers face is how to utilize these data to detect the presence/absence of different appliances in the customers' households. This valuable information can help them provide personalized offers and recommendations to help customers towards the energy transition. Appliance detection can be cast as a time series classification problem. However, the large amount of data combined with the long and variable length of the consumption series pose challenges when training a classifier. We propose ADF, a framework that uses subsequences of a client consumption series to detect the presence/absence of appliances. We also introduce TransApp, a Transformer-based time series classifier that is first pretrained in a self-supervised way to enhance its performance on appliance detection tasks.
If you use this work in your project or research, please cite the following paper:
Adrien Petralia, Philippe Charpentier, and Themis Palpanas. ADF & TransApp: A Transformer-Based Framework for Appliance Detection Using Smart Meter Consumption Series. Proceedings of the VLDB Endowment (PVLDB), 17(3): 553 - 562, 2023.
@article{petraliavldb24,
author = {Petralia, Adrien and Charpentier, Philippe and Palpanas, Themis},
title = {ADF \& TransApp: A Transformer-Based Framework for Appliance Detection Using Smart Meter Consumption Series},
year = {2023},
publisher = {VLDB Endowment},
volume = {17},
number = {3},
issn = {2150-8097},
journal={Proceedings of the VLDB Endowment},
month = nov,
pages = {553–562},
}
We propose the Appliance Detection Framework (ADF) to detect the presence of appliances in households, using real-world consumption series, which are sampled at a very low frequency, and are long and variable-length. ADF addresses these challenges by operating at individual subsequences of each consumption series, instead of each series in its entirety. The framework can be used with any time series classifier designed to predict probabilities.
We propose TransApp, a Transformer-based time series classifier, which can first be pretrained in a self-supervised manner to enhance its ability on appliances detection tasks. This way, TransApp can significantly improve its accuracy.
The proposed architecture lies in combination of a strong embedding block made of dilated convolutional layers followed by a Transformer encoder using Diagonally Masked Self-Attention (DMSA).
Self-supervised pretraining. The use of a self-supervised pretraining of a Transformer architecture on an auxiliary task has been used in the past to boost the model performance on downstream tasks. This process is inspired by the mask-based pretraining of vision transformer and requires only the input consumption series without any appliance information label. It results in a reconstruction objective of a corrupted (masked) time series fed to the model input.
Supervised pretraining. The supervised training results in a simple binary classification process using labeled time series.
We provide a jupyter-notebook example to use our Appliance Detection Framework combined with our TransApp classifier on the CER data : experiments/TransAppExample.ipynb.
In addition, to reproduce papers experiments, use the following guidelines.
Pretraining TransApp in a self-supervised way using non labeled data :
sh LaunchTransAppPretraining.sh
Use our Appliance Detection Framework combined with TransApp to detect appliance in consumption time series :
sh LaunchTransAppClassif.sh
Use our Appliance Detection Framework combined with ConvNet, ResNet or InceptionTime to detect appliance in consumption time series :
sh LaunchModelsClassif.sh
Please refer to this Github ApplianceDetectionBenchmark to reproduce the experiments, where an extensive evaluation of different time series classifiers have been conducted, inluding on the datasets used in this study.
Python version : >= Python 3.7
Overall, the required python packages are listed as follows:
Use pip to install all the required libraries listed in the requirements.txt file.
pip install -r requirements.txt
The data used in this project comes from two sources:
- CER smart meter dataset from the ISSDA archive.
- Private smart meter dataset provide by EDF (Electricité De France).
You may find more information on how to access the datasets in the data folder.
- Adrien Petralia, EDF Research, Université Paris Cité
- Philippe Charpentier, EDF Research
- Themis Palpanas, Université Paris Cité, IUF
We would like to thanks Paul Boniol for the valuable discussions on this project. Work supported by EDF R&D and ANRT French program.