Highlights
- •Smart meter becomes the most researched area in the energy sector due to its benefits for consumers and utilities.
- •The need for powerful algorithms in smart meter data analytics increases as data grows.
- •In smart meter data analytics, performance of algorithms is improved by testing them with real-time data.
- •To test the effectiveness of developed algorithms, a large amount of real-time data is required.
- •Obtaining smart meter data sets is difficult due to privacy and security policies in many countries.
- •Synthetic data are widely utilized to mitigate the above problem without compromising privacy and security issues.
Abstract
Keywords
Current code version | V1.1 |
Permanent link to code/repository used for this code version | https://github.com/SoftwareImpacts/SIMPAC-2022-132 |
Permanent link to reproducible capsule | https://codeocean.com/capsule/0796106/tree/v1 |
Legal code license | GNU General Public License (GPL) |
Code versioning system used | git |
Software code languages, tools and services used | Python |
Compilation requirements, operating environments and dependencies | Python 3.9.12 Pandas Tkinter Prophet |
If available, link to developer documentation/manual | |
Support email for questions | [email protected] |
1. Introduction
Paper | Analytic framework used | Processing scheme | Seasonality and trend removal |
---|---|---|---|
[5] | Spark | AR | Centered moving average |
[2] | Spark | AR | Moving average |
[6] | Generative Adversarial Network (GAN) | ARIMA | – |
2. Software description
2.1 Methodology

2.2 Implementation
2.2.1 Pre-processing algorithm
2.2.2 Data generation

3. Impact overview
3.1 Real time applications
Papers | Application |
---|---|
[11] | Generating synthetic load patterns |
[12] | Clustering approach |
[13] | Synthetic data generation in smart home |
[14] | Consumer privacy mitigation |
[15] | Non-intrusive load monitoring in households |
[16] | Demand Data Generation for ML |
[17] | Synthetic Electric Power Systems |
[18] | Smart meter data analytics |



3.2 Outcomes
Declaration of Competing Interest
References
S. Jain, K.N. Vinoth, A. Paventhan, V. Kumar Chinnaiyan, V. Arnachalam, M. Pradish, Survey on smart grid technologies-smart metering, IoT and EMS, in: 2014 IEEE Students’ Conf. Electr. Electron. Comput. Sci. SCEECS 2014, 2014.
- Evaluation of big data frameworks for analysis of smart grids.J. Big Data. 2019; 6
T. Sirojan, S. Lu, B.T. Phung, E. Ambikairajah, Embedded Edge Computing for Real-time Smart Meter Data Analytics, in: 2019 International Conference on Smart Energy Systems and Technologies, SEST, 2019, pp. 1–5.
- Data privacy and residential smart meters: Comparative analysis and harmonization potential.Util. Policy. 2021; 70101188
- A scalable smart meter data generator using spark.in: Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), 10573 LNCS. 2017: 21-36
C. Zhang, S.R. Kuppannagari, R. Kannan, V.K. Prasanna, Generative Adversarial Network for Synthetic Time Series Data Generation in Smart Grids, in: 2018 IEEE Int. Conf. Commun. Control. Comput. Technol. Smart Grids, SmartGridComm 2018, 2018, pp. 1–6.
- A novel deseasonalized time series model with an improved seasonal estimate for groundwater level predictions.H2Open J. 2019; 2: 25-44
- Smartmeter energy consumption data in London households.2015 ([Online]. Available: https://data.london.gov.uk/dataset/smartmeter-energy-use-data-in-london-households)
C. Zhang, S.R. Kuppannagari, R. Kannan, V.K. Prasanna, Generative Adversarial Network for synthetic time series data generation in smart grids, in: 2018 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids, SmartGridComm, 2018.
- Synthetic energy data generation using time variant generative Adversarial Network.Electronics. 2022; 11: 355
- A data-driven approach for generating synthetic load patterns and usage habits.IEEE Trans. Smart Grid. 2020; 11: 4984-4995
K. Mason, S. Vejdan, S. Grijalva, An ‘on the fly’ framework for efficiently generating synthetic big data sets, in: 2019 IEEE International Conference on Big Data, Big Data, 2019.
M. Razghandi, et al., Variational autoencoder generative adversarial network for Synthetic Data Generation in smart home, in: ICC 2022 - IEEE International Conference on Communications, 2022.
- Mitigating consumer privacy breach in smart grid using obfuscation-based generative Adversarial Network.Math. Biosci. Eng. 2022; 19: 3350-3368
- A synthetic energy dataset for non-intrusive load monitoring in households.Sci. Data. 2020; 7
- Synthetic demand data generation for individual electricity consumers: Generative Adversarial Networks (GANs).Energy AI. 2022; 9100161
- The creation and validation of load time series for synthetic electric power systems.IEEE Trans. Power Syst. 2021; 36: 961-969
- Synthetic Data – A Solution to Train Diagnostic Systems for High-Voltage Equipment without Fault-Condition Measurements.CIGRE Sci. Eng. 2022; 2022: 1-28
N. Patki, R. Wedge, K. Veeramachaneni, The synthetic data vault, in: 2016 IEEE International Conference on Data Science and Advanced Analytics, DSAA, 2016.
N. Iftikhar, X. Liu, F.E. Nordbjerg, S. Danalachi, A Prediction-Based Smart Meter Data Generator, in: 2016 19th International Conference on Network-Based Information Systems, NBiS, 2016, pp. 173–180.
Article info
Publication history
Footnotes
The code (and data) in this article has been certified as Reproducible by Code Ocean: (https://codeocean.com/). More information on the Reproducibility Badge Initiative is available at https://www.elsevier.com/physical-sciences-and-engineering/computer-science/journals.
Identification
Copyright
User license
Creative Commons Attribution (CC BY 4.0) |
Permitted
- Read, print & download
- Redistribute or republish the final article
- Text & data mine
- Translate the article
- Reuse portions or extracts from the article in other works
- Sell or re-use for commercial purposes
Elsevier's open access license policy