Electricity Load Forecasting using Hybrid Datasets with Linear Interpolation and Synthetic Data
Received: 31 July 2024 | Revised: 23 September 2024 | Accepted: 28 September 2024 | Online: 7 October 2024
Corresponding author: Chakkrit Termritthikun
Abstract
Electricity load forecasting is an important aspect of power system management. Improving forecasting accuracy ensures reliable electricity supply, grid operations, and cost savings. Often, collected data consist of Missing Values (MVs), anomalies, outliers, or other inconsistencies caused by power failures, metering errors, data collection errors, hardware failures, network failures, or other unexpected events. This study uses real-world data to investigate the possibility of using synthetically generated data as an alternative to filling in MVs. Three datasets were created from an original one based on different imputation methods. The imputation methods employed were linear interpolation, imputation using synthetic data, and a proposed hybrid method based on linear interpolation and synthetic data. The performance of the three datasets was compared using deep learning, machine learning, and statistical models and verified based on forecasting accuracy improvements. The findings demonstrate that the hybrid dataset outperformed the other interpolation methods based on the forecasting accuracy of the models.
Keywords:
bad data, missing values, deep learning, synthetic data, electricity load forecasting, generative adversarial networkDownloads
References
S. K. Filipova-Petrakieva and V. Dochev, "Short-Term Forecasting of Hourly Electricity Power Demand: Reggresion and Cluster Methods for Short-Term Prognosis," Engineering, Technology & Applied Science Research, vol. 12, no. 2, pp. 8374–8381, Apr. 2022.
H. Nguyen and C. K. Hansen, "Short-term electricity load forecasting with Time Series Analysis," in 2017 IEEE International Conference on Prognostics and Health Management (ICPHM), Dallas, TX, USA, Jun. 2017, pp. 214–221.
L. Baur, K. Ditschuneit, M. Schambach, C. Kaymakci, T. Wollmann, and A. Sauer, "Explainability and Interpretability in Electric Load Forecasting Using Machine Learning Techniques – A Review," Energy and AI, vol. 16, May 2024, Art. no. 100358.
S. Jung, J. Moon, S. Park, S. Rho, S. W. Baik, and E. Hwang, "Bagging Ensemble of Multilayer Perceptrons for Missing Electricity Consumption Data Imputation," Sensors, vol. 20, no. 6, Jan. 2020, Art. no. 1772.
N. Ahmad, Y. Ghadi, M. Adnan, and M. Ali, "Load Forecasting Techniques for Power System: Research Challenges and Survey," IEEE Access, vol. 10, pp. 71054–71090, 2022.
M. H. Bin Kamilin and S. Yamaguchi, "Resilient Electricity Load Forecasting Network with Collective Intelligence Predictor for Smart Cities," Electronics, vol. 13, no. 4, Jan. 2024, Art. no. 718.
A. R. Munappy, J. Bosch, H. H. Olsson, A. Arpteg, and B. Brinne, "Data management for production quality deep learning models: Challenges and solutions," Journal of Systems and Software, vol. 191, Sep. 2022, Art. no. 111359.
J. Jeong, T. Y. Ku, and W. K. Park, "Denoising Masked Autoencoder-Based Missing Imputation within Constrained Environments for Electric Load Data," Energies, vol. 16, no. 24, Jan. 2023, Art. no. 7933.
G. R. Hemanth and S. Charles Raja, "Proposing suitable data imputation methods by adopting a Stage wise approach for various classes of smart meters missing data – Practical approach," Expert Systems with Applications, vol. 187, p. 115911, Jan. 2022.
J. Zhang and P. Yin, "Multivariate Time Series Missing Data Imputation Using Recurrent Denoising Autoencoder," in 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), San Diego, CA, USA, Nov. 2019, pp. 760–764.
S. N. Hussain, A. Abd Aziz, M. J. Hossen, N. A. Ab Aziz, G. R. Murthy, and F. Bin Mustakim, "A Novel Framework Based on Cnn-Lstm Neural Network for Prediction of Missing Values in Electricity Consumption Time-Series Datasets," Journal of Information Processing Systems, vol. 18, no. 1, pp. 115–129, 2022.
J. Hwang and D. Suh, "CC-GAIN: Clustering and classification-based generative adversarial imputation network for missing electricity consumption data imputation," Expert Systems with Applications, vol. 255, Dec. 2024, Art. no. 124507.
X. Shen, H. Zhao, Y. Xiang, P. Lan, and J. Liu, "Short-term electric vehicles charging load forecasting based on deep learning in low-quality data environments," Electric Power Systems Research, vol. 212, Nov. 2022, Art. no. 108247.
H. Demirhan and Z. Renwick, "Missing value imputation for short to mid-term horizontal solar irradiance data," Applied Energy, vol. 225, pp. 998–1012, Sep. 2018.
Y. Li et al., "Load Profile Inpainting for Missing Load Data Restoration and Baseline Estimation," IEEE Transactions on Smart Grid, vol. 15, no. 2, pp. 2251–2260, Mar. 2024.
B. Cho et al., "Effective Missing Value Imputation Methods for Building Monitoring Data," in 2020 IEEE International Conference on Big Data (Big Data), Atlanta, GA, USA, Dec. 2020, pp. 2866–2875.
J. Yoon, D. Jarrett, and M. van der Schaar, "Time-series generative adversarial networks," in Proceedings of the 33rd International Conference on Neural Information Processing Systems, Vancouver, Canada, Sep. 2019, pp. 5508–5518.
J. Peppanen, Xiaochen Zhang, S. Grijalva, and M. J. Reno, "Handling bad or missing smart meter data through advanced data imputation," in 2016 IEEE Power & Energy Society Innovative Smart Grid Technologies Conference (ISGT), Minneapolis, MN, USA, Sep. 2016, pp. 1–5.
Y. Mao, M. Yang, P. Li, and Z. Ou, "A Missing Data Imputation Method for Electricity Consumption Data Based on TCN-Attention with Mask Tokens," in 2024 4th International Conference on Consumer Electronics and Computer Engineering (ICCECE), Guangzhou, China, Jan. 2024, pp. 513–517.
M. N. Noor, A. S. Yahaya, N. A. Ramli, and A. M. M. A. Bakri, "Filling Missing Data Using Interpolation Methods: Study on the Effect of Fitting Distribution," Key Engineering Materials, vol. 594–595, pp. 889–895, 2014.
I. Goodfellow et al., "Generative adversarial networks," Communications of the ACM, vol. 63, no. 11, pp. 139–144, Jul. 2020.
C. Wang, Y. Cao, S. Zhang, and T. Ling, "A Reconstruction Method for Missing Data in Power System Measurement Based on LSGAN," Frontiers in Energy Research, vol. 9, Mar. 2021.
Z. Chang, S. Liu, Z. Cai, and G. Tu, "ANODE-GAN: Incomplete Time Series Imputation by Augmented Neural ODE-Based Generative Adversarial Networks," in Artificial Neural Networks and Machine Learning – ICANN 2023, Heraklion, Greece, 2023, pp. 16–27.
S. Aissa and K. M. Tarek, "Time Generative adversarial network for the generation of electricity load data," in 2023 International Conference on Control, Automation and Diagnosis (ICCAD), Rome, Italy, May 2023, pp. 1–5.
A. Kammoun, R. Slama, H. Tabia, T. Ouni, and M. Abid, "Generative Adversarial Networks for face generation: A survey," ACM Computing Surveys, Mar. 2022.
Q. Wen et al., "Time Series Data Augmentation for Deep Learning: A Survey," in Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, Montreal, Canada, Aug. 2021, pp. 4653–4660.
Z. Yang, Y. Li, and G. Zhou, "TS-GAN: Time-series GAN for Sensor-based Health Data Augmentation," ACM Transactions on Computing for Healthcare, vol. 4, no. 2, pp. 12:1-12:21, Dec. 2023.
D. S. Lee and S. Y. Son, "PV Forecasting Model Development and Impact Assessment via Imputation of Missing PV Power Data," IEEE Access, vol. 12, pp. 12843–12852, 2024.
E. Afrifa-Yamoah, U. A. Mueller, S. M. Taylor, and A. J. Fisher, "Missing data imputation of high-resolution temporal climate time series data," Meteorological Applications, vol. 27, no. 1, 2020, Art. no. e1873.
Z. Zhang, "Missing data imputation: focusing on single imputation," Annals of translational medicine, vol. 4, no. 1, Jan. 2016.
Downloads
How to Cite
License
Copyright (c) 2024 Karma Dorji, Sorawut Jittanon, Prapita Thanarak, Pornthip Mensin, Chakkrit Termritthikun
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.