Conditional Tabular GAN-based Two-stage Data Generation Scheme for Short-term Load Forecasting

2020 
Load forecasting is one of the critical tasks for enhancing the energy efficiency of smart grids. Even though recent deep learning-based load forecasting models have shown excellent forecasting performance, one of the common problems they faced was that their forecasting accuracy was highly dependent on the data quality and quantity available for the model training. Collecting a sufficient amount of high-quality data is expensive and time-consuming. Recently, a generative adversarial network (GAN) has shown its potential as a solution to the data shortage problem by generating virtual data based on a small amount of real data, and several studies have used GAN to generate electric load data for training forecasting models. However, due to the noise data problem of GANs, their predictive performance also deteriorated. To solve this problem, in this study, we propose a two-stage data generation scheme that more effectively generates input and output variables for short-term load forecasting. In the first stage, we generate virtual calendar and temperature data used as input variables using a conditional tabular GAN (CTGAN). In the second stage, we generate electric load data corresponding to the input variables using a deep learning-based regression model. Lastly, we construct our forecasting model by training another regression model using a mixture of generated data and real data. To verify the effectiveness of our scheme, we conducted extensive experiments using various datasets and data generation models. We report some of the results.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    46
    References
    3
    Citations
    NaN
    KQI
    []