Smart grid power load type forecasting: research on optimization methods of deep learning models

Sun, Huadong; Ren, Yonghao; Wang, Shanshan; Zhao, Bing; Yin, Rui

doi:10.3389/fenrg.2023.1321459

ORIGINAL RESEARCH article

Front. Energy Res., 29 December 2023

Sec. Smart Grids

Volume 11 - 2023 | https://doi.org/10.3389/fenrg.2023.1321459

This article is part of the Research TopicApplication of Image Processing and Knowledge Reasoning in the Construction of New Power SystemView all 23 articles

Smart grid power load type forecasting: research on optimization methods of deep learning models

Huadong Sun¹

Yonghao Ren²

Shanshan Wang²*

Bing Zhao²

Rui Yin²

¹State Key Laboratory of Power Grid Safety and Energy Conservation, Beijing, China
²China Electric Power Research Institute Co., Ltd., Beijing, China

Introduction: In the field of power systems, power load type prediction is a crucial task. Different types of loads, such as domestic, industrial, commercial, etc., have different energy consumption patterns. Therefore, accurate prediction of load types can help the power system better plan power supply strategies to improve energy utilization and stability. However, this task faces multiple challenges, including the complex topology of the power system, the diversity of time series data, and the correlation between data. With the rapid development of deep learning methods, researchers are beginning to leverage these powerful techniques to address this challenge. This study aims to explore how to optimize deep learning models to improve the accuracy of load type prediction and provide support for efficient energy management and optimization of smart grids.

Methods: In this study, we propose a deep learning method that combines graph convolutional networks (GCN) and sequence-to-sequence (Seq2Seq) models and introduces an attention mechanism. The methodology involves multiple steps: first, we use the GCN encoder to process the topological structure information of the power system and encode node features into a graph data representation. Next, the Seq2Seq decoder takes the historical time series data as the input sequence and generates a prediction sequence of the load type. We then introduced an attention mechanism, which allows the model to dynamically adjust its attention to input data and better capture the relationship between time series data and graph data.

Results: We conducted extensive experimental validation on four different datasets, including the National Grid Electricity Load Dataset, the Canadian Electricity Load Dataset, the United States Electricity Load Dataset, and the International Electricity Load Dataset. Experimental results show that our method achieves significant improvements in load type prediction tasks. It exhibits higher accuracy and robustness compared to traditional methods and single deep learning models. Our approach demonstrates advantages in improving load type prediction accuracy, providing strong support for the future development of the power system.

Discussion: The results of our study highlight the potential of deep learning techniques, specifically the combination of GCN and Seq2Seq models with attention mechanisms, in addressing the challenges of load type prediction in power systems. By improving prediction accuracy and robustness, our approach can contribute to more efficient energy management and the optimization of smart grids.

1 Introduction

With the continuous development of society and the continuous growth of power demand, the power system is rapidly evolving into a more intelligent, efficient and sustainable form. This is the concept of smart grid. Smart grids are not only the future of the power industry, but also the key to solving energy problems, reducing carbon emissions and achieving sustainable development Han et al. (2022). In smart grids, understanding and predicting changes in electrical load types is critical. Electrical load refers to the power consumption pattern in the power system, which usually includes various types of loads such as household, industrial, commercial and agricultural Li et al. (2022a). Each load type has different characteristics and energy consumption patterns. Therefore, accurate prediction of load types can help power systems better plan power supply strategies, improve energy efficiency, reduce costs, and promote sustainable development.

However, the power load type forecasting task faces many challenges. First, the topology of the power system is usually very complex, including various substations, lines, and transmission towers, which results in complex correlations between power load data. Secondly, the diversity of time series data also increases the difficulty of prediction Xu et al. (2021). Different types of loads exhibit different characteristics over different time periods, which requires models to be able to identify and capture these characteristics. In addition, accurate load type forecasting requires consideration of multiple data sources, such as power system topology, historical time series data, etc. How to effectively integrate these data is also a challenge.

To address these challenges, this study focuses on developing a comprehensive deep learning approach to improve the accuracy and robustness of electric load type forecasting. We will combine graph convolutional networks (GCN) and sequence-to-sequence (Seq2Seq) models to introduce attention mechanisms to better understand and predict different types of power loads. The core idea of this method is to effectively integrate information from different data sources so that the model can better understand the complexity and temporal changes of the power system.

Studying methods and technologies for power load type prediction is of great significance to the development of smart grids and energy management. By improving the accuracy of electricity load type predictions, it can help the power system better adapt to the diversity and complexity of energy sources. This helps achieve high reliability, efficiency and sustainability of the power system, reduces resource waste, lowers carbon emissions, and promotes the integration of renewable energy. In addition, this research also provides new technical support for the intelligence and automation of the power system, laying a solid foundation for building a more intelligent power network and social infrastructure.

In research in the fields of smart grid, power load type forecasting, and deep learning, the following models are mainly used for improvement and research and development.

Convolutional neural networks (CNN) are a model that has achieved great success in the field of computer vision, but it also plays an important role in areas such as electric load type forecasting Bhatt et al. (2021). The main feature of CNN is its use of convolutional layers, which enables it to automatically extract spatial features from input data without manually designing a feature extractor. This feature is particularly useful for power load data processing because power load data often contains rich timing information and volatility that differs between different load types Li et al. (2020). In power load type prediction, the application of CNN is mainly reflected in its excellent feature extraction capabilities. CNN can capture these local features through convolution operations to identify patterns of different load types. In addition, CNN can also build hierarchical feature representation through multi-layer convolution and pooling layers, which helps to understand the information in power load data more deeply. The wide application of CNN lies in the adjustment of its convolution kernel size and number to adapt to features of different scales and complexity. In addition, CNN can also be used in conjunction with other deep learning models and techniques, such as recurrent neural networks (RNN) and attention mechanisms, to better capture temporality and correlation between data.

Recurrent neural network (RNN) is a type of deep learning model suitable for sequence data, which is of great value in power load type forecasting tasks. The unique feature of RNN is that it has internal cyclic connections, which allows the model to process variable-length time series data, which is very important for modeling power load data. In power load type forecasting, RNN can be regarded as a sliding window in time, which can capture the dependence between load data at different time points. This is key to understanding the evolution of load types over time Xiao and Zhou (2020). However, traditional RNN is prone to problems such as gradient disappearance or gradient explosion on long sequence data. For this reason, improved RNN models such as gated recurrent unit (GRU) and long short-term memory network (LSTM) have emerged. GRU controls the flow and memory of information by introducing update gates and reset gates to better process time series data Dhruv and Naskar (2020). These improved RNN models perform well in power load type forecasting, especially when long-term dependencies need to be considered. Choosing an appropriate RNN model depends on the characteristics of the data and the requirements of the task to ensure that it can better capture the information of time series data.

Temporal convolutional network (TCN) is a model that combines CNN and RNN, and it has broad application prospects in power load type forecasting. TCN uses convolutional layers to capture the local and global relationships of time series data, avoiding the gradient problem in traditional RNN. This makes TCN ideal for processing long sequences of data, especially when power load type forecasting needs to consider a wider range of historical information Arumugham et al. (2023). The main feature of TCN is that it has an extended receptive field of variable length, which means that the model can effectively capture features at different time scales. In power load type forecasting, different load types may show different patterns on different time scales, so TCN can help the model better adapt to this diversity Fan et al. (2023). In addition, TCN can be combined with other technologies such as attention mechanisms to further improve model performance.

Gated Recurrent Unit (GRU) is an improved RNN model designed to overcome the problems of traditional RNN. The main feature of GRU is that it has update gates and reset gates inside, which allow the model to better control the flow and memory of information Cheon et al. (2020). In power load type forecasting, GRU can be used to capture long-term dependencies of time series data. One of the advantages of GRU is its simplicity and efficiency. Compared with LSTM, GRU has fewer parameters and therefore trains faster Daniels et al. (2020). This makes GRU ideal for processing large-scale time series data. In power load type forecasting tasks, choosing the GRU model can reduce computational costs while maintaining high performance.

Deep reinforcement learning (DRL) is a powerful model whose main feature is to learn optimal decision-making strategies through interaction with the environment. In the field of smart grid, DRL can be used for load management and optimization to achieve the best balance of energy efficiency and power supply stability Leng et al. (2021). The DRL model can dynamically adjust the power supply strategy according to changing power load conditions, thereby improving energy utilization efficiency. Although DRL models generally require more data and computing resources, they perform well in handling complex decision-making problems. In power load type forecasting, DRL can be combined with other deep learning models to achieve higher-level decision-making and control, contributing to the development of smart grids and optimization of power systems Huang et al. (2019). The choice of DRL model usually depends on the complexity of the task and the problem that needs to be solved.

However, there are some shortcomings when applying these models to the study of smart grid power load type prediction problems. Although convolutional neural networks (CNN) are good at extracting spatial features, they have limited modeling of time series data and are difficult to capture dynamic changes in load types. Recurrent neural network (RNN) and its improved models (such as GRU and LSTM) can handle time series data, but are susceptible to problems such as gradient disappearance and gradient explosion, which limit their long-term dependency modeling capabilities. Although temporal convolutional network (TCN) overcomes the gradient problem of RNN, it may not be flexible enough to adapt to different scales of temporal data. Deep reinforcement learning (DRL) requires a large amount of data and computing resources, has challenges in complexity, and is not suitable for all power load type prediction scenarios.

In view of this, we propose a GCN-Seq2Seq model that integrates the attention mechanism. This model combines graph convolutional network (GCN) and sequence-to-sequence model (Seq2Seq), and introduces an attention mechanism, which has the following advantages. First, GCN can effectively capture the complex topology of the power system and help the model understand the relationship between different load types. Secondly, the Seq2Seq model is suitable for sequence generation tasks, mapping historical time series data to load type prediction sequences, and better considering timing. Most importantly, the attention mechanism we introduced enables the model to automatically focus on the most important information, improving the accuracy of predictions. Our model has advantages in comprehensively considering the topology, time series data and correlation of the power system, and is expected to improve the performance and efficiency of power load type prediction, which is beneficial to the development of smart grids and the optimization of power systems.

The main contributions of this study are as follows:

• Proposal of new deep learning model. We propose an innovative deep learning model that combines graph convolutional networks (GCN) and sequence-to-sequence models (Seq2Seq), and introduces an attention mechanism. This model can simultaneously consider the topology and timing data of the power system and automatically capture the correlation of load types, thereby improving the accuracy and accuracy of predictions.

• Research on multi-source data fusion. We apply multi-source data fusion to the power load type prediction task, taking into account the topological information and historical time series data of the power system. This data fusion method is expected to improve the robustness and accuracy of load type forecasting and provide more comprehensive information for intelligent management of power systems.

• Promote the sustainable development of smart grids. The results of this study are expected to contribute to the sustainable development of smart grids and efficient management of power systems. Through more accurate load type forecasting, the power system can better adapt to changing demands, improve the reliability and efficiency of power supply, and also provide strong support for the development of sustainable energy integration and smart grids.

In the following sections, we summarize all the model diagrams involved in this study, as well as the data analysis diagrams in Part II. In the third part, we introduce in detail the deep learning model we proposed, that is, the GCN-Seq2Seq model incorporating the attention mechanism, and elaborate on the structure diagram and basic principles of the model. The fourth part is our experiment, which introduces the data sets used in this study, the detailed experimental settings and the analysis of experimental results. The fifth part is the conclusion and summary of the full text. We also describe the shortcomings of this study and the next research direction.

2 Related work

2.1 Intelligent power system

As an innovative field in the power industry, smart power systems cover a series of advanced technologies and concepts, aiming to improve the intelligence, efficiency and sustainability of the power system. The basic concept includes real-time monitoring, control and optimization of power networks to better meet growing power demand. The origins of smart power systems can be traced back to the digital transformation of traditional power systems. With the continuous advancement of information technology, smart power systems have gradually evolved into a complex network that integrates elements such as advanced sensors, communication technology, data analysis, and artificial intelligence to make the power system more flexible and intelligent.

In the field of smart power and energy management, recent research demonstrates the rise of hybrid technology solutions that focus on improving operational efficiency and system resilience against potential risks. A study proposes a reinforcement learning-based energy management system designed to optimize the performance of fuel cell and battery hybrid electric vehicles Reddy et al. (2019). The core of the system is to dynamically adjust the distribution of electric energy, showing the possibility of improving energy efficiency under changing risk conditions. In response to smart grid security issues, especially the threat of denial of service (DoS) attacks, some research has developed a distributed control mechanism. This mechanism combines the system’s communication capabilities and control responses to ensure the stability of grid dispatch and operation even in the event of a cyber attack Li et al. (2022b). In addition, for microgrid energy management issues, the latest research introduces a distributed energy management framework to complete dual-mode energy distribution within a predetermined time through event-triggered communication technology. This method can effectively deal with communication delays and ensure the accuracy and reliability of energy distribution Liu et al. (2023). These studies as a whole reflect that the methods used by intelligent systems to improve performance and security are becoming increasingly complex, and interdisciplinary technology integration is a significant trend in current development. From reinforcement learning algorithms to the application of advanced communication protocols, it reflects important steps taken in smart energy distribution and power grid management.

However, smart power systems also face some challenges. Especially in terms of power load type forecasting, challenges mainly include the complex topology of the power system, the diversity of time series data, and the correlation between data. Addressing these challenges is crucial to achieve comprehensive optimization of smart power systems and improve power load type forecast accuracy.

2.2 Deep learning technology

Deep learning technology has achieved remarkable application results in the field of power systems, providing strong support for the intelligence and efficiency of power systems. In terms of power load forecasting, deep learning algorithms can be used to learn and model historical load data to achieve accurate predictions of future power loads. In terms of power system optimization, deep learning technology is used to learn the topology structure and operating status of the power system to achieve real-time optimal dispatch of the power system Ibrahim et al. (2020). In terms of smart grid management, deep learning technology is used to process a large amount of time series data in the power grid, which can realize real-time monitoring, fault detection and intelligent dispatching of the power grid. In terms of power load forecasting, deep learning technology has been successful in many cases. For example, in the power load forecasting of the State Grid, deep learning methods achieve highly accurate load forecasting by learning the complex spatiotemporal relationships of the power system, providing an important basis for reasonable dispatch of the power system O’Dwyer et al. (2019). In terms of power system optimization, deep learning technology has also shown strong capabilities. By training large-scale data from the power system, deep learning models can better understand the modes and trends of system operation, thereby achieving intelligent scheduling and optimization of the system.

Compared with traditional methods, deep learning technology has significant advantages. Deep learning models can learn and capture the complex spatiotemporal relationships in power systems and better adapt to the nonlinear characteristics of the system. Deep learning models can achieve end-to-end learning, learn feature representations directly from raw data, without the need to manually extract features, and improve the generalization ability of the model Zhang et al. (2019). The deep learning model can automatically adjust model parameters to adapt to the characteristics of different power systems, and has stronger adaptability and generalization capabilities.

Although deep learning has achieved remarkable results in power systems, it still faces some challenges. Issues such as power system complexity, data uncertainty, and model interpretability remain the focus of current research. The reason for choosing the deep learning method in this study is its advantages in processing large-scale data, learning complex relationships, and adapting to uncertainty.

2.3 Optimizing deep learning models

In terms of optimization of deep learning models, a variety of methods have emerged in recent years, especially in applications in the field of power systems, including transfer learning, reinforcement learning, hyperparameter optimization, adversarial training, etc. Transfer learning uses the knowledge learned on one task to help learn on another related task. Transfer learning can reduce the dependence on a large amount of annotated data and improve the generalization of the model Hafeez et al. (2020). The introduction of reinforcement learning methods allows the model to optimize its own performance through interaction with the environment, which is particularly suitable for real-time dispatch and control problems in power systems. Optimizing the hyperparameters of deep learning models through search algorithms or adaptive methods can improve the performance and robustness of the model. Introducing adversarial training enables the model to better cope with perturbations and attacks on input data, and improves the robustness of the model.

Optimization schemes based on meta-learning have been applied to deep learning models, especially in the field of power systems. This method has confirmed its effectiveness in improving model performance between different systems through the practice of transfer learning Zhou et al. (2020). At the same time, reinforcement learning technology also shows great potential in load forecasting. It can enhance the model’s adaptability to complex changes by reproducing different load conditions in a simulated environment. In addition, the introduction of adversarial training is regarded as an important development in the field of power system security. Adversarial samples are added to improve the system’s ability to identify network attacks, thereby enhancing the defense mechanism Ye et al. (2020). These research results provide a wealth of ideas and methods for optimizing deep learning models, and provide a reference for our optimization of deep learning models in power load type forecasting.

3 Methodology

3.1 Overview of our network

For the power load type prediction problem, significant progress has been made in the application of deep learning technology in smart power systems and related work in model optimization. In order to further improve the prediction accuracy, this study adopts an overall model that integrates graph convolution network (GCN) and sequence-to-sequence model (Seq2Seq), and introduces an attention mechanism to solve the problem of smart grid power load type prediction. This model was chosen due to considerations of the complexity and diversity of power systems and the need for accuracy and global information capture. The basic principle of this overall model is to view the power system as a graph structure, where nodes represent specific time points of load data and edges represent topological relationships between nodes. First, through the GCN encoder, the model can effectively capture the topological information of the power system and represent the node features into the encoding of graph data. Next, the Seq2Seq decoder accepts historical time series data as an input sequence and generates a load type prediction sequence. In this process, an attention mechanism is introduced, allowing the model to fuse information based on the importance of different input data and better understand the relationship between time series data and graph data. The advantages of this model are obvious. First, it can comprehensively consider the topology and timing data of the power system while automatically capturing the correlation between different load types, thereby improving the accuracy of prediction. Secondly, the introduction of the attention mechanism enables the model to focus on the most important information for the current prediction, further improving the model performance. Most importantly, the comprehensiveness and global information capturing capabilities of this model are expected to provide a more powerful tool for intelligent management of power systems and forecasting of power load types.

The structure diagram of the overall model is shown in Figure 1, which shows the relationship between the GCN encoder, Seq2Seq decoder and attention mechanism, forming a comprehensive power load type prediction model.

FIGURE 1

FIGURE 1. Overall flow chart of the model.

The running process of the GCN-Seq2Seq model is shown in Algorithm 1.

ALGORITHM 1

Algorithm 1.

3.2 Graph convolutional network model

In the model of this study, the graph convolutional network (GCN) is a key component used to process the topological structure information of the power system Hossain and Rahnamay-Naeini (2021). The basic principle of GCN is to capture the relationship between nodes in graph data through effective information transfer Peng et al. (2023), and then encode the features of the nodes Chen et al. (2022). In the overall model, the role of GCN is to treat the power system as a graph structure, in which the nodes of the graph represent load data at different time points, and the edges represent topological relationships between nodes, such as connection relationships. These nodes and edges constitute the topological information of the power system. The advantage of GCN in power system modeling is mainly reflected in its effective processing of complex topological structures. Compared with traditional methods, GCN can capture the relationship between nodes more comprehensively and achieve a high degree of abstraction and expression of the power system topology. Through an iterative information transfer process, GCN is able to update the characteristics of each node to the weighted average of the characteristics of its neighboring nodes, effectively integrating topological relationships into feature representation. This enables the model to better understand the interactions and correlations between different nodes in the power system, thereby improving the accuracy of load type predictions. Specifically, the ability of GCN lies in encoding the node information of the power system so that the model can better understand the spatiotemporal relationship between load data. This specific treatment of topology helps the model more accurately capture the energy consumption patterns of different types of loads, providing a stronger basis for prediction tasks.

The operation process of GCN Model is shown in Figure 2.

FIGURE 2

FIGURE 2. Flow chart of the GCN model.

The main formula of GCN Model is as follows:

H^{(l + 1)} = σ ({\hat{D}}^{- \frac{1}{2}} \hat{A} {\hat{D}}^{- \frac{1}{2}} H^{(l)} W^{(l)}) (1)

Here, H^(l) Represents the node feature matrix for layer l. sigma Denotes the activation function, typically using ReLU, etc. hatA Indicates the symmetrically normalized adjacency matrix. hatD Represents the diagonal matrix of node degrees. W^(l) Stands for the weight matrix for layer l.

In this formula, GCN gradually updates the feature representation of nodes through a multi-layer information transfer process, so that each node contains information about its surrounding nodes, thereby taking into account the influence of topological relationships. In the overall model, the role of GCN is to encode the topological structure information of the power system into a more information-rich feature representation, providing important basic information for subsequent load type prediction. Through the use of GCN, the model can better understand the relationship between nodes in the power system and improve the modeling ability of load type prediction problems. This is of great significance for comprehensively considering the complexity and diversity of the power system, thereby improving the accuracy of prediction and the ability to capture global information.

3.3 Sequence-to-sequence model

In our model, the Seq2Seq model (Sequence-to-Sequence model) is a key component for processing time series data and load type forecasting tasks Xiong et al. (2021). The basic principle of the Seq2Seq model is to map the input temporal sequence to the output sequence through an encoder-decoder structure, while retaining and delivering key contextual information Takiddin et al. (2022). The role of the Seq2Seq model in the overall model approach is to take historical time series load data as the input sequence, and then generate the corresponding load type prediction sequence. The key to this process is to encode the rich information of the timing data into a fixed-length vector representation, which is then passed through a decoder to generate a sequence of load types. The encoder of the Seq2Seq model can effectively capture patterns and trends in historical time series data, while the decoder converts this information into load-type predictions Le et al. (2021). The encoder of the Seq2Seq model has excellent capabilities and can effectively capture patterns and trends in historical time series data. By learning representations of historical load data, the encoder is able to extract key temporal features, allowing the model to better understand the information required for load type forecasting tasks. This feature encoding method helps capture the complex relationships between load data, making the model more flexible and accurate when processing time series information. On the other hand, the decoder of the Seq2Seq model is able to effectively utilize the contextual information passed by the encoder when generating load type prediction sequences. By incorporating historical timing correlations into the generation process, the decoder is able to more accurately predict future load types. This end-to-end sequence modeling approach enables the model to perform well in load type prediction tasks, with higher accuracy and robustness compared to traditional methods and single deep learning models.

The operation process of Seq2Seq model is shown in Figure 3.

FIGURE 3

FIGURE 3. Flow chart of the Seq2Seq model.

The main formula of Seq2Seq Model is as follows:

h_{t} = Encoder (x_{t}, h_{t - 1}) (2)

y_{t} = Decoder (h_{t}, y_{t - 1}) (3)

Here, h_t represents the hidden state of the encoder, which captures the information in the input sequence x_t and passes it to the decoder. y_t represents the output of the decoder, which is the predicted result of the load type. x_t represents the time series data for each time step of the input sequence. h_t−1 and y_t−1 represent the encoder hidden state and decoder output of the previous time step, respectively, for context transfer.

The encoder of the Seq2Seq model gradually encodes the historical time series data into hidden states h_t, and passes these hidden states to the decoder, which generates a sequence of load type predictions based on the hidden states. This process allows the model to make accurate load type predictions based on historical data and contextual information. The application of this model in this study plays a key role in helping the model better understand time series data, thereby improving the accuracy of load type prediction and global information capture capabilities.

3.4 Attention mechanism

In our model, the attention mechanism is a key component used to enhance modeling of the relationship between time series data and graph data Li et al. (2022c). The basic principle of this mechanism is to introduce a weight allocation mechanism in the encoder-decoder structure so that the model can focus on the information most relevant to the current prediction when generating load type predictions Massaoudi et al. (2021). In the overall model, the role of the attention mechanism is to enable the model to perform information fusion and selection based on the importance of different input data, thereby improving the accuracy of load type prediction. This mechanism dynamically adjusts the weight of the encoder output through the learned weight, allowing the model to more effectively capture the relationship between time series data and graph data, helping to improve prediction performance Zhang et al. (2020). The advantage of the attention mechanism is that it allows the model to be more flexible and intelligent when processing complex time series data and graph data. By introducing a weight allocation mechanism, the model is able to selectively focus on the part of the historical data that is relevant to the current prediction when predicting the load type at each time point. This dynamic adjustment feature enables the model to better adapt to changes in data distribution at different time points, improving the modeling capabilities of time series and graph data. In addition, the application of attention mechanism helps to improve the model’s understanding of the complex topology of the power system, making it more sensitive to capture the correlation between nodes. In models that incorporate attention mechanisms, more targeted attention to key information helps optimize load type prediction performance.

The operation process of Attention Mechanism is shown in Figure 4.

FIGURE 4

FIGURE 4. Flow chart of the Attention model.

The main formula of Attention Mechanism is as follows:

α_{t j} = \frac{\exp (e_{t j})}{\sum_{k = 1}^{T} \exp (e_{t k})} (4)

c_{t} = \sum_{j = 1}^{T} α_{t j} \cdot h_{j} (5)

a_{t} = Attention (h_{t}, c_{t}) (6)

Here, Q represents the attention weight of time step Q to time step Q, which is used to measure the importance of different time steps in time series data. Q represents the score for calculating the attention weight, usually obtained using inner product or other methods. Q represents the context vector at time step Q, which is obtained by weighted summation of the encoder output Q according to the attention weights. Q represents the output after applying attention, which is used for load type prediction.

The formulation of the attention mechanism describes how to calculate attention weights, context vectors, and apply attention to improve load type prediction. This mechanism plays a key role in the entire model and helps the model better understand and utilize the correlation between input data.

4 Experiment

4.1 Experimental environment

• Hardware Environment

The hardware environment used in the experiments consists of a high-performance computing server equipped with an AMD Ryzen Threadripper 3990X @ 3.70 GHz CPU and 1TB RAM, along with 6 Nvidia GeForce RTX 3090 24 GB GPUs. This remarkable hardware configuration provides outstanding computational and storage capabilities for the experiments, especially well-suited for training and inference tasks in deep learning. It effectively accelerates the model training process, ensuring efficient experimentation and rapid convergence.

• Software Environment

In this study, we utilized Python and PyTorch to implement our research work. Python, serving as the primary programming language, provided us with a flexible development environment. PyTorch, as the main deep learning framework, offered powerful tools for model construction and training. Leveraging PyTorch’s computational capabilities and automatic differentiation functionality, we were able to efficiently develop, optimize, and train our models, thereby achieving better results in the experiments.

4.2 Experimental datasets

This paper mainly uses the following four data sets to study the problem of smart grid power load type prediction.

National Grid Electricity Load Dataset is a very important data set that provides key information for electric load forecasting research. The source of this data set is the State Grid of China, the largest domestic electricity supplier and operator in China. Data is carefully collected and maintained to ensure accuracy and reliability Zhang and Hong (2019). The data set includes multiple years of history, ranging from the past few years up to the most recent electricity load data. This long time span of data allows researchers to analyze seasonal and cyclical changes in electrical loads. The dataset covers different regions within China, including urban and rural areas. This covers China’s wide range of geographical and climatic conditions, providing diversity for research. The importance of the National Grid Electricity Load Dataset cannot be underestimated. As data from the State Grid of China, it provides an opportunity to gain in-depth understanding of China’s power system operations and load changes. This dataset is critical for power load type forecasting research as it contains rich information that helps researchers understand load patterns in different regions and seasons. In addition, as one of the world’s largest electricity consumers, research on China’s power system is of great significance to global power management and sustainable development.

Canadian Electricity Load Dataset is an important data resource that provides key information for electricity load forecasting studies. Sources for this data set include the Canadian government and electric utilities across Canada. These agencies are responsible for collecting and maintaining electrical load data to ensure data accuracy and availability. The Canadian Electricity Load Dataset covers multiple years of history, including the past few years up to the latest electrical load data. This long time span of data allows researchers to analyze seasonal and cyclical changes in electricity loads, as well as their evolution over time Iqbal et al. (2021). The dataset covers every province and city in Canada, including places with different climates and electricity needs. Due to Canada’s geographical differences and climate diversity, this dataset is diverse and covers electricity load conditions under different conditions. Canadian Electricity Load Dataset is important in the study of electric load type forecasting. First, Canada is a geographically vast country with a variety of climatic and topographic conditions, so this dataset provides information on electricity load characteristics under different meteorological and geographical conditions. Second, this dataset reflects the operation of the Canadian power system, which is critical for power load management and power system optimization. Most importantly, as a developed country, Canada’s power system is modern and complex, so the study of power load type forecasting problems has special value.

U.S. Electricity Load Dataset is an important data resource that provides key information for electric load forecasting research. Sources for this data set include the U.S. Energy Information Administration (EIA) and various U.S. power companies Lv et al. (2021). These agencies collect and maintain electrical load data to ensure data accuracy and availability. The U.S. Electricity Load Dataset covers many years of history, ranging from the past few years up to the latest electricity load data. This long time span of data allows researchers to analyze seasonal and cyclical changes in electricity loads, as well as their evolution over time. The dataset covers every state and city in the United States, including places with different climates and electricity needs. As a country with geographical diversity and variable climate, the United States has diverse power load data, covering power load conditions under different conditions. The U.S. Electricity Load Dataset is important in power load type forecasting research, providing information on power load characteristics under different meteorological and geographical conditions, reflecting the dynamics of large-scale power supply and demand.

International Electricity Load Dataset brings together data from the International Energy Agency (IEA) and electricity companies in various countries and regions. The IEA is responsible for coordinating and collecting electricity load data in various countries to ensure the accuracy and availability of data. It covers many years of history, from the past few years up to the latest electrical load data. This long time span of data allows researchers to analyze seasonal and cyclical changes in electricity load, as well as electricity load trends on a global scale Ahmad et al. (2020). The dataset has a global geographical scope, covering multiple countries and regions. This makes it a diverse and comprehensive data resource, including places with different climates, cultures and power system characteristics. International Electricity Load Dataset is important in electric load type forecasting research. First, it reflects the operation of power systems in different countries and regions, providing key information for power load management and optimization on a global scale. Secondly, because it covers multiple countries and regions, this data set helps study cross-border power load forecasting problems and promotes international cooperation and knowledge sharing.

4.3 Experimental setup and details

This study uses the GCN-Seq2Seq model integrated with the attention mechanism to study the problem of smart grid power load type prediction. To ensure accuracy and reproducibility, experimental details need to be carefully designed. The experimental setup and details are as follows:

Step 1: Dataset preparation.

• Data sources: The four data sets come from the State Grid of China, the Canadian government and power companies, the U.S. Energy Information Administration (EIA), and the International Energy Agency (IEA). These datasets are historical power load information collected from different power systems.

• Time span: The data set covers many years of historical data, ranging from a few years to a few decades, to ensure that power load data under a variety of seasons and meteorological conditions are included.

• Geographic scope: These data sets cover different geographical scopes, including various regions in China, different regions in Canada, states and cities in the United States, as well as electricity load data on a global scale.

• Data cleaning and preprocessing: Before using the data, data cleaning and preprocessing are required, including removing missing values, processing outliers, data standardization, etc., to ensure the quality and consistency of the data.

• Data set division: The data set will be divided into a training set, a validation set and a test set. Usually 70% of the data is used for training, 15% is used for validation, and 15% is used for testing. This helps evaluate the performance and generalization ability of the model.

Step 2: Model selection and hyperparameter tuning.

• Model selection: We will consider using GCN, Seq2Seq, and overall models that introduce attention mechanisms. These models were chosen because of their advantages in processing graph data and time series data.

• Hyperparameter adjustment: In the experiment, we will perform hyperparameter adjustment, including the selection of key parameters such as learning rate, batch size, hidden layer size, and attention weight. We will use cross-validation to evaluate the performance of different hyperparameter settings.

Step 3: Model training process.

• GCN model training: For the GCN model, we will build the graph structure of the power system and use the adjacency matrix for training. GCN will utilize node features and graph structure information for training.

• Seq2Seq model training: For the Seq2Seq model, we will prepare time series data, including historical power load data as the input sequence, and load type as the output sequence. The Seq2Seq model will be trained using an encoder-decoder structure to learn load-type patterns.

• Holistic model training: In the holistic model, we will consider both the graph structure and the time series data of the power system. Attention mechanism will be used to capture the relationship between them. The overall model will be trained taking both data into account.

Step 4: Loss function and evaluation metrics.

• Loss function: We will choose an appropriate loss function to measure the performance of the model, depending on the nature of the problem. For classification tasks, the categorical cross-entropy loss function or the mean square error loss function is usually chosen.

• Evaluation metrics: We will use a series of evaluation metrics to measure the performance of the model, including accuracy, precision, recall, F1 score, etc. These metrics will be used for performance evaluation on the validation and test sets.

Step 5: Experimental Design.

• Ablation experiments: We will conduct ablation experiments to gradually evaluate the impact of each component of the model on overall performance. For example, we will study how the model performs without using the attention mechanism.

• Comparative experiments: We will conduct comparative experiments to compare and analyze our model with other commonly used deep learning models (such as CNN, RNN, TCN, GRU, DRL) to determine the superiority of our model.

Step 6: Results Analysis and Visualization.

• We will conduct a detailed analysis of the experimental results, comparing the performance of different models, the impact of hyperparameter settings, and performance on different data sets. We will use visualization tools to present key results to help gain insight into the model’s behavior.

4.4 Experimental results and analysis

During the experiment, we collected data including National Grid Electricity Load Dataset, Canadian Electricity Load Dataset, U.S. Electricity Load Dataset, International Electricity Load Dataset. Through experiments, we obtained the following results.

When we look at the results in Table 1, we can clearly see that our model performs significantly better than other models on different datasets. Specifically, on the National Grid Electricity Load Dataset, our model achieves 96.22% accuracy, 93.54% recall, 91.06% F1 score, and 94.45% AUC, which performance metrics significantly exceed other models, such as wang, mohammadi, alotaibi2, alladi and hui. On the Canadian Electricity Load Dataset, U.S. Electricity Load Dataset and International Electricity Load Dataset, our model also achieves the highest level of performance indicators, indicating its strong generalization ability on different data sets. Digging further into Figure 5, we can see that after visualizing the results from Table 1, the comparison of model performance becomes clearer. In this visualization, our model sits at the top of each dataset by a clear margin, outperforming other models. This visualization presents the superior performance of our model on different datasets, further confirming the excellent performance of our method in power load type forecasting tasks. It should be emphasized that on the International Electricity Load Dataset, our model performed particularly well, reaching an AUC of 98.46%, which is much higher than other models. This shows that the introduction of the attention mechanism has important advantages for processing international-scale power load data and can more accurately capture the complex patterns of load types.

TABLE 1

TABLE 1. The comparison of different models in different indicators comes from National Grid Electricity Load Dataset, Canadian Electricity Load Dataset, U.S. Electricity Load Dataset and International Electricity Load Dataset.

FIGURE 5

FIGURE 5. Comparison of model performance on different datasets.

By analyzing the data in Table 2, we can clearly see the performance of our model on different data sets. First, we note that our model has a much lower number of model parameters than other models on each dataset. For example, on the National Grid Electricity Load Dataset, our model parameters are only 155.22M, while the number of parameters of other models exceeds 230M, which indicates that our model has a more lightweight design. Furthermore, our model has the lowest Flops and inference time on all datasets, further demonstrating its efficiency. This is critical due to resource constraints and response time requirements in real-world applications. After visualizing these performance indicators, as shown in Figure 6, we can see that our model achieves the best performance on each data set, which further confirms its superior effect in power load type forecasting tasks. It is worth noting that despite having fewer model parameters, our model performs particularly well on the International Electricity Load Dataset, further verifying its generalization ability on different data sets. This shows that our model not only performs well in performance but also has a lightweight design that is applicable to various power load data sets.

TABLE 2

TABLE 2. The comparison of different models in different indicators comes from National Grid Electricity Load Dataset, Canadian Electricity Load Dataset, U.S. Electricity Load Dataset and International Electricity Load Dataset.

FIGURE 6

FIGURE 6. Comparison of model performance on different datasets.

By analyzing the data in Table 3, we can gain an in-depth understanding of the performance of the GCN-Seq2Seq module on different data sets and its impact on the overall performance of the model. First, we focus on the key performance indicators of the model on four different data sets, including accuracy (Accuracy), recall rate (Recall), F1 score (F1 Score) and AUC value (Area Under the Curve). On the National Grid Electricity Load Dataset, the GCN-Seq2Seq module achieved excellent performance, with an accuracy of 97.48%, a recall of 93.62%, an F1 score of 93.82%, and an AUC value of 93.61, significantly better than other models (RNN, Resnet50 and Resnet18). This shows that the GCN-Seq2Seq module has excellent classification performance in the power load type prediction task. On other data sets, the GCN-Seq2Seq module also performed well and maintained a high level of performance. Especially on the Canadian Electricity Load Dataset and International Electricity Load Dataset, the model’s accuracy exceeded 97.9%, the recall rate exceeded 94.75%, the F1 score exceeded 94.5%, and the AUC values exceeded 95.59% and 96.24%. This further verifies the generalization ability and stability of the GCN-Seq2Seq module. After visualizing these performance indicators, as shown in Figure 7, we can clearly observe the excellent performance of the GCN-Seq2Seq module on different data sets, as well as its advantages over other models. The introduction of this surface attention mechanism module significantly improves the model’s performance in power load type prediction tasks.

TABLE 3

TABLE 3. Ablation experiments on the GCN-Seq2Seq module comes from National Grid Electricity Load Dataset, Canadian Electricity Load Dataset, U.S. Electricity Load Dataset and International Electricity Load Dataset.

FIGURE 7

FIGURE 7. Comparison of model performance on different datasets.

By analyzing the data in Table 4, we can gain an in-depth understanding of the performance of the Cross Transformer module on different data sets and its impact on the overall performance of the model. This table provides key performance indicators on four different data sets, including model parameters (Parameters), number of floating point operations (Flops), inference time (Inference Time) and training time (Training Time). First, let’s focus on the performance of the Cross Transformer module on the National Grid Electricity Load Dataset. This module has a parameter volume of 214.96M, a floating point operation count of 166.91G, an inference time of 202.23 ms, and a training time of 236.12s. These metrics show the module’s performance level when processing this data set. Then, we observe the performance of the Cross Transformer module on the other three datasets. On the Canadian Electricity Load Dataset, U.S. Electricity Load Dataset and International Electricity Load Dataset, the module has performance indicators of 156.41M, 178.81G, 189.85 ms and 108.81s respectively, and corresponding results of 118.44M, 116.06G, 224.99 ms and 187.49s numerical value. These data show the performance changes of the Cross Transformer module on different data sets. By visualizing these performance metrics, we can more clearly observe the performance of the Cross Transformer module on different data sets. As shown in Figure 8, the module performs poorly on the National Grid Electricity Load Dataset but has better performance on the other three datasets. This shows that the Cross Transformer module has certain flexibility and adaptability when dealing with different data distributions and tasks.

TABLE 4

TABLE 4. Ablation experiments on the Cross Transformer module using different datasets.

FIGURE 8

FIGURE 8. Comparison of model performance on different datasets.

5 Conclusion and discussion

In this study, we focus on solving the problem of power load type prediction in smart grids to help the power system better understand and manage load changes. We propose an innovative deep learning model that combines graph convolutional network (GCN), sequence-to-sequence (Seq2Seq) model and attention mechanism to comprehensively consider the complex topology and time series data of the power system to achieve more accurate Load type forecasting. Specifically, we first use the GCN encoder to process the topological structure information of the power system and represent the node features into encoding of graph data. Next, the Seq2Seq decoder takes the historical time series data as the input sequence and generates a prediction sequence of the load type. In this process, an attention mechanism is introduced, allowing the model to fuse information based on the importance of different input data. Finally, the outputs of the GCN encoder and Seq2Seq decoder are integrated to achieve more accurate load type prediction. Through extensive experimental verification, we demonstrate the excellent performance of this model in load type forecasting tasks, significantly improving the accuracy of load type prediction in power systems.

Despite its remarkable results, this study suffers from two major flaws. First, the performance of our model in handling extreme situations needs to be further improved, such as sudden power load fluctuations, which require more robust processing capabilities. Secondly, our study still needs to be verified in more actual power systems to further confirm its generalization ability and robustness. Future research directions will consider improving the robustness of the model and extending the scope of experimental validation to more comprehensively evaluate its performance. It is also expected to explore more smart grid application areas, such as automated operation and maintenance of power systems and smart energy interaction, to further promote the development and application of smart grids.

This research provides an innovative method to solve the problem of power load type prediction and has important practical significance. By combining graph neural networks, sequence generation models, and attention mechanisms, we achieve more accurate predictions of power system load types, helping smart grids achieve more efficient energy management and optimization. This is of great significance to the high reliability, efficiency and sustainability of the power system, and also makes a positive contribution to the development of smart grids and sustainable energy integration.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

HS: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Writing–original draft. YR: Investigation, Methodology, Project administration, Resources, Writing–original draft. SW: Conceptualization, Formal Analysis, Methodology, Project administration, Resources, Software, Writing–original draft, Writing–review and editing. BZ: Investigation, Methodology, Project administration, Resources, Writing–review and editing. RY: Investigation, Project administration, Supervision, Visualization, Writing–review and editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work is supported by the Sub-project “Research on parameter design method of power electronic equipment control system based on new topology structure” of the Long Term Key Project of China Electric Power Research Institute, titled “New Power Electronic Control Technology and Equipment Prototype Supporting Safe and Efficient Operation of Large Power Grid” (Project No. XT83-23-007).

Acknowledgments

We would like to express our deep appreciation to the China Electric Power Research Institute for their generous support and funding, which made this research possible. We are also grateful to the Long Term Key Project for providing the framework and resources for our work.

Conflict of interest

Authors YR, SW, BZ, and RY were employed by China Electric Power Research Institute Co., Ltd.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Ahmad, W., Ayub, N., Ali, T., Irfan, M., Awais, M., Shiraz, M., et al. (2020). Towards short term electricity load forecasting using improved support vector machine and extreme learning machine. Energies 13, 2907. doi:10.3390/en13112907

CrossRef Full Text | Google Scholar

Al-Badi, A. H., Ahshan, R., Hosseinzadeh, N., Ghorbani, R., and Hossain, E. (2020). Survey of smart grid concepts and technological demonstrations worldwide emphasizing on the Oman perspective. Appl. Syst. Innov. 3, 5. doi:10.3390/asi3010005

CrossRef Full Text | Google Scholar

Alladi, T., Chamola, V., Rodrigues, J. J., and Kozlov, S. A. (2019). Blockchain in smart grids: a review on different use cases. Sensors 19, 4862. doi:10.3390/s19224862

PubMed Abstract | CrossRef Full Text | Google Scholar

Alotaibi, I., Abido, M. A., Khalid, M., and Savkin, A. V. (2020). A comprehensive review of recent advances in smart grids: a sustainable future with renewable energy resources. Energies 13, 6269. doi:10.3390/en13236269

CrossRef Full Text | Google Scholar

Arumugham, V., Ghanimi, H. M., Pustokhin, D. A., Pustokhina, I. V., Ponnam, V. S., Alharbi, M., et al. (2023). An artificial-intelligence-based renewable energy prediction program for demand-side management in smart grids. Sustainability 15, 5453. doi:10.3390/su15065453

CrossRef Full Text | Google Scholar

Bhatt, D., Patel, C., Talsania, H., Patel, J., Vaghela, R., Pandya, S., et al. (2021). Cnn variants for computer vision: history, architecture, application, challenges and future scope. Electronics 10, 2470. doi:10.3390/electronics10202470

CrossRef Full Text | Google Scholar

Chen, R., Wang, Y., Li, G., Yan, D., and Cao, H. (2022). “Pre-training models based knowledge graph multi-hop reasoning for smart grid technology,” in Proceedings of 2021 5th Chinese conference on swarm intelligence and cooperative control (Springer), 1866–1875.

CrossRef Full Text | Google Scholar

Cheon, H., Dziewulska, K. H., Moosic, K. B., Olson, K. C., Gru, A. A., Feith, D. J., et al. (2020). Advances in the diagnosis and treatment of large granular lymphocytic leukemia. Curr. Hematol. malignancy Rep. 15, 103–112. doi:10.1007/s11899-020-00565-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Daniels, J., Doukas, P. G., Escala, M. E. M., Ringbloom, K. G., Shih, D. J., Yang, J., et al. (2020). Cellular origins and genetic landscape of cutaneous gamma delta t cell lymphomas. Nat. Commun. 11, 1806. doi:10.1038/s41467-020-15572-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Dhruv, P., and Naskar, S. (2020). Image classification using convolutional neural network (cnn) and recurrent neural network (rnn): a review. Mach. Learn. Inf. Process. Proc. ICMLIP 2019, 367–381. doi:10.1007/978-981-15-1884-3_34

CrossRef Full Text | Google Scholar

Fan, J., Zhang, K., Huang, Y., Zhu, Y., and Chen, B. (2023). Parallel spatio-temporal attention-based tcn for multivariate time series prediction. Neural Comput. Appl. 35, 13109–13118. doi:10.1007/s00521-021-05958-z

CrossRef Full Text | Google Scholar

Hafeez, G., Alimgeer, K. S., and Khan, I. (2020). Electric load forecasting based on deep learning and optimized by heuristic algorithm in smart grid. Appl. Energy 269, 114915. doi:10.1016/j.apenergy.2020.114915

CrossRef Full Text | Google Scholar

Han, S.-Y., Zhao, Q., Sun, Q.-W., Zhou, J., and Chen, Y.-H. (2022). Engs-dgr: traffic flow forecasting with indefinite forecasting interval by ensemble gcn, seq2seq, and dynamic graph reconfiguration. Appl. Sci. 12, 2890. doi:10.3390/app12062890

CrossRef Full Text | Google Scholar

Hossain, M. J., and Rahnamay-Naeini, M. (2021). “State estimation in smart grids using temporal graph convolution networks,” in Proceedings of the 2021 north American power symposium (NAPS), College Station, TX, USA, November 2021 (IEEE), 01–05.

CrossRef Full Text | Google Scholar

Huang, X., Hong, S. H., Yu, M., Ding, Y., and Jiang, J. (2019). Demand response management for industrial facilities: a deep reinforcement learning approach. IEEE Access 7, 82194–82205. doi:10.1109/access.2019.2924030

CrossRef Full Text | Google Scholar

Hui, H., Ding, Y., Shi, Q., Li, F., Song, Y., and Yan, J. (2020). 5g network-based internet of things for demand response in smart grid: a survey on application potential. Appl. Energy 257, 113972. doi:10.1016/j.apenergy.2019.113972

CrossRef Full Text | Google Scholar

Ibrahim, M. S., Dong, W., and Yang, Q. (2020). Machine learning driven smart electric power systems: current trends and new perspectives. Appl. Energy 272, 115237. doi:10.1016/j.apenergy.2020.115237

CrossRef Full Text | Google Scholar

Iqbal, H. K., Malik, F. H., Muhammad, A., Qureshi, M. A., Abbasi, M. N., and Chishti, A. R. (2021). A critical review of state-of-the-art non-intrusive load monitoring datasets. Electr. Power Syst. Res. 192, 106921. doi:10.1016/j.epsr.2020.106921

CrossRef Full Text | Google Scholar

Le, T.-T.-H., Heo, S., and Kim, H. (2021). Toward load identification based on the hilbert transform and sequence to sequence long short-term memory. IEEE Trans. Smart Grid 12, 3252–3264. doi:10.1109/tsg.2021.3066570

CrossRef Full Text | Google Scholar

Leng, J., Ruan, G., Song, Y., Liu, Q., Fu, Y., Ding, K., et al. (2021). A loosely-coupled deep reinforcement learning approach for order acceptance decision of mass-individualized printed circuit board manufacturing in industry 4.0. J. Clean. Prod. 280, 124405. doi:10.1016/j.jclepro.2020.124405

CrossRef Full Text | Google Scholar

Li, Q., Zhu, Y., Ding, J., Li, W., Sun, W., and Ding, L. (2022a). Deep reinforcement learning based resource allocation for cloud edge collaboration fault detection in smart grid. CSEE J. Power Energy Syst. doi:10.17775/CSEEJPES.2021.02390

CrossRef Full Text | Google Scholar

Li, Y., Nie, J., and Chao, X. (2020). Do we really need deep cnn for plant diseases identification? Comput. Electron. Agric. 178, 105803. doi:10.1016/j.compag.2020.105803

CrossRef Full Text | Google Scholar

Li, Y., Ren, R., Huang, B., Wang, R., Sun, Q., Gao, D. W., et al. (2022b). Distributed hybrid-triggering-based secure dispatch approach for smart grid against dos attacks. IEEE Trans. Syst. Man, Cybern. Syst. 53, 3574–3587. doi:10.1109/tsmc.2022.3228780

CrossRef Full Text | Google Scholar

Li, Y., Wei, X., Li, Y., Dong, Z., and Shahidehpour, M. (2022c). Detection of false data injection attacks in smart grid: a secure federated deep learning approach. IEEE Trans. Smart Grid 13, 4862–4872. doi:10.1109/tsg.2022.3204796

CrossRef Full Text | Google Scholar

Liu, L.-N., Yang, G.-H., and Wasly, S. (2023). Distributed predefined-time dual-mode energy management for a microgrid over event-triggered communication. IEEE Trans. Industrial Inf., 1–11. doi:10.1109/tii.2023.3304025

CrossRef Full Text | Google Scholar

Lv, L., Wu, Z., Zhang, J., Zhang, L., Tan, Z., and Tian, Z. (2021). A vmd and lstm based hybrid model of load forecasting for power grid security. IEEE Trans. Industrial Inf. 18, 6474–6482. doi:10.1109/tii.2021.3130237

CrossRef Full Text | Google Scholar

Massaoudi, M., Abu-Rub, H., Refaat, S. S., Chihi, I., and Oueslati, F. S. (2021). Deep learning in smart grid technology: a review of recent advancements and future prospects. IEEE Access 9, 54558–54578. doi:10.1109/access.2021.3071269

CrossRef Full Text | Google Scholar

Mohammadi, F. (2021). Emerging challenges in smart grid cybersecurity enhancement: a review. Energies 14, 1380. doi:10.3390/en14051380

CrossRef Full Text | Google Scholar

O’Dwyer, E., Pan, I., Acha, S., and Shah, N. (2019). Smart energy systems for sustainable smart cities: current developments, trends and future directions. Appl. energy 237, 581–597. doi:10.1016/j.apenergy.2019.01.024

CrossRef Full Text | Google Scholar

Peng, S., Zhang, Z., Deng, R., and Cheng, P. (2023). Localizing false data injection attacks in smart grid: a spectrum-based neural network approach. IEEE Trans. Smart Grid 14, 4827–4838. doi:10.1109/tsg.2023.3261970

CrossRef Full Text | Google Scholar

Reddy, N. P., Pasdeloup, D., Zadeh, M. K., and Skjetne, R. (2019). “An intelligent power and energy management system for fuel cell/battery hybrid electric vehicle using reinforcement learning,” in Proceedings of the 2019 IEEE transportation electrification conference and expo (ITEC), Detroit, MI, USA, June 2019 (IEEE), 1–6.

CrossRef Full Text | Google Scholar

Takiddin, A., Ismail, M., Zafar, U., and Serpedin, E. (2022). Deep autoencoder-based anomaly detection of electricity theft cyberattacks in smart grids. IEEE Syst. J. 16, 4106–4117. doi:10.1109/jsyst.2021.3136683

CrossRef Full Text | Google Scholar

Wang, X., Liu, Y., and Choo, K.-K. R. (2020). Fault-tolerant multisubset aggregation scheme for smart grid. IEEE Trans. Industrial Inf. 17, 4065–4072. doi:10.1109/tii.2020.3014401

CrossRef Full Text | Google Scholar

Xiao, J., and Zhou, Z. (2020). “Research progress of rnn language model,” in Proceedings of the 2020 IEEE international conference on artificial intelligence and computer applications (ICAICA), Dalian, China, June 2020 (IEEE), 1285–1288.

CrossRef Full Text | Google Scholar

Xiong, G., Przystupa, K., Teng, Y., Xue, W., Huan, W., Feng, Z., et al. (2021). Online measurement error detection for the electronictransformer in a smart grid. Energies 14, 3551. doi:10.3390/en14123551

CrossRef Full Text | Google Scholar

Xu, A., Wu, T., Zhang, Y., Hu, Z., and Jiang, Y. (2021). “Graph-based time series edge anomaly detection in smart grid,” in Proceedings of the 2021 7th IEEE intl conference on big data security on cloud (BigDataSecurity), IEEE intl conference on high performance and smart computing, (HPSC) and IEEE intl conference on intelligent data and security (IDS), NY, USA, May 2021 (IEEE), 1–6.

CrossRef Full Text | Google Scholar

Ye, Y., Qiu, D., Wu, X., Strbac, G., and Ward, J. (2020). Model-free real-time autonomous control for a residential multi-energy system using deep reinforcement learning. IEEE Trans. Smart Grid 11, 3068–3082. doi:10.1109/tsg.2020.2976771

CrossRef Full Text | Google Scholar

Zhang, F., Liu, Q., Liu, Y., Tong, N., Chen, S., and Zhang, C. (2020). Novel fault location method for power systems based on attention mechanism and double structure gru neural network. IEEE Access 8, 75237–75248. doi:10.1109/access.2020.2988909

CrossRef Full Text | Google Scholar

Zhang, Z., and Hong, W.-C. (2019). Electric load forecasting by complete ensemble empirical mode decomposition adaptive noise and support vector regression with quantum-based dragonfly algorithm. Nonlinear Dyn. 98, 1107–1136. doi:10.1007/s11071-019-05252-7

CrossRef Full Text | Google Scholar

Zhang, Z., Zhang, D., and Qiu, R. C. (2019). Deep reinforcement learning for power system applications: an overview. CSEE J. Power Energy Syst. 6, 213–225. doi:10.17775/CSEEJPES.2019.00920

CrossRef Full Text | Google Scholar

Zhou, S., Hu, Z., Gu, W., Jiang, M., Chen, M., Hong, Q., et al. (2020). Combined heat and power system intelligent economic dispatch: a deep reinforcement learning approach. Int. J. Electr. power and energy Syst. 120, 106016. doi:10.1016/j.ijepes.2020.106016

CrossRef Full Text | Google Scholar

Keywords: smart grid, deep learning, optimization of intelligent systems, electric load type prediction, multi-source data, data analysis

Citation: Sun H, Ren Y, Wang S, Zhao B and Yin R (2023) Smart grid power load type forecasting: research on optimization methods of deep learning models. Front. Energy Res. 11:1321459. doi: 10.3389/fenrg.2023.1321459

Received: 14 October 2023; Accepted: 27 November 2023;
Published: 29 December 2023.

Edited by:

Hengrui Ma, Qinghai University, China

Reviewed by:

José Baptista, University of Trás-os-Montes and Alto Douro, Portugal
Yushuai Li, University of Oslo, Norway

Copyright © 2023 Sun, Ren, Wang, Zhao and Yin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Shanshan Wang, cnloNTEzMTIxODM3QG91dGxvb2suY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.