AUTHOR=Zhao Huashi , Wu Zhichao , He Yubin , Fu Qiujia , Liang Shouyu , Ma Guang , Li Wenchao , Yang Qun 

TITLE=Combination optimization method of grid sections based on deep reinforcement learning with accelerated convergence speed

JOURNAL=Frontiers in Energy Research

VOLUME=11

YEAR=2023

URL=https://www.frontiersin.org/journals/energy-research/articles/10.3389/fenrg.2023.1269854

DOI=10.3389/fenrg.2023.1269854

ISSN=2296-598X

ABSTRACT=<p>A modern power system integrates more and more new energy and uses a large number of power electronic equipment, which makes it face more challenges in online optimization and real-time control. Deep reinforcement learning (DRL) has the ability of processing big data and high-dimensional features, as well as the ability of independently learning and optimizing decision-making in complex environments. This paper explores a DRL-based online combination optimization method of grid sections for a large complex power system. In order to improve the convergence speed of the model, it proposes to discretize the output action of the unit and simplify the action space. It also designs a reinforcement learning loss function with strong constraints to further improve the convergence speed of the model and facilitate the algorithm to obtain a stable solution. Moreover, to avoid the local optimal solution problem caused by the discretization of the output action, this paper proposes to use the annealing optimization algorithm to make the granularity of the unit output finer. The proposed method in this paper has been verified on an IEEE 118-bus system. The experimental results show that it has fast convergence speed and better performance and can obtain stable solutions.</p>