电力系统调度决策:一种示教学习辅助加速的安全强化学习方法

Translated title of the contribution: Power System Dispatch: An Accelerated Safe Reinforcement Learning Approach by Incorporating Learning From Demonstration
  • Zhongkai Yi
  • , Shouyu Liang
  • , Wei Wang
  • , Wei Jiang
  • , Cheng Yang
  • , Yan Xin

Research output: Contribution to journalArticlepeer-review

Abstract

With the growing penetration of renewable energy generation and the increasing complexity of the power system environment, it is necessary to formulate a hybrid knowledge-data-driven dispatch strategy for modern power systems. In light of this, an imitation learning approach is proposed to exploit the expert knowledge, which provides demonstrations for power system economic dispatch strategy using neural networks. Furthermore, an accelerated safe reinforcement learning approach is proposed by incorporating learning from demonstration, which can make fast decisions in real-time operation. By incorporating the learning from demonstration approach, the algorithm convergence speed is significantly accelerated, the dispatch strategy is optimized, the operation cost is reduced, and the power flow violation risk is alleviated. Numerical simulation results verify the advantages of the proposed approach in improving the convergence efficiency of the reinforcement learning algorithm and promoting the security and economy of the power systems.

Translated title of the contributionPower System Dispatch: An Accelerated Safe Reinforcement Learning Approach by Incorporating Learning From Demonstration
Original languageChinese (Traditional)
Pages (from-to)5084-5096
Number of pages13
JournalZhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering
Volume44
Issue number13
DOIs
StatePublished - 5 Jul 2024
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Fingerprint

Dive into the research topics of 'Power System Dispatch: An Accelerated Safe Reinforcement Learning Approach by Incorporating Learning From Demonstration'. Together they form a unique fingerprint.

Cite this