An Improved TD3 Framework for CHP Economic Dispatch: Synergistic Optimization of Entropy Reward Mechanisms and Adaptive Noise

  • Xianji Jin
  • , Huaiming Guan
  • , Yongxu Chen
  • , Zhongwei Li*
  • , Zhenyu Zhang
  • , Weiming Tong
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This study proposes AME-TD3, an enhanced deep reinforcement learning algorithm, to address the high computational complexity and poor adaptability of traditional methods in optimizing combined heat and power (CHP) systems. AME-TD3 integrates an entropy reward mechanism to balance exploration-exploitation and adaptive noise adjustment for dynamic system response, improving stability. A CHP economic dispatch model with power, CHP, and heat only units minimizes costs while meeting supply-demand constraints. Case studies (24/48-unit systems) show AME-TD3 outperforms TD3, DDPG, and particle swarm methods in minimum, average costs and standard deviation, demonstrating superior adaptability in large-scale dynamic environments. Results confirm its effectiveness in economic optimization, stability, and multi-constraint handling for integrated energy systems.

Original languageEnglish
Title of host publication2025 7th International Conference on Information Science, Electrical and Automation Engineering, ISEAE 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages935-940
Number of pages6
ISBN (Electronic)9798331510381
DOIs
StatePublished - 2025
Externally publishedYes
Event7th International Conference on Information Science, Electrical and Automation Engineering, ISEAE 2025 - Harbin, China
Duration: 18 Apr 202520 Apr 2025

Publication series

Name2025 7th International Conference on Information Science, Electrical and Automation Engineering, ISEAE 2025

Conference

Conference7th International Conference on Information Science, Electrical and Automation Engineering, ISEAE 2025
Country/TerritoryChina
CityHarbin
Period18/04/2520/04/25

Keywords

  • Adaptive Entropy Reward
  • Combined Heat and Power Systems
  • Deep Reinforcement Learning
  • Economic Dispatch

Fingerprint

Dive into the research topics of 'An Improved TD3 Framework for CHP Economic Dispatch: Synergistic Optimization of Entropy Reward Mechanisms and Adaptive Noise'. Together they form a unique fingerprint.

Cite this