Skip to main navigation Skip to search Skip to main content

具有稳定性保证的基于模型与条件神经过程的强化学习算法

Translated title of the contribution: Conditional neural processes for model-based reinforcement learning with stability guarantees
  • School of Astronautics, Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Safety is an essential property that enables the further extensive applications of reinforcement learning. This paper introduces a framework of safe model-based reinforcement learning by employing the classic Lyapunov methods (uniformly ultimate boundness) in control theory with safety guarantees during both training and deployment without the intervention mechanism. More specifically, an efficient way is presented to collect data and learn the dynamic models in a safe region defined by iterated Lyapunov functions. On this basis, this paper proposes a practical and effective algorithm capable of gradually expanding the safe region while improving the control performance. Finally, illustrative examples are given to demonstrate the necessity and the validity of the obtained policy on an inverted pendulum.

Translated title of the contributionConditional neural processes for model-based reinforcement learning with stability guarantees
Original languageChinese (Traditional)
Pages (from-to)265-274
Number of pages10
JournalScientia Sinica Technologica
Volume54
Issue number2
DOIs
StatePublished - 2024
Externally publishedYes

Fingerprint

Dive into the research topics of 'Conditional neural processes for model-based reinforcement learning with stability guarantees'. Together they form a unique fingerprint.

Cite this