Skip to main navigation Skip to search Skip to main content

Reinforcement Learning for Dual-Control Aircraft Six-Degree-of-Freedom Attitude Control with System Uncertainty

  • Yuqi Yuan
  • , Di Zhou*
  • *Corresponding author for this work
  • School of Astronautics, Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

This article proposes a near-optimal control strategy based on reinforcement learning, which is applied to the six-degree-of-freedom (6-DoF) attitude control of dual-control aircraft. In order to solve the problem that the existing reinforcement learning is difficult to apply to the high-dimensional multiple-input multiple-output (MIMO) systems, the Long Short-Term Memory (LSTM) neural network is introduced to replace the polynomial network in the adaptive dynamic programming (ADP) technique. Meanwhile, based on the Lyapunov method, a novel online adaptive updating law of LSTM neural network weights is given, and the stability of the system is verified. In the simulation process, the algorithm proposed in this article is applied to the six-degree-of-freedom attitude control problem of dual-control aircraft with system uncertainty. The simulation results show that the algorithm can achieve near-optimal control.

Original languageEnglish
Article number281
JournalAerospace
Volume11
Issue number4
DOIs
StatePublished - Apr 2024
Externally publishedYes

Keywords

  • dual-control nonlinear system
  • long short-term memory neural network
  • near-optimal control
  • online training
  • reinforcement learning
  • six-degree-of-freedom aircraft attitude control

Fingerprint

Dive into the research topics of 'Reinforcement Learning for Dual-Control Aircraft Six-Degree-of-Freedom Attitude Control with System Uncertainty'. Together they form a unique fingerprint.

Cite this