TY - GEN
T1 - Reinforcement Learning Control for a 2-DOF Flight Attitude Simulator
AU - Cai, Yu
AU - Ban, Xiaojun
AU - Zhou, Chengbao
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - This paper focuses on the control problem for a two-degree-of-freedom flight simulator experimental setup, proposing a reinforcement learning-based flight attitude controller. The flight simulator aims to simulate the aircraft attitude control system, requiring consideration of its nonlinearity, model uncertainty, and the impact of external disturbances when designing the controller. Proximal Policy Optimization (PPO), as a policy gradient-based deep reinforcement learning algorithm, autonomously learns an approximately optimal controller based on a given objective function without the need for a mathematical model of the controlled object. Thanks to the application of the Actor-Critic framework and neural networks, the training of the two-degree-of-freedom flight simulator controller can rapidly converge within a short period. Simulations validate the generalization capability of the trained PPO controller and its robustness to external disturbances.
AB - This paper focuses on the control problem for a two-degree-of-freedom flight simulator experimental setup, proposing a reinforcement learning-based flight attitude controller. The flight simulator aims to simulate the aircraft attitude control system, requiring consideration of its nonlinearity, model uncertainty, and the impact of external disturbances when designing the controller. Proximal Policy Optimization (PPO), as a policy gradient-based deep reinforcement learning algorithm, autonomously learns an approximately optimal controller based on a given objective function without the need for a mathematical model of the controlled object. Thanks to the application of the Actor-Critic framework and neural networks, the training of the two-degree-of-freedom flight simulator controller can rapidly converge within a short period. Simulations validate the generalization capability of the trained PPO controller and its robustness to external disturbances.
KW - Aircraft Control
KW - Attitude Stabilization
KW - Proximal Policy Optimization
KW - Reinforcement Learning
UR - https://www.scopus.com/pages/publications/85200605696
U2 - 10.1109/FASTA61401.2024.10595236
DO - 10.1109/FASTA61401.2024.10595236
M3 - 会议稿件
AN - SCOPUS:85200605696
T3 - Proceedings of the 3rd Conference on Fully Actuated System Theory and Applications, FASTA 2024
SP - 1164
EP - 1169
BT - Proceedings of the 3rd Conference on Fully Actuated System Theory and Applications, FASTA 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 3rd Conference on Fully Actuated System Theory and Applications, FASTA 2024
Y2 - 10 May 2024 through 12 May 2024
ER -