TY - GEN
T1 - Multi-UAV Sliding Mode Formation Control Based on Reinforcement Learning
AU - Xie, Zihan
AU - Li, Yongyuan
AU - Guan, Yingzi
AU - Wei, Changzhu
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
PY - 2023
Y1 - 2023
N2 - In this paper, a sliding mode controller with fixed time convergence is designed for the UAV formation control problem, and a controller parameter online tuning method is proposed as well. First, the formation control problem is described and a formation control model is developed. Second, a formation controller based on terminal sliding mode theory is designed, whose fixed-time convergence is proved theoretically. Besides, aiming to the problem that sliding mode controller parameters are difficult to determine and lack of adaptability, a controller parameter tuning method based on actor-critic framework is proposed by taking maximum acceleration and energy cost into account. Simulation results show that the control method proposed in this paper can generate the desired formation quickly, and effectively reduce the convergence time and energy cost, with good dynamic performance.
AB - In this paper, a sliding mode controller with fixed time convergence is designed for the UAV formation control problem, and a controller parameter online tuning method is proposed as well. First, the formation control problem is described and a formation control model is developed. Second, a formation controller based on terminal sliding mode theory is designed, whose fixed-time convergence is proved theoretically. Besides, aiming to the problem that sliding mode controller parameters are difficult to determine and lack of adaptability, a controller parameter tuning method based on actor-critic framework is proposed by taking maximum acceleration and energy cost into account. Simulation results show that the control method proposed in this paper can generate the desired formation quickly, and effectively reduce the convergence time and energy cost, with good dynamic performance.
KW - Formation control
KW - Reinforcement learning
KW - Sliding mode
UR - https://www.scopus.com/pages/publications/85135851078
U2 - 10.1007/978-981-19-3998-3_152
DO - 10.1007/978-981-19-3998-3_152
M3 - 会议稿件
AN - SCOPUS:85135851078
SN - 9789811939976
T3 - Lecture Notes in Electrical Engineering
SP - 1628
EP - 1639
BT - Proceedings of 2021 5th Chinese Conference on Swarm Intelligence and Cooperative Control
A2 - Ren, Zhang
A2 - Hua, Yongzhao
A2 - Wang, Mengyi
PB - Springer Science and Business Media Deutschland GmbH
T2 - 5th Chinese Conference on Swarm Intelligence and Cooperative Control, CCSICC 2021
Y2 - 19 January 2022 through 22 January 2022
ER -