TY - GEN
T1 - An approximate dynamic programming approach for model-free control of switched systems
AU - Lu, Wenjie
AU - Ferrari, Silvia
PY - 2013
Y1 - 2013
N2 - Several approximate dynamic programming (ADP) algorithms have been developed and demonstrated for the model-free control of continuous and discrete dynamical systems. However, their applicability to hybrid systems that involve both discrete and continuous state and control variables has yet to be demonstrated in the literature. This paper presents an ADP approach for hy- brid systems (hybrid-ADP) that obtains the optimal control law and discrete action sequence via online learning. New recursive relationships for hybrid-ADP are presented for switched hybrid systems that are possibly nonlinear. In order to demonstrate the ability of the proposed ADP algorithm to converge to the optimal solution, the approach is demonstrated on a switched, linear hybrid system with a quadratic cost function, for which there exists an analytical solution. The results show that the ADP algorithm is capable of converging to the optimal switched control law, by minimizing the cost-to-go online, based on an observable state vector.
AB - Several approximate dynamic programming (ADP) algorithms have been developed and demonstrated for the model-free control of continuous and discrete dynamical systems. However, their applicability to hybrid systems that involve both discrete and continuous state and control variables has yet to be demonstrated in the literature. This paper presents an ADP approach for hy- brid systems (hybrid-ADP) that obtains the optimal control law and discrete action sequence via online learning. New recursive relationships for hybrid-ADP are presented for switched hybrid systems that are possibly nonlinear. In order to demonstrate the ability of the proposed ADP algorithm to converge to the optimal solution, the approach is demonstrated on a switched, linear hybrid system with a quadratic cost function, for which there exists an analytical solution. The results show that the ADP algorithm is capable of converging to the optimal switched control law, by minimizing the cost-to-go online, based on an observable state vector.
UR - https://www.scopus.com/pages/publications/84902341657
U2 - 10.1109/CDC.2013.6760475
DO - 10.1109/CDC.2013.6760475
M3 - 会议稿件
AN - SCOPUS:84902341657
SN - 9781467357173
T3 - Proceedings of the IEEE Conference on Decision and Control
SP - 3837
EP - 3844
BT - 2013 IEEE 52nd Annual Conference on Decision and Control, CDC 2013
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 52nd IEEE Conference on Decision and Control, CDC 2013
Y2 - 10 December 2013 through 13 December 2013
ER -