Data-based optimal control of multiagent systems: A reinforcement learning design approach

  • Jilie Zhang
  • , Zhanshan Wang
  • , Hongwei Zhang*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

This paper studies an optimal consensus tracking problem of heterogeneous linear multiagent systems. By introducing tracking error dynamics, the optimal tracking problem is reformulated as finding a Nash-equilibrium solution to multiplayer games, which can be done by solving associated coupled Hamilton-Jacobi equations. A data-based error estimator is designed to obtain the data-based control for the multiagent systems. Using the quadratic functional to approximate every agent's value function, we can obtain the optimal cooperative control by the input-output (I/O) ${Q}$-learning algorithm with a value iteration technique in the least-square sense. The control law solves the optimal consensus problem for multiagent systems with measured I/O information, and does not rely on the model of multiagent systems. A numerical example is provided to illustrate the effectiveness of the proposed algorithm.

Original languageEnglish
Article number8472167
Pages (from-to)4441-4449
Number of pages9
JournalIEEE Transactions on Cybernetics
Volume49
Issue number12
DOIs
StatePublished - Dec 2019
Externally publishedYes

Keywords

  • Consensus
  • data-based control
  • optimal cooperative control
  • reinforcement learning

Fingerprint

Dive into the research topics of 'Data-based optimal control of multiagent systems: A reinforcement learning design approach'. Together they form a unique fingerprint.

Cite this