Abstract
This paper studies an optimal consensus tracking problem of heterogeneous linear multiagent systems. By introducing tracking error dynamics, the optimal tracking problem is reformulated as finding a Nash-equilibrium solution to multiplayer games, which can be done by solving associated coupled Hamilton-Jacobi equations. A data-based error estimator is designed to obtain the data-based control for the multiagent systems. Using the quadratic functional to approximate every agent's value function, we can obtain the optimal cooperative control by the input-output (I/O) ${Q}$-learning algorithm with a value iteration technique in the least-square sense. The control law solves the optimal consensus problem for multiagent systems with measured I/O information, and does not rely on the model of multiagent systems. A numerical example is provided to illustrate the effectiveness of the proposed algorithm.
| Original language | English |
|---|---|
| Article number | 8472167 |
| Pages (from-to) | 4441-4449 |
| Number of pages | 9 |
| Journal | IEEE Transactions on Cybernetics |
| Volume | 49 |
| Issue number | 12 |
| DOIs | |
| State | Published - Dec 2019 |
| Externally published | Yes |
Keywords
- Consensus
- data-based control
- optimal cooperative control
- reinforcement learning
Fingerprint
Dive into the research topics of 'Data-based optimal control of multiagent systems: A reinforcement learning design approach'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver