Abstract
This paper presents a unified approach to time-aggregated Markov decision processes (MDPs) with an average cost criterion. The approach is based on a framework in which a time-aggregated MDP constitutes a semi-Markov decision process (SMDP). By analyzing the performance sensitivity formulas of this SMDP, a number of optimization algorithms for time aggregated MDPs, including those previously reported in the literature, can be developed in a simple and intuitive way.
| Original language | English |
|---|---|
| Pages (from-to) | 77-84 |
| Number of pages | 8 |
| Journal | Automatica |
| Volume | 67 |
| DOIs | |
| State | Published - 1 May 2016 |
| Externally published | Yes |
Keywords
- Markov decision process
- Performance sensitivity
- Semi-Markov decision process
- Time aggregation
Fingerprint
Dive into the research topics of 'A unified approach to time-aggregated Markov decision processes'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver