Abstract
We discuss the problems of discounted-cost performance optimization for a class of semi-Markov decision processes (SMDPs). We define a matrix which can be used as the infinitesimal generator of a Markov process. The discounted Poisson equation is proposed for an SMDP by using this matrix, from which the α-potential is defined. The optimality equation satisfied by the optimal stationary policy is given and the relation between discounted model and average model is discussed. Two iteration algorithms to find ε-optimal policies are proposed and the proofs of convergence of these two algorithms are given. A numerical example is provided to illustrate the application of the algorithms.
| Original language | English |
|---|---|
| Pages (from-to) | 213-222 |
| Number of pages | 10 |
| Journal | European Journal of Control |
| Volume | 14 |
| Issue number | 3 |
| DOIs | |
| State | Published - 2008 |
| Externally published | Yes |
Keywords
- Discounted poisson equation
- Discounted-cost criteria
- Policy iteration
- Semi-Markov decision processes
- Value iteration
- α-potential
Fingerprint
Dive into the research topics of 'Performance optimization of semi-Markov decision processes with discounted-cost criteria'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver