Skip to main navigation Skip to search Skip to main content

Performance optimization of semi-Markov decision processes with discounted-cost criteria

  • Baoqun Yin*
  • , Yanjie Li
  • , Yaping Zhou
  • , Hongsheng Xi
  • *Corresponding author for this work
  • University of Science and Technology of China

Research output: Contribution to journalArticlepeer-review

Abstract

We discuss the problems of discounted-cost performance optimization for a class of semi-Markov decision processes (SMDPs). We define a matrix which can be used as the infinitesimal generator of a Markov process. The discounted Poisson equation is proposed for an SMDP by using this matrix, from which the α-potential is defined. The optimality equation satisfied by the optimal stationary policy is given and the relation between discounted model and average model is discussed. Two iteration algorithms to find ε-optimal policies are proposed and the proofs of convergence of these two algorithms are given. A numerical example is provided to illustrate the application of the algorithms.

Original languageEnglish
Pages (from-to)213-222
Number of pages10
JournalEuropean Journal of Control
Volume14
Issue number3
DOIs
StatePublished - 2008
Externally publishedYes

Keywords

  • Discounted poisson equation
  • Discounted-cost criteria
  • Policy iteration
  • Semi-Markov decision processes
  • Value iteration
  • α-potential

Fingerprint

Dive into the research topics of 'Performance optimization of semi-Markov decision processes with discounted-cost criteria'. Together they form a unique fingerprint.

Cite this