Skip to main navigation Skip to search Skip to main content

A unified approach to time-aggregated Markov decision processes

  • Harbin Institute of Technology Shenzhen
  • Shenzhen Institute of Advanced Technology

Research output: Contribution to journalArticlepeer-review

Abstract

This paper presents a unified approach to time-aggregated Markov decision processes (MDPs) with an average cost criterion. The approach is based on a framework in which a time-aggregated MDP constitutes a semi-Markov decision process (SMDP). By analyzing the performance sensitivity formulas of this SMDP, a number of optimization algorithms for time aggregated MDPs, including those previously reported in the literature, can be developed in a simple and intuitive way.

Original languageEnglish
Pages (from-to)77-84
Number of pages8
JournalAutomatica
Volume67
DOIs
StatePublished - 1 May 2016
Externally publishedYes

Keywords

  • Markov decision process
  • Performance sensitivity
  • Semi-Markov decision process
  • Time aggregation

Fingerprint

Dive into the research topics of 'A unified approach to time-aggregated Markov decision processes'. Together they form a unique fingerprint.

Cite this