A note on the convergence rate of the value iteration scheme in controlled Markov chains<span class="refPreview" id="refp_1"></span>期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

A note on the convergence rate of the value iteration scheme in controlled Markov chains

Authors:	Rolando Cavazos-Cadena

Abstract:	This work is concerned with controlled Markov chains with bounded costs. Assuming that the transition probabilities satisfy a simultaneous Doeblin condition, it is shown that Schweitzer’s transformation on the transition law yields a strong ergodicity condition that implies that the solution to the average cost optimality equation can be approximated, at a geometric rate, via the value iteration scheme.

Keywords:	Markov decision processes Long-run average cost criterion Simultaneous Doeblin condition Geometric convergence rate
本文献已被 ScienceDirect 等数据库收录！