A counterexample on the optimality equation in Markov decision chains with the average cost criterion期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

A counterexample on the optimality equation in Markov decision chains with the average cost criterion

Authors:	Rolando Cavazos-Cadena

Abstract:	We consider Markov decision processes with denumerable state space and finite control sets; the performance index of a control policy is a long-run expected average cost criterion and the cost function is bounded below. For these models, the existence of average optimal stationary policies was recently established in [11] under very general assumptions. Such a result was obtained via an optimality inequality. Here, we use a simple example to prove that the conditions in [11] do not imply the existence of a solution to the average cost optimality equation.

Keywords:	Markov decision processes average cost criterion optimal stationary policies counterexample on the average cost optimality equation
本文献已被 ScienceDirect 等数据库收录！