Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains

Authors:	Rolando Cavazos-Cadena

Abstract:	We consider average reward Markov decision processes with discrete time parameter and denumerable state space. We are concerned with the following problem: Find necessary and sufficient conditions so that, for arbitrary bounded reward function, the corresponding average reward optimality equation has a bounded solution. This problem is solved for a class of systems including the case in which, under the action of any stationary policy, the state space is an irreducible positive recurrent class.

Keywords:	Markov decision processes Average reward criterion Optimal stationary policies Necessary and sufficient conditions for the optimality equation Simultaneous Doeblin condition Bounded rewards
本文献已被 ScienceDirect 等数据库收录！