Strong 1-optimal stationary policies in denumerable Markov decision processes期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Strong 1-optimal stationary policies in denumerable Markov decision processes

Authors:	R. Cavazos-Cadena

Abstract:	Usual conditions for existence of stationary average optimal policies in denumerable MDPs with general bounded rewards are shown to be also sufficient for strong 1-optimality. Moreover, we prove that all limit points of discounted optimal stationary policies when the discount factor goes to 1 are strong 1-optimal.

Keywords:	Markov decision processes Average optimality 1-optimality Average optimality equation Simultaneous Doeblin condition
本文献已被 ScienceDirect 等数据库收录！