首页 | 本学科首页   官方微博 | 高级检索  
     


Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains
Authors:Rolando Cavazos-Cadena
Abstract:We consider average reward Markov decision processes with discrete time parameter and denumerable state space. We are concerned with the following problem: Find necessary and sufficient conditions so that, for arbitrary bounded reward function, the corresponding average reward optimality equation has a bounded solution. This problem is solved for a class of systems including the case in which, under the action of any stationary policy, the state space is an irreducible positive recurrent class.
Keywords:Markov decision processes  Average reward criterion  Optimal stationary policies  Necessary and sufficient conditions for the optimality equation  Simultaneous Doeblin condition  Bounded rewards
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号