首页 | 本学科首页   官方微博 | 高级检索  
     


Strong 1-optimal stationary policies in denumerable Markov decision processes
Authors:R. Cavazos-Cadena
Abstract:Usual conditions for existence of stationary average optimal policies in denumerable MDPs with general bounded rewards are shown to be also sufficient for strong 1-optimality. Moreover, we prove that all limit points of discounted optimal stationary policies when the discount factor goes to 1 are strong 1-optimal.
Keywords:Markov decision processes   Average optimality   1-optimality   Average optimality equation   Simultaneous Doeblin condition
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号