首页 | 本学科首页   官方微博 | 高级检索  
     


A note on the convergence rate of the value iteration scheme in controlled Markov chains
Authors:Rolando Cavazos-Cadena
Abstract:This work is concerned with controlled Markov chains with bounded costs. Assuming that the transition probabilities satisfy a simultaneous Doeblin condition, it is shown that Schweitzer’s transformation on the transition law yields a strong ergodicity condition that implies that the solution to the average cost optimality equation can be approximated, at a geometric rate, via the value iteration scheme.
Keywords:Markov decision processes   Long-run average cost criterion   Simultaneous Doeblin condition   Geometric convergence rate
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号