首页 | 本学科首页   官方微博 | 高级检索  
     


A counterexample on the optimality equation in Markov decision chains with the average cost criterion
Authors:Rolando Cavazos-Cadena
Abstract:We consider Markov decision processes with denumerable state space and finite control sets; the performance index of a control policy is a long-run expected average cost criterion and the cost function is bounded below. For these models, the existence of average optimal stationary policies was recently established in [11] under very general assumptions. Such a result was obtained via an optimality inequality. Here, we use a simple example to prove that the conditions in [11] do not imply the existence of a solution to the average cost optimality equation.
Keywords:Markov decision processes   average cost criterion   optimal stationary policies   counterexample on the average cost optimality equation
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号