首页 | 本学科首页   官方微博 | 高级检索  
     


Maximizing the set of recurrent states of an MDP subject to convex constraints
Authors:Eduardo Arvelo  Nuno C. Martins
Affiliation:Department of Electrical and Computer Engineering, University of Maryland, College Park, MD, 20742, USA
Abstract:This paper focuses on the design of time-homogeneous fully observed Markov decision processes (MDPs), with finite state and action spaces. The main objective is to obtain policies that generate the maximal set of recurrent states, subject to convex constraints on the set of invariant probability mass functions. We propose a design method that relies on a finitely parametrized convex program inspired on principles of entropy maximization. A numerical example is provided to illustrate these ideas.
Keywords:Maximum entropy   Markov decision problems   Markov models   Convex optimization   Optimal control
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号