Maximizing the set of recurrent states of an MDP subject to convex constraints期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Maximizing the set of recurrent states of an MDP subject to convex constraints

Authors:	Eduardo Arvelo Nuno C. Martins

Affiliation:	Department of Electrical and Computer Engineering, University of Maryland, College Park, MD, 20742, USA

Abstract:	This paper focuses on the design of time-homogeneous fully observed Markov decision processes (MDPs), with finite state and action spaces. The main objective is to obtain policies that generate the maximal set of recurrent states, subject to convex constraints on the set of invariant probability mass functions. We propose a design method that relies on a finitely parametrized convex program inspired on principles of entropy maximization. A numerical example is provided to illustrate these ideas.

Keywords:	Maximum entropy Markov decision problems Markov models Convex optimization Optimal control
本文献已被 ScienceDirect 等数据库收录！