首页 | 本学科首页   官方微博 | 高级检索  
     


Markov decision Processes with fractional costs
Authors:Zhiyuan Ren Krogh  BH
Affiliation:Signal Electron. & Embedded Syst. Lab., Gen. Electr. Global Res. Center, Niskayuna, NY, USA;
Abstract:Certain methods for constructing embedded Markov decision processes (MDPs) lead to performance measures that are the ratio of two long-run averages. For such MDPs with finite state and action spaces and under an ergodicity assumption, this note presents algorithms for computing optimal policies based on policy iterations, linear programming, value iterations and Q-learning.
Keywords:
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号