首页 | 本学科首页   官方微博 | 高级检索  
     


Incremental Value Iteration for Time-Aggregated Markov-Decision Processes
Authors:Tao Sun Qianchuan Zhao Luh   P.B.
Affiliation:Tsinghua Univ., Beijing;
Abstract:A value iteration algorithm for time-aggregated Markov-decision processes (MDPs) is developed to solve problems with large state spaces. The algorithm is based on a novel approach which solves a time aggregated MDP by incrementally solving a set of standard MDPs. Therefore, the algorithm converges under the same assumption as standard value iteration. Such assumption is much weaker than that required by the existing time aggregated value iteration algorithm. The algorithms developed in this paper are also applicable to MDPs with fractional costs.
Keywords:
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号