首页 | 本学科首页   官方微博 | 高级检索  
     


Recursive Approaches for Single Sample Path Based Markov Reward Processes
Authors:Hai‐Tao Fang  Han‐Fu Chen  Xi‐Ren Cao
Abstract:In this paper, two single sample path‐based recursive approaches for Markov decision problems are proposed. One is based on the simultaneous perturbation approach and can be applied to the general state problem, but its convergence rate is low. In this algorithm, the small perturbation on current parameters is necessary to get another sample path for comparison, but it may worsen the system. Hence, we introduce another approach, which directly estimates the gradient of the performance for optimization by “potential” theory. This algorithm, however, is limited to finite state space systems, but its convergence speed is higher than the first one. The estimate for gradient can be obtained by using the sample path with current parameters without any perturbation. This approach is more acceptable for practical applications.
Keywords:Markov decision processes  stochastic approximation  potential  recursive approach
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号