首页 | 本学科首页   官方微博 | 高级检索  
     


Multi-objective discounted Markov decision processes with expectation and variance criteria
Authors:QIU-SHENG LIU  KATSUHISA OHNO  HIROTAKA NAKAYAMA
Affiliation:1. Department of Systems Engineering , Nagoya Institute of Technology , Gokiso-cho, Showa-ku, Nagoya, 466, Japan;2. Department of Applied Mathematics , Faculty of Science, Konan University , Okamolo, Higashinada-ku, Kobe, 658, Japan.
Abstract:A multi-objective discounted Markov decision process (MDP) with expectation and variance criteria is discussed. First, difficulties in variance minimization are discussed and it is shown that variance minimization is much more difficult than the expectation optimization. Then, the multi-objective MDP with expectation and variance criteria is formulated as a multi-objective non-linear programming problem. An algorithm for finding a stationary satisfactory Pareto policy is proposed by applying the satisficing trade-off method of Nakayama. In the proposed algorithm, a decision-maker need not have a high degree of judgment and it is easy to take the balance of expectation and variance criteria and furthermore, the number of auxiliary optimization problems to be solved is quite small. Numerical examples show the efficiency of the proposed algorithm.
Keywords:
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号