首页 | 本学科首页   官方微博 | 高级检索  
     


Computing rank dependent utility in graphical models for sequential decision problems
Authors:Gildas Jeantet  Olivier Spanjaard
Affiliation:Laboratoire d?Informatique de Paris 6 (LIP6-CNRS), Université Pierre et Marie Curie (UPMC), 4 Place Jussieu, F-75252 Paris Cedex 05, France;Faculty of Industrial Engineering and Management, Technion - Israel Institute of Technology, Israel;Department of Mathematics and Computer Science, Philipps-Universität Marburg, Germany;CRIL, IUT de Lens, Rue de l''Université, SP 16, 62300 Lens, France;IRIT, Université Paul Sabatier, 118 Route de Narbonne, 31062 Toulouse Cedex 09, France
Abstract:This paper is devoted to automated sequential decision in AI. More precisely, we focus here on the Rank Dependent Utility (RDU) model. This model is able to encompass rational decision behaviors that the Expected Utility model cannot accommodate. However, the non-linearity of RDU makes it difficult to compute an RDU-optimal strategy in sequential decision problems. This has considerably slowed the use of RDU in operational contexts. In this paper, we are interested in providing new algorithmic solutions to compute an RDU-optimal strategy in graphical models. Specifically, we present algorithms for solving decision tree models and influence diagram models of sequential decision problems. For decision tree models, we propose a mixed integer programming formulation that is valid for a subclass of RDU models (corresponding to risk seeking behaviors). This formulation reduces to a linear program when mixed strategies are considered. In the general case (i.e., when there is no particular assumption on the parameters of RDU), we propose a branch and bound procedure to compute an RDU-optimal strategy among the pure ones. After highlighting the difficulties induced by the use of RDU in influence diagram models, we show how this latter procedure can be extended to optimize RDU in an influence diagram. Finally, we provide empirical evaluations of all the presented algorithms.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号