首页 | 本学科首页   官方微博 | 高级检索  
     

基于实例的POMDP问题的近似求解
引用本文:修国明,张积滨,潘启树. 基于实例的POMDP问题的近似求解[J]. 计算机工程与应用, 2008, 44(29): 82-85. DOI: 10.3778/j.issn.1002-8331.2008.29.022
作者姓名:修国明  张积滨  潘启树
作者单位:哈尔滨工业大学 计算机科学与技术学院,哈尔滨 150001
摘    要:结合启发式求解和增强学习技术,深入研究了基于实例的POMDP问题的近似求解算法,包括基于最近邻算法法的NNI及它的参数化增强版本ENNI和基于局部加权回归算法的LWI,并通过实验对比,给出了相应算法在实际应用中的性能。实验证明,基于实例的方法来求解POMDP问题,能够获得性能较好的次优解。

关 键 词:基于实例的方法  部分可观察马尔可夫决策过程(POMDP)  启发式求解  增强学习  最近邻  局部加权回归  
收稿时间:2007-11-06
修稿时间:2008-2-26 

Instance based approximate solution to POMDP problem
XIU Guo-ming,ZHANG Ji-bin,PAN Qi-shu. Instance based approximate solution to POMDP problem[J]. Computer Engineering and Applications, 2008, 44(29): 82-85. DOI: 10.3778/j.issn.1002-8331.2008.29.022
Authors:XIU Guo-ming  ZHANG Ji-bin  PAN Qi-shu
Affiliation:School of Computer Science and Technology,Harbin Institute of Technology,Harbin 150001,China
Abstract:In this paper,with the idea of combining heuristic solution and reinforcement learning technique,the instance based approximate solution to POMDP problem is studied and Nearest Neighbor based algorithm NNI and its extended parameterized version ENNI and Local Weighted Regression based algorithm LWI are presented.With the performance analyzed and compared through experiments on common workbench,solving POMDP problems using instance based methods can produce good sub-optimal solutions.
Keywords:instance based method  Partially Observable Markov Decision Process(POMDP)  heuristic solution  reinforcement learning  nearest neighbor  local weighted regression
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号