基于实例的POMDP问题的近似求解 Instance based approximate solution to POMDP problem期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于实例的POMDP问题的近似求解

引用本文：	修国明,张积滨,潘启树. 基于实例的POMDP问题的近似求解[J]. 计算机工程与应用, 2008, 44(29): 82-85. DOI: 10.3778/j.issn.1002-8331.2008.29.022

作者姓名：	修国明张积滨潘启树

作者单位：	哈尔滨工业大学计算机科学与技术学院，哈尔滨 150001

摘要：	结合启发式求解和增强学习技术，深入研究了基于实例的POMDP问题的近似求解算法，包括基于最近邻算法法的NNI及它的参数化增强版本ENNI和基于局部加权回归算法的LWI，并通过实验对比，给出了相应算法在实际应用中的性能。实验证明，基于实例的方法来求解POMDP问题，能够获得性能较好的次优解。
关键词：	基于实例的方法部分可观察马尔可夫决策过程（POMDP）启发式求解增强学习最近邻局部加权回归
收稿时间：	2007-11-06
修稿时间：	2008-2-26
Instance based approximate solution to POMDP problem

XIU Guo-ming,ZHANG Ji-bin,PAN Qi-shu. Instance based approximate solution to POMDP problem[J]. Computer Engineering and Applications, 2008, 44(29): 82-85. DOI: 10.3778/j.issn.1002-8331.2008.29.022

Authors:	XIU Guo-ming ZHANG Ji-bin PAN Qi-shu

Affiliation:	School of Computer Science and Technology，Harbin Institute of Technology，Harbin 150001，China

Abstract:	In this paper,with the idea of combining heuristic solution and reinforcement learning technique,the instance based approximate solution to POMDP problem is studied and Nearest Neighbor based algorithm NNI and its extended parameterized version ENNI and Local Weighted Regression based algorithm LWI are presented.With the performance analyzed and compared through experiments on common workbench,solving POMDP problems using instance based methods can produce good sub-optimal solutions.

Keywords:	instance based method Partially Observable Markov Decision Process(POMDP) heuristic solution reinforcement learning nearest neighbor local weighted regression
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《计算机工程与应用》浏览原始摘要信息
	点击此处可从《计算机工程与应用》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏