并行强化学习算法及其应用研究 Parallel reinforcement learning algorithm and its application期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

并行强化学习算法及其应用研究

引用本文：	孟伟,韩学东. 并行强化学习算法及其应用研究[J]. 计算机工程与应用, 2009, 45(34): 25-28. DOI: 10.3778/j.issn.1002-8331.2009.34.008

作者姓名：	孟伟韩学东

作者单位：	1.北京林业大学信息学院，北京 100083 2.中国航天科工集团 706所，北京 100854

基金项目：	国家"十一五"科技支撑计划重大项目资助

摘要：	强化学习是一种重要的机器学习方法，然而在实际应用中，收敛速度缓慢是其主要不足之一。为了提高强化学习的效率，提出了一种并行强化学习算法。多个同时学习，在各自学习一定周期后，利用D-S证据利用对学习结果进行融合，然后在融合结果的基础上，各进行下一周期的学习，从而实现提高整个系统学习效率的目的。实验结果表明了该方法的可行性和有效性。
关键词：	并行算法强化学习 Q-学习 D-S证据理论路径规划
收稿时间：	2009-08-11
修稿时间：	2009-10-9
Parallel reinforcement learning algorithm and its application

MENG Wei,HAN Xue-dong. Parallel reinforcement learning algorithm and its application[J]. Computer Engineering and Applications, 2009, 45(34): 25-28. DOI: 10.3778/j.issn.1002-8331.2009.34.008

Authors:	MENG Wei HAN Xue-dong

Affiliation:	1.Information School，Beijing Forestry University，Beijing 100083，China 2.706 Institute of China Aerospace Science and Industry Corporation，Beijing 100854，China

Abstract:	Reinforcement learning is an important machine learning method.However，slow convergence has been one of main problem in practice.To improve the efficiency of reinforcement learning，this paper proposes parallel reinforcement learning algorithm.There are multiple agents in learning system.In a learning episode，each agent learns independently.After a learning episode，the results of all agents are fused based on D-S evidence theory so as to achieve common result，which are shared by all agents in next learning episode.Experiments show the feasibility and efficiency of the algorithm.

Keywords:	parallel algorithms reinforcement learning Q-learning D-S evidence theory path plan
本文献已被维普万方数据等数据库收录！
	点击此处可从《计算机工程与应用》浏览原始摘要信息
	点击此处可从《计算机工程与应用》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏