一类基于启发式搜索的激励学习算法 A Class of Reinforcement Learning Algorithm Based on Heuristic Search期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一类基于启发式搜索的激励学习算法

引用本文：	唐中勇,付强,卓佳,陈焕文. 一类基于启发式搜索的激励学习算法[J]. 微机发展, 2006, 16(8): 41-43

作者姓名：	唐中勇付强卓佳陈焕文

作者单位：	长沙理工大学计算机通讯工程学院湖南长沙410076

摘要：	激励学习已被证明是在控制领域中一种可行的新方法。相比其他的方法,它能较好地处理未知环境问题,但它仍然不是一种有效的方法。幸运的是,在现实世界中,智能体总是会有一些环境的先验知识,这些能形成启发式信息。启发式搜索是一种常用的搜索方法,有很快的搜索速度,但需要精确的启发式信息,这在有些时候难以得到。文中分析比较了启发式搜索和激励学习的各自特点,提出一类新的基于启发式搜索的激励学习算法,初步的实验结果显示了较好的性能。
关键词：	启发式搜索激励学习启发式SARSA
文章编号：	1673-629X(2006)08-0041-03
修稿时间：	2005-11-16
A Class of Reinforcement Learning Algorithm Based on Heuristic Search

TANG Zhong-yong,FU Qiang,ZHUO Jia,CHEN Huan-wen. A Class of Reinforcement Learning Algorithm Based on Heuristic Search[J]. Microcomputer Development, 2006, 16(8): 41-43

Authors:	TANG Zhong-yong FU Qiang ZHUO Jia CHEN Huan-wen

Abstract:	;The reinforcement learning has been proved to be a new applicable method in control field.It can solve the problems of unknown environment better than the others.But it isn't a very effective method yet.Fortunately in real world,the agent often has some knowledge of the environment,which can be used as heuristic information.The heuristic search is a very effective search method,which can search very quickly.But it need very precise heuristic information,which may be hard to get in complex environment.The characteristics of heuristic search and reinforcement learning are compared and a class of reinforcement learning algorithm on heuristic search is introduced.The preliminary empirical result shows better than the previous.

Keywords:	heuristic search reinforcement learning H-SARSA
本文献已被 CNKI 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏