强化学习中的混合探索方案 Hybrid exploration strategy for reinforcement learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

强化学习中的混合探索方案

引用本文：	李春贵,陈波. 强化学习中的混合探索方案[J]. 计算机工程与设计, 2006, 27(9): 1595-1597

作者姓名：	李春贵陈波

作者单位：	广西工学院,计算机系,广西,柳州,545006;广西工学院,计算机系,广西,柳州,545006

基金项目：	广西自然科学基金;广西工学院校科研和教改项目

摘要：	对强化学习中的探索方案进行了研究,描述了间接探索和直接探索两种方案各自的特点.综合它们的优点,提出了一种集直接探索和间接探索为一体的混合探索方案.该方案在学习的初始阶段,由于对环境的经验知识较少,侧重于直接探索;在获得比较多的经验后,侧重于间接探索,使得行动选择渐渐趋向于最优策略.实验表明该方案比纯粹的间接探索-greedy方案有更高的学习效率.
关键词：	强化学习 Markov决策过程探索利用 Q学习
文章编号：	1000-7024（2006）09-1595-03
收稿时间：	2005-03-10
修稿时间：	2005-03-10
Hybrid exploration strategy for reinforcement learning

LI Chun-gui,CHEN Bo. Hybrid exploration strategy for reinforcement learning[J]. Computer Engineering and Design, 2006, 27(9): 1595-1597

Authors:	LI Chun-gui CHEN Bo

Affiliation:	Department of Computer, Guangxi University of Technology, Liuzhou545006, China

Abstract:	Exploration strategies in reinforcement learning were discussed.And characters of direct or indirect exploration strategies were introduced.By integrating both of the strategies' strongpoint,a hybrid exploration strategy was presented.In this strategy,direct strategy is emphasized because oflacking forenvironment knowledge at the beginning stage,and after alot of such knowledge is acquired,indirect strategy will be adopted instead of the direct one,that will ensure the policy converging to the optimal policy.Experiment result show that the new strategy is more effective than-greedy exploration strategy.

Keywords:	reinforcement learning Markov decision process exploration exploitation Q learning
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏