加强学习主要算法的比较研究 Comparative Study of the Main Reinforcement Learning Algorithms期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

加强学习主要算法的比较研究

引用本文：	郭茂祖,刘扬,黄梯云. 加强学习主要算法的比较研究[J]. 计算机工程与应用, 2001, 37(21): 16-18,48

作者姓名：	郭茂祖刘扬黄梯云

作者单位：	1. 哈尔滨工业大学计算机科学与技术学院 2. 哈尔滨工业大学管理学院

基金项目：	国家自然科学基金项目(编号:70071008)

摘要：	文章介绍了加强学习模型,分别给出了加强学习的四个主要算法:动态规划、蒙特卡罗算法、时序差分算法、Q-学习,并指出了它们之间的区别和联系。最后给出加强学习的两个应用以及今后的研究方向。
关键词：	加强学习动态规划蒙特卡罗算法时序差分算法 Q-学习
文章编号：	1002-8331-(2001)21-0016-03
Comparative Study of the Main Reinforcement Learning Algorithms

Guo Maozu Liu Yang Huang Tiyun. Comparative Study of the Main Reinforcement Learning Algorithms[J]. Computer Engineering and Applications, 2001, 37(21): 16-18,48

Authors:	Guo Maozu Liu Yang Huang Tiyun

Affiliation:	Guo Maozu 1 Liu Yang 1 Huang Tiyun 21

Abstract:	The model of reinforcement learning is first introduced in this paper.Then the four main algorithms including dynamic programming,monte carlo method,temporal-difference and Q-learning are given respectively,and their differ-ence and relation are pointed out.At last two applications and future research directions are proposed.

Keywords:	Reinforcement learning Dynamic programming monte carlo method Temporal-difference Q-learning
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏