强化学习理论、算法及应用 Reinforcement Learning Theory,Algorithms and Its Application期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

强化学习理论、算法及应用

引用本文：	张汝波,顾国昌,刘照德,王醒策.强化学习理论、算法及应用[J].控制理论与应用,2000,17(5):637-642.

作者姓名：	张汝波顾国昌刘照德王醒策

作者单位：	哈尔滨工程大学计算机系,哈尔滨,150001

基金项目：	黑龙江省自然科学基金资助项目

摘要：	强化学习（ｒｅｉｎｆｏｒｃｅｍｅｎｔｌｅａｒｎｉｎｇ）一词来自于行为心理学，这一理论把行为学习看成是反复试验的过程，从而把环境状态映射成相应的动作。首先全面地介绍了强化学习理论的主要算法，即瞬时差分法、Ｑ－学习算法及自适应启发评价算法；然后介绍了强化学习的应用情况；最后讨论了强化学习目前所要研究的问题。
关键词：	强化学习学习理论学习算法机器学习智能控制
修稿时间：	1999-02-26
Reinforcement Learning Theory,Algorithms and Its Application

ZHANG Rubo,GU Guochang,LIU Zhaode,WANG Xingce.Reinforcement Learning Theory,Algorithms and Its Application[J].Control Theory & Applications,2000,17(5):637-642.

Authors:	ZHANG Rubo GU Guochang LIU Zhaode WANG Xingce

Abstract:	The term,reinforcement learning,comes from behavior psychology that takes behavior leaming as trial and error,by which the states of environment are mapped into corresponding actions.First,the main algorithms,temporal difference, \%Q \%learning and adaptive heuristic critic,are roundly introduced.Then,the application of reinforcement leaming is presented.Finally,some present research projects of reinforcement learning are discussed.

Keywords:	reinforcement leaming temporal difference Q-learning adaptive heuristic critic intelligent control system
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《控制理论与应用》浏览原始摘要信息

设为首页 | 免责声明 | 关于勤云 | 加入收藏