强化学习在导弹制导中的应用 Application of Reinforcement Learning in Missile Guidance期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

强化学习在导弹制导中的应用

引用本文：	周　锐,陈宗基. 强化学习在导弹制导中的应用[J]. 控制理论与应用, 2001, 18(5): 748-750

作者姓名：	周　锐陈宗基

作者单位：	北京航空航天大学自动控制系,

基金项目：	国家自然科学基金(6990 40 0 2 )，国防预研基金，航天科技创新基金资助项目

摘要：	阐述了强化学习的基本原理和特点，讨论了强化学习中评价函数的神经网络近似问题，重点分析了采用多神经网络近似评价函数的学习问题，实现了状态空间或任务的自动分解，提高了评价函数的推广能力，网络的学习是离线进行，并作为反馈控制器在线应用，并以A－学习为例，将强化学习应用于导弹的制导问题，仿真结果表明了强化学习在导弹制导或控制问题中的应用前景和有效性。
关键词：	神经网络强化学习微分对策导弹制导人工智能
文章编号：	1000-8152(2001)05-0748-03
收稿时间：	2000-01-10
修稿时间：	2000-01-10
Application of Reinforcement Learning in Missile Guidance

ZHOU Rui and CHEN Zong-ji. Application of Reinforcement Learning in Missile Guidance[J]. Control Theory & Applications, 2001, 18(5): 748-750

Authors:	ZHOU Rui and CHEN Zong-ji

Affiliation:	Department of Automatic Control,Beijing University of Aeronautics and Astronautics, Beijing,100083, P. R. China;Department of Automatic Control,Beijing University of Aeronautics and Astronautics, Beijing,100083, P. R. China

Abstract:	Principle and characteristic of reinforcement learning are outlined. The value function approximation of reinforcement learning with neural networks is studied, and the learning algorithm using modular neural networks to approximate the value function is emphatically analyzed, which decomposes the state space automatically and increases the generalizing ability of the neural networks. The neural networks are trained offline, and is used online as a feedback controller. The A learning algorithm is applied in the missile guidance problem, and the simulation results show the good performance and effectiveness of the application of reinforcement learning in those problems of missile guidance and control.

Keywords:	neural networks reinforcement learning differential games missile guidance
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《控制理论与应用》浏览原始摘要信息
	点击此处可从《控制理论与应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏