首页 | 本学科首页   官方微博 | 高级检索  
     

强化学习在导弹制导中的应用
引用本文:周 锐,陈宗基. 强化学习在导弹制导中的应用[J]. 控制理论与应用, 2001, 18(5): 748-750
作者姓名:周 锐  陈宗基
作者单位:北京航空航天大学自动控制系,
基金项目:国家自然科学基金(6990 40 0 2 ),国防预研基金,航天科技创新基金资助项目
摘    要:阐述了强化学习的基本原理和特点,讨论了强化学习中评价函数的神经网络近似问题,重点分析了采用多神经网络近似评价函数的学习问题,实现了状态空间或任务的自动分解,提高了评价函数的推广能力,网络的学习是离线进行,并作为反馈控制器在线应用,并以A-学习为例,将强化学习应用于导弹的制导问题,仿真结果表明了强化学习在导弹制导或控制问题中的应用前景和有效性。

关 键 词:神经网络 强化学习 微分对策 导弹制导 人工智能
文章编号:1000-8152(2001)05-0748-03
收稿时间:2000-01-10
修稿时间:2000-01-10

Application of Reinforcement Learning in Missile Guidance
ZHOU Rui and CHEN Zong-ji. Application of Reinforcement Learning in Missile Guidance[J]. Control Theory & Applications, 2001, 18(5): 748-750
Authors:ZHOU Rui and CHEN Zong-ji
Affiliation:Department of Automatic Control,Beijing University of Aeronautics and Astronautics, Beijing,100083, P. R. China;Department of Automatic Control,Beijing University of Aeronautics and Astronautics, Beijing,100083, P. R. China
Abstract:Principle and characteristic of reinforcement learning are outlined. The value function approximation of reinforcement learning with neural networks is studied, and the learning algorithm using modular neural networks to approximate the value function is emphatically analyzed, which decomposes the state space automatically and increases the generalizing ability of the neural networks. The neural networks are trained offline, and is used online as a feedback controller. The A learning algorithm is applied in the missile guidance problem, and the simulation results show the good performance and effectiveness of the application of reinforcement learning in those problems of missile guidance and control.
Keywords:neural networks  reinforcement learning  differential games  missile guidance
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《控制理论与应用》浏览原始摘要信息
点击此处可从《控制理论与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号