迈进高维连续空间:深度强化学习在机器人领域中的应用 Step into High-Dimensional and Continuous Action Space: A Survey on Applications of Deep Reinforcement Learning to Robotics期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

迈进高维连续空间:深度强化学习在机器人领域中的应用

引用本文：	多南讯,吕强,林辉灿,卫恒.迈进高维连续空间:深度强化学习在机器人领域中的应用[J].机器人,2019,41(2):276-288.

作者姓名：	多南讯吕强林辉灿卫恒

作者单位：	陆军装甲兵学院,北京,100072;陆军装甲兵学院,北京,100072;陆军装甲兵学院,北京,100072;陆军装甲兵学院,北京,100072

摘要：	首先,对深度强化学习(DRL)的兴起与发展进行了回顾.然后,将用于高维连续动作空间的深度强化学习算法分为基于值函数近似的算法、基于策略近似的算法以及基于其他结构的算法3类,详细讲解了深度强化学习中的最新代表性算法及其特点,并重点阐述了其思路、优势及不足.最后,结合深度强化学习算法的发展方向,对使用深度强化学习方法解决机器人学问题的未来发展趋势进行了展望.
关键词：	深度学习强化学习机器人学
Step into High-Dimensional and Continuous Action Space: A Survey on Applications of Deep Reinforcement Learning to Robotics

DUO Nanxun,LU Qiang,LIN Huican,WEI Heng.Step into High-Dimensional and Continuous Action Space: A Survey on Applications of Deep Reinforcement Learning to Robotics[J].Robot,2019,41(2):276-288.

Authors:	DUO Nanxun LU Qiang LIN Huican WEI Heng

Affiliation:	(Academy of Army Armored Force, Beijing 100072, China)

Abstract:	Firstly, the emergence and development of DRL(deep reinforcement learning) are reviewed. Secondly, DRL algorithms used in high-dimensional and continuous action space are classified into value function approximation based algorithms, policy approximation based algorithms and other structures based algorithms. Then, typical DRL algorithms and their characteristics are introduced, especially their ideas, advantages and disadvantages. Finally, the future trends of applying DRL to robotics are forecasted according to the development directions of DRL algorithms.

Keywords:	deep learning reinforcement learning robotics
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏