首页 | 本学科首页   官方微博 | 高级检索  
     

基于深度强化学习的机器人运动控制研究进展
引用本文:董豪,杨静,李少波,王军,段仲静.基于深度强化学习的机器人运动控制研究进展[J].控制与决策,2022,37(2):278-292.
作者姓名:董豪  杨静  李少波  王军  段仲静
作者单位:贵州大学机械工程学院,贵阳550025;贵州大学机械工程学院,贵阳550025;贵州大学省部共建公共大数据国家重点实验室(筹),贵阳550025;贵州大学机械工程学院,贵阳550025;贵州大学省部共建公共大数据国家重点实验室(筹),贵阳550025;贵州大学现代制造技术教育部重点实验室,贵阳550025;贵州大学现代制造技术教育部重点实验室,贵阳550025
基金项目:国家重点研发计划项目(2018AAA0101803);国家自然科学基金项目(51475097,91746116);工信部资助项目(工信部联装[2016]213号);贵州省科技计划项目(黔科合人才[2015]4011);贵州省重点实验室建设项目(黔科合平台人才[2016]5103));贵州大学培育项目(贵大培育[2019]22号).
摘    要:复杂未知环境下智能感知与自动控制是目前机器人在控制领域的研究热点之一,而新一代人工智能为其实现智能自动化赋予了可能.近年来,在高维连续状态-动作空间中,尝试运用深度强化学习进行机器人运动控制的新兴方法受到了相关研究人员的关注.首先,回顾了深度强化学习的兴起与发展,将用于机器人运动控制的深度强化学习算法分为基于值函数和策略梯度2类,并对各自典型算法及其特点进行了详细介绍;其次,针对仿真至现实之前的学习过程,简要介绍5种常用于深度强化学习的机器人运动控制仿真平台;然后,根据研究类型的不同,综述了目前基于深度强化学习的机器人运动控制方法在自主导航、物体抓取、步态控制、人机协作以及群体协同等5个方面的研究进展;最后,对其未来所面临的挑战以及发展趋势进行了总结与展望.

关 键 词:复杂未知环境  人工智能  高维连续空间  深度强化学习  仿真至现实  机器人运动控制

Research progress of robot motion control based on deep reinforcement learning
DONG Hao,YANG Jing,LI Shao-bo,WANG Jun,DUAN Zhong-jing.Research progress of robot motion control based on deep reinforcement learning[J].Control and Decision,2022,37(2):278-292.
Authors:DONG Hao  YANG Jing  LI Shao-bo  WANG Jun  DUAN Zhong-jing
Affiliation:School of Mechanical Engineering,Guizhou University,Guiyang 550025,China;School of Mechanical Engineering,Guizhou University,Guiyang 550025,China;State Key Laboratory of Public Big Data,Guizhou University,Guiyang 550025,China;School of Mechanical Engineering,Guizhou University,Guiyang 550025,China;State Key Laboratory of Public Big Data,Guizhou University,Guiyang 550025,China;Key Laboratory of Advanced Manufacturing Technology of Ministry of Education,Guizhou University,Guiyang 550025,China
Abstract:Intelligent perception and automatic control in a complex unknown environment is one of the current research hotspots of robots in the field of control, and a new generation of artificial intelligence makes it possible to realize intelligent automation. In recent years, the new method of robot control using deep reinforcement learning in high-dimensional continuous state-action space has attracted the attention of relevant researchers. Firstly, the rise and development of deep reinforcement learning are first reviewed. The deep reinforcement learning algorithms for robot motion control are classified into two categories: value-based functions and policy gradients, and their typical algorithms and their related features are detailly described. Then, for the learning process before simulation to reality, five kinds of simulation platforms for robot motion control are briefly introduced, which are often used for deep reinforcement learning. Moreover, according to different types of research, the research progress of the deep reinforcement learning approach of robot motion control is expounded in five aspects, including autonomous navigation, object grasping, gait control, human-robot collaborative and multi-robot cooperation. Finally, the future challenges and development trends are summarized and anticipated.
Keywords:complex unknown environment  artificial intelligence  high-dimensional continuous space  deep reinforcement learning  simulation to reality  robot motion control
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《控制与决策》浏览原始摘要信息
点击此处可从《控制与决策》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号