基于深度Q学习的移动机器人路径规划 Robot Path Planning Based on Deep Q-Learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于深度Q学习的移动机器人路径规划

引用本文：	刘志荣,姜树海,袁雯雯,史晨辉.基于深度Q学习的移动机器人路径规划[J].测控技术,2019,38(7):24-28.

作者姓名：	刘志荣姜树海袁雯雯史晨辉

作者单位：	南京林业大学机械电子工程学院,江苏南京210037;南京林业大学智能控制与机器人技术研究所,江苏南京210037;南京林业大学机械电子工程学院,江苏南京210037;南京林业大学智能控制与机器人技术研究所,江苏南京210037;南京林业大学机械电子工程学院,江苏南京210037;南京林业大学智能控制与机器人技术研究所,江苏南京210037;南京林业大学机械电子工程学院,江苏南京210037;南京林业大学智能控制与机器人技术研究所,江苏南京210037

基金项目：	国家公益性行业科研专项重大项目（201404402-03）；江苏省研究生科研创新计划项目（KYCX17_0865）

摘要：	针对传统Q-learning算法在复杂环境下移动机器人路径规划问题中容易产生维数灾难的问题,提出一种改进方法。该方法将深度学习融于Q-learming框架中,以网络输出代替Q值表,解决维数灾难问题。通过构建记忆回放矩阵和双层网络结构打断数据相关性,提高算法收敛性。最后,通过栅格法建立仿真环境建模,在不同复杂程度上的地图上进行仿真实验,对比实验验证了传统Q-learming难以在大状态空间下进行路径规划,深度强化学习能够在复杂状态环境下进行良好的路径规划。
关键词：	Q-learning 深度Q学习移动机器人路径规划
Robot Path Planning Based on Deep Q-Learning

LIU Zhi-rong,JIANG Shu-hai,YUAN Wen-wen,SHI Chen-hui.Robot Path Planning Based on Deep Q-Learning[J].Measurement & Control Technology,2019,38(7):24-28.

Authors:	LIU Zhi-rong JIANG Shu-hai YUAN Wen-wen SHI Chen-hui

Affiliation:	(College of Mechanical and Electronic Engineering,Nanjing Forestry University,Nanjing 210037,China;Institute of Intelligent Control and Roboties,Nanjing Forestry University,Nanjing 210037,China)

Abstract:	In order to solve the problem that the traditional Q-learning algorithm is prone to dimension disaster in the path planning of mobile robot in complex environment,an improved method is proposed.This method integrates deep learning into the Q-learning framework and replaces the Q-value table with network output to solve the dimensionality disaster problem.In addition,by constructing a memory playback matrix and a two-layer network structure,data correlation is interrupted to improve the convergence of the algorithm.Finally,the simulation environment modeling is established by grid method,and simulation experiments are carried out on multiple maps with different complexity levels.The comparison experiments verify that traditional Q-learning is difficult to perform good path planning in large state space,and deep Q-learning enables good path planning in complex state environments.

Keywords:	Q-learning deep Q-learning mobile robot path planning
本文献已被维普万方数据等数据库收录！
	点击此处可从《测控技术》浏览原始摘要信息
	点击此处可从《测控技术》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏