改进DDPG无人机航迹规划算法 Unmanned Aerial Vehicle Track Planning Algorithm Based on Improved DDPG期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

改进DDPG无人机航迹规划算法

引用本文：	高敬鹏,胡欣瑜,江志烨.改进DDPG无人机航迹规划算法[J].计算机工程与应用,2022,58(8):264-272.

作者姓名：	高敬鹏胡欣瑜江志烨

作者单位：	1.电子信息系统复杂电磁环境效应国家重点实验室，河南洛阳 471003 2.哈尔滨工程大学信息与通信工程学院，哈尔滨 150001 3.北京航天长征飞行器研究所试验物理与计算数学国家级重点实验室，北京 100076

摘要：	针对无人机飞行过程存在未知威胁使智能算法处理复杂度高,导致航迹实时规划困难,以及深度强化学习中调整DDPG算法参数,存在时间成本过高的问题,提出一种改进DDPG航迹规划算法.围绕无人机航迹规划问题,构建飞行场景模型,根据飞行动力学理论,搭建动作空间,依据非稀疏化思想,设计奖励函数,结合人工蜂群算法,改进DDPG算法模型...
关键词：	深度确定性策略梯度算法无人机航迹规划深度强化学习人工蜂群算法
Unmanned Aerial Vehicle Track Planning Algorithm Based on Improved DDPG

GAO Jingpeng,HU Xinyu,JIANG Zhiye.Unmanned Aerial Vehicle Track Planning Algorithm Based on Improved DDPG[J].Computer Engineering and Applications,2022,58(8):264-272.

Authors:	GAO Jingpeng HU Xinyu JIANG Zhiye

Affiliation:	1.State Key Laboratory of Complex Electromagnetic Environment Effects on Electronics and Information System（CEMEE）, Luoyang, Henan 471003, China 2.College of Information and Communication Engineering, Harbin Engineering University, Harbin 150001, China 3.National Key Laboratory of Science and Technology on Test Physics and Numerical Mathematics, Beijing Institute of Space Long March Vehicle, Beijing 100076, China

Abstract:	An improved DDPG flight track planning algorithm is proposed, aiming at the problem of high processing complexity of intelligent algorithm due to unknown threats in UAV flight process which leads to the difficulty of real-time flight track planning, and long training time by adjusting the parameters of DDPG algorithm in deep reinforcement learning. The flight scene model is established under the background of UAV track planning. According to the flight dynamics theory, the action space is built. On the basis of the non-sparse idea, the reward function is designed. Combined with the artificial bee colony algorithm, the updating mechanism of the model parameters of DDPG algorithm is improved, and the network model is trained to achieve the flight track decision-making of UAV. Simulation results show that the overall training time of the proposed algorithm is only 1.98 times of the average training time of the prototype algorithm, the training efficiency is improved, and the cost of time is reduced. Besides, under the condition of satisfy real time flight, the proposed algorithm can meet the demand of UAV track quality, and provides a new idea for promoting the practical application of deep reinforcement learning in flight track planning.

Keywords:	deep deterministic policy gradient algorithm unmanned aerial vehicle track planning deep reinforcement learning artificial bee colony algorithm
本文献已被万方数据等数据库收录！
	点击此处可从《计算机工程与应用》浏览原始摘要信息
	点击此处可从《计算机工程与应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏