首页 | 本学科首页   官方微博 | 高级检索  
     

改进DDPG无人机航迹规划算法
引用本文:高敬鹏,胡欣瑜,江志烨.改进DDPG无人机航迹规划算法[J].计算机工程与应用,2022,58(8):264-272.
作者姓名:高敬鹏  胡欣瑜  江志烨
作者单位:1.电子信息系统复杂电磁环境效应国家重点实验室,河南 洛阳 471003 2.哈尔滨工程大学 信息与通信工程学院,哈尔滨 150001 3.北京航天长征飞行器研究所 试验物理与计算数学国家级重点实验室,北京 100076
摘    要:针对无人机飞行过程存在未知威胁使智能算法处理复杂度高,导致航迹实时规划困难,以及深度强化学习中调整DDPG算法参数,存在时间成本过高的问题,提出一种改进DDPG航迹规划算法.围绕无人机航迹规划问题,构建飞行场景模型,根据飞行动力学理论,搭建动作空间,依据非稀疏化思想,设计奖励函数,结合人工蜂群算法,改进DDPG算法模型...

关 键 词:深度确定性策略梯度算法  无人机  航迹规划  深度强化学习  人工蜂群算法

Unmanned Aerial Vehicle Track Planning Algorithm Based on Improved DDPG
GAO Jingpeng,HU Xinyu,JIANG Zhiye.Unmanned Aerial Vehicle Track Planning Algorithm Based on Improved DDPG[J].Computer Engineering and Applications,2022,58(8):264-272.
Authors:GAO Jingpeng  HU Xinyu  JIANG Zhiye
Affiliation:1.State Key Laboratory of Complex Electromagnetic Environment Effects on Electronics and Information System(CEMEE), Luoyang, Henan 471003, China 2.College of Information and Communication Engineering, Harbin Engineering University, Harbin 150001, China 3.National Key Laboratory of Science and Technology on Test Physics and Numerical Mathematics, Beijing Institute of Space Long March Vehicle, Beijing 100076, China
Abstract:An improved DDPG flight track planning algorithm is proposed, aiming at the problem of high processing complexity of intelligent algorithm due to unknown threats in UAV flight process which leads to the difficulty of real-time flight track planning, and long training time by adjusting the parameters of DDPG algorithm in deep reinforcement learning. The flight scene model is established under the background of UAV track planning. According to the flight dynamics theory, the action space is built. On the basis of the non-sparse idea, the reward function is designed. Combined with the artificial bee colony algorithm, the updating mechanism of the model parameters of DDPG algorithm is improved, and the network model is trained to achieve the flight track decision-making of UAV. Simulation results show that the overall training time of the proposed algorithm is only 1.98 times of the average training time of the prototype algorithm, the training efficiency is improved, and the cost of time is reduced. Besides, under the condition of satisfy real time flight, the proposed algorithm can meet the demand of UAV track quality, and provides a new idea for promoting the practical application of deep reinforcement learning in flight track planning.
Keywords:deep deterministic policy gradient algorithm  unmanned aerial vehicle  track planning  deep reinforcement learning  artificial bee colony algorithm  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号