首页 | 本学科首页   官方微博 | 高级检索  
     

基于强化学习的无人机全自主电力巡检
引用本文:王瑞群,欧阳权,段朝伟,王志胜. 基于强化学习的无人机全自主电力巡检[J]. 机械与电子, 2021, 39(12): 34-38. DOI: 10.3969/j.issn.1001-2257.2021.12.007
作者姓名:王瑞群  欧阳权  段朝伟  王志胜
作者单位:南京航空航天大学自动化学院,江苏南京211100
基金项目:江苏省高校自然科学研究面上项目(18KJB520023);南京航空航天大学研究生创新竞赛(016001)。
摘    要:针对无人机在电力巡检中的全自主性进行研究,提出全自主电力巡检系统,该系统由无人机智能体、充电桩和待巡检目标构成。借助无线充电技术和强化学习决策算法使无人机拥有全自主执行任务的能力,并在设计的仿真环境中进行了验证,训练后的无人机智能体可以自主路径规划进行电力巡检和自主决策到附近的无线充电桩充电,无需人工介入可完成所有巡检任务。由于传统的近端策略优化算法在本系统中奖励低的问题,因此提出一种基于动作掩码的近端策略优化算法来训练无人机智能体,仿真实验结果表明,动作掩码机制使奖励提高了39%,能耗降低了15.34%。

关 键 词:电力巡检  强化学习  近端策略优化  无线充电  能量优化

Autonomous Power Inspection of UAV Based on Reinforcement Learning
WANG Ruiqun,OUYANG Quan,DUAN Chaowei,WANG Zhisheng. Autonomous Power Inspection of UAV Based on Reinforcement Learning[J]. Machinery & Electronics, 2021, 39(12): 34-38. DOI: 10.3969/j.issn.1001-2257.2021.12.007
Authors:WANG Ruiqun  OUYANG Quan  DUAN Chaowei  WANG Zhisheng
Affiliation:( College of Automation Engineering , Nanjing University of Aeronautics and Astronautics , Nanjing 211100 , China )
Abstract:Based on the research on the full autonomy of UAV in power line inspection , an autonomous power inspection system is proposed , which is composed of UAV agent , charging pile and targets to be inspected.With the aid of wireless charging technology and reinforcement learning algorithm , UAV has achieved the independent ability to perform a task , which has been verified in the design of simulation environment.Trained UAV agent can dopath planning for electric power inspection and make decision to nearby wireless charging pile autonomously , and all the inspection tasks can be done without human intervention.Due to low reward of traditional proximal strategy optimization ( PPO ) algorithm in this system , therefore , a kind of PPO algorithm based on action mask ( PPOAM ) is proposed to train UAV agent.The simulation results show that the action mask mechanism can increase the reward by 39% and reduce the energy consumption by 15.34%.
Keywords:power line inspection  reinforcement learning  proximal policy optimization  wireless charging  energy optimization
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《机械与电子》浏览原始摘要信息
点击此处可从《机械与电子》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号