基于强化学习的无人机全自主电力巡检 Autonomous Power Inspection of UAV Based on Reinforcement Learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于强化学习的无人机全自主电力巡检

引用本文：	王瑞群,欧阳权,段朝伟,王志胜. 基于强化学习的无人机全自主电力巡检[J]. 机械与电子, 2021, 39(12): 34-38. DOI: 10.3969/j.issn.1001-2257.2021.12.007

作者姓名：	王瑞群欧阳权段朝伟王志胜

作者单位：	南京航空航天大学自动化学院,江苏南京211100

基金项目：	江苏省高校自然科学研究面上项目(18KJB520023)；南京航空航天大学研究生创新竞赛(016001)。

摘要：	针对无人机在电力巡检中的全自主性进行研究,提出全自主电力巡检系统,该系统由无人机智能体、充电桩和待巡检目标构成。借助无线充电技术和强化学习决策算法使无人机拥有全自主执行任务的能力,并在设计的仿真环境中进行了验证,训练后的无人机智能体可以自主路径规划进行电力巡检和自主决策到附近的无线充电桩充电,无需人工介入可完成所有巡检任务。由于传统的近端策略优化算法在本系统中奖励低的问题,因此提出一种基于动作掩码的近端策略优化算法来训练无人机智能体,仿真实验结果表明,动作掩码机制使奖励提高了39%,能耗降低了15.34%。
关键词：	电力巡检强化学习近端策略优化无线充电能量优化
Autonomous Power Inspection of UAV Based on Reinforcement Learning

WANG Ruiqun,OUYANG Quan,DUAN Chaowei,WANG Zhisheng. Autonomous Power Inspection of UAV Based on Reinforcement Learning[J]. Machinery & Electronics, 2021, 39(12): 34-38. DOI: 10.3969/j.issn.1001-2257.2021.12.007

Authors:	WANG Ruiqun OUYANG Quan DUAN Chaowei WANG Zhisheng

Affiliation:	( College of Automation Engineering , Nanjing University of Aeronautics and Astronautics , Nanjing 211100 , China )

Abstract:	Based on the research on the full autonomy of UAV in power line inspection , an autonomous power inspection system is proposed , which is composed of UAV agent , charging pile and targets to be inspected.With the aid of wireless charging technology and reinforcement learning algorithm , UAV has achieved the independent ability to perform a task , which has been verified in the design of simulation environment.Trained UAV agent can dopath planning for electric power inspection and make decision to nearby wireless charging pile autonomously , and all the inspection tasks can be done without human intervention.Due to low reward of traditional proximal strategy optimization ( PPO ) algorithm in this system , therefore , a kind of PPO algorithm based on action mask ( PPOAM ) is proposed to train UAV agent.The simulation results show that the action mask mechanism can increase the reward by 39% and reduce the energy consumption by 15.34%.

Keywords:	power line inspection reinforcement learning proximal policy optimization wireless charging energy optimization
本文献已被维普万方数据等数据库收录！
	点击此处可从《机械与电子》浏览原始摘要信息
	点击此处可从《机械与电子》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏