首页 | 本学科首页   官方微博 | 高级检索  
     


Transfer reinforcement learning method with multi-label learning for compound fault recognition
Affiliation:1. State Key Laboratory of Digital Manufacturing Equipment and Technology, Huazhong University of Science and Technology, Wuhan 430074, China;2. School of Mechanical and Electrical Engineering, Central South University, Changsha 410083, China;3. National NC System Engineering Research Center, Huazhong University of Science and Technology, Wuhan 430074, China;1. School of Aerospace Engineering, Beijing Institute of Technology, Beijing 100081, China;2. Key Laboratory of Autonomous Navigation and Control for Deep Space Exploration (Beijing Institute of Technology), Ministry of Industry and Information Technology, Beijing 100081, China;3. Space Star Technology Co., Ltd, Beijing 100086, China
Abstract:In complex working site, bearings used as the important part of machine, could simultaneously have faults on several positions. Consequently, multi-label learning approach considering fully the correlation between different faulted positions of bearings becomes the popular learning pattern. Deep reinforcement learning (DRL) combining the perception ability of deep learning and the decision-making ability of reinforcement learning, could be adapted to the compound fault diagnosis while having a strong ability extracting the fault feature from the raw data. However, DRL is difficult to converge and easily falls into the unstable training problem. Therefore, this paper integrates the feature extraction ability of DRL and the knowledge transfer ability of transfer learning (TL), and proposes the multi-label transfer reinforcement learning (ML-TRL). In detail, the proposed method utilizes the improved trust region policy optimization (TRPO) as the basic DRL framework and pre-trains the fixed convolutional networks of ML-TRL using the multi-label convolutional neural network method. In compound fault experiment, the final results demonstrate powerfully that the proposed method could have the higher accuracy than other multi-label learning methods. Hence, the proposed method is a remarkable alternative when recognizing the compound fault of bearings.
Keywords:Compound fault recognition  Deep reinforcement learning  Multi-label learning  Transfer learning  Trust region policy optimization
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号