Transfer reinforcement learning method with multi-label learning for compound fault recognition |
| |
Affiliation: | 1. State Key Laboratory of Digital Manufacturing Equipment and Technology, Huazhong University of Science and Technology, Wuhan 430074, China;2. School of Mechanical and Electrical Engineering, Central South University, Changsha 410083, China;3. National NC System Engineering Research Center, Huazhong University of Science and Technology, Wuhan 430074, China;1. School of Aerospace Engineering, Beijing Institute of Technology, Beijing 100081, China;2. Key Laboratory of Autonomous Navigation and Control for Deep Space Exploration (Beijing Institute of Technology), Ministry of Industry and Information Technology, Beijing 100081, China;3. Space Star Technology Co., Ltd, Beijing 100086, China |
| |
Abstract: | In complex working site, bearings used as the important part of machine, could simultaneously have faults on several positions. Consequently, multi-label learning approach considering fully the correlation between different faulted positions of bearings becomes the popular learning pattern. Deep reinforcement learning (DRL) combining the perception ability of deep learning and the decision-making ability of reinforcement learning, could be adapted to the compound fault diagnosis while having a strong ability extracting the fault feature from the raw data. However, DRL is difficult to converge and easily falls into the unstable training problem. Therefore, this paper integrates the feature extraction ability of DRL and the knowledge transfer ability of transfer learning (TL), and proposes the multi-label transfer reinforcement learning (ML-TRL). In detail, the proposed method utilizes the improved trust region policy optimization (TRPO) as the basic DRL framework and pre-trains the fixed convolutional networks of ML-TRL using the multi-label convolutional neural network method. In compound fault experiment, the final results demonstrate powerfully that the proposed method could have the higher accuracy than other multi-label learning methods. Hence, the proposed method is a remarkable alternative when recognizing the compound fault of bearings. |
| |
Keywords: | Compound fault recognition Deep reinforcement learning Multi-label learning Transfer learning Trust region policy optimization |
本文献已被 ScienceDirect 等数据库收录! |
|