首页 | 本学科首页   官方微博 | 高级检索  
     


Dual attention interactive fine-grained classification network based on data augmentation
Affiliation:1. School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China;2. Jiangsu Key Laboratory of Broadband Wireless Communication and Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing 210003, China;3. School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing 210003, China;1. School of Information Science and Engineering, Huaqiao University, Xiamen, China;2. School of Engineering, Huaqiao University, Quanzhou, China;3. Xiamen Key Laboratory of Mobile Multimedia Communications, Xiamen, China
Abstract:The key to fine-grained image classification is to find discriminative regions. Most existing methods only use simple baseline networks or low-recognition attention modules to discover object differences, which will limit the model to finding discriminative regions hidden in images. This article proposes an effective method to solve this problem. The first is a novel layered training method, which uses a new training method to enhance the feature extraction ability of the baseline model. The second step focuses on key regions of the image based on improved long short-term memory (LSTM) and multi-head attention. In the third step, based on the feature map obtained by the dual attention network, spatial mapping is performed by a multi-layer perceptron (MLP). Then the element-by-element mutual multiplication calculation of the channel is performed to obtain a feature map with finer granularity. Finally, the CUB-200-2011, FGVC Aircraft, Stanford Cars, and MedMNIST v2 datasets achieved good performance.
Keywords:Data augmentation  Hierarchical training  Denoising autoencoder  Dual attention mechanism  Interactive attention
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号