Dual attention interactive fine-grained classification network based on data augmentation期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Dual attention interactive fine-grained classification network based on data augmentation

Affiliation:	1. School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China;2. Jiangsu Key Laboratory of Broadband Wireless Communication and Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing 210003, China;3. School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing 210003, China;1. School of Information Science and Engineering, Huaqiao University, Xiamen, China;2. School of Engineering, Huaqiao University, Quanzhou, China;3. Xiamen Key Laboratory of Mobile Multimedia Communications, Xiamen, China

Abstract:	The key to fine-grained image classification is to find discriminative regions. Most existing methods only use simple baseline networks or low-recognition attention modules to discover object differences, which will limit the model to finding discriminative regions hidden in images. This article proposes an effective method to solve this problem. The first is a novel layered training method, which uses a new training method to enhance the feature extraction ability of the baseline model. The second step focuses on key regions of the image based on improved long short-term memory (LSTM) and multi-head attention. In the third step, based on the feature map obtained by the dual attention network, spatial mapping is performed by a multi-layer perceptron (MLP). Then the element-by-element mutual multiplication calculation of the channel is performed to obtain a feature map with finer granularity. Finally, the CUB-200-2011, FGVC Aircraft, Stanford Cars, and MedMNIST v2 datasets achieved good performance.

Keywords:	Data augmentation Hierarchical training Denoising autoencoder Dual attention mechanism Interactive attention
本文献已被 ScienceDirect 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏