基于视觉信息积累的行人重识别网络 Visual information accumulation network for person re-identification期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于视觉信息积累的行人重识别网络

作者姓名：	耿圆谭红臣李敬华王立春

作者单位：	北京工业大学人工智能与自动化学院，北京 100124

基金项目：	第7批全国博士后创新人才支持计划项目(BX20220025)；第70批全国博士后面上基金项目(2021M700303)

摘要：	在以往的行人重识别方法中，绝大部分的工作集中于图像注意力区域的学习，却忽视了非注意力区域对最终特征学习的影响，如果在关注图像注意力区域的同时加强非注意力区域的特征学习，可进一步丰富最终的行人特征，有利于行人身份信息的准确识别。基于此，提出了视觉信息积累网络(VIA Net)，该网络整体采用两分支结构，一个分支倾向于学习图像的全局特征，另一个分支则拓展为多分支结构，通过结合注意力区域和非注意力区域的特征逐步加强局部特征的学习，实现视觉信息的积累，进一步丰富特征信息。实验结果表明，在Market-1501等行人重识别数据集上，所提出的VIA Net网络达到了较高的实验性能；同时，在In-Shop Clothes Retrieval数据集上的实验证明：该网络也适用于一般的图像检索任务，具有一定的通用性。
关键词：	行人重识别视觉信息注意力区域非注意力区域度量学习
Visual information accumulation network for person re-identification

Authors:	GENG Yuan TAN Hong-chen LI Jing-hua WANG Li-chun

Affiliation:	School of Artificial Intelligence and Automation, Beijing University of Technology, Beijing 100124, China

Abstract:	The preceding person re-identification methods were mostly focused on the learning of the image attention region, but ignored the impact of the non-attention region on the final feature learning. If the feature learning of image non-attention regions is enhanced while focusing on attention regions, the final person features can be further enriched, which is beneficial to the accurate identification of person identity information. Based on this, this paper proposed a visual information accumulation network (VIA Net), adopting two branches. One branch tended to learn the global features of the image, and the other branch was expanded into a multi-branch structure. By combining the features of the attention and non-attention regions, the learning of local features could be gradually strengthened, thus realizing the accumulation of visual information and further enriching the feature information. The experimental results show that the proposed VIA Net could attain high experimental performance in terms of person re-identification datasets such as Market-1501. At the same time, the experiment on the In-Shop Clothes Retrieval dataset shows that the network could also be applicable to general image retrieval tasks and possess certain universality.

Keywords:	person re-identification visual information attention region non-attention region metric learning

	点击此处可从《》浏览原始摘要信息
	点击此处可从《》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏