首页 | 本学科首页   官方微博 | 高级检索  
     

基于多尺度生成对抗网络的遮挡行人重识别方法
引用本文:杨婉香,严严,陈思,张小康,王菡子.基于多尺度生成对抗网络的遮挡行人重识别方法[J].软件学报,2020,31(7):1943-1958.
作者姓名:杨婉香  严严  陈思  张小康  王菡子
作者单位:福建省智慧城市感知与计算重点实验室(厦门大学信息学院),福建厦门361005;厦门理工学院计算机与信息工程学院,福建厦门361024
基金项目:国家自然科学基金(61571379,U1605252,61872307);福建省自然科学基金(2017J01127,2018J01576)
摘    要:行人重识别是指在多个非重叠摄像头拍摄的场景下,给定一幅查询行人图像,从大规模行人图像库中检索出具有相同身份的行人图像,是一类特殊的图像检索任务.随着深度学习的不断发展,行人重识别方法的性能得到了显著提升.但是行人重识别在实际应用中经常遭遇遮挡问题(例如背景遮挡、行人互相遮挡等).由于遮挡图像不仅丢失了部分目标信息,而且引入了额外的干扰,使得现有方法往往难以学习到鲁棒的特征表示,从而导致识别性能严重下降.最近,生成对抗网络在各类计算机视觉任务上展现出强大的图像生成能力.受到生成对抗网络的启发,提出了一种基于多尺度生成对抗网络的遮挡行人重识别方法.首先,利用成对的遮挡图像和非遮挡图像训练一个多尺度生成器和一个判别器.多尺度生成器能够对随机遮挡区域进行去遮挡操作,生成高质量的重构图;而判别器能够区分输入图像是真实图像还是生成图像.其次,利用训练好的多尺度生成器,生成去除随机遮挡的训练图像,添加到原始训练图像集,用于增加训练样本的多样性.最后,基于此扩充训练图像集,训练分类识别模型,有效地提高模型在测试图像集上的泛化性.在多个有挑战性的行人重识别数据集上的实验结果,验证了所提出方法的有效性.

关 键 词:行人重识别  生成对抗网络  遮挡  数据增强
收稿时间:2019/4/24 0:00:00
修稿时间:2019/7/11 0:00:00

Multi-scale Generative Adversarial Network for Person Re-identification under Occlusion
YANG Wan-Xiang,YAN Yan,CHEN Si,ZHANG Xiao-Kang,WANG Han-Zi.Multi-scale Generative Adversarial Network for Person Re-identification under Occlusion[J].Journal of Software,2020,31(7):1943-1958.
Authors:YANG Wan-Xiang  YAN Yan  CHEN Si  ZHANG Xiao-Kang  WANG Han-Zi
Affiliation:Fujian Key Laboratory of Sensing and Computing for Smart City, School of Informatics, Xiamen University, Xiamen 361005, Fujian, China,Fujian Key Laboratory of Sensing and Computing for Smart City, School of Informatics, Xiamen University, Xiamen 361005, Fujian, China,School of Computer and Information Engineering, Xiamen University of Technology, Xiamen 361024, Fujian, China,Fujian Key Laboratory of Sensing and Computing for Smart City, School of Informatics, Xiamen University, Xiamen 361005, Fujian, China and Fujian Key Laboratory of Sensing and Computing for Smart City, School of Informatics, Xiamen University, Xiamen 361005, Fujian, China
Abstract:Person re-identification (ReID) refers to the task of retrieving a given probe pedestrian image from a large-scale gallery collected by multiple non-overlapping cameras, which belongs to a specific task of image retrieval. With the development of deep learning, the performance of person ReID has been significantly improved. However, in practical applications, person ReID usually suffers from the problem of occlusion (such as background occlusion, pedestrian occlusion). The occluded image not only loses partial target information, but also introduces additional interference, which makes the deep neural network difficult to learn robust feature representations and seriously degrades the performance of person ReID. Recently, generative adversarial network (GAN) has shown the powerful image generation ability on various computer vision tasks. Inspried by GAN, we propose a person ReID method under occlusion based on multi-scale GAN. Firstly, we use the paired occluded images and unoccluded images to train a multi-scale generator and a discriminator. The multi-scale generator can restore the lost information for randomly occluded areas and generate high-quality reconstructed images; while the discriminator can distinguish whether the input image is a real image or a generated image. Then, we use the trained multi-scale generator to generate the de-occluded images. Adding these de-occluded images to the original training image set can increase the diversity of training samples. Finally, we train a classification network based on the augmented training image set, which effectively improves the generalization capability of the trained model on the testing image set. Experimental results on several challenging person ReID datasets demonstrate the effectiveness of our proposed method.
Keywords:person re-identification  generative adversarial network  occlusion  data augmentation
本文献已被 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号