基于视觉误差与语义属性的零样本图像分类 Zero-shot image classification based on visual error and semantic attributes期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于视觉误差与语义属性的零样本图像分类

引用本文：	徐戈,肖永强,汪涛,陈开志,廖祥文,吴运兵.基于视觉误差与语义属性的零样本图像分类[J].计算机应用,2020,40(4):1016-1022.

作者姓名：	徐戈肖永强汪涛陈开志廖祥文吴运兵

作者单位：	1. 闽江学院计算机与控制工程学院, 福州 350108;2. 福州大学数学与计算机科学学院, 福州 350116;3. 福建省网络计算与智能信息处理重点实验室(福州大学), 福州 350116;4. 数字福建金融大数据研究所, 福州 350116

基金项目：	国家自然科学基金资助项目（61772135，U1605251，61703195）；中国科学院网络数据科学与技术重点实验室开放课题基金资助项目（CASNDST201708，CASNDST201606）；模式识别国家重点实验室开放课题基金资助项目（201900041）；福建省自然科学基金面上项目（2017J01755）；赛尔网络下一代互联网技术创新项目（NGII20160501）。

摘要：	在图像分类的实际应用过程中，部分类别可能完全没有带标签的训练数据。零样本学习（ZSL）的目的是将带标签类别的图像特征等知识迁移到无标签的类别上，实现无标签类别的正确分类。现有方法在测试时无法显式地区分输入图像属于已知类还是未知类，很大程度上导致未知类在传统设定下的ZSL和广义设定下的ZSL（GZSL）上的预测效果相差甚远。为此，提出一种融合视觉误差与属性语义信息的方法来缓解零样本图像分类中的预测偏置问题。首先，设计一种半监督学习方式的生成对抗网络架构来获取视觉误差信息，由此预测图像是否属于已知类；然后，提出融合属性语义信息的零样本图像分类网络来实现零样本图像分类；最后，测试融合视觉误差与属性语义的零样本图像分类方法在数据集AwA2和CUB上的效果。实验结果表明，与对比模型相比，所提方法有效缓解了预测偏置问题，其调和指标H在AwA2（Animal with Attributes）上提升了31.7个百分点，在CUB（Caltech-UCSD-Birds-200-2011）上提升了8.7个百分点。
关键词：	零样本学习图像分类生成对抗网络视觉误差属性语义
收稿时间：	2019-09-03
修稿时间：	2019-10-23
Zero-shot image classification based on visual error and semantic attributes

XU Ge,XIAO Yongqiang,WANG Tao,CHEN Kaizhi,LIAO Xiangwen,WU Yunbing.Zero-shot image classification based on visual error and semantic attributes[J].journal of Computer Applications,2020,40(4):1016-1022.

Authors:	XU Ge XIAO Yongqiang WANG Tao CHEN Kaizhi LIAO Xiangwen WU Yunbing

Affiliation:	1. College of Computer and Control Engineering, Minjiang University, Fuzhou Fujian 350108, China;2. College of Mathematics and Computer Science, Fuzhou University, Fuzhou Fujian 350116, China;3. Fujian Provincial Key Laboratory of Networking Computing and Intelligent Information Processing(Fuzhou University), Fuzhou Fujian 350116, China;4. Digital Fujian Financial Big Data Institute, Fuzhou Fujian 350116, China

Abstract:	In the practical applications of image classification,some categories may have no labeled training data at all. The purpose of Zero-Shot Learning(ZSL)is to transfer knowledge such as image features of labeled categories to unlabeled categories and to correctly classify the unlabeled categories. However,the existing state-of-the-art methods cannot explicitly distinguish the input image belonging to the known categories or unknown categories,which leads to a large performance gap for unlabeled categories between the traditional ZSL prediction and the Generalized ZSL(GZSL)prediction. Therefore,a method of fusing of visual error and semantic attributes was proposed to alleviate the prediction bias problem in zero-shot image classification. Firstly,a semi-supervised learning based generative adversarial network framework was designed to obtain visual error information,so as to predict whether the image belongs to the known categories. Then,a zero-shot image classification network combining semantic attributes was proposed to achieve zero-shot image classification. Finally,the performance of zero-shot image classification algorithm combining visual error and semantic attributes was tested on AwA2 (Animal with Attributes) and CUB (Caltech-UCSD-Birds-200-2011) datasets. The experimental results show that, compared to the baseline models,the proposed method can effectively alleviate the prediction bias problem,and has the harmonic index H increased by 31. 7 percentage points on AwA2 dataset and 8. 7 percentage points on CUB dataset.

Keywords:	Zero-Shot Learning (ZSL) image classification generative adversarial network visual error semantic attribute
本文献已被维普万方数据等数据库收录！
	点击此处可从《计算机应用》浏览原始摘要信息
	点击此处可从《计算机应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏