运用模态融合的半监督广义零样本学习 Semi-supervised Generalized Zero-Shot Learning Using Modal Fusion期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

运用模态融合的半监督广义零样本学习

引用本文：	林爽,王晓军.运用模态融合的半监督广义零样本学习[J].计算机工程与应用,2022,58(5):163-171.

作者姓名：	林爽王晓军

作者单位：	1.南京邮电大学计算机学院，南京 210023 2.南京邮电大学物联网学院，南京 210003

基金项目：	江苏省自然科学基金青年项目

摘要：	映射域漂移和偏见性预测问题使得现有的方案无法很好地应对广义零样本学习挑战.在CADA-VAE模型的基础上,提出了基于模态融合的半监督学习方案,就如何利用未标注样本及语义辅助模型进行模态内自学习提供了一种思路.该方案使用潜层向量空间作为视觉和语义模态融合的桥梁,提出了视觉质心和异类语义潜层向量概念,用以指导模态间互学习;...
关键词：	广义零样本学习模态融合半监督学习视觉质心
Semi-supervised Generalized Zero-Shot Learning Using Modal Fusion

LIN Shuang,WANG Xiaojun.Semi-supervised Generalized Zero-Shot Learning Using Modal Fusion[J].Computer Engineering and Applications,2022,58(5):163-171.

Authors:	LIN Shuang WANG Xiaojun

Affiliation:	1.School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China 2.School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing 210003, China

Abstract:	Projection domain drift and prejudice prediction problems make the existing schemes unable to meet the challenge of generalized zero-shot learning well.Based on the CADA-VAE,this article proposes a semi-supervised learning scheme based on modal fusion which provides a way of how to use unlabeled samples and semantic help the model for intra-modal self-learning.This solution uses the latent layer vector space as a bridge for the fusion of visual and semantic modalities,and proposes the concept of visual centroid and heterogeneous semantic latent layer vectors to guide mutual learning between modalities.In the cross-reconstruction link,the semantic latent layer vector is cross-reconstructed into visual features by taking the visual centroid as the axis;in the feature coding link,the visual feature is coded as a latent layer vector along the opposite direction of the heterogeneous semantic latent layer vector.This scheme ensures the generated samples have diversity while not losing the discrimination between classes.Comparative experiments on three benchmark data sets proves that this model is superior to the current mainstream solutions in recognition accuracy,and it can cope with the scarcity of labeled samples.

Keywords:	generalized zero-shotlearning modal fusion semi-supervised learning visual centroid
本文献已被维普万方数据等数据库收录！
	点击此处可从《计算机工程与应用》浏览原始摘要信息
	点击此处可从《计算机工程与应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏