首页 | 本学科首页   官方微博 | 高级检索  
     

运用模态融合的半监督广义零样本学习
引用本文:林爽,王晓军.运用模态融合的半监督广义零样本学习[J].计算机工程与应用,2022,58(5):163-171.
作者姓名:林爽  王晓军
作者单位:1.南京邮电大学 计算机学院,南京 210023 2.南京邮电大学 物联网学院,南京 210003
基金项目:江苏省自然科学基金青年项目
摘    要:映射域漂移和偏见性预测问题使得现有的方案无法很好地应对广义零样本学习挑战.在CADA-VAE模型的基础上,提出了基于模态融合的半监督学习方案,就如何利用未标注样本及语义辅助模型进行模态内自学习提供了一种思路.该方案使用潜层向量空间作为视觉和语义模态融合的桥梁,提出了视觉质心和异类语义潜层向量概念,用以指导模态间互学习;...

关 键 词:广义零样本学习  模态融合  半监督学习  视觉质心

Semi-supervised Generalized Zero-Shot Learning Using Modal Fusion
LIN Shuang,WANG Xiaojun.Semi-supervised Generalized Zero-Shot Learning Using Modal Fusion[J].Computer Engineering and Applications,2022,58(5):163-171.
Authors:LIN Shuang  WANG Xiaojun
Affiliation:1.School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China 2.School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
Abstract:Projection domain drift and prejudice prediction problems make the existing schemes unable to meet the challenge of generalized zero-shot learning well.Based on the CADA-VAE,this article proposes a semi-supervised learning scheme based on modal fusion which provides a way of how to use unlabeled samples and semantic help the model for intra-modal self-learning.This solution uses the latent layer vector space as a bridge for the fusion of visual and semantic modalities,and proposes the concept of visual centroid and heterogeneous semantic latent layer vectors to guide mutual learning between modalities.In the cross-reconstruction link,the semantic latent layer vector is cross-reconstructed into visual features by taking the visual centroid as the axis;in the feature coding link,the visual feature is coded as a latent layer vector along the opposite direction of the heterogeneous semantic latent layer vector.This scheme ensures the generated samples have diversity while not losing the discrimination between classes.Comparative experiments on three benchmark data sets proves that this model is superior to the current mainstream solutions in recognition accuracy,and it can cope with the scarcity of labeled samples.
Keywords:generalized zero-shotlearning  modal fusion  semi-supervised learning  visual centroid
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号