Label distribution for multimodal machine learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Label distribution for multimodal machine learning

Authors:	Yi REN Ning XU Miaogen LING Xin GENG

Affiliation:	Department of Computer Science and Engineering, Southeast University, Nanjing 211189, China

Abstract:	Multimodal machine learning (MML) aims to understand the world from multiple related modalities. It has attracted much attention as multimodal data has become increasingly available in real-world application. It is shown that MML can perform better than single-modal machine learning, since multi-modalities containing more information which could complement each other. However, it is a key challenge to fuse the multi-modalities in MML. Different from previous work, we further consider the side-information, which reflects the situation and influences the fusion of multi-modalities. We recover multimodal label distribution (MLD) by leveraging the side-information, representing the degree to which each modality contributes to describing the instance. Accordingly, a novel framework named multimodal label distribution learning (MLDL) is proposed to recover the MLD, and fuse the multimodalities with its guidance to learn an in-depth understanding of the jointly feature representation. Moreover, two versions of MLDL are proposed to deal with the sequential data. Experiments on multimodal sentiment analysis and disease prediction show that the proposed approaches perform favorably against state-of-the-art methods.

Keywords:	multimodal machine learning label distribution learning sentiment analysis disease prediction
本文献已被维普等数据库收录！
	点击此处可从《Frontiers of Computer Science》浏览原始摘要信息
	点击此处可从《Frontiers of Computer Science》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏