首页 | 本学科首页   官方微博 | 高级检索  
     

基于稀疏编码的多模态信息交叉检索
引用本文:刘菲,刘学亮.基于稀疏编码的多模态信息交叉检索[J].中国图象图形学报,2015,20(9):1170-1176.
作者姓名:刘菲  刘学亮
作者单位:合肥工业大学计算机与信息学院, 合肥 230009;合肥工业大学计算机与信息学院, 合肥 230009
基金项目:国家高技术研究发展计划(863)基金项目(2014AA015104);教育部留学回国基金项目(JZ2014JYLH0390);中央高校基本科研业务费专项基金(JZ2014HGQC0132)
摘    要:目的 多模态信息交叉检索的根本问题是多模态数据的特征表示。稀疏编码是一种有效的数据特征表示方法,但是当查询数据和被检索数据来自不同模态时,数据间存在分布差异,相似的特征可能被编码为差异显著的稀疏表示,此时传统稀疏编码便不再适用。为此,提出了一种基于稀疏编码的多模态信息交叉检索算法。方法 采用最大均值差异(MMD)以及图拉普拉斯,并将二者加入到稀疏编码的目标函数中来充分利用多模态信息进行编码,模型求解采用特征符号搜索和离散线搜索算法逐个更新稀疏编码系数。结果 在Wikipedia的文本图像对数据上进行实验,并与传统稀疏编码进行比较,实验结果表明,本文算法使交叉检索的平均准确率(MAP)提高了18.7%。结论 本文算法增强了稀疏表示的鲁棒性,提高了多模态交叉检索的准确率,更适用于对多模态数据进行特征提取,并进行进一步的操作,如交叉检索、分类等。

关 键 词:多模态  交叉检索  稀疏编码  最大均值差异  图拉普拉斯
收稿时间:2015/3/27 0:00:00
修稿时间:2015/5/11 0:00:00

Novel multi-modality information cross-retrieval based on sparse coding
Liu Fei and Liu Xueliang.Novel multi-modality information cross-retrieval based on sparse coding[J].Journal of Image and Graphics,2015,20(9):1170-1176.
Authors:Liu Fei and Liu Xueliang
Affiliation:School of computer and information, HeFei University of Technology, Hefei 230009, China;School of computer and information, HeFei University of Technology, Hefei 230009, China
Abstract:Objective The fundamental issue of multi-modality information cross-retrieval is feature representation of multi-modality data. Sparse coding is an effective representation method for feature modeling. However, when the query terms and the retrieval terms come from different modalities, the traditional sparse coding may never be suitable because the distribution difference between different modalities and similar features can be encoded as a significant difference of sparse representation. Therefore, in this paper, we present a multi-modality information cross-retrieval algorithm based on sparse coding. Method In the proposed method,maximum mean difference (MMD) and graph Laplacianare used to formulate the sparse coding objective function to thoroughly exploit the multimodal information in coding. Then, feature-sign search and discrete line search algorithm are used to optimize the objective function. Result We performed a cross-retrieval experiment on a Wikipedia text-image dataset and compared the proposed method with traditional sparse coding methods. The experimental result shows that the proposed method increased the average mean average precision (MAP) of cross-retrieval by 18.7%. Conclusion The proposed algorithm improves the robustness of sparse coding and the accuracy of multimodal cross-retrieval. and more suitable for extracting features of multimodality data for further operations, such as cross-retrieval, classification, etc.
Keywords:multi-modality  cross-retrieval  sparse coding  maximum mean discrepancy  graph Laplace
点击此处可从《中国图象图形学报》浏览原始摘要信息
点击此处可从《中国图象图形学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号