首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于内容相关性的跨媒体检索方法
引用本文:张鸿,吴飞,庄越挺,陈建勋. 一种基于内容相关性的跨媒体检索方法[J]. 计算机学报, 2008, 31(5): 820-826
作者姓名:张鸿  吴飞  庄越挺  陈建勋
作者单位:武汉科技大学计算机科学与技术学院,武汉,430081;浙江大学人工智能研究所,杭州,310027;浙江大学人工智能研究所,杭州,310027;武汉科技大学计算机科学与技术学院,武汉,430081
基金项目:国家自然科学基金 , 国家科技支撑计划 , 国家高技术研究发展计划(863计划) , 高校科技创新工程项目 , 教育部长江学者和创新团队发展计划
摘    要:针对传统基于内容的多媒体检索对单一模态的限制,提出一种新的跨媒体检索方法.分析了不同模态的内容特征之间在统计意义上的典型相关性,并通过子空间映射解决了特征向量的异构性问题,同时结合相关反馈中的先验知识,修正不同模态多媒体数据集在子空间中的拓扑结构,实现跨媒体相关性的准确度量.实验以图像和音频数据为例验证了基于相关性学习的跨媒体检索方法的有效性.

关 键 词:跨媒体检索  异构性  典型相关性  子空间映射  相关反馈
修稿时间:2006-06-16

Cross-Media Retrieval Method Based on Content Correlations
ZHANG Hong,WU Fei,ZHUANG Yue-Ting,CHEN Jian-Xun. Cross-Media Retrieval Method Based on Content Correlations[J]. Chinese Journal of Computers, 2008, 31(5): 820-826
Authors:ZHANG Hong  WU Fei  ZHUANG Yue-Ting  CHEN Jian-Xun
Abstract:Most traditional content-based multimedia retrieval methods are designed for multimedia data of single modality.Such methods include image retrieval,audio retrieval,video retrieval,etc.This paper proposes a novel cross-media retrieval approach,which can process multimedia data of different modalities and measure cross-media similarity,such as image-audio similarity.First statistical method is used to learn canonical correlations between low-level feature spaces of different modalities.Then,sub-space mapping is designed to build an isomorphic subspace and solve the heterogeneity problem between different low-level feature vectors.This subspace contains media objects of different modalities,and each media object is represented with isomorphic vector.Since canonical correlations among multimedia objects are furthest preserved during the mapping process,cross-media similarity can be estimated with defined distance metric.Furthermore,relevance feedback provided by users is utilized to learn prior knowledge and refine multimedia topology in the subspace.In this way cross-media similarity is more consistent with human perception with the incorporation of user interaction.Both image and audio data are selected for experiments and comparisons.Given the same visual and auditory features the new approach outperforms ICA,PCA and PLS methods both in precision and recall performance.Overall cross-media retrieval results between images and audios are very encouraging.
Keywords:cross-media retrieval  heterogeneity  canonical correlation  subspace mapping  relevance feedback
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号