首页 | 本学科首页   官方微博 | 高级检索  
     

跨模态数据实体分辨研究综述
引用本文:曹建军,聂子博,郑奇斌,吕国俊,曾志贤.跨模态数据实体分辨研究综述[J].软件学报,2023,34(12):5822-5847.
作者姓名:曹建军  聂子博  郑奇斌  吕国俊  曾志贤
作者单位:国防科技大学 第六十三研究所, 江苏 南京 210007;中国人民解放军陆军工程大学 指挥控制工程学院, 江苏 南京 210007
基金项目:国家自然科学基金(61371196); 中国博士后科学基金(2015M582832); 国家科技重大专项(2015ZX01040201-003)
摘    要:实体分辨广泛地存在于数据质量控制、信息检索、数据集成等数据任务中.传统的实体分辨主要面向关系型数据,而随着大数据技术的发展,文本、图像等模态不同的数据大量涌现催生了跨模态数据应用需求,将跨模态数据实体分辨提升为大数据处理和分析的基础问题之一.对跨模态实体分辨问题的研究进展进行回顾,首先介绍问题的定义、评价指标;然后,以模态内关系的保持和模态间关系的建立为主线,对现有研究进行总结和梳理;并且,通过在多个公开数据集上对常用方法进行测试,对出现差异的原因和进行分析;最后,总结当前研究仍然存在的问题,并依据这些问题给出未来可能的研究方向.

关 键 词:实体分辨  跨模态数据处理  深度学习  相似性度量
收稿时间:2022/5/16 0:00:00
修稿时间:2022/6/23 0:00:00

Survey on Cross-modal Data Entity Resolution
Affiliation:The Sixty-third Research Institute, National University of Defense Technology, Nanjing 210007, China;College of Command and Control Engineering, Army Engineering University of PLA, Nanjing 210007, China
Abstract:Entity resolution widely exists in data tasks such as data quality control, information retrieval, and data integration. Traditional entity resolution methods mainly focus on relational data, while with the development of big data technology, the application requirements of cross-modal data are generated due to the proliferation of different modal data including texts and images. Hence, cross-modal data entity resolution has become a fundamental problem in big data processing and analysis. In this study, the research development of cross-modal entity resolution is reviewed, and its definition and evaluation indexes are introduced. Then, with the construction of inter-modal relationships and the maintenance of intra-modal relationships as the main line, existing research results are surveyed. In addition, widely used methods are tested on different open datasets, and their differences and reasons behind them are analyzed. Finally, the problems in the present research are concluded, on the basis of which the future research trends are given.
Keywords:entity resolution  cross-modal data processing  deep learning  similarity measure
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号