一种Deep Web查询结果的实体抽取方法 Research on entity extraction method of Deep Web data integration期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种Deep Web查询结果的实体抽取方法

引用本文：	赵海霞,李道申,刘勇,赵嘉诚.一种Deep Web查询结果的实体抽取方法[J].计算机工程与应用,2012,48(36):160-163.

作者姓名：	赵海霞李道申刘勇赵嘉诚

作者单位：	1. 河南科技大学电子信息工程学院,河南洛阳,471003 2. 长春理工大学软件学院,长春,130000

摘要：	Deep Web中蕴含着丰富的高质量的信息,通过Deep Web集成查询接口可以获取到包含这些信息的结果页面,因此,Deep Web查询结果页面的数据抽取成为Deep Web数据集成的关键。提出了将索引方法和编辑相似度相结合的方法,来完成Deep Web查询结果页面的数据抽取工作。大量实验结果表明:该方法是可行的,并且能够提高Deep Web数据实体抽取的准确性和召回率。
关键词：	深度网数据抽取文件对象模型(DOM)树索引相似度
Research on entity extraction method of Deep Web data integration

ZHAO Haixia , LI Daoshen , LIU Yong , ZHAO Jiacheng.Research on entity extraction method of Deep Web data integration[J].Computer Engineering and Applications,2012,48(36):160-163.

Authors:	ZHAO Haixia LI Daoshen LIU Yong ZHAO Jiacheng

Affiliation:	1.Electronic & Information Engineering College,Henan University of Science and Technology,Luoyang,Henan 471003,China 2.Software College,Changchun University of Science Technology,Changchun 130000,China

Abstract:	Based on the realization of Deep Web integrated query mechanism, Deep Web information can be obtained from the resulting pages, so how to extract the entity information of Deep Web from the results pages effectively becomes the key of Deep Web data integration. A method that combines the index with the edit similarity methods is proposed, which resolves the problem of data extraction of Deep Web result page. Large experimental results show that this approach is feasible, and can improve the precision and recall of Deep Web data extraction.

Keywords:	Deep Web data extraction Document Object Model（DOM） tree index similarity
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏