基于子树广度的Web信息抽取 Web Information Extraction Based on Sub-tree Breadth期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于子树广度的Web信息抽取

引用本文：	王权,施韶亭.基于子树广度的Web信息抽取[J].计算机工程,2009,35(3):89-90,9.

作者姓名：	王权施韶亭

作者单位：	甘肃省科学技术情报研究所,兰州,730000

基金项目：	甘肃省技术研究与开发专项计划基金

摘要：	提出一种新的网页信息抽取方法，基于子树的广度可不加区分地对不同科技文献网站的页面信息进行自动抽取。对大量科技文献网站进行信息抽取实验，已应用到甘肃省科技文献共享平台。实验结果证明，该方法能不依赖科技文献网页的来源而自动地抽取相关信息，并能保证较高的数据抽取回召率和查准率。
关键词：	子树广度信息抽取跨库检索
修稿时间：
Web Information Extraction Based on Sub-tree Breadth

WANG Quan,SHI Shao-ting.Web Information Extraction Based on Sub-tree Breadth[J].Computer Engineering,2009,35(3):89-90,9.

Authors:	WANG Quan SHI Shao-ting

Affiliation:	Institute of Science & Technology Information of Gansu;Lanzhou 730000

Abstract:	This paper proposes a new method which can extract the useful information from the different document sites automatically based on the breadth of a sub-tree. Experimental evaluation on a large of Web pages from different document Web sites has done and this method has been applied to the platform of gansu science & technology document sharing successfully. Experimental result shows this method automatically extracts the information ignoring where Web sites the pages come from and has high accuracy in terms ...

Keywords:	sub-tree breadth information extraction cross-search
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《计算机工程》浏览原始摘要信息
	点击此处可从《计算机工程》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏