基于Deep Web的信息采集系统 An Information Extraction System Based on Deep Web期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于Deep Web的信息采集系统

引用本文：	王冉冉,王刚,黄青松.基于Deep Web的信息采集系统[J].微机发展,2007,17(10):171-173.

作者姓名：	王冉冉王刚黄青松

作者单位：	昆明理工大学信息工程与自动化学院云南昆明650051

基金项目：	国家教育部春晖计划(Z2005-1-53004)

摘要：	随着互联网技术的迅速发展,大量结构化的高质量信息被埋入网络,却无法被传统的搜索引擎检索到,进而难以被挖掘利用。针对这一现象,提出了基于Deep Web的信息采集系统,设计了基于Web的查询方式,并结合数据挖掘的相关技术,获取并挖掘深网信息资源,解决传统手工采集信息的弊端,提高系统的使用效率,避免人工搜集时间和费用上的开销,降低成本,便于维护。并且正在云南省大型仪器协作共用网络平台的建设中尝试实现这个子系统的设计。
关键词：	Deep Web 信息采集查询接口数据挖掘
文章编号：	1673-629X(2007)10-0171-03
修稿时间：	2006年12月2日
An Information Extraction System Based on Deep Web

WANG Ran-ran,WANG Gang,HUANG Qing-song.An Information Extraction System Based on Deep Web[J].Microcomputer Development,2007,17(10):171-173.

Authors:	WANG Ran-ran WANG Gang HUANG Qing-song

Abstract:	With the rapid development of Internet technology,a large amount of structured and high-quality information is embedded into Internet.However,the information cannot be retrieved by traditional search engine and then it is difficult to mine and make full use of it.In view of this phenomenon,presents a system based on the deep Web information extraction,designs a query schema based on the Web,and combines some relevant technology of data mining.As a result,can get and mine the information which is in the Deep Web.At the same time,it resolves the traditional drawback of collecting information artificially,enhances the efficiency of the system,avoids the expenses on collection time and the expense,reduces the cost and maintains easily.And it has been designing in the Yunnan province scientific instrument shared network platform.

Keywords:	deep Web information extraction inquiry interface data mining
本文献已被 CNKI 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏