首页 | 本学科首页   官方微博 | 高级检索  
     

基于Deep Web的信息采集系统
引用本文:王冉冉,王刚,黄青松. 基于Deep Web的信息采集系统[J]. 计算机技术与发展, 2007, 17(10): 171-173,177
作者姓名:王冉冉  王刚  黄青松
作者单位:昆明理工大学,信息工程与自动化学院,云南,昆明,650051
摘    要:随着互联网技术的迅速发展,大量结构化的高质量信息被埋入网络,却无法被传统的搜索引擎检索到,进而难以被挖掘利用。针对这一现象,提出了基于Deep Web的信息采集系统,设计了基于Web的查询方式,并结合数据挖掘的相关技术,获取并挖掘深网信息资源,解决传统手工采集信息的弊端,提高系统的使用效率,避免人工搜集时间和费用上的开销,降低成本,便于维护。并且正在云南省大型仪器协作共用网络平台的建设中尝试实现这个子系统的设计。

关 键 词:信息采集  查询接口  数据挖掘
文章编号:1673-629X(2007)10-0171-03
收稿时间:2006-12-02
修稿时间:2006-12-02

An Information Extraction System Based on Deep Web
WANG Ran-ran,WANG Gang,HUANG Qing-song. An Information Extraction System Based on Deep Web[J]. Computer Technology and Development, 2007, 17(10): 171-173,177
Authors:WANG Ran-ran  WANG Gang  HUANG Qing-song
Affiliation:School of Information Engineering and Automation, Kunming University of Science and Technolegy, Kunmlng 650051, China
Abstract:With the rapid development of Intemet technology, a large amount of structured and high - quality information is embedded into Internet. However, the information cannot be retrieved by traditional search engine and then it is difficult to mine and make full use of it. In view of this phenomenon, presents a system based on the deep Web information extraction, designs a query .schema based on the Web, and combines some relevant technology of data mining. As a result,can get and mine the information which is in the Deep Web. At the same time, it resolves the traditional drawback of collecting information artificially, enhances the efficiency of the system, avoids the expenses on collection time and the expense, reduces the cost and maintains easily. And it has betel designing in the Yunnan province scientific instrument shared network platform.
Keywords:Deep Web
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号