基于Deep Web的信息采集系统 An Information Extraction System Based on Deep Web期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于Deep Web的信息采集系统

引用本文：	王冉冉,王刚,黄青松. 基于Deep Web的信息采集系统[J]. 计算机技术与发展, 2007, 17(10): 171-173,177

作者姓名：	王冉冉王刚黄青松

作者单位：	昆明理工大学,信息工程与自动化学院,云南,昆明,650051

摘要：	随着互联网技术的迅速发展，大量结构化的高质量信息被埋入网络，却无法被传统的搜索引擎检索到，进而难以被挖掘利用。针对这一现象，提出了基于Deep Web的信息采集系统，设计了基于Web的查询方式，并结合数据挖掘的相关技术，获取并挖掘深网信息资源，解决传统手工采集信息的弊端，提高系统的使用效率，避免人工搜集时间和费用上的开销，降低成本，便于维护。并且正在云南省大型仪器协作共用网络平台的建设中尝试实现这个子系统的设计。
关键词：	信息采集查询接口数据挖掘
文章编号：	1673-629X（2007）10-0171-03
收稿时间：	2006-12-02
修稿时间：	2006-12-02
An Information Extraction System Based on Deep Web

WANG Ran-ran,WANG Gang,HUANG Qing-song. An Information Extraction System Based on Deep Web[J]. Computer Technology and Development, 2007, 17(10): 171-173,177

Authors:	WANG Ran-ran WANG Gang HUANG Qing-song

Affiliation:	School of Information Engineering and Automation, Kunming University of Science and Technolegy, Kunmlng 650051, China

Abstract:	With the rapid development of Intemet technology, a large amount of structured and high - quality information is embedded into Internet. However, the information cannot be retrieved by traditional search engine and then it is difficult to mine and make full use of it. In view of this phenomenon, presents a system based on the deep Web information extraction, designs a query .schema based on the Web, and combines some relevant technology of data mining. As a result,can get and mine the information which is in the Deep Web. At the same time, it resolves the traditional drawback of collecting information artificially, enhances the efficiency of the system, avoids the expenses on collection time and the expense, reduces the cost and maintains easily. And it has betel designing in the Yunnan province scientific instrument shared network platform.

Keywords:	Deep Web
本文献已被维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏