一种并行Web信息采集系统模型 Parallel system model of Web information retrieval期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种并行Web信息采集系统模型

引用本文：	杨天奇,周晔. 一种并行Web信息采集系统模型[J]. 计算机应用, 2007, 27(1): 225-227

作者姓名：	杨天奇周晔

作者单位：	暨南大学,计算机科学系,广东,广州,510632

摘要：	根据国内外在信息采集领域的发展以及并行采集技术的研究，提出了一个基于多线程并行的Web信息采集结构模型，该模型以线程并行的方式对Web页面同时采集，实现了全面、高效并且灵活的信息搜集。
关键词：	并行Web 信息采集搜索引擎
文章编号：	1001-9081（2007）01-0225-03
收稿时间：	2006-07-03
修稿时间：	2006-06-30
Parallel system model of Web information retrieval

YANG Tian-qi,ZHOU Ye. Parallel system model of Web information retrieval[J]. Journal of Computer Applications, 2007, 27(1): 225-227

Authors:	YANG Tian-qi ZHOU Ye

Affiliation:	Department of Computer Science, Jinan University, Guangzhou Guangdong 510632, China

Abstract:	Based on the long-time accumulation in the field of Web crawling,and combining the current developing technologies on parallel Web crawling,this article put forward a structure design model of the parallel incremental Web crawler.In order to download Web pages parallelly,we adopted means of multiple thread that can effectively improve information gathering performance.

Keywords:	parallel Web information gathering search engine
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《计算机应用》浏览原始摘要信息
	点击此处可从《计算机应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏