首页 | 本学科首页   官方微博 | 高级检索  
     

即时定向新闻采集技术研究
引用本文:王辛,黄穗,龙舜.即时定向新闻采集技术研究[J].计算机工程与科学,2012,34(9):180-183.
作者姓名:王辛  黄穗  龙舜
作者单位:1. 暨南大学计算机科学系,广东广州,510632
2. 暨南大学计算机科学系,广东广州510632;广东省公共网络安全风险评价与预警应急技术研究中心,广东广州510632
摘    要:互联网的迅速发展带动了信息量的爆炸性增加。如何更快地采集所需信息一直是国内外研究和开发的热点。近年来,不断增长的对特定信息(例如特定领域的新闻)的需求要求有针对性地从指定的网站即时采集相关信息。这些新闻一般具有不可预见性、更新频率较快、时效性强等特点。这要求我们必须能针对这些特点实现即时定向的采集。本文提出了一种有效抓取网页并进行分析的方法,实践表明取得了满意的效果。

关 键 词:新闻采集  爬虫  即时

An Efficient Approach to Just-In-Time Focused News Acquisition
WANG Xin , HUANG Sui , LONG Shun.An Efficient Approach to Just-In-Time Focused News Acquisition[J].Computer Engineering & Science,2012,34(9):180-183.
Authors:WANG Xin  HUANG Sui  LONG Shun
Affiliation:1,2 (1.Department of Computer Science,Jinan University,Guangzhou 510632; 2.Emergency Technology Research Center of Risk Evaluation and Prewarning on Public Network Security,Guangzhou 510632,China)
Abstract:The rapid development of the Internet leads to the explosive increase in the amount of information.How to collect the required information quickly has been a hot topic in both industry and research areas.In recent years, the growing demand for specific information (such as news of specific topics) information should be acquired from some specified sites in a just-in-time manner.However,they are generally unpredictable,of quicker update frequency,more time-sensitive,and therefore more difficult to acquire just-in-time.This paper proposes a novel approach to tackle this problem,whose efficiency has been demonstrated in practice.
Keywords:news acquisition  crawler  just-in-time
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号