首页 | 本学科首页   官方微博 | 高级检索  
     

基于数据流的网页内容分析技术研究
引用本文:王佰玲,曲芸,张永铮,田志宏.基于数据流的网页内容分析技术研究[J].电子学报,2013,41(4):751-756.
作者姓名:王佰玲  曲芸  张永铮  田志宏
作者单位:1. 哈尔滨工业大学计算机科学与技术学院,黑龙江哈尔滨 150001;2. 北京大学信息科学与技术学院,北京 100871;3. 中国科学院计算技术研究所,北京 100190
摘    要:提出针对网络数据流中活跃信息进行话题相关数据采集与分析方法.首先给出面向论坛话题的定义;然后对网络数据流进行分析、对用户访问行为进行分类;并给出基于数据流的用户行为识别方法及话题相关数据抽取、存储算法;最后给出实验分析,结果表明,所提出的基于数据流的论坛话题数据采集方法能够很好地反映用户行为,并对基于数据流的网络舆情热点话题发现、突发事件检测与实时跟踪等应用提供有利的数据资源.

关 键 词:网络舆情  热点话题  突发事件  网络数据流  
收稿时间:2011-09-10

Research on Network-Traffic Based Web Traffic Computing Technology
WANG Bai-ling , QU Yun , ZHANG Yong-zheng , TIAN Zhi-hong.Research on Network-Traffic Based Web Traffic Computing Technology[J].Acta Electronica Sinica,2013,41(4):751-756.
Authors:WANG Bai-ling  QU Yun  ZHANG Yong-zheng  TIAN Zhi-hong
Affiliation:1. School of Electronics Engineering and Computer Science, Peking University, Beijing 100871, China;2. School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China;3. Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
Abstract:In this paper,a network-traffic based topic extracting and analyzing method is introduced.The new topic definition for web2.0 and the classification of user behavior is given;the detecting method of user behavior,topic extracting method,and data storage algorithm is also proposed.At last,a prototype of topic collector based on network traffic is implemented;the testing results show that the user behavior and the hot topic can be collected and detected effectively and correctly,and the new method provides a new data channel for analyzing public opinion.
Keywords:public opinion  hot topic  emergent event  network traffic
本文献已被 万方数据 等数据库收录!
点击此处可从《电子学报》浏览原始摘要信息
点击此处可从《电子学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号