首页 | 本学科首页   官方微博 | 高级检索  
     

利用组合模型生成微博热点话题事件摘要
引用本文:戴天,吴渝,雷大江.利用组合模型生成微博热点话题事件摘要[J].计算机应用研究,2016,33(7).
作者姓名:戴天  吴渝  雷大江
作者单位:重庆邮电大学网络智能研究所 重庆 400065,重庆邮电大学网络智能研究所 重庆 400065,重庆邮电大学网络智能研究所 重庆 400065
基金项目:重庆教委科学技术研究项目(KJ130527);重庆市自然科学基金(CSTC,2014jcyjA40049);国家级大学生创新创业训练计划项目(Grant No. 201310617003)
摘    要:针对微博热点话题检测使用主题模型只能提取出无序话题词组合的问题,提出一种结合词激活力模型与主题模型各自优点的微博热点话题检测方法及话题关键词的计算方法。首先,使用传统的主题模型提取出微博文本中的热点主题;其次,根据各主题下文档的概率分布提取出新的话题文档;然后引入词激活力模型计算各个词之间的词激活力,生成词激活力矩阵;最后,利用词激活力矩阵生成有序的词序列作为热点事件。实验验证了该方法的可行性,表明所提出方法能够很好地识别出热点词并生成可读性高的事件。

关 键 词:微博    话题检测    潜在狄利克雷分布  词激活力
收稿时间:3/9/2015 12:00:00 AM
修稿时间:2016/5/12 0:00:00

Hot topic summarization on microblog generated by model combination
DAI Tian,WU Yu and LEI Dajiang.Hot topic summarization on microblog generated by model combination[J].Application Research of Computers,2016,33(7).
Authors:DAI Tian  WU Yu and LEI Dajiang
Affiliation:Institute of Web Intelligence,Chongqing University of Posts and Telecommunications,Institute of Web Intelligence,Chongqing University of Posts and Telecommunications,Institute of Web Intelligence,Chongqing University of Posts and Telecommunications
Abstract:To solve the problem that microblog hot topic detection based on topic model can only extract disorderly words combinations, a hot topic detection method on microblog combined with the advantage of word active force model and topic model, as well as its calculation method of Keywords, is proposed. Firstly, this approach extracts hot topic on microblog through topic model. Secondly, new documents are extracted according to the probability distribution of documents under each topic. Then, the word active matrix is generated by word active model. Finally, an orderly sequence of words as hot topic is generated by word active matrix. The experiments prove the feasibility of the proposed method which can effectively identify topic Keywordsand generate events with high readability.
Keywords:microblog  topic detection  latent Dirichlet allocation  word active force
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号