首页 | 本学科首页   官方微博 | 高级检索  
     

基于后缀树算法的地区微博摘要技术研究
引用本文:高永兵,张贵娟,胡文江,马占飞.基于后缀树算法的地区微博摘要技术研究[J].计算机工程与应用,2018,54(9):126-132.
作者姓名:高永兵  张贵娟  胡文江  马占飞
作者单位:1.内蒙古科技大学 信息工程学院,内蒙古 包头 014010 2.包头师范学院 计算机系,内蒙古 包头 014010
摘    要:地区官方微博中包含了大量相关当地的事件信息,聚合地区官方微博数据可以发掘当地的重要事件;结合地区微博数据地区别称、不同层级,地区标签属性突显等特征提出了基于后缀树算法的地区微博摘要技术研究。利用地区权值树和知网HowNet对地区微博数据进行预处理,将意思相近的词汇进行替换统一;利用后缀树聚类算法STC和奇异值分解SVD对地区微博进行聚类;结合地区微博特征对其综合打分,选取有代表性的微博句子生成摘要。实验验证了该方法的可行性,表明所提出的方法能够很好地识别出当地事件并生成可读性高的事件摘要。

关 键 词:地区微博  地区权值树  知网  后缀树聚类  摘要  

Research of regional microblog summarization based on Suffix Tree Clustering algorithm
GAO Yongbing,ZHANG Guijuan,HU Wenjiang,MA Zhanfei.Research of regional microblog summarization based on Suffix Tree Clustering algorithm[J].Computer Engineering and Applications,2018,54(9):126-132.
Authors:GAO Yongbing  ZHANG Guijuan  HU Wenjiang  MA Zhanfei
Affiliation:1.School of Information Engineering, Inner Mongolia University of Science and Technology, Baotou, Inner Mongolia 014010, China 2.Department of Computer, Baotou Teachers College, Baotou, Inner Mongolia 014010, China
Abstract:A large number of region-related event information is contained by regional official Microblog, aggregating these official Microblog data can find the local important events. Depending on the features of regional Microblog data, such as regional nicknames, multi-levels and distinctive attributes of regional label, the research of region-related Microblog summarization based on Suffix Tree Clustering(STC) algorithm is proposed. Regional Microblog data is preprocessed to integrate similar meanings words using regional weight tree and HowNet. Then clusters are generated by adopting Suffix Tree Clustering and Singular Value Decomposition?algorithm. At?last the regional Microblog data is comprehensively rated considering its features and the representative Microblog sentences are selected as summary. The experiments prove the feasibility of the proposed method which can effectively identity local event and generate events with high readability.
Keywords:regional microblog  regional weight tree  Hownet  Suffix Tree Clustering(STC)  summarization  
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号