首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于局部—全局主题关系的演化式摘要系统
引用本文:吴仁守,刘凯,王红玲.一种基于局部—全局主题关系的演化式摘要系统[J].中文信息学报,2018,32(9):75-83.
作者姓名:吴仁守  刘凯  王红玲
作者单位:苏州大学 计算机科学与技术学院,江苏 苏州 215006
基金项目:国家自然科学基金(61402314)
摘    要:带有时间标志的演化式摘要是近年来提出的自然语言处理任务,其本质是多文档自动文摘,它的研究对象是互联网上连续报道的热点新闻文档。针对互联网新闻事件报道的动态演化、动态关联和信息重复等特点,该文提出了一种基于局部—全局主题关系的演化式摘要方法,该方法将新闻事件划分为多个不同的子主题,在考虑时间演化的基础上同时考虑子主题之间的主题演化,最后将新闻标题作为摘要输出。实验结果表明,该方法是有效的,并且在以新闻标题作为输入输出时,和当前主流的多文档摘要和演化摘要方法相比,在Rouge评价指标上有显著提高。

关 键 词:主题关系  PageRank  演化式摘要  多文档文摘  

An Evolutionary Summarization System Based on Local-global Topic Relationship
WU Renshou,LIU Kai,WANG Hongling.An Evolutionary Summarization System Based on Local-global Topic Relationship[J].Journal of Chinese Information Processing,2018,32(9):75-83.
Authors:WU Renshou  LIU Kai  WANG Hongling
Affiliation:Institute of Computer Science and Technology, Soochow University, Suzhou, Jiangsu 215006, China
Abstract:Evolutionary timeline summarization (ETS) for Internet News Event is a new task in natural language processing,which is a kind of multi-document summarization (MDS) in essence. According to the features of dynamic evolution,content relevance and information redundancy of Internet news event,this paper puts forward an evolutionary summarization method basing on local and global topic relations. First,the news event is divided into a number of different sub-topics. In the meantime,the basis of time evolution and the topic evolution between sub-topics are considered. Finally,headlines are extracted as summary. The experimental results show that this method is effective. Especially using news headlines as inputs or outputs brings significant improvements in the Rouge evaluation,compared with current popular method of multi-document summarization and evolution summarization.
Keywords:topic relation  PageRank  evolutionary timeline summarization  multi-document summarization  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号