首页 | 本学科首页   官方微博 | 高级检索  
     


iSpreadRank: Ranking sentences for extraction-based summarization using feature weight propagation in the sentence similarity network
Authors:Jen-Yuan Yeh  Hao-Ren Ke  Wei-Pang Yang  
Affiliation:aDepartment of Computer Science, National Chiao Tung University, Hsinchu 300, Taiwan;bInstitution of Information Management, National Chiao Tung University, Hsinchu 300, Taiwan;cUniversity Library, National Chiao Tung University, Hsinchu 300, Taiwan;dDepartment of Information Management, National Dong Hwa University, Hualien 974, Taiwan
Abstract:Sentence extraction is a widely adopted text summarization technique where the most important sentences are extracted from document(s) and presented as a summary. The first step towards sentence extraction is to rank sentences in order of importance as in the summary. This paper proposes a novel graph-based ranking method, iSpreadRank, to perform this task. iSpreadRank models a set of topic-related documents into a sentence similarity network. Based on such a network model, iSpreadRank exploits the spreading activation theory to formulate a general concept from social network analysis: the importance of a node in a network (i.e., a sentence in this paper) is determined not only by the number of nodes to which it connects, but also by the importance of its connected nodes. The algorithm recursively re-weights the importance of sentences by spreading their sentence-specific feature scores throughout the network to adjust the importance of other sentences. Consequently, a ranking of sentences indicating the relative importance of sentences is reasoned. This paper also develops an approach to produce a generic extractive summary according to the inferred sentence ranking. The proposed summarization method is evaluated using the DUC 2004 data set, and found to perform well. Experimental results show that the proposed method obtains a ROUGE-1 score of 0.38068, which represents a slight difference of 0.00156, when compared with the best participant in the DUC 2004 evaluation.
Keywords:Sentence extraction  Multidocument summarization  Spreading activation  Sentence similarity network  Feature weigh propagation  Social network analysis
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号