首页 | 本学科首页   官方微博 | 高级检索  
     


Abstractive Text Summarization based on Improved Semantic Graph Approach
Authors:Atif Khan  Naomie Salim  Haleem Farman  Murad Khan  Bilal Jan  " target="_blank">Awais Ahmad  Imran Ahmed  Anand Paul
Affiliation:1.Department of Computer Science,Islamia College,Peshawar,Pakistan;2.Faculty of Computing,Universiti Teknologi Malaysia,Johor,Malaysia;3.Department of Computer and IT,Sarhad University of Science and IT,Peshawar,Pakistan;4.Department of Computer Science,FATA University,Dara Adam Khel,Pakistan;5.Department of Information and Communication Engineering,Yeungnam University,Gyeongsan,Republic of Korea;6.Institute of Management Science,Peshawar,Pakistan;7.School of Computer Science and Engineering,Kyugpook National University,Daegu,Republic of Korea
Abstract:The goal of abstractive summarization of multi-documents is to automatically produce a condensed version of the document text and maintain the significant information. Most of the graph-based extractive methods represent sentence as bag of words and utilize content similarity measure, which might fail to detect semantically equivalent redundant sentences. On other hand, graph based abstractive method depends on domain expert to build a semantic graph from manually created ontology, which requires time and effort. This work presents a semantic graph approach with improved ranking algorithm for abstractive summarization of multi-documents. The semantic graph is built from the source documents in a manner that the graph nodes denote the predicate argument structures (PASs)—the semantic structure of sentence, which is automatically identified by using semantic role labeling; while graph edges represent similarity weight, which is computed from PASs semantic similarity. In order to reflect the impact of both document and document set on PASs, the edge of semantic graph is further augmented with PAS-to-document and PAS-to-document set relationships. The important graph nodes (PASs) are ranked using the improved graph ranking algorithm. The redundant PASs are reduced by using maximal marginal relevance for re-ranking the PASs and finally summary sentences are generated from the top ranked PASs using language generation. Experiment of this research is accomplished using DUC-2002, a standard dataset for document summarization. Experimental findings signify that the proposed approach shows superior performance than other summarization approaches.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号