首页 | 本学科首页   官方微博 | 高级检索  
     


Text summarization using unsupervised deep learning
Affiliation:1. Department of Computer Science and Engineering, G. H. Raisoni College of Engineering, Nagpur, Maharashtra, India;2. Department of Computer Science and Engineering, Yeshwantrao Chavhan College of Engineering, Nagpur, Maharashtra, India;1. School of Automation, Northwestern Polytechnical University, Xi’an, China;2. Jiangsu Provincial Key Laboratory of E-Business, Nanjing University of Finance and Economics, Nanjing, China;1. C.U. Shah University-Wadhwan, Surendranagar, Gujarat, India;2. Adani Institute of Infrastructure Engineering, Ahmedabad, Gujarat, India
Abstract:We present methods of extractive query-oriented single-document summarization using a deep auto-encoder (AE) to compute a feature space from the term-frequency (tf) input. Our experiments explore both local and global vocabularies. We investigate the effect of adding small random noise to local tf as the input representation of AE, and propose an ensemble of such noisy AEs which we call the Ensemble Noisy Auto-Encoder (ENAE). ENAE is a stochastic version of an AE that adds noise to the input text and selects the top sentences from an ensemble of noisy runs. In each individual experiment of the ensemble, a different randomly generated noise is added to the input representation. This architecture changes the application of the AE from a deterministic feed-forward network to a stochastic runtime model. Experiments show that the AE using local vocabularies clearly provide a more discriminative feature space and improves the recall on average 11.2%. The ENAE can make further improvements, particularly in selecting informative sentences. To cover a wide range of topics and structures, we perform experiments on two different publicly available email corpora that are specifically designed for text summarization. We used ROUGE as a fully automatic metric in text summarization and we presented the average ROUGE-2 recall for all experiments.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号