首页 | 本学科首页   官方微博 | 高级检索  
     


Multi-documents Automatic Abstracting based on text clustering and semantic analysis
Authors:Qinglin Guo  Ming Zhang  
Affiliation:aDepartment of Computer Science and Technology, Peking University, Beijing 100871, China;bSchool of Computer Science and Technology, North China Electric Power University, Beijing 102206, China
Abstract:A method of realization of multi-documents Automatic Abstracting based on text clustering and semantic analysis is brought forward, aimed at overcoming shortages of some current methods about multi-documents. The method makes use of semantic analysis and can realize Automatic Abstracting of multi-documents. The algorithm of twice word segmentation based on the title and first-sentences in paragraphs is brought forward. Its precision and recall is above 95%. For a specific domain on plastics, an Automatic Abstracting system named TCAAS is implemented. The precision and recall of multi-document’s Automatic Abstracting is above 75%. And experiments do prove that it is feasible to use the method to develop a domain Automatic Abstracting system, which is valuable for further study in more depth.
Keywords:Semantic analysis   Automatic Abstracting   Multi-documents   Text clustering   Natural language understanding
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号