首页 | 本学科首页   官方微博 | 高级检索  
     

一种英文自动摘要方法
引用本文:张燕,赵广社,郭培胜.一种英文自动摘要方法[J].计算机工程与应用,2009,45(7):135-137.
作者姓名:张燕  赵广社  郭培胜
作者单位:1.西安交通大学 自动控制研究所,西安 710049 2.西安交通大学 工业自动化教研室,西安 710049
基金项目:国家自然科学基金,国家重点基础研究发展规划(973计划) 
摘    要:随着在线网页的指数型增长,自动摘要技术越来越受到人们的关注。针对抽取型摘要很少对文本进行语义分析、抽取出的句子可能偏离主题等缺陷,结合单文本摘要的特点,提出了一种英文自动摘要方法TLETS(TF-ISF and LexRank based English Text Summarization)。该方法采用WordNet对向量空间模型的特征词进行概念统计,计算每个概念词的TF-ISF值作为其权值,最后计算每个句子的LexRank权值并提取出权值最高的几个句子作为摘要。实验结果表明,TLETS方法能很好地得到摘要结果。

关 键 词:单文本  摘要  WordNet  向量空间模型  概念统计  
收稿时间:2008-8-29
修稿时间:2008-10-30  

English automatic text summarization
ZHANG Yan,ZHAO Guang-she,GUO Pei-sheng.English automatic text summarization[J].Computer Engineering and Applications,2009,45(7):135-137.
Authors:ZHANG Yan  ZHAO Guang-she  GUO Pei-sheng
Affiliation:1.Institute of Automatic Control,Xi’an Jiaotong University,Xi’an 710049,China 2.Department of Industrial Automation,Xi’an Jiaotong University,Xi’an 710049,China
Abstract:With the growing presence of large amounts of online text,more and more people are interested in automatic text summarization.Most of previous summarizing methods are based on word counting,which miss deep semantic analysis of texts and may be unrelated to the topic,so the extracted summarization is unsatisfying.According to the properties of single document summarization,this paper puts forward an English automatic text summarization method ——TLETS(TF-ISF and LexRank based English Text Summarization).It makes use of WordNet to count concept based on the Vector Space Model(VSM).Since it deals with single document,the VSM of the document is established by TF-ISF model.The LexRank value is counted and the sentences with the best values are extracted.The experiment results show that TLETS method can get better summarization.
Keywords:WordNet
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号