首页 | 本学科首页   官方微博 | 高级检索  
     

基于语料库与层次词典的自动文摘研究
引用本文:宋今,赵东岩. 基于语料库与层次词典的自动文摘研究[J]. 软件学报, 2000, 11(3): 308-314
作者姓名:宋今  赵东岩
作者单位:北京大学计算机科学与技术系,北京,100871
摘    要:自动文摘研究作为自然语言处理研究的一个重要且实用的分支,目前逐渐成为Internet信息检索等应用领域的重要研究课题之一.该文提出的基于语料库的文摘试图将传统的基地语言学分析的文摘方法和基于统计的文摘方法的优点结合在一起.基于语料库的文摘方法的实质即以系统外的分析代价换取系统内的算法效率.该文描述的算法给出了基于层次词典的关键字提取和基于语料库的自动文摘的实现.

关 键 词:自动文摘,语料库,关键字提取,层次词典.
收稿时间:1997-12-24
修稿时间:1998-10-06

Study of Automatic Abstracting Based on Corpus and Hierarchical Dictionary
SONG Jin and ZHAO Dong-yan. Study of Automatic Abstracting Based on Corpus and Hierarchical Dictionary[J]. Journal of Software, 2000, 11(3): 308-314
Authors:SONG Jin and ZHAO Dong-yan
Affiliation:Department of Computer Science and Technology Beijing University Beijing 100871
Abstract:The study of automatic abstracting is a vital and practical information processing task in natural language processing,and becomes an important problem in domains such as Internet information retrieval.An approach based on corpus proposed by this paper provides an integration of the advantages of linguistic analysis based methods and those based on statistics.In essence,the basic idea of corpus-based method is at the expense of the cost of analysis outside the system to gain the efficiency of the algorithm inside the system.The algorithm given by the paper implements both keywording and abstracting while the former is based on a hierarchical dictionary and the latter on the corpus.
Keywords:Automatic abstracting  corpus  keywording  hierarchical dictionary.
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号