首页 | 本学科首页   官方微博 | 高级检索  
     

基于同义词链的中文关键词提取算法
引用本文:张颖颖,谢强,丁秋林.基于同义词链的中文关键词提取算法[J].计算机工程,2010,36(19):93-95.
作者姓名:张颖颖  谢强  丁秋林
作者单位:南京航空航天大学信息科学与技术学院,南京,210016
摘    要:针对传统中文关键词提取对语义和同义词的不重视而导致的精确度和召回率低的问题,提出基于同义词链的中文关键词提取算法。利用上下文窗口和消歧算法解决词语在上下文中的语义问题,利用文档中的同义词构建同义词链,简化候选词的选取。根据同义词链的特征,得到相应的权重计算公式,对候选词进行过滤。实验结果表明,该算法在同义词较多的文档中精确度和召回率有较大的提高,平均性能也有明显改善。

关 键 词:关键词提取  同义词链  语义  消歧

Chinese Keyword Extraction Algorithm Based on Synonym Chains
ZHANG Ying-ying,XIE Qiang,DING Qiu-lin.Chinese Keyword Extraction Algorithm Based on Synonym Chains[J].Computer Engineering,2010,36(19):93-95.
Authors:ZHANG Ying-ying  XIE Qiang  DING Qiu-lin
Affiliation:(College of Information Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China)
Abstract:To solve the problem of low precision rate and recall rate in the traditional Chinese keyword extraction resulted from indifference of semantic and synonym, Chinese keyword extraction algorithm based on synonym chains is proposed. In the algorithm, the problem of word semantic in the context is solved by using the word of context window and word sense disambiguation algorithm. Synonym chains are built by using synonym of the document which simplifies the selection of candidate words, and the weight formula of keyword which can filter candidate word is brought out by the characteristics of synonym chains. Experimental results show that the proposed algorithm has more precision rate and recall rate in the document with much more synonym, and the average performance can be obviously improved.
Keywords:keyword extraction  synonym chains  semantic  disambiguation
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号