首页 | 本学科首页   官方微博 | 高级检索  
     

基于最大熵原理的汉语词义消歧
引用本文:陈笑蓉,秦进.基于最大熵原理的汉语词义消歧[J].计算机科学,2005,32(5):174-176.
作者姓名:陈笑蓉  秦进
作者单位:贵州大学信息与计算机科学学院,贵阳,550025;贵州大学信息与计算机科学学院,贵阳,550025
摘    要:词义消歧是自然语言处理中亟待解决的一个关键问题,本文提出一种基于最大熵模型的有监督的机器学习方法,用于汉语词义消歧。该方法综合了词标记、词性、主题等上下文特征,并用一种统一的表示方法规范化特征形式,解决了多种不同特征之间的融合和特征的知识表示。实验对20个汉语高频多义词进行了测试,平均正确率为87%,验证了该方法的有效性。

关 键 词:词义消歧  最大熵模型  有监督机器学习

Maximum Entropy-Based Chinese Word Sense Disambiguation
CHEN Xiao-rong,QIN Jin.Maximum Entropy-Based Chinese Word Sense Disambiguation[J].Computer Science,2005,32(5):174-176.
Authors:CHEN Xiao-rong  QIN Jin
Affiliation:CHEN Xiao-Rong,QIN Jin College of Information Computer Science,Guizhou University,Guiyang 550025
Abstract:Word sense disambiguation is a crucial problem to be solved in NLP. A supervised machine learning method is proposed in this paper, which is applied in word disambiguation in Chinese. The method combines various features in context to disambiguate word senses. The features include annotations of words, parts of speech and sub- jects etc. And a uniform representation formalizes the features. In this way, the problem of synthesis among various features and knowledge representation of feature will be solved. 20 Chinese polysemous words are tested in our exper- iment. The result with average precision 87% shows that the method is effective.
Keywords:Word sense disambiguation  Maximum entropy models  Supervised machine learning
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号