首页 | 本学科首页   官方微博 | 高级检索  
     

基于知网语义相关度计算的词义消歧方法
引用本文:王广正,王喜凤.基于知网语义相关度计算的词义消歧方法[J].安徽工业大学学报,2008,25(1):71-75.
作者姓名:王广正  王喜凤
作者单位:安徽工业大学计算机学院,安徵马鞍山243002
基金项目:安徽工业大学计算机学院青年教师科研资助项目
摘    要:歧义字段处理一直是中文信息处理领域中最关键也是最困难的问题之一,至今该问题仍没有得到完全而有效的解决,使得以此为基础的多个应用领域都难以取得突破性进展。传统的消歧方法--规则消歧和统计消歧都有不可避免的缺点:规则消歧存在规则的完备性与合理性问题,统计消歧则只取大概率事件而忽视小概率事件。在研究了知网表达汉语知识的基础上,改进了基于知网语义相关度的计算模型,并应用于汉语的歧义字段处理中。经大量例句作实验,以句子为单位的切分正确率可达到97.1%,验证了该消歧方法的有效性。

关 键 词:汉语自动分词  词义消歧  语义相关度  知网
文章编号:1671-7872(2008)07-0071-05
收稿时间:2007-09-04
修稿时间:2007年9月4日

Word Sense Disambiguating Method Based on HowNet Semantic Relevancy Computation
WANG Guang-zheng,WANG Xi-feng.Word Sense Disambiguating Method Based on HowNet Semantic Relevancy Computation[J].Journal of Anhui University of Technology,2008,25(1):71-75.
Authors:WANG Guang-zheng  WANG Xi-feng
Affiliation:(School of Computer Science, Anhui University of Technology, Ma'anshan 243002, China)
Abstract:As one of the most important and also the most difficult problems of Chinese information processing field, disambiguation has not been entirely solved until now. And because of this problem, some application fields based on it have not achieved breakthrough progress. As the traditional disambiguating methods, rule-based disambiguation and statistic-based disambiguation both have the.it inevitable vice: Rule-based disambiguation has the problem of maturity and rationality of rules, while statistic-based disambiguation only takes affairs with large probability and ignores them with little probability. On the basis of our research on HowNet representing Chinese knowledge, this article has ameliorated the model based on HowNet semantic relevancy computation, and utilizes it to disambiguate. After experiment on lots of examples, the segmenting precision can achieve 97.1% if taking sentence as unit; it shows that the disambiguating method is very good.
Keywords:Chinese automatic word segmentation  word sense disambiguation (WSD)  semantic relevancy  Hownet
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号