首页 | 本学科首页   官方微博 | 高级检索  
     

句法信息指导的汉语词义消歧
引用本文:张春祥,栾博,高雪瑶,卢志茂.句法信息指导的汉语词义消歧[J].计算机工程与应用,2015,51(5):142-145.
作者姓名:张春祥  栾博  高雪瑶  卢志茂
作者单位:1.哈尔滨理工大学 计算机科学与技术学院,哈尔滨 150080 2.哈尔滨理工大学 软件学院,哈尔滨 150080 3.哈尔滨工程大学 信息与通信工程学院,哈尔滨 150001
基金项目:黑龙江省教育厅科学技术研究项目(No.12531106)。
摘    要:词义消歧要解决如何让计算机理解多义词在上下文中的具体含义,对信息检索、机器翻译、文本分类和自动文摘等自然语言处理问题有着十分重要的作用。通过引入句法信息,提出了一种新的词义消歧方法。构造歧义词汇上下文的句法树,提取句法信息、词性信息和词形信息作为消歧特征。利用贝叶斯模型来建立词义消歧分类器,并将其应用到测试数据集上。实验结果表明:消歧的准确率有所提升,达到了65%。

关 键 词:词义消歧  句法信息  消歧特征  贝叶斯模型  

Chinese word sense disambiguation directed by syntactic infor-mation
ZHANG Chunxiang,LUAN Bo,GAO Xueyao,LU Zhimao.Chinese word sense disambiguation directed by syntactic infor-mation[J].Computer Engineering and Applications,2015,51(5):142-145.
Authors:ZHANG Chunxiang  LUAN Bo  GAO Xueyao  LU Zhimao
Affiliation:1.School of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, China 2.School of Software, Harbin University of Science and Technology, Harbin 150080, China 3.College of Information and Communication Engineering, Harbin Engineering University, Harbin 150001, China
Abstract:The task of word sense disambiguation is to make computers choose the correct sense of an ambiguous word in a given context. It is important for problems in natural language processing, such as information retrieval, machine translation, text classification and automatic summarization. In this paper, a new method of word sense disambiguation is proposed, where syntactic information is introduced. The parsing tree of its context including the ambiguous word is built. Disambiguation features are extracted including parsing information, part of speech and word information. The Bayesian model is used to build word sense disambiguation classifier. Experimental results show that accuracy rate of disambiguation is improved and arrives at 65%.
Keywords:word sense disambiguation  syntactic information  disambiguation features  Bayesian model
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号