首页 | 本学科首页   官方微博 | 高级检索  
     

基于语言模型的有监督词义消歧模型优化研究
引用本文:杨陟卓,黄河燕.基于语言模型的有监督词义消歧模型优化研究[J].中文信息学报,2014,28(1):19-25.
作者姓名:杨陟卓  黄河燕
作者单位:北京理工大学 北京市海量语言信息处理与云计算应用工程技术研究中心,北京 100081;北京理工大学 计算机科学与技术学院,北京 100081
基金项目:国家自然科学基金(61132009);北京理工大学科技创新计划重大项目培育专项计划基金;国防基础基金
摘    要:词义消歧是自然语言领域中重要的研究课题之一。目前,有监督词义消歧方法已经是解决该问题的有效手段。但是,由于缺乏大规模的训练语料,有监督方法还不能取得满意的效果。该文提出一种基于语言模型的词义消歧优化模型,该模型采用语言模型优化传统的有监督消歧模型,充分利用有监督和语言模型两种模型的消歧优势,共同推导歧义词的词义。该模型可以在训练语料不足的情况下,有效的提高词义消歧效果。在真实数据上表明,该方法的消歧性能超过了参加SemEval-2007:task #5评测任务的最好的有监督词义消歧系统。

关 键 词:数据稀疏  模型优化  有监督模型  语言模型  参数估计  

Supervised WSD Model Optimization Based on Language Model
YANG Zhizhuo,HUANG Heyan.Supervised WSD Model Optimization Based on Language Model[J].Journal of Chinese Information Processing,2014,28(1):19-25.
Authors:YANG Zhizhuo  HUANG Heyan
Affiliation:Beijing Engineering Applications Research Center on High Volume Language Information Processing and Cloud Computing, Beijing Institute on Technology, Beijing 100081, China; Department of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
Abstract:Word Sense Disambiguation (WSD) is one of the key issues in natural language processing. Currently, supervised WSD method is an effective way to solve the problem. However, because of the lack of large-scale training data, supervised methods cannot achieve satisfactory results. This paper presents a word sense disambiguation optimization model based on statistical language model, which exploits language model to optimize traditional supervised WSD model. The new model derives the meaning of ambiguous words by taking advantage of the knowledge contained in training data and language model. The model can significantly improve WSD performance when the training data is insufficient. Experimental results show that the optimized model outperformed the best participating system in the SemEval-2007: task #5 evaluation.
Keywords:data sparseness  model optimization  supervised model  language model  parameter estimation  
本文献已被 CNKI 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号