首页 | 本学科首页   官方微博 | 高级检索  
     

一个全文词义自动标注系统的实现
引用本文:刘挺,卢志茂,李生. 一个全文词义自动标注系统的实现[J]. 哈尔滨工业大学学报, 2005, 37(12): 1603-1605,1649
作者姓名:刘挺  卢志茂  李生
作者单位:哈尔滨工业大学,计算机科学与技术学院,黑龙江,哈尔滨,150001;哈尔滨工业大学,计算机科学与技术学院,黑龙江,哈尔滨,150001;哈尔滨工业大学,计算机科学与技术学院,黑龙江,哈尔滨,150001
基金项目:国家自然科学基金资助重点项目(60435020).
摘    要:为研究在给定上下文中如何确定多义词的词义,介绍了一种无指导的词义消歧技术和一个汉语全文词义标注系统的设计实现过程.该系统基于贝叶斯模型,使用大规模语料进行训练,较好地解决了知识获取中数据稀疏的问题.该系统具有标注正确率高和运行速度快等特点,适合大规模文本的词义标注工作.

关 键 词:词义消歧  自然语言处理  无指导学习算法  贝叶斯模型  依存文法
文章编号:0367-6234(2005)12-1603-03
收稿时间:2004-05-17
修稿时间:2004-05-17

Implement a full- text automatic system for word sense tagging
LIU Ting,LU Zhi-mao,LI Sheng. Implement a full- text automatic system for word sense tagging[J]. Journal of Harbin Institute of Technology, 2005, 37(12): 1603-1605,1649
Authors:LIU Ting  LU Zhi-mao  LI Sheng
Abstract:Word sense disambiguation has been a very active research topic in the NLP field, which studies how to determine which of the senses of an ambiguous word is invoked in a particular context using sense classifiers. This paper presents a technique for unsupervised word sense disambiguation and implements the process of a full - text word sense tagging system. This system performs word sense disambiguation based on the Nave Bayesian Model, uses largescale corpora as training data, and it is able to preferentially conquer the problem of Sparse Data in Knowledge Acquisition. In addition, this system has the characteristics of high accuracy and quick running speed. Thus, this system is competent for word sense tagging on large - scale, real - word text.
Keywords:word sense disambiguation   natural language processing   unsupervised learning algorithm   Nave-Bayesian Model   dependency grammar
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号