首页 | 本学科首页   官方微博 | 高级检索  
     

一种新颖的词性标注模型
引用本文:袁里驰,钟义信. 一种新颖的词性标注模型[J]. 微电子学与计算机, 2005, 22(9): 1-2,6
作者姓名:袁里驰  钟义信
作者单位:北京邮电大学信息工程学院,北京,100876
基金项目:国家自然科学基金,国家高技术研究发展计划(863计划)
摘    要:文章首次提出一种统计模型,即马氏族模型,该模型假定一个词出现概率既与当前词的词性标记有关,也与它前面的词有关,但其前面的词和该词词性标记关于该词条件独立.将马氏族模型适当加以简化,能成功地用于词性标记,实验结果证明:在相同的测试条件下,这种基于马氏族模型的词性标注方法标记成功率大大高于传统的基于隐马尔可夫模型的词性标注方法.马氏族模型在其它一些自然语言处理领域如分词、句法分析、语音识别、机器翻译也有广泛的应用前景.

关 键 词:马氏族模型  词性标注  隐马尔可夫模型  Viterbi算法
文章编号:1000-7180(2005)09-001-02
收稿时间:2005-01-17
修稿时间:2005-01-17

A Novel POS Tagging Model
YUAN Li-chi,ZHONG Yi-xin. A Novel POS Tagging Model[J]. Microelectronics & Computer, 2005, 22(9): 1-2,6
Authors:YUAN Li-chi  ZHONG Yi-xin
Abstract:In this paper, the Markov Family Model, a kind of statistical Models was firstly introduced. Under the assumption that the probability of a word depends both on its own tag and previous word, but its own tag and previous word are independent if the word is known, we simplify the Markov Family Model and use for part-of-speech tagging successfully. Experimental results show that this part-of-speech tagging method based on Markov Family Model has greatly improved the precision comparing the conventional POS tagging method based on Hidden Markov Model under the same testing conditions. The Markov Family Model is also very useful in other natural language processing technologies such as word segmentation, statistical parsing, text-to-speech, optical character recognition, etc.
Keywords:Markov family model   Part-of-speech tagging   Hidden markov model   Viterbi algorithm
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号