首页 | 本学科首页   官方微博 | 高级检索  
     

基于完全二阶隐马尔可夫模型的汉语词性标注
引用本文:梁以敏,黄德根.基于完全二阶隐马尔可夫模型的汉语词性标注[J].计算机工程,2005,31(10):177-179.
作者姓名:梁以敏  黄德根
作者单位:大连理工大学计算机科学与工程系,大连,116023;大连理工大学计算机科学与工程系,大连,116023
基金项目:国家自然科学基金资助项目
摘    要:该文基于隐马尔可夫理论,提出了一种三元词汇概率和词性概率相结合的汉语词性标注模型,并对传统的Viterbi算法进行了扩展。对统计模型中出现的数据稀疏问题,给出了基于线性插值法的平滑算法,实验表明,完全二阶隐马尔可夫模型比标准的二元,三元模型有更高的词性标注正确率和消歧率。

关 键 词:完全二阶隐马尔可夫模型  汉语词性标注  平滑算法  Viterbi算法
文章编号:1000-3428(2005)10-0177-02

Chinese Part-of-speech Tagging Based on Full Second-order Hidden Markov Model
LIANG Yimin,HUANG Degen.Chinese Part-of-speech Tagging Based on Full Second-order Hidden Markov Model[J].Computer Engineering,2005,31(10):177-179.
Authors:LIANG Yimin  HUANG Degen
Abstract:This paper describes an extension to the hidden Markov model for Chinese part-of-speech tagging using second-order approximations for both contextual and lexical probabilities, as well as the traditional Viterbi algorithm is extended. The model makes use of more contextual information than standard statistical models. A smoothing algorithm based on the linear interpolation algorithm is introduced to solve the sparse data problem. The new full second-order HMM is proved to improve Chinese part-of-speech tagging accuracies and disambiguation accuracies over current models.
Keywords:Full second-order hidden Markov model  Chinese part-of-speech tagging  Smoothing algorithm  Viterbi algorithm
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号