首页 | 本学科首页   官方微博 | 高级检索  
     

SSD模型及其在汉语词性标注中的应用
引用本文:邢富坤,宋柔,罗智勇. SSD模型及其在汉语词性标注中的应用[J]. 中文信息学报, 2010, 24(1): 20-25
作者姓名:邢富坤  宋柔  罗智勇
作者单位:1. 北京语言大学 语言信息处理研究所,北京 100083; 2. 解放军外国语学院,河南 洛阳 471003
基金项目:国家自然科学基金资助项目(60572159,60872121)
摘    要:该文提出了一种以符号解码与数值解码并举的SSD(Symbol-and-Statistics Decoding Model)模型,该模型被用于汉语词性标注任务,其标注正确率在封闭测试中达到97.08%,开放测试中达到95.67%,较二阶HMM的95.56%和94.70%都有较为显著提高。SSD模型的正确率虽然不及最大熵模型和CRF模型,但它的训练时间远少于后者,说明SSD模型在处理自然语言中的特定任务时是一种较强的实用模型。

关 键 词:计算机应用  中文信息处理  SSD模型  HMM  词性标注  

Symbol-and-Statistics Decoding Model and Its Application in Chinese POS Tagging
XING Fukun,SONG Rou,LUO Zhiyong. Symbol-and-Statistics Decoding Model and Its Application in Chinese POS Tagging[J]. Journal of Chinese Information Processing, 2010, 24(1): 20-25
Authors:XING Fukun  SONG Rou  LUO Zhiyong
Affiliation:1. Center of Language Information Processing, Beijing Language and Culture University, Beijing 100083, China;
2. Foreign Languages University of PLA,Luoyang,Henan 471003,China
Abstract:A statistical language model named Symbol-and-Statistics Decoding(SSD) language model is presented in this article.The 2-gram SSD model is applied to the Chinese POS tagging task with a quite good result.The precision is as high as 97.08% in the closed test and 95.67% in the open test is,which are both significantly higher than the HMM at 95.56% and 94.70%,respectively.Although the performance of SSD model is not as good as the conditional models such as Maximum Entropy Model and CRF model,the training time...
Keywords:computer application  Chinese information processing  SSD model  HMM  POS tagging  
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号