首页 | 官方网站   微博 | 高级检索  
     

一种有效解决汉语歧义切分的方法
引用本文:朱鉴,张建,李淼.一种有效解决汉语歧义切分的方法[J].计算机工程与应用,2007,43(11):175-178.
作者姓名:朱鉴  张建  李淼
作者单位:[1]中国科学院合肥智能机械研究所,合肥230031 [2]中国科学技术大学信息科学技术学院,合肥230027
基金项目:中国科学院知识创新工程项目
摘    要:提出了一种通过有向图和统计加规则的多层过滤方法来有效解决汉语分词过程中的交集型歧义切分问题,该方法大大提高了切分的正确率。经过65000字的开放语料测试,统计了其对交集型歧义字段的切分结果,发现该方法对交集型歧义字段的切分正确率为98.43%,以上数据表明该方法在解决汉语交集型歧义字段的问题时是行之有效的。

关 键 词:有向图  统计模型  规则库  歧义字段  汉字切分
文章编号:1002-8331(2007)11-0175-03
收稿时间:2006-5-12
修稿时间:2006-09

An Effective Method on Resolve Chinese Ambiguous Segmentation
ZHU Jian,ZHANG Jian,LI Miao.An Effective Method on Resolve Chinese Ambiguous Segmentation[J].Computer Engineering and Applications,2007,43(11):175-178.
Authors:ZHU Jian  ZHANG Jian  LI Miao
Affiliation:1.Institute of Intelligent Machines, Chinese Academy of Sciences,Hefei 230031 ,China ;2.School of Information Science and Technology,University of Science and Technology of China,Hefei 230027,China
Abstract:This paper presents a method that is based on directed graph plus statistic-based and rule-based means,this method effectively resolves the Chinese overlapped ambiguous segmentation.In an open test of a Chinese corpus with 65 000 characters, the accuracy of segmentation for ambiguous phrases of overlapped type reaches 98.43% ,this number proves that this method is very effective on resolving Chinese overlapped ambiguous segmentation.
Keywords:directed graph  statistical model  rule library  ambiguous phrase  Chinese word segmentation
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号