首页 | 本学科首页   官方微博 | 高级检索  
     

一种改进的MM中文分词算法
引用本文:石正喜,张捍东,赵黎明,陈玉燕.一种改进的MM中文分词算法[J].计算机与网络,2009,35(2):48-50.
作者姓名:石正喜  张捍东  赵黎明  陈玉燕
作者单位:[1]宁波城市职业技术学院信息学院,浙江宁波315100 [2]安徽工业大学电气信息学院,安徽马鞍山243000
摘    要:对汉语的特点和分词概念作了简单介绍,详细说明了常用的分词算法,在此基础上,提出了一种改进的Ⅲ中文分词算法。该算法兼顾了最大正向匹配法(MM)和逆向最大匹配法(RMM)的优点,克服他们的不足,使得切分准确率和分词效率均有明显的提高,是一种比较实用的分词算法。实验也进一步证明,该算法能有效地提高切分准确率和分词效率。

关 键 词:自然语言处理  中文分词  改进的最大匹配法

An Improved Maximum Matching Method for Chinese Word Segmentation
Affiliation:SHI Zheng-xi,ZHANG Han-dong,ZHAO Li-ming, CHEN Yu-yan(1 Information College, Ningho City College of Vocational Technology, Ningbo Zhejiang 315000, China;2. School of Electrical Engineering and Information, AnHui University of Technology,MaAnShan Anhui 243000, China)
Abstract:It introduces briefly the conception of word segmentation and characteristic of chinese, explains detailedly the method of ordinary word segmentation and puts forward an improved Maximum Matching Method (MM) for chinese word segmentation. This method is an applied method for chinese word segmentation, and it has the advantage Maximum Matching Method (MM) and Reverse Maximum Matcing Method (RMM) and overcomes their shortcomings. So it obtains obvious improvement for the exact probability and efficiency of Chinese word segmentation, it is proved through practices that this method can improve efficiently the exact probability and efficiency of Chinese word segmentation.
Keywords:natural language processing  Chinese word segmentation  improved maximum matching method
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号