首页 | 本学科首页   官方微博 | 高级检索  
     

基于词典的中文分词算法研究
引用本文:周程远,朱敏,杨云. 基于词典的中文分词算法研究[J]. 计算机与数字工程, 2009, 37(3): 68-71
作者姓名:周程远  朱敏  杨云
作者单位:华东师范大学计算中心,上海,200062;华东师范大学计算中心,上海,200062;华东师范大学计算中心,上海,200062
摘    要:中文分词是计算机自动处理文本的基础。通过比较常用的机械分词算法的优缺点,提出了分层逐字二分算法,综合了TRIE树和逐字二分分词的特点,以求通过较小的开销来实现较快的匹配速度。实验结果表明,该算法在综合性能上有显著提高。

关 键 词:中文分词  计算机应用  中文信息处理

Research on Chinese Word Segmentation Algorithm Based on the Dictionary
Zhou Chengyuan,Zhu Min,Yang Yun. Research on Chinese Word Segmentation Algorithm Based on the Dictionary[J]. Computer and Digital Engineering, 2009, 37(3): 68-71
Authors:Zhou Chengyuan  Zhu Min  Yang Yun
Affiliation:Dept.of Computer Center;East China Normal University;Shanghai 200062
Abstract:Chinese word segmentation is the base for Chinese information processing.By comparison commonly the advantages and disadvantages of the machinery word segmentation algorithm,then a lied verbatim binary algorithm has been presented,which integrated TRIE trees and verbatim binary search's characteristics,try to take the smaller overhead to achieve faster match speed.The results show that the algorithm in the comprehensive performance has made significant increase.
Keywords:Chinese word segmentation  computer application  Chinese information processing  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号