首页 | 本学科首页   官方微博 | 高级检索  
     

运用有向图进行中文分词研究
引用本文:张培颖. 运用有向图进行中文分词研究[J]. 计算机工程与应用, 2009, 45(22): 123-125. DOI: 10.3778/j.issn.1002-8331.2009.22.040
作者姓名:张培颖
作者单位:中国石油大学(华东),计算机与通信工程学院,山东东营257061
基金项目:中国石油大学(华东)计算机与通信工程学院青年教师创新基金 
摘    要:首先说明了分词在中文信息处理中的作用,然后介绍了分词系统中的关键技术。提出了一种基于有向图的中文分词算法,该算法首先构造中文分词有向图,然后计算中文分词有向图中所有可能的切分路径,最后利用了最少分词原则、汉字之间的互信息和词语的频率等信息给中文分词有向图中的每条切分路径打分,分数最高的路径就对应正确的切分结果。开放测试结果表明分词精确率可达90%以上。

关 键 词:中文分词  有向图  中文分词有向图  切分路径  互信息
收稿时间:2008-04-30
修稿时间:2008-7-23 

Method of Chinese word segmentation based on directed graph
ZHANG Pei-ying. Method of Chinese word segmentation based on directed graph[J]. Computer Engineering and Applications, 2009, 45(22): 123-125. DOI: 10.3778/j.issn.1002-8331.2009.22.040
Authors:ZHANG Pei-ying
Affiliation:College of Computer & Communication Engineering,University of Petroleum (East China),Dongying,Shandong 257061,China
Abstract:Chinese word segmentation is the first step for any Chinese information processing and hinders seriously its development.This paper introduces the critical technologies in the segmentation systems.It proposes a refinement of the segmentation algorithm based on the directed graph,this algorithm first constructs the Chinese segmentation directed graph,and calculates the weight of every segmentation path,last evaluates every segmentation path based on the principle of least segmentation,the mutual info of char...
Keywords:Chinese segmentation  directed graph  Chinese segmentation directed graph  segmentation path  mutual information
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号