一种基于多元信息库的自适应汉语歧义切分方法 Self-Adaptive Chinese Ambiguous Word Segmentation Method Based on Multi-Gram Library期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种基于多元信息库的自适应汉语歧义切分方法

引用本文：	朱巧明,温滔,李培蜂,钱培德.一种基于多元信息库的自适应汉语歧义切分方法[J].小型微型计算机系统,2006,27(8):1597-1600.

作者姓名：	朱巧明温滔李培蜂钱培德

作者单位：	苏州大学,计算机科学与技术学院,江苏,苏州,215006

基金项目：	江苏省高技术研究发展计划项目;江苏省自然科学基金;江苏省教育厅自然科学基金

摘要：	在分析目前分词方法的基础上提出了一种通过建立多元信息库、采用改进型的粗分算法以拔出所有可能存在歧义的句子、借助于人工干预建立错误切分歧异词库等，实现汉语歧异切分的方法，通过修改、插入多元信息库中的信息量，进一步设计了一个具有自适应能力的歧义切分方法，并通过实验证明该方法能够有效改进汉语分词中错误歧义切分的结果．
关键词：	多元信息库歧义切分自适应
文章编号：	1000-1220（2006）08-1597-04
收稿时间：	05 23 2005 12:00AM
修稿时间：	2005-05-23
Self-Adaptive Chinese Ambiguous Word Segmentation Method Based on Multi-Gram Library

ZHU Qiao-ming,WEN Tao,LI Pei-feng,QIAN Pei-de.Self-Adaptive Chinese Ambiguous Word Segmentation Method Based on Multi-Gram Library[J].Mini-micro Systems,2006,27(8):1597-1600.

Authors:	ZHU Qiao-ming WEN Tao LI Pei-feng QIAN Pei-de

Affiliation:	School of Computer Science and Technology, Soochow University, Suzhou 215006, China

Abstract:	On the basis of the analysis of the existing algorithms of Chinese word segmentation, the article puts forward to realize Chinese word ambiguous segmentation by establishing mulit-gram library and improving the rough segmentation algorithm in order to find all sentences which have ambiguous word segmentation and establishing false segmentation ambiguous word library in virtual of manual interference. Moreover, the article designs a self-adaptive Chinese ambiguous word segmentation method based on modifying and inserting the values in the multi-gram library. It proves that the new method is able to improve effects in false ambiguous word segmentation.

Keywords:	multi-gram library ambiguous segmentation self-adaptive
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏