基于最大熵模型和规则的中文姓名识别 Identification of Chinese names based on maximum entropy model and rules期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于最大熵模型和规则的中文姓名识别

引用本文：	贾宁,张全.基于最大熵模型和规则的中文姓名识别[J].计算机工程与应用,2007,43(35):1-4.

作者姓名：	贾宁张全

作者单位：	[1]中国科学院研究生院,北京100039 [2]中国科学院声学研究所,北京100080

基金项目：	国家重点基础研究发展计划(973计划) , 中国科学院知识创新工程项目

摘要：	中文姓名识别是中文信息处理的一项重要技术,识别的召回率对其它需要以姓名识别为基础的中文信息处理技术有至关重要的影响。提出了一种统计模型和处理规则相结合的中文姓名识别方法:首先以最大熵模型识别潜在姓氏,而后再通过判定规则作进一步处理。真实语料的开放测试表明,该方法在召回率方面有明显的优势,可以达到94%以上的召回率,同时能保证较高的准确率。
关键词：	中文姓名识别最大熵规则
文章编号：	1002-8331(2007)35-0001-04
修稿时间：	2007年8月1日
Identification of Chinese names based on maximum entropy model and rules

JIA Ning,ZHANG Quan.Identification of Chinese names based on maximum entropy model and rules[J].Computer Engineering and Applications,2007,43(35):1-4.

Authors:	JIA Ning ZHANG Quan

Affiliation:	1.Graduate School of Chinese Academy of Sciences，Beijing 100039，China 2.Institute of Acoustics，Chinese Academy of Sciences，Beijing 100080，China

Abstract:	Identification of Chinese names is one of the important fields for the Chinese language automatic processing.The recall rate of identification will affect other processing deeply.But most methods can’t get a good recall rate which is up to 90%.This paper presents a method based on maximum entropy model and rules.The open test on real corpus shows that the recall rate of the system reaches 94%，with a precision more than 84%.The method is practicable，and benefits from its recall rate.

Keywords:	Chinese name recognition maximum entropy rule
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《计算机工程与应用》浏览原始摘要信息
	点击此处可从《计算机工程与应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏