汉语基本短语的自动识别 Automatic Identification of Chinese Base Phrases期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

汉语基本短语的自动识别

引用本文：	张昱琪,周强.汉语基本短语的自动识别[J].中文信息学报,2002,16(6):2-9.

作者姓名：	张昱琪周强

作者单位：	智能技术与系统国家重点实验室,清华大学计算机系

基金项目：	国家自然科学基金项目 (6 990 30 0 7)，国家 973基金项目 (G19980 30 5 0 7)，国家 86 3计划项目 (2 0 0 1AA114 0 4 0 )

摘要：	本文应用基于实例的MBL(Memory-Based Learning)学习方法,对汉语中较常见的9种基本短语的边界及类别进行识别,并利用短语内部构成结构和词汇信息对预测中出现的边界歧义和短语类型歧义进行了排歧处理。实验中还比较了在特征向量中加入词汇信息与否对实验结果的影响。实验取得了比较令人满意的结果:对这9种基本短语的识别正确率达到95.2%;召回率达到93.7%。
关键词：	部分分析基本短语基于实例学习短语结构词汇排歧
修稿时间：	2002年5月8日
Automatic Identification of Chinese Base Phrases

ZHANG Yu,qi,ZHOU Qiang.Automatic Identification of Chinese Base Phrases[J].Journal of Chinese Information Processing,2002,16(6):2-9.

Authors:	ZHANG Yu qi ZHOU Qiang

Affiliation:	State Key Laboratory of Intelligent Technology and Systems, Dept. of Computer Science and Technology, Tsinghua University

Abstract:	This paper proposed a hybrid model to identify Chinese base phrases.At first step,We use a memory based learning (MBL) approach to the chunking of nine types of Chinese base phrases and compare the results coming from different feature vectors.In the second series of experiments we used grammar rules that represent the inner structures of base phrases and lexical information to correct the incorrect predictions from the first step.The experiments reported in this paper show competitive results:the precision is 95.2% and the recall is 93.7%.

Keywords:	partial parsing base phrase memory based learning phrase structure lexical based disambiguation
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《中文信息学报》浏览原始摘要信息
	点击此处可从《中文信息学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏