基于SVM的组块识别及其错误驱动学习方法 Chunk Parsing Based on SVM and Error-Driven Learning Methods期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于SVM的组块识别及其错误驱动学习方法

引用本文：	黄德根,王莹莹.基于SVM的组块识别及其错误驱动学习方法[J].中文信息学报,2006,20(6):19-26.

作者姓名：	黄德根王莹莹

作者单位：	大连理工大学计算机科学与工程系

基金项目：	国家自然科学基金资助项目(60373095,60373096)

摘要：	给出了一种错误驱动学习机制与SVM相结合的汉语组块识别方法。该方法在SVM组块识别的基础上,对SVM识别结果中的错误词语序列的词性、组块标注信息等进行分析,获得候选校正规则集;之后按照阈值条件对候选集进行筛选,得到最终的校正规则集;最后应用该规则集对SVM的组块识别结果进行校正。实验结果表明,与单独采用SVM模型的组块识别相比,加入错误驱动学习方法后,组块识别的精确率、召回率和F值均得到了提高。
关键词：	计算机应用中文信息处理组块分析错误驱动学习支持向量机(SVM) 规则集
文章编号：	1003-0077（2006）06-0017-08
收稿时间：	2005-09-26
修稿时间：	2005年9月26日
Chunk Parsing Based on SVM and Error-Driven Learning Methods

HUANG De-gen,WANG Ying-ying.Chunk Parsing Based on SVM and Error-Driven Learning Methods[J].Journal of Chinese Information Processing,2006,20(6):19-26.

Authors:	HUANG De-gen WANG Ying-ying

Affiliation:	Department of Computer Science and Engineering , Dalian University of Technology

Abstract:	Chunk parsing of Chinese texts can decrease the difficulty of syntactic parsing.This paper proposes a chunking approach that combines support vector machine with error-driven learning.First,the SVM model is used to chunk the training data.Then by error-driven learning,we automatically acquire the tuning rules from the chunking results of SVM.After filtration the rules are used to revise the chunk parsing results of SVM.The experimental results show that this approach is effective in Chinese chunk parsing.Compared with the pure SVM-based chunking,the performance is improved.

Keywords:	computer application Chinese information processing chunk parsing error-driven learning support vector machine(SVM) rule set
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《中文信息学报》浏览原始摘要信息
	点击此处可从《中文信息学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏