首页 | 本学科首页   官方微博 | 高级检索  
     

基于SVM的组块识别及其错误驱动学习方法
引用本文:黄德根,王莹莹.基于SVM的组块识别及其错误驱动学习方法[J].中文信息学报,2006,20(6):19-26.
作者姓名:黄德根  王莹莹
作者单位:大连理工大学计算机科学与工程系
基金项目:国家自然科学基金资助项目(60373095,60373096)
摘    要:给出了一种错误驱动学习机制与SVM相结合的汉语组块识别方法。该方法在SVM组块识别的基础上,对SVM识别结果中的错误词语序列的词性、组块标注信息等进行分析,获得候选校正规则集;之后按照阈值条件对候选集进行筛选,得到最终的校正规则集;最后应用该规则集对SVM的组块识别结果进行校正。实验结果表明,与单独采用SVM模型的组块识别相比,加入错误驱动学习方法后,组块识别的精确率、召回率和F值均得到了提高。

关 键 词:计算机应用  中文信息处理  组块分析  错误驱动学习  支持向量机(SVM)  规则集  
文章编号:1003-0077(2006)06-0017-08
收稿时间:2005-09-26
修稿时间:2005年9月26日

Chunk Parsing Based on SVM and Error-Driven Learning Methods
HUANG De-gen,WANG Ying-ying.Chunk Parsing Based on SVM and Error-Driven Learning Methods[J].Journal of Chinese Information Processing,2006,20(6):19-26.
Authors:HUANG De-gen  WANG Ying-ying
Affiliation:Department of Computer Science and Engineering , Dalian University of Technology
Abstract:Chunk parsing of Chinese texts can decrease the difficulty of syntactic parsing.This paper proposes a chunking approach that combines support vector machine with error-driven learning.First,the SVM model is used to chunk the training data.Then by error-driven learning,we automatically acquire the tuning rules from the chunking results of SVM.After filtration the rules are used to revise the chunk parsing results of SVM.The experimental results show that this approach is effective in Chinese chunk parsing.Compared with the pure SVM-based chunking,the performance is improved.
Keywords:computer application  Chinese information processing  chunk parsing  error-driven learning  support vector machine(SVM)  rule set
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号