首页 | 本学科首页   官方微博 | 高级检索  
     

一种结合SVM学习的产生式依存分析方法
引用本文:罗强,奚建清.一种结合SVM学习的产生式依存分析方法[J].中文信息学报,2007,21(4):21-26.
作者姓名:罗强  奚建清
作者单位:华南理工大学 计算机学院,广东 广州 510641
基金项目:国家科技攻关项目;广东省自然科学基金
摘    要:本文提出了一种结合SVM学习和产生式模型的依存分析方法。该方法用产生式模型的分析错误对SVM分类器进行训练。为进一步提高分析精度,采用扩大寻优范围的动态规划算法对产生式模型的分析结果进行错误估计,同时引入范围参数,使得寻优范围可以根据实际情况进行调整。本方法在不牺牲分类性能的前提下,有效减少了训练SVM分类器所依赖的支撑向量数。在对哈工大中文树库语料上的对比测试结果表明,该方法的依存分析精度达到86.4%,具有很强的依存分析能力。

关 键 词:计算机应用  中文信息处理  中文依存分析  产生式概率模型    SVM学习  SMO  动态规划算法  
文章编号:1003-0077(2007)04-0021-06
收稿时间:2006-07-07
修稿时间:2006-07-072007-04-03

An SVM-Based Generative Statistical Algorithm for Chinese Dependency Analysis
LUO Qiang,XI Jian-qing.An SVM-Based Generative Statistical Algorithm for Chinese Dependency Analysis[J].Journal of Chinese Information Processing,2007,21(4):21-26.
Authors:LUO Qiang  XI Jian-qing
Affiliation:South China University of Technology, Guangzhou, Guangdong 510641, China
Abstract:In this paper, we propose a SVM-combined generative statistical model for Chinese dependency analysis that trains SVM classifier using erroneous results generated by generative statistical model. To further improve the precision of dependency analysis, two measures were taken, first, dynamic programming algorithm that extends the range of finding the best local solution was used to estimate the error rate of generative model; second, a ranging factor was introduced to make the solutions adaptive on the practical situation. All those efforts make it possible for the new method to largely decrease the number of negative support vectors without sacrificing classification ability in training. Comparative experiments on Hit Chinese Treebank corpus show that the new method shows better performance than current Chinese dependency methods, with precision reaching to 86.4%.
Keywords:SMO
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号