首页 | 本学科首页   官方微博 | 高级检索  
     

噪声消除与SMO算法收敛性
引用本文:何建兵,何清,史忠植.噪声消除与SMO算法收敛性[J].计算机工程与应用,2006,42(24):160-163.
作者姓名:何建兵  何清  史忠植
作者单位:1. 中国科学院研究生院软件学院,北京,100049
2. 中科院计算技术研究所智能信息处理重点实验室,北京,100080
基金项目:国家自然科学基金;国家科技攻关项目;中国-澳大利亚合作项目;北京市自然科学基金
摘    要:近年来,随着序列最小优化分类算法SMO等一系列快速算法的推出,支持向量机在自动文本分类研究领域取得了很大的成功。大多数文本分类问题是线性可分的,使用线性核函数的SMO算法能够取得非常好的分类效果。但是文本向量是一种非常稀疏的向量,采用线性核函数的SMO算法对噪声样本非常敏感,容易产生发散的问题。文章分析证明了噪声如何影响SMO算法收敛性。为了解决训练样本中噪声样本影响SMO算法收敛的问题,设计了一个消除噪声样本的算法,取得了非常好的效果。

关 键 词:文本分类  支持向量机  SMO  算法  噪声样本
文章编号:1002-8331-(2006)24-0160-04
收稿时间:2006-03
修稿时间:2006-03

Eliminating Noisy and SMO Algorithm Convergence
He Jianbing,He Qing,Shi Zhongzhi.Eliminating Noisy and SMO Algorithm Convergence[J].Computer Engineering and Applications,2006,42(24):160-163.
Authors:He Jianbing  He Qing  Shi Zhongzhi
Affiliation:1.College of Software Engineering,Graduate School of the Chinese Academy of Sciences, Beijing 100049; 2.The Key Laboratory of Intelligent Information Processing,Institute of Computing Technology, Chinese Academy of Sciences,Beijing 100080
Abstract:In recent years,accompany with the appearance of a series of rapid training algorithm as Sequential Minimal Optimization(SMO),support vector machines achieved great success in text categorization.Most text categorization problems are linearly separable,and SMO algorithm using linear kernel-induced can perform well for text categorization.However,text vectors are a kind of extremely sparse vector,and SMO algorithm with linear kernel or polynomial kernel is very sensitive to the extremely sparse noisy example,which is easy to bring on the problem that algorithm can not converge.It is been proved that the noisy example how to influence the convergence of SMO algorithm in the paper.To solve the problem that noisy sample in training samples affect the convergence of SMO algorithm,this paper designs the algorithm that can eliminate noisy samples,and good results is achieved.
Keywords:text categorlzation  SVM  SMO algorithm  noisy sample
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号