首页 | 本学科首页   官方微博 | 高级检索  
     

基于主成分分析的SMO文本分类
引用本文:黎超,吴义国,魏星.基于主成分分析的SMO文本分类[J].现代计算机,2011(10):18-21.
作者姓名:黎超  吴义国  魏星
作者单位:广东工业大学计算机学院,广州,510006
摘    要:利用SMO进行文本分类的核心问题是特征的选择问题,特征选择涉及到哪些特征和选择的特征维度问题。针对以上问题,介绍一种基于主成分分析和信息增益相结合的数据集样本降维的方法,并在此基础上对序贯最小优化算法进行改进,提出降维序贯最小优化(P-SOM)算法。P-SMO算法去掉了冗余维。实验结果证明,该方法提高SMO算法的性能,缩短支持向量机的训练时间,提高支持向量机的分类精度。

关 键 词:机器学习  支持向量机  序贯最小化  主成分分析  降维

SMO Classification Text Based on PCA
Li Chao,WU Yi-guo,WEI Xing.SMO Classification Text Based on PCA[J].Modem Computer,2011(10):18-21.
Authors:Li Chao  WU Yi-guo  WEI Xing
Affiliation:(Department of Computer,Guangdong University of Technology,Guangzhou 510006)
Abstract:Using SMO for text classification is a core problem of selection,and the characteristics selection involves which characteristic to choose and dimension problem of the characteristics which chosen.According to these problems,proposes a data sets of sample dimension reduction method based on the combination of principal component analysis and information gain,and on this basis,improves the SMO algorithm and proposes the P-SOM which can remove the redundant.The experimental result shows that it can improve the performance of SMO and shorten the training time of SVM,improve the classification accuracy of SVM.
Keywords:Machine Learning  Support Vector Machine(SVM)  Sequential Minmal Optimization  PCA  Dimension Reduction
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号