首页 | 本学科首页   官方微博 | 高级检索  
     

改进粒子群与支持向量机混合的特征变换
引用本文:熊文,王枞.改进粒子群与支持向量机混合的特征变换[J].北京邮电大学学报,2009,32(6):24-27.
作者姓名:熊文  王枞
作者单位:北京邮电大学,计算机学院,北京,100876
基金项目:高等学校博士学科点专项科研基金项目(20060013007); 国家科技支撑计划项目(2007BAH05B02-04); 北京市自然科学基金项目(4092029)
摘    要:研究了数据挖掘中通过特征变换的数据预处理来提高支持向量机(SVM)分类精度的方法,提出了改进粒子群优化(PSO)和SVM混合的方法. 用推广t统计、Fisher判别式和随机森林的线性加权度量来排序特征,得到预选特征子集,再用启发式信息加速改进PSO搜索特征的线性变换因子,并用二进制PSO对特征变换子集进行特征选择,在后处理中通过格子搜索获取了高精度SVM分类器. 在NIPS 2003的madelon及10个UCI数据集上的实验表明,与有C-SVM分类精度相比,新方法在4个数据集上的精度更高.

关 键 词:粒子群  特征变换  支持向量机  特征选择  分类
收稿时间:2009-3-24
修稿时间:2009-8-31

Hybrid Feature Transformation Based on Modified Particle Swarm Optimization and Support Vector Machine
XIONG Wen,WANG Cong.Hybrid Feature Transformation Based on Modified Particle Swarm Optimization and Support Vector Machine[J].Journal of Beijing University of Posts and Telecommunications,2009,32(6):24-27.
Authors:XIONG Wen  WANG Cong
Affiliation:(School of Computer Science and Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China)
Abstract:This paper researches on a preprocessing method of feature transformation in data mining to improve the classification accuracy of Support Vector Machine (SVM), and proposes an algorithm based on hybrid modified Particle Swarm Optimization (PSO) and SVM for transforming feature set in order to obtain higher accuracy classifier. First, the method preselects top-ranked features as feature subset using linear weighted combination of extended t-statistic, Fisher Discriminant Ratio (FDR) and Random Forests (RF) feature importance scores; secondly, uses modified PSO to search feature linear transformation factor and a novel heuristic info to accelerate it; then, selects feature subset on transformed dataset using Binary Particle Swarm Optimization (BPSO) algorithm to refine the transformed feature subset; last, uses Grid searching method to attain high accuracy SVM classifier. Experiments show the method proposed provides higher accuracy than the original SVM on the UC Irvine Machine Learning Repository (UCI) and NIPS2003 datasets.
Keywords:Particle Swarm Optimization (PSO)  feature transformation  Support Vector Machine (SVM)  heuristic info  feature selection  classification  Random Forests (RF)
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《北京邮电大学学报》浏览原始摘要信息
点击此处可从《北京邮电大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号