首页 | 本学科首页   官方微博 | 高级检索  
     

随机特征上一致中心调节的支持向量机
引用本文:廖士中,卢玮. 随机特征上一致中心调节的支持向量机[J]. 计算机工程与应用, 2014, 0(17): 44-48,55
作者姓名:廖士中  卢玮
作者单位:天津大学计算机科学与技术学院
基金项目:国家自然科学基金(No.61170019);天津市自然科学基金(No.11JCYBJC00700).
摘    要:支持向量机(SVM)是最为流行的分类工具,但处理大规模的数据集时,需要大量的内存资源和训练时间,通常在大集群并行环境下才能实现。提出一种新的并行SVM算法,RF-CCASVM,可在有限计算资源上求解大规模SVM。通过随机傅里叶映射,应用低维显示特征映射一致近似高斯核对应的无限维隐式特征映射,从而用线性SVM一致近似高斯核SVM。提出一致中心调节的并行化方法。具体地,将数据集划分成若干子数据集,多个进程并行地在各自的子数据集上独立训练SVM。当各个子数据集上的最优超平面即将求出时,用由各个子集上获得的一致中心解取代当前解,继续在各子集上训练直到一致中心解在各个子集上达到最优。标准数据集的对比实验验证了RF-CCASVM的正确性和有效性。

关 键 词:并行支持向量机  大规模数据集  有限资源  随机傅里叶特征  一致中心调节

Support vector machine via consensus centre adjustment on random features
LIAO Shizhong,LU Wei. Support vector machine via consensus centre adjustment on random features[J]. Computer Engineering and Applications, 2014, 0(17): 44-48,55
Authors:LIAO Shizhong  LU Wei
Affiliation:(School of Computer Science and Technology, Tianjin University, Tianjin 300072, China)
Abstract:Support Vector Machines(SVMs)have become popular classification tools, but when dealing with very large datasets, SVMs need large memory requirement and computation time. Therefore, large-scale SVMs are performed on computer clusters or supercomputers. A novel parallel algorithm for large-scale SVM is presented. The algorithm is per-formed on a resource-limited computing environment and guarantees a uniform convergence. The infinite-dimensional implicit feature mapping of the Gaussian kernel function is sufficiently approximated by a low-dimensional feature map-ping. The kernel SVM is approximated with a linear SVM by explicitly mapping data to low-dimensional features using random the Fourier map. The parallelization of the algorithm is implemented with a consensus centre adjustment strategy. Concretely, the dataset is partitioned into several subsets, and separate SVMs are trained on processors parallel with the subsets. When the optimal hyperplanes on subsets are nearly found, solutions achieved by separate SVMs are replaced by the consensus centre and are retrained on the subsets until the consensus centre is optimal on all subsets. Comparative experiments on benchmark databases are performed. The results show that the proposed resource-limited parallel algo-rithm is effective and efficient.
Keywords:parallel Support Vector Machines (SVM)  large-scale datasets  limited resource  random Fourier features  consensus centre adjustment
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号