基于边界样本选择的支持向量机加速算法 SVM accelerated training algorithm based on border sample selection期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于边界样本选择的支持向量机加速算法

引用本文：	胡小生,钟勇. 基于边界样本选择的支持向量机加速算法[J]. 计算机工程与应用, 2017, 53(3): 169-173. DOI: 10.3778/j.issn.1002-8331.1507-0245

作者姓名：	胡小生钟勇

作者单位：	佛山科学技术学院电子与信息工程学院，广东佛山 528000

摘要：	针对支持向量机（Support Vector Machine，SVM）处理大规模数据集的学习时间长、泛化能力下降等问题，提出基于边界样本选择的支持向量机加速算法。首先，进行无监督的K均值聚类；然后，在各个聚簇内依照簇的混合度、支持度因素应用K近邻算法剔除非边界样本，获得最终的类别边界区域样本，参与SVM模型训练。在标准数据集上的实验结果表明，算法在保持传统支持向量机的分类泛化能力的同时，显著降低了模型训练时间。
关键词：	支持向量机大规模分类边界样本聚类
SVM accelerated training algorithm based on border sample selection

HU Xiaosheng,ZHONG Yong. SVM accelerated training algorithm based on border sample selection[J]. Computer Engineering and Applications, 2017, 53(3): 169-173. DOI: 10.3778/j.issn.1002-8331.1507-0245

Authors:	HU Xiaosheng ZHONG Yong

Affiliation:	College of Electronic and Information Engineering, Foshan University, Foshan, Guangdong 528000, China

Abstract:	Support Vector Machine（SVM）is a powerful instrument for solving pattern classification problem, but it is not suitable for large-scale data, due to the drawbacks of slow training speed, large computational cost and low generalization. An accurate support vector machine algorithm is proposed, which uses training samples lying close to the separation boundary. First of all, K-means clustering is performed to the initial training data, and then the boundary samples are selected in each cluster by K-nearest neighbor algorithm, two cluster factors, the degree of mixing and support, are defined to determine the boundary width. These boundary samples are then used in the training of the SVM classifier. The experiments on some benchmark datasets show that the proposed method not only makes computational complexities decreased, but also makes classification power of traditional SVM invariant.

Keywords:	Support Vector Machine（SVM） large-scale classification boundary samples clustering

	点击此处可从《计算机工程与应用》浏览原始摘要信息
	点击此处可从《计算机工程与应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏