基于分层聚类及重采样的大规模数据分类 Large-scale data classification based on hierarchical clustering and re-sampling期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于分层聚类及重采样的大规模数据分类

引用本文：	张永,浮盼盼,张玉婷.基于分层聚类及重采样的大规模数据分类[J].计算机应用,2013,33(10):2801-2803.

作者姓名：	张永浮盼盼张玉婷

作者单位：	辽宁师范大学计算机与信息技术学院, 辽宁大连 116081

基金项目：	国家自然科学基金资助项目，中国博士后科学基金资助项目，辽宁省教育厅基金资助项目

摘要：	针对大规模数据的分类问题,将监督学习与无监督学习结合起来,提出了一种基于分层聚类和重采样技术的支持向量机(SVM)分类方法。该方法首先利用无监督学习算法中的k-means聚类分析技术将数据集划分成不同的子集,然后对各个子集进行逐类聚类,分别选出各类中心邻域内的样本点,构成最终的训练集,最后利用支持向量机对所选择的最具代表样本点进行训练建模。实验表明,所提方法可以大幅度降低支持向量机的学习代价,其分类精度比随机欠采样更优,而且可以达到采用完整数据集训练所得的结果
关键词：	海量数据分类聚类重采样支持向量机
收稿时间：	2013-03-13
修稿时间：	2013-04-24
Large-scale data classification based on hierarchical clustering and re-sampling

ZHANG Yong , FU Panpan , ZHANG Yuting.Large-scale data classification based on hierarchical clustering and re-sampling[J].journal of Computer Applications,2013,33(10):2801-2803.

Authors:	ZHANG Yong FU Panpan ZHANG Yuting

Affiliation:	School of Computer and Information Technology, Liaoning Normal University, Dalian Liaoning 116081, China

Abstract:	Based on hierarchical clustering and re-sampling, this paper presented a Support Vector Machine (SVM) classification method for large-scale data, which combined supervised learning with unsupervised learning. The proposed method first used k-means cluster analytical technology to partition dataset into several subsets. Then, the method clustered class by class for each subset and selected samples in each clustering center neighborhood to form candidate training datasets. Last, the method applied SVM to train and model for candidate training datasets. The experimental results show that the proposed method can substantially reduce SVM learning cost. Meanwhile, the proposed method has better classification accuracy than random re-sampling method, and can attain about the same classification accuracy of the non-sampling method.

Keywords:	large-scale data classification clustering re-sampling Support Vector Machine (SVM)
本文献已被万方数据等数据库收录！
	点击此处可从《计算机应用》浏览原始摘要信息
	点击此处可从《计算机应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏