基于Hub的高维数据初始聚类中心的选择策略 Hub-Based Initialization for K-hubs期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于Hub的高维数据初始聚类中心的选择策略

引用本文：	张巧达,何振峰.基于Hub的高维数据初始聚类中心的选择策略[J].计算机系统应用,2015,24(4):171-175.

作者姓名：	张巧达何振峰

作者单位：	福州大学数学与计算机科学学院,福州,350108

摘要：	针对基于Hub的聚类算法K-hubs算法存在对初始聚类中心敏感的问题，提出一种基于Hub的初始中心选择策略。该策略充分利用高维数据普遍存在的Hubness现象，选择相距最远的K个Hub点作为初始的聚类中心。实验表明采用该策略的K-hubs算法与原来采用随机初始中心的K-hubs算法相比，前者拥有较好的初始中心分布，能够提高聚类准确率，而且初始中心所在的位置倾向于接近最终簇中心，有利于加快算法收敛。
关键词：	Hubness 初始中心最大最小距离方法高维数据聚类
收稿时间：	2014/7/31 0:00:00
修稿时间：	2014/9/28 0:00:00
Hub-Based Initialization for K-hubs

ZHANG Qiao-Da and HE Zhen-Feng.Hub-Based Initialization for K-hubs[J].Computer Systems& Applications,2015,24(4):171-175.

Authors:	ZHANG Qiao-Da and HE Zhen-Feng

Affiliation:	School of Mathematics and Computer Science, Fuzhou University, Fujian 350108, China;School of Mathematics and Computer Science, Fuzhou University, Fujian 350108, China

Abstract:	K-hubs is a Hub-based clustering algorithm that is very sensitive to initialization. Therefore, this paper proposes an initialization method based on Hub to solve this problem. The initialization method takes full use of the feature of the Hubness phenomenon by selecting initial centers that are the most remote Hub points with each other. The experimental results show that compared with the random initialization of ordinary K-hubs algorithm, the proposed initialization method can obtain a better distribution of initial centers, which could enhance the clustering accuracy; moreover, the selected initial centers can appear near the cluster centers, which could speed up the convergence of the clustering algorithm.

Keywords:	Hubness initial center maximin method high-dimensional data clustering
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《计算机系统应用》浏览原始摘要信息
	点击此处可从《计算机系统应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏