首页 | 本学科首页   官方微博 | 高级检索  
     

基于Hub的高维数据初始聚类中心的选择策略
引用本文:张巧达,何振峰.基于Hub的高维数据初始聚类中心的选择策略[J].计算机系统应用,2015,24(4):171-175.
作者姓名:张巧达  何振峰
作者单位:福州大学数学与计算机科学学院,福州,350108
摘    要:针对基于Hub的聚类算法K-hubs算法存在对初始聚类中心敏感的问题,提出一种基于Hub的初始中心选择策略。该策略充分利用高维数据普遍存在的Hubness现象,选择相距最远的K个Hub点作为初始的聚类中心。实验表明采用该策略的K-hubs算法与原来采用随机初始中心的K-hubs算法相比,前者拥有较好的初始中心分布,能够提高聚类准确率,而且初始中心所在的位置倾向于接近最终簇中心,有利于加快算法收敛。

关 键 词:Hubness  初始中心  最大最小距离方法  高维数据  聚类
收稿时间:2014/7/31 0:00:00
修稿时间:2014/9/28 0:00:00

Hub-Based Initialization for K-hubs
ZHANG Qiao-Da and HE Zhen-Feng.Hub-Based Initialization for K-hubs[J].Computer Systems& Applications,2015,24(4):171-175.
Authors:ZHANG Qiao-Da and HE Zhen-Feng
Affiliation:School of Mathematics and Computer Science, Fuzhou University, Fujian 350108, China;School of Mathematics and Computer Science, Fuzhou University, Fujian 350108, China
Abstract:K-hubs is a Hub-based clustering algorithm that is very sensitive to initialization. Therefore, this paper proposes an initialization method based on Hub to solve this problem. The initialization method takes full use of the feature of the Hubness phenomenon by selecting initial centers that are the most remote Hub points with each other. The experimental results show that compared with the random initialization of ordinary K-hubs algorithm, the proposed initialization method can obtain a better distribution of initial centers, which could enhance the clustering accuracy; moreover, the selected initial centers can appear near the cluster centers, which could speed up the convergence of the clustering algorithm.
Keywords:Hubness  initial center  maximin method  high-dimensional data  clustering
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号