首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于K-Means局部最优性的高效聚类算法
引用本文:LEI Xiao-Feng,谢昆青,LIN Fan,夏征义.一种基于K-Means局部最优性的高效聚类算法[J].软件学报,2008,19(7).
作者姓名:LEI Xiao-Feng  谢昆青  LIN Fan  夏征义
作者单位:1. 北京大学,信息科学技术学院智能科学系/视觉与听觉国家重点实验室,北京,100871
2. 中国人民解放军总后勤部,后勤科学研究所,北京,100071
基金项目:the National High-Tech Research and Development Plan of China under Grant No.2006AA12Z217 (国家高技术研究发展计划
摘    要:K-Means聚类算法只能保证收敛到局部最优,从而导致聚类结果对初始代表点的选择非常敏感.许多研究工作都着力于降低这种敏感性.然而,K-Means的局部最优和结果敏感性却构成了K-MeanSCAN聚类算法的基础.K-MeanSCAN算法对数据集进行多次采样和K-Means预聚类以产生多组不同的聚类结果,来自不同聚类结果的子簇之间必然会存在交集.算法的核心思想是,利用这些交集构造出关于子簇的加权连通图,并根据连通性合并子簇.理论和实验证明,K-MeanScan算法可以在很大程度上提高聚类结果的质量和算法的效率.

关 键 词:K-MeanSCAN  基于密度  K-Means  聚类  连通性

An Efficient Clustering Algorithm Based on Local Optimality of K-Means
LEI Xiao-Feng,XIE Kun-Qing,LIN Fan,XIA Zheng-Yi.An Efficient Clustering Algorithm Based on Local Optimality of K-Means[J].Journal of Software,2008,19(7).
Authors:LEI Xiao-Feng  XIE Kun-Qing  LIN Fan  XIA Zheng-Yi
Affiliation:LEI Xiao-Feng~
Abstract:K-Means is the most popular clustering algorithm with the convergence to one of numerous local minima,which results in much sensitivity to initial representatives.Many researches are made to overcome the sensitivity of K-Means algorithm.However,this paper proposes a novel clustering algorithm called K-MeanSCAN by means of the local optimality and sensitivity of K-Means.The core idea is to build the connectivity between sub-clusters based on the multiple clustering results of K-Means,where these clustering results are distinct because of local optimality and sensitivity of K-Means.Then a weighted connected graph of the sub-clusters is constructed using the connectivity,and the sub-clusters are merged by the graph search algorithm.Theoretic analysis and experimental demonstrations show that K-MeanSCAN outperforms existing algorithms in clustering quality and efficiency.
Keywords:K-MeanSCAN  density-based  K-Means  clustering  connectivity
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号