首页 | 本学科首页   官方微博 | 高级检索  
     

自动确定聚类中心的密度峰值算法
引用本文:王 洋,张桂珠.自动确定聚类中心的密度峰值算法[J].计算机工程与应用,2018,54(8):137-142.
作者姓名:王 洋  张桂珠
作者单位:江南大学 物联网工程学院,江苏 无锡 214122
摘    要:密度峰值聚类算法(Density Peaks Clustering,DPC),是一种基于密度的聚类算法,该算法具有不需要指定聚类参数,能够发现非球状簇等优点。针对密度峰值算法凭借经验计算截断距离dc]无法有效应对各个场景并且密度峰值算法人工选取聚类中心的方式难以准确获取实际聚类中心的缺陷,提出了一种基于基尼指数的自适应截断距离和自动获取聚类中心的方法,可以有效解决传统的DPC算法无法处理复杂数据集的缺点。该算法首先通过基尼指数自适应截断距离dc],然后计算各点的簇中心权值,再用斜率的变化找出临界点,这一策略有效避免了通过决策图人工选取聚类中心所带来的误差。实验表明,新算法不仅能够自动确定聚类中心,而且比原算法准确率更高。

关 键 词:密度峰值  聚类  簇中心点  基尼指数  

Automatically determine density of cluster center of peak algorithm
WANG Yang,ZHANG Guizhu.Automatically determine density of cluster center of peak algorithm[J].Computer Engineering and Applications,2018,54(8):137-142.
Authors:WANG Yang  ZHANG Guizhu
Affiliation:School of Internet of Things Engineering, Jiangnan University, Wuxi, Jiangsu 214122, China
Abstract:Density Peaks Clustering(DPC) is a density-based clustering algorithm, which has the advantage of not needing to specify clustering parameters and discovering non-spherical clusters. In this paper, an adaptive truncation method based on Gini index is proposed to solve the problem that the density peak algorithm can not effectively deal with each scene by calculating the cutoff distance dc], and the density peak algorithm manually selects the clustering center to get the actual clustering center. Distance dc] and automatic clustering center method can effectively solve the defects of traditional DPC algorithm which can not handle the complex data set. The algorithm firstly cuts off the distance through Gini index, then calculates the cluster center weights of each point, and then uses the change of slope to find the critical point. This strategy effectively avoids the errors caused by manual selection of clustering centers by decision graph. Experiments show that the new algorithm not only can automatically determine the clustering center, but also has higher accuracy than the original algorithm.
Keywords:density peak  clustering  cluster center point  Gini index  
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号