首页 | 本学科首页   官方微博 | 高级检索  
     

[K]近邻相似度优化的密度峰聚类
引用本文:朱庆峰,葛洪伟.[K]近邻相似度优化的密度峰聚类[J].计算机工程与应用,2019,55(2):148-153.
作者姓名:朱庆峰  葛洪伟
作者单位:1.轻工过程先进控制教育部重点实验室(江南大学),江苏 无锡 214122 2.江南大学 物联网工程学院,江苏 无锡 214122
摘    要:针对密度峰聚类分配时,仅考虑样本点与指向点(密度比它大的最近点)之间的距离,不适用于流形聚类(如Circleblock数据集、Lineblobs数据集等)的问题,提出了K]近邻相似度优化的密度峰聚类算法。在计算每个点的密度与指向点后,通过相似度函数,找出每个点的K]近邻,然后根据K]近邻信息判断样本点的指向点是否正确,对于指向错误的点重新寻找正确的指向点,可以有效减少错误分配。在人工数据集和UCI数据集上的实验表明,新算法具有更高的准确率。

关 键 词:聚类  密度峰  相似度  [K]近邻  

Density Peaks Clustering Optimized by K Nearest Neighbor's Similarity
ZHU Qingfeng,GE Hongwei.Density Peaks Clustering Optimized by K Nearest Neighbor's Similarity[J].Computer Engineering and Applications,2019,55(2):148-153.
Authors:ZHU Qingfeng  GE Hongwei
Affiliation:1.Ministry of Education Key Laboratory of Advanced Process Control for Light Industry(Jiangnan University), Wuxi, Jiangsu 214122, China 2.School of Internet of Things Engineering, Jiangnan University, Wuxi, Jiangsu 214122, China
Abstract:For the clustering of density peaks, only the distance between the sample point and the point of pointing (the nearest point of density is bigger than it) is considered, and it is not applicable to the problem of manifold clustering (such as Circleblock data set, Lineblobs data set, etc.). A density peak clustering algorithm with K] similarity optimization is proposed. After calculating the density and point of each point, find the K] neighborhood of each point by the similarity function, and then judge whether the point of the sample point is correct according to the K] proximity information. For the point pointing to the wrong point, it can effectively reduce the error distribution. Experiments on artificial datasets and UCI datasets show that the new algorithm has a higher accuracy rate.
Keywords:clustering  density peaks  similarity  [K] nearest neighbor  
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号