首页 | 本学科首页   官方微博 | 高级检索  
     

优化分配策略的密度峰值聚类算法
引用本文:丁志成,葛洪伟. 优化分配策略的密度峰值聚类算法[J]. 计算机科学与探索, 2020, 14(5): 792-802
作者姓名:丁志成  葛洪伟
作者单位:江南大学 江苏省模式识别与计算智能工程实验室,江苏 无锡 214122;江南大学 物联网工程学院,江苏 无锡 214122;江南大学 江苏省模式识别与计算智能工程实验室,江苏 无锡 214122;江南大学 物联网工程学院,江苏 无锡 214122
基金项目:The Research Innovation Program for College Graduate of Jiangsu Province under Grant No. KYLX16_0781 (江苏省普通高校研究生科研创新计划项目);(江苏省高校优势学科建设工程项目)
摘    要:针对密度峰值聚类算法在面对复杂结构数据集时容易出现分配错误的问题,提出一种优化分配策略的密度峰值聚类算法(ODPC)。新算法首先引入参数积γ,扩大了聚类中心的选取范围;然后使用改进的数据点分配策略,对数据集的数据点进行基于相似度指标MS的重新分配,进一步优化了簇类中点集的分配;最后使用dc近邻法优化识别数据集的噪声点。在人工数据集及UCI真实数据集上的实验均可证明,新算法能够在优化噪声识别的同时,提高复杂流形数据集中数据点分配的正确率,并取得比DPC算法、DenPEHC算法、GDPC算法更好的聚类效果。

关 键 词:密度聚类  快速搜索与发现密度峰值聚类(DPC)  分配策略

Density Peaks Clustering with Optimized Allocation Strategy
DING Zhicheng,GE Hongwei. Density Peaks Clustering with Optimized Allocation Strategy[J]. Journal of Frontier of Computer Science and Technology, 2020, 14(5): 792-802
Authors:DING Zhicheng  GE Hongwei
Affiliation:(Jiangsu Provincial Engineering Laboratory of Pattern Recognition and Computational Intelligence,Jiangnan University,Wuxi,Jiangsu 214122,China;School of Internet of Things Engineering,Jiangnan University,Wuxi,Jiangsu 214122,China)
Abstract:Focused on the issue that density peaks clustering algorithm will make mistakes when facing data sets allocation with complex structures, a kind of density peaks clustering with optimized allocation strategy(ODPC) is proposed in this paper. Firstly, the parameter product γ is introduced into the new algorithm to expand the selection of cluster centers. Then, it proposes an improved allocation strategy for data points, which redistributes points of data sets with similarity index MS, and further optimizes the allocation of points. Finally, dcnearest neighbor method is used to optimally identify the noise points of data sets. The experiments on artificial and UCI real data sets show that the new algorithm can improve the accuracy of complex manifold data sets allocation while optimizing noise recognition, and achieves better clustering results than DPC(clustering by fast search and find of density peaks), DenPEHC(density peak based efficient hierarchical clustering) and GDPC(density peaks clustering algorithm with gird-division strategy) algorithms.
Keywords:density clustering  clustering by fast search and find of density peaks(DPC)  allocation strategy
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号