首页 | 本学科首页   官方微博 | 高级检索  
     

基于网格和最近邻居的聚类算法
引用本文:陈义如,孙广中,许胤龙.基于网格和最近邻居的聚类算法[J].计算机辅助工程,2008,17(1):81-86.
作者姓名:陈义如  孙广中  许胤龙
作者单位:中国科学技术大学,省部共建高性能计算与应用重点实验室,合肥,230027
摘    要:针对目前已有的聚类算法不能很好地处理包含不同密度的簇数据,或者不能很好地区分相邻的密度相差不大的簇的问题,提出1种新的基于严格最近邻居和共享最近邻居的聚类算法.通过构造共享严格最近邻图,使样本点在密度一致的区域保持连接,而在密度不同的相邻区域断开连接,并尽可能去除噪声点和孤立点.该算法可以处理包含有不同密度的簇数据,而且在处理高维数据时具有较低的时间复杂度、实验结果证明,该算法能有效找出不同大小、形状和密度的聚类.

关 键 词:聚类算法  相似度  密度  网格  最近邻居
文章编号:1006-0871(2008)01-0081-06
收稿时间:2007-02-28
修稿时间:2007-04-05

Clustering algorithm based on grid and nearest neighbors
CHEN Yiru,SUN Guangzhong,XU Yinlong.Clustering algorithm based on grid and nearest neighbors[J].Computer Aided Engineering,2008,17(1):81-86.
Authors:CHEN Yiru  SUN Guangzhong  XU Yinlong
Affiliation:CHEN Yiru,SUN Guangzhong,XU Yinlong(Anhui Province-MOST Key Co-Lab of High Performance Computing , its Application,Univ.of Sci.& Tech.of China,Hefei 230027,China)
Abstract:Due to the fact that the current clustering algorithms can not perform well while processing clustering datasets which contain clusters with different densities or distinguishing adjacent clusters with similar densities, a new clustering algorithm is proposed based on strict nearest neighbors and shared nearest neighbors. The algorithm keeps the links in regions of uniform density and breaks the links in regions of different density and removes the noises and isolated points by constructing the shared strict nearest neighbor graph. It processes datasets containing clusters with different densities and has low time complexity while dealing with high dimensional data. The experiment results prove that the algorithm can efficiently find clusters with differing shapes, sizes and densities.
Keywords:clustering algorithm  similarity  density  grid  nearest neighbor
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号