首页 | 本学科首页   官方微博 | 高级检索  
     

基于密度和聚类指数改进的K-means算法
引用本文:毛秀,冒纯丽,丁岳伟.基于密度和聚类指数改进的K-means算法[J].电子科技,2015,28(11):47.
作者姓名:毛秀  冒纯丽  丁岳伟
作者单位:(上海理工大学 光电信息与计算机工程学院,上海 200093)
基金项目:国家高科技研究发展计划基金资助项目(2014AA021502)
摘    要:传统K-means算法中,随机选择到的初始聚类中心不同会得到不一样的簇类,人工给定的k值与实际聚类数较难达到一致,针对这些问题,文中提出了基于密度和聚类指数改进的K-means聚类算法。根据密度获取高密度集HP,从此集合中选择相互之间距离最大的两对对象均值当成第一轮聚类的聚类中心,新的聚类中心可通过最大距离积法获取,并参考聚类指数确定合适的k值。通过进行实验确认了该算法有较高的准确性和有效性。

关 键 词:k均值算法  初始聚类中心  高密度集  最大距离积法  聚类指数  

Improved K-means Algorithm Based on Density and Clustering Index
MAO Xiu,MAO Chunli,DING Yuewei.Improved K-means Algorithm Based on Density and Clustering Index[J].Electronic Science and Technology,2015,28(11):47.
Authors:MAO Xiu  MAO Chunli  DING Yuewei
Affiliation:(School of Optical-Electrical and Computer Engineering,University of Shanghai for Science and Technology,Shanghai 200093,China)
Abstract:The traditional K-means clustering algorithm initializing cluster centers randomly leads to the difference of clustering results.In the actual cases,it is difficult to give the exact number of clusters k to the algorithm in advance.In view of this,an improved K-means algorithm is presented based on density and clustering index.The algorithm generates the high density set named High Point(HP) according to all the objects' densities and chooses the mean of the furthest mutual distance two sample objects couples from HP as the first cluster's initial centers.The new initial center from rest objects in HP can be obtained by the maximum distances product algorithm.At the same time,the new algorithm automatically analyzes the clustering quality as the growth of the k values and determines the optimal number of clusters by selecting the index.Experiments show that the new algorithm achieves higher accuracy and validity.
Keywords:K means algorithm  initial clustering center  high density set  maximum distances product  clustering index  
本文献已被 万方数据 等数据库收录!
点击此处可从《电子科技》浏览原始摘要信息
点击此处可从《电子科技》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号