首页 | 本学科首页   官方微博 | 高级检索  
     

基于粒计算的K-medoids聚类算法
引用本文:马箐,谢娟英.基于粒计算的K-medoids聚类算法[J].计算机应用,2012,32(7):1973-1977.
作者姓名:马箐  谢娟英
作者单位:陕西师范大学 计算机科学学院,西安710062
基金项目:陕西省自然科学基金,中央高校基本科研业务费专项,陕西师范大学2011年研究生培养创新基金
摘    要:传统K-medoids聚类算法的聚类结果随初始中心点不同而波动,且计算复杂度较高不适于处理大规模数据集;快速K-medoids聚类算法通过选择合适的初始聚类中心改进了传统K-medoids聚类算法,但是快速K-medoids聚类算法的初始聚类中心有可能位于同一类簇。为克服传统K-medoids聚类算法和快速K-medoids聚类算法的缺陷,提出一种基于粒计算的K-medoids聚类算法。算法引入粒度概念,定义新的样本相似度函数,基于等价关系产生粒子,根据粒子包含样本多少定义粒子密度,选择密度较大的前K个粒子的中心样本点作为K-medoids聚类算法的初始聚类中心,实现K-medoids聚类。UCI机器学习数据库数据集以及随机生成的人工模拟数据集实验测试,证明了基于粒计算的K-medoids聚类算法能得到更好的初始聚类中心,聚类准确率和聚类误差平方和优于传统K-medoids和快速K-medoids聚类算法,具有更稳定的聚类结果,且适用于大规模数据集。

关 键 词:传统K-medoids聚类算法  快速K-medoids聚类算法  粒计算  等价关系  聚类  
收稿时间:2011-12-12
修稿时间:2012-03-20

New K-medoids clustering algorithm based on granular computing
MA Qing , XIE Juan-ying.New K-medoids clustering algorithm based on granular computing[J].journal of Computer Applications,2012,32(7):1973-1977.
Authors:MA Qing  XIE Juan-ying
Affiliation:School of Computer Science, Shaanxi Normal University, Xi'an Shaanxi 710062, China
Abstract:Traditional K-medoids clustering algorithm has some drawbacks,such as its clustering results being sensitive to initial cluster centers and its deficiency in large datasets.Although the fast K-medoids algorithm overcame the shortcomings of traditional K-medoids,it has the potential disadvantages of selecting the exemplars in the same cluster as initial seeds for different clusters.To overcome the shortcomings of the traditional K-medoids and the fast K-medoids clustering algorithms,a granular computing based K-medoids clustering algorithm was proposed in this paper.The algorithm defined a new similarity function between samples via pooling granularity,where the granules were produced via the equivalence relationship.The density of a granule was defined according to the number of samples in it,after that the K samples closest to the centers of the first K granules were selected as the initial centers for K-medoids clustering algorithm to cluster datasets.The experimental results on the datasets from UCI machine learning repository and on the synthetic datasets all demonstrate that the new granular computing based K-medoids clustering algorithm can find much better initial centers.Its clustering accuracy and its clustering error are better than those of the traditional K-medoids and the fast K-medoids clustering algorithms.It can get much more stable results and can be applied to cluster large datasets.
Keywords:traditional K-medoids clustering algorithm  fast K-medoids clustering algorithm  granular computing  equivalence relation  clustering
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号