首页 | 本学科首页   官方微博 | 高级检索  
     

基于离散量优化初始聚类中心的k-means算法
引用本文:刘美玲,黄名选,汤卫东.基于离散量优化初始聚类中心的k-means算法[J].计算机工程与科学,2017,39(6):1164-1170.
作者姓名:刘美玲  黄名选  汤卫东
作者单位:(1.广西民族大学信息科学与工程学院,广西 南宁 530006; 2.广西师范学院科学计算与智能信息处理广西高校重点实验室,广西 南宁 530023; 3.广西财经学院信息与统计学院,广西 南宁 530003)
基金项目:国家自然科学基金(61262028);广西高校科学技术研究项目(KY2015ZD039);广西民族大学科研项目(2011MDYB032);科学计算与智能信息处理广西高校重点实验室基金项目(GXSCIIP201201)
摘    要:传统k-means算法由于初始聚类中心的选择是随机的,因此会使聚类结果不稳定。针对这个问题,提出一种基于离散量改进k-means初始聚类中心选择的算法。算法首先将所有对象作为一个大类,然后不断从对象数目最多的聚类中选择离散量最大与最小的两个对象作为初始聚类中心,再根据最近距离将这个大聚类中的其他对象划分到与之最近的初始聚类中,直到聚类个数等于指定的k值。最后将这k个聚类作为初始聚类应用到k-means算法中。将提出的算法与传统k-means算法、最大最小距离聚类算法应用到多个数据集进行实验。实验结果表明,改进后的k-means算法选取的初始聚类中心唯一,聚类过程的迭代次数也减少了,聚类结果稳定且准确率较高。

关 键 词:离散量  k-means  聚类  聚类中心
收稿时间:2015-10-08
修稿时间:2017-06-25

A k-means algorithm for optimized initial clustering center based on discrete quantity
LIU Mei-ling,HUANG Ming-xuan,TANG Wei-dong.A k-means algorithm for optimized initial clustering center based on discrete quantity[J].Computer Engineering & Science,2017,39(6):1164-1170.
Authors:LIU Mei-ling  HUANG Ming-xuan  TANG Wei-dong
Affiliation:(1.College of Information Science and Engineering,Guangxi University for Nationalities,Nanning 530006; 2.Guangxi Higher Education Key Laboratory of Science Computing and Intelligent Information Processing, Guangxi Teachers Education University,Nanning 530023; 3.College of Information and Statistics,Guangxi University of Finance and Economics,Nanning 530003,China)  
Abstract:The initial clustering centers of traditional k-means are randomly selected, which results in unstable clustering results. To solve this problem, we propose an improved algorithm based on discrete quantity. In the proposed algorithm, all the objects are firstly regarded as a class and the two objects that have the maximum and the minimum discrete quantity respectively are selected from the cluster with the largest number of objects as the initial clustering centers. And then the other objects in the largest cluster are partitioned to the nearest initial clusters. The partition process is repeated until the cluster number is equal to the specified value k. Finally, as the initial clusters, the partitioned k clusters are applied to the k-means algorithm. We conduct experiments on several datasets, and compare the proposed algorithm with the traditional k-means algorithm and max-min distance clustering algorithm. Experimental results show that the improved k-means algorithm can select unique initial clustering centers, reduce the times of iteration, and has stable clustering results and higher accuracy.
Keywords:discrete quantity  k-means  clustering  clustering center  
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号