首页 | 本学科首页   官方微博 | 高级检索  
     

基于优化初始聚类中心的K中心点算法
引用本文:段桂芹,邹臣嵩,刘锋.基于优化初始聚类中心的K中心点算法[J].计算机与现代化,2019,0(4):1.
作者姓名:段桂芹  邹臣嵩  刘锋
作者单位:广东松山职业技术学院计算机系,广东 韶关,512126;广东松山职业技术学院电气工程系,广东 韶关,512126
基金项目:广东高校省级重大科研项目(2017GkQNCX033); 韶关市科技计划项目(2017CX/K055); 广东松山职业技术学院重点科技项目(2018KJZD001); 广东大学生科技创新培养专项资金资助项目(pdjh2015a0715)
摘    要:针对K中心点算法的初始聚类中心可能过于临近、代表性不足、稳定性差等问题,提出一种改进的K中心点算法。将样本集间的平均距离与样本间的平均距离的比值作为样本的密度参数,精简了高密度点集合中候选代表点的数量,采用最大距离乘积法选择密度较大且距离较远的K个样本作为初始聚类中心,兼顾聚类中心的代表性和分散性。在UCI数据集上的实验结果表明,与传统K中心点算法和其他2种改进聚类算法相比,新提出的算法不仅聚类结果更加准确,同时也具有更快的收敛速度和更高的稳定性。

关 键 词:密度  初始聚类中心  K中心点  绝对误差
收稿时间:2019-04-30

An Improved K-medoids Algorithm Based on Optimal Initial Cluster Center
DUAN Gui-qin,ZOU Chen-song,LIU Feng.An Improved K-medoids Algorithm Based on Optimal Initial Cluster Center[J].Computer and Modernization,2019,0(4):1.
Authors:DUAN Gui-qin  ZOU Chen-song  LIU Feng
Abstract:Aiming at the initial clustering center of k-medoids may be too near, under-represented, or poor stability, an improved k-medoids algorithm is proposed. The ratio of sample sets average distance and samples average distance is treated as the density of sample parameters, the number of candidate representative points in the high density point set is simplified, the product of maximum distance method is adopted to choose K samples with high density and long distance as the initial clustering center, both of the representative and dispersion of the clustering center are considered also. Experimental results on the UCI data set show that compared with the traditional K-medoids algorithm and the other two improved clustering algorithms, the new algorithm not only has more accurate clustering results, but also has faster convergence speed and higher stability.
Keywords:density  initial cluster center  K-medoids  absolute error  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机与现代化》浏览原始摘要信息
点击此处可从《计算机与现代化》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号