首页 | 本学科首页   官方微博 | 高级检索  
     


K-means properties on six clustering benchmark datasets
Authors:Pasi Fränti  Sami Sieranoja
Affiliation:1.Machine Learning Group, School of Computing,University of Eastern Finland,Joensuu,Finland
Abstract:This paper has two contributions. First, we introduce a clustering basic benchmark. Second, we study the performance of k-means using this benchmark. Specifically, we measure how the performance depends on four factors: (1) overlap of clusters, (2) number of clusters, (3) dimensionality, and (4) unbalance of cluster sizes. The results show that overlap is critical, and that k-means starts to work effectively when the overlap reaches 4% level.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号