首页 | 本学科首页   官方微博 | 高级检索  
     


Selection of the number of clusters via the bootstrap method
Authors:Yixin Fang  Junhui Wang
Affiliation:
  • a Division of Biostatistics, Department of Environmental Medicine, New York University, 650 First Avenue, Room 551, New York, NY 10016, United States
  • b Department of Mathematics, Statistics, and Computer Science, University of Illinois at Chicago, United States
  • Abstract:Here the problem of selecting the number of clusters in cluster analysis is considered. Recently, the concept of clustering stability, which measures the robustness of any given clustering algorithm, has been utilized in Wang (2010) for selecting the number of clusters through cross validation. In this paper, an estimation scheme for clustering instability is developed based on the bootstrap, and then the number of clusters is selected so that the corresponding estimated clustering instability is minimized. The proposed selection criterion’s effectiveness is demonstrated on simulations and real examples.
    Keywords:Cluster analysis   K-means   Spectral clustering   Stability
    本文献已被 ScienceDirect 等数据库收录!
    设为首页 | 免责声明 | 关于勤云 | 加入收藏

    Copyright©北京勤云科技发展有限公司  京ICP备09084417号