首页 | 本学科首页   官方微博 | 高级检索  
     


A polythetic clustering process and cluster validity indexes for histogram-valued objects
Authors:Jaejik Kim  L. Billard
Affiliation:
  • a Department of Biostatistics, Georgia Health Sciences University, Augusta, GA 30912, USA
  • b Department of Statistics, University of Georgia, Athens, GA 30602, USA
  • Abstract:Clustering is an explanatory procedure which helps to understand data with complex structure and multivariate relationships, and is a very useful method to extract knowledge and information especially from large datasets. When such datasets are aggregated into categories (as driven by scientific questions underlying the analysis), the resulting observations will perforce be expressed as so-called symbolic data (though symbolic data can occur “naturally” in any sized datasets). The focus of this work is to provide a divisive polythetic algorithm to establish clusters for p-dimensional histogram-valued data. In addition, two cluster validity indexes for use in establishing the optimal number of clusters are also developed. Finally, the proposed procedure is applied to a large forestry cover type dataset.
    Keywords:Divisive clustering   Quantitative histogram data   Dunn index and Davis-Bouldin index for symbolic data
    本文献已被 ScienceDirect 等数据库收录!
    设为首页 | 免责声明 | 关于勤云 | 加入收藏

    Copyright©北京勤云科技发展有限公司  京ICP备09084417号