首页 | 本学科首页   官方微博 | 高级检索  
     


Model-based methods to identify multiple cluster structures in a data set
Authors:Giuliano Galimberti
Affiliation:Dipartimento di Scienze Statistiche, Alma Mater Studiorum - Università di Bologna, via Belle Arti 41, 40126 Bologna, Italy
Abstract:There is an interest in the problem of identifying different partitions of a given set of units obtained according to different subsets of the observed variables (multiple cluster structures). A model-based procedure has been previously developed for detecting multiple cluster structures from independent subsets of variables. The method relies on model-based clustering methods and on a comparison among mixture models using the Bayesian Information Criterion. A generalization of this method which allows the use of any model-selection criterion is considered. A new approach combining the generalized model-based procedure with variable-clustering methods is proposed. The usefulness of the new method is shown using simulated and real examples. Monte Carlo methods are employed to evaluate the performance of various approaches. Data matrices with two cluster structures are analyzed taking into account the separation of clusters, the heterogeneity within clusters and the dependence of cluster structures.
Keywords:Cluster analysis   Cluster structure   Mixture model   Model-selection   Clustering variables
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号