Clustering techniques: The user's dilemma |
| |
Authors: | Richard Dubes Anil K Jain |
| |
Affiliation: | Department of Computer Science, Michigan State University, East Lansing, MI 48824, U.S.A. |
| |
Abstract: | Numerous papers on clustering techniques and their applications in engineering, medical, and biological areas have appeared in pattern recognition literature during the past decade. This paper attempts to set some guidelines for a potential user of a clustering technique. We examine eight clustering programs which are representative of the various available techniques and compare their performances from several points of view. A formal comparative analysis is also performed with a portion of Munson's handprinted character data set. We believe that an understanding of the intrinsic characteristics of a clustering technique is essential to the intelligent application of the technique. Further, the output of a clustering program, along with whatever information a user may have about the data set, should be used together to form hypotheses about the structure of the data set. |
| |
Keywords: | Clustering technique Patterns Features Squared error Distance measures Dendrogram Similarity matrix Hierarchical clustering Minimum spanning tree Admissability criteria |
本文献已被 ScienceDirect 等数据库收录! |
|