首页 | 本学科首页   官方微博 | 高级检索  
     


Hybrid clustering solution selection strategy
Authors:Zhiwen Yu  Le Li  Yunjun Gao  Jane You  Jiming Liu  Hau-San Wong  Guoqiang Han
Affiliation:1. School of Computer Science and Engineering, South China University of Technology, China;2. College of Computer Science, Zhejiang University, China;3. Department of Computing, Hong Kong Polytechnic University, Hong Kong;4. Department of Computer Science, Hong Kong Baptist University, Hong Kong;5. Department of Computer Science, City University of Hong Kong, Hong Kong
Abstract:Cluster ensemble approaches make use of a set of clustering solutions which are derived from different data sources to gain a more comprehensive and significant clustering result over conventional single clustering approaches. Unfortunately, not all the clustering solutions in the ensemble contribute to the final result. In this paper, we focus on the clustering solution selection strategy in the cluster ensemble, and propose to view clustering solutions as features such that suitable feature selection techniques can be used to perform clustering solution selection. Furthermore, a hybrid clustering solution selection strategy (HCSS) is designed based on a proposed weighting function, which combines several feature selection techniques for the refinement of clustering solutions in the ensemble. Finally, a new measure is designed to evaluate the effectiveness of clustering solution selection strategies. The experimental results on both UCI machine learning datasets and cancer gene expression profiles demonstrate that HCSS works well on most of the datasets, obtains more desirable final results, and outperforms most of the state-of-the-art clustering solution selection strategies.
Keywords:Cluster ensemble  Clustering solution selection  Feature selection  Hybrid strategy
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号