首页 | 本学科首页   官方微博 | 高级检索  
     


Towards improving cluster-based feature selection with a simplified silhouette filter
Authors:Thiago F. Covõ  es [Author Vitae] [Author Vitae]
Affiliation:Department of Computer Sciences, University of São Paulo (USP) at São Carlos, Av. Trabalhador São-carlense, 400, Centro Caixa Postal 668, CEP 13.560-970, São Carlos, SP, Brazil
Abstract:This paper proposes a filter-based algorithm for feature selection. The filter is based on the partitioning of the set of features into clusters. The number of clusters, and consequently the cardinality of the subset of selected features, is automatically estimated from data. The computational complexity of the proposed algorithm is also investigated. A variant of this filter that considers feature-class correlations is also proposed for classification problems. Empirical results involving ten datasets illustrate the performance of the developed algorithm, which in general has obtained competitive results in terms of classification accuracy when compared to state of the art algorithms that find clusters of features. We show that, if computational efficiency is an important issue, then the proposed filter may be preferred over their counterparts, thus becoming eligible to join a pool of feature selection algorithms to be used in practice. As an additional contribution of this work, a theoretical framework is used to formally analyze some properties of feature selection methods that rely on finding clusters of features.
Keywords:Feature selection   Filters   Clustering   Classification
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号