首页 | 本学科首页   官方微博 | 高级检索  
     


Statistical computation of feature weighting schemes through data estimation for nearest neighbor classifiers
Authors:José   A. Sá  ez,Joaquí  n Derrac,Juliá  n Luengo,Francisco Herrera
Affiliation:1. Department of Computer Science and Artificial Intelligence, University of Granada, CITIC-UGR, Granada 18071, Spain;2. School of Computer Science & Informatics, Cardiff University, Cardiff CF24 3AA, United Kingdom;3. Department of Civil Engineering, LSI, University of Burgos, Burgos 09006, Spain
Abstract:
The Nearest Neighbor rule is one of the most successful classifiers in machine learning. However, it is very sensitive to noisy, redundant and irrelevant features, which may cause its performance to deteriorate. Feature weighting methods try to overcome this problem by incorporating weights into the similarity function to increase or reduce the importance of each feature, according to how they behave in the classification task. This paper proposes a new feature weighting classifier, in which the computation of the weights is based on a novel idea combining imputation methods – used to estimate a new distribution of values for each feature based on the rest of the data – and the Kolmogorov–Smirnov nonparametric statistical test to measure the changes between the original and imputed distribution of values. This proposal is compared with classic and recent feature weighting methods. The experimental results show that our feature weighting scheme is very resilient to the choice of imputation method and is an effective way of improving the performance of the Nearest Neighbor classifier, outperforming the rest of the classifiers considered in the comparisons.
Keywords:Feature weighting   Imputation methods   Nearest neighbor   Classification
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号