Data compression and local metrics for nearest neighborclassification |
| |
Authors: | Ricci F Avesani P |
| |
Affiliation: | Ist. per la Ricerca Sci. e Tecnologica, Povo; |
| |
Abstract: | A local distance measure for the nearest neighbor classification rule is shown to achieve high compression rates and high accuracy on real data sets. In the approach proposed here, first, a set of prototypes is extracted during training and, then, a feedback learning algorithm is used to optimize the metric. Even if the prototypes are randomly selected, the proposed metric outperforms, both in compression rate and accuracy, common editing procedures like ICA, RNN, and PNN. Finally, when accuracy is the major concern, we show how compression can be traded for accuracy by exploiting voting techniques. That indicates how voting can be successfully integrated with instance-based approaches, overcoming previous negative results |
| |
Keywords: | |
|
|