A theory of cross-validation error |
| |
Authors: | PETER TURNEY |
| |
Affiliation: | Knowledge Systems Laboratory, Institute for Information Technology, National Research Council Canada , Ottawa, Ontario, K1A 0R6, Canada Phone: 613-993-8564 E-mail: peter@ai.iit.nrc.ca |
| |
Abstract: | Abstract This paper presents a theory of error in cross-validation testing of algorithms for predicting real-valued attributes. The theory justifies the claim that predicting real-valued attributes requires balancing the conflicting demands of simplicity and accuracy. Furthermore, the theory indicates precisely how these conflicting demands must be balanced, in order to minimize cross-validation error. A general theory is presented, then it is developed in detail for linear regression and instance-based learning |
| |
Keywords: | cross-validation simplicity bias variance AIC linear regression instance-based learning |
|
|