Validation and data splitting in predictive regression modeling of honing surface roughness data |
| |
Authors: | C-X J Feng Z-G S Yu J-H J Wang |
| |
Affiliation: | 1. Department of Industrial and Manufacturing Engineering and Technology , Bradley University , 1501 W. Bradley Ave., Peoria, IL 61625, USA;2. Department of Industrial Technology , Eastern Michigan University , Ypsilanti, MI 48197, USA |
| |
Abstract: | Model validation is critical in predicting the performance of manufacturing processes. In predictive regression, proper selection of variables helps minimize the model mismatch error, proper selection of models helps reduce the model estimation error, and proper validation of models helps minimize the model prediction error. In this paper, the literature is briefly reviewed and a rigorous procedure is proposed for evaluating the validation and data splitting methods in predictive regression modeling. Experimental data from a honing surface roughness study will be used to illustrate the methodology. In particular, the individual versus average data splitting methods as well as the fivefold versus threefold cross-validation methods are compared. This paper shows that statistical tests and prediction errors evaluation are important in subset selection and cross-validation of predictive regression models. No statistical differences were found between the fivefold and the threefold cross-validation methods, and between use of the individual and average data splitting methods in predictive regression modeling. |
| |
Keywords: | Predictive modeling Regression Cross-validation Data splitting Design of experiments Honing surface roughness |
|
|