首页 | 本学科首页   官方微博 | 高级检索  
     


A variable elimination method to improve the parsimony of MLR models using the successive projections algorithm
Authors:Roberto Kawakami Harrop Galvo  Mrio Csar Ugulino Araújo  Wallace Duarte Fragoso  Edvan Cirino Silva  Gledson Emidio Jos  Sfacles Figueredo Carreiro Soares  Henrique Mohallem Paiva
Affiliation:

aInstituto Tecnológico de Aeronáutica, Divisão de Engenharia Eletrônica, 12228-900, São José dos Campos, SP, Brazil

bUniversidade Federal da Paraíba, CCEN, Departamento de Química, Caixa Postal 5093, CEP 58051-970 — João Pessoa, PB, Brazil

cEmpresa Brasileira de Aeronáutica (EMBRAER), Flight Control Systems, 12227-901, São José dos Campos, SP, Brazil

Abstract:The successive projections algorithm (SPA) is a variable selection technique designed to minimize collinearity problems in multiple linear regression (MLR). This paper proposes a modification to the basic SPA formulation aimed at further improving the parsimony of the resulting MLR model. For this purpose, an elimination procedure is incorporated to the algorithm in order to remove variables that do not effectively contribute towards the prediction ability of the model as indicated by an F-test. The utility of the proposed modification is illustrated in a simulation study, as well as in two application examples involving the analysis of diesel and corn samples by near-infrared (NIR) spectroscopy. The results demonstrate that the number of variables selected by SPA can be reduced without significantly compromising prediction performance. In addition, SPA is favourably compared with classic Stepwise Regression and full-spectrum PLS. A graphical user interface for SPA is available at www.ele.ita.br/not, vert, similarkawakami/spa/.
Keywords:Multiple linear regression  Variable selection  Successive projections algorithm  Near-infrared spectrometry  Diesel analysis  Corn analysis
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号