Dimension reduction via principal variables |
| |
Authors: | JA Cumming DA Wooff |
| |
Affiliation: | Department of Mathematical Sciences, Durham University, Science Laboratories, Stockton Road, Durham, DH1 3LE, UK |
| |
Abstract: | For many large-scale datasets it is necessary to reduce dimensionality to the point where further exploration and analysis can take place. Principal variables are a subset of the original variables and preserve, to some extent, the structure and information carried by the original variables. Dimension reduction using principal variables is considered and a novel algorithm for determining such principal variables is proposed. This method is tested and compared with 11 other variable selection methods from the literature in a simulation study and is shown to be highly effective. Extensions to this procedure are also developed, including a method to determine longitudinal principal variables for repeated measures data, and a technique for incorporating utilities in order to modify the selection process. The method is further illustrated with real datasets, including some larger UK data relating to patient outcome after total knee replacement. |
| |
Keywords: | Variable selection Principal components Partial correlation Partial covariance Utility Longitudinal data Repeated measures |
本文献已被 ScienceDirect 等数据库收录! |
|