Error rates for multivariate outlier detection |
| |
Authors: | Andrea Cerioli Alessio Farcomeni |
| |
Affiliation: | a University of Parma, Via Kennedy 6, 43100 Parma, Italy b Sapienza University of Rome, Piazzale Aldo Moro, 5, 00186 Roma, Italy |
| |
Abstract: | Multivariate outlier identification requires the choice of reliable cut-off points for the robust distances that measure the discrepancy from the fit provided by high-breakdown estimators of location and scatter. Multiplicity issues affect the identification of the appropriate cut-off points. It is described how a careful choice of the error rate which is controlled during the outlier detection process can yield a good compromise between high power and low swamping, when alternatives to the Family Wise Error Rate are considered. Multivariate outlier detection rules based on the False Discovery Rate and the False Discovery Exceedance criteria are proposed. The properties of these rules are evaluated through simulation. The rules are then applied to real data examples. The conclusion is that the proposed approach provides a sensible strategy in many situations of practical interest. |
| |
Keywords: | False discovery rate False discovery exceedance Multiple outliers Reweighted MCD Masking and swamping |
本文献已被 ScienceDirect 等数据库收录! |