首页 | 本学科首页   官方微博 | 高级检索  
     


A procedure for outlier identification in data sets from continuous distributions
Authors:Email author" target="_blank">N?BalakrishnanEmail author  A?J?Quiroz
Affiliation:(1) Department of Mathematics and Statistics, McMaster University, Hamilton, Canada;(2) Departmento de Cómputo Científico y Estadística, Universidad Simón Bolívar, Spain
Abstract:We propose a procedure, based on sums of reciprocals ofp-values, for the identification of outliers in univariate or multivariate data sets coming from continuous distributions. Using results of Csörg? (1990), we find the limiting distribution of the relevant statistic for completely specified models. By simulations, we obtain approximate quantiles for the asymptotic distribution, (which does not depend on the specific model or the dimension where the data live) and for the finite sample distribution in different dimensions of our statistic when parameters are estimated, for the multivariate Gaussian model and a multivariate double exponential model with independent coordinates. Monte Carlo evaluation shows that the procedure proposed is effective in the identification of outliers, and that it is sensitive to sample size, a feature seldom found in outlier identification methods.
Keywords:Outlier identification  St  Petersburg paradox  continuous distributions
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号