Bootstrap estimation of the proportion of outliers in robust regression

Qiang Heng,Kenneth Lange
DOI: https://doi.org/10.1007/s11222-024-10526-1
IF: 2.3241
2024-11-23
Statistics and Computing
Abstract:This paper presents a nonparametric bootstrap method for estimating the proportions of inliers and outliers in robust regression models. Our approach is based on the concept of stability, providing robustness against distributional assumptions and eliminating the need for pre-specified confidence levels. Through numerical experiments, we demonstrate that this method yields more accurate and stable estimates than existing alternatives. Additionally, the generated instability paths offer a valuable graphical tool for understanding the inlier and outlier distributions within the data. The method naturally extends to generalized linear models, where we find that variance-stabilizing transformations produce residuals that are well-suited for outlier detection. Applications to two real-world datasets further illustrate the practical utility of our approach in identifying outliers.
statistics & probability,computer science, theory & methods
What problem does this paper attempt to address?