A Comparative analysis of multiple outlier detection procedures in the linear regression model
We evaluate several published techniques to detect multiple outliers in linear regression using an extensive Monte Carlo simulation. These procedures include both direct methods from algorithms and indirect methods from robust regression estimators. We evaluate the impact of outlier density and geom...
Saved in:
| Published in | Computational statistics & data analysis Vol. 36; no. 3; pp. 351 - 382 |
|---|---|
| Main Authors | , , |
| Format | Journal Article |
| Language | English |
| Published |
Amsterdam
Elsevier B.V
28.05.2001
Elsevier Science Elsevier |
| Series | Computational Statistics & Data Analysis |
| Subjects | |
| Online Access | Get full text |
| ISSN | 0167-9473 1872-7352 |
| DOI | 10.1016/S0167-9473(00)00042-6 |
Cover
| Summary: | We evaluate several published techniques to detect multiple outliers in linear regression using an extensive Monte Carlo simulation. These procedures include both direct methods from algorithms and indirect methods from robust regression estimators. We evaluate the impact of outlier density and geometry, regressor variable dimension, and outlying distance in both leverage and residual on detection capability and false alarm (swamping) probability. The simulation scenarios focus on outlier configurations likely to be encountered in practice and use a designed experiment approach. The results for each scenario provide insight and limitations to performance for each technique. Finally, we summarize each procedure's performance and make recommendations. |
|---|---|
| ISSN: | 0167-9473 1872-7352 |
| DOI: | 10.1016/S0167-9473(00)00042-6 |