A Comparative analysis of multiple outlier detection procedures in the linear regression model

We evaluate several published techniques to detect multiple outliers in linear regression using an extensive Monte Carlo simulation. These procedures include both direct methods from algorithms and indirect methods from robust regression estimators. We evaluate the impact of outlier density and geom...

Full description

Saved in:
Bibliographic Details
Published inComputational statistics & data analysis Vol. 36; no. 3; pp. 351 - 382
Main Authors Wisnowski, James W, Montgomery, Douglas C, Simpson, James R
Format Journal Article
LanguageEnglish
Published Amsterdam Elsevier B.V 28.05.2001
Elsevier Science
Elsevier
SeriesComputational Statistics & Data Analysis
Subjects
Online AccessGet full text
ISSN0167-9473
1872-7352
DOI10.1016/S0167-9473(00)00042-6

Cover

More Information
Summary:We evaluate several published techniques to detect multiple outliers in linear regression using an extensive Monte Carlo simulation. These procedures include both direct methods from algorithms and indirect methods from robust regression estimators. We evaluate the impact of outlier density and geometry, regressor variable dimension, and outlying distance in both leverage and residual on detection capability and false alarm (swamping) probability. The simulation scenarios focus on outlier configurations likely to be encountered in practice and use a designed experiment approach. The results for each scenario provide insight and limitations to performance for each technique. Finally, we summarize each procedure's performance and make recommendations.
ISSN:0167-9473
1872-7352
DOI:10.1016/S0167-9473(00)00042-6