The Use of Multiple Imputation for Data Subject to Limits of Detection

Missing data due to limit of detection and limit of quantification is a common obstacle in epidemiological and biomedical research. We are interested in methodologies that provide unbiased and efficient estimates of these missing data while using popular statistical software. We describe a multiple...

Full description

Saved in:
Bibliographic Details
Published inSri Lankan Journal of Applied Statistics Vol. 5; no. 4; pp. 227 - 246
Main Authors Harel, Ofer, Perkins, Neil, Schisterman, Enrique F.
Format Journal Article
LanguageEnglish
Published Sri Lanka 01.01.2014
Subjects
Online AccessGet full text
ISSN1391-4987
2424-6271
2424-6271
DOI10.4038/sljastats.v5i4.7792

Cover

More Information
Summary:Missing data due to limit of detection and limit of quantification is a common obstacle in epidemiological and biomedical research. We are interested in methodologies that provide unbiased and efficient estimates of these missing data while using popular statistical software. We describe a multiple imputation (MI) procedure for cross-sectional and longitudinal data which examines the sources of variation of hormones levels throughout the menstrual cycle conditional on specific biomarkers. We describe the rational, procedure, advantages and disadvantages of the multiple imputation procedure. We also provide a comparison to commonly used missing data procedures (complete cases analysis and single imputation). We illustrate our approach using the BioCycle data where we are interested in the effects of Vitamin E and Beta-carotene on Progesterone levels. We also evaluate the longitudinal impact of changes in Vitamin E on Progesterone levels over time. Finaly, we demonstrate the advantages of using MI over complete case analysis or naive single replacement in both cross-sectional and longitudinal analysis where measurements below the limit of quantification (LOQ) are unreported. We also illustrate that if available, inclusion of potentially demined unreliable data below the limit of detection (LOD) improves simple estimation substantially.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1391-4987
2424-6271
2424-6271
DOI:10.4038/sljastats.v5i4.7792