Defining, Evaluating, and Removing Bias Induced by Linear Imputation in Longitudinal Clinical Trials with MNAR Missing Data

Missing not at random (MNAR) post-dropout missing data from a longitudinal clinical trial result in the collection of "biased data," which leads to biased estimators and tests of corrupted hypotheses. In a full rank linear model analysis the model equation, E[ Y ] = X β, leads to the defin...

Full description

Saved in:
Bibliographic Details
Published inJournal of biopharmaceutical statistics Vol. 21; no. 2; pp. 226 - 251
Main Authors Helms, Ronald W., Reece, Laura Helms, Helms, Russell W., Helms, Mary W.
Format Journal Article
LanguageEnglish
Published England Taylor & Francis Group 01.03.2011
Taylor & Francis Ltd
Subjects
Online AccessGet full text
ISSN1054-3406
1520-5711
1520-5711
DOI10.1080/10543406.2011.550097

Cover

More Information
Summary:Missing not at random (MNAR) post-dropout missing data from a longitudinal clinical trial result in the collection of "biased data," which leads to biased estimators and tests of corrupted hypotheses. In a full rank linear model analysis the model equation, E[ Y ] = X β, leads to the definition of the primary parameter β = (X′X) −1 X′E[ Y ], and the definition of linear secondary parameters of the form θ = L β = L(X′X) −1 X′E[ Y ], including, for example, a parameter representing a "treatment effect." These parameters depend explicitly on E[ Y ], which raises the questions: What is E[ Y ] when some elements of the incomplete random vector Y are not observed and MNAR, or when such a Y is "completed" via imputation? We develop a rigorous, readily interpretable definition of E[ Y ] in this context that leads directly to definitions of β, , , and the extent of hypothesis corruption. These definitions provide a basis for evaluating, comparing, and removing biases induced by various linear imputation methods for MNAR incomplete data from longitudinal clinical trials. Linear imputation methods use earlier data from a subject to impute values for post-dropout missing values and include "Last Observation Carried Forward" (LOCF) and "Baseline Observation Carried Forward" (BOCF), among others. We illustrate the methods of evaluating, comparing, and removing biases and the effects of testing corresponding corrupted hypotheses via a hypothetical but very realistic longitudinal analgesic clinical trial.
Bibliography:SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-2
ObjectType-Undefined-1
ObjectType-Feature-3
content type line 23
ISSN:1054-3406
1520-5711
1520-5711
DOI:10.1080/10543406.2011.550097