Maximum Likelihood Estimations and EM Algorithms With Length-Biased Data

Length-biased sampling has been well recognized in economics, industrial reliability, etiology applications, and epidemiological, genetic, and cancer screening studies. Length-biased right-censored data have a unique data structure different from traditional survival data. The nonparametric and semi...

Full description

Saved in:
Bibliographic Details
Published inJournal of the American Statistical Association Vol. 106; no. 496; pp. 1434 - 1449
Main Authors Qin, Jing, Ning, Jing, Liu, Hao, Shen, Yu
Format Journal Article
LanguageEnglish
Published Alexandria, VA Taylor & Francis 01.12.2011
American Statistical Association
Taylor & Francis Ltd
Subjects
Online AccessGet full text
ISSN0162-1459
1537-274X
DOI10.1198/jasa.2011.tm10156

Cover

More Information
Summary:Length-biased sampling has been well recognized in economics, industrial reliability, etiology applications, and epidemiological, genetic, and cancer screening studies. Length-biased right-censored data have a unique data structure different from traditional survival data. The nonparametric and semiparametric estimation and inference methods for traditional survival data are not directly applicable for length-biased right-censored data. We propose new expectation-maximization algorithms for estimations based on full likelihoods involving infinite-dimensional parameters under three settings for length-biased data: estimating nonparametric distribution function, estimating nonparametric hazard function under an increasing failure rate constraint, and jointly estimating baseline hazards function and the covariate coefficients under the Cox proportional hazards model. Extensive empirical simulation studies show that the maximum likelihood estimators perform well with moderate sample sizes and lead to more efficient estimators compared to the estimating equation approaches. The proposed estimates are also more robust to various right-censoring mechanisms. We prove the strong consistency properties of the estimators, and establish the asymptotic normality of the semiparametric maximum likelihood estimators under the Cox model using modern empirical processes theory. We apply the proposed methods to a prevalent cohort medical study. Supplemental materials are available online.
Bibliography:SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-2
content type line 23
ObjectType-Article-1
ObjectType-Feature-2
Jing Qin is Mathematical Statistician, National Institution of Allergy and Infectious Diseases, Bethesda, MD 20817 (jingqin@niaid.nih.gov); Jing Ning is Assistant Professor, Division of Biostatistics, The University of Texas, Health Science Center at Houston, TX 77030 (jing.ning@uth.tmc.edu); Hao Liu is Assistant Professor, Division of Biostatistics, Dan L. Duncan Cancer Center, Baylor College of Medicine, Houston, Texas 77030 (haol@bcm.edu); and Yu Shen is Professor, Department of Biostatistics, M.D. Anderson Cancer Center, Houston, TX 77030 (yshen@mdanderson.org).
ISSN:0162-1459
1537-274X
DOI:10.1198/jasa.2011.tm10156