BiMM forest: A random forest method for modeling clustered and longitudinal binary outcomes

Clustered binary outcomes and datasets with many predictor variables are frequently encountered in clinical research (e.g. longitudinal studies). Generalized linear mixed models (GLMMs) typically employed for clustered endpoints have challenges for some scenarios, particularly for complex datasets w...

Full description

Saved in:
Bibliographic Details
Published inChemometrics and intelligent laboratory systems Vol. 185; pp. 122 - 134
Main Authors Speiser, Jaime Lynn, Wolf, Bethany J., Chung, Dongjun, Karvellas, Constantine J., Koch, David G., Durkalski, Valerie L.
Format Journal Article
LanguageEnglish
Published Netherlands Elsevier B.V 15.02.2019
Subjects
Online AccessGet full text
ISSN0169-7439
1873-3239
DOI10.1016/j.chemolab.2019.01.002

Cover

More Information
Summary:Clustered binary outcomes and datasets with many predictor variables are frequently encountered in clinical research (e.g. longitudinal studies). Generalized linear mixed models (GLMMs) typically employed for clustered endpoints have challenges for some scenarios, particularly for complex datasets which contain many interactions among predictors and nonlinear predictors of outcome. We propose a new method called Binary Mixed Model (BiMM) forest, which combines random forest and GLMM methodology. BiMM forest offers a flexible and stable method which naturally models interactions among predictors and can be employed in the setting of clustered data. Simulation studies show that BiMM forest achieves similar or superior prediction accuracy compared to standard random forest, GLMMs and its tree counterpart (BiMM tree) for clustered binary outcomes. The method is applied to a real dataset from the Acute Liver Failure Study Group. BiMM forest offers an alternative method for modeling clustered binary outcomes which may be applied in myriad research settings. •There is no random forest method available for clustered binary outcomes.•We propose a random forest method to accommodate clustered and longitudinal outcomes called Binary Mixed Model (BiMM) forest.•Accuracy of BiMM forest is slightly higher or comparable to existing methods for predicting outcome of new subjects.•Accuracy of BiMM forest is often higher for predicting outcome of new training data observations compared to other methods.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0169-7439
1873-3239
DOI:10.1016/j.chemolab.2019.01.002