Analysis of Machine Learning Techniques for Heart Failure Readmissions

The current ability to predict readmissions in patients with heart failure is modest at best. It is unclear whether machine learning techniques that address higher dimensional, nonlinear relationships among variables would enhance prediction. We sought to compare the effectiveness of several machine...

Full description

Saved in:

Bibliographic Details
Published in	Circulation Cardiovascular quality and outcomes Vol. 9; no. 6; pp. 629 - 640
Main Authors	Mortazavi, Bobak J., Downing, Nicholas S., Bucholz, Emily M., Dharmarajan, Kumar, Manhapra, Ajay, Li, Shu-Xia, Negahban, Sahand N., Krumholz, Harlan M.
Format	Journal Article
Language	English
Published	United States 01.11.2016
Subjects	Aged Algorithms Data Mining - methods Databases, Factual Female Heart Failure - diagnosis Heart Failure - therapy Humans Logistic Models Male Middle Aged Nonlinear Dynamics Patient Readmission Randomized Controlled Trials as Topic Reproducibility of Results Risk Assessment Risk Factors Support Vector Machine Telemedicine Time Factors meta-analysis heart failure patient readmission machine learning computers
Online Access	Get full text
ISSN	1941-7713 1941-7705 1941-7705
DOI	10.1161/CIRCOUTCOMES.116.003039

Cover

More Information
Summary:	The current ability to predict readmissions in patients with heart failure is modest at best. It is unclear whether machine learning techniques that address higher dimensional, nonlinear relationships among variables would enhance prediction. We sought to compare the effectiveness of several machine learning algorithms for predicting readmissions. Using data from the Telemonitoring to Improve Heart Failure Outcomes trial, we compared the effectiveness of random forests, boosting, random forests combined hierarchically with support vector machines or logistic regression (LR), and Poisson regression against traditional LR to predict 30- and 180-day all-cause readmissions and readmissions because of heart failure. We randomly selected 50% of patients for a derivation set, and a validation set comprised the remaining patients, validated using 100 bootstrapped iterations. We compared C statistics for discrimination and distributions of observed outcomes in risk deciles for predictive range. In 30-day all-cause readmission prediction, the best performing machine learning model, random forests, provided a 17.8% improvement over LR (mean C statistics, 0.628 and 0.533, respectively). For readmissions because of heart failure, boosting improved the C statistic by 24.9% over LR (mean C statistic 0.678 and 0.543, respectively). For 30-day all-cause readmission, the observed readmission rates in the lowest and highest deciles of predicted risk with random forests (7.8% and 26.2%, respectively) showed a much wider separation than LR (14.2% and 16.4%, respectively). Machine learning methods improved the prediction of readmission after hospitalization for heart failure compared with LR and provided the greatest predictive range in observed readmission rates.
Bibliography:	ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 Authors Negahbhan and Krumholz contributed equally as senior authors. Dr. Bucholz's current affiliation: Boston Children's Hospital, Boston, MA Dr. Downing's current affiliation: Brigham and Women's Hospital, Boston, MA
ISSN:	1941-7713 1941-7705 1941-7705
DOI:	10.1161/CIRCOUTCOMES.116.003039