Longitudinal clinical score prediction in Alzheimer's disease with soft-split sparse regression based random forest

Alzheimer's disease (AD) is an irreversible neurodegenerative disease and affects a large population in the world. Cognitive scores at multiple time points can be reliably used to evaluate the progression of the disease clinically. In recent studies, machine learning techniques have shown promi...

Full description

Saved in:
Bibliographic Details
Published inNeurobiology of aging Vol. 46; pp. 180 - 191
Main Authors Huang, Lei, Jin, Yan, Gao, Yaozong, Thung, Kim-Han, Shen, Dinggang
Format Journal Article
LanguageEnglish
Published United States Elsevier Inc 01.10.2016
Subjects
Online AccessGet full text
ISSN0197-4580
1558-1497
1558-1497
DOI10.1016/j.neurobiolaging.2016.07.005

Cover

More Information
Summary:Alzheimer's disease (AD) is an irreversible neurodegenerative disease and affects a large population in the world. Cognitive scores at multiple time points can be reliably used to evaluate the progression of the disease clinically. In recent studies, machine learning techniques have shown promising results on the prediction of AD clinical scores. However, there are multiple limitations in the current models such as linearity assumption and missing data exclusion. Here, we present a nonlinear supervised sparse regression–based random forest (RF) framework to predict a variety of longitudinal AD clinical scores. Furthermore, we propose a soft-split technique to assign probabilistic paths to a test sample in RF for more accurate predictions. In order to benefit from the longitudinal scores in the study, unlike the previous studies that often removed the subjects with missing scores, we first estimate those missing scores with our proposed soft-split sparse regression–based RF and then utilize those estimated longitudinal scores at all the previous time points to predict the scores at the next time point. The experiment results demonstrate that our proposed method is superior to the traditional RF and outperforms other state-of-art regression models. Our method can also be extended to be a general regression framework to predict other disease scores.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
These authors contributed equally to the work.
ISSN:0197-4580
1558-1497
1558-1497
DOI:10.1016/j.neurobiolaging.2016.07.005