A novel approach to perform linear discriminant analyses for a 4-way alzheimer’s disease diagnosis based on an integration of pearson’s correlation coefficients and empirical cumulative distribution function
Diagnosing Alzheimer’s disease (AD) remains a significant challenge, particularly in effectively identifying individuals in the early (EMCI) and late (LMCI) stages of Mild Cognitive Impairment (MCI) within the normal control subjects (CN). Leveraging the Alzheimer’s Disease Neuroimaging Initiative (...
Saved in:
| Published in | Multimedia tools and applications Vol. 83; no. 31; pp. 76687 - 76703 |
|---|---|
| Main Authors | , , , , |
| Format | Journal Article |
| Language | English |
| Published |
New York
Springer US
01.09.2024
Springer Nature B.V |
| Subjects | |
| Online Access | Get full text |
| ISSN | 1573-7721 1380-7501 1573-7721 |
| DOI | 10.1007/s11042-024-18532-1 |
Cover
| Summary: | Diagnosing Alzheimer’s disease (AD) remains a significant challenge, particularly in effectively identifying individuals in the early (EMCI) and late (LMCI) stages of Mild Cognitive Impairment (MCI) within the normal control subjects (CN). Leveraging the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database and relevant datasets, our aim is to establish a 4-way framework for multi-class diagnosis. Linear Discriminant Analysis (LDA), often coupled with Principal Component Analysis (PCA), has conventionally served as a method for supervised classification. However, this paper introduces an alternative approach using Pearson’s correlation coefficient (PCC) instead of PCA. We integrate the optimal LDA subspace with the PCC method, primarily to address the singularity issue that arises when dealing with an underdetermined dataset. Our methodology comprises three main steps. Firstly, we engage in the preprocessing of 237 Diffusion Tensor and Magnetic Resonance brain images to map brain connectivity and extract connections within and between hemispheres. Secondly, we calculate correlation coefficients between features and classes, subsequently constructing empirical cumulative distribution functions (ECDF). Features exceeding a predetermined percentile in the ECDF, guaranteeing the non-singularity of the within-class variance matrix, are subsequently chosen and assessed using a primary classifier. The top k features, linked to the highest classification accuracy, are then mapped into the LDA space through 100 iterations of five-fold Cross-Validation. Following each trial, we assess the performance of six machine learning algorithms, selecting the Logistic Regression classifier to gauge the reliability of our proposed method. As a result, we observed a significant improvement in average accuracy, achieving a performance of 65.46% ± 1.94%, compared to the commonly used PCA+LDA approach, which achieved 50.71% ± 2.1%. Notably, our work achieved 100% accuracy in diagnosing the LMCI class, surpassing other methods. Furthermore, in a separate experiment conducted within and between hemispheres datasets, we identified connectivity between hemispheres as a pivotal biomarker for disease diagnosis in a medical context. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 1573-7721 1380-7501 1573-7721 |
| DOI: | 10.1007/s11042-024-18532-1 |