Identifying and Validating Prognostic Hyper-Inflammatory and Hypo-Inflammatory COVID-19 Clinical Phenotypes Using Machine Learning Methods
COVID-19 exhibits complex pathophysiological manifestations, characterized by significant clinical and biological heterogeneity. Identifying phenotypes may enhance our understanding of the disease's diverse trajectories, benefiting clinical practice and trials. This study included adult patient...
Saved in:
| Published in | Journal of inflammation research Vol. 18; pp. 3009 - 3024 |
|---|---|
| Main Authors | , , , |
| Format | Journal Article |
| Language | English |
| Published |
New Zealand
Dove Medical Press Limited
01.01.2025
Taylor & Francis Ltd Dove Dove Medical Press |
| Subjects | |
| Online Access | Get full text |
| ISSN | 1178-7031 1178-7031 |
| DOI | 10.2147/JIR.S504028 |
Cover
| Summary: | COVID-19 exhibits complex pathophysiological manifestations, characterized by significant clinical and biological heterogeneity. Identifying phenotypes may enhance our understanding of the disease's diverse trajectories, benefiting clinical practice and trials.
This study included adult patients with COVID-19 from Xinhua Hospital, affiliated with Shanghai Jiao Tong University School of Medicine, between December 15, 2022, and February 15, 2023. The k-prototypes clustering method was employed using 50 clinical variables to identify phenotypes. Machine learning algorithms were then applied to select key classifier variables for phenotype recognition.
A total of 1376 patients met the inclusion criteria. K-prototypes clustering revealed two distinct subphenotypes: Hypo-inflammatory subphenotype (824 [59.9%]) and Hyper-inflammatory subphenotype (552 [40.1%]). Patients in Hypo-inflammatory subphenotype were younger, predominantly female, with low mortality and shorter hospital stays. In contrast, Hyper-inflammatory subphenotype patients were older, predominantly male, exhibiting a hyperinflammatory state with higher mortality and rates of organ dysfunction. The AdaBoost model performed best for subphenotype prediction (Accuracy: 0.975, Precision: 0.968, Recall: 0.976, F1: 0.972, AUROC: 0.975). "CRP", "IL-2R", "D-dimer", "ST2", "BUN", "NT-proBNP", "neutrophil percentage", and "lymphocyte count" were identified as the top-ranked variables in the AdaBoost model.
This analysis identified two phenotypes based on COVID-19 symptoms and comorbidities. These phenotypes can be accurately recognized using machine learning models, with the AdaBoost model being optimal for predicting in-hospital mortality. The variables "CRP", "IL-2R", "D-dimer", "ST2", "BUN", "NT-proBNP", "neutrophil percentage", and "lymphocyte count" play a significant role in the prediction of subphenotypes. Use the identified subphenotypes for risk stratification in clinical practice. Hyper-inflammatory subphenotypes can be closely monitored, and preventive measures such as early admission to the intensive care unit or prophylactic anticoagulation can be taken. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 |
| ISSN: | 1178-7031 1178-7031 |
| DOI: | 10.2147/JIR.S504028 |