Identifying and Validating Prognostic Hyper-Inflammatory and Hypo-Inflammatory COVID-19 Clinical Phenotypes Using Machine Learning Methods

COVID-19 exhibits complex pathophysiological manifestations, characterized by significant clinical and biological heterogeneity. Identifying phenotypes may enhance our understanding of the disease's diverse trajectories, benefiting clinical practice and trials. This study included adult patient...

Full description

Saved in:
Bibliographic Details
Published inJournal of inflammation research Vol. 18; pp. 3009 - 3024
Main Authors Ji, Xiaojing, Guo, Yiran, Tang, Lujia, Gao, Chengjin
Format Journal Article
LanguageEnglish
Published New Zealand Dove Medical Press Limited 01.01.2025
Taylor & Francis Ltd
Dove
Dove Medical Press
Subjects
Online AccessGet full text
ISSN1178-7031
1178-7031
DOI10.2147/JIR.S504028

Cover

More Information
Summary:COVID-19 exhibits complex pathophysiological manifestations, characterized by significant clinical and biological heterogeneity. Identifying phenotypes may enhance our understanding of the disease's diverse trajectories, benefiting clinical practice and trials. This study included adult patients with COVID-19 from Xinhua Hospital, affiliated with Shanghai Jiao Tong University School of Medicine, between December 15, 2022, and February 15, 2023. The k-prototypes clustering method was employed using 50 clinical variables to identify phenotypes. Machine learning algorithms were then applied to select key classifier variables for phenotype recognition. A total of 1376 patients met the inclusion criteria. K-prototypes clustering revealed two distinct subphenotypes: Hypo-inflammatory subphenotype (824 [59.9%]) and Hyper-inflammatory subphenotype (552 [40.1%]). Patients in Hypo-inflammatory subphenotype were younger, predominantly female, with low mortality and shorter hospital stays. In contrast, Hyper-inflammatory subphenotype patients were older, predominantly male, exhibiting a hyperinflammatory state with higher mortality and rates of organ dysfunction. The AdaBoost model performed best for subphenotype prediction (Accuracy: 0.975, Precision: 0.968, Recall: 0.976, F1: 0.972, AUROC: 0.975). "CRP", "IL-2R", "D-dimer", "ST2", "BUN", "NT-proBNP", "neutrophil percentage", and "lymphocyte count" were identified as the top-ranked variables in the AdaBoost model. This analysis identified two phenotypes based on COVID-19 symptoms and comorbidities. These phenotypes can be accurately recognized using machine learning models, with the AdaBoost model being optimal for predicting in-hospital mortality. The variables "CRP", "IL-2R", "D-dimer", "ST2", "BUN", "NT-proBNP", "neutrophil percentage", and "lymphocyte count" play a significant role in the prediction of subphenotypes. Use the identified subphenotypes for risk stratification in clinical practice. Hyper-inflammatory subphenotypes can be closely monitored, and preventive measures such as early admission to the intensive care unit or prophylactic anticoagulation can be taken.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:1178-7031
1178-7031
DOI:10.2147/JIR.S504028