Deep learning to detect acute respiratory distress syndrome on chest radiographs: a retrospective study with external validation
Acute respiratory distress syndrome (ARDS) is a common, but under-recognised, critical illness syndrome associated with high mortality. An important factor in its under-recognition is the variability in chest radiograph interpretation for ARDS. We sought to train a deep convolutional neural network...
Saved in:
Published in | The Lancet. Digital health Vol. 3; no. 6; pp. e340 - e348 |
---|---|
Main Authors | , , , , , , , , , , , , , , , , , , |
Format | Journal Article |
Language | English |
Published |
England
Elsevier Ltd
01.06.2021
Elsevier |
Subjects | |
Online Access | Get full text |
ISSN | 2589-7500 2589-7500 |
DOI | 10.1016/S2589-7500(21)00056-X |
Cover
Summary: | Acute respiratory distress syndrome (ARDS) is a common, but under-recognised, critical illness syndrome associated with high mortality. An important factor in its under-recognition is the variability in chest radiograph interpretation for ARDS. We sought to train a deep convolutional neural network (CNN) to detect ARDS findings on chest radiographs.
CNNs were pretrained on 595 506 radiographs from two centres to identify common chest findings (eg, opacity and effusion), and then trained on 8072 radiographs annotated for ARDS by multiple physicians using various transfer learning approaches. The best performing CNN was tested on chest radiographs in an internal and external cohort, including a subset reviewed by six physicians, including a chest radiologist and physicians trained in intensive care medicine. Chest radiograph data were acquired from four US hospitals.
In an internal test set of 1560 chest radiographs from 455 patients with acute hypoxaemic respiratory failure, a CNN could detect ARDS with an area under the receiver operator characteristics curve (AUROC) of 0·92 (95% CI 0·89–0·94). In the subgroup of 413 images reviewed by at least six physicians, its AUROC was 0·93 (95% CI 0·88–0·96), sensitivity 83·0% (95% CI 74·0–91·1), and specificity 88·3% (95% CI 83·1–92·8). Among images with zero of six ARDS annotations (n=155), the median CNN probability was 11%, with six (4%) assigned a probability above 50%. Among images with six of six ARDS annotations (n=27), the median CNN probability was 91%, with two (7%) assigned a probability below 50%. In an external cohort of 958 chest radiographs from 431 patients with sepsis, the AUROC was 0·88 (95% CI 0·85–0·91). When radiographs annotated as equivocal were excluded, the AUROC was 0·93 (0·92–0·95).
A CNN can be trained to achieve expert physician-level performance in ARDS detection on chest radiographs. Further research is needed to evaluate the use of these algorithms to support real-time identification of ARDS patients to ensure fidelity with evidence-based care or to support ongoing ARDS research.
National Institutes of Health, Department of Defense, and Department of Veterans Affairs. |
---|---|
Bibliography: | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Undefined-1 ObjectType-Feature-3 content type line 23 Contributors MWS and CEG had full access to and verified all of the data in the study, take responsibility for the integrity of the data and accuracy of the data analysis, and drafted the Article. MWS, DT, SA, and CEG conceived of and designed the study. All authors acquired, analysed, or interpreted data, and critically revised the Article for important intellectual content. MWS, JM, and CEG did the statistical analysis. MWS, NJM, KRW, and CEG obtained the funding for the study. MWS, NJM, and CEG supervised the study. MWS had the final responsibility for the decision to submit for publication. |
ISSN: | 2589-7500 2589-7500 |
DOI: | 10.1016/S2589-7500(21)00056-X |