Clinical actionability of triaging DNA mismatch repair deficient colorectal cancer from biopsy samples using deep learning

We aimed to develop a deep learning (DL) model to predict DNA mismatch repair (MMR) status in colorectal cancers (CRC) based on hematoxylin and eosin-stained whole-slide images (WSIs) and assess its clinical applicability. The DL model was developed and validated through three-fold cross validation...

Full description

Saved in:
Bibliographic Details
Published inEBioMedicine Vol. 81; p. 104120
Main Authors Jiang, Wu, Mei, Wei-Jian, Xu, Shuo-Yu, Ling, Yi-Hong, Li, Wei-Rong, Kuang, Jin-Bo, Li, Hao-Sen, Hui, Hui, Li, Ji-Bin, Cai, Mu-Yan, Pan, Zhi-Zhong, Zhang, Hui-Zhong, Li, Li, Ding, Pei-Rong
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.07.2022
Elsevier
Subjects
Online AccessGet full text
ISSN2352-3964
2352-3964
DOI10.1016/j.ebiom.2022.104120

Cover

More Information
Summary:We aimed to develop a deep learning (DL) model to predict DNA mismatch repair (MMR) status in colorectal cancers (CRC) based on hematoxylin and eosin-stained whole-slide images (WSIs) and assess its clinical applicability. The DL model was developed and validated through three-fold cross validation using 441 WSIs from the Cancer Genome Atlas (TCGA) and externally validated using 78 WSIs from the Pathology AI Platform (PAIP), and 355 WSIs from surgical specimens and 341 WSIs from biopsy specimens of the Sun Yet-sun University Cancer Center (SYSUCC). Domain adaption and multiple instance learning (MIL) techniques were adopted for model development. The performance of the models was evaluated using the area under the receiver operating characteristic curve (AUROC). A dual-threshold strategy was also built from the surgical cohorts and validated in the biopsy cohort. Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), F1-score, and the percentage of patients avoiding IHC testing were evaluated. The MIL model achieved an AUROC of 0·8888±0·0357 in the TCGA-validation cohort, 0·8806±0·0232 in the PAIP cohort, 0·8457±0·0233 in the SYSUCC-surgical cohort, and 0·7679±0·0342 in the SYSUCC-biopsy cohort. A dual-threshold triage strategy was used to rule-in and rule-out dMMR patients with remaining uncertain patients recommended for further IHC testing, which kept sensitivity higher than 90% and specificity higher than 95% on deficient MMR patient triage from both the surgical and biopsy specimens, result in more than half of patients avoiding IHC based MMR testing. A DL-based method that could directly predict CRC MMR status from WSIs was successfully developed, and a dual-threshold triage strategy was established to minimize the number of patients for further IHC testing. The study was funded by the National Natural Science Foundation of China (82073159, 81871971 and 81700576), the Natural Science Foundation of Guangdong Province (No. 2021A1515011792 and No.2022A1515012403) and Medical Scientific Research Foundation of Guangdong Province of China (No. A2020392).
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Wu Jiang, Wei-Jian Mei, Shuo-Yu Xu and Yi-Hong Ling contributed equally to this work.
ISSN:2352-3964
2352-3964
DOI:10.1016/j.ebiom.2022.104120