Screening for extranodal extension in HPV-associated oropharyngeal carcinoma: evaluation of a CT-based deep learning algorithm in patient data from a multicentre, randomised de-escalation trial
Pretreatment identification of pathological extranodal extension (ENE) would guide therapy de-escalation strategies for in human papillomavirus (HPV)-associated oropharyngeal carcinoma but is diagnostically challenging. ECOG-ACRIN Cancer Research Group E3311 was a multicentre trial wherein patients...
Saved in:
| Published in | The Lancet. Digital health Vol. 5; no. 6; pp. e360 - e369 |
|---|---|
| Main Authors | , , , , , , , , , , , , , , , , , , , , |
| Format | Journal Article |
| Language | English |
| Published |
England
Elsevier Ltd
01.06.2023
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 2589-7500 2589-7500 |
| DOI | 10.1016/S2589-7500(23)00046-8 |
Cover
| Summary: | Pretreatment identification of pathological extranodal extension (ENE) would guide therapy de-escalation strategies for in human papillomavirus (HPV)-associated oropharyngeal carcinoma but is diagnostically challenging. ECOG-ACRIN Cancer Research Group E3311 was a multicentre trial wherein patients with HPV-associated oropharyngeal carcinoma were treated surgically and assigned to a pathological risk-based adjuvant strategy of observation, radiation, or concurrent chemoradiation. Despite protocol exclusion of patients with overt radiographic ENE, more than 30% had pathological ENE and required postoperative chemoradiation. We aimed to evaluate a CT-based deep learning algorithm for prediction of ENE in E3311, a diagnostically challenging cohort wherein algorithm use would be impactful in guiding decision-making.
For this retrospective evaluation of deep learning algorithm performance, we obtained pretreatment CTs and corresponding surgical pathology reports from the multicentre, randomised de-escalation trial E3311. All enrolled patients on E3311 required pretreatment and diagnostic head and neck imaging; patients with radiographically overt ENE were excluded per study protocol. The lymph node with largest short-axis diameter and up to two additional nodes were segmented on each scan and annotated for ENE per pathology reports. Deep learning algorithm performance for ENE prediction was compared with four board-certified head and neck radiologists. The primary endpoint was the area under the curve (AUC) of the receiver operating characteristic.
From 178 collected scans, 313 nodes were annotated: 71 (23%) with ENE in general, 39 (13%) with ENE larger than 1 mm ENE. The deep learning algorithm AUC for ENE classification was 0·86 (95% CI 0·82–0·90), outperforming all readers (p<0·0001 for each). Among radiologists, there was high variability in specificity (43–86%) and sensitivity (45–96%) with poor inter-reader agreement (κ 0·32). Matching the algorithm specificity to that of the reader with highest AUC (R2, false positive rate 22%) yielded improved sensitivity to 75% (+ 13%). Setting the algorithm false positive rate to 30% yielded 90% sensitivity. The algorithm showed improved performance compared with radiologists for ENE larger than 1 mm (p<0·0001) and in nodes with short-axis diameter 1 cm or larger.
The deep learning algorithm outperformed experts in predicting pathological ENE on a challenging cohort of patients with HPV-associated oropharyngeal carcinoma from a randomised clinical trial. Deep learning algorithms should be evaluated prospectively as a treatment selection tool.
ECOG-ACRIN Cancer Research Group and the National Cancer Institute of the US National Institutes of Health. |
|---|---|
| Bibliography: | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Undefined-1 ObjectType-Feature-3 content type line 23 BHK, BAB, and RLF conceptualised and designed the study. BHK, JPG, SP, HRK, and AFJ collected and curated the data. BHK, JPG, SP, HRK, AFJ, SA, JL, ZY, and DB did the investigation. BHK, JL, and ZY designed the methodology, did the formal analysis, made the figures, interpreted the data, and wrote the manuscript. BHK, JL, ZY, SA, RB, HRK, AFJ, SP, JPG, HJWLA, JPG, YF, RU, RH, RBT, DNM, JDS, RMS, BAB, RLF, and DB reviewed and edited the manuscript. BHK, BAB, and RLF were responsible for project administration. BHK, BAB, and RLF obtained resources and supervised the study. BHK and JL have directly accessed and verified the underlying data reported in the manuscript. All authors had access to the raw datasets and were responsible for the decision to submit for publication. Contributors |
| ISSN: | 2589-7500 2589-7500 |
| DOI: | 10.1016/S2589-7500(23)00046-8 |