Performance of radiomics analysis in ultrasound imaging for differentiating benign from malignant adnexal masses: A systematic review and meta‐analysis

Introduction We present the state of the art of ultrasound‐based machine learning (ML) radiomics models in the context of ovarian masses and analyze their accuracy in differentiating between benign and malignant adnexal masses. Material and Methods Web of Science, PubMed, and Scopus databases were s...

Full description

Saved in:
Bibliographic Details
Published inActa obstetricia et gynecologica Scandinavica Vol. 104; no. 8; pp. 1433 - 1442
Main Authors Moro, Francesca, Ciancia, Marianna, Sciuto, Maria, Baldassari, Giulia, Tran, Huong Elena, Carcagnì, Antonella, Fagotti, Anna, Testa, Antonia Carla
Format Journal Article
LanguageEnglish
Published United States John Wiley & Sons, Inc 01.08.2025
John Wiley and Sons Inc
Wiley
Subjects
Online AccessGet full text
ISSN0001-6349
1600-0412
1600-0412
DOI10.1111/aogs.15146

Cover

More Information
Summary:Introduction We present the state of the art of ultrasound‐based machine learning (ML) radiomics models in the context of ovarian masses and analyze their accuracy in differentiating between benign and malignant adnexal masses. Material and Methods Web of Science, PubMed, and Scopus databases were searched. All studies were imported into RAYYAN QCRI software. All studies that developed and internally or externally validated ML models using only radiomics features extracted from ultrasound images were included. The overall quality of the included studies was assessed using the QUADAS‐AI tool. Summary sensitivity and specificity analyses with corresponding 95% confidence intervals (CIs) were reported. Results 12 studies developed ML models including only radiomics features extracted from ultrasound images, and six of them were included in the meta‐analysis. The overall sensitivity and specificity for differentiating benign from malignant adnexal masses were 0.80 (95% CI 0.74–0.87) and 0.86 (95% CI 0.80–0.90), respectively, in the validation set. All studies demonstrated a high risk of bias in subject selection (e.g., lack of details on image sources or scanner models; absence of image preprocessing), and the majority also showed a high risk in the index test (e.g., models were not validated on external datasets) domain. In contrast, the risk of bias was generally low for the reference standard (i.e., most studies used a reference that accurately identified the target condition) and the testing workflow (i.e., the time interval between the index test and reference standard was appropriate) domains. Conclusions The good performance of ultrasound‐based radiomics models in the validation set supports that radiomics is worth exploring to improve the diagnosis of adnexal masses. So far, the studies have a high risk of bias due to the small sample size, single‐setting design, and no external validation included. Currently, the IOTA‐ADNEX model and the O‐RADS are the most reliable methods for calculating the risk of malignancy in adnexal masses. However, some skill is required to accurately recognize the ultrasound variables included in these methods. In recent years, several authors have demonstrated that radiomic analysis, which does not rely on expert interpretation or the recognition of ultrasound variables, performs almost as well as the methods currently in use in discriminating between benign and malignant adnexal masses. With further development and validation, it has the potential to be integrated into ultrasound imaging systems, enhancing clinical decision‐making and potentially evolving into a more autonomous diagnostic tool in the future.
Bibliography:Francesca Moro and Marianna Ciancia contributed equally to the work.
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ObjectType-Review-3
content type line 23
ISSN:0001-6349
1600-0412
1600-0412
DOI:10.1111/aogs.15146