Unveiling AI's role in papilledema diagnosis from fundus images: A systematic review with diagnostic test accuracy meta-analysis and comparison of human expert performance

Papilledema is a condition, which is characterized by optic disc swelling due to increased intracranial pressure. Diagnostic modalities include fundus camera and other ophthalmology imaging techniques. The Frisén scale is used to grade the severity of this condition. In this paper, we investigate th...

Full description

Saved in:
Bibliographic Details
Published inComputers in biology and medicine Vol. 184; p. 109350
Main Authors Łajczak, Paweł Marek, Sirek, Sebastian, Wyględowska-Promieńska, Dorota
Format Journal Article
LanguageEnglish
Published United States Elsevier Ltd 01.01.2025
Elsevier Limited
Subjects
Online AccessGet full text
ISSN0010-4825
1879-0534
1879-0534
DOI10.1016/j.compbiomed.2024.109350

Cover

More Information
Summary:Papilledema is a condition, which is characterized by optic disc swelling due to increased intracranial pressure. Diagnostic modalities include fundus camera and other ophthalmology imaging techniques. The Frisén scale is used to grade the severity of this condition. In this paper, we investigate the application of artificial intelligence (AI) for detecting and grading papilledema from fundus images. Following the PRISMA guidelines for systematic reviews, a search of five databases (PubMed, Scopus, Web of Science, Embase, Cochrane) was conducted using MeSH terms related to AI and papilledema. The inclusion criteria were original articles that discussed AI applications for detecting or grading papilledema from fundus images. Extracted data included sensitivity, specificity, accuracy, and technical and demographic characteristics. The systematic review included 21 studies. In the meta-analysis, the pooled sensitivity and specificity were 0.97 and 0.98, respectively. High heterogeneity was observed (I2 > 96%). Deep learning models outperformed traditional machine learning algorithms, with detection models being more effective than grading models. Publication bias was observed with Deek's plot. Several publications compared AI to human experts, showing superiority or non-inferiority of computer algorithms to humans. AI models show high diagnostic accuracy in detecting papilledema, often surpassing human experts in sensitivity, though not always in specificity. Despite limitations related to patient selection, image sourcing, and heterogeneity, AI holds potential to significantly improve diagnostic accuracy and clinical workflows in ophthalmology. •AI models achieved 97 % sensitivity and 98 % specificity in papilledema detection.•Deep learning outperforms traditional machine learning in diagnostic accuracy.•Detection models surpass grading models in AI performance for papilledema.•AI systems show potential to outperform human experts in papilledema diagnosis.
Bibliography:ObjectType-Article-2
SourceType-Scholarly Journals-1
content type line 14
ObjectType-Feature-3
ObjectType-Evidence Based Healthcare-1
ObjectType-Article-1
ObjectType-Feature-2
ObjectType-Review-4
content type line 23
ObjectType-Undefined-3
ISSN:0010-4825
1879-0534
1879-0534
DOI:10.1016/j.compbiomed.2024.109350