Unveiling AI's role in papilledema diagnosis from fundus images: A systematic review with diagnostic test accuracy meta-analysis and comparison of human expert performance

Papilledema is a condition, which is characterized by optic disc swelling due to increased intracranial pressure. Diagnostic modalities include fundus camera and other ophthalmology imaging techniques. The Frisén scale is used to grade the severity of this condition. In this paper, we investigate th...

Full description

Saved in:

Bibliographic Details
Published in	Computers in biology and medicine Vol. 184; p. 109350
Main Authors	Łajczak, Paweł Marek, Sirek, Sebastian, Wyględowska-Promieńska, Dorota
Format	Journal Article
Language	English
Published	United States Elsevier Ltd 01.01.2025 Elsevier Limited
Subjects	Accuracy Algorithms Artificial Intelligence Bias Bibliographic literature Blood Blood vessels Deep Learning Diagnostic tests Fundus imaging Fundus Oculi Heterogeneity Human performance Humans Image Interpretation, Computer-Assisted - methods Imaging techniques Internal Medicine Intracranial pressure Machine learning Medical diagnosis Medical imaging Meta-analysis Observational learning Ophthalmology Optic nerve Other Papilledema Papilledema - diagnosis Papilledema - diagnostic imaging Sensitivity analysis Software Swelling Systematic review Deep learning Papilledema Artificial intelligence Fundus imaging Machine learning
Online Access	Get full text
ISSN	0010-4825 1879-0534 1879-0534
DOI	10.1016/j.compbiomed.2024.109350

Cover

More Information
Summary:	Papilledema is a condition, which is characterized by optic disc swelling due to increased intracranial pressure. Diagnostic modalities include fundus camera and other ophthalmology imaging techniques. The Frisén scale is used to grade the severity of this condition. In this paper, we investigate the application of artificial intelligence (AI) for detecting and grading papilledema from fundus images. Following the PRISMA guidelines for systematic reviews, a search of five databases (PubMed, Scopus, Web of Science, Embase, Cochrane) was conducted using MeSH terms related to AI and papilledema. The inclusion criteria were original articles that discussed AI applications for detecting or grading papilledema from fundus images. Extracted data included sensitivity, specificity, accuracy, and technical and demographic characteristics. The systematic review included 21 studies. In the meta-analysis, the pooled sensitivity and specificity were 0.97 and 0.98, respectively. High heterogeneity was observed (I2 > 96%). Deep learning models outperformed traditional machine learning algorithms, with detection models being more effective than grading models. Publication bias was observed with Deek's plot. Several publications compared AI to human experts, showing superiority or non-inferiority of computer algorithms to humans. AI models show high diagnostic accuracy in detecting papilledema, often surpassing human experts in sensitivity, though not always in specificity. Despite limitations related to patient selection, image sourcing, and heterogeneity, AI holds potential to significantly improve diagnostic accuracy and clinical workflows in ophthalmology. •AI models achieved 97 % sensitivity and 98 % specificity in papilledema detection.•Deep learning outperforms traditional machine learning in diagnostic accuracy.•Detection models surpass grading models in AI performance for papilledema.•AI systems show potential to outperform human experts in papilledema diagnosis.
Bibliography:	ObjectType-Article-2 SourceType-Scholarly Journals-1 content type line 14 ObjectType-Feature-3 ObjectType-Evidence Based Healthcare-1 ObjectType-Article-1 ObjectType-Feature-2 ObjectType-Review-4 content type line 23 ObjectType-Undefined-3
ISSN:	0010-4825 1879-0534 1879-0534
DOI:	10.1016/j.compbiomed.2024.109350