Development of a Deep Learning Algorithm for the Histopathologic Diagnosis and Gleason Grading of Prostate Cancer Biopsies: A Pilot Study

The pathologic diagnosis and Gleason grading of prostate cancer are time-consuming, error-prone, and subject to interobserver variability. Machine learning offers opportunities to improve the diagnosis, risk stratification, and prognostication of prostate cancer. To develop a state-of-the-art deep l...

Full description

Saved in:
Bibliographic Details
Published inEuropean urology focus Vol. 7; no. 2; pp. 347 - 351
Main Authors Kott, Ohad, Linsley, Drew, Amin, Ali, Karagounis, Andreas, Jeffers, Carleen, Golijanin, Dragan, Serre, Thomas, Gershman, Boris
Format Journal Article
LanguageEnglish
Published Netherlands Elsevier B.V 01.03.2021
Elsevier
Subjects
Online AccessGet full text
ISSN2405-4569
2405-4569
DOI10.1016/j.euf.2019.11.003

Cover

More Information
Summary:The pathologic diagnosis and Gleason grading of prostate cancer are time-consuming, error-prone, and subject to interobserver variability. Machine learning offers opportunities to improve the diagnosis, risk stratification, and prognostication of prostate cancer. To develop a state-of-the-art deep learning algorithm for the histopathologic diagnosis and Gleason grading of prostate biopsy specimens. A total of 85 prostate core biopsy specimens from 25 patients were digitized at 20× magnification and annotated for Gleason 3, 4, and 5 prostate adenocarcinoma by a urologic pathologist. From these virtual slides, we sampled 14803 image patches of 256×256 pixels, approximately balanced for malignancy. We trained and tested a deep residual convolutional neural network to classify each patch at two levels: (1) coarse (benign vs malignant) and (2) fine (benign vs Gleason 3 vs 4 vs 5). Model performance was evaluated using fivefold cross-validation. Randomization tests were used for hypothesis testing of model performance versus chance. The model demonstrated 91.5% accuracy (p<0.001) at coarse-level classification of image patches as benign versus malignant (0.93 sensitivity, 0.90 specificity, and 0.95 average precision). The model demonstrated 85.4% accuracy (p<0.001) at fine-level classification of image patches as benign versus Gleason 3 versus Gleason 4 versus Gleason 5 (0.83 sensitivity, 0.94 specificity, and 0.83 average precision), with the greatest number of confusions in distinguishing between Gleason 3 and 4, and between Gleason 4 and 5. Limitations include the small sample size and the need for external validation. In this study, a deep learning-based computer vision algorithm demonstrated excellent performance for the histopathologic diagnosis and Gleason grading of prostate cancer. We developed a deep learning algorithm that demonstrated excellent performance for the diagnosis and grading of prostate cancer. In this pilot study, a deep learning-based computer vision algorithm demonstrated excellent accuracy for the histopathologic diagnosis and Gleason grading of prostate cancer. These results are encouraging for the future clinical application of automated histopathologic diagnosis via deep learning.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
PMCID: PMC7242119
Author contributions: Boris Gershman had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
Administrative, technical, or material support: None.
Study concept and design: Kott, Linsley, Amin, Karagounis, Jeffers, Golijanin, Serre, Gershman.
Obtaining funding: Kott, Linsley, Golijanin, Serre, Gershman.
Acquisition of data: Kott, Amin, Golijanin, Gershman.
Analysis and interpretation of data: Kott, Linsley, Amin, Karagounis, Jeffers, Golijanin, Serre, Gershman.
Drafting of the manuscript: Kott, Linsley, Amin, Karagounis, Jeffers, Golijanin, Serre, Gershman.
Other: None.
Supervision: Gershman, Serre.
Critical revision of the manuscript for important intellectual content: Kott, Linsley, Amin, Karagounis, Jeffers, Golijanin, Serre, Gershman.
Joint first authors.
Joint senior authors.
Statistical analysis: Linsley, Karagounis, Jeffers, Serre, Gershman.
ISSN:2405-4569
2405-4569
DOI:10.1016/j.euf.2019.11.003