Ensemble-based hybrid probabilistic sampling for imbalanced data learning in lung nodule CAD

Classification plays a critical role in false positive reduction (FPR) in lung nodule computer aided detection (CAD). The difficulty of FPR lies in the variation of the appearances of the nodules, and the imbalance distribution between the nodule and non-nodule class. Moreover, the presence of inher...

Full description

Saved in:
Bibliographic Details
Published inComputerized medical imaging and graphics Vol. 38; no. 3; pp. 137 - 150
Main Authors Cao, Peng, Yang, Jinzhu, Li, Wei, Zhao, Dazhe, Zaiane, Osmar
Format Journal Article
LanguageEnglish
Published New York, NY Elsevier Ltd 01.04.2014
Elsevier
Subjects
Online AccessGet full text
ISSN0895-6111
1879-0771
1879-0771
DOI10.1016/j.compmedimag.2013.12.003

Cover

More Information
Summary:Classification plays a critical role in false positive reduction (FPR) in lung nodule computer aided detection (CAD). The difficulty of FPR lies in the variation of the appearances of the nodules, and the imbalance distribution between the nodule and non-nodule class. Moreover, the presence of inherent complex structures in data distribution, such as within-class imbalance and high-dimensionality are other critical factors of decreasing classification performance. To solve these challenges, we proposed a hybrid probabilistic sampling combined with diverse random subspace ensemble. Experimental results demonstrate the effectiveness of the proposed method in terms of geometric mean (G-mean) and area under the ROC curve (AUC) compared with commonly used methods.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0895-6111
1879-0771
1879-0771
DOI:10.1016/j.compmedimag.2013.12.003