Texture feature ranking with relevance learning to classify interstitial lung disease patterns

The generalized matrix learning vector quantization (GMLVQ) is used to estimate the relevance of texture features in their ability to classify interstitial lung disease patterns in high-resolution computed tomography images. After a stochastic gradient descent, the GMLVQ algorithm provides a discrim...

Full description

Saved in:
Bibliographic Details
Published inArtificial intelligence in medicine Vol. 56; no. 2; pp. 91 - 97
Main Authors Huber, Markus B., Bunte, Kerstin, Nagarajan, Mahesh B., Biehl, Michael, Ray, Lawrence A., Wismüller, Axel
Format Journal Article
LanguageEnglish
Published Netherlands Elsevier B.V 01.10.2012
Subjects
Online AccessGet full text
ISSN0933-3657
1873-2860
1873-2860
DOI10.1016/j.artmed.2012.07.001

Cover

More Information
Summary:The generalized matrix learning vector quantization (GMLVQ) is used to estimate the relevance of texture features in their ability to classify interstitial lung disease patterns in high-resolution computed tomography images. After a stochastic gradient descent, the GMLVQ algorithm provides a discriminative distance measure of relevance factors, which can account for pairwise correlations between different texture features and their importance for the classification of healthy and diseased patterns. 65 texture features were extracted from gray-level co-occurrence matrices (GLCMs). These features were ranked and selected according to their relevance obtained by GMLVQ and, for comparison, to a mutual information (MI) criteria. The classification performance for different feature subsets was calculated for a k-nearest-neighbor (kNN) and a random forests classifier (RanForest), and support vector machines with a linear and a radial basis function kernel (SVMlin and SVMrbf). For all classifiers, feature sets selected by the relevance ranking assessed by GMLVQ had a significantly better classification performance (p<0.05) for many texture feature sets compared to the MI approach. For kNN, RanForest, and SVMrbf, some of these feature subsets had a significantly better classification performance when compared to the set consisting of all features (p<0.05). While this approach estimates the relevance of single features, future considerations of GMLVQ should include the pairwise correlation for the feature ranking, e.g. to reduce the redundancy of two equally relevant features.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0933-3657
1873-2860
1873-2860
DOI:10.1016/j.artmed.2012.07.001