Texture feature ranking with relevance learning to classify interstitial lung disease patterns

The generalized matrix learning vector quantization (GMLVQ) is used to estimate the relevance of texture features in their ability to classify interstitial lung disease patterns in high-resolution computed tomography images. After a stochastic gradient descent, the GMLVQ algorithm provides a discrim...

Full description

Saved in:

Bibliographic Details
Published in	Artificial intelligence in medicine Vol. 56; no. 2; pp. 91 - 97
Main Authors	Huber, Markus B., Bunte, Kerstin, Nagarajan, Mahesh B., Biehl, Michael, Ray, Lawrence A., Wismüller, Axel
Format	Journal Article
Language	English
Published	Netherlands Elsevier B.V 01.10.2012
Subjects	Algorithms Cluster Analysis Feature selection High-resolution computed tomography of the chest Humans Internal Medicine Interstitial lung disease patterns Lung Diseases, Interstitial - classification Lung Diseases, Interstitial - diagnosis Other Relevance learning Supervised learning Support Vector Machine Texture analysis Tomography, X-Ray Computed - methods High-resolution computed tomography of the chest Relevance learning Feature selection Texture analysis Supervised learning Interstitial lung disease patterns
Online Access	Get full text
ISSN	0933-3657 1873-2860 1873-2860
DOI	10.1016/j.artmed.2012.07.001

Cover

More Information
Summary:	The generalized matrix learning vector quantization (GMLVQ) is used to estimate the relevance of texture features in their ability to classify interstitial lung disease patterns in high-resolution computed tomography images. After a stochastic gradient descent, the GMLVQ algorithm provides a discriminative distance measure of relevance factors, which can account for pairwise correlations between different texture features and their importance for the classification of healthy and diseased patterns. 65 texture features were extracted from gray-level co-occurrence matrices (GLCMs). These features were ranked and selected according to their relevance obtained by GMLVQ and, for comparison, to a mutual information (MI) criteria. The classification performance for different feature subsets was calculated for a k-nearest-neighbor (kNN) and a random forests classifier (RanForest), and support vector machines with a linear and a radial basis function kernel (SVMlin and SVMrbf). For all classifiers, feature sets selected by the relevance ranking assessed by GMLVQ had a significantly better classification performance (p<0.05) for many texture feature sets compared to the MI approach. For kNN, RanForest, and SVMrbf, some of these feature subsets had a significantly better classification performance when compared to the set consisting of all features (p<0.05). While this approach estimates the relevance of single features, future considerations of GMLVQ should include the pairwise correlation for the feature ranking, e.g. to reduce the redundancy of two equally relevant features.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	0933-3657 1873-2860 1873-2860
DOI:	10.1016/j.artmed.2012.07.001