Hybrid feature selection algorithm using symmetrical uncertainty and a harmony search algorithm

Microarray technology can be used as an efficient diagnostic system to recognise diseases such as tumours or to discriminate between different types of cancers in normal tissues. This technology has received increasing attention from the bioinformatics community because of its potential in designing...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of systems science Vol. 47; no. 6; pp. 1312 - 1329
Main Authors Shreem, Salam Salameh, Abdullah, Salwani, Nazri, Mohd Zakree Ahmad
Format Journal Article
LanguageEnglish
Published Taylor & Francis 25.04.2016
Subjects
Online AccessGet full text
ISSN0020-7721
1464-5319
DOI10.1080/00207721.2014.924600

Cover

More Information
Summary:Microarray technology can be used as an efficient diagnostic system to recognise diseases such as tumours or to discriminate between different types of cancers in normal tissues. This technology has received increasing attention from the bioinformatics community because of its potential in designing powerful decision-making tools for cancer diagnosis. However, the presence of thousands or tens of thousands of genes affects the predictive accuracy of this technology from the perspective of classification. Thus, a key issue in microarray data is identifying or selecting the smallest possible set of genes from the input data that can achieve good predictive accuracy for classification. In this work, we propose a two-stage selection algorithm for gene selection problems in microarray data-sets called the symmetrical uncertainty filter and harmony search algorithm wrapper (SU-HSA). Experimental results show that the SU-HSA is better than HSA in isolation for all data-sets in terms of the accuracy and achieves a lower number of genes on 6 out of 10 instances. Furthermore, the comparison with state-of-the-art methods shows that our proposed approach is able to obtain 5 (out of 10) new best results in terms of the number of selected genes and competitive results in terms of the classification accuracy.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0020-7721
1464-5319
DOI:10.1080/00207721.2014.924600