Gene selection and classification of microarray data method based on mutual information and moth flame algorithm

•The first work to apply the moth flame optimization algorithm to gene selection.•Hybridization of mutual information and moth flame algorithm for gene selection.•Performance of our algorithm is evaluated on sixteen benchmark datasets.•Our proposal obtained the best subset of genes with high classif...

Full description

Saved in:
Bibliographic Details
Published inExpert systems with applications Vol. 166; p. 114012
Main Authors Dabba, Ali, Tari, Abdelkamel, Meftali, Samy, Mokhtari, Rabah
Format Journal Article
LanguageEnglish
Published New York Elsevier Ltd 15.03.2021
Elsevier BV
Subjects
Online AccessGet full text
ISSN0957-4174
1873-6793
DOI10.1016/j.eswa.2020.114012

Cover

More Information
Summary:•The first work to apply the moth flame optimization algorithm to gene selection.•Hybridization of mutual information and moth flame algorithm for gene selection.•Performance of our algorithm is evaluated on sixteen benchmark datasets.•Our proposal obtained the best subset of genes with high classification accuracy. Several techniques or methods may help in detecting diseases and cancer. Creating an effective method for extracting disease information is one of the major challenges in the classification of gene expression data as long as there is (in the presence) a massive amount of redundant data and noise. Bio-inspired algorithms are among the most effective when used for solving gene selection. Moth Flame Optimization Algorithm (MFOA) is computationally less expensive and can converge faster than other methods. In this paper, we propose a new extension of the MFOA called the modified Moth Flame Algorithm (mMFA), the mMFA is combined with Mutual Information Maximization (MIM) to solve gene selection in microarray data classification. Our approach Called Mutual Information Maximization – modified Moth Flame Algorithm (MIM-mMFA), the MIM based pre-filtering technique is used to measure the relevance and the redundancy of the genes, and the mMFA is used to evolve gene subsets and evaluated by the fitness function, which uses a Support Vector Machine (SVM) with Leave One Out Cross Validation (LOOCV) classifier and the number of selected genes. In order to test the performance of the proposed MIM-mMFA algorithm, we compared the MIM-mMFA algorithm with other recently published algorithms in the literature. The experiment results which have been conducted on sixteen benchmark datasets either binary-class or multi-class, confirm that MIM-mMFA algorithm provides a greater classification accuracy.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2020.114012