Random Global and Local Optimal Search Algorithm Based Subset Generation for Diagnosis of Cancer

Data mining algorithms are extensively used to classify the data, in which prediction of disease using minimal computation time plays a vital role. The aim of this paper is to develop the classification model from reduced features and instances. In this paper we proposed four search algorithms for f...

Full description

Saved in:
Bibliographic Details
Published inCurrent medical imaging reviews Vol. 16; no. 3; p. 249
Main Authors Meenachi, Loganathan, Ramakrishnan, Srinivasan
Format Journal Article
LanguageEnglish
Published United Arab Emirates 01.01.2020
Subjects
Online AccessGet more information
ISSN1573-4056
DOI10.2174/1573405614666180720152838

Cover

More Information
Summary:Data mining algorithms are extensively used to classify the data, in which prediction of disease using minimal computation time plays a vital role. The aim of this paper is to develop the classification model from reduced features and instances. In this paper we proposed four search algorithms for feature selection the first algorithm is Random Global Optimal (RGO) search algorithm for searching the continuous, global optimal subset of features from the random population. The second is Global and Local Optimal (GLO) search algorithm for searching the global and local optimal subset of features from population. The third one is Random Local Optimal (RLO) search algorithm for generating random, local optimal subset of features from the random population. Finally the Random Global and Optimal (RGLO) search algorithm for searching the continuous, global and local optimal subset of features from the random population. RGLO search algorithm combines the properties of first three stated algorithm. The subsets of features generated from the proposed four search algorithms are evaluated using the consistency based subset evaluation measure. Instance based learning algorithm is applied to the resulting feature dataset to reduce the instances that are redundant or irrelevant for classification. The model developed using naïve Bayesian classifier from the reduced features and instances is validated with the tenfold cross validation. Classification accuracy based on RGLO search algorithm using naïve Bayesian classifier is 94.82% for Breast, 97.4% for DLBCL, 98.83% for SRBCT and 98.89% for Leukemia datasets. The RGLO search based reduced features results in the high prediction rate with less computational time when compared with the complete dataset and other proposed subset generation algorithm.
ISSN:1573-4056
DOI:10.2174/1573405614666180720152838