Towards achieving optimal performance using stacked generalization algorithm : a case study of clinical diagnosis of malaria fever

The birth of data mining has been a blessing to all fields of endeavours and there are numerous data mining algorithms available today. One of the major problems of mining data is the selection of the appropriate algorithm or model for a job at hand; this has led to different comparison experiments...

Full description

Saved in:
Bibliographic Details
Published inInternational arab journal of information technology Vol. 16; no. 6; pp. 1074 - 1081
Main Authors Oguntimilehin, Abiodun, Osho, Innocent, Adetunmbi, Olusola
Format Journal Article
LanguageEnglish
Published Zarqa, Jordan Zarqa University 01.11.2019
Online AccessGet full text
ISSN1683-3198
1683-3198

Cover

More Information
Summary:The birth of data mining has been a blessing to all fields of endeavours and there are numerous data mining algorithms available today. One of the major problems of mining data is the selection of the appropriate algorithm or model for a job at hand; this has led to different comparison experiments by researchers. Stacked Generalization is one of the methods of combining multiple models to give a better accuracy. The method has been investigated to be effective by many researchers over the years. This study investigates how optimal performance could be achieved using Stacked Generalization algorithm. Six different data mining algorithms (PART, REP Tree, J48, Random Tree, RIDOR and JRIP) arranged in two different orders were used as base learners to two different Meta Learners (Random Forest and NNGE) independently and the results obtained were compared in terms of classification accuracy. The study shows that the order of arrangement of the base learners and the choice of Meta Learner could affect the accuracy of the Stacked Generalization method; NNGE outperforms Random Forest as a Meta-Learner and its performance is independent of the order of arrangement of the base learners as against Random Forest. Malaria fever datasets collected from reputable hospitals in Ado-Ekiti, Ekiti State, Nigeria were purposefully used for this study because malaria is one of the major diseases killing almost a million people yearly in the tropical region of Africa, so a more accurate malaria fever diagnosis model is as well proposed as a result of this study.
ISSN:1683-3198
1683-3198