Prediction of zero-dose children using supervised machine learning algorithm in Tanzania: evidence from the recent 2022 Tanzania Demographic and Health Survey

ObjectivesThis study aimed to employ machine learning algorithms to predict the factors contributing to zero-dose children in Tanzania, using the most recent nationally representative data.DesignCross-sectional study.SettingThis study was conducted in Tanzania and used the most recent 2022 Tanzania...

Full description

Saved in:
Bibliographic Details
Published inBMJ open Vol. 15; no. 3; p. e097395
Main Authors Asnake, Angwach Abrham, Seifu, Beminate Lemma, Gebrehana, Alemayehu Kasu
Format Journal Article
LanguageEnglish
Published England British Medical Journal Publishing Group 22.03.2025
BMJ Publishing Group LTD
BMJ Publishing Group
Subjects
Online AccessGet full text
ISSN2044-6055
2044-6055
DOI10.1136/bmjopen-2024-097395

Cover

More Information
Summary:ObjectivesThis study aimed to employ machine learning algorithms to predict the factors contributing to zero-dose children in Tanzania, using the most recent nationally representative data.DesignCross-sectional study.SettingThis study was conducted in Tanzania and used the most recent 2022 Tanzania Demographic and Health Survey, accessed from http://www.dhsprogram.com.ParticipantsA total of 2120 children aged 12–23 months were included in this study.Outcome measureSeven classification algorithms were used in this study: logistic regression, decision tree classifier, random forest classifier (RF), support vector machine, K-nearest neighbour, XGBoost (XGB) and Naive Bayes. The dataset was randomly divided into training and testing sets, with 80% allocated for training and 20% for testing. After training the models, the testing data were used to evaluate their performance. This evaluation measured the models’ ability to generalise to unseen data using performance metrics such as accuracy, precision, recall, F1-score and AUC.ResultsApproximately 7.45% of children (95% CI 6.73%, 8.65%) were categorised as zero-dose children. The RF classifier achieved the highest performance metrics among the evaluated algorithms, with accuracy=0.95, precision=0.94, recall=0.96, F1 score=0.95 and AUC=0.99, making it the most effective supervised machine learning method for predicting zero-dose children in Tanzania. Maternal unemployment had the most significant positive impact (+0.060) on predicting zero-dose children. Lack of maternal education was the second most significant positive factor (+0.048), indicating that mothers without formal education are more likely to have zero-dose children. Small family size was the third most influential factor, with a positive effect (+0.040) on predicting zero-dose children in Tanzania.ConclusionsThe RF classifier emerged as the top-performing model for predicting children in Tanzania who have not received any vaccinations. This comprehensive approach enabled the accurate identification of zero-dose children, highlighting the effectiveness of machine learning in enhancing public health initiatives and optimising vaccination strategies. Using this algorithm can enhance health outcomes and reduce the prevalence of vaccine-preventable diseases in Tanzania.
Bibliography:Original research
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.
None declared.
ISSN:2044-6055
2044-6055
DOI:10.1136/bmjopen-2024-097395