Prediction of zero-dose children using supervised machine learning algorithm in Tanzania: evidence from the recent 2022 Tanzania Demographic and Health Survey
ObjectivesThis study aimed to employ machine learning algorithms to predict the factors contributing to zero-dose children in Tanzania, using the most recent nationally representative data.DesignCross-sectional study.SettingThis study was conducted in Tanzania and used the most recent 2022 Tanzania...
Saved in:
| Published in | BMJ open Vol. 15; no. 3; p. e097395 |
|---|---|
| Main Authors | , , |
| Format | Journal Article |
| Language | English |
| Published |
England
British Medical Journal Publishing Group
22.03.2025
BMJ Publishing Group LTD BMJ Publishing Group |
| Subjects | |
| Online Access | Get full text |
| ISSN | 2044-6055 2044-6055 |
| DOI | 10.1136/bmjopen-2024-097395 |
Cover
| Summary: | ObjectivesThis study aimed to employ machine learning algorithms to predict the factors contributing to zero-dose children in Tanzania, using the most recent nationally representative data.DesignCross-sectional study.SettingThis study was conducted in Tanzania and used the most recent 2022 Tanzania Demographic and Health Survey, accessed from http://www.dhsprogram.com.ParticipantsA total of 2120 children aged 12–23 months were included in this study.Outcome measureSeven classification algorithms were used in this study: logistic regression, decision tree classifier, random forest classifier (RF), support vector machine, K-nearest neighbour, XGBoost (XGB) and Naive Bayes. The dataset was randomly divided into training and testing sets, with 80% allocated for training and 20% for testing. After training the models, the testing data were used to evaluate their performance. This evaluation measured the models’ ability to generalise to unseen data using performance metrics such as accuracy, precision, recall, F1-score and AUC.ResultsApproximately 7.45% of children (95% CI 6.73%, 8.65%) were categorised as zero-dose children. The RF classifier achieved the highest performance metrics among the evaluated algorithms, with accuracy=0.95, precision=0.94, recall=0.96, F1 score=0.95 and AUC=0.99, making it the most effective supervised machine learning method for predicting zero-dose children in Tanzania. Maternal unemployment had the most significant positive impact (+0.060) on predicting zero-dose children. Lack of maternal education was the second most significant positive factor (+0.048), indicating that mothers without formal education are more likely to have zero-dose children. Small family size was the third most influential factor, with a positive effect (+0.040) on predicting zero-dose children in Tanzania.ConclusionsThe RF classifier emerged as the top-performing model for predicting children in Tanzania who have not received any vaccinations. This comprehensive approach enabled the accurate identification of zero-dose children, highlighting the effectiveness of machine learning in enhancing public health initiatives and optimising vaccination strategies. Using this algorithm can enhance health outcomes and reduce the prevalence of vaccine-preventable diseases in Tanzania. |
|---|---|
| Bibliography: | Original research ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise. None declared. |
| ISSN: | 2044-6055 2044-6055 |
| DOI: | 10.1136/bmjopen-2024-097395 |