A comparative approach to alleviating the prevalence of diabetes mellitus using machine learning
Diabetes mellitus, a metabolic disease with elevated blood sugar levels, is a significant global public health concern. Identification of diabetes at its very early stage can reduce the prevalence of cases. This work focuses on developing a machine learning-based system that will have a significant...
Saved in:
Published in | Computer methods and programs in biomedicine update Vol. 4; p. 100113 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
Elsevier B.V
2023
Elsevier |
Subjects | |
Online Access | Get full text |
ISSN | 2666-9900 2666-9900 |
DOI | 10.1016/j.cmpbup.2023.100113 |
Cover
Summary: | Diabetes mellitus, a metabolic disease with elevated blood sugar levels, is a significant global public health concern. Identification of diabetes at its very early stage can reduce the prevalence of cases. This work focuses on developing a machine learning-based system that will have a significant impact on diabetic patient identification. To develop such a system we utilized a dataset made up by acquiring direct questionnaires from Sylhet Diabetic Hospital patients. The dataset contains information about the signs and symptoms of patients who are new or likely to have diabetes. We applied 14 different machine-learning techniques where the Gradient Boosting Machine (GBM) outperformed other algorithms with the highest F1 and ROC scores of 99.37%, and 99.92% respectively. We also employed various ensemble-based approaches that show competitive performance to the individual model’s performance.
•Determination of risk factors through exploratory data analysis.•Building and validation of ML model to detect diabetes in early stage.•Comparison of 14 machine learning techniques and set a baseline for future research.•GBM model outperformed other models according to different performance metrics. |
---|---|
ISSN: | 2666-9900 2666-9900 |
DOI: | 10.1016/j.cmpbup.2023.100113 |