Implementation of five machine learning methods to predict the 52-week blood glucose level in patients with type 2 diabetes

For the patients who are suffering from type 2 diabetes, blood glucose level could be affected by multiple factors. An accurate estimation of the trajectory of blood glucose is crucial in clinical decision making. Frequent glucose measurement serves as a good source of data to train machine learning...

Full description

Saved in:
Bibliographic Details
Published inFrontiers in endocrinology (Lausanne) Vol. 13; p. 1061507
Main Authors Fu, Xiaomin, Wang, Yuhan, Cates, Ryan S., Li, Nan, Liu, Jing, Ke, Dianshan, Liu, Jinghua, Liu, Hongzhou, Yan, Shuangtong
Format Journal Article
LanguageEnglish
Published Switzerland Frontiers Media S.A 20.01.2023
Subjects
Online AccessGet full text
ISSN1664-2392
1664-2392
DOI10.3389/fendo.2022.1061507

Cover

More Information
Summary:For the patients who are suffering from type 2 diabetes, blood glucose level could be affected by multiple factors. An accurate estimation of the trajectory of blood glucose is crucial in clinical decision making. Frequent glucose measurement serves as a good source of data to train machine learning models for prediction purposes. This study aimed at using machine learning methods to predict blood glucose for type 2 diabetic patients. We investigated various parameters influencing blood glucose, as well as determined the most effective machine learning algorithm in predicting blood glucose. 273 patients were recruited in this research. Several parameters such as age, diet, family history, BMI, alcohol intake, smoking status et al were analyzed. Patients who had glycosylated hemoglobin less than 6.5% after 52 weeks were considered as having achieved glycemic control and the rest as not achieving it. Five machine learning methods (KNN algorithm, logistic regression algorithm, random forest algorithm, support vector machine, and XGBoost algorithm) were compared to evaluate their performances in prediction accuracy. R 3.6.3 and Python 3.12 were used in data analysis. The statistical variables for which p< 0.05 was obtained were BMI, pulse, Na, Cl, AKP. Compared with the other four algorithms, XGBoost algorithm has the highest accuracy (Accuracy=99.54% in training set and 78.18% in testing set) and AUC values (1.0 in training set and 0.68 in testing set), thus it is recommended to be used for prediction in clinical practice. When it comes to future blood glucose level prediction using machine learning methods, XGBoost algorithm scores the highest in effectiveness. This algorithm could be applied to assist clinical decision making, as well as guide the lifestyle of diabetic patients, in pursuit of minimizing risks of hyperglycemic or hypoglycemic events.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Edited by: Huajin Wang, Carnegie Mellon University, United States
These authors have contributed equally to this work
Reviewed by: Tadao Ooka, Massachusetts General Hospital, Harvard Medical School, United States; Fei Wang, The Affiliated Hospital of Qingdao University, China; Guifang Yan, Johns Hopkins Medicine, United States
This article was submitted to Systems Endocrinology, a section of the journal Frontiers in Endocrinology
These authors share first authorship
ISSN:1664-2392
1664-2392
DOI:10.3389/fendo.2022.1061507