Predicting dyslipidemia incidence: unleashing machine learning algorithms on Lifestyle Promotion Project data

Background Dyslipidemia, characterized by variations in plasma lipid profiles, poses a global health threat linked to millions of deaths annually. Objectives This study focuses on predicting dyslipidemia incidence using machine learning methods, addressing the crucial need for early identification a...

Full description

Saved in:
Bibliographic Details
Published inBMC public health Vol. 24; no. 1; pp. 1777 - 16
Main Authors Naderian, Senobar, Nikniaz, Zeinab, Farhangi, Mahdieh Abbasalizad, Nikniaz, Leila, Sama-Soltani, Taha, Rostami, Parisa
Format Journal Article
LanguageEnglish
Published London BioMed Central 03.07.2024
BioMed Central Ltd
Springer Nature B.V
BMC
Subjects
Online AccessGet full text
ISSN1471-2458
1471-2458
DOI10.1186/s12889-024-19261-8

Cover

More Information
Summary:Background Dyslipidemia, characterized by variations in plasma lipid profiles, poses a global health threat linked to millions of deaths annually. Objectives This study focuses on predicting dyslipidemia incidence using machine learning methods, addressing the crucial need for early identification and intervention. Methods The dataset, derived from the Lifestyle Promotion Project (LPP) in East Azerbaijan Province, Iran, undergoes a comprehensive preprocessing, merging, and null handling process. Target selection involves five distinct dyslipidemia-related variables. Normalization techniques and three feature selection algorithms are applied to enhance predictive modeling. Result The study results underscore the potential of different machine learning algorithms, specifically multi-layer perceptron neural network (MLP), in reaching higher performance metrics such as accuracy, F1 score, sensitivity and specificity, among other machine learning methods. Among other algorithms, Random Forest also showed remarkable accuracies and outperformed K-Nearest Neighbors (KNN) in metrics like precision, recall, and F1 score. The study’s emphasis on feature selection detected meaningful patterns among five target variables related to dyslipidemia, indicating fundamental shared unities among dyslipidemia-related factors. Features such as waist circumference, serum vitamin D, blood pressure, sex, age, diabetes, and physical activity related to dyslipidemia. Conclusion These results cooperatively highlight the complex nature of dyslipidemia and its connections with numerous factors, strengthening the importance of applying machine learning methods to understand and predict its incidence precisely.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:1471-2458
1471-2458
DOI:10.1186/s12889-024-19261-8