Estimation of low-density lipoprotein cholesterol by machine learning methods

•Machine learning was used for LDL estimation from cholesterol, HDL and triglycerides.•Both shallow and deep machine learning was used.•Shallow methods perform in general as well as deep ones.•If the data to train the model is good, all algorithms will give good results.•Model interpretability and m...

Full description

Saved in:

Bibliographic Details
Published in	Clinica chimica acta Vol. 517; pp. 108 - 116
Main Authors	Tsigalou, Christina, Panopoulou, Maria, Papadopoulos, Charalambos, Karvelas, Alexandros, Tsairidis, Dimitrios, Anagnostopoulos, Konstantinos
Format	Journal Article
Language	English
Published	Netherlands Elsevier B.V 01.06.2021
Subjects	Cholesterol Friedewald LDL Machine learning Triglycerides Friedewald Triglycerides LDL Cholesterol Machine learning
Online Access	Get full text
ISSN	0009-8981 1873-3492 1873-3492
DOI	10.1016/j.cca.2021.02.020

Cover

More Information
Summary:	•Machine learning was used for LDL estimation from cholesterol, HDL and triglycerides.•Both shallow and deep machine learning was used.•Shallow methods perform in general as well as deep ones.•If the data to train the model is good, all algorithms will give good results.•Model interpretability and model drift should be considered. Accurate determination of low-density lipoprotein cholesterol (LDL) is important for coronary heart disease risk assessment and atherosclerosis. Apart from direct determination of LDL values, models (or equations) are used. A more recent approach is the use of machine learning (ML) algorithms. ML algorithms were used for LDL determination (regression) from cholesterol, HDL and triglycerides. The methods used were multivariate Linear Regression (LR), Support Vector Machines (SVM), Extreme Gradient Boosting (XGB) and Deep Neural Networks (DNN), in both larger and smaller data sets. Also, LDL values were classified according to both NCEP III and European Society of Cardiology guidelines. The performance of regression was assessed by the Standard Error of the Estimate. ML methods performed better than established equations (Friedewald and Martin). The performance all ML methods was comparable for large data sets and was affected by the divergence of the train and test data sets, as measured by the Jensen-Shannon divergence. Classification accuracy was not satisfactory for any model. Direct determination of LDL is the most preferred route. When not available, ML methods can be a good substitute. Not only deep neural networks but other, less computationally expensive methods can work as well as deep learning.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	0009-8981 1873-3492 1873-3492
DOI:	10.1016/j.cca.2021.02.020