Enhanced Heart Disease Diagnosis Using Machine Learning Algorithms: A Comparison of Feature Selection

Heart disease or cardiovascular disease is one of the leading causes of death in the world. Based on WHO data, in 2019, as many as 17.9 million people died from cardiovascular disease. If early prevention is not carried out immediately, of course, the victims will increase every year. Therefore, wit...

Full description

Saved in:
Bibliographic Details
Published inJurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) (Online) Vol. 9; no. 2; pp. 385 - 392
Main Authors Hirmayanti, Ema Utami
Format Journal Article
LanguageEnglish
Published Ikatan Ahli Informatika Indonesia 01.04.2025
Subjects
Online AccessGet full text
ISSN2580-0760
2580-0760
DOI10.29207/resti.v9i2.6175

Cover

More Information
Summary:Heart disease or cardiovascular disease is one of the leading causes of death in the world. Based on WHO data, in 2019, as many as 17.9 million people died from cardiovascular disease. If early prevention is not carried out immediately, of course, the victims will increase every year. Therefore, with the increasingly rapid development of technology, especially in the health sector, it is hoped that it can help medical personnel in treating patients suffering from various diseases, especially heart disease. So in this study, it will be more focused on the selection of relevant features or attributes to increase the accuracy value of the Machine Learning algorithm. The algorithms used include Random Forest and SVM. Meanwhile, for feature selection, several feature selection techniques are used, including information gain (IG), Chi-square (Chi2) and correlation feature selection (CFS). The use of these three techniques aims to obtain the main features so that they can minimize irrelevant features that can slow down the machine process. Based on the results of the experiment with a comparison of 70:30, it shows that CFS-SVM is superior by using nine features, which obtain the highest accuracy of 92.19%, while CFS-RF obtains the best value with eight features of 91.88%. By using feature selection and hyperparameter techniques, SVM obtained an increase of 10.88%, and RF obtained an increase of 9.47%. Based on the performance of the model using the selected relevant features, it shows that the proposed CFS-SVM shows good and efficient performance in diagnosing heart disease.
ISSN:2580-0760
2580-0760
DOI:10.29207/resti.v9i2.6175