A new machine learning technique for an accurate diagnosis of coronary artery disease
•Novel data mining method is proposed for CAD diagnosis.•Application of feature selection (based on GA and PSO) is proposed.•New genetic training (N2Genetic optimizer) based on fusion of 10-fold cross-validation with GA or PSO is employed.•SVM (SVC, nuSVM, LinSVM) is employed for classification.•Hig...
Saved in:
| Published in | Computer methods and programs in biomedicine Vol. 179; p. 104992 |
|---|---|
| Main Authors | , , , , , |
| Format | Journal Article |
| Language | English |
| Published |
Ireland
Elsevier B.V
01.10.2019
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 0169-2607 1872-7565 1872-7565 |
| DOI | 10.1016/j.cmpb.2019.104992 |
Cover
| Summary: | •Novel data mining method is proposed for CAD diagnosis.•Application of feature selection (based on GA and PSO) is proposed.•New genetic training (N2Genetic optimizer) based on fusion of 10-fold cross-validation with GA or PSO is employed.•SVM (SVC, nuSVM, LinSVM) is employed for classification.•High classification accuracy of 93.08% is obtained.
Coronary artery disease (CAD) is one of the commonest diseases around the world. An early and accurate diagnosis of CAD allows a timely administration of appropriate treatment and helps to reduce the mortality. Herein, we describe an innovative machine learning methodology that enables an accurate detection of CAD and apply it to data collected from Iranian patients.
We first tested ten traditional machine learning algorithms, and then the three-best performing algorithms (three types of SVM) were used in the rest of the study. To improve the performance of these algorithms, a data preprocessing with normalization was carried out. Moreover, a genetic algorithm and particle swarm optimization, coupled with stratified 10-fold cross-validation, were used twice: for optimization of classifier parameters and for parallel selection of features.
The presented approach enhanced the performance of all traditional machine learning algorithms used in this study. We also introduced a new optimization technique called N2Genetic optimizer (a new genetic training). Our experiments demonstrated that N2Genetic-nuSVM provided the accuracy of 93.08% and F1-score of 91.51% when predicting CAD outcomes among the patients included in a well-known Z-Alizadeh Sani dataset. These results are competitive and comparable to the best results in the field.
We showed that machine-learning techniques optimized by the proposed approach, can lead to highly accurate models intended for both clinical and research use. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| ISSN: | 0169-2607 1872-7565 1872-7565 |
| DOI: | 10.1016/j.cmpb.2019.104992 |