Fast clustering algorithm for large ECG data sets based on CS theory in combination with PCA and K-NN methods

Long-term recording of Electrocardiogram (ECG) signals plays an important role in health care systems for diagnostic and treatment purposes of heart diseases. Clustering and classification of collecting data are essential parts for detecting concealed information of P-QRS-T waves in the long-term EC...

Full description

Saved in:
Bibliographic Details
Published in2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society Vol. 2014; pp. 98 - 101
Main Authors Balouchestani, Mohammadreza, Krishnan, Sridhar
Format Conference Proceeding Journal Article
LanguageEnglish
Published United States IEEE 01.01.2014
Subjects
Online AccessGet full text
ISSN1094-687X
1557-170X
DOI10.1109/EMBC.2014.6943538

Cover

More Information
Summary:Long-term recording of Electrocardiogram (ECG) signals plays an important role in health care systems for diagnostic and treatment purposes of heart diseases. Clustering and classification of collecting data are essential parts for detecting concealed information of P-QRS-T waves in the long-term ECG recording. Currently used algorithms do have their share of drawbacks: 1) clustering and classification cannot be done in real time; 2) they suffer from huge energy consumption and load of sampling. These drawbacks motivated us in developing novel optimized clustering algorithm which could easily scan large ECG datasets for establishing low power long-term ECG recording. In this paper, we present an advanced K-means clustering algorithm based on Compressed Sensing (CS) theory as a random sampling procedure. Then, two dimensionality reduction methods: Principal Component Analysis (PCA) and Linear Correlation Coefficient (LCC) followed by sorting the data using the K-Nearest Neighbours (K-NN) and Probabilistic Neural Network (PNN) classifiers are applied to the proposed algorithm. We show our algorithm based on PCA features in combination with K-NN classifier shows better performance than other methods. The proposed algorithm outperforms existing algorithms by increasing 11% classification accuracy. In addition, the proposed algorithm illustrates classification accuracy for K-NN and PNN classifiers, and a Receiver Operating Characteristics (ROC) area of 99.98%, 99.83%, and 99.75% respectively.
ISSN:1094-687X
1557-170X
DOI:10.1109/EMBC.2014.6943538