Академический Документы
Профессиональный Документы
Культура Документы
Algorithm
Allam Jaya prakash,Samit Ari
Department of Electronics and Communication Engineering, National Institute of Technology, Rourkela,India
E-mail: allamjayaprakash@gmail.com
ECG plays a most important role in finding cardiac disorders of the heart. Cardiac arrhythmias occur in a short duration of time which can’t
be distinguishable by a human eye. The finding of arrhythmias is a tedious task since slight changes in the electrocardiogram signal (ECG)
may lead to life-threatening. Diagnosis and medication at an early stage of cardiac arrhythmia may facilitate to decrease the mortality rate of
the heart patients. This work presents an accurate system for the classification of five types of ECG arrhythmias namely Paced (P), Premature
ventricular contraction (V), Normal (N), Right bundle branch block (R) and Left bundle branch block (L). In the proposed technique, to
classify ECG arrhythmias random forest (RF) classifier is applied. The projected model of cardiac arrhythmia recognition system covers 3
stages they’re pre-processing, feature extraction & classification. In the initial stage, filtered the ECG signal raw data and finding the R-peak
locations of the ECG signal. Dual tree complex wavelet transform (DTCWT) is employed to extract the feature vector from the ECG signal
within the second stage. The final feature vector consists of an extracted feature set using DTCWT and four other temporal features of the
ECG signal. The random forest (RF) classifier access final feature set as input. Classifier performance is evaluated using MIT-BIH physionet
database. Finally, RF classifier achieved overall accuracy of 98.78% on an individual basis once tested over five kinds of physionet standard
arrhythmia database (MIT-BIH). The proposed arrangement of framework effectively classified five kinds of ECG arrhythmias. The proposed
RF classifier performs better than other reported techniques for the ECG arrhythmia classification.
1. Introduction: Electrocardiogram (ECG) is the recording of customized ECG beats. Linear vector quantization (LVQ) and self-
the electrical phenomenon at the heart that shows the variability of organizing maps (SOM) are two classification models based on
heart rate. Any deviation from the normal heart rate, disturbance clustering technique, LVQ is a supervised learning approach which
in rhythm, site of origin or conduction of heart electric impulse is classifies the feature vector corresponding to the class label of
considered as an arrhythmia. Most of the arrhythmias are seldom the cluster pattern (code word) into which is clustered. In SOM,
occurring and can’t be distinguished by a person’s eye. Long each centring of the cluster (prototype or code word) is denoted
duration of ECG record called as ambulatory electrocardiogram are by some weights of a neuron which is assigned to correlate in
required to identify the abnormalities present in a patient. Moreover, the feature map. The classifier is modelled using mixture of SOM
huge variations in temporal and some other morphological and LVQ approaches. The network is designed in such a way that
characteristics from one patient to the other patient make detection LVQ gives superior classification performance for classes 1 and 3
of abnormalities is a challenging task. Hence it is very difficult however for class 2 and 4 SOM gives superior performance. Overall
to analyse and recognize these arrhythmias manually by an expert classification accuracy of 94% is reported using this mixture of
cardiologist also. Hence we require an automatic computer-aided expert’s approach.
diagnostic (CAD) system that can quickly detect abnormalities A unique technique is proposed in [7] for a patient-adapted ECG
within the patient. heartbeat classification that consists of four stages namely, pre-
In literature, there are many algorithms proposed on automatic processing, feature extraction, feature selection and classification.
classification of ECG arrhythmias. Most of them have done ECG Features are extracted in temporal and morphological domain.
classification in the subsequent steps i) pre-processing ii) feature morphological domain features include coefficients obtained by
extraction iii) classification. In this feature extraction techniques applying stock well transform. 184 samples around each R peak
include both temporal (time) and morphological domain features. are taken by combining 4 temporal features (Pre R-R, Post
Some of the temporal features include R-R intervals, Q-R intervals, R-R, average R-R, local R-R) and 180 samples of frequency
QRS complex duration, R-S intervals, ST time segments. These domain features and obtained better classification results but poor
temporal features are not enough for classification since there is sensitivity as reported in [7].
a huge variation in the other morphological patterns within the To address the above limitations, a novel approach of random
same patient. Hence some of the researchers have reported the forest (RF) classifier is proposed to classify ECG arrhythmias.
mixture of both frequency and time features as reported in [1]. Random forest is a highly powerful machine learning algorithm
Many transformation techniques are used, like S transform, Fourier which is based on supervised learning.[8] Random forest algorithm
transform (FT) and discrete wavelet transform (DWT) to extract can use individually for both classification and the regression kind
the features from the pre-processed data [2]. Extracted mixture of difficulties. As name suggests that this algorithm creates the
feature set is given to the input of a classifier for classification forest with many number of random trees .
into respective classes. Some of the classifiers are artificial neural The remaining paper is organized as follows: Section 2 presents
networks (ANN) [3], support vector machine (SVM) [4] etc. MIT-BIH database that is used to evaluate the performance of
Detection of cardiac arrhythmias by applying Hidden Markov the proposed classification algorithm. Section 3 contains proposed
models is reported in [5].This algorithm has demonstrated very framework. Section 4 represents the experiment results and
good promising results in classifying ventricular arrhythmias and discussion of the proposed algorithm. The conclusion of the work
detection of low amplitude P wave detection. In [6], the author is explained in section 5.
presented a mixture of experts (MOE) methodology to classify
(i) Pre-processing
Normalization
database for detection of cardiac arrhythmias. The database
contains forty-eight files of ECG recordings, and every file consists
half an hour ECG segment selected from 24 hours recordings of Filtering
48 specific patients. The first 23 recordings related to the normal
routine clinical recordings while the remaining recordings contain R-peak detection
the complex ventricular, junctional & supraventricular arrhythmias
[9]. MIT-BIH ECG recordings are digitized at 360 Hz and band-
pass filtered at 0.1-100 Hz. Fig.1 represents the plot of record 100
from MIT-BIH ECG signal database. Annotation file contains labels
Morphological features
Temporal features
AC power
for each rhythm that are detected by using a simple slope sensitive
P V N RBBB LBBB
beat with better accuracy.The proposed method classifies N, P, L, R arrhythmia beat detection shows an accuracy of 99.70%,
R, V with sensitivity of 99.74%, 99.05%, 96.16%, 95.88%, 94.42% sensitivity of 95.88%, specificity of 99.99% and positive
respectively. The proposed technique provides best overall accuracy predictivity of 99.86%. F-score is calculated based on precision
of 98.78% compared to the all other classifiers. The proposed and recall. F-score provides the most realistic measure of classifier
technique with random classifier gives higher true positive rate performance.
(TPR) than false positive rate (FPR) [18].
In [11] M.Thomas et al. selected a window of 128 samples
from right and left of the R-peak of the ECG signal.The final
feature vector consists of an extracted feature set from DTCWT and
four other temporal features.The authors proposed multilayer back
propagation neural network is proposed to classify the five types
of ECG arrhythmias. Multilayer back propogation neural network
is very much sensitive to the hidden layer neurons. When hidden
layer neurons are less,then MLP leads to underfitting. If the hidden
layer neurons are many then it leads to over fitting so that the
fitting curve takes uncontrolled oscillations.Network dysfunction
happens once the weights are adjusted to terribly massive values.
It has the following disadvantages (i) A fixed handcrafted feature
extraction method may not be suitable for extracting features when
classifying beats of a particular person (ii) Complexity in the
structure of this technique imposes difficulty in implementation.
DTCWT is a promising technique even though because of multi Figure 5: Patterns classification plot of N, V, P, L, R arrhythmias
layer back propogation drawbacks final classification accuracy
decreased. These drawbacks can be overcome by random forest
Precision and recall values are calculated using confusion matrix
algorithm to get better accuracy [19].
represented in Table 2. Precision is defined as the fraction of
Table 3 represents the performance parameters of the classifier
instances correctly classified as positive out of all the instances
like accuracy (Acc), sensitivity (Sen), specificity (Spe), positive-
the algorithm classified as positive, whereas recall is the fraction
predictivity (Ppr) [20]. All 48 files including four files of paced
of elements correctly classified as positive out of all the positive
beats also included for the performance evaluation of the classifier.
elements [21].
For 48 files the N arrhythmia beat detection shows an accuracy of
98.93% , sensitivity of 99.74%, specificity of 96.85% and positive 2 ∗ TP
predictivity of 98.78%, V arrhythmia beat detection shows an F − score(F) = (11)
2 ∗ TP + FP + FN
accuracy of 99.39% , sensitivity of 94.42%, specificity of 99.76%
and positive predictivity of 96.63%, P arrhythmia beat detection The overall F-score of a classifier calculated as 98.24 by using
shows an accuracy of 99.90% , sensitivity of 99.05%, specificity precision and recall values.
of 99.96% and positive predictivity of 99.51%. L arrhythmia beat
detection shows an accuracy of 99.63% , sensitivity of 96.16%, Table 4 True, False positive rates & Roc area of Individual Classes
specificity of 99.93% and positive predictivity of 99.11%. Class TP Rate FP Rate ROC Area
N 0.997 0.031 0.999
Table 2 Classification results of MIT-BIH arrhythmia database V 0.944 0.002 0.999
P 0.990 0.000 1.000
Class Confusion matrix [DTCWT] L 0.962 0.001 0.999
N V P L R R 0.959 0.000 0.999
N 74788 142 9 42 4
V 357 6730 21 17 3
P 43 23 6953 1 0 ROC area is used to evaluate the performance of the classifier
L 250 54 3 7759 3 [22]. An area of 1.000, 0.500 represents the perfect test and test
R 272 16 1 10 6952 fail respectively. Table 4 shows the ROC area of individual five
arrhythmia classes. For all classes ROC area is above 0.9. The
results shows that Random forest classifier works efficiently for the
Table 3 Classification performance of RF with DTCWT features
arrhythmia classification [23].
Performance matrix
Method Class
Acc (%) Sen (%) Spe (%) Ppr (%) 5. Conclusion: In this letter, an automatic classification
N 98.93 99.74 96.85 98.78 approach is proposed to classify the ECG arrhythmia types of
V 99.39 94.42 99.76 96.63 Paced (P), Premature ventricular contraction (V), Normal (N),
Proposed
P 99.90 99.05 99.96 99.51 Right bundle branch block (R) and Left bundle branch block
Method
L 99.63 96.16 99.93 99.11 (L). The experiments are conducted on the standard MIT-BIH
R 99.70 95.88 99.99 99.86 standard physionet dataset [24]. In this work Feature extraction