Literature DB >> 35656286

Classification of cardiac arrhythmia using a convolutional neural network and bi-directional long short-term memory.

Shahab Ul Hassan¹, Mohd S Mohd Zahid¹, Talal Aa Abdullah¹, Khaleel Husain².

Abstract

Cardiac arrhythmia is a leading cause of cardiovascular disease, with a high fatality rate worldwide. The timely diagnosis of cardiac arrhythmias, determined by irregular and fast heart rate, may help lower the risk of strokes. Electrocardiogram signals have been widely used to identify arrhythmias due to their non-invasive approach. However, the manual process is error-prone and time-consuming. A better alternative is to utilize deep learning models for early automatic identification of cardiac arrhythmia, thereby enhancing diagnosis and treatment. In this article, a novel deep learning model, combining convolutional neural network and bi-directional long short-term memory, is proposed for arrhythmia classification. Specifically, the classification comprises five different classes: non-ectopic (N), supraventricular ectopic (S), ventricular ectopic (V), fusion (F), and unknown (Q) beats. The proposed model is trained, validated, and tested using MIT-BIH and St-Petersburg data sets separately. Also, the performance was measured in terms of precision, accuracy, recall, specificity, and f1-score. The results show that the proposed model achieves training, validation, and testing accuracies of 100%, 98%, and 98%, respectively with the MIT-BIH data set. Lower accuracies were shown for the St-Petersburg data set. The performance of the proposed model based on the MIT-BIH data set is also compared with the performance of existing models based on the MIT-BIH data set.

Entities: Chemical

Keywords: Arrhythmia; accuracy; bi-directional long short-term memory; classification.; convolutional neural network; electrocardiogram; precision

Year: 2022 PMID： 35656286 PMCID： PMC9152186 DOI： 10.1177/20552076221102766

Source DB: PubMed Journal: Digit Health ISSN： 2055-2076

Introduction

Cardiovascular disease (CVD) is the leading cause of death globally. As per the World Health Organization (WHO), over 17.9 million humans around the world have died as a result of CVD diseases.[1,2] Early diagnosis of CVD is critical to preventing sudden death from a heart attack or cardiac arrest. Cardiac arrhythmias refer to a group of disorders in which the heart’s electrical impulse is abnormal, resulting in a quicker or slower beat than usual. A thorough investigation of the electrocardiogram (ECG) segment offers structural instruction about cardiac patients, widely employed in clinical procedures for arrhythmia identification. Usually, the ECG signs of cardiac disease do not appear within a short ECG recording period. They require a prolonged recording and monitoring of more than one day. This lengthens and complicates cardiologists’ interpretation of ECG charts. Thus, numerous advancements in recent years have been made to ECG signals to decrease mortality and assist cardiologists in making timely, efficient, and accurate decisions. There are two processes for assessing ECG characteristics in the traditional manual method. The first stage involves extracting ECG features, while the second stage categorizes ECG based on the retrieved characteristics. The process is cumbersome and error-prone for the cardiologists, and there is a need for automated ECG classification. Therefore, early recognition of cardiac arrhythmia is critical to effective investigation and treatment. Several machine learning-based techniques for extracting heart characteristics and training models for arrhythmia identification have recently been developed. Linear predictive coding, wavelet entropy, synchro-squeezing wavelet transform, k-nearest neighbor and support vector machine models have all been utilized in predicting arrhythmia. While these non-deep learning techniques perform well, they suffer from various constraints, including poor classification performance for large data. Thus, several deep learning methods have been applied recently to address a variety of difficult problems across all disciplines of health care research, including ECG classification.[9,10] Deep learning methods transcend the limitations of traditional disease diagnosis, enhancing performance and generalization by reducing pre-processing and feature extraction.[11,12] In this context, only a few studies on convolutional neural networks (CNNs),[13,14] recurrent neural networks (RNNs) such as long short-term memory (LSTM), and bi-directional long short-term memory (Bi-LSTM) are used for heart categorization and found significant improvement. In recent years, end-to-end training of CNN has been the dominating technique for health care image analysis. Additionally, because of its capacity to record position and shift-invariant modes, CNN is used to analyze the morphology of clinical information. Even when the input signal is noisy, CNN may also be able to retrieve valuable data. These performance characteristics are mirrored in the network structure built layer by layer. As the network’s layers increase, features are learned and expressed more abstractly and concisely. Moreover, LSTM is a kind of artificial RNN,[19,20] which is suitable for classifying sequences and time-series data. LSTM only preserves the previous data because the only inputs it has received are from the past. The Bi-LSTM is a variant of the traditional LSTM capable of learning from both past and future states. It enables the network to learn representations of the characteristics and the temporal connection between the features. This work evaluates a deep learning model that combines CNN and Bi-LSTM on two data sets (MIT-BIH and St-Petersburg data sets) for autonomously detecting arrhythmia illness from ECG signals. The model is capable of detecting and classifying five different types of cardiac arrhythmias such as non-ectopic (N), supraventricular ectopic (S), ventricular ectopic (V), fusion (F), and unknown (Q) beats. The contributions of this article are as follows: An overview of the state-of-the-art work related to categorizing multiple classes from ECG signals using different data sets. Proposed a novel deep learning model for categorizing five classes of cardiac arrhythmia from ECG signals using the MIT-BIH and ST-Petersburg data sets. Evaluate the optimum hyper-parameters of conv1D in terms of Kernel size, number of filters, activation function and number of layers. We combined the Bi-LSTM technique of size 32 based on factorial cross-entropy following the Adam optimizer with an evaluated optimum conv1D. This approach is the first to detect cardiac arrhythmia in the way described above to the best of our knowledge. The results indicate that the detection accuracy, sensitivity, specificity, and precision are promising. The rest of the article‘s organization is as follows. The “Background” section covers the basic concepts of CNN, LSTM, and Bi-LSTM algorithms and their key parameters required to understand the topic. The “Methodology” section covers the methodology, explaining the workflow of the performance analysis process. The “Results” section gives the results, such as model performance, training and validation outcomes, and the “Discussions” section discusses the proposed model. Finally, the “Conclusions” section offers the conclusion.

Related work

There has been significant research on using deep learning models to classify arrhythmias. Study by Acharya et al. describes using a CNN method to identify the various ECG signals automatically. The method comprises an 11 layers of deep CNN with a four-neuron output layers that indicates typical atrial fibrillation (AFIB), Nsr, atrial flutter (AFL), and Vfib ECG classifications. The deep learning one-dimensional (1D)-CNN model was proposed in the work of Yıldırım which was capable of correctly identifying cardiac arrhythmias (17 diagnostic classifications including normal sinus rhythm, the rhythm of a pacemaker, and 15 other rhythm abnormalities) from the assessment of ECG signal. Most researchers combined CNN with LSTM and achieved satisfactory results in classifying arrhythmia from ECG signals. Work of Oh et al. proposed an automated system utilized a combination of CNN-LSTM algorithms for diagnosing normal beats, left bundle branch block, right bundle branch block, atrial premature beats and premature ventricular contraction from ECG signals. The deep CAE-LSTM method was proposed in for recognizing arrhythmic heartbeat. Specifically, the CAE technique was utilized to shorten ECG waves and gather low-dimensional digitized information from individual ECG records. The coded signals were then classified using an LSTM network model. They operated 100,022 fragmented data representing five distinct ECG pulse patterns. Work of Zheng et al. developed a technique for categorizing arrhythmias by combining a CNN with LSTM, which was then applied to eight ECG data sets, one of which had a typical normal heart rhythm. Deep learning-based technique for automatically classifying six kinds of ECG signals: AFIB, normal sinus rhythm (N) segments, pacing rhythm (P), ventricular bigeminy (B), sinus bradycardia, and atrial flutter (AFL) were presented in. This framework processed ECG recordings and related RR duration from the well-known data set MIT-BIH using a multi-input structure. In the work of Yao et al., an attention-based time-incremental CNN model was developed to categorize arrhythmia using wide-ranging ECG data. This model collected information from ECG data in two stages: spatial information fusion using CNN, temporal information fusion using LSTM cells and an attention module that classified eight classes of arrhythmias and sinus rhythms. In the work of Rai and Chatterjee, the authors suggest a hybrid CNN-LSTM model and ensemble technique for identifying myocardial infarction using ECG signals. They achieved satisfactory results by classifying all non-MI beats (N, S, V, F, Q) as regular beats and myocardial infarction beats as MI beats. An automatic diagnosis of cardiac arrhythmias from large-scale ECG data has been suggested utilizing a feature extraction approach based on the multi-layer probabilistic neural network (MPNN) classifier to classify two distinct classes of arrhythmia. Although there has been considerable work on detecting cardiac arrhythmia using deep learning techniques, an effective CNN-Bi-LSTM technique for arrhythmia classification is still lacking. Recently, Bi-LSTM has been used in different areas and has shown good performance. In this work, we combine CNN with Bi-LSTM models to evaluate and classify five different ECG segments, comprising N, S, V, F, and Q.

Background

An ECG is a non-invasive diagnostic tool that visually captures the heart’s pulse and electrical activity over time. Arrhythmia is described as the irregular electrical activity of any set of diseases affecting the heart, resulting in a fast or slow pulse. DNNs have been utilized in the categorization of ECGs recently. A DNN model’s features are more extensive than those manually retrieved with adequate training data. The CNN and LSTM architectures are commonly utilized for prediction, detection, and recognition in most artificial intelligence applications using sequential data or signals of one dimension. As a result of its local and parameter sharing capabilities, CNN is an efficient technique for extracting features. Also, LSTM is a frequently used technique for processing time-series signals. Both CNN and LSTM are more successful in detecting multi-class arrhythmias. The following section provides an in-depth discussion on the CNN, LSTM, and Bi-LSTM techniques.

Convolutional neural network

CNN is a subset of artificial neural networks that are extensively utilized for image processing, feature extraction and categorization of time series data. CNN has been one of the most often utilized artificial intelligence methods due to its superior capacity to identify characteristics automatically. CNN models are constructed using convolutional layers and a fully connected layer with related weights, and pooling layers. Moreover, 1D-CNNs are highly suitable for real-time applications because of their minimal processing requirements. The basic CNN architecture inside the network comprises a series of layers, each of which is responsible for a distinct function. Multiple convolutional filters work concurrently to extract outcomes and display them as activation. Several convolutions enhance activations, resulting in a feature map for the associated input. The following equation applies to single convolutional processing of a signal in which denotes the total number of points, is the activation function, layer index is , is the th feature map’s bias, is the feature map’s weight, kernel size is , and th is the filter index. The local characteristics of the ECG segments are extracted using CNNs. The general conv1d network design is depicted in Figure 1.

Figure 1.

General architecture of Conv1D.

Long short-term memory

LSTM technique is based on RNNs. A fundamental LSTM unit cell consists of an input gate, an output gate, and a forget gate. The LSTM enables the processing of a variety of different types of data in adjacent time steps, thereby producing an internal feedback state that helps the network to comprehend the concept of time and high variability of the exhibited data. By incorporating gate control, the LSTM network integrates resolving gradient disappearing up to a specific extent. While an LSTM framework can retain and comprehend previous data, it cannot correctly use new data to assist final prediction. As a result, a Bi-LSTM with two-way capability was built using the LSTM network. Bi-LSTM networks outperform other RNN, and LSTM structures in sequence prediction modeling, particularly in the areas of machine translation and voice or handwriting detection.

Bi-directional long-short term memory

The Bi-LSTM comprises two separate LSTMs capable of summarizing data from both the backward and forward directions and then integrating the data from both ways. It is advantageous to access both previous and future contexts while doing sequence labeling tasks. Bi-LSTM proposes forwarding and reversing each sequence to two distinct hidden states to gather future and past information and then concatenating the two hidden states to produce the outcome. For every time t, the next LSTM generates the hidden vector using the prior hidden vector and the input words encoding , while the reverse LSTM produces the hidden vector using the opposite prior hidden vector . After that, the backward hidden vector and forward hidden vector are combined to form the Bi-LSTM model’s last hidden vector. In the Bi-LSTM framework, the parameters of two opposing directions are distinct, although they share identical sentence embeddings. Figure 2 depicts the Bi-LSTM model’s fundamental structure, where represents the forward hidden vector and representing the backward hidden vector, respectively. denotes the vector formed by the intersection of and .

Figure 2.

General architecture of bi-directional long short-term memory (Bi-LSTM) network.

MIT-BIH arrhythmia database

The MIT-BIH data set has grown in popularity in recent years. This data set has been utilized at over 500 locations globally since 1980 to conduct basic heart dynamics research. The proposed work incorporates experimental data from the MIT-BIH arrhythmia database, which is publically available on PhysioNet. The signals were digitized at a sampling rate of 360 samples per second per channel and a resolution of 11 bits throughout a 10 mV range. In total, 48 half-hour snippets of 24-hour two-channel ECG recordings from 48 recordings were used for this work at a sample rate of 360 Hz. ECG signals in the MIT-BIH arrhythmia database were described by a binary file (.dat), a text header file (.hea), and a binary annotation file (.atr). The header file (.hea) is a small text file that contains information about the signals’ details (includes the file’s name, sample amount, recording’s format, and comprehensive clinical data). Annotation files have a collection of labels, each describing a property of one or more signals at a specific location in the record.

St-Petersburg arrhythmia database

St-Petersburg INCART 12 lead arrhythmia database includes 75 recordings that have been annotated. These samples were culled from 32 different Holter recordings. Each channel lasts thirty minutes and contains 12 standard leads, with a sampling rate of 257 Hz. The samples were collected from 17 men and 15 women, ranging from 18 to 80 years old, and 58 years old for men. The .hea file contains information on the patient’s age, gender, diagnosis, and an analysis of the ECG’s characteristics for each record. Additionally, each .hea file has a patient‘s number (1–32) that uniquely distinguishes the source recordings; each patient’s recording was generated from the identical Holter records.

Methodology

The flowchart for conducting the research is shown in Figure 3. Firstly, MIT-BIH and St-Petersburg data sets are prepared for model training by performing signal extraction, data normalization, and data balancing. The ECG signals have been split throughout the training, validation, and testing phases. We extract the ECG signals using the WFDB package and then normalize the data using tf.Keras.utils.normalize. The resampling method was used to balance the training data, with down-sampling for the majority class and up-sampling for the minority classes. The model development stage starts with building a CNN model with optimal number of layers, filters, kernel sizes, and activation function. The CNN model is then integrated with a Bi-LSTM model. The number of layers in Bi-LSTM is determined when the accuracy of categorizing output classes of arrythmia has been satisfied at acceptable level.

Figure 3.

Flowchart diagram of the proposed work.

Flowchart diagram of the proposed work. The training, validation and testing phases of the model is performed separately for MITH-BIH and St-Petersburg data sets. Initially it is performed with MIT-BIH, then it is performed with St-Petersburg data set. The results are then compared as another measure to obtain a credible model.

Data preparation

The proposed model is evaluated utilizing MIT-BIH and St-Petersburg database. There are 48 individuals in the MIT-BIH database, with each subject’s data including normal, abnormal, and non-beats. Moreover, the number of seconds of recording before and after the beat is set to three seconds. Finally, the data set is divided into subsets for training, validation, and testing. 60% patients’ data were chosen randomly for training, and 20% for validation, while the remaining 20% patients’ data were utilized for testing. Similarly, the St-Petersburg data set has 32 recordings, each lasting thirty minutes and consisting of 12 standard leads with normal, abnormal, and non-beats. The St-Petersburg data set has also been randomly divided into training, validation and testing phases of a time duration of 3 sec.

Data balancing

A balanced data set is one that has a similar amount of input samples for each output class or target class. The use of a resampling approach can accomplish balancing. It is critical to balance the data set, or at the very least, bring it near to being balanced. The primary purpose of this is to ensure that each class is given equal priority. The MIT-BIH and St-Petersburg arrhythmia data sets are imbalanced data sets. We used the down-sampling and up-sampling techniques in order to balance the data sets. After resampling the training data we fixed 30,000 samples for each output class. After data balancing, we implement the data into the proposed model to categorize cardiac arrhythmia.

Proposed model (CNN-Bi-LSTM)

The proposed deep learning model combined two neural networks, CNN and Bi-LSTM. The choice to integrate these two techniques was made with the understanding that their combination provides better outcomes. The network created in this study has two Conv1D layers and two Bi-LSTM layers. An activation function follows each Conv1D layer for rectified linear units (ReLU). To boost their effectiveness in feature extraction, we then add two 1D max-pooling layers of size 2. The Conv1D layer was created with a kernel size of 5 and 32 filters in total, and then the Bi-LSTM layers having a size of 32 were used to link these layers. A fully connected layer with softmax activation is utilized after enhancing the temporal information between segments using Bi-LSTM. We also used a callback class to stop the model when reaching 0.999% accuracy. We compute the output classification probabilities from the training and validation outcomes using performance matrices. We also apply kernel_ regularizer = regularizers.l2 of size 0.0003 to reduce information redundancy and prevent overfitting because most segments overlap. Furthermore, Table 1 shows the CNN-Bi-LSTM model summary, which includes information such as output shape, layer type, and the number of parameters. Moreover, we utilized categorical_ crossentropy for the loss function, and the optimizer considered was “Adam” optimizer. Figure 4 explained the proposed model layer by layer detail as well as the optimum parameters which we achieved from experiments.

Table 1.

Model summary of the convolutional neural network and bi-directional long short-term memory (CNN-Bi-LSTM) framework for MIT-BIH Arrhythmia data set highlighting the layer types, output shape and number of parameters.

Layer (type)	Output shape	Parameters
Conv1D_1	182, 32	192
Max-pooling1D	91, 32 32	0
Conv1D_2	87, 32	5152
Max-pooling1D	43, 32	0
Bidirectional_1	43, 64	16640
Bidirectional_2	43, 64	24832
Dense_ 1	16	1040
Dense_ 2	5	85
Total number of parameters:	47,941
Trainable parameters:	47,941
Non-trainable parameters:	0

Figure 4.

Proposed CNN-Bi-LSTM architecture.

Proposed CNN-Bi-LSTM architecture. Model summary of the convolutional neural network and bi-directional long short-term memory (CNN-Bi-LSTM) framework for MIT-BIH Arrhythmia data set highlighting the layer types, output shape and number of parameters.

Performance matrices

The performance of the developed model was calculated in terms of accuracy, recall, precision, and specificity. These performance metrics are described next: Accuracy: is described as “the amount to which the outcome of a measurement correlates to the proper value or a norm”, and it relates to the proximity of a measure to its accepted values. Recall: also known as sensitivity, is the ability of a test to discover patients with a condition correctly. Sensitivity measures how accurate a test produces a positive result (also known as the “true positive” rate) for people who have tested the ailment. is described as “performing precisely” and relates to the proximity of two or more measures, whether or not they are accurate. Precision measurements have a risk of being erroneous. refers to a test’s capacity to reliably produce a negative result for those who do not have the disease assessed (also known as the “true negative” rate).

Results

The model was trained and tested to recognize arrhythmia disease using CNN and Bi-LSTM algorithms. The Jupyter Notebook application is used to build and evaluate our deep learning model. The pypi package wfdb is used to import the ECG and annotation and then show all the annotation in order to visualize the distribution of the cardiac rhythm. Table 2 describes the achieved training, validation and testing overall performance parameters of the proposed technique on both data sets. The developed model comprises two layers of conv1D along with 1D max-pooling and two layers of Bi-LSTM. The number of filters is set at 32, and the kernel size is fixed at 5, which achieved higher training, validation and testing accuracies of MIT-BIH and St-Petersburg data sets. A stratified random test was conducted by splitting the MIT-BIH and St-Petersburg data set into different parts. In total, 60% of the data was utilized for model training 20% for validation, and the remaining 20% was used to test the model. The procedure was repeated until all of the data had been examined.

Table 2.

Overall performance parameters of training, validation and testing values of the proposed convolutional neural network and bi-directional long short-term memory (CNN-Bi-LSTM) model using both data sets.

Performance parameters	MIT-BIH data set			St-Petersburg data set
	Train values	Valid values	Test values	Train values	Valid values	Test values
Accuracy	100%	98.0%	98.0%	98.0%	95.0%	95.0%
Recall	100%	92.20%	91.0%	98.2%	92.4%	92.2%
Precision	100%	88.0%	88.20%	98.2%	75.0%	73.8%
Specificity	99.2%	92.02%	90.96%	97.6%	92.5%	92.3%
F1-score	100%	89.80%	89.80%	98.2%	81.0%	80.4%

Model performance

Prior to finding the optimal hyperparameter, we train our model with a variety of parameters, including the number of filters from 32 to 128, kernel size from 3 to 11, batch normalization, exponential Linear Unit (ELU), and a dropout rate of 0.25, until we find the optimal performance parameters, which are as follows: Adam optimizer, conv1d filter size 32, kernel size 5, kernel_regularizer L2, ReLU activation function, max-pooling size 2, Bi-LSTM 32, epochs 100, dense 16, and finally dense 5 with softmax function. The confusion matrix for each class of validation and testing of the MIT-BIH data set is shown in Figures 5 and 6. While the confusion matrix of the St. Petersburg data set is shown in Figures 7 and 8 which indicates the number of correctly predicted samples against incorrectly predicted samples for each class.

Figure 5.

Validation-based confusion matrix for arrhythmia categorization using the MIT-BIH data set.

Figure 6.

Testing-based confusion matrix for arrhythmia categorization using the MIT-BIH data set.

Figure 7.

Validation-based confusion matrix for arrhythmia categorization using the St-Petersburg data set.

Figure 8.

Testing-based confusion matrix for arrhythmia categorization using the St-Petersburg.

Validation-based confusion matrix for arrhythmia categorization using the MIT-BIH data set. Testing-based confusion matrix for arrhythmia categorization using the MIT-BIH data set. Validation-based confusion matrix for arrhythmia categorization using the St-Petersburg data set. Testing-based confusion matrix for arrhythmia categorization using the St-Petersburg.

Validation and training outcomes

The developed network performed well in investigations of rhythm epoch classification. After 100 epochs of training, the categorization accuracy values for the MIT-BIH data set were almost 100%, with a relatively flat curve. Whereas the valid set had an accuracy of between 96% and 98%, and the curve was stable (Figure 9). Consequently, the training set’s loss curve was almost zero and very flat. In comparison, valid loss varied between 0 and 0.2 across epochs (Figure 10), suggesting that the cross-entropy loss function has a high level of generalization. The five standard statistical measures were utilized to determine the effectiveness of each categorization algorithm: recall, accuracy, precision, specificity, and f1-score. The overall performance demonstrates that the MIT-BIH data set is always superior to the St-Petersburg data set in terms of five performance matrices. Figure 11 depicts our network’s receiver operating characteristic (ROC) curves on the validation data set. The dashed diagonal line indicates a random performance with a 0.5 ROC. Our network demonstrated a high ROC of 1.

Figure 9.

Model accuracy curve.

Figure 10.

Model loss curve.

Figure 11.

Receiver operating characteristic (ROC) curve of the proposed model.

Model accuracy curve. Model loss curve. Receiver operating characteristic (ROC) curve of the proposed model.

Discussions

Doctors are frequently required to evaluate and diagnose cardiac problems using single-lead or multiple-lead ECG readings in clinical applications. However, an efficient and low-complexity automatic CVD identification model is required due to the heavy burden associated with a clinical diagnosis and the disparity in doctors’ expertise levels. A novel technique integrating CNN with Bi-LSTM explores how well this combination performs in detecting and classifying heartbeat occurrences. There exist considerable work in the literature to detect ECG signals in the MIT-BIH arrhythmia database. Table 3 summarises some of these investigations. Rajendra et al. have achieved 92.50% accuracy performance for identifying four kinds of arrhythmia classes by applying 11 layers of CNN. Acharya et al. designed a nine-layer CNN model capable of automatically classifying five distinct types of heartbeats in ECG readings achieving the accuracy of 94.03%. Shu Lih Oh et al. reported 98.10% accuracy for the categorization of five classes by using the CNN-LSTM model. Li Guo et al. used DenseNet and gated recurrent unit networks GRU to overcome the challenge of inter-patient ECG categorization. In this work they categorize five ECG classes from MIT-BIH data set. Xue Xu et al. suggested a model for arrhythmia classification based on deep learning. Their investigation used a one-dimensional CNN and Bi-LSTM model to classify ECG data into five categories and found that this strategy achieved an accuracy of 95.90%. Our proposed CNN-Bi-LSTM technique outperforms the existing deep learning techniques for arrhythmia classification by achieving superior accuracy 98.0% and sensitivity 91.0% performance.

Table 3.

Summary of the proposed network comparing with state-of-the-art methodologies.

Work	Architecture	Preprocessing	Classes	Accuracy (%)	Specificity (%)	Recall (%)
Rajendra et al.²¹	11 layers CNNs	Yes	4	92.50	93.10	98.06
Acharya, et al.⁴⁸	9 layers CNN	Yes	5	94.03	91.54	94.03
Shu Lih et al.²³	CNN-LSTM	Yes	5	98.10	98.70	97.50
Li Guo et al.⁴⁹	CNN	No	5	93.71	94.77	91.25
Chen Chen²⁶	CNN + LSTM	Yes	6	96.62	96.80	95.40
Alqudah et al.¹¹	CNN	Yes	5	97.80	99.40	97.80
W. Jung et al.⁵⁰	WKNN	Yes	4	96.12	99.97	96.12
Shraddha et al.⁵¹	RNN-LSTM	No	2	88.10	92.40	83.35
Vandana et al.⁵²	Ensemble SVMs	Yes	4	94.05	92.96	92.84
Our work	CNN-Bi-LSTM	No	5	98.00	90.96	91.00

SVM: support vector machine; CNN-Bi-LSTM: convolutional neural network and bi-directional long short-term memory; CNN: convolutional neural network; RNN: recurrent neural network.

Summary of the proposed network comparing with state-of-the-art methodologies. SVM: support vector machine; CNN-Bi-LSTM: convolutional neural network and bi-directional long short-term memory; CNN: convolutional neural network; RNN: recurrent neural network.

Conclusion

Arrhythmia is a severe CVD that can be predicted via ECG segment processing. Arrhythmia must be accurately diagnosed and prevented early to reduce cardiac disease. Our proposed system model met the study’s primary goal of assisting doctors in swiftly determining the kind of ECG or verifying their diagnostics in a medical context while maintaining a high level of precision and cost. In this work, a CNN-Bi-LSTM model is proposed to categorize five categories of ECG fragments to construct an effective and resilient autonomous computer-aided diagnosis system. The developed network achieved maximum accuracies of 100%, 98.0%, and 98.0% of training, validation, and testing using MIT-BIH data set. In comparison, the St-Petersburg data set achieved 98.0%, 95.0%, and 95.0% accuracies of training, validation, and testing in identifying arrhythmia. This research showed many advantages, including its ability to help clinicians reliably make ECG recording-related clinical decisions. Moreover, it was intended to be as simple as possible while delivering the most significant performance. The described method is straightforward for health professionals and does not involve signal modification or feature extraction. Additionally, this research focused only on one kind of CVD, namely, arrhythmia, whereas the manifestations of cardiac disease are often complex and varied. As a result, more types of ECG data will need to be added to broaden the scope of the planned network.

17 in total

1. The impact of the MIT-BIH arrhythmia database.

Authors: G B Moody; R G Mark
Journal: IEEE Eng Med Biol Mag Date: 2001 May-Jun

2. Long short-term memory.

Authors: S Hochreiter; J Schmidhuber
Journal: Neural Comput Date: 1997-11-15 Impact factor: 2.026

3. A new approach for arrhythmia classification using deep coded features and LSTM networks.

Authors: Ozal Yildirim; Ulas Baran Baloglu; Ru-San Tan; Edward J Ciaccio; U Rajendra Acharya
Journal: Comput Methods Programs Biomed Date: 2019-05-10 Impact factor: 5.428

4. Automated diagnosis of arrhythmia using combination of CNN and LSTM techniques with variable length heart beats.

Authors: Shu Lih Oh; Eddie Y K Ng; Ru San Tan; U Rajendra Acharya
Journal: Comput Biol Med Date: 2018-06-05 Impact factor: 4.589

5. Arrhythmia detection using deep convolutional neural network with long duration ECG signals.

Authors: Özal Yıldırım; Paweł Pławiak; Ru-San Tan; U Rajendra Acharya
Journal: Comput Biol Med Date: 2018-09-15 Impact factor: 4.589

6. Heartbeat Sound Signal Classification Using Deep Learning.

Authors: Ali Raza; Arif Mehmood; Saleem Ullah; Maqsood Ahmad; Gyu Sang Choi; Byung-Won On
Journal: Sensors (Basel) Date: 2019-11-05 Impact factor: 3.576

7. Multifactorial e- and mHealth interventions for cardiovascular disease primary prevention: Protocol for a systematic review and meta-analysis of randomised controlled trials.

Authors: Artur Direito; Jonathan Rawstorn; Jacqueline Mair; Reza Daryabeygi-Khotbehsara; Ralph Maddison; E Shyong Tai
Journal: Digit Health Date: 2019-11-21

Review 8. Mechanisms of cardiac arrhythmias.

Authors: Gary Tse
Journal: J Arrhythm Date: 2015-12-17

9. Exploring feature selection and classification methods for predicting heart disease.

Authors: Robinson Spencer; Fadi Thabtah; Neda Abdelhamid; Michael Thompson
Journal: Digit Health Date: 2020-03-29