Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Diagnosing Left Bundle Branch Block in 12-lead Electrocardiogram using Self-Attention Convolutional Neural Networks

Alireza Sadeghi, Alireza Rezaee, Farshid Hajati
doi: https://doi.org/10.1101/2023.06.25.23291867
Alireza Sadeghi
1Intelligent Mobile Robot Lab (IMRL), Department of Mechatronics Engineering, Faculty of New Sciences and Technologies, University of Tehran, Tehran, Iran
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: alrz.sadeghi{at}ut.ac.ir
Alireza Rezaee
1Intelligent Mobile Robot Lab (IMRL), Department of Mechatronics Engineering, Faculty of New Sciences and Technologies, University of Tehran, Tehran, Iran
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Farshid Hajati
2College of Engineering and Science, Victoria University Sydney, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

The left bundle branch block is a cardiac conduction disorder that affects the heart’s electrical system. It causes the left ventricle of the heart to contract later than the right ventricle, leading to an irregular heartbeat. The diagnosis of left bundle branch block is crucial in determining the most effective treatment for heart irregularities, including cardiac resynchronization therapy. Cardiac resynchronization therapy uses a pacemaker-like device to resynchronize the heart’s contractions and improve its function. However, diagnosing left bundle branch block accurately can be challenging using traditional diagnostic methods that rely on electrocardiograms. This study introduces Self-Attention Convolutional Neural Networks for the detection of left bundle branch block from 12-lead electrocardiograms data using SE-Residual blocks and a self-attention mechanism to highlight important input data features for more accurate diagnosis of left bundle branch block. The model is trained and validated on a database of 10,344 12-lead electrocardiogram samples using a 10-fold cross-validation approach. The results demonstrate an accuracy of 98.91% ± 0.1%, specificity of 99.28% ± 0.15%, precision of 73.09% ± 3.5%, recall or sensitivity of 82.83% ± 4.34%, F1 score of 77.5% ± 1.59%, and area under the receiver operating characteristics curve of 0.91 ± 0.02. The experimental results demonstrate that the proposed deep learning model achieves high accuracy, specificity, and F1 score. These findings suggest that the proposed model can serve as an effective diagnostic tool for identifying left bundle branch block with a high level of efficiency, improving the diagnostic process, and promoting early treatment in medical settings.

1. INTRODUCTION

Cardiac resynchronization therapy (CRT) is a medical procedure that involves the placement of a pacemaker in the body to correct irregular heartbeats [1]. This therapy has been found to improve the health of heart patients and increase their lifespan [2, 3]. It is generally more effective in individuals with a QRS complex length of over 120 milliseconds on their electrocardiogram (ECG). Moreover, research suggests that a wider QRS complex or the presence of left bundle branch block (LBBB) on the ECG affect CRT procedure [3–6]. Therefore, the presence or absence of LBBB is critical in determining the efficacy of CRT in heart disease. LBBB is a heart rhythm disorder that is characterized by faulty left ventricular contraction due to a partial or complete malfunction of the electrical system on the left side of the heart. The American Heart College and the American Heart Association provide specific criteria for identifying LBBB on an ECG [7]. However, numerous studies [3, 5, 8–10] have shown that conventional methods and criteria for diagnosing LBBB are prone to errors and mistakes in over 30% of cases. Hence, an accurate and comprehensive system for ECG data analysis and LBBB diagnosis is crucial.

Computer Aided Diagnosis (CAD), used since 1966, is a field of study that uses advanced technology to assist physicians in diagnosing diseases and its objective is to improve the accuracy and efficiency of medical diagnoses [11, 12]. The use of CAD systems has many benefits, including increased accuracy and efficiency of diagnoses, reduced errors and variability between physicians, and improved patient outcomes [13]. The development of CAD systems also has the potential to reduce healthcare costs by reducing the need for unnecessary tests and procedures [13]. Recently, deep neural networks have become essential components of intelligent CAD systems as they enable computers to learn from data and improve their accuracy over time [14, 15]. Deep learning algorithms can recognize complex patterns in medical data and identify subtle changes that may be missed by human eyes.

2. RELATED WORKS

The detection of left bundle branch block (LBBB) in electrocardiogram (ECG) data has become increasingly widespread through the use of machine learning and deep learning techniques. Initially, research in this field focused on utilizing small datasets with few channels or leads. For instance, in [16], researchers aimed to classify ECG records from the MIT-BIH dataset [17] into three categories: normal, right bundle branch block (RBBB), and left bundle branch block (LBBB). The dataset included 48 long ECG signals with leads MLII and V5 from 47 patients, and this study utilized 11 noisy signals. The researchers employed the structural and morphological properties of the ECG signals, including RR intervals and PQ intervals, as features for classification. They utilized a Genetic Algorithm (GA) to reduce the number of features, and the final set of features was then classified using a neural network, which achieved a classification accuracy of 98.71%. In addition, the researchers experimented with Principal Component Analysis (PCA) for feature reduction, but the results were not as good as the GA experiment, achieving an accuracy of 77.66%.

In a different study, the goal of [18] was to classify electrocardiogram (ECG) records into three classes: LBBB, RBBB, and normal signals. The researchers utilized 11 samples from the MIT-BIH dataset [17], consisting of 5 normal, 3 LBBB, and 3 RBBB signals. To identify appropriate features for final classification, they introduced the adaptive bacterial forage optimization (ABFO) algorithm, an enhanced version of the bacterial forage optimization (BFO) algorithm [19, 20]. The ABFO algorithm was utilized to eliminate groups with low searching methods, leading to the identification of appropriate groups with effective search strategies. A total of 20 features were produced by the ABFO algorithm, which were then input into the Levenberg–Marquardt Neural Network [21] for classification. The resulting system achieved accurate classification of ECG signals with an accuracy of 98.74%.

In 2018, another study for classifying ECG records into the three classes of LBBB, RBBB, and normal signals was conducted by [22]. In this study, the same MIT-BIH dataset [17] was used. Signals were first filtered using Pan Tompkins algorithm [23] and then different methods including PCA, Magnitude Squared Coherence (MSC) [24], and wavelet transform (WT) [25] were used on them to extract proper features. According to the study, the features generated by PCA were reliable for identifying LBBB, while those derived from MSC were effective in identifying RBBB. Features extracted by WT were also useful in providing stable time and frequency resolutions in noisy environments. To reduce the total number of features, only statistical properties, such as minimum value and median value, were used for the final classification. The final classification was performed using a Random Forest model consisting of 300 decision trees, resulting in an average accuracy of 98.4%.

In spite of previous studies which used hand crafted features for ECG classification, Yildirim et al. [26] proposed an end-to-end model utilizing deep learning techniques to classify 17 different heart arrhythmias, including LBBB. They utilized the same MIT-BIH dataset [17], but only the MLII lead was considered. The researchers generated 1,000 fragments from ECG records of 45 patients and trained the model on this data for 50 epochs. Input signals were first rescaled to the range of - 1 to 1 and then fed to a 16-layer deep model, consisting of 1D convolutional, batch normalization, max-pooling, and dense layers. To address the issue of overfitting, dropout layers [27] were also incorporated. The model demonstrated 100% accuracy in detecting LBBB complications. However, the model was designed to perform effectively at a segment-level input rather than a patient-level input. Additionally, the size of filters used in the convolutional layers was large (7, 10, 15), which may result in slower performance of the system in real-time applications.

The above-mentioned studies relied on small datasets, and often required expert knowledge for feature extraction. With the advent of larger ECG datasets that include signals from multiple leads (such as 3 leads or 6 leads), research has expanded to include the analysis of multi-lead signals. These multi-lead signals can provide a more comprehensive view of the heart’s electrical activity, offering models additional information during the training process. It is worth mentioning that 12-lead ECG signals contain a significant amount of information, making models trained on these signals highly reliable for diagnosing heart arrhythmias.

Ribeiro et al. [28] conducted a study to classify 12-lead ECG records into six different classes, which includes LBBB. Their proposed model includes convolutional, batch normalization, and dense layers, while residual blocks [29] serve as the primary components. The researchers trained the model for 50 epochs on a dataset comprising over two million samples, produced by the Telehealth Network of Minas Gerais (TNMG) [30]. The labels for these samples were only available in textual format, and the researchers utilized an automated method in the first step to extract these labels for each sample in the dataset. Additionally, 98% of the samples in the dataset were utilized for training and validation, while the remaining 2% was used for testing purposes. The proposed model achieved a F1-score of 100% for detecting LBBB.

In 2022, a study was conducted by Chen et al. [31] to classify 12-lead ECG samples into nine different heart arrhythmias, including Bundle Branch Block (both RBBB and LBBB). The authors proposed a deep learning model that consisted of convolutional, batch normalization, residual blocks, dropout, LSTM, attention, and dense layers. The model was trained for 200 epochs using a dataset introduced by Kaohsiung Medical University Hospital (KMUH) [32], which contained 19,253 different samples. To prepare suitable data for the study, the researchers took images from all ECG reports, removed personal information from each image, and digitized the images to produce 12-lead signals. The final model achieved an average accuracy of 96.02% across all arrhythmias. Furthermore, the researchers evaluated their proposed model on the CPSC-2018 dataset [33, 34], which contains 6,877 12-lead ECG signals, and the model achieved an average accuracy of 94.07%.

Previous efforts to use machine learning to diagnose LBBB have several drawbacks and limitations. Firstly, most of these studies require manual extraction of features from ECG data, a process that requires the involvement of experts in the field and increases the application cost. An end-to-end model that can detect LBBB directly from raw ECG data without human intervention would alleviate this issue. Secondly, these studies often use a limited number of training samples, which makes the resulting model vulnerable to overfitting and unreliable in practical applications. Training a deep learning model on a large ECG sample dataset can overcome this limitation and result in a more trustworthy model. Moreover, proposed models in some studies are computationally expensive and may cause many challenges in real-time applications. Lastly, the performance of these models is limited, and errors in diagnosis can have negative impacts on patients’ health, so a highly accurate and reliable system is necessary.

We propose a Self-Attention Convolutional Neural Network (SACNN) that utilizes SE-Residual blocks and attention mechanism for the classification of raw 12-lead ECG data. The ECG data acquired from the 2020 PhysioNet/Computing in Cardiology Challenge [35] was inputted into the SACNN which consisted of 1D convolutional layers, 4 SE-Residual blocks, and a soft-attention block. The SACNN model was then able to accurately identify cases of LBBB in the dataset with an accuracy of 99%.

3. MATERIALS AND METHODS

3.1. Data

To develop a model for diagnosing LBBB from 12-lead ECG signals, we used the dataset provided by the 2020 PhysioNet/Computing in Cardiology Challenge [35]. Emory University collected this dataset from 10,344 patients in the southeastern United States and includes two sets of files: record files and annotation files. The record files contain 12-lead ECG signals recorded at a 500 Hz sampling frequency ranging from 5 to 10 seconds in length. The annotation files contain information about patient demographics, diagnosis, medical history, symptoms, and surgical history. We created labels for each sample in the record file using the diagnosis information in the annotation file and used them to train the proposed model.

The dataset consists of ECG signals from both male and female patients. It contains 5,551 samples of males (53.66%) and 4,793 samples of females (46.33%). The dataset also exhibits age diversity, with ECG data ranging from 14 to 89 years of age, and the average age of patients is approximately 60 years old. Out of the total number of samples, 231 (2.23%) are found to have LBBB abnormalities in their ECG recordings. Of those 231 samples, 108 (46.75%) belong to male patients, and 123 (53.25%) belong to female patients. Additionally, 81.39% of patients with LBBB are 60 years of age or older. Figure 1 shows the demographic distribution of the studied population.

Figure 1.
  • Download figure
  • Open in new tab
Figure 1.

a) LBBB existance ratio, b) Age distribution of patients

We have processed the original data in two steps. First, we normalize the data using the z-score normalization method which addresses the issue of amplitude scaling and offset effect [36]. Second, we standardize the duration of all signals to 8 seconds. Short signals were extended using zero padding, while long ones were truncated to retain just the first 8 seconds (see Figure 2). It should be noted that to enhance the robustness of our model against faulty inputs, we deliberately refrained from removing any compromised ECG signals or utilizing any noise cancellation methods.

Figure 2.
  • Download figure
  • Open in new tab
Figure 2.

a) Extending short data, b) Truncating long data

3.2. Self-Attention Convolutional Neural Network (SACNN)

Figure 3 illustrates the architecture of the proposed soft-attention convolutional neural network (SACNN). In this architecture, several weighted (convolutional) and non-weighted (batch normalization and activation function) layers are first applied to the processed 12-lead ECG signals. Following this, a convolutional layer with a filter size of 3 and stride size of 2 is applied, followed by a pack of 4 SE-Res blocks which are comprehensively discussed later in this section. To reduce the feature size, a max pooling layer with a size of 7 is deployed, after which a self-attention block is utilized. It is worth noting that the filter size of all Conv1D layers utilized in the model is set to 3 which is the smallest size that can effectively capture the information of a single point and its neighborhood. The features extracted through these layers are ultimately fed into a set of fully connected layers for the final classification. Throughout the remainder of this section, we will introduce the various blocks that have been implemented in SACNN in detail.

Figure 3.
  • Download figure
  • Open in new tab
Figure 3.

Structure of SACNN (top) which contains several SE-Res blocks (buttom-middle and left) and a self-attention block (buttom-right).

The information contained in different channels of a signal is not equally important. Some channels have a greater impact on the final results than others. To address this challenge, Squeeze and Excitation (SE) blocks have been designed [37]. These blocks magnify the importance of more crucial channels and reduce the importance of less important ones.

The SE blocks consist of two steps. The first step is called squeeze, in which the information of each channel is compressed into a scalar value. The second step is called excitation, which transforms the scalar values obtained in the first step into an array of scalars, denoted as s that indicate the importance of each channel. Here, Global Average Pooling (GAP) is chosen as the function for the transformation in the first step, and a neural network is used to determine the values of the array s.

Every input signal to a SE block is represented as X ∈ Rm×n, where n is the number of channels and m is the number of features in each channel. Embedded Image

where σ and δ are Sigmoid and ReLU [38] activation functions, respectively. During the training process, the values of Embedded Image and Embedded Image are adjusted to achieve the desired values. In the proposed model, the r parameter is set to 8. The array s contains scalar values, each indicating the importance of its corresponding channel. every channel in X is multiplied by its corresponding value in the array s and a new tensor denoted as X′is generated and used as the input for the next layer.

Figure 4.
  • Download figure
  • Open in new tab
Figure 4.

The excitation part of the SE block.

In SACNN, we also employed the residual block proposed in [39] with some modifications. Specifically, we replaced all the layers in the original residual block with their corresponding 1D counterparts, given that our inputs are unidimensional. Additionally, the second Conv1D layer in our residual architecture was assigned a dilation rate of 2 to capture a wider receptive field. The input tensor, which has been transformed by the Conv1D layers within the residual blocks, is further processed by a SE block. The SE block assigns greater influence on the more important channels of the input tensor, thereby enhancing the output. The resulting tensor is added to the original input signal, completing the residual connection within the block: Embedded Image

where X and X′ are the input and output of a residual block respectively, F(X) is the transformation function through the Conv1D layers in the residual block, and G(X) is the transformation function of the SE block.

Each standard Conv1D layer with a filter size of 3 captures the information of 3 consecutive points in each input. By stacking 2 of such layers, each output point can retrieve information from 5 consecutive points in the input. Conversely, a dilated Conv1D (dilation rate = 2) layer captures information from every other consecutive point (see Figure 5(b)). Thus, by stacking a standard Conv1D layer with a dilated Conv1D layer, each output point can access information from 7 points in the input layer. The difference between a standard Conv1D layer and a dilated Conv1D layer is illustrated in Figure 5.

Figure 4.
  • Download figure
  • Open in new tab
Figure 4.

Comparison of a convolutional layer and dilated convolutional layer.

Figure 5.
  • Download figure
  • Open in new tab
Figure 5.

The process of generating the attention weights in the self-attention block in the SACNN

In this study, a single-head self-attention model was employed to analyze the extracted features through the SE-Res Blocks, with the aim of amplifying the impact of more important features on the final output. We use a method proposed in [40] to generate the attention weights. The equations below demonstrate how attention weights are calculated. Embedded Image Embedded Image

where α contains the attention weights, X(m×n) is the input signal and to simplify the equation, Q is set to an adequate identical matrix. Embedded Image is defined during the training process by implementing a dense neural network. The SoftMax function ensures that the sum of attention weights is equal to 1. Figure 6 illustrates the process of generating attention weights in the attention block of our model.

Figure 6.
  • Download figure
  • Open in new tab
Figure 6.

The results of SACNN on the training and the validation sets.

After calculating the attention weights, the output of the attention block, X′ ∈ Rm×n, is generated by the equation below. Embedded Image

where ⊙ is element-wise multiplication.

It should be noted that each component in the output signal of the attention block in our model contains information from all the features of the input signal (via attention weights), which is not feasible using simple Conv1D or SE-Res blocks. This characteristic of the attention block enhances the performance of SACNN.

4. RESULTS

To evaluate the proposed model for diagnosing LBBB in 12-lead ECG signals, we used the dataset provided by the 2020 PhysioNet/Computing in Cardiology Challenge [35]. After preprocessing, the data was divided into a training-validation set consisting of 8,000 samples and a test set consisting of 2,344 samples. The data was split in such a way that both training-validation and test sets had an equal representation of LBBB. A stratified 10-fold cross-validation was applied to the training-validation set, with each fold using 90% of the data for training and the remaining 10% for validation. This stratified approach ensured that the proportion of LBBB samples was equal in the training and validation sets for all folds.

Due to the imbalanced data with only 2.23% of positive samples, it is crucial to choose appropriate metrics to evaluate the performance of the model. Therefore, we used multiple metrics, including accuracy, specificity, F1 score, and the area under the ROC curve, to assess the model’s performance [41]. The final classification of the model is determined by an optimal threshold, which was chosen by searching for a value that maximizes the F1 score, a more robust metric for imbalanced data. Ten different threshold values were obtained and evaluated for all ten models trained in different folds, resulting in a final optimal threshold of 0.488.

To establish a benchmark for comparison with the final results of the proposed model, several machine learning and deep learning models were implemented on the dataset. The machine learning models include Random Forest and Light Gradient Boosting Machine. These models require discrete features as inputs for classification. Therefore, statistical characteristics of two indicators from ECG signals (RR intervals and voltage values in R waves) are used as features. These statistical characteristics are first 5 percentile, first quartile, mean, third quartile, last 5 percentile, standard deviation, variance, and median. As a result, each sample was transformed into 192 discrete features (8 statistical characteristics of two indicators for 12 leads), which are used by the baseline models to classify a given sample. In addition to these machine learning models, deep learning models developed by Oh et al. [42] and Ribeiro et al. [28] are also applied to the dataset. They are popular deep learning models which perform properly on ECG signal classification task and can be considered as an appropriate benchmark. Table 1 presents the outcome of implementing the benchmark machine learning and deep learning models, which is used as a point of reference for evaluating the performance of proposed model. To ensure a fair comparison, all models were trained using a 10-fold cross-validation method. Furthermore, all deep models were trained for 50 epochs using Adam [43] (learning rate = 0.001) and Binary Cross Entropy as the optimizer and the loss function respectively. In all models, the final classification is performed by utilizing a threshold value of 0.5. Finally, we computed the same metrics for the SACNN’s on the test set. The results are tabulted in Table 1. As shown in Table 1, the SACNN outperforms the benchmarks in almost all metrics utilizing the same training hyperparameters. This observation strongly suggests that the architecture of SACNN is better suited for the task of 12-lead ECG classification.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1.

The performance of the benchmark models on detecting LBBB in the 12-lead ECG signals.

To finetune SACNN, various optimizers, including Adam [43] (learning rate = 0.001), RMSProp [46] (learning rate = 0.001), and SGD (Learning rate = 0.001, momentum = 0.9), were tested and found that Adam performed the best, and was chosen as the optimizer. Moreover, during the training process, a learning rate scheduler was implemented, which began at a value of 10-3 for the first epoch and decreased uniformly concluding at a value of 10-8 for epoch number 50. The study also employed a focal binary cross-entropy loss function [47] to assess the degree of alignment between the predicted and actual classes of each sample. This loss function, which is a variant of the conventional binary cross-entropy, is particularly beneficial in classification tasks where the data is highly imbalanced. Additionally, the batch size and the number of epochs were set to the value of 32 and 50 respectively. The performance of SACNN after finetuning is as follows: accuracy of 98.91% ± 0.1%, specificity of 99.28% ± 0.15%, precision of 73.09% ± 3.5%, recall of 82.83% ± 4.34%, F1 score of 77.5% ± 1.59%, and AUC of 0.91 ± 0.02.

We also evaluated the learning curves of the SACNN on the dataset. Figure 7 illustrates the accuracy and loss curves for both the training and validation sets across the various epochs during training in Fold 1. The accuracy curve shows that the model achieved an impressive accuracy of 99% on the validation set in less than 5 epochs. Moreover, the loss curves indicate that the model was able to converge quickly and reach a satisfactory level of loss on the training and validation sets.

Figure 7.
  • Download figure
  • Open in new tab
Figure 7.

The confusion matrix of SACNN

The confusion matrices generated by the SACNN for diagnosing LBBB samples are presented in Figure 8. The number of false positives and false negatives is relatively low, with an average of only 16 false positives and 9 false negatives made by the SACNN.

Figure 8.
  • Download figure
  • Open in new tab
Figure 8.

a) Lead aVF in 2 given samples misclassified in all 10 folds, b) Lead aVF in a random sample classified correctly.

Overall, the experimental results demonstrate that our deep learning model is effective for ECG signals on this dataset and holds great potential for other similar applications.

During the evaluation on the test set, it was found that 54 (2.1%) out of the 2,432 samples were misclassified at least once. Of these 54 misclassified samples, 13 (24.07%) were incorrectly classified in all 10 folds. To investigate the reasons of this misclassification, we took a look on these 13 samples. Figure 9 displays the lead V4 in two of these 13 samples that were misclassified in every fold and a randomly selected sample that was classified correctly.

After observing the two incorrectly classified samples and comparing them with a correctly classified sample, it was noted that there was some form of abnormality present in their ECG. This abnormality could be due to medical personnel’s mistake, patient noncompliance, or improper movements during ECG recording, or even due to the noise [48–50]. To further investigate this, the noise in the samples that were misclassified in all folds was removed using the Savitzky-Golay filter [51]. The model was then reapplied to the denoised samples. The results of this are shown in Table 2. Table 2 shows that the model’s ability to correctly classify all-fold-misclassified samples improves by over 20% through the denoising process. This shows that the model is more effective when there is less ECG defect present. However, we do not include the denoising to our training process as we aim to make the model robust to these effects and more prepared for the real-world circumstances.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 2.

How all-fold-misclassified samples are classified befor and after denoising

5. DISCUSSION

The proposed study presents a new deep learning model for accurately detecting LBBB, an arrhythmia in heart patients. The model is composed of SE-Res blocks and an attention block, and was validated using the 2020 PhysioNet/Computing in Cardiology Challenge data [35], which consists of 10,344 ECG samples with 2.23% suffering from LBBB. To ensure unbiased results, a ten-fold cross-validation method was used, and multiple performance metrics were applied to evaluate the performance of the system. The results showed that the proposed model achieved a high accuracy (98.91%) in detecting LBBB.

The proposed model was trained without using any noise-cancellation techniques, as this would have altered the structure of the ECG signals. This approach makes the model more practical for use in medical centers, as it can interpret the output of ECG recorders directly. In comparison with the benchmark machine learning models including logistic regression, random forest, and light gradient boosting machine, the proposed model outperforms them in all the evaluation metrics. The comparison proves that the proposed model is able to find the patterns in ECG signals which are more useful in diagnosing LBBB than the statistical characteristics of RR intervals that are commonly utilized by physicians.

A limitation of this study is that the patient’s clinical information, such as age, sex, and medical history, was not used in building the model. This information can play a role in the probability of a patient having LBBB arrhythmia, as the disease is more likely to occur in older individuals [52]. Unfortunately, such information was not fully provided in the used dataset.

6. CONCLUSION

This study introduces a novel deep neural network architecture called SACNN, which has been specifically developed for the purpose of detecting left bundle branch block (LBBB) from raw 12-lead electrocardiogram (ECG) recordings. Through comprehensive experimental evaluation, we have demonstrated the superior performance of SACNN in accurately identifying LBBB cases, surpassing other models designed for detecting heart arrhythmias. Furthermore, by evaluating the model’s performance on a large dataset, we have showcased the robust generalization capabilities of SACNN, thus highlighting its potential for real-world applications. It is worth noting that, in order to maintain the fidelity of the ECG inputs to actual raw recordings, fraud samples were included and no noise cancellation techniques were employed.

The findings of this research strongly support the suitability of the SACNN model for automatic and precise diagnosis of LBBB using 12-lead ECG data. This has significant implications for the future of LBBB screening and early detection in primary care settings, as it has the potential to alleviate the burden on healthcare professionals. Moreover, the SACNN architecture can be adapted and expanded to detect other cardiovascular conditions, thus broadening its impact on the field of cardiology and healthcare.

In conclusion, the SACNN model represents a significant advancement in the application of deep learning techniques for the detection of LBBB from 12-lead ECG recordings. This research not only sets the stage for more accurate and efficient LBBB diagnosis, but also serves as a foundation for future investigations into similar deep learning-based approaches for detecting and diagnosing other cardiovascular abnormalities.

Data Availability

The study used ONLY openly available human data that were originally provided by the 2020 PhysioNet/Computing in Cardiology Challenge

Footnotes

  • arrezaee{at}ut.ac.ir, farshid.hajati{at}vu.edu.au

References

  1. [1].↵
    S. S. Barold, “What is cardiac resynchronization therapy?,” The American journal of medicine, vol. 111, no. 3, pp. 224–232, 2001.
    OpenUrlCrossRefPubMedWeb of Science
  2. [2].↵
    S. Cazeau et al., “Effects of multisite biventricular pacing in patients with heart failure and intraventricular conduction delay,” New England Journal of Medicine, vol. 344, no. 12, pp. 873–880, 2001.
    OpenUrlCrossRefPubMedWeb of Science
  3. [3].↵
    J. Cleland, “Cardiac Resynchronization-Heart Failure (CARE-HF) Study Investigators: The effect of cardiac resynchronization on morbidity and mortality in heart failure,” N Engl J Med, vol. 352, pp. 1539–1549, 2005.
    OpenUrlCrossRefPubMedWeb of Science
  4. [4].
    M. R. Bristow, et al., “Cardiac-resynchronization therapy with or without an implantable defibrillator in advanced chronic heart failure,” New England Journal of Medicine, vol. 350, no. 21, pp. 2140-2150, 2004.
    OpenUrlCrossRefPubMedWeb of Science
  5. [5].↵
    A. J. Moss, et al., “Cardiac-resynchronization therapy for the prevention of heart-failure events,” New England Journal of Medicine, vol. 361, no. 14, pp. 1329-1338, 2009.
    OpenUrlCrossRefPubMedWeb of Science
  6. [6].↵
    A. S. Tang et al., “Resynchronization/defibrillation for ambulatory heart failure trial: rationale and trial design,” Current Opinion in Cardiology, vol. 24, no. 1, pp. 1–8, 2009.
    OpenUrlCrossRefPubMedWeb of Science
  7. [7].↵
    D. Scherbak and G. J. Hicks, “Left Bundle Branch Block,” in StatPearls [Internet]: StatPearls Publishing, 2021.
  8. [8].↵
    C. A. Egoavil, R. T. Ho, A. J. Greenspon, and B. B. Pavri, “Cardiac resynchronization therapy in patients with right bundle branch block: analysis of pooled data from the MIRACLE and Contak CD trials,” Heart Rhythm, vol. 2, no. 6, pp. 611–615, 2005.
    OpenUrlCrossRefPubMedWeb of Science
  9. [9].
    K. C. Bilchick, S. Kamath, J. P. DiMarco, and G. J. Stukenborg, “Bundle-branch block morphology and other predictors of outcome after cardiac resynchronization therapy in Medicare patients,” Circulation, vol. 122, no. 20, pp. 2022-2030, 2010.
    OpenUrlAbstract/FREE Full Text
  10. [10].↵
    W. Zareba, et al., “Effectiveness of cardiac resynchronization therapy by QRS morphology in the multicenter automatic defibrillator implantation trial–cardiac resynchronization therapy (MADIT-CRT),” Circulation, vol. 123, no. 10, pp. 1061-1072, 2011.
    OpenUrlAbstract/FREE Full Text
  11. [11].↵
    J. Yanase and E. Triantaphyllou, “A systematic survey of computer-aided diagnosis in medicine: Past and present developments,” Expert Systems with Applications, vol. 138, p. 112821, 2019.
    OpenUrl
  12. [12].↵
    M. L. Giger, “Machine learning in medical imaging,” Journal of the American College of Radiology, vol. 15, no. 3, pp. 512–520, 2018.
    OpenUrl
  13. [13].↵
    B. Al Mohammad, P. C. Brennan, and C. Mello-Thoms, “A review of lung cancer screening and the role of computer-aided detection,” Clinical radiology, vol. 72, no. 6, pp. 433–442, 2017.
    OpenUrl
  14. [14].↵
    Y. LeCun, Y. Bengio, and G. Hinton, “Deep learning,” nature, vol. 521, no. 7553, pp. 436–444, 2015.
    OpenUrlCrossRefPubMed
  15. [15].↵
    I. Goodfellow, Y. Bengio, and A. Courville, Deep learning. MIT press, 2016.
  16. [16].↵
    R. Allami, A. Stranieri, V. Balasubramanian, and H. F. Jelinek, “A genetic algorithm-neural network wrapper approach for bundle branch block detection,” in 2016 Computing in Cardiology Conference (CinC), 2016: IEEE, pp. 461–464.
  17. [17].↵
    G. B. Moody and R. G. Mark, “The impact of the MIT-BIH arrhythmia database,” IEEE engineering in medicine and biology magazine, vol. 20, no. 3, pp. 45–50, 2001.
    OpenUrlCrossRefPubMedWeb of Science
  18. [18].↵
    P. Kora and S. R. K. Kalva, “Detection of bundle branch block using adaptive bacterial foraging optimization and neural network,” Egyptian Informatics Journal, vol. 18, no. 1, pp. 67–74, 2017.
    OpenUrl
  19. [19].↵
    Y. Liu and K. Passino, “Biomimicry of social foraging bacteria for distributed optimization: models, principles, and emergent behaviors,” Journal of optimization theory and applications, vol. 115, pp. 603–628, 2002.
    OpenUrl
  20. [20].↵
    K. M. Passino, “Biomimicry of bacterial foraging for distributed optimization and control,” IEEE control systems magazine, vol. 22, no. 3, pp. 52–67, 2002.
    OpenUrlCrossRefWeb of Science
  21. [21].↵
    M. I. Ibrahimy, R. Ahsan, and O. O. Khalifa, “Design and optimization of Levenberg-Marquardt based neural network classifier for EMG signals to identify hand motions,” Measurement Science Review, vol. 13, no. 3, pp. 142–151, 2013.
    OpenUrl
  22. [22].↵
    C. Hao, S. Sivanesan, M. Majmudar, and K. S. Rajput, “Combinational feature based random forest classification for enhanced bundle branch block beat detection,” in 2018 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI), 2018: IEEE, pp. 319–322.
  23. [23].↵
    J. Pan and W. J. Tompkins, “A real-time QRS detection algorithm,” IEEE transactions on biomedical engineering, no. 3, pp. 230–236, 1985.
    OpenUrl
  24. [24].↵
    P. Kora and K. S. Ramakrishna, “Classification bundle block detection using magnitude squared coherence,” ARPN Journal of Engineering and Applied Sciences, vol. 10, no. 14, pp. 5984-5989, 2015.
    OpenUrl
  25. [25].↵
    R. Ceylan and Y. Özbay, “Comparison of FCM, PCA and WT techniques for classification ECG arrhythmias using artificial neural network,” Expert Systems with Applications, vol. 33, no. 2, pp. 286–295, 2007.
    OpenUrl
  26. [26].↵
    Ö. Yıldırım, P. Pławiak, R.-S. Tan, and U. R. Acharya, “Arrhythmia detection using deep convolutional neural network with long duration ECG signals,” Computers in biology and medicine, vol. 102, pp. 411–420, 2018.
    OpenUrlCrossRef
  27. [27].↵
    N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, “Dropout: a simple way to prevent neural networks from overfitting,” The journal of machine learning research, vol. 15, no. 1, pp. 1929–1958, 2014.
    OpenUrl
  28. [28].↵
    A. H. Ribeiro, et al., “Automatic diagnosis of the 12-lead ECG using a deep neural network,” Nature communications, vol. 11, no. 1, p. 1760, 2020.
    OpenUrl
  29. [29].↵
    K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770–778.
  30. [30].↵
    M. B. Alkmim et al., “Improving patient access to specialized health care: the Telehealth Network of Minas Gerais, Brazil,” Bulletin of the World Health Organization, vol. 90, pp. 373–378, 2012.
    OpenUrlCrossRefPubMed
  31. [31].↵
    C.-Y. Chen et al., “Automated ECG classification based on 1D deep learning network,” Methods, vol. 202, pp. 127–135, 2022.
    OpenUrl
  32. [32].↵
    “Kaohsiung Medical University Hospital Research Database.” https://www.kmuh.org.tw/english/main5-2-9.asp. (accessed.
  33. [33].↵
    “The China Physiological Signal Challenge 2018.” http://2018.icbeb.org/Challenge.html (accessed.
  34. [34].↵
    F. Liu, et al., “An open access database for evaluating the algorithms of electrocardiogram rhythm and morphology abnormality detection,” Journal of Medical Imaging and Health Informatics, vol. 8, no. 7, pp. 1368-1373, 2018.
    OpenUrlCrossRef
  35. [35].↵
    E. A. P. Alday et al., “Classification of 12-lead ecgs: the physionet/computing in cardiology challenge 2020,” Physiological measurement, vol. 41, no. 12, p. 124003, 2020.
    OpenUrl
  36. [36].↵
    D. Singh and B. Singh, “Investigating the impact of data normalization on classification performance,” Applied Soft Computing, vol. 97, p. 105524, 2020.
    OpenUrl
  37. [37].↵
    J. Hu, L. Shen, and G. Sun, “Squeeze-and-excitation networks,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 7132–7141.
  38. [38].↵
    V. Nair and G. E. Hinton, “Rectified linear units improve restricted boltzmann machines,” in Proceedings of the 27th international conference on machine learning (ICML-10), 2010, pp. 807–814.
  39. [39].↵
    K. He, X. Zhang, S. Ren, and J. Sun, “Identity mappings in deep residual networks,” in Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part IV 14, 2016: Springer, pp. 630–645.
  40. [40].↵
    A. Vaswani et al., “Attention is all you need,” Advances in neural information processing systems, vol. 30, 2017.
  41. [41].↵
    N. Seliya, T. M. Khoshgoftaar, and J. Van Hulse, “A study on the relationships of classifier performance metrics,” in 2009 21st IEEE international conference on tools with artificial intelligence, 2009: IEEE, pp. 59–66.
  42. [42].↵
    S. L. Oh, E. Y. Ng, R. San Tan, and U. R. Acharya, “Automated diagnosis of arrhythmia using combination of CNN and LSTM techniques with variable length heart beats,” Computers in biology and medicine, vol. 102, pp. 278–287, 2018.
    OpenUrl
  43. [43].↵
    D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980, 2014.
    OpenUrl
  44. [44].
    L. Breiman, “Random forests,” Machine learning, vol. 45, pp. 5–32, 2001.
    OpenUrlCrossRef
  45. [45].
    G. Ke et al., “Lightgbm: A highly efficient gradient boosting decision tree,” Advances in neural information processing systems, vol. 30, 2017.
  46. [46].↵
    T. a. G. H. Tieleman, “Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural networks for machine learning, 2012. 4(2): p. 26-31.”
    OpenUrl
  47. [47].↵
    T.-Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollár, “Focal loss for dense object detection,” in Proceedings of the IEEE international conference on computer vision, 2017, pp. 2980–2988.
  48. [48].↵
    T. Thaler, V. Tempelmann, M. Maggiorini, and A. Rudiger, “The frequency of electrocardiographic errors due to electrode cable switches: a before and after study,” Journal of electrocardiology, vol. 43, no. 6, pp. 676–681, 2010.
    OpenUrlPubMed
  49. [49].
    A. Rudiger, J. P. Hellermann, R. Mukherjee, F. Follath, and J. Turina, “Electrocardiographic artifacts due to electrode misplacement and their frequency in different clinical settings,” The American journal of emergency medicine, vol. 25, no. 2, pp. 174–178, 2007.
    OpenUrlCrossRefPubMed
  50. [50].↵
    M. Awan, M. Omar, B. A. Omar, C. Malozzi, and G. M. Awan, “ECG Recording Errors Prevalence in a Hospital Setting,” Circulation: Cardiovascular Quality and Outcomes, vol. 11, no. suppl_1, pp. A244–A244, 2018.
    OpenUrl
  51. [51].↵
    A. Savitzky and M. J. Golay, “Smoothing and differentiation of data by simplified least squares procedures,” Analytical chemistry, vol. 36, no. 8, pp. 1627-1639, 1964.
    OpenUrlCrossRef
  52. [52].↵
    V. Kumar et al., “Implications of left bundle branch block in patient treatment,” The American journal of cardiology, vol. 111, no. 2, pp. 291–300, 2013.
    OpenUrlCrossRefPubMed
Back to top
PreviousNext
Posted June 29, 2023.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Diagnosing Left Bundle Branch Block in 12-lead Electrocardiogram using Self-Attention Convolutional Neural Networks
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Diagnosing Left Bundle Branch Block in 12-lead Electrocardiogram using Self-Attention Convolutional Neural Networks
Alireza Sadeghi, Alireza Rezaee, Farshid Hajati
medRxiv 2023.06.25.23291867; doi: https://doi.org/10.1101/2023.06.25.23291867
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Diagnosing Left Bundle Branch Block in 12-lead Electrocardiogram using Self-Attention Convolutional Neural Networks
Alireza Sadeghi, Alireza Rezaee, Farshid Hajati
medRxiv 2023.06.25.23291867; doi: https://doi.org/10.1101/2023.06.25.23291867

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Cardiovascular Medicine
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)