- 1School of Information Science and Engineering, Linyi University, Linyi, China
- 2School of Radiology, Shandong First Medical University & Shandong Academy of Medical Sciences, Tai’an, China
- 3School of Information Science and Engineering, Shandong University, Qingdao, China
Since ECG contains key characteristic information of arrhythmias, extracting this information is crucial for identifying arrhythmias. Based on this, in order to effectively extract ECG data features and realize automatic detection of arrhythmia, a multi-classification method of arrhythmia based on multi-scale residual neural network and multi-channel data fusion is proposed. First, the features of single-lead ECG signals are extracted and converted into two-dimensional images, and the feature data sets are labeled and divided according to different types of arrhythmias. The improved residual neural network is trained on the training set to obtain the classification model of the neural network. Finally, the classification model is applied to the automatic detection of arrhythmias during exercise. The accuracy of the classification model of this method is as high as 99.60%, and it has high accuracy and generalization ability. The automatic identification of arrhythmia also contributes to the research and development of future wearable devices.
1 Introduction
Data released by the American College of Cardiology in 2019 shows that cardio-vascular disease is already the most important cause of human death. It is estimated that by 2030, the number of deaths from cardiovascular disease will reach 24 million (Sannino and De Pietro, 2018; Liu et al., 2021). Arrhythmias are a type of cardiovascular disease, and arrhythmias are usually not life-threatening, but some arrhythmias are harmful and can lead to stroke or sudden death (Luo et al., 2017). With the increase of social pressure, people often maintain physical health through exercise, and arrhythmias during exercise can lead to physical discomfort, and serious ones can also lead to sudden death. Electrocardiogram (ECG) is able to reflect the condition of the heart by recording periodic changes in the heartbeat (Fan et al., 2021). Therefore, arrhythmias can be diagnosed by electrocardiogram. It can be seen that it is very necessary to detect arrhythmic diseases as soon as possible through abnormal electrocardiograms, so as to achieve early intervention and early treatment.
The ECG signal is constantly changing with the beating of the heart, and it is a periodic repetition of heartbeat activity over time. Cardiologists detect abnormal heart-beats based on changes in the frequency, rhythm, and morphology of the heart-beat. The ECG signal corresponding to arrhythmias is short-lasting and imperceptible, which makes it easy for doctors to make mistakes in the diagnosis process, and the diagnosis process is time-consuming and laborious (Yang and Shen, 2013; Atal and Singh, 2020).
In order to achieve automatic classification of ECGs, researchers have proposed many methods (Cortes and Vapnik, 1995; Kleinbaum et al., 2002; TENCON, 2013; Xu et al., 2018), including artificial neural networks, support vector machines, logistic regression, and random forests. In recent years, deep learning technology has been used more and more widely in ECG. Based on this, we have a new understanding of the classification of ECG, and the deep learning-based method does not require additional feature extraction and selection steps (Breiman, 2001; Xu and Liu, 2020). Neural networks are able to ex-tract representative features from the input data and classify them with the help of softmax. Panda R et al. (Panda et al., 2020) proposed a new method to detect shockable ventricular cardiac arrhythmias using ECG signals. A fixed frequency range empirical wavelet transform filter bank is proposed for multi-scale analysis of ECG signals. Patterns evaluated using a fixed frequency range empirical wavelet transform of ECG signals were used as input to a deep convolutional neural network for the detection of shockable ventricular cardiac arrhythmias. Zhao W et al. (Zhao et al., 2019) based on the ResNet model, using a low-pass filter to eliminate noise in the ECG signal, obtained high classification accuracy on 5 different types of ECGs. Shi Z et al. (Shi et al., 2021) improved the residual network to achieve the 5-classification task of arrhythmias, and their classification accuracy reached 99.59%. Park J et al. (Park et al., 2020) designed the residual network structure of the integrated squeeze-and-excitation module and obtained 97.05% accuracy. Li D et al. (Li et al., 2020) proposed a multi-scale residual network and realized the 4-classification task of ECG signals, and the accuracy of the method reached up to 94.29% (Guo et al., 2021). combined multiscale and squeeze-and-excitation modules for arrhythmia classification, which extracted ECG image features and applied key features to signal recognition, and they verified the effectiveness of the method in two datasets with an accuracy rate of 98.3% and 97.5%, respectively (Mangathayaru et al., 2021). evaluated the diagnostic model using a single-lead dataset, where they first extracted the characteristics of the signal using the wavelet transform, and then used the attention mechanism to distinguish different arrhythmia categories from the extracted signal according to the time series, with an accuracy rate of 98.5%. In the above work, the ResNet model was used, the improved residual network and the fusion multi-scale residual network were used to realize the multi-classification task of arrhythmia. Their recognition accuracy has reached more than 90%, but in the medical field, the recognition of diseases is accurate the rate is closely related to the patient’s condition. Therefore, it is very important to further improve the recognition accuracy of ECG signals. Moreover, on the basis of obtaining better signal recognition accuracy in existing research, it is necessary to simplify the process of signal feature extraction, and the fusion of multi-lead data should also be considered. The data of different leads has different characteristics, and the data set composed of data extracted from different leads is more complex and confusing. The model trained in this way can be used in the face of complex and diverse clinical conditions. More accurately identify the disease.
Therefore, on the basis of the above research, drawing on the excellent algorithms in the field of im-ages, we propose a method based on multi-scale residual neural network and multi-channel fusion for the diagnosis of motor arrhythmias. The main contributions of this paper are as follows.
(1) The two-channel ECG data is fused and converted into a two-dimensional ECG image.
(2) A multi-scale module is integrated in the bottom layer of the residual network to fuse different scale features of ECG images. When the image features reach the deep layer of the network, the image information of the upper layer and the bottom layer can be combined through the multi-scale module.
(3) The improved network makes full use of image information of different scales, combines more image information to learn more content, alleviates the gradient disappearance that may occur in the deep network, and achieves higher-precision ECG classification.
The rest of this paper is organized as follows. Experimental materials and methods are described in the second section, including datasets, data pre-processing, im-proved residual neural networks and experimental equipment. The experimental results are described in Section 3 and discussed in Section 4. A summary of our work and vision for the future are described in section 5.
2 Materials and methods
2.1 Dataset
We completed the experiment using the MIT-BIH arrhythmia dataset (Goldberger et al., 2000; Moody and Mark, 2001). The heartbeat data recorded in this dataset includes two sets of leads. In Figure 1, A represents the V5 lead and B represents the MLII lead. This dataset randomly selected 23 cases from 4,000 24-h Holter records as representative samples of clinical routine records.
The sudden cardiac death holter dataset is data on sudden cardiac deaths recorded at boston hospitals in the 1980s. It included 18 patients with underlying sinus rhythm, 1 patient with continuous pacing, and 4 patients with atrial fibrillation. Annotating this dataset is particularly difficult due to the complexity of the heart rhythms included, so there is limited annotated data for this data.
2.1.1 Dataset division
Deep learning algorithms tend to learn categories with a larger number of samples, and the number of ECGs for each arrhythmia in the MIT-BIH database is very different, so this will reduce the accuracy of the deep learning model. Therefore, we train the model by selecting categories with an approximate number of samples from the da-ta-base, which eliminates the impact of database imbalances. We selected 5 different categories from our database, including normal (N), left bundle branch block (LBBB), ventricular premature beats (PVC), atrial premature beats (APB), and right bundle branch block (RBBB).
The experimental dataset is divided according to the ratio of 9:1, and the number of MIT-BIH database ECG image samples used during training and testing is shown in Table 1. Table 1 shows the size of the sample 2D ECG dataset used in this paper. Among them, the number of ECGs for each arrhythmia category is also relatively close, so when training, the phenomenon of unbalanced model training can be avoided.
The number of samples in the sudden cardiac death holter dataset (Greenwald et al., 1985; Greenwald, 1986; Goldberger et al., 1988; Courtemanche et al., 1989; Goldberger et al., 1989) is shown in Table 2. The dataset is divided into three categories, namely, normal (N), ventricular premature beats (PVC) and Isolated QRS-like artifact (I).
2.2 Data pre-processing
This paper uses the matplotlib library of the python language to fuse the single-lead ECG data together, and uses the plt function to save it as a two-dimensional ECG image suitable for the CNN structure. Since each ECG signal is a continuous one-dimensional time series, we use 400 data points as a sample, data 1–400 as the first sample generated by cutting, and the starting point of the next sample is 401–800, until the end of ECG data.
The definition of each sample is shown in Equation 1, d is the current data point, P is the interval length of the selected data point, and S is the sample data after segmentation.
The segmented data points are shown in Figure 2, with a sample of every 400 data points and looped through them.
2.3 Cross-entropy loss function
The function of the loss function is to calculate the gap between the predicted value of the network model and the true label, and it is used to measure how well the network model is trained. As the network model is trained, the loss function value changes continuously. When the loss function reaches the minimum value, the prediction accuracy of the network model also reaches the highest value. The cross-entropy loss function is used for multiclassification tasks, which can calculate the degree of difference in the probability distribution of the same random variable. Dataset D = (
2.4 Improved residual network
HE K et al. (He et al., 2016) proposed the residual neural network, which achieved success in the ImageNet competition. But considering that image features at different scales are also critical for the classification of ECGs, we made further improvements to the residual network.
In this paper, an improved multi-scale residual ECG classification algorithm is introduced, the network architecture is based on the residual structure, and takes into account the extraction of feature information at different scales, in order to make effective use of this information, multi-scale modules are added to the end of the residual network.
The proposed network framework is shown in Figure 3, from which each module of the network architecture can be clearly seen. First, the 2D ECG signal is input into the network, and then it is zeroed, followed by 2D convolution and regularization operations. Before the residual connection, perform the maximum pooling operation, save the main features in the obtained feature map, and then input the main features into the residual block. We define two residual blocks, conv_block (denoted by ConvB below) and identity_block (denoted by IdenB below). First, the features obtained by the maximum pooling are input into the ConvB module, and then the output of the ConvB module is passed to the IdenB module, so that the obtained features retain the most important information as much as possible, so that the IdenB module is used more than the ConvB module. Because the IdenB module directly connects the input features with the final convolution output features, the ConvB module does not simply connect the initial input features, but performs convolution and batch regularization operations to further process the extracted features. The IdenB module receives the output of the ConvB module, and continuously retains the features of the initial acquisition to the subsequent convolutional layer, which can effectively preserve the features of the initial acquisition image and is more conducive to image classification. We add 2 IdenB modules after the ConvB module, 3 IdenB modules after the ConvB module, and 5 IdenB modules after the ConvB module, and fuse the multi-scale modules at the output position of this layer. As the network deepens, the training accuracy of the model is improved, but with continuous training, the gradient in the network will gradually disappear. When the network reaches a certain depth, a multi-scale module is added to sample the collected features at different sampling rates, so that the network can consider feature information of different scales, thereby improving the accuracy of classification.
2.4.1 Conv_block module
The contents of the ConvB module are described in this section, as shown in Figure 4A. The imported features are activated by two-dimensional convolution, batch regularization, dropout, and Relu in turn, and after four rounds of these four operations, the original input features are subjected to two-dimensional convolution and batch regularization operations, that is, shortcut. Connect the features of the output after four rounds of operation with the features of the output after the shortcut, and finally activate the output using a Relu.
FIGURE 4. The three modules used in the network structure proposed here: (A) represents the conv_block module, (B) represents the identity_block module, and (C) represents the multi-scale module.
2.4.2 Identity_block module
The contents of the IdenB module are described in this section, as shown in Figure 4B. The imported features are in turn two-dimensional convolution, batch regularization, dropout, and Relu activation, and these four operations use the characteristics of the output after three rounds to connect the original input features, and finally use a Relu to activate the output.
2.4.3 Multi-scale module
2.5 Diagnostic methods
The method flow mainly includes: data preprocessing, dataset division, training model and ECG image classification. Figure 5 shows the flow of the method presented herein.
1. First collect ECG data, which is one-dimensional time series data;
2. Preprocess the collected 1D ECG data. The single-channel data was fused by the matplotlib method to form a two-dimensional multi-channel ECG image;
3. In a 9:1 ratio, the ECG dataset is divided into training and validation sets;
4. In order to fully extract the characteristics of multi-channel ECG images, multi-scale network modules are designed;
5. Train the designed model to obtain the network model of ECG image diagnosis;
6. The trained model is verified on the validation set and the accuracy of ECG image recognition is calculated.
This paper uses the frameworks of Python and Tensorflow to complete the experiment. The program was trained and tested with the GPU NVIDIA GeForce RTX 3060. During training, in order to reduce the memory usage, each group takes 32 pieces of data, and the learning rate of the Adam optimizer is 0.0001.
3 Results
The definition of the evaluation index of the experimental results is as follows:
TP is to predict a positive class as a positive class, FN is to predict a positive class as a negative class, FP is to predict a negative class as a positive class, and TN is to predict a negative class to be a negative class.
Firstly, in order to verify the importance of multiscale modules, the experimental results of ResNet and the proposed method are compared under the same dataset. The training results are shown in Table 3 and Figure 6. Table 3 summarizes the experimental results under the same dataset, and the accuracy of ResNet and the proposed method in the verification set is 98.99% and 99.60%, respectively. The accuracy of the proposed method on the validation set is 0.61% higher than that of ResNet, and the loss function value on both the training set and the validation set is lower than that of ResNet. Therefore, it is very reasonable to add multi-scale modules on the basis of ResNet, and multi-scale modules can effectively improve classification accuracy.
FIGURE 6. Experimental results on the MIT-BIH dataset: (A) is the experimental result of ResNet, and (B) is the experimental result of the proposed method.
In Figure 6, the first line represents the loss function value and accuracy value of ResNet’s training set and validation set. The second line represents the loss function value and accuracy value of the training set and validation set of the proposed method. Through comparison, it can be found that the method proposed in this paper has faster and more stable convergence speed and higher accuracy.
Secondly, in order to verify the effectiveness of multi-channel data fusion for classification tasks, on the basis of the same model, the single-channel data (V5 and MLII) and the multi-channel fusion data were trained respectively, and the experimental results were shown in Table 4 and Figure 7. As shown from Table 4, the validation set accuracy of V5 channel, MLII channel and the proposed method is 96.84%, 98.54% and 99.60%, respectively, and the verification set accuracy of the proposed method is the highest, and the loss function value is also the lowest. Experimental results show that the dual-channel data is conducive to improving the accuracy of classification, and the proposed method can make full use of the dual-channel image information to achieve higher recognition accuracy. In Figure 7A represents the loss function value and ac-curacy value of training set and validation set under V5 channel. (b) Represents the loss function value and accuracy value of training set and validation set under MLII channel. (c) Represents the loss function value and accuracy value of training set and validation set under multi-channel fusion data. As can be seen from the figure, the value of the loss function after the fusion of two-channel data converges faster and is more stable. Its accuracy rate is also stable. Therefore, the proposed method can effectively extract the information of multi-channel ECG images, improve the accuracy of ECG classification, and the classification performance is optimal.
FIGURE 7. Single-channel and multi-channel experimental results: (A) is the experimental result of V5 channel, (B) is the experimental result of MLII, and (C) is the experimental result of multi-channel data.
By training the sudden cardiac death holter dataset, the experimental results of ResNet network architecture and fusion multi-scale modules are shown in Table 5 and Figure 8.
FIGURE 8. ResNet of the sudden cardiac death holter dataset and the experimental results of the method in this paper: (A) is the experimental results of ResNet, and (B) is the experimental results of the method in this paper.
Table 5 and Figure 8 shows that the accuracy of the method proposed in this paper on the verification set is slightly lower than the results obtained by the ResNet network, and in the training process, the ResNet method is more stable and less fluctuating, while the method proposed in this paper fluctuates more, and the results are more uncertain, so the results obtained by the ResNet network are more trustworthy. The method proposed in this paper is not as good as the ResNet method. The reason may be that the sudden cardiac death holter dataset is complex, there are fewer samples with labeled information in this data, and the experimental samples of individual patients are limited.
In addition to comparing the ResNet algorithm, we trained the Goolenet network using the same data set. The experimental results are shown in Table 6 and Table 7. Table 6 shows the comparison results under the MITBIH dataset, and the optimal results in the table have been bolded. Except for the Kappa index, other indexes are superior to the results obtained by other algorithms. Table 7 shows the comparison results of the Sudden cardiac death holter dataset, and the optimal results in the table have been bolded. It can be seen that the results obtained by the ResNet algorithm are better. The reason may be that the dynamic ECG data set of sudden cardiac death is complex, and there are fewer samples with labeled information in the data, and the experimental samples of individual patients are limited. Bold fonts in Table 6 and 7 indicate the optimal experimental results.
To further verify the effectiveness of the proposed method, the results of some other ECG classification methods in recent years were compared under the same data set, as shown in Table 8. From the experimental results and Table 8, it can be concluded that this paper has obtained better results compared with other methods. Moreover, when adding multi-scale modules to the ResNet network, the effect is superior to that of the ResNet network, thus verifying that the proposed method is effective.
TABLE 8. This is a table. Tables should be placed in the main text near to the first time they are cited.
4 Discussion
t-SEN (t-Distributed Stochastic Neighbor Embedding) (Li et al., 2020) was proposed in 2008. The t-SEN feature is a better demonstration of experimental results. When there are many classifications, t-SEN can concisely demonstrate the classification performance of the classification model. As a result, we are able to more intuitively analyze the performance and quality of classification models through t-SEN visualization. The confusion matrix is used to summarize the predicted categories and the real categories in the form of a matrix.
We use these two methods to visualize ResNet and the methods proposed in this paper, as shown in Figure 9 and Figure 10. Figure 9 compares the ECG image classification effect of ResNet and the proposed method, although both methods can achieve the 5-classification task of the ECG graph, but the proposed method in this paper has a lower error rate than ResNet on the t-SNE diagram. And in the confusion matrix, the recognition accuracy of both label 1 and label 4 is improved by 1%.
FIGURE 10. Confusion matrix and cluster analysis results for single-channel and multi-channel data fusion.
Figure 10 compares the confusion matrix and t-SNE plot of single-channel data and multi-channel data fusion. Among them A represents the V5 channel, B represents the MLII channel, and C represents multi-channel data fusion. The method proposed in this paper has clear clustering boundaries and the experimental data can be accurately divided into 5 categories. However, the classification results of single-channel data are very confusing and cannot effectively distinguish the categories of data.
The prediction effect of the network model is shown in Figure 11 and Figure 12 The prediction accuracy of the ResNet method on the three categories of I, N and V are 100%, 100% and 99%, respectively. However, the accuracy rate of the method proposed in this paper is 100%, 91% and 100% respectively on these three categories. The prediction accuracy rate of the method proposed in this paper is low for normal categories, and 9% of them are misclassified as V category. It is difficult for the proposed method to distinguish the normal category from the V category. In Figure 12, you can more intuitively see the prediction results of the ResNet method and the method proposed in this paper under the three categories. The N category and the V category are similar in some features. The method proposed in this paper is insufficient for feature learning of N and V categories. It is difficult to distinguish the N category from the V category when the two categories are similar.
FIGURE 11. ResNet of the sudden cardiac death holter dataset and the prediction results of this method: (A) is the network of ResNet, (B) is the method of this paper.
FIGURE 12. The prediction results of three categories in the sudden cardiac death holter dataset: (A) is the network of ResNet, (B) is the method of this paper.
Through the visualization of the experimental results, in the MITBIH dataset, compared with the simple residual network, the method proposed in this paper is significantly better than the comparison experiment. It can be seen that the multi-scale module has an effect on the ECG data of the MITBIH data set. The feature learning is better, the features of the data can be effectively extracted, and various ECG features can be accurately identified. In the sudden cardiac death holter dataset, the method proposed in this paper has a low classification accuracy for category N, confuses category N and category V, and classifies category N as category V. It can be seen that in the more complex sudden cardiac death holter dataset, the model proposed in this paper is not sufficient to accurately classify categories with similar characteristics. The method proposed in this paper needs to enhance the accuracy of data classification with subtle differences in ECG features.
5 Conclusion
In this paper, we propose an arrhythmia classification method based on multi-scale residual neural network and multi-channel data fusion. Considering that the information hidden in different lead ECG data is crucial for identifying arrhythmia diseases, we fused multi-channel ECG signals. In the process of ECG signal recognition, the residual neural network incorporates multi-scale modules, and uses the data after channel fusion to train the multi-scale residual neural network. After continuous optimization, the model can automatically extract image features from ECG images for arrhythmia classification with an accuracy rate of 99.60%. In addition, we also conduct experimental verification on single-channel data, and the accuracy rates reach 96.84% and 98.54%, respectively. Experimental results show that this method has better classification performance and is more conducive to the classification of arrhythmia. In future work, we will continue to develop portable wearable devices to diagnose more real-time samples.
Data availability statement
Publicly available datasets were analyzed in this study. This data can be found here: https://physionet.org/content/sddb/1.0.0/; https://physionet.org/content/sddb/1.0.0/.
Ethics statement
The studies involving human participants were reviewed and approved by Massachusetts Institute of Technology. The patients/participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this paper.
Author contributions
FZ: Writing–original draft, Writing–review & editing. ML: Writing–original draft, Writing–review & editing. LS: Conceptualization, Writing–review and editing. LW: Conceptualization, Writing–review and editing. BW: Data curation, Visualization, Writing–review and editing.
Funding
This research was funded by the Shandong Provincial Natural Science Foundation, China, grant number ZR2020MF156.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Atal D. K., Singh M. (2020). Arrhythmia classification with ECG signals based on the optimization-enabled deep convolutional neural network. Comput. Methods Programs Biomed. 196, 105607. doi:10.1016/j.cmpb.2020.105607
Cortes C., Vapnik V. (1995). Support-vector networks. Mach. Learn. 20, 273–297. doi:10.1007/bf00994018
Courtemanche M., GlassRosengarten L. M. D., Goldberger A. L. (1989). Beyond pure parasystole: promises and problems in modeling complex arrhythmias. Am. J. Physiol. 257, H693–H706. doi:10.1152/ajpheart.1989.257.2.H693
Fan W., Si Y., Yang W., Zhang G. (2021). Active broad learning system for ECG arrhythmia classification. Measurement 185, 110040. doi:10.1016/j.measurement.2021.110040
Goldberger A. L., Amaral L. A. N., Glass L., Hausdorff J. M., Ivanov P. C., Mark R. G., et al. (2000). PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals. circulation 101 (23), e215–e220. doi:10.1161/01.cir.101.23.e215
Goldberger A. L., Rigney D. R., Mietus J., Antman E. W., Greenwald S. (1988). Nonlinear dynamics in sudden cardiac death syndrome: heart rate oscillations and bifurcations. Experientia 44, 983–987. doi:10.1007/BF01939894
Goldberger A. L., Rigney D. R. (1989). “On the non-linear motions of the heart: fractals, chaos and cardiac dynamics,” in Cell to cell signaling: From experiments to theoretical models. Editor A. Goldbeter (San Diego: Academic Press), 541–550.
Greenwald S. D., Albrecht P., Moody G. B., Mark R. G. (1985). “Estimating confidence limits for arrhythmia detector performance[J],” in Comput. Cardiol. (Washington, DC: IEEE Computer Society Press), 383–386. (Pubitemid 17485737).
Greenwald S. D. (1986). The development and analysis of a ventricular fibrillation detector[D]. Massachusetts Institute of Technology.
Guo X., Wang Q., Zheng J. (2021). An intelligent computer-aided diagnosis approach for atrial fibrillation detection based on multi-scale convolution kernel and Squeeze-and-Excitation network. Biomed. Signal Process. Control 68, 102778. doi:10.1016/j.bspc.2021.102778
He K., Zhang X., Ren S. (2016). “Deep residual learning for image recognition,” in Proceedings of the IEEE conference on computer vision and pattern recognition, Las Vegas, NV, USA, 27-30 June 2016 (IEEE), 770–778.
Izci E., Ozdemir M. A., Degirmenci M. (2019). “Cardiac arrhythmia detection from 2D ECG images by using deep learning technique,” in 2019 Medical Technologies Congress (TIPTEKNO), Izmir, Turkey, 03-05 October 2019 (IEEE), 1–4.
Kleinbaum D. G., Dietz K., Gail M., Klein M., Klein M. (2002). Logistic regression. New York, NY, USA: Springer.
Li D., Tao Y., Zhao J., Wu H. (2020a). Classification of congestive heart failure from ECG segments with a multi-scale residual network. Symmetry 12 (12), 2019. doi:10.3390/sym12122019
Li M., Zhao Z., Scheidegger C. (2020b). Visualizing Neural Networks with the Grand Tour[J]. Distill. doi:10.23915/distill.00025
Liu X., Wang H., Li Z., Qin L. (2021). Deep learning in ECG diagnosis: A review. Knowledge-Based Syst. 227, 107187. doi:10.1016/j.knosys.2021.107187
Luo K., Li J., Wang Z., Cuschieri A. (2017). Patient-specific deep architectural model for ECG classification. J. Healthc. Eng. 2017, 4108720. doi:10.1155/2017/4108720
Mangathayaru N., Rani P., Janaki V., Srinivas K., Mathura Bai B., Sai Mohan G., et al. (2021). An attention based neural architecture for arrhythmia detection and classification from ecg signals. CMC-COMPUTERS Mater. CONTINUA 69 (2), 2425–2443. doi:10.32604/cmc.2021.016534
Moody G. B., Mark R. G. (2001). The impact of the MIT-BIH arrhythmia database. IEEE Eng. Med. Biol. Mag. 20, 45–50. doi:10.1109/51.932724
Panda R. S. R., JainTripathy K., Acharya U. R. (2020). Detection of shockable ventricular cardiac arrhythmias from ECG signals using FFREWT filter-bank and deep convolutional neural network. Comput. Biol. Med. 124, 103939–9. doi:10.1016/j.compbiomed.2020.103939
Park J., Kim J., Jung S., Gil Y., Choi J. I., Son H. S. (2020). ECG-signal multi-classification model based on squeeze-and-excitation residual neural networks. Appl. Sci. 10 (18), 6495. doi:10.3390/app10186495
Sannino G., De Pietro G. (2018). A deep learning approach for ECG-based heartbeat classification for arrhythmia detection. Future Gener. comput. Syst. 86, 446–455. doi:10.1016/j.future.2018.03.057
Shi Z., Yin Z., Ren X., Liu H., Chen J., Hei X., et al. (2021). Arrhythmia classification using deep residual neural networks. J. Mech. Med. Biol. 21 (10), 2140067. doi:10.1142/s0219519421400674
Ullah A., Rehman S., Tu S., Mehmood R. M., Ehatisham-Ul-Haq M. (2021). A hybrid deep CNN model for abnormal arrhythmia detection based on cardiac ECG signal. Sensors 21 (3), 951. doi:10.3390/s21030951
Wang T., Lu C., Sun Y., Yang M., Liu C., Ou C. (2021). Automatic ECG classification using continuous wavelet transform and convolutional neural network. Entropy 23 (1), 119. doi:10.3390/e23010119
Xu S. S., Mak M. W., Cheung C. C. (2018). Towards end-to-end ECG classification with raw signal extraction and deep neural networks. IEEE J. Biomed. Health Inf. 23, 1574–1584. doi:10.1109/JBHI.2018.2871510
Xu X., Liu H. (2020). ECG heartbeat classification using convolutional neural networks. IEEE Access 8, 8614–8619. doi:10.1109/access.2020.2964749
Yang S., Shen H. (2013). “Heartbeat classification using discrete wavelet transform and kernel principal component analysis,” in IEEE 2013 Tencon - Spring, Sydney, NSW, Australia, 17-19 April 2013 (IEEE).
Keywords: deep learning, multi-scale, residual neural networks, electrocardiogram, multichannel fusion
Citation: Zhang F, Li M, Song L, Wu L and Wang B (2023) Multi-classification method of arrhythmia based on multi-scale residual neural network and multi-channel data fusion. Front. Physiol. 14:1253907. doi: 10.3389/fphys.2023.1253907
Received: 06 July 2023; Accepted: 12 September 2023;
Published: 28 September 2023.
Edited by:
Fernando Soares Schlindwein, University of Leicester, United KingdomReviewed by:
Suman Deb, Sardar Vallabhbhai National Institute of Technology Surat, IndiaPranjali Gajbhiye, Nirvesh Enterprises Private Limited, India
Copyright © 2023 Zhang, Li, Song, Wu and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Li Song, bHNvbmdAc2RmbXUuZWR1LmNu
†These authors have contributed equally to this work