Skip to main content

REVIEW article

Front. Cardiovasc. Med., 25 March 2022
Sec. Coronary Artery Disease

Deep Learning for Detecting and Locating Myocardial Infarction by Electrocardiogram: A Literature Review

\nPing XiongPing Xiong1Simon Ming-Yuen LeeSimon Ming-Yuen Lee1Ging Chan,
Ging Chan1,2*
  • 1State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau, Taipa, Macau SAR, China
  • 2Department of Public Health and Medicinal Administration, Faculty of Health Sciences, University of Macau, Taipa, Macau SAR, China

Myocardial infarction is a common cardiovascular disorder caused by prolonged ischemia, and early diagnosis of myocardial infarction (MI) is critical for lifesaving. ECG is a simple and non-invasive approach in MI detection, localization, diagnosis, and prognosis. Population-based screening with ECG can detect MI early and help prevent it but this method is too labor-intensive and time-consuming to carry out in practice unless artificial intelligence (AI) would be able to reduce the workload. Recent advances in using deep learning (DL) for ECG screening might rekindle this hope. This review aims to take stock of 59 major DL studies applied to the ECG for MI detection and localization published in recent 5 years, covering convolutional neural network (CNN), long short-term memory (LSTM), convolutional recurrent neural network (CRNN), gated recurrent unit (GRU), residual neural network (ResNet), and autoencoder (AE). In this period, CNN obtained the best popularity in both MI detection and localization, and the highest performance has been obtained from CNN and ResNet model. The reported maximum accuracies of the six different methods are all beyond 97%. Considering the usage of different datasets and ECG leads, the network that trained on 12 leads ECG data of PTB database has obtained higher accuracy than that on smaller number leads data of other datasets. In addition, some limitations and challenges of the DL techniques are also discussed in this review.

Introduction

According to the WHO's 2019 global health estimate, ischemic heart disease (IHD) has been the largest cause of death globally, accounting for 16% of deaths worldwide, and has increased from more than 2 million to 8.9 million in the last two decades (1). Myocardial ischemia is the first stage in the progression of myocardial infarction (MI) which is characterized pathologically as the irreversible necrolysis of cardiomyocytes caused by a disruption in coronary blood supply to the myocardium partially or completely (2). MI, also called a heart attack, can frequently occur in patients with a history of heart disease. Heart failure, angina pectoris, and arrhythmia are the main clinical symptoms of acute MI (3). Furthermore, studies reveal that ~22–64% of non-fatal MIs are silent or unrecognized MIs. Several potential factors, such as a history of cardiovascular diseases (CVD), hypertension, and diabetes could also increase the risk of silent MI, and this provides evidence that the prevalence and incidence of clinically unrecognized MI increase with patients' age (4, 5). The characteristics of asymptomatic and multifactorial may be related to a considerably increased mortality risk.

Early diagnosis of the onset of MI is a crucial step for suspected patients to receive medical intervention timely, such as percutaneous coronary intervention (PCI) which is an effective way to limit infarct size and thereby reduce the risk of post-MI complications and heart failure (6, 7). Biomarkers, cardiac image modalities, and electrocardiographic methods play important roles in MI diagnosis (8). In terms of urgent therapeutic options, a non-invasive ECG is the most cost-effective and irreplaceable one and allows continuous and remote monitoring (9). The continuous ECG can always provide valuable prognostic information and may help determine the status of reperfusion or re-occlusion (10). Therefore, it is an essential diagnostic step for suspected patients either in pre-hospital or in-hospital settings. Additionally, the 12-lead ECG can be used to better understand the pathogenesis of MI and to pinpoint the localization of cardiac damage. Specific ECG leads can reflect the electrical activity of the heart from various angles, allowing them to distinguish between different types of MI based on the location of the infarction in the myocardium (11). For instance, a combination of lead V1, V2, V3, and V4 provides suggestive information concerning anterior MI (AMI), whereas a combination of lead II, III, and aVF can indicate inferior MI. However, the 12-lead ECG has difficulty in localizing the posterior MI (12, 13). Gupta et al. (14) quantified the contributions of each of 15 leads ECG signal from the PTB database individually and observed that the five leads: V5, V6, Vx, Vz, and II contain the most useful information, then they were quantified in pairs using the five best channels and results indicated that lead V6 and lead Vz can induce the best performance of the model. Fu et al. (15) employed an attention mechanism to select the most essential leads under the intra-patient and inter-patient scheme in MI detection and localization, although it just aimed to assist proposed DL methods to effectively diagnosis not to find the most significant leads in pathology. Treating all the leads equally could inversely lead to limiting model performance and largely increasing computational complexity because of redundant and unnecessary information. Multi-lead ECGs can not only help clinicians carry out myocardial reperfusion therapy as soon as possible but also help interventional cardiologists make targeted preliminary judgments on the pathological vessels related to infarction and perform directed interventional treatment. In addition, 12-lead ECG can also indicate whether clinicians need to prepare rescue measures for those patients with MI diagnosed with large infarct sizes. An overview of coronary arteries structure and ECG 12 leads is illustrated in Figure 1A. Different MI locations with their corresponding leads and culprit coronary arteries can be found in Table 1.

FIGURE 1
www.frontiersin.org

Figure 1. Overview of coronary arteries structure and ECG 12 leads (A). The ECG characteristics of normal sinus rhythm (B) and myocardial infarction (MI) (C).

TABLE 1
www.frontiersin.org

Table 1. Different myocardial infarction (MI) localizations and corresponding leads and culprit coronary arteries.

The main manifestations of MI in ECG are ST-segment elevation, as well as the high apex and inversion of T waves, and the appearance of pathological Q waves (Figure 1C). Among these manifestations, the change in the ST segment is the most significant. The characteristics of normal sinus rhythm and MI are shown in Figures 1B,C respectively. In a feature visualization analysis (17), the weight assigned to the U-wave of ECG signals is also large. It is also indicated that the U wave may play an important role in MI detection. Furthermore, silent MI with a Q wave accounted for 9–37% of all non-fatal MI events (18). According to ECG waveform characteristics, that is, whether ECG presents a specific sign called ST-elevation, MI can be divided into two categories: ST-elevation MI (STEMI), which refers to MI with ST-segment elevation that cannot be rapidly reversed by nitrates, and up to 25% of acute coronary syndrome (ACS) patients present this severe condition. Patients with STEMI will be at significant risk of cardiac mortality and sequelae if urgent reperfusion therapy is not provided (19). Non-ST-elevation MI (NSTEMI) is defined as MI with an ECG presentation of ST-segment depression, T-wave inversion, or both. The ECG waveform deviations provide indicative information about patients with MI. Pre-clinical ECG diagnosis can predict the risk stratification of MI and shorten the time to treatment. Patients with suspected STEMI who receive pre-hospital ECG have a 20% lower risk of in-hospital mortality (20). The interval between diagnosis and treatment is critical, and 12-lead ECGs of patients with MI should be collected and examined within 10 min of the initial medical contact (10). Silent patients with MI who receive treatment within 90 min of starting MI have a better chance of survival. In addition to being inefficient, manually identifying complicated non-linear ECG features across the 12-lead is time-consuming, and human interpretation of the ECG differs considerably depending on experience and competence levels. Because of intra- and inter-individual variability, neither timeliness nor accuracy can be guaranteed. It should be emphasized that the sensitivity and specificity of manual AMI diagnosis are 91 and 51%, respectively (21). As a result, prompt identification, immediate feedback, and precise diagnosis can give a better chance for future medical therapy.

Machine Learning for MI Diagnosis

Machine Learning

Machine learning (ML) is the technique of enabling computers to mimic human learning behaviors to update their existing knowledge frame and acquire new knowledge to progressively advance their ability to complete specific tasks (22). In cardiology, ML methods have been extensively used in medical imaging [e.g., CT (23), MRI (24), chest X-ray (25), echocardiogram (26)] and ECG (27). It is especially beneficial in pre-hospital settings, where paramedics may lack the knowledge of emergency physicians or cardiologists when it comes to interpreting ECGs. ECG interpretation using ML, including traditional ML, deep learning (DL), and a combination of the two. For traditional ML methods, extra hand-crafted feature extraction and selection steps are needed. Morphological features are computed by the demarcation of major ECG characteristic points such as the QRS complex, T wave, and the J point (28). Wavelet transforms decompose ECG signals in the time-frequency domain. Principal component analysis (PCA) (29), empirical mode decomposition (EMD) (30), and the hidden Markov model (31) are all commonly employed to extract representative features. The goal of feature selection is to lower the complexity of the computational process and ensure that ML algorithms only employ the most informative features. Conventional threshold-based support vector machines (SVM), random forest (RF), naive Bayes, decision tree (DT), k-nearest neighbor (KNN), and neural network (NN) are the commonly used classifiers, and almost all these techniques have achieved good performance beyond that of cardiologists, who achieved an average accuracy of 75% for detecting ECG pathologies (32). To name a few, Kora (33) proposed a hybrid firefly (FF) and PSO (FFPSO) algorithm to optimize ECG features. Then, an ANN model named the Levenberg–Marquardt Neural Network with optimization algorithm achieved the best accuracy of 99.3% compared with the other two ML algorithms, KNN and SVM. Acharya et al. (34) investigated a four-level discrete wavelet transform to extract 12 types of non-linear features. An ANOVA analysis was used to rank these features and obtain optimal features. The performance of the proposed model was evaluated using a KNN classifier, which had an accuracy of 98.8% for MI detection and 98.73% for MI localization. Some recent conventional ML methods with good performance for MI classifications are shown in Table 2. However, these ML techniques are also confronted with some challenges, which are concluded as follows:

TABLE 2
www.frontiersin.org

Table 2. Recent conventional machine learning methods for MI detection and localization.

• First, information loss: the feature extraction and classification are two isolated modules in traditional ML approaches. In feature engineering, dimensionality reduction can remove large irrelevant features, make data analysis simpler, and lower computational costs. The feature selection aims to choose an appropriate algorithm to rank the scores of contributions of the features to the results so that the relevant characteristics can be maintained (41). However, the unselected features in the process of feature selection are directly moved out, so it is difficult to tell if the hidden information has been thoroughly unearthed or utilized redundantly. This form of two independent modules gives a negative influence on the learning ability and performance of ML models (42).

• Second, in the conventional ML framework, independent feature extraction techniques use fixed hand-crafted features. However, ECG characteristics may change with the influence of some external factors such as patients' age, gender, the devices used for acquiring ECG data, and the generalizability is compromised when fixed ECG features are used (43).

• Third, feature point detection cannot be guaranteed in ECG signals due to their faintness and noise interference. When confronted with faulty ECG tracings, traditional models can easily lose their robustness (41).

• Fourth, the ECG characteristics of MI, such as ST-segment deviation, are often inadequate to detect MI since they may be seen in other cardiac conditions, such as left ventricular hypertrophy and left bundle-branch block (44). Moreover, ECG abnormalities are also common in patients who have myocarditis or Takotsubo syndrome (TTS) (45, 46) Therefore, the low accuracy of the conventional ML methods keeps the manual-crafted feature extraction as a central task, which may be attributed to the traditional imperfect ECG classification criteria.

Deep Learning

Deep learning (DL) is a subfield of ML. DL models can be trained on huge datasets to learn the relationship between the input features and the results automatically (47) Thus, they can directly learn features from given input of raw data without a specific step of feature extraction and have the capacity to maintain good generalizability. The hidden layers of DL models are a black box, which is responsible for automatic features learning. The deep hidden layers make a deep neural network that can potentially map to any function, which allows it to solve exceedingly complicated functions. A system with 5–20 non-linear layers may implement extraordinarily complex functions of its inputs (47). DL has been rapidly evolving and having an imperative impact on the accuracy in the classification of heart diseases. Data-driven DL models rely significantly on the quality of data and can be promoted as more data is gathered. Since 2010, due to aging populations, the availability of easy-to-use ECG monitoring devices in the form of wireless, mobile, and remote technologies have greatly expanded the capture of ECG data, and DL algorithm-based interpretation software automatically interprets ECG data. Wearable technology, wireless sensors, and deep learning techniques can all collaborate to create innovative approaches to improve healthcare services (48, 49). The improvement of AI may greatly promote the development of ECG and expand the interoperability of healthcare. For the limitations of conventional ML approaches mentioned in Section Machine Learning, the advantages of DL methods compared with the ML approaches are concluded as follows:

• As opposed to the conventional ML algorithms, the DL framework integrates feature extraction and classification into a whole instead of clearly describing them as two independent modules. That is also called the “end-to-end” model. End-to-end models can use a single model to solve tasks with multiple modules or steps. When solving a complex task using multiple modules, the major drawback is the accumulation of errors, since deviations from one module can affect the next. The end-to-end DL models avoid this inherent defect and make a reduction in engineering complexity. However, just as the “no Free Lunch” theory, the interpretability of DL models is reduced.

• From the perspective of generalization, DL is easier to deal with massive data than conventional ML methods. It is believed that memory training data is an important reason for the poor generalization ability of models in conventional view. Therefore, various regularization methods are often used to make models “simple” and break this memory. In Zhang et al. (50), researchers challenged this conventional view by adopting the randomization test to compare how the learning algorithm performs on the natural data vs. the randomized data. The results indicate that deep neural networks easily fit random labels and emphasize that the effective capacity of the neural network makes it large enough to memorize data. More importantly, researchers assessed the effects of implicit regularizers on generalization performance and proved implicit regularizations are not the main cause for model generalization. Therefore, we can infer deep neural networks make good use of their memory when they work. In comparison, the traditional ML methods lack this memory when they deal with complex functions.

• Compared with the rule-based features extraction, DL approaches automate feature engineering. DL models can better express the rich underlying information of data by using vast amounts of raw data to automatically identify the representations needed for classification. As a result, the DL algorithms may “see” informative features that even the experienced expert may not notice. Their potential for greater generalization ability makes them better adapt to the dynamic changes of ECG patterns and enables them to diagnose more cardiac conditions with a greater performance than identified by conventional ML.

Based on the summary of Ansari et al. (51) we add the DL methods for MI detection and localization emerging from the recent 5 years to construct the timeline in Figure 2.

FIGURE 2
www.frontiersin.org

Figure 2. Timeline of technology advances of feature extraction and classification methods for MI diagnosis.

Objectives and Methods

Other Related Work

There exist other six related works that focus on automatic ECG analysis for the prediction of structural cardiac pathologies, including two systematic reviews (52, 53), one meta-analysis (54), and three comprehensive reviews (44, 51, 55). Al Hinai et al. (52) assess the evidence for DL-based analysis of resting ECGs to predict cardiac diseases such as left ventricular (LV) systolic dysfunction, myocardial hypertrophy, and ischemic heart disease. Joloudari et al. (53) focus on ML and DL techniques for myocardial infarction disease (MID) diagnosis but just cover 16 papers regarding DL methods. Grün et al. (54) include a total of five reports to provide an overview of the ability of AI to predict heart failure based on ECG signals. Attia et al. (55) discuss AI ECG algorithms for cardiac screening including LV dysfunction, silent atrial fibrillation, hypertrophic cardiomyopathy, and other structural and valvular diseases. Jothiramalingam et al. review papers that consider ECG signal pre-processing, feature extraction and selection, and classification techniques to diagnose heart disorders such as LV Hypertrophy, Bundle Branch Block, and MI. Ansari et al. (51) comprehensively evaluate several hundred publications that analyzed the ECG signal and electronic health records (EHR) to diagnose myocardial ischemia and infarction automatically and point out that DL methods have not specifically been used to detect MI and ischemia prior to 2017.

Study Objectives

In recent 5 years, there has been an emerging number of studies focusing on state-of-the-art DL methods for MI detection and localization. Compared to the related work mentioned before, we only focus on DL techniques for MI detection and localization to consider more related papers. Covering a wide variety of studies focused on this topic in the review, we have set the following detailed goals: First, to describe and evaluate all public databases and newly collected datasets, as well as the commonly accepted assessment measures employed in the reviewed publications. Second, to assess and compare the different DL methods based on their respective performance in detecting and locating MI to find the most popular one. Finally, to discuss how these different DL methods contribute to MI detection and localization, as well as the primary potential obstacles of using DL algorithms in clinical practice for MI detection.

Study Selection

We search technical papers that deployed DL methods [deep neural networks (DNNs)] for MI detection and localization on Google Scholar, PubMed, and Web of Science. The following general search terms are used: (“deep learning” OR “deep neural network” OR “artificial neural network” OR “convolutional neural network” OR “CNN” OR “recurrent neural network” OR “RNN” OR “long short-term memory” OR “LSTM” OR “autoencoder” OR “deep belief network” OR “DBN” OR “generative adversarial networks” OR “GAN” OR “Restricted Boltzmann machines” OR “RBM”) AND (“electrocardiogram” OR “ECG” OR “EKG” OR “electrocardiography” OR “electrocardiograph” OR “electrocardiology”) AND (“myocardial infarction” OR “MI” OR “acute myocardial infarction” OR “AMI”). To avoid missing related papers, we also include some references that have been cited by others. Figure 3 shows the flow diagram of paper selection.

FIGURE 3
www.frontiersin.org

Figure 3. Flow diagram of paper selection.

Findings From Review

Datasets

A high-quality dataset can boost the improvement of data-driven model performance and generalizability, making DL techniques possible to be used in clinical environments. A total of 13 datasets are considered in all the reviewed papers. The largest database is the ECG-ViEW II database consisting of 979,273 recordings from 461,178 patients, which was collected over a 19-year study period in South Korea. The smallest dataset is the Long-Term ST Database (LTST), only covering 86 recordings from 80 subjects, which were collected in Slovenia. Thus, these datasets differ significantly in the number of ECG recordings. Among the investigated papers, 47 (more than 79%) research trained DL models on the public Physikalisch-Technische Bundesanstalt (PTB) Diagnostic ECG database and achieved high performance for MI classification. It makes the problem of the generalizability of these existing DL models worth considering, and clinical data is needed to verify the diagnostic efficacy of such models. Other public databases such as the LTST, the PTB-XL, and the ECG-ViEW II database are alternatives for some researchers. Notably, 7 studies are using the new ECG collections from some medical institutes in recent two years. For instance, Tadesse et al. (56) collected the ECGs from 17,381 patients (11,853 MI and 5,528 Normal cases) in the Provincial Key Laboratory of Coronary Heart Disease, Guangdong Cardiovascular Institute (GCI), and three subgroups: acute, recent, and old MI were divided by cardiologists based on patients' medical history with a combination of ECGs. In 12/13 of these datasets, ECGs are collected as voltage amplitude time-series signals in one dimension, however, there is also one ECG images collection for CNN model training. Khan et al. manually collected 11,148 ECG images from Ch. Pervaiz Elahi Institute of Cardiology Multan in Pakistan and cardiologists annotated the images. All collected datasets are listed in Table 3.

TABLE 3A
www.frontiersin.org

Table 3A. Properties of the collected databases used in the research of MI detection and location.

TABLE 3B
www.frontiersin.org

Table 3B. Description of newly collected datasets used in studied research.

• Leads: Standard 12 leads ECG are included in ten datasets, but few datasets only contain two or three leads ECG recordings such as the LTST and The European ST-T database (ESCDB). Each record in the PTB database includes 12 conventional standard leads together with a vectorcardiogram (VCG) of three Frank leads, and the 15 leads ECG can provide a more comprehensive assessment about the heart abnormalities. It is also the only database whose record has 15 simultaneously measured signals in all the datasets. In the reviewed literature, a small number of them adopted ECG signals of several specific leads according to their different study objectives, for instance, Lead V1, V2, V3 are adopted to detect AMI (68), and lead II, III, and aVF are used for detection of IMI (69). The three VCG leads record electrical heart activities in three orthogonal planes including frontal, sagittal, and transverse (70). In Yadav et al. (71), the best results obtained for 3 VCG leads were concatenated. According to Einthoven's law, there are linear correlations between limb leads, and these leads are electrically equilateral (72). Therefore, 8 leads (2 limb lead and 6 precordial leads) are considered as the non-redundant number of leads in Zhang et al. (73) to generate model training. The usage of ECG leads is shown in Figure 4A.

• Length of ECG recordings: Several databases contain short-term ECG recordings which are less than several minutes, such as 10 s (59) and 2 min (57), while some databases include long-term ECG data which is more than 1 h, such as 2 h (60) and 24 h (58). Sometimes, the length of 2 h is also relatively short for MI detection and limits the use of computational methods that analyze the signals over a longer period.

FIGURE 4
www.frontiersin.org

Figure 4. The number of leads used in investigated articles (A). The proportion of the usage of single lead especially the lead II ECG signals is the highest for MI detection, even higher than the standard 12-lead ECG signals. It is because this limb lead is coaxial to the cardiac conduction, the ECG signal in lead II has the largest forward waveform amplitude and the clearest waveform amplitude, so it can provide good ECG morphological information for MI detection (74). Furthermore, for the objective of MI localization, only 12-lead ECG data can provide comprehensive information and reflect different regions of the heart, so standard 12 lead ECG signals are employed in most research of MI localization. Percentage of metrics used in reviewed papers (B). Accuracy, sensitivity, and specificity are the three major metrics. Accuracy, which is considered in 79.7% of the articles, is the most frequently used metric, followed by sensitivity and specificity, with rates of 76.3 and 66.1%, respectively. Less than 20% of authors have considered AUC. Distribution of data splitting methods in MI diagnosis (C). They contain k-fold cross-validation (CV) such as ten- and five-folds, train-test separation such as 90:10 and 80:20%, and train-validation-test separation such as 70:15:15 and 60:10:30%. It shows tenfold CV is the most popular data splitting method, which has been considered in 20 articles.

Model Performance Evaluation

Metrics

Model performance evaluation is the main step in ML-based diagnosis. There are plenty of measurements for the performance of various diagnostic algorithms, just like accuracy, sensitivity, specificity, precision, F1-score, receiver operating characteristic curve, and area under the curve. There is still a lack of consistency in the performance metrics used for MI classification through using robust evaluation strategies to make results comparable and generalizable is of great importance. Figure 4B shows the percentage of metrics used in the investigated articles. All the metrics commonly used to assess the performance of models are summarized in follows:

Accuracy (Acc) = TP+TNTP+TN+FP+FN

Sensitivity (Sen) = true positive rate (TPR) = recall (Re) = TPTP+FN

Specificity (Spec) = TNTN+FP

Precision (Pr) = positive predictive value (PPV) = TPTP +F

False-positive rate (FPR) = FPTN+FP

Negative predictive value (NPV) = TNTN+FN

F1 Score (F1) = 2TP2TP+FP+FN

Classification Error Rate (CER%) = FP+FNTP+TN+FP+FN

Receiver operating characteristic (ROC) curve and the area under the curve (AUC)

Youden's J statistic (J- Measure): J = Recall + Spec – 1 = TPTP+FN + TNTN+FP 1

For classification in imbalanced data, Balanced Accuracy (BACC) and Matthew's Correlation Coefficient (MCC) are used:

BACC = 12 * (TPTP+FP + TNTN+FN)

MCC = (TP*TN)-(FP*FN)(TP+FP)(TP+FN)(TN+FP)(TN+FN)

where TP is a true positive, TN is a true negative, FP is a false positive, and FN is a false negative.

Performance in MI diagnosis is often measured by accuracy. Nonetheless, relying on accuracy alone might be misleading in the context of imbalanced data distributions, since it is easy to obtain a high accuracy score by simply classifying all observations as the majority class (e.g., the low prevalence of a disease). The AUC measures the ranking scores between predictions and targets, and we can identify the model that offers the best trade-off between specificity and sensitivity by using AUC (67). Medically speaking, AUC is more informative than accuracy. Other than the ROC plots and the AUC, the metrics mentioned above are all single-threshold measures and none of them can give an overview of performance when thresholds are varied.

In the F1 score, precision and recall are weighted by a harmonic mean. It is commonly used in binary classification problems. F1 score accounts for precision and recall of positive observations while accuracy reflects correctly classified positive and negative observations. It will make a huge difference for imbalanced problems since the model generally predicts true negatives. When both true negatives and true positives are equally important, then accuracy should be selected. If the dataset is heavily imbalanced or the positive class is mostly cared about, the F1 score is a better choice.

Data Splitting

In DL methods, data splitting is of great importance for performance evaluation. The model will likely fit the data to a maximum extent if all the original data are used to train the model while performing terribly for new data. Training-validation-testing data splitting is commonly used to prevent the over-fitting problem of the models. The three subsets perform different functions: the training set is used to train models and adjust parameters. The validation set evaluates whether the efficiency of model training goes in a bad direction and participated in the process of parameter tuning (hyperparameter), and the test set is used to evaluate the generalization capability of models. The proportion of data splitting varies in different studies, and there may be no validation set in some studies. K-fold cross-validation (CV) is a dynamic validation technique that aids in mitigating the impacts of data partitioning. The dataset is separated into k folds in k-fold CV. The model is trained using K-1 folds, and its accuracy is evaluated using the remaining 1-fold. This procedure will be repeated k times, with the final model performance calculated by averaging the results. Figure 4C depicts the distribution of data splitting approaches in the investigated MI research.

In some research, the class-based and the patient-specific experiments are designed to explore intra-individual variability (AIV) and inter-individual variability (RIV), respectively. It comes down to two different ways of data splitting. AIV can be investigated when the same individual patient's ECG heartbeats simultaneously appeared in both the training and testing datasets, whereas a patient-independent evaluation paradigm needs to confirm no overlap of individual patient data between the two datasets. In all the pieces of literature that employed both intra- and inter-patient experiments simultaneously, we can see that the models deployed in an intra-patient experiment always have overestimated prediction, this is due to the data in the test dataset has been partly occurred in the training dataset. Training and testing with the same data results in overfitting that yield to overestimated performance of the algorithm. Therefore, this kind of excellent performance does not mean this model can classify and detect unseen things very well in the future. RIV is a significant challenge for automated diagnosis since the ECG beats of the patients showed different characteristics, so this type of data splitting which is employed in an inter-individual scheme accords with the real world more than that in an intra-individual one. The evaluation results of the model deployed in the inter-patient experiment would better represent the real performance in real-world applications since networks are trained using a dataset of recorded patient data but applied to new patients and obtain better generalizability.

Pre-processing

Instead of relying on the complex steps of feature extraction, DL methods learn the intrinsic characteristics directly from raw or low-level processed data. Relatively little preprocessing such as resample, denoise, segmentation, and data balance are also necessary for some research. Resampling ECG data to obtain a consistent sampling rate is essential to maintain consistency. In this stage, the ECG beats are changed into the same periodic length for the DL models. ECG waveforms can be subsequently deformed by two major artifacts including high-frequency noise as well as baseline wander (BW) caused by breathing and patient movement (75). Wavelet transform methods performed well in eliminating the ubiquitous ECG noises. Daubechies wavelet 6(db6) mother wavelet basis function and Savitzky-Golay (SG) smoothing filter are commonly used to filter out high-frequency noise and correct baseline wanders. In Acharya et al. (76) and Liu et al. (77), the ECG signals with noise and without noise are both used for CNN models to compare the performance, and the results illustrated that the higher quality denoised ECG signals can improve the performance of models. However, in Zhang et al. (78), heartbeats with and without noise are adopted in contrast experiments to verify the model's robustness to noise, and the research results indicated that the experiment using ECG heartbeats with noise achieved better performance. The main reason for this anomaly is that some informative features will be missed while using wavelet to denoise since the noise domain has an unescapable overlap with the information domain. Heartbeats are segmented by distinctive points, notably fiducial R-point since R-wave has an extremely high amplitude and plainly visible peak. Pan Tompkins algorithm is the most widely used R-peak and QRS-wave detection method and modified Nagatomo's method was conducted to automatically update the algorithm of R-peak threshold in Sugimoto et al. (79). According to the position of R-peak, sliding windows are generated with the same number of heartbeats to control the size of ECG records which inputted into the model due to the length of ECG signals varies in datasets. Data imbalance occurs when the number of samples for different classes varies greatly in a classification task, and it is another common issue of influencing model performance since it almost exists in all datasets. Low-degree imbalanced data does not matter much whereas there is a great deal of highly imbalanced data in medical diagnostics of the clinical setting. Large data skew can affect the prediction results. Several approaches have been discussed in reviewed literature to best address the problem of imbalance. Rai and Chatterjee (80) proposed a Synthetic Minority Over-sampling Technique (SMOTE) to create synthetic samples for minor classes instead of copies. SMOTE is a representative algorithm of the oversampling method. Hammad et al. (81) and Dai et al. (82) added a new loss function called focal loss (FL) to address data imbalance and results in Tripathy et al. (83) indicated that FL increased MI detection accuracy by 9%. Cao et al. (84) chose Balanced Cross-Entropy, which is a modified version of the cross-entropy loss function. To forcibly strengthen the robustness of DL models, data augmentation is also used to deal with the scarce class data and to supplement the trained model with more diverse and representative data. Alghamdi et al. (85) investigated MI ECG segments with and without the augmentation technique in the proposed model, and results indicated that the model with data augmentation technique achieved better performance than that without one. Besides that, in Darmawahyuni et al. (86), researchers introduced two performance metrics Balanced Accuracy (BACC) and Matthew's Correlation Coefficient (MCC) to evaluate the performance in their proposed study which the imbalanced ratio reaches 4.57. Notably, the balanced data helps to improve the model performance, nevertheless, there is a gap between balanced distribution and real condition (87).

Furthermore, to solve the problems of amplitudes scaling and eliminating offset effect, some researchers centralized and normalized each heartbeat, respectively (43, 68, 84). Rare studies converted one-dimensional (1-D) ECG data into two-dimensional (2-D) images and treated them as computer vision tasks. In this research, signal preprocessing steps can be nearly avoided.

Architectures

A plethora of studies have utilized DL algorithms to vastly improve ECG waveform-based heart diseases classification. With a focus on MI classifications, aside from binary classifiers for detection of MI and non-MI, more studies have also accomplished multiple-class classification for localizing the distinct injury parts of the myocardium. According to a content review, six main types of DL models are concluded: CNN, LSTM, CRNN, ResNet, AE, and GRU. The popularity of each type of model in MI detection and localization is illustrated in Figures 5A,B, respectively. The usage changes over the years of the six methods are illustrated in Figure 5C. The popularity of the CNN model can be reflected in terms of research purposes and annual classification models. In the following sections, each of these models will be discussed in more detail.

FIGURE 5
www.frontiersin.org

Figure 5. Percentages on different deep learning (DL)-based models for MI detection (A) and localization (B). Convolutional neural network (CNN) is the most common technique used for both MI detection (with 60% contribution) and localization (with 58% contribution). Convolutional recurrent neural network (CRNN) (for MI detection) and ResNet (for MI localization) are the next most frequently used learning techniques in articles. Distribution of articles focused on each DL method in recent 5 years (C). This figure shows the usage changes over the years of the six methods. Another four types of DL methods have been emerged for MI diagnoses from 2019 compared to that in 2018, while only CNN and CRNN were used in 2017 and 2018. CNN dominated the model categories in each year, and the proportion of CNN shows a gradual upward trend from 2019 to 2021 (33, 47, and 79 in 2019, 2020, and 2021, respectively).

CNN

Convolutional neural network (CNN) is not only well-known for its ability to complete computer vision tasks but also excellent in speech recognition, natural language processing, and signal analysis. CNN's basic structure is composed of a convolution layer followed by batch normalization layer, rectified linear activation function, pooling layer, and fully connected layer. The convolution process entails convolution of the input maps by kernels and then adding a bias to make the output maps. Multiple convolution layers and pooling layers are generally selected and set alternately, and the different convolution layers could extract different levels of features. The direct use of raw ECG data effectively alleviates the information loss which results from the process of handcraft feature extraction and selection. Table 4 lists the details of reviewed papers using the CNN model for MI detection and localization.

TABLE 4
www.frontiersin.org

Table 4. Properties of some notable convolutional neural network (CNN)-based ECG MI detection and localization.

• Single-lead and multi-lead CNN. As shown in Section Datasets, the ECG data of lead II is commonly used for MI detection. In a highly cited paper (76), a deep 11-layer CNN was implemented using ECG beats of lead II for MI detection and obtained an average accuracy of 95.22 and 93.53% without noise and with noise, respectively. The data between leads is independent and the combination of ECG signals from multi-lead reflects heart features on multiple scales, so several methods are employed to ensure the data independence among different leads, such as the multi-channel technology (68, 84), the lead asymmetric pooling (LAP) and sub-2-D convolutional layers (103), independent feature branch (43), and inception blocks (69). In these multi-lead CNN models, correlations containing intra-beat, inter-beat, and inter-lead are captured. Each lead is in line with an independent channel or feature branch, and the feature map of each lead will be concatenated and integrated into fully connected layers for detection and localization. Through training samples, each lead can find which 1-D kernel is most suitable for them. In Cao et al. (84), the results show that the comparative single-channel CNN model forcibly used the same kernel in different lead data and performed worse than multi-channel CNN which can obtain better feature representation. It is well-demonstrated that the proposed multi-channel CNN model meets the independence of data. Instead of multi-lead ECG data, a multi-branch CNN was proposed by Hao et al. (92) by dividing ECG images evenly into 12 branches based on 12 leads and contributing to 12 separate networks.

• 1-D and 2-D CNN. In CNN-based MI detection methods, the two types 1-D CNN and 2-D CNN are commonly used. 1-D voltage amplitude ECG data which is represented as a time-series signal is input to the 1-D CNN model. Because of that using ECG images as inputs of the CNN model is more corresponds to the way that cardiologists diagnose and analyze abnormalities, an emerging number of researchers adopted 2-D CNN when ECG signals are treated as an image. The 2-D CNN models used for detection are always trained through a transfer-learning scheme including the pre-trained Google Inception V3, GoogLeNet, MnasNet, and VGG-Net rather than trained from the ground up, and that is the idea of retraining the existing models. The 2-D grayscale image models use the snapshots of 10 s or two consecutive R-peaks ECG images which are transformed from continuous ECG temporal dynamics. The feature encoding of time-frequency spectrograms is also performed as a computer vision task, and it contains multiscale waveform features and the spatial correlations between these features. Using the Gramian Angular Summation/Difference Fields (GASF/GADF) method to transform 1-D ECG signal into a 2-D image, Zhang et al. (78) developed a PCANet for detecting prominent features. In terms of this transformation, the loss of information that results from preprocessing such as noise removal can be prevented. Alghamdi et al. achieved the highest accuracy of 99.02 and 99.22% using VGG-MI1 and VGG-MI2 models, respectively, which are fine-tuned from pre-trained VGG-Net so that their proposed automatic diagnosis system obtains the ability to deploy in urban healthcare (85). Notably, the aforementioned sub-2-D convolutional layers employed in multi-lead feature analysis differ from conventional 2-D CNNs since, in traditional 2-D CNN, the convolutional calculations are performed in both row and column directions, which make it suitable for image recognition. However, this convolution operation undermines the independence of different leads because the ECG data of different leads (column direction) within the same time (row direction) will be convolved. Some intra-lead local changes in multi-lead ECG cannot be captured in conventional 2-D convolutions, and normal pooling based on a single pooling factor cannot efficiently utilize the multiscale features. These characteristics make the conventional 2-D CNN very unreasonable for multi-lead ECG classification.

• CNN for further detection. In the investigated studies based on CNN, MI detection and localization are two main objectives. However, there are also several extended research such as detection of myocardial scar (MS) (67) and prediction of occurrence-time in MI (56). MI results in myocyte necrosis, which is replaced by fibrous scars due to the myocardium being very weak at regeneration, leading to arrhythmias, heart failure, and even sudden death. MS is the indicator of IHD, and cardiac magnetic resonance imaging (CMI) with late Gd enhancement (LGE) is the standard method for diagnosing MS and assessing structural conditions of the myocardium (104) Gumpfer et al. (67) first try to adopt a deep learning model to predict MS based on ECG and additional clinical parameters and achieved better performance with an AUC score, sensitivity, specificity, and accuracy of 0.89, 70, 84.3, and 78%, respectively, on combine model (ECG data together with clinical parameters) than that on ECG model. The results suggest that further information on patients' clinical characteristics can improve the prediction outcomes. Compared to the DL method applied for ECG analysis, there is also large research focusing on automatic approaches for MS detection from MRI. In Moccia et al. (105), a 2D CNN model toward automatically segments MS for MI quantification based on ENet is proposed and the best-performing achieved 97% median accuracy and ~71% Dice coefficient over a 30-subject cohort, however, the computational training time is too long. The ECG interpretation for the detection of MS is insufficient and complex for healthcare professionals to diagnose MS so that ECG cannot be applied as an alternative for MRI currently, but the disadvantages of MRI are also obvious, such as expensiveness and limited usability for patients with severe renal impairment. Further exploration of automatic techniques for the detection of MS is needed.

LSTM

Based on recurrent neural networks, Hochreiter and Schmidhuber (106) first proposed the concept of LSTM. This network gained great popularity in speech recognition and machine translation in recent years. It is an expert in dealing with sequential data and could handle the vanishing or exploding gradient problem in the backward pass by entering a gate mechanism, which makes it more applicable than a recurrent neural network. The sequential characteristic of ECG data is in line with the dependency relationship between input and output data. The vanilla version of the LSTM unit is composed of four parts: input gate, memory block, forget gate, and output gate. Input and output gates are responsible for updating information across time-steps, while gate mechanisms control the input and output flow of information to the memory cells so that the LSTM can store or discard data selectively. Notably, researchers prefer to employ an LSTM-based method to detect MI not to localize myocardial infarct regions. Table 5 lists the details of papers using the LSTM model for MI diagnosis. Darmawahyuni et al. (86) compared the performance of LSTM with that of other two sequential model classifier standard RNN and GRU within different data splitting and results indicated that a simple LSTM network with 90:10% for training and testing set presented the best specificity of 97.97% for MI detection than RNN and GRU. The bidirectional LSTM (Bi-LSTM) network is a variant of LSTM. In Zhang and Li (107), the proposed Bi-LSTM model provided the input sequence in both forward and backward ways, which simultaneously capture past and future information, and the heartbeat-attention mechanism was incorporated to improve the 2% accuracy of MI detection and reached 94.77%. In Zhang et al. (108), researchers used 8 leads ECG data to detect AMI and IMI and achieved the highest accuracy of 99.91%.

TABLE 5
www.frontiersin.org

Table 5. Properties of some notable long short-term memory (LSTM)-based ECG MI detection.

CRNN

A CRNN is the combination of two parts: CNN and RNN. As GRU, LSTM, Bi-LSTM, and Bi-GRU are all variants of RNN, CRNN comes in various forms such as CNN-LSTM (90, 100, 111115), CNN-BiLSTM (116, 117), and CNN-BiGRU (15). Heart rate variability (HRV) is a measure of the amount of variation in the R-R interval from beat to beat (118). Some CNN-based methods just classified single heartbeat or ECG segments so that the beat-to-beat HRV cannot be utilized by them, whereas LSTM is well-suited to process heartbeat sequences and can analyze the beat-to-beat variations of the ECG morphology after the convolutional layers. Convolution networks extract spatial features and temporal properties are acquired through LSTM. According to the characteristics of the two networks, some researchers prefer to combine CNN with LSTM to create hybrid models. Theoretically, the combination of the two-deep learning techniques can capture any subtle morphological changes as well as beat-beat HRV and makes it excellent applicability in MI detection and localization. It is adopted by replacing one fully connected layer of the CNN classifier with an LSTM layer to create the CNN-LSTM classifier in Lui and Chow (119), and the technique of stack encoding is also added into the hybrid model to exhibit improved performance. Based on MFB-CNN in Liu et al. (43, 116) added a Bi-LSTM into a single model to abstract all 12 feature branches in total, and the model with the Bi-LSTM module can achieve better performance than the one without Bi-LSTM, notably, this is possible because Bi-LSTM is efficient in logical dependencies. Similarly, a shallow 1D CNN and a Bi-LSTM layer (117) were used to classify MI subjects from 21 temporal features which were collected from the 12-lead data. In Tadesse et al. (97), a 2D spectral-based CNN which enables cross-domain transfer learning from pre-trained GoogLeNet was used to encode frequency-time characteristics, and the corresponding spectral dense features are then fed into LSTM-based longitudinal modeling. Just one researcher hybrid the CNN with BiGRU (15), a novel MLA-CNN-BiGRU framework with multi-lead attention mechanism integrated for automatic MI detection and localization is employed, satisfactory performance was obtained in MI detection under intra- and inter-patient experiments but the MI location under inter-patient scheme needs further improvement. Table 6 lists the specifications of investigated papers using the CRNN model for MI diagnosis.

TABLE 6
www.frontiersin.org

Table 6. Properties of some notable convolutional recurrent neural network (CRNN)-based ECG MI detection and localization.

AE

Autoencoder (AE) was introduced as early as 1994, and it is another form of artificial neural network (ANN) for reducing dimensionality. The extended capability of this model is strong, and it can be applied to data dimension reduction, feature extraction, and data visualization analysis (79). It contains an encoder and decoder. The coding process involves converting the input vector to a hidden representation and then returning it to its original form in the decoding process. The efficiency of the learning process can be improved because the input vector is transformed into a lower-dimensional representation in the coding process. Staked Sparse Autoencoders (SAEs) (74) and convolutional autoencoder (CAE) (78) are both variants of an autoencoder and are used for feature extraction. Table 7 lists the properties of some AE models for MI classification. Zhang et al. (74) structured shallow SAEs which were trained in unsupervised learning to extract discriminative features, nevertheless, the classification was completed by the shallow classifier TreeBagger. Sugimoto et al. (79) constructed a CAE-based model to learn the temporal features of normal beats and calculate the deviation from the normal waveform for the input signals. Finally, the KNN classifier was employed to categorize the error vectors into one of 11 classes. In Zhang et al. (78) and Cho et al. (65) proposed a variational autoencoder (VAE) to reconstruct a precordial 6-lead ECG using a limb 6-lead ECG. The encoder and decoder comprised 6 CNN layers each and were connected by a 1-D dense layer. The DL-based algorithm with VAE is one novel point of this retrospective research, and it outperformed conventional DL methods for interpreting MI or STEMI using 6-lead ECG.

TABLE 7
www.frontiersin.org

Table 7. Properties of some notable autoencoder (AE)-based ECG MI detection and localization.

ResNet

Deep residual networks were originally conceived for visual recognition tasks to overcome the degradation phenomenon that occurs in deeper networks with the network depth increasing. In 2015, ResNet was introduced by He et al. on the ILSVRC & COCO 2015 competitions and achieved excellent performance in image detection and classification tasks which greatly exceeded the level of other approaches in the previous years, consequently winning 1st place on ImageNet detection, ImageNet localization, COCO detection, and COCO segmentation in ILSVRC (124). The obvious characteristic of ResNet is the considerable depth which enables the network to have an extremely strong ability of representation. Recently, the generic ResNet framework has been applicable in time series classification tasks. Table 8 lists the details of notable papers using the ResNet model for MI classifications. The existing residual skip CNN was trained to detect MI and improved 4% in the accuracy compared with benchmark performance in Gopika et al. (125). In Strodthoff's approach, the two variants of CNN including fully convolutional architectures and ResNet-inspired architectures with skip-connections were adopted to distinguish AMI and IMI, and both architectures showed similar performance when applied to ECG data with multiple leads. In this research, the ECG signal matrix was the form of input and directly fed into the first 2-D convolutional layer of the ResNet (126). Differently, in Jafarian et al. (39), a single 1-D convolutional layer of each ECG lead was treated as a front-end to extract a pseudo-time-frequency representation and summarized as multi-lead discriminative features which are then input the first 2-D convolutional layer, and then three residual blocks were followed. The performance results showed that there is just one miss-classified MI in the five-folds cross-validation so that the average accuracy, sensitivity, and specificity of MI detection and localization reached almost 100%, respectively. Similarly, Han and Shi (127) developed a unique multi-lead ResNet with three residual blocks and feature fusion using ECG recordings from 12 leads to detect and localize 5 types of MI. Following three residual blocks, the global average pooling (GAP) layer and dropout are applied to improve generalizability. Furthermore, experiments under intra- and inter-patient schemes were both implemented and achieved an accuracy of 99.92 and 95.49% for the two schemes, respectively. Attractively, Wang et al. (128) proposed a multi-lead ensemble neural network (MENN) with three sub-networks and generated 15 and 12 sub-networks for AMI and IMI detection, respectively. The architecture of Net1 includes two residual blocks with reference in Xie et al. (129) to extract features, and using ResNeXt, Net3 realized a multi-branch residual network by adding group operations to the convolutional operation of the residual blocks. Furthermore, Net2 used two inception blocks. The three sub-networks extracted ECG features in different levels and the ensemble of three nets had superior performance on detection of AMI and IMI.

TABLE 8
www.frontiersin.org

Table 8. Properties of some notable ResNet-based ECG MI detection and localization.

GRU

A Gated recurrent unit (GRU) is likewise intended to solve the vanishing gradient problem of RNN. GRU has fewer trainable parameters and lowers computational complexity than LSTM, making it an enhanced form of LSTM with a faster training process. The reset gate and the update gate are the two most significant gates. The update gate is a merge component of input and output gates in standard LSTM, and it focuses on the past information of the previous moment which should be kept. The reset gate controls the previous information that should be disregarded in the current hidden state. In Prabhakararao and Dandapat (130), the GRU is chosen as the basic unit of RNN, and two multi-lead diagnostic attention-based RNN (MLDA-RNN) models were proposed for the classification of three MI severity stages including early, acute, and chronic MI. When RNN encoding size and latent-space size are adjusted to 32 and 64, respectively, the proposed model achieved the best accuracy of 97.79%. Based on (130), three GRU models were designed with no attention, only intra-lead attention, and intra- and inter-lead attention (IIL) modules respectively in Prabhakararao and Dandapat (131). The intra-lead attention module concentrates on the most discriminative ECG characteristics of each lead to generate a lead-specific attentive representation (LSAR), whereas the inter-lead attention module integrates all 12 representations. The two modules are used for reducing intra- and inter-lead redundancies. An ablation model with IIL modules as well as patients' clinical features was proposed to detect MI and outperformed other existing models, showing an accuracy of 98.3%. Be analogous to Bi-LSTM, a basic Bi-GRU consists of backward- and forward-propagating GRU units, and it can utilize information of past and future time step. The Bi-GRU model was adopted on eight ECG leads to locate five classes of MI and obtain an overall accuracy of 99.84% in Zhang et al. (73), and such excellent performance enables it to be applied to the computer-aided diagnostic platform as a MI localization algorithm. Table 9 summarizes the properties of the GRU model for MI detection and localization.

TABLE 9
www.frontiersin.org

Table 9. Properties of some notable GRU-based ECG MI detection and localization method.

Model Performance

Each original research covered in this review yielded encouraging diagnostic performances regarding the utility of DL in ECG interpretation. The advantages and disadvantages of the six different DL methods are reviewed in Table 10. To compare the most reported performance concerning the six different DL methods, Figures 6A,B display the maximum, average, and minimum accuracies of each DL technique for MI detection and localization, respectively. The maximum accuracies of DL techniques that were trained on reviewed ECG databases and used the different number of leads are summarized in Figure 6C. It provides us reference about selecting the appropriate data leads and datasets according to the purpose of the research since some smart bands and smartwatches record single-lead ECG and multi-lead Holter monitors. It is a little difficult and arbitrary to make a comparison of model performance among studies according to a single measurement due to variability that exists between different datasets and various metrics for measurements. However, some authors have conducted studies that compare different classification algorithms in their research, and using the same dataset makes the model performance more comparable. Table 11 summarizes the only six pairs of DL models reported in the investigated paper. This is also a flaw in MI studies. Studies comparing multiple DL techniques are expected to be conducted in future works.

TABLE 10
www.frontiersin.org

Table 10. Advantages and disadvantages of the six different deep learning (DL) methods.

FIGURE 6
www.frontiersin.org

Figure 6. Maximum, average, and minimum accuracies of each DL technique for MI detection (A). The reported highest accuracy for MI detection is 99.99% in the ResNet model, followed by CNN with an accuracy of 99.95%. The investigated papers with the reported highest accuracy for each model are all beyond 97%, and the average accuracy for each model is all beyond 93%. The reported minimum accuracy for MI detection appears in the CNN model with 78%. Maximum, average, and minimum accuracies of each DL technique for MI localization (B). The reported highest accuracy for MI localization is 99.87% in the CNN model. There is no research focusing on applying the LSTM model to MI localization, and only one research for AE and GRU model, respectively, applied to MI localization. Therefore, the maximum and average accuracies of autoencoder (AE) and gated recurrent unit (GRU) models are the same, but the minimum is vacant. The minimum accuracies of the remaining three models are all below 65% in MI localization. Maximum accuracies of DL techniques which were trained on investigated ECG datasets and used the different number of leads (C). Generally, the network that trained on 12 leads ECG data of PTB database has gained higher performance than that on smaller number leads of ECG data. ResNet model with 12 lead ECG data of PTB database achieved the highest accuracy 99.99% in MI localization. The usage of only lead II ECG data has also achieved good results in CNN, CRNN, and AE methods on the PTB database. The main reason is probably the three research with high performance that used only lead II ECG data are just for MI detection, and they cannot obtain matching results in MI localization.

TABLE 11
www.frontiersin.org

Table 11. Comparison of the methods that were reported in the articles.

Discussion, Opportunities, and Open Issues

The number of articles about DL methods that were specifically used for the detection of MI has been largely increasing in recent 5 years. It also suggests that more and more researchers have explored the possibility of improving the performance of the diagnostic methods in DL. According to the detailed findings from the review, we also find some limitations about data quality, robustness, and interpretability of models. In addition to discussion about these limitations and challenges, new research opportunity regarding DL methods for detecting and localizing MI related to other current technologies such as the Internet of Health Things (IoHT) is also emerging. Besides that, open issues about privacy in healthcare data, the lack of annotated data of admissible quality, and the explainability of DL models are also being discussed.

Limitations

Data Quality

First, the most mentioned limitation is the need for good quality and a large number of datasets. The sample size is smaller than 300 in five datasets investigated in this review, thus, sizable data is needed. PTB database has been the first choice in nearly 80% of research for MI detection and localization, although that includes using multiple different datasets simultaneously containing PTB. The usage of a singular dataset result leads to DL algorithms being too over-fitted to this dataset, hence the robustness of DL models cannot be guaranteed. Second, the demographic diversity of subjects which could affect the generalizability of DL models to ECG interpretation remains a challenge. From the used datasets in the reviewed papers, all the 13 datasets were collected from just a few continents, such as Europe (German, Slovenia, and European community), Asia (China, South Korea, and Pakistan), and North America (USA). When researchers in distant countries train their models using a uniform data set, the racial difference perhaps makes biased model predictions. It is important to focus on the problem of racial bias when considering AI into disease diagnosis since the diagnosis results to be biased by race are not expected. Several examples confirm the existence of this problem. For instance, Google's algorithm for diagnosis of diabetic retinopathy performed poorly in populations in India outside where the model had been developed (132) while Amazon recently created a facial recognition system, “Rekognition,” that was proficient at detecting lighter-skinned men but had trouble identifying darker-skinned women and men (133). Noseworthy et al. conducted a retrospective cohort analysis to assess the impact of race on the performance of their CNN model to detect low left ventricular ejection fraction (LVEF), but the prediction results did not correspond with the ethnic disparities. That suggests that the ECG features associated with LVEF rather than ECG are race-invariant. However, the effect of racial disparity on DL algorithms for other disease diagnoses should be further explored (134).

Performance

With the success of CNN architectures in these diverse domains such as image recognition and natural language process, researchers have applied them to time series analysis (111). In this review, CNN obtains the most popularity and gets as good performance as ResNet models in both MI detection and localization according to the previous statistical analysis. The analysis indicates that most researchers prefer to choose such a relatively old but proven reliable model and gradually improve this model to make the performance better and better in MI detection and localization. Then, it can be shown that the difference between the maximum accuracies of each DL technique for MI detection and localization is relatively small in Figures 6A,B. For instance, the difference between the maximum accuracies of CNN, CRNN, AE, and ResNet in MI detection, and the maximum accuracies of CNN and LSTM in MI localization are all <0.1%. Even the maximum accuracy of ResNet in MI detection is 0.04% higher than that of CNN, and the maximum accuracy of AE in MI localization is 0.01% higher than that of CNN. When these two tasks of detection and localization are considered simultaneously, CNN just wins a narrow victory. Finally, simply comparing the accuracy of different models in different experimental settings (data, model training, etc.) is a bit arbitrary, the lack of a unified standard for measuring and comparing is the main limitation. Another important problem is related to data imbalance. In the investigated studies, the ratio of different classes of data is generally below 1:4, it shows a slight imbalance and just a few researchers add techniques to deal with the problem. The AUC is an important statistical metric in the medical domain and is robust to imbalance distribution of positive and negative samples, and it can visualize this problem in plots, however, most researchers use accuracy to evaluate the model performance instead of AUC. To merely analyze the experimental data in the investigated papers, we found the size of the data seems to have an impact on the model performance. For instance, the accuracies of models trained on larger datasets (17, 43, 85, 90, 100) are higher than those on smaller datasets (67, 69). However, model performance is emphatically not influenced by a single factor. In most situations, more data is usually better, but more data does not always equivalent to a more accurate model since high bias models will not benefit from more training examples. Overfitting should also be considered.

Robustness

The robustness of a model refers to its ability to withstand perturbations or accurately classify the input data without adjusting its initial parameters. In brief, robustness can be understood as the tolerance of the model to data changes. In the reviewed studies, researchers adopted specific techniques to denoise, and the noise-free ECG signals were then input to networks. In real-world applications, the clinical setting is complicated and changeable, and ECG waveforms can be deformed by many external forces. The original ECG signal with noise will be the input for automatic devices. Therefore, adding some noise to the training samples is necessary and can improve the robustness of the models. Zhang et al. (78) has proved it. Zhang et al. adopt heartbeats with and without noise in contrast experiments to verify the model's robustness to noise, and the results indicate that the experiment using ECG heartbeats with noise indeed achieves 1% higher accuracy than that without noise. Notably, too much noise will make the training error larger. However, in Han et al. (135), show that even a model with very high accuracy could misclassify a normal sinus rhythm (NSR) recording as atrial fibrillation (AF) with high certainty by adding small perturbations to an ECG.

Explainability

The effect of “black box.” This limitation is also mentioned in Section Machine Learning for MI Diagnosis. The characteristic of learning features automatically from raw data rather than done manually makes the interpretability of DL approaches remains a general challenge, which is especially important for medical applications (112), because the mysterious processes may not be acceptable for medical professionals. There have been most research put forward exploratory studies for the application of interpretability methods in computer vision, whereas few applications covered time series ECG data. For example, attribution methods were investigated in Strodthoff and Strodthoff (126), which allows qualitative study if the DL algorithm uses similar features as human cardiologists. In some studies (67), explainable artificial intelligence (XAI) is still regarded as the future work to elucidate the mechanisms of DL models.

Opportunities

Wearable devices have shown their potential for risk assessment, screening, early diagnosis, and patient management in some cardiovascular diseases such as hypertension, atrial fibrillation, heart failure, peripheral vascular disease, coronary artery disease, and so on (113). The MiCORE study in Marvel et al. (114) proved wearables' value of ECG monitoring in MI prevention. Two hundred patients with type I MI are enrolled, and the wearables include an Apple Watch and a Bluetooth BP cuff. Preliminary results showed a 43% lower likelihood of the first readmission within 30 days among patients in the Corrie Health Digital Platform (Corrie) than among participants in a historical comparison group. In the investigated studies, most researchers just build the DL models for MI detection and localization by using a fixed ECG dataset and few have deployed their developed models with great performance in wearable devices. Rashid et al. (95) designed a Binary Convolutional Neural Network (BCNN) for low-power and low-memory wearable devices with a single lead sensor, the microcontroller is similar to that of the SmartCardia device. The energy consumption evaluation shows that their model achieves 8 × and 12 × energy efficiency compared to other related works, so it is energy efficient and memory-saving for wearables. The current power consumption constraints of wearables might make researchers try to compress the DL models since the memory of digital devices is limited, and models using different lead data can be used in different devices.

Cloud computing offers some help to healthcare providers, such as hospitals and clinics, who need quick access to large storage facilities and computing. With cloud computing, healthcare providers can easily share healthcare data across regions, eliminating delays in patient care to realize smart medical monitoring. Cloud computing can be integrated into a platform for interaction and data sharing, especially for remote and personalized patient care in the medical (115). In medical diagnosis, various algorithms with promising performance have been proposed in large amounts of research, but they are rarely applied to an interactive platform and non-related professional and technical personnel have trouble using these models. Recently, an easy-to-operate platform LINDA which is designed for online medical image recognition has been proposed for common users without specific knowledge in the diagnostic area (136). The contribution of the LINDA system reflects in three aspects: firstly, it uses computational intelligence to recognize abnormalities instead of knowledge-based feature extraction. Second and most importantly, an application of image processing can be shared with other professionals to obtain more shared data for the improvement of model performance. Data privacy must be considered when involves data sharing. In the LINDA system, the hash code is used to prevent parameters from passing, and the number of requests to the prediction API is limited to reduce service attacks. In addition, the traffic assessment tools are also conducted. The third contribution is that it supports creating a large image database to self-improve. Similarly, in MI diagnosis, DL techniques can also be integrated into such a cloud platform to achieve data sharing and remote diagnosis, which further works are expected to explore.

Open Issues

We are in the era of data explosion, the usage of smartphones, wearables, and IoT devices makes new data generate all the time. There is a lot of information that can be mined by DL algorithms in the massive data, but it also raises challenges about privacy. DL methods are data-driven methods. In the medical domain, the ethics related to patients' data that are being used for DL model prediction is of utmost importance. It makes sense to develop a solution that tries to find a balance between data utilization and privacy protection. The four stages of the life cycle model of big data in healthcare involve data collection, storage, sharing, and analysis process (137). Healthcare data subject involves individual patients, healthcare institutions, government, research institutions, and industry. Privacy leakage concerns can exist in every aspect and need to take the corresponding technical approaches to deal with. In data collection, data anonymity and differential privacy are the main technologies. Due to the size of sensors and wearables are getting smaller, lightweight, and less complex differential privacy algorithms are needed to adapt to such devices. Cloud computing is largely used to store big data in healthcare, and it is common to use encryption and auditing to ensure data confidentiality and integrity. The risks from data sharing could be effectively controlled through access control technology. In this way, novel technologies such as blockchain have been considered in encryption. In Khalid et al. (138), a decentralized authentication and access control mechanism is proposed and can apply to many scenarios. In Parah et al. (139), a novel high payload and reversible EHR embedding framework based on left data mapping (LDM), pixel repetition method (PRM), RC4 encryption, and checksum computation is proposed. In addition to technical support, policies, laws, and regulations are also necessary such as the European General Data Protection Regulation (GDPR). Hedlund et al. draw up a data sharing policy to provide guidelines for activities in the Analytic Imaging Diagnostics Arena (AIDA) (140). Privacy protection techniques in data analysis are especially important since lots of hidden information can be dug for only after data is analyzed. A research hotspot of model privacy is machine unlearning (141), and it is expected to realize model forgetting with less computational. To achieve smooth interoperability between the different parties, making data management is important.

In the medical domain, the input samples are always accompanied by various artifacts which lead to the lack of annotated data of admissible quality. However, the limited available input data in real-world practice make ML models may capture specifically these artifacts which are nothing to do with the diagnostic task (142). In terms of the problem of imperfect data quality, solutions including data augmentation techniques, transfer learning, and domain adaptation provide help. How to generate high-quality input samples instead of simply producing more training data through data augmentation has become the key issue. Recently, there are quite a few related studies, such as the classic Synthetic Minority Oversampling Technique (SMOTE), MixUp (ReMix), Hard Negative Mixing (143), and good-enough example extrapolation (144) all aim to improve the quality of resampling of small samples. The transfer learning emerges for solving the problem of insufficient annotation data, so it can be carried out under fast modeling, annotation deficiency, and small data modeling. However, the parametric transfer learning methods are time- and cost-consuming to tune, hence restricting the wide applicability in computational constraints of wearables devices. In Wang et al. (145), propose a first easy non-parametric transfer learning approach, the result of the extensibility experiment shows this easy approach can efficiently achieve good performance without requiring feature learning algorithms. Domain adaptation (DA) is an important part of transfer learning to solve the problem that data distribution between the source domain and target domain does not correspond with the independent and identically distributed (IID) condition. Gradient reversal layer (146) and generative adversarial network (GAN) (147) have been successfully applied in unsupervised domain adaptation.

In terms of input samples with artifacts mentioned before, another helpful solution is the interpretability of the models. The interpretability could enable it to notice abnormal predictions before being widely used as a diagnostic tool. The methods to achieve interpretability enable people with specific experience to have the maximum understanding of a specific model when given specific data and tasks. Based on the existing methods, we can divide the methods to achieve interpretability into three stages, that is, before, during, and after model training. Before model training, the objective of interpretability lies in data analysis, and what we're trying to do is learn about the data to the maximum extent. The methods used generally include data visualization and statistical analysis such as maximum mean discrepancy (MMD). In modeling, directly building interpretable models is a vital method. Linear regression models and specific neural networks with an explicit pooling structure are all self-explanatory models. Linear models and some of their variants have good explainability due to a very solid statistical foundation. Attention mechanisms are always adopted to provide insight into model prediction. In Samek et al. (148), four post-hoc explanation techniques have been summarized as follows: interpretable local surrogates, occlusion analysis, gradient-based techniques, and layerwise relevance propagation (LRP). A popular approach of local surrogates is the Locally Interpretable Model-agnostic Explanations (LIME) (149), which proposes to explain single predictions of any ML classifiers, and attribution method based on Shapley values which is a framework firstly proposed in game theory is a kind of occlusion analysis (150).

Conclusion

Ischemic heart disease has been a major killer endangering human health. Myocardial ischemia leads to myocardial damage and necrosis, resulting in MI. Mild symptoms such as arrhythmias, ventricular premature, and ventricular block can be treated early but severe cases like cardiac arrest or even sudden death are catastrophic. Looking back on the news reported by the media in recent years, more and more young people, such as students, white-collar workers, and so on, are in a state of physical and mental fatigue for a long time due to the pressure of study and work, and unfortunately die suddenly. It suggests that this heart disease, which mainly occurs in the elderly, gradually tends to be younger. Therefore, achieving the early prevention of MI with long-term detection through ECG is vital to lifesaving. Based on the development of AI, big data, wearable devices, cloud healthcare, etc., automatic real-time monitoring of ECG data has become possible. In recent years, an emerging number of researchers engaged in the DL field, and DL methods have generally achieved outstanding performance in ECG interpretations. This review systematically summarizes the progress of DL methods for MI detection and localization, recaps some general limitations from the aspects of data, models, performance, and prospects the application of these DL technologies in clinical scenarios. It is hoped to provide some suggestions and references for researchers in related fields in model selection, the dataset used, and the construction of a cloud platform for real-time monitoring and diagnosis of ECG data. With the problem of data privacy always challenging, in the future, this monitoring paradigm could revolutionize cardiovascular care as soon as data security and other concerns are addressed. It is believed that more patients and sub-healthy patients can benefit from it and the mortality rate of cardiovascular diseases could decrease significantly.

Author Contributions

PX and GC conceived of this work. PX collected and analyzed the data and wrote the draft of the manuscript. PX, SL, and GC reviewed and revised the manuscript.

Funding

This work was supported by the Science and Technology Development Fund, Macau SAR (SKL-QRCM(UM)-2020-2022).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We acknowledge support from the State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau. We would like to thanks Servier Medical Art images (https://smart.servier.com) for the design of Figure 1A.

References

1. World Health Organization. Global Health Estimates: Life Expectancy and Leading Causes of Death and Disability. WHO (2020). Available online at: https://www.who.int/data/gho/data/themes/mortality-and-global-health-estimates (accessed December 27, 2020).

2. Manfroi WC, Peukert C, Berti CB, Noer C, Gutierres Dde A, Silva FT. Acute myocardial infarction: the first manifestation of ischemic heart disease and relation to risk factors. Arq Bras Cardiol. (2002) 78:392–5. doi: 10.1590/S0066-782X2002000400006

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Strong AT, Sharma G, Tu C, Aminian A, Young JB, Rodriguez J, et al. A population-based study of early postoperative outcomes in patients with heart failure undergoing bariatric surgery. Obes Surg. (2018) 28:2281–8. doi: 10.1007/s11695-018-3174-3

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Sheifer SE, Manolio TA, Gersh BJ. Unrecognized myocardial infarction. Ann Intern Med. (2001) 135:801–11. doi: 10.7326/0003-4819-135-9-200111060-00010

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Valensi P, Lorgis L, Cottin Y. Prevalence, incidence, predictive factors and prognosis of silent myocardial infarction: a review of the literature. Arch Cardiovasc Dis. (2011) 104:178–88. doi: 10.1016/j.acvd.2010.11.013

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Mehta RH, Lopes RD, Ballotta A, Frigiola A, Sketch MH Jr, Bossone E, et al. Percutaneous coronary intervention or coronary artery bypass surgery for cardiogenic shock and multivessel coronary artery disease? Am Heart J. (2010) 159:141–7. doi: 10.1016/j.ahj.2009.10.035

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Bulluck H, Yellon DM, Hausenloy DJ. Reducing myocardial infarct size: challenges and future opportunities. Heart. (2016) 102:341–8. doi: 10.1136/heartjnl-2015-307855

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Thygesen K, Alpert JS, Jaffe AS, Simoons ML, Chaitman BR, White HD, et al. Third universal definition of myocardial infarction. J Am Coll Cardiol. (2012) 60:1581–98. doi: 10.1016/j.gheart.2012.08.001

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Siontis KC, Noseworthy PA, Attia ZI, Friedman PA. Artificial intelligence-enhanced electrocardiography in cardiovascular disease management. Nat Rev Cardiol. (2021) 18:465–78. doi: 10.1038/s41569-020-00503-2

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Thygesen K, Alpert JS, Jaffe AS, Chaitman BR, Bax JJ, Morrow DA, et al. Fourth universal definition of myocardial infarction. J Am Coll Cardiol. (2018) 72:2231–64. doi: 10.1016/j.jacc.2018.08.1038

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Meek S, Morris F. ABC of clinical electrocardiography. Introduction. I-Leads, rate, rhythm, and cardiac axis. BMJ. (2002) 324:415–8. doi: 10.1136/bmj.324.7334.415

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Tranchesi J, Teixeira V, Ebaid M, Boccalandro I, Bocanegra J, Pileggi F. The vectorcardiogram in dorsal or posterior myocardial infarction. Am J Cardiol. (1961) 7:505–10. doi: 10.1016/0002-9149(61)90507-0

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Hähnle L, Viljoen C, Hoevelmann J, Gill R, Chin A. Posterior infarction: a STEMI easily missed. Cardiovasc J Afr. (2020) 31:331–4. doi: 10.5830/CVJA-2020-059

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Gupta A, Huerta E, Zhao Z, Moussa I. Deep learning for cardiologist-level myocardial infarction detection in electrocardiograms. In: EMBEC 2020: 8th European Medical and Biological Engineering Conference. Cham (2021). p. 341–55. doi: 10.1007/978-3-030-64610-3_40

CrossRef Full Text | Google Scholar

15. Fu L, Lu B, Nie B, Peng Z, Liu H, Pi X. Hybrid network with attention mechanism for detection and location of myocardial infarction based on 12-lead electrocardiogram signals. Sensors. (2020) 20:1020. doi: 10.3390/s20041020

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Sharma LN, Tripathy RK, Dandapat S. Multiscale energy and eigenspace approach to detection and localization of myocardial infarction. IEEE Trans Biomed Eng. (2015) 62:1827–37. doi: 10.1109/TBME.2015.2405134

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Liu J, Wang R, Wen B, Liu Z, Miao F, Li Y. Myocardial infarction detection and localization with electrocardiogram based on convolutional neural network. Chin J Electron. (2021) 30:833–842. doi: 10.1049/cje.2021.06.005

CrossRef Full Text | Google Scholar

18. Burgess DC, Hunt D, Li L, Zannino D, Williamson E, Davis TM, et al. Incidence and predictors of silent myocardial infarction in type 2 diabetes and the effect of fenofibrate: an analysis from the fenofibrate intervention and event lowering in diabetes (FIELD) study. Eur Heart J. (2010) 31:92–9. doi: 10.1093/eurheartj/ehp377

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Choudhury T, West NE, El-Omar M. ST elevation myocardial infarction. Clin Med. (2016) 16:277–82. doi: 10.7861/clinmedicine.16-3-277

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Diercks DB, Kontos MC, Chen AY, Pollack CV Jr, Wiviott SD, Rumsfeld JS, et al. Utilization and impact of pre-hospital electrocardiograms for patients with acute ST-segment elevation myocardial infarction: data from the NCDR (national cardiovascular data registry) ACTION (acute coronary treatment and intervention outcomes network) registry. J Am Coll Cardiol. (2009) 53:161–6. doi: 10.1016/j.jacc.2008.09.030

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Salerno SM, Alguire PC, Waxman HS. Competency in interpretation of 12-lead electrocardiograms: a summary and appraisal of published evidence. Ann Intern Med. (2003) 138:751–60. doi: 10.7326/0003-4819-138-9-200305060-00013

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Anantrasirichai N, Bull D. Artificial intelligence in the creative industries: a review. Artif Intell Rev. (2021) 55:589–656. doi: 10.1007/s10462-021-10039-7

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Singh G, Al'Aref SJ, Van Assen M, Kim TS, van Rosendael A, Kolli KK, et al. Machine learning in cardiac CT: basic concepts and contemporary data. J Cardiovasc Comput Tomogr. (2018) 12:192–201. doi: 10.1016/j.jcct.2018.04.010

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Zacharaki EI, Wang S, Chawla S, Soo Yoo D, Wolf R, Melhem ER, et al. Classification of brain tumor type and grade using MRI texture and shape in a machine learning scheme. Magn Reason Med. (2009) 62:1609–18. doi: 10.1002/mrm.22147

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Unberath M, Zaech JN, Gao C, Bier B, Goldmann F, Lee SC, et al. Enabling machine learning in X-ray-based procedures via realistic simulation of image formation. Int J Comput Assist Radiol Surg. (2019) 14:1517–28. doi: 10.1007/s11548-019-02011-2

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Zamzmi G, Hsu LY, Li W, Sachdev V, Antani S. Harnessing machine intelligence in automatic echocardiogram analysis: current status, limitations, and future directions. IEEE Rev Biomed Eng. (2021) 14:181–203. doi: 10.1109/RBME.2020.2988295

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Oh SL, Ng EYK, Tan RS, Acharya UR. Automated diagnosis of arrhythmia using combination of CNN and LSTM techniques with variable length heart beats. Comput Biol Med. (2018) 102:278–87. doi: 10.1016/j.compbiomed.2018.06.002

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Sampath A, Sumithira TR. ECG morphological marking using discrete wavelet transform. Intell Dec Technol. (2016) 10:373–83. doi: 10.3233/IDT-160264

CrossRef Full Text | Google Scholar

29. Karhunen J. Principal component neural networks — theory and applications. Patt Anal Applic. (1998) 1:74–5. doi: 10.1007/BF01238029

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Acharya UR, Fujita H, Sudarshan VK, Oh SL, Adam M, Tan JH, et al. Automated characterization and classification of coronary artery disease and myocardial infarction by decomposition of ECG signals: a comparative study. Knowl Based Syst. (2017) 132:156–66. doi: 10.1016/j.knosys.2017.06.026

CrossRef Full Text | Google Scholar

31. Chang P, Lin J, Hsieh JC, Weng J. Myocardial infarction classification with multi-lead ECG using hidden Markov models and Gaussian mixture models. Appl Soft Comput J. (2012) 12:3165–75. doi: 10.1016/j.asoc.2012.06.004

CrossRef Full Text | Google Scholar

32. Padhy S, Dandapat S. Third-order tensor based analysis of multilead ECG for classification of myocardial infarction. Biomed Signal Process Control. (2017) 31:71–8. doi: 10.1016/j.bspc.2016.07.007

CrossRef Full Text | Google Scholar

33. Kora P. ECG based myocardial infarction detection using hybrid firefly algorithm. Comput Methods Prog Biomed. (2017) 152:141–8. doi: 10.1016/j.cmpb.2017.09.015

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Acharya UR, Fujita H, Sudarshan VK, Oh SL, Adam M, Koh JEW, et al. Automated detection and localization of myocardial infarction using electrocardiogram: a comparative study of different leads. Knowl Based Syst. (2016) 99:146–56. doi: 10.1016/j.knosys.2016.01.040

CrossRef Full Text | Google Scholar

35. Sharma LD, Sunkaria RK. Inferior myocardial infarction detection using stationary wavelet transform and machine learning approach. Signal Image Video Process. (2018) 12:199–206. doi: 10.1007/s11760-017-1146-z

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Dohare AK, Kumar V, Kumar R. Detection of myocardial infarction in 12 lead ECG using support vector machine. Appl Soft Comput J. (2018) 64:138–47. doi: 10.1016/j.asoc.2017.12.001

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Tripathy RK, Bhattacharyya A, Pachori RB. A novel approach for detection of myocardial infarction from ECG signals of multiple electrodes. IEEE Sens J. (2019) 19:4509–17. doi: 10.1109/JSEN.2019.2896308

CrossRef Full Text | Google Scholar

38. Arenas WJ, Zequera ML, Altuve M, Sotelo SA. Linear and nonlinear features for myocardial infarction detection using support vector machine on 12-lead ECG recordings. IFMBE Proc. (2021) 80:758–66. doi: 10.1007/978-3-030-64610-3_85

CrossRef Full Text | Google Scholar

39. Jafarian K, Vahdat V, Salehi S, Mobin M. Automating detection and localization of myocardial infarction using shallow and end-to-end deep neural networks. Appl Soft Comput J. (2020) 93:106383. doi: 10.1016/j.asoc.2020.106383

CrossRef Full Text | Google Scholar

40. Arif M, Malagore IA, Afsar FA. Detection and localization of myocardial infarction using K-nearest neighbor classifier. J Med Syst. (2012) 36:279–89. doi: 10.1007/s10916-010-9474-3

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Xie L, Li Z, Zhou Y, He Y, Zhu J. Computational diagnostic techniques for electrocardiogram signal analysis. Sensors. (2020) 20:6318. doi: 10.3390/s20216318

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Bharti R, Khamparia A, Shabaz M, Dhiman G, Pande S, Singh P. Prediction of heart disease using a combination of machine learning and deep learning. Comput Intell Neurosci. (2021) 2021:8387680. doi: 10.1155/2021/8387680

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Liu W, Huang Q, Chang S, Wang H, He J. Multiple-feature-branch convolutional neural network for myocardial infarction diagnosis using electrocardiogram. Biomed Signal Process Control. (2018) 45:22–32. doi: 10.1016/j.bspc.2018.05.013

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Jothiramalingam R, Jude A, Hemanth DJ. Review of computational techniques for the analysis of abnormal patterns of ECG signal provoked by cardiac disease. Comput Model Eng Sci. (2021) 128:875–906. doi: 10.32604/cmes.2021.016485

CrossRef Full Text | Google Scholar

45. Deluigi CC, Ong P, Hill S, Wagner A, Kispert E, Klingel K, et al. ECG findings in comparison to cardiovascular MR imaging in viral myocarditis. Int J Cardiol. (2013) 165:100–6. doi: 10.1016/j.ijcard.2011.07.090

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Guerra F, Rrapaj E, Pongetti G, Fabbrizioli A, Pelizzoni V, Giannini I, et al. Differences and similarities of repolarization patterns during hospitalization for takotsubo cardiomyopathy and acute coronary syndrome. Am J Cardiol. (2013) 112:1720–4. doi: 10.1016/j.amjcard.2013.07.036

PubMed Abstract | CrossRef Full Text | Google Scholar

47. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. (2015) 521:436–44. doi: 10.1038/nature14539

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Baig MM, Gholamhosseini H, Connolly MJ. A comprehensive survey of wearable and wireless ECG monitoring systems for older adults. Med Biol Eng Comput. (2013) 51:485–95. doi: 10.1007/s11517-012-1021-6

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Serhani MA, T El Kassabi H, Ismail H, Nujum Navaz A. ECG monitoring systems: review, architecture, processes, and key challenges. Sensors. (2020) 20:1796. doi: 10.3390/s20061796

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Zhang C, Bengio S, Hardt M, Recht B, Vinyals O. Understanding deep learning (still) requires rethinking generalization. Commun ACM. (2021) 64:107–15. doi: 10.1145/3446776

CrossRef Full Text | Google Scholar

51. Ansari S, Farzaneh N, Duda M, Horan K, Andersson HB, Goldberger ZD, et al. A review of automated methods for detection of myocardial ischemia and infarction using electrocardiogram and electronic health records. IEEE Rev Biomed Eng. (2017) 10:264–98. doi: 10.1109/RBME.2017.2757953

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Al Hinai G, Jammoul S, Vajihi Z, Afilalo J. Deep learning analysis of resting electrocardiograms for the detection of myocardial dysfunction, hypertrophy, and ischaemia: a systematic review. Eur Heart J Digit Health. (2021) 2:416–23. doi: 10.1093/ehjdh/ztab048

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Joloudari JH, Mojrian S, Nodehi I, Mashmool A, Zadegan ZK, Shirkharkolaie SK, et al. A survey of applications of artificial intelligence for myocardial infarction disease diagnosis. arXiv preprint. (2021) arXiv:2107.06179.

Google Scholar

54. Grün D, Rudolph F, Gumpfer N, Hannig J, Elsner L K, von Jeinsen B, et al. Identifying heart failure in ECG data with artificial intelligence-a meta-analysis. Front Digit Health. (2021) 2:584555. doi: 10.3389/fdgth.2020.584555

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Attia ZI, Harmon DM, Behr ER, Friedman PA. Application of artificial intelligence to the electrocardiogram. Eur Heart J. (2021) 42:4717–30. doi: 10.1093/eurheartj/ehab649

PubMed Abstract | CrossRef Full Text | Google Scholar

56. Tadesse GA, Javed H, Weldemariam K, Liu Y, Liu J, Chen J, et al. DeepMI: Deep multi-lead ECG fusion for identifying myocardial infarction and its occurrence-time. Artif Intell Med. (2021) 121:102192. doi: 10.1016/j.artmed.2021.102192

PubMed Abstract | CrossRef Full Text | Google Scholar

57. Bousseljot R, Kreiseler D, Schnabel A. Nutzung der EKG- signaldatenbank CARDIODAT der PTBuber das internet. Biocybern Biomed Eng. (1995) 40:317–8. doi: 10.1515/bmte.1995.40.s1.317

CrossRef Full Text | Google Scholar

58. Jager F, Taddei A, Moody GB, Emdin M, Antolic G, Dorn R, et al. Long-term ST database: a reference for the development and evaluation of automated ischaemia detectors and for the study of the dynamics of myocardial ischaemia. Med Biol Eng Comput. (2003) 41:172–82. doi: 10.1007/BF02344885

PubMed Abstract | CrossRef Full Text | Google Scholar

59. Wagner P, Strodthoff N, Bousseljot RD, Kreiseler D, Lunze FI, Samek W, et al. PTB-XL, a large publicly available electrocardiography dataset. Sci Data. (2020) 7:154. doi: 10.1038/s41597-020-0495-6

PubMed Abstract | CrossRef Full Text | Google Scholar

60. Taddei A, Distante G, Emdin M, Pisani P, Moody GB, Zeelenberg C, et al. The European ST-T database: standard for evaluating systems for the analysis of ST-T changes in ambulatory electrocardiography. Eur Heart J. (1992) 13:1164–72. doi: 10.1093/oxfordjournals.eurheartj.a060332

PubMed Abstract | CrossRef Full Text | Google Scholar

61. Kim YG, Shin D, Park MY, Lee S, Jeon MS, Yoon D, et al. ECG-ViEW II, a freely accessible electrocardiogram database. PLoS ONE. (2017) 12:e0176222. doi: 10.1371/journal.pone.0176222

PubMed Abstract | CrossRef Full Text | Google Scholar

62. Martínez JP, Pahlm O, Ringborn M, Warren S, Laguna P, Sörnmo L. The STAFF III database: ECGs recorded during acutely induced myocardial ischemia. Comput Cardiol. (2017) 44:266–133. doi: 10.22489/CinC.2017.266-133

CrossRef Full Text | Google Scholar

63. Tadesse GA, Zhu TT, Liu Y, Zhou Y, Chen J, Tian M, et al. Cardiovascular disease diagnosis using cross-domain transfer learning. In: Annual International Conference IEEE Engineering Medical Biology Society. Berlin (2019). p. 4262–5. doi: 10.1109/EMBC.2019.8857737

PubMed Abstract | CrossRef Full Text | Google Scholar

64. Chen X, Guo W, Zhao L, Huang W, Wang L, Sun A, et al. Acute myocardial infarction detection using deep learning-enabled electrocardiograms. Front Cardiovasc Med. (2021) 8:654515. doi: 10.3389/fcvm.2021.654515

PubMed Abstract | CrossRef Full Text | Google Scholar

65. Cho Y, Kwon JM, Kim KH, Medina-Inojosa JR, Jeon KH, Cho S, et al. Artificial intelligence algorithm for detecting myocardial infarction using six-lead electrocardiography. Sci Rep. (2020) 10:20495. doi: 10.1038/s41598-020-77599-6

PubMed Abstract | CrossRef Full Text | Google Scholar

66. Khan AH, Hussain M, Malik MK. Cardiac disorder classification by electrocardiogram sensing using deep neural network. Complexity. (2021) 2021:1–8. doi: 10.1155/2021/5512243

CrossRef Full Text | Google Scholar

67. Gumpfer N, Grün D, Hannig J, Keller T, Guckert M. Detecting myocardial scar using electrocardiogram data and deep neural networks. Biol Chem. (2021) 402:911–23. doi: 10.1515/hsz-2020-0169

PubMed Abstract | CrossRef Full Text | Google Scholar

68. Chen Y, Chen H, He Z, Yang C, Cao Y. Multi-Channel lightweight convolution neural network for anterior myocardial infarction detection. In: Proceeding 2018 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI). Guangzhou (2018). p. 572–8. doi: 10.1109/SmartWorld.2018.00119

CrossRef Full Text | Google Scholar

69. Reasat T, Shahnaz C. Detection of inferior myocardial infarction using shallow convolutional neural networks. In: 2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC). Dhaka (2017). p. 718–21. doi: 10.1109/R10-HTC.2017.8289058

CrossRef Full Text | Google Scholar

70. Tripathy RK, Dandapat S. Detection of myocardial infarction from vectorcardiogram using relevance vector machine. Signal Image Video Process. (2017) 11:1139–46. doi: 10.1007/s11760-017-1068-9

CrossRef Full Text | Google Scholar

71. Yadav SS, More SB, Jadhav SM, Sutar SR. Convolutional neural networks based diagnosis of myocardial infarction in electrocardiograms. In: 2021 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS). Greater Noida (2021). p. 581–6. doi: 10.1109/ICCCIS51004.2021.9397193

PubMed Abstract | CrossRef Full Text | Google Scholar

72. Luthra A. ECG Made Easy. Delhi: Jaypee Brothers Medical Pub Press (2019).

Google Scholar

73. Zhang X, Li R, Dai H, Liu Y, Zhou B, Wang Z. Localization of myocardial infarction with multi-lead bidirectional gated recurrent unit neural network. IEEE Access. (2019) 7:161152–66. doi: 10.1109/ACCESS.2019.2946932

CrossRef Full Text | Google Scholar

74. Zhang J, Lin F, Xiong P, Du H, Zhang H, Liu M, et al. Automated detection and localization of myocardial infarction with staked sparse autoencoder and treebagger. IEEE Access. (2019) 7:70634–42. doi: 10.1109/ACCESS.2019.2919068

CrossRef Full Text | Google Scholar

75. Blanco-Velasco M, Weng B, Barner KE. ECG signal denoising and baseline wander correction based on the empirical mode decomposition. Comput Biol Med. (2008) 38:1–13. doi: 10.1016/j.compbiomed.2007.06.003

PubMed Abstract | CrossRef Full Text | Google Scholar

76. Acharya UR, Fujita H, Oh SL, Hagiwara Y, Tan JH, Adam M. Application of deep convolutional neural network for automated detection of myocardial infarction using ECG signals. Inf Sci. (2017) 415:190–8. doi: 10.1016/j.ins.2017.06.027

CrossRef Full Text | Google Scholar

77. Liu N, Wang L, Chang Q, Xing Y, Zhou X. A simple and effective method for detecting myocardial infarction based on deep convolutional neural network. J Med Imaging Heal Informatics. (2018) 8:1508–12. doi: 10.1166/jmihi.2018.2463

CrossRef Full Text | Google Scholar

78. Zhang G, Si Y, Wang D, Yang W, Sun Y. Automated detection of myocardial infarction using a gramian angular field and principal component analysis network. IEEE Access. (2019) 7:171570–83. doi: 10.1109/ACCESS.2019.2955555

CrossRef Full Text | Google Scholar

79. Sugimoto K, Kon Y, Lee S, Okada Y. Detection and localization of myocardial infarction based on a convolutional autoencoder. Knowl Based Syst. (2019) 178:123–31. doi: 10.1016/j.knosys.2019.04.023

CrossRef Full Text | Google Scholar

80. Rai HM, Chatterjee K. Hybrid CNN-LSTM deep learning model and ensemble technique for automatic detection of myocardial infarction using big ECG data. Appl Intell. (2021) 52:5366–84. doi: 10.1007/s10489-021-02696-6

CrossRef Full Text | Google Scholar

81. Hammad M, Alkinani MH, Gupta BB, Abd EI-Latif AA. Myocardial infarction detection based on deep neural network on imbalanced data. Multimed Syst. (2021) 1–13. doi: 10.1007/s00530-020-00728-8

CrossRef Full Text | Google Scholar

82. Dai H, Hwang HG, Tseng VS. Convolutional neural network based automatic screening tool for cardiovascular diseases using different intervals of ECG signals. Comput Methods Programs Biomed. (2021) 203:106035. doi: 10.1016/j.cmpb.2021.106035

PubMed Abstract | CrossRef Full Text | Google Scholar

83. Tripathy RK, Bhattacharyya A, Pachori RB. Localization of myocardial infarction from multi-lead ECG signals using multiscale analysis and convolutional neural network. IEEE Sens J. (2019) 19:11437–48. doi: 10.1109/JSEN.2019.2935552

CrossRef Full Text | Google Scholar

84. Cao Y, Wei T, Lin N, Zhang D, Rodrigues JJPC. Multi-Channel lightweight convolutional neural network for remote myocardial infarction monitoring. In: 2020 IEEE Wireless Communications and Networking Conference Workshops. Seoul (2020). p. 1–6. doi: 10.1109/WCNCW48565.2020.9124860

PubMed Abstract | CrossRef Full Text | Google Scholar

85. Alghamdi A, Hammad M, Ugail H, Abdel-Raheem A, Muhammad K, Khalifa HS, et al. Detection of myocardial infarction based on novel deep transfer learning methods for urban healthcare in smart cities. Multimed Tools Appl. (2020) 1–22. doi: 10.1007/s11042-020-08769-x

CrossRef Full Text | Google Scholar

86. Darmawahyuni A, Nurmaini S, Sukemi, Caesarendra W, Bhayyu V, Rachmatullah MN, et al. Deep learning with a recurrent network structure in the sequence modeling of imbalanced data for ECG-rhythm classifier. Algorithms. (2019) 12:118. doi: 10.3390/a12060118

CrossRef Full Text | Google Scholar

87. Miotto R, Wang F, Wang S, Jiang X, Dudley JT. Deep learning for healthcare: review, opportunities and challenges. Brief Bioinform. (2018) 19:1236–46. doi: 10.1093/bib/bbx044

PubMed Abstract | CrossRef Full Text | Google Scholar

88. Xiao R, Xu Y, Pelter MM, Fidler R, Badilini F, Mortara DW, et al. Monitoring significant ST changes through deep learning. J Electrocardiol. (2018) 51:S78–S82. doi: 10.1016/j.jelectrocard.2018.07.026

PubMed Abstract | CrossRef Full Text | Google Scholar

89. Xiao R, Xu Y, Pelter MM, Mortara DW, Hu X. A deep learning approach to examine ischemic ST changes in ambulatory ECG recordings. AMIA Jt Summits Transl Sci Proc. (2018) 2017:256–62.

PubMed Abstract | Google Scholar

90. Baloglu UB, Talo M, Yildirim O, Tan RS, Acharya UR. Classification of myocardial infarction with multi-lead ECG signals and deep CNN. Pattern Recognit Lett. (2019) 122:23–30. doi: 10.1016/j.patrec.2019.02.016

PubMed Abstract | CrossRef Full Text | Google Scholar

91. Lodhi AM, Qureshi AN, Sharif U, Ashiq Z. A novel approach using voting from ECG leads to detect myocardial infarction. IntelliSys 2018 Adv Intell Syst Comput. (2019) 869:337–52. doi: 10.1007/978-3-030-01057-7_27

CrossRef Full Text | Google Scholar

92. Hao P, Gao X, Li Z, Zhang J, Wu F, Bai C. Multi-branch fusion network for myocardial infarction screening from 12-lead ECG images. Comput Methods Programs Biomed. (2020) 184:105286. doi: 10.1016/j.cmpb.2019.105286

PubMed Abstract | CrossRef Full Text | Google Scholar

93. Makimoto H, Höckmann M, Lin T, Glöckner D, Gerguri S, Clasen L, et al. Performance of a convolutional neural network derived from an ECG database in recognizing myocardial infarction. Sci Rep. (2020) 10:8445. doi: 10.1038/s41598-020-65105-x

PubMed Abstract | CrossRef Full Text | Google Scholar

94. Zhang G, Si Y, Yang W, Wang D. A robust multilevel dwt densely network for cardiovascular disease classification. Sensors. (2020) 20:4777. doi: 10.3390/s20174777

PubMed Abstract | CrossRef Full Text | Google Scholar

95. Rashid N, Al Faruque MA. Energy-efficient real-time myocardial infarction detection on wearable devices. Proc Annu Int Conf IEEE Eng Med Biol Soc. (2020) 2020:4648–51. doi: 10.1109/EMBC44109.2020.9175232

PubMed Abstract | CrossRef Full Text | Google Scholar

96. Nasimov R, Muminov B, Mirzahalilov S, Nasimova N. A new approach to classifying myocardial infarction and cardiomyopathy using deep learning. In: 2020 International Conference Inference Science Communication Technology. Tashkent (2020). p. 1–5. doi: 10.1109/ICISCT50599.2020.9351386

PubMed Abstract | CrossRef Full Text | Google Scholar

97. Tadesse GA, Javed H, Weldemariam K, Zhu T. A spectral-longitudinal model for detection of heart attack from 12-lead electrocardiogram waveforms. Annu Int Conf IEEE Eng Med Biol Soc. (2020) 2020:6009–12. doi: 10.1109/EMBC44109.2020.9176253

PubMed Abstract | CrossRef Full Text | Google Scholar

98. Jahmunah V, Ng EYK, San TR, Acharya UR. Automated detection of coronary artery disease, myocardial infarction and congestive heart failure using GaborCNN model with ECG signals. Comput Biol Med. (2021) 134:104457. doi: 10.1016/j.compbiomed.2021.104457

PubMed Abstract | CrossRef Full Text | Google Scholar

99. Jian JZ, Ger TR, Lai HH, Ku CM, Chen CA, Abu P, et al. Detection of myocardial infarction using ECG and multi-scale feature concatenate. Sensors. (2021) 21:1906. doi: 10.3390/s21051906

PubMed Abstract | CrossRef Full Text | Google Scholar

100. Xiong P, Xue Y, Zhang J, Liu M, Du H, Zhang H, et al. Localization of myocardial infarction with multi-lead ECG based on DenseNet. Comput Methods Prog Biomed. (2021) 203:106024. doi: 10.1016/j.cmpb.2021.106024

PubMed Abstract | CrossRef Full Text | Google Scholar

101. Wasimuddin M, Elleithy K, Abuzneid A, Faezipour M, Abuzaghleh O. Multiclass ECG signal analysis using global average-based 2-D convolutional neural network modeling. Electron. (2021) 10:170. doi: 10.3390/electronics10020170

CrossRef Full Text | Google Scholar

102. Karhade J, Ghosh SK, Gajbhiye P, Tripathy RK, Acharya UR. Multichannel multiscale two-stage convolutional neural network for the detection and localization of myocardial infarction using vectorcardiogram signal. Appl Sci. (2021) 11:7920. doi: 10.3390/app11177920

CrossRef Full Text | Google Scholar

103. Liu W, Zhang M, Zhang Y, Liao Y, Huang Q, Chang S, et al. Real-Time multilead convolutional neural network for myocardial infarction detection. IEEE J Biomed Health Inform. (2018) 22:1434–44. doi: 10.1109/JBHI.2017.2771768

PubMed Abstract | CrossRef Full Text | Google Scholar

104. Winau L, Nagel E, Herrmann E, Puntmann VO. Towards the clinical management of cardiac involvement in systemic inflammatory conditions–a central role for CMR. Curr Cardiovasc Imaging Rep. (2018) 11:11. doi: 10.1007/s12410-018-9451-7

CrossRef Full Text | Google Scholar

105. Moccia S, Banali R, Martini C, Moscogiuri G, Pontone G, Pepi M, et al. Automated scar segmentation from cmr-lge images using a deep learning approach. In: Proceedings of the 2018 Computing in Cardiology Conference (CinC). Maastricht (2018). p. 1–4.

Google Scholar

106. Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. (1997) 9:1735–80. doi: 10.1162/neco.1997.9.8.1735

PubMed Abstract | CrossRef Full Text | Google Scholar

107. Zhang Y, Li J. Application of heartbeat-attention mechanism for detection of myocardial infarction using 12-lead ECG records. Appl Sci. (2019) 9:3328. doi: 10.3390/app9163328

CrossRef Full Text | Google Scholar

108. Zhang X, Li R, Hu Q, Zhou B, Wang Z. A new automatic approach to distinguish myocardial infarction based on LSTM. In: 2019 8th International Symposium on Next Generation Electronics. (2019). p. 1–3. doi: 10.1109/ISNE.2019.8896550

CrossRef Full Text | Google Scholar

109. Darmawahyuni A, Nurmaini S. Deep learning with long short-term memory for enhancement myocardial infarction classification. In: Proceeding 2019 6th International Conference Instrumentation, Control Automation (ICA). Bandung (2019). p. 19–23. doi: 10.1109/ICA.2019.8916683

CrossRef Full Text | Google Scholar

110. Martin H, Izquierdo W, Cabrerizo M, Cabrerab A, Adjouadi M. Near real-time single-beat myocardial infarction detection from single-lead electrocardiogram using long short-term memory neural network. Biomed Signal Process Control. (2021) 68:102683. doi: 10.1016/j.bspc.2021.102683

CrossRef Full Text | Google Scholar

111. Borges C, Gamboa J. Deep learning for time-series analysis. arXiv Preprint. (2017) arXiv:1701.01887.

Google Scholar

112. Cabitza F, Rasoini R, Gensini GF. Unintended consequences of machine learning in medicine. JAMA. (2017) 318:517–8. doi: 10.1001/jama.2017.7797

PubMed Abstract | CrossRef Full Text | Google Scholar

113. Bayoumy K, Gaber M, Elshafeey A, Mhaimeed O, Dineen EH, Marvel FA, et al. Smart wearable devices in cardiovascular care: where we are and how to move forward. Nat Rev Cardiol. (2021) 18:581–99. doi: 10.1038/s41569-021-00522-7

PubMed Abstract | CrossRef Full Text | Google Scholar

114. Marvel FA, Spaulding EM, Lee M, Yang W, Martin SS. Corrie health digital platform for self-management in secondary prevention after acute myocardial infarction. Circ Cardiovasc Qual Outcomes. (2019) 12:e005509. doi: 10.1161/CIRCOUTCOMES.119.005509

PubMed Abstract | CrossRef Full Text | Google Scholar

115. Linthicum DS. Cloud computing changes data integration forever: what's needed right now. IEEE Cloud Comput. (2017) 4:50–3. doi: 10.1109/MCC.2017.47

CrossRef Full Text | Google Scholar

116. Liu W, Wang F, Huang Q, Chang S, Wang H, He J. MFB-CBRNN: a hybrid network for MI detection using 12-lead ECGs. IEEE J Biomed Health Inform. (2020) 24:503–14. doi: 10.1109/JBHI.2019.2910082

PubMed Abstract | CrossRef Full Text | Google Scholar

117. Dey M, Omar N, Ullah MA. Temporal feature-based classification into myocardial infarction and other CVDs merging CNN and Bi-LSTM from ECG signal. IEEE Sens J. (2021) 21:21688–95. doi: 10.1109/JSEN.2021.3079241

CrossRef Full Text | Google Scholar

118. Billman GE. Heart rate variability - a historical perspective. Front Physiol. (2011) 2:86. doi: 10.3389/fphys.2011.00086

PubMed Abstract | CrossRef Full Text | Google Scholar

119. Lui HW, Chow KL. Multiclass classification of myocardial infarction with convolutional and recurrent neural networks for portable ECG devices. Informatics Med Unlocked. (2018) 13:26–33. doi: 10.1016/j.imu.2018.08.002

CrossRef Full Text | Google Scholar

120. Feng K, Pi X, Liu H, Sun K. Myocardial infarction classification based on convolutional neural network and recurrent neural network. Appl Sci. (2019) 9:1879. doi: 10.3390/app9091879

PubMed Abstract | CrossRef Full Text | Google Scholar

121. Lih OS, Jahmunah V, San TR, Ciaccio EJ, Yamakawa T, Tanabe M, et al. Comprehensive electrocardiographic diagnosis based on deep learning. Artif Intell Med. (2020) 103:101789. doi: 10.1016/j.artmed.2019.101789

PubMed Abstract | CrossRef Full Text | Google Scholar

122. Goto S, Kimura M, Katsumata Y, Goto S, Kamatani T, Ichihara G, et al. Artificial intelligence to predict needs for urgent revascularization from 12-leads electrocardiography in emergency patients. PLoS ONE. (2019) 14:e0210103. doi: 10.1371/journal.pone.0210103

PubMed Abstract | CrossRef Full Text | Google Scholar

123. Manimekalai K, Kavitha DA. Deep learning methods in classification of myocardial infarction by employing ECG signals. Indian J Sci Technol. (2020) 13:2823–32. doi: 10.17485/IJST/v13i28.445

CrossRef Full Text | Google Scholar

124. He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. (2016). p. 770–8. doi: 10.1109/CVPR.2016.90

PubMed Abstract | CrossRef Full Text | Google Scholar

125. Gopika P, Sowmya V, Gopalakrishnan EA, Soman KP. Performance improvement of residual skip convolutional neural network for myocardial disease classification. In: ICICCT 2019—System Reliability, Quality Control, Safety, Maintenance and Management. Singapore (2019). doi: 10.1007/978-981-13-8461-5_25

CrossRef Full Text | Google Scholar

126. Strodthoff N, Strodthoff C. Detecting and interpreting myocardial infarction using fully convolutional neural networks. Physiol Meas. (2019) 40:015001. doi: 10.1088/1361-6579/aaf34d

PubMed Abstract | CrossRef Full Text | Google Scholar

127. Han C, Shi L. ML-ResNet: a novel network to detect and locate myocardial infarction using 12 leads ECG. Comput Methods Prog Biomed. (2020) 185:105138. doi: 10.1016/j.cmpb.2019.105138

PubMed Abstract | CrossRef Full Text | Google Scholar

128. Wang H, Zhao W, Jia D, Hu J, Li Z, Yan C, et al. Myocardial infarction detection based on multi-lead ensemble neural network. Proceeding Annual International Conference IEEE Engineering Medical Biological Society. Berlin (2019). p. 2614–7. doi: 10.1109/EMBC.2019.8856392

PubMed Abstract | CrossRef Full Text | Google Scholar

129. Xie S, Girshick R, Dollár P, Tu Z, He K. Aggregated residual transformations for deep neural networks. Proceeding IEEE Computer Society Conference Computer Vision Pattern Recognition. (2017). p. 5987–95. doi: 10.1109/CVPR.2017.634

PubMed Abstract | CrossRef Full Text | Google Scholar

130. Prabhakararao E, Dandapat S. Myocardial infarction severity stages classification from ECG signals using attentional recurrent neural network. IEEE Sens J. (2020) 20:8711–20. doi: 10.1109/JSEN.2020.2984493

CrossRef Full Text | Google Scholar

131. Prabhakararao E, Dandapat S. Attentive RNN-based network to fuse 12-lead ECG and clinical features for improved myocardial infarction diagnosis. IEEE Signal Process Lett. (2020) 27:2029–33. doi: 10.1109/LSP.2020.3036314

CrossRef Full Text | Google Scholar

132. Abrams C. Google's effort to prevent blindness shows AI challenges. Wall Street J. (2019) 1:26.

Google Scholar

133. West SM, Whittaker M, Crawford K. Discriminating Systems: Gender, Race, Power in AI. AI Now Institute, New York University (2019). Available online at: https://ainowinstitute.org/discriminatingsystems.pdf

Google Scholar

134. Noseworthy PA, Attia ZI, Brewer LC, Hayes SN, Yao X, Kapa S, et al. Assessing and mitigating bias in medical artificial intelligence: the effects of race and ethnicity on a deep learning model for ECG analysis. Circ Arrhythm Electrophysiol. (2020) 13:e007988. doi: 10.1161/CIRCEP.119.007988

PubMed Abstract | CrossRef Full Text | Google Scholar

135. Han X, Hu Y, Foschini L, Chinitz L, Jankelson L, Ranganath R. Deep learning models for electrocardiograms are susceptible to adversarial attack. Nat Med. (2020) 26:360–3. doi: 10.1038/s41591-020-0791-x

PubMed Abstract | CrossRef Full Text | Google Scholar

136. Dourado C, da Silva S, da Nobrega R, Rebouças F, Muhammad K, Albuquerque V. An open IoHT-based deep learning framework for online medical image recognition. IEEE J Select Areas Commun. (2021) 39:541–8. doi: 10.1109/JSAC.2020.3020598

CrossRef Full Text | Google Scholar

137. Guo Z, Luo Y, Cai Z, Zheng T. Overview of privacy protection technology of big data in healthcare. J Front Comput Sci Technol. (2021) 15:389–402.

Google Scholar

138. Khalid U, Asim M, Baker T, Hung PC, Tariq MA, Rafferty L. A decentralized lightweight blockchain-based authentication mechanism for IoT systems. Cluster Comput. (2020) 23:2067–87. doi: 10.1007/s10586-020-03058-6

CrossRef Full Text | Google Scholar

139. Parah SA, Kaw JA, Bellavista P, Loan NA, Bhat GM, Muhammad K, et al. Efficient security and authentication for edge-based internet of medical things. IEEE Internet Things J. (2021) 8:15652–62. doi: 10.1109/JIOT.2020.3038009

CrossRef Full Text | Google Scholar

140. Hedlund J, Eklund A, Lundström C. Key insights in the AIDA community policy on sharing of clinical imaging data for research in Sweden. Sci Data. (2020) 7:331. doi: 10.1038/s41597-020-00674-0

PubMed Abstract | CrossRef Full Text | Google Scholar

141. Liu X, Tsaftaris SA. Have you forgotten? A method to assess if machine learning models have forgotten data. In: Proceedings of the 23rd International Conference on Medical Image Computing and Computer Assisted Intervention. (2020). p. 95–105. doi: 10.1007/978-3-030-59710-8_10

CrossRef Full Text | Google Scholar

142. Hagele M, Seegerer P, Lapuschkin S, Bockmayr M, Samek W, Klauschen F, et al. Resolving challenges in deep learning-based analyses of histopathological images using explanation methods. Sci Rep. (2020) 10:6423. doi: 10.1038/s41598-020-62724-2

PubMed Abstract | CrossRef Full Text | Google Scholar

143. Zhong Z, Fini E, Roy S, Luo Z, Ricci E, Sebe N. Neighborhood contrastive learning for novel class discovery. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). (2021). p. 10862–70. doi: 10.1109/CVPR46437.2021.01072

CrossRef Full Text | Google Scholar

144. Andreas J. Good-enough compositional data augmentation. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. (2020). p. 7556–66. doi: 10.18653/v1/2020.acl-main.676

CrossRef Full Text | Google Scholar

145. Wang Y, Chen H, Yu M, Huang, Yang Q. Easy transfer learning by exploiting intra-domain structures. In: 2019 IEEE International Conference on Multimedia and Expo (ICME). (2019). p. 1210–5. doi: 10.1109/ICME.2019.00211

CrossRef Full Text | Google Scholar

146. Ganin Y, Lempitsky V. Unsupervised domain adaptation by backpropagation. arXiv Preprint. (2014) arXiv:1409.7495

Google Scholar

147. Volpi R, Morerio P, Savarese S, Murino V. Adversarial feature augmentation for unsupervised domain adaptation. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City, UT (2018). p. 5495–504. doi: 10.1109/CVPR.2018.00576

PubMed Abstract | CrossRef Full Text | Google Scholar

148. Samek W, Montavon G, Lapuschkin S, Anders CJ, Müller KR. Explaining deep neural networks and beyond: a review of methods and applications. Proc IEEE. (2021) 109:247–78. doi: 10.1109/JPROC.2021.3060483

CrossRef Full Text | Google Scholar

149. Nóbrega C, Marinho L. Towards explaining recommendations through local surrogate models. In: SAC '19: Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing. (2019). p. 1671–8. doi: 10.1145/3297280.3297443

CrossRef Full Text | Google Scholar

150. Strumbelj E, Kononenko I. An efficient explanation of individual classifications using game theory. J Mach Learn Res. (2010) 11:1–18.

Google Scholar

Keywords: deep learning, neural networks, electrocardiogram (ECG), myocardial infarction detection, myocardial infarction localization

Citation: Xiong P, Lee SM-Y and Chan G (2022) Deep Learning for Detecting and Locating Myocardial Infarction by Electrocardiogram: A Literature Review. Front. Cardiovasc. Med. 9:860032. doi: 10.3389/fcvm.2022.860032

Received: 22 January 2022; Accepted: 18 February 2022;
Published: 25 March 2022.

Edited by:

Victor Hugo C. de Albuquerque, University of Fortaleza, Brazil

Reviewed by:

Jennifer Hannig, Technische Hochschule Mittelhessen, Germany
Auzuir Ripardo De Alexandria, Instituto Federal de Educação, Ciência e Tecnologia do Ceará (IFCE), Brazil
Tomofumi Nakamura, Tokyo Medical and Dental University, Japan
Tetsuo Sasano, Tokyo Medical and Dental University, Japan

Copyright © 2022 Xiong, Lee and Chan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ging Chan, gchan@um.edu.mo

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.