Skip to main content

SYSTEMATIC REVIEW article

Front. Cardiovasc. Med., 07 June 2023
Sec. Cardiovascular Genetics and Systems Medicine

The diagnostic value of electrocardiogram-based machine learning in long QT syndrome: a systematic review and meta-analysis

\r\nMin-Juan Wu,Min-Juan Wu1,2Wen-Qin WangWen-Qin Wang3Wei ZhangWei Zhang2Jun-Hua LiJun-Hua Li3Xing-Wei Zhang
\r\nXing-Wei Zhang4*
  • 1School of Nursing, Hangzhou Medical College, Hangzhou, China
  • 2School of Public Health, Hangzhou Normal University, Hangzhou, China
  • 3School of Nursing, Hangzhou Normal University, Hangzhou, China
  • 4School of Clinical Medicine, Hangzhou Normal University, Hangzhou, China

Introduction: To perform a meta-analysis to discover the performance of ML algorithms in identifying Congenital long QT syndrome (LQTS).

Methods: The searched databases included Cochrane, EMBASE, Web of Science, and PubMed. Our study considered all English-language studies that reported the detection of LQTS using ML algorithms. Quality was assessed using QUADAS-2 and QUADAS-AI tools. The bivariate mixed effects models were used in our study. Based on genotype data for LQTS, we performed a subgroup analysis.

Results: Out of 536 studies, 8 met all inclusion criteria. The pooled area under the receiving operating curve (SAUROC) for detecting LQTS was 0.95 (95% CI: 0.31–1.00); sensitivity was 0.87 (95% CI: 0.83–0.90), and specificity was 0.91 (95% CI: 0.88–0.93). Additionally, diagnostic odd ratio (DOR) was 65 (95% CI: 39–109). The positive likelihood ratio (PLR) was 9.3 (95% CI: 7.0–12.3) and the negative likelihood ratio (NLR) was 0.14 (95% CI: 0.11–0.20), with very low heterogeneity (I2 = 16%).

Discussion: We found that machine learning can be used to detect features of rare cardiovascular disease like LQTS, thus increasing our understanding of intelligent interpretation of ECG. To improve ML performance in the classification of LQTS subtypes, further research is required.

Systematic Review Registration: identifier PROSPERO CRD42022360122.

Introduction

Congenital long QT syndrome (LQTS) is characterized by a prolonged QT interval and abnormal T wave morphology on the electrocardiogram (ECG) (1, 2). Clinically, patients are often asymptomatic, but can suffer from syncope, seizures, or sudden cardiac death (3, 4).

A prevalence of 1:2,000–3,000 is estimated to exist for LQTS, which represents a very rare condition (2, 5, 6). A main method for diagnosing LQTS is to measure the heart rate-corrected QT interval (QTc) (2, 4). Specific ECG features, such as T-wave morphologies, QT interval changes on treadmill QT-stress or epinephrine tests, that may help to diagnose LQTS, but may not be reliable enough to provide accurate diagnostic value (711). Even though LQTS is characterized by QT prolongation, many patients appear normal on the ECG, despite they were clinically and/or genetically diagnosed (12, 13). Consequently, diagnosing LQTS is particularly challenging for general cardiologists, resulting in this disease being significantly underdiagnosed (4, 14).

Many studies have demonstrated that physicians have difficulty reading ECGs in clinical situations (15, 16). A number of researchers have used machine learning (ML) to investigate whether artificial intelligence (AI) can enhance ECG interpretation and clinical decision-making. ML can potentially take into account subtleties in ECG waves that humans do not consider in routine interpretations of ECG signals. It has been shown that ML improves ECG interpretation and clinical decision-making for several common cardiovascular diseases like atrial fibrillation or heart failure (17). Several high-quality studies have been conducted on ML algorithms for LQTS, but their diagnostic value remains unclear (1821). Detecting and identifying LQTS properly is vital to preventing syncope, seizures, and sudden cardiac death in LQTS people. Consequently, a meta-analysis was conducted in order to discover whether ML algorithms could be used to identify LQTS.

Materials and methods

This study was performed according to the PRISMA statement (22) and was registered in PROSPERO (CRD42022360122).

Search strategy

The Cochrane, EMBASE, Web of Science, and PubMed databases were searched from inception through 16 August 2022. Supplementary Table S1 provided a detailed description of the search strategy. The references of all the studies included in this study were also manually searched.

Criteria for inclusion and exclusion

Abstracts and titles of the retrieved studies were reviewed by two authors. Study eligibility criteria were as follows: (1) clearly described ML models and ECG regions used in the LQTS detection, (2) presented the results of the ML algorithms and LQTS detection, (3) be written in English, (4) patients with complete ECG records, (5) provided sensitivity and specificity information, (6) provided the number of LQTS patients, (7) explicitly described data sources and datasets used. These were the criteria for exclusion: (1) only reported risk factors and incidence of LQTS, (2) studies with incomplete data, (3) studies not in English. There was no restriction on the region or year of publication.

Definition of LQTS

According to HRS/EHRA/APHRS Expert Consensus Statement Recommendations (23), a diagnosis of LQTS is made in the presence of QTc ≥ 500 milliseconds (ms) in repeated 12-lead ECGs, or in the presence of a QTc between 480 and 499 ms in a patient with unexplained syncope, or in the presence of the Schwartz score ≥ 3.5 points. A secondary cause for QT prolongation should be ruled out in all of these cases. Additionally, LQTS can also be diagnosed if one of the LQTS genes carries an unequivocally pathogenic mutation. It is estimated that 90 percent of genotype-positive patients and 75 percent of all clinically diagnosed LQTS cases are caused by three main genes. LQT1 and 2 are derived from functional loss-of-function variants in the potassium channel genes KCNQ1 and KCNH2, respectively. LQT3 is based on gain-of-function variants in SCN5A (2). These genes respectively encode for the slow and rapid delayed rectifier current and the fast inward cardiac sodium current, which will lead to prolonged QT interval. Three subtypes of ST-T segments exhibit distinct morphologies, and these features can be used to accurately predict genotype (24, 25).

Literature screening

Traditional methods, editorials, and short reports for detecting LQTS were excluded. The final analysis was conducted by the same two authors (M-JW and W-QW) who reviewed all included studies. The chief investigator (X-WZ) resolved all disagreements between two authors about the selection of potential studies. Algorithms and ECG features from the studies that provided the most detailed information were retained. In order to compile and detect duplicates, EndNote X9.1 (Thomson Reuters) was used.

Data extraction and quality assessment

We used a data extraction sheet to extract variables in our study. Literature searches and study selection were conducted prior to defining the data sheets. Publication year, author, target disease, task, data size, ECG features, algorithm type, and model performance metrics were extracted from each study.

Quality was assessed using the QUADAS-2 tool (26). To address the specifics of AI-related studies, QUADAS-AI tool was also used in our study (27). Each relevant article was independently assessed by two researchers (WZ and X-WZ) for bias risk and applicability concerns according to QUADAS-2 guidelines. Four domains (patient selection, index test, reference standard, flow and timing) of articles were assessed for bias using a range of questions. For each of the four domains, bias risk was estimated using a three-tailed scale: high, low, or unclear. A three-tailed scale was also used to assess applicability, with the exception of “flow and timing”. Consensus was used during scoring to resolve disagreements. All decisions were reached by consensus.

Primary and subgroup analysis

We considered all studies in the full analysis dataset that tested ML models for detecting LQTS patients. A two-part analysis of the datasets was conducted: (a) training dataset; (b) test/validation dataset. LQTS types 1, 2, and 3 were also analyzed as subgroups (28).

Statistical analysis

In this study, we analyzed the metrics (C-index and accuracy) for evaluating ML models. When the 95%CI and standard error (SE) of the C-index were missing, the SE was estimated according to Debray TP et al.'s research (29). ROC, sensitivity, specificity, PLR, NLR, DOR, and their 95%CI were computed (30, 31). Moderate accuracy is considered to be DOR > 25, and high accuracy is considered to be DOR > 100 (32, 33). For each test, sensitivity and specificity estimates were plotted on forest plots and in the ROC space as part of preliminary exploratory analyses. An AUC was determined, along with a 95% CI and a P-value. Heterogeneity was assessed using a forest plot, and significance was determined using Chi-squared and I2 statistics. Random effect models were used to estimate the effect size for all included studies, which helps reduce heterogeneity between them. Consequently, Bivariate mixed effects models were used to calculate pooled estimates. Subgroup analyses were conducted, as well as sensitivity analyses if necessary. In order to assess publication bias, logarithms of DOR were plotted against square roots of effective sample sizes; P < 0.05 for the slope coefficient indicates significant publication bias. Statistical analyses were conducted using Stata 15.0 (Stata Corp LP, College Station, TX) and the midas command.

Results

Study selection

In the initial search, 536 unique records were identified. In our study, 525 articles were excluded due to inclusion and exclusion criteria. The full-text review was performed on 11 studies, and 8 of these met all criteria for inclusion (Figure 1) (19, 21, 3439).

FIGURE 1
www.frontiersin.org

Figure 1. Flowchart of literature search.

Study characteristics

Publication dates ranged from 2011 to 2022. There are eight studies to be reviewed across four regions: (1) Europe (n = 4), (2) North America (n = 2), (3) Oceania (n = 1), and (4) Asia (n = 1). The eight ML models included convolutional neural networks (CNNs) (n = 5), support vector machines (SVMs) (n = 2), and artificial neural networks (ANNs) (n = 1). In the majority of studies (n = 7) the ECG signals were used as inputs to the ML algorithm (19, 21, 3437, 39), while in the remaining study (n = 1) the ECG signals were combined with demographic characteristics (age and gender) (38). Two studies analyzed data from multiple centers, while six studies examined data from a single center. Eight studies developed ML models in order to detect LQTS, and three of them differentiated genotypes further (19, 21, 34). The algorithm was evaluated using k-fold cross-validation in most studies (n = 6). The full analysis dataset included 16,584 participants. Cohort size ranged from 81 to 12,200. Study participants ranged in age from 21.60 ± 15.00 to 60.00 ± 18.00 years; 29.0% to 58.0% were males. LQTS patients were all over 16 years old. Finally, we included 2,692 LQTS patients and 14,417 controls: 42% were LQT1, 46% were LQT2, and 12% were LQT3. A total of 7,493,577 ECGs obtained from 12-lead ECGs were included in this study. The length of resting QTc ranged from 400 to 467 ms. There were 2,128, 362, and 202 LQTS patients in the training, test, and validation part, respectively (Table 1).

TABLE 1
www.frontiersin.org

Table 1. Characteristics of included studies.

Risk of bias assessment

Detailed quality assessment results are presented in Supplementary Table S2. Providing that the source, size, and input data quality were accurately characterized, along with clear criteria for patient eligibility, a low risk of bias was observed in the “patient selection” domain. The index test was judged primarily on the validity of models on external datasets. Diagnostic performance may be overestimated if the model is tested exclusively on internal data, the “index test” domain lacked external validation, resulting in a high risk of bias. Additionally, the absence of external validation adversely affected the level of concern regarding applicability, as the results of diagnostic effectiveness cannot be extrapolated to the general population based on an in-house data set alone. The same reference standard was used in all studies. The review question matched the target conditions defined by a reference standard. All included studies described genetic testing as the reference standard. The timing between the index test and the reference standard was often inaccurate. Most studies, however, provided positive information concerning these indicators.

Machine learning model and LQTS detection

In eight studies, the performance of ML approaches was examined for detecting LQTS patients. ML was found to be an effective method for diagnosing LQTS in this meta-analysis. According to the full analysis dataset, the ML models performed well in identifying LQTS, and the overall pooled AUROC for ML to identify LQTS was 0.95 (95% CI: 0.31–1.00) (Figure 2).

FIGURE 2
www.frontiersin.org

Figure 2. SAUROC of ML algorithms for detecting LQTS.

Additionally, sensitivity, specificity, and DOR were 0.87 (95% CI: 0.83–0.90), 0.91 (95% CI: 0.88–0.93), and 65 (95% CI: 39–109), respectively (Figures 3, 4). Fagan nomogram showed that a positive test increases significantly the pretest probability of LQTS, from 20% to 70%, while a negative test decreases significantly the pretest probability, from 20% to 3% (Supplementary Figure S3). PLR was 9.3 (95% CI: 7.0–12.3) and NLR was 0.14 (95% CI: 0.11–0.20), with very low heterogeneity (I2 = 16%) (Supplementary Figures S4, S5). Asymmetry was not found in Deeks' funnel plots for publication bias. The publication bias coefficient was 17.60, which was not significant (P = 0.23). All studies did not demonstrate publication bias.

FIGURE 3
www.frontiersin.org

Figure 3. Sensitivity and specificity of ML algorithms for detecting LQTS.

FIGURE 4
www.frontiersin.org

Figure 4. DOR of ML algorithms for detecting LQTS.

Training dataset

The overall pooled SAUROC for ML to detect LQTS was 0.96 (95% CI: 0.32–1.00) (Supplementary Figure S6). Sensitivity, specificity, and DOR were 0.90 (95% CI: 0.86–0.93), 0.93 (95% CI: 0.89–0.96), and 125 (95% CI: 55–285), respectively (Supplementary Figures S7, S8). Fagan nomogram showed that a positive test increases significantly the pretest probability of LQTS, from 20% to 77%, while a negative test decreases significantly the pretest probability, from 20% to 3% (Supplementary Figure S9). The overall pooled PLR was 13.4 (95% CI: 8.0–22.4) and the NLR was 0.11 (95% CI: 0.07–0.16), with medium heterogeneity (I2 = 62%) (Supplementary Figures S10, S11). There was no asymmetry in Deeks' funnel plot for publication bias as well. However, the publication bias evaluation indicated that the bias coefficient was 59.68 and significant (P = 0.01).

Test and validation dataset

The ability of ML to detect LQTS in the test/validation dataset was evaluated in three studies. The overall pooled SAUROC for diagnosing LQTS was 0.92 (95% CI: 0.29–1.00) (Supplementary Figure S12). The sensitivity, specificity, and DOR were 0.82 (95% CI: 0.75–0.87), 0.89 (95% CI: 0.85–0.91), and 34 (95% CI: 24–49), respectively (Supplementary Figures S13, S14). Fagan nomogram showed that a positive test increases significantly the pretest probability of LQTS, from 20% to 64%, while a negative test decreases significantly the pretest probability, from 20% to 5% (Supplementary Figure S15). The PLR was 7.1 (95% CI: 5.6–9.0), and the NLR was 0.21 (95% CI: 0.15–0.28), with low heterogeneity (I2 = 37%) (Supplementary Figures S16, S17). Publication bias was evaluated, and a bias coefficient of −11.04 was found to be non-significant (P = 0.26). Test/validation datasets did not demonstrate publication bias.

Subgroup analysis

In order to evaluate ML detection performance for LQTS subtypes, subgroup analysis was conducted (Table 2). LQT3 was not studied extensively, so we only examined ML's performance in identifying LQT1 and LQT2.

TABLE 2
www.frontiersin.org

Table 2. Performance of detecting LQTS subtypes.

LQT1

Three studies assessed ML's performance for identifying LQT1. The overall pooled AUROC for identifying LQT1 was 0.89 (95% CI: 0.16–1.00). Additionally, the sensitivity, specificity, DOR were 0.86 (95%CI: 0.83–0.88), 0.91 (95%CI: 0.86–0.94), and 60 (95% CI: 37–99). The PLR was 9.4 (95% CI: 6.0–14.8), and the NLR was 0.16 (95% CI: 0.13–0.19).

LQT2

Two studies assessed ML's performance for identifying LQT2. The overall pooled AUROC for identifying LQT2 was 0.92 (95% CI: 0.18–1.00). Additionally, the sensitivity, specificity, DOR were 0.88 (95%CI: 0.85–0.91), 0.91 (95%CI: 0.88–0.94), and 79 (95% CI: 45–138). The PLR was 10.2 (95% CI: 7.1–14.8), and the NLR was 0.13 (95% CI: 0.10–0.17).

Discussion

ML algorithms for the detection of LQTS were investigated through this meta-analysis. The overall pooled estimation indicated that ML performed well in recognizing LQTS; thus, we believe that pooling the diagnostic accuracy from eligible studies would be beneficial to cardiologists and researchers.

Performance and clinical relevance of models

Providing appropriate treatment for LQTS requires accurate identification (6). As of today, LQTS has been diagnosed not by genetic testing, but by the Schwartz Score, along with ECGs and clinical parameters that can be used in conjunction with genetic testing (2). Genetic testing is recommended if clinical presentation, family history, and ECG characteristics indicate LQTS, but its high cost and time delay limit its application. Exercise and QT-stand testing may lead to overdiagnosis of LQTS in children and adolescents with low likelihoods of the illness (40). For the diagnosis of LQTS, the 12-lead ECGs is still the most essential diagnostic tool. Studies have shown that the present ECG-based ML models play a significant role in diagnosis and genotyping of LQTS with excellent accuracy. This could mean that ML models could be implemented in clinical care, for example, it could potentially serve as a clinical decision tool to help general cardiologists predict which patients might need further workup.

Pooled performance and heterogeneity

LQTS patients were identified by genetic testing in all the included studies. To estimate the overall performance of ML, AUROC values were pooled across all models presented. LQTS genotypes were identified and distinguished using ML algorithms with high sensitivity and specificity; the PLR, NLR, and DOR values indicate effective test performance. All accuracy values were above 0.89, which suggested that ECG-based ML models were effective in predicting LQTS. However, the CI was very broad and the AUC estimate was skewed to the upper end of the 95% CI, possibly due to a lack of available studies. One of the most common reasons for a broadening CI is an insufficient sample size. Our study only included a total of 2,692 LQTS patients, which may make the CI too broad. Another explanation may be the wide variation in the standard errors of the included models (ranging from 0.008 to 0.034). Besides, AUC values of training datasets are generally higher than those of test/validation datasets. Only two included studies were tested on the test/validation dataset, so the overall pooled AUROC of our study mainly comes from the training dataset, which may explain the higher AUC estimate.

This meta-analysis was conducted using a subgroup analysis to resolve heterogeneity between studies. However, model performance differed significantly between studies. It was found that the models tested on the full analysis dataset (AUROC of 0.95) performed less well than those tested on the training dataset (AUROC of 0.96), while they performed better than those tested on the test/validation dataset (AUROC of 0.92). Based on the fact that two studies of the test/validation dataset used external validation data, we assumed that the sources of heterogeneity were derived from different population groups.

Performance of subgroup analysis

In the eight included studies, the pooled estimation was better in the training dataset compared to the test/validation dataset after dividing them into two categories according to the different datasets, indicating ML models require more external verification in order to prevent overestimation caused by overfitting and bias (41). Only Aufero and Hermans' model was externally validated among all the included studies (34, 37). Validation datasets are used to tune hyperparameters, while test datasets are used to evaluate the final model fit unbiasedly (42). It is more common for ML models to be validated internally by using test datasets derived from the training dataset rather than externally by using independent datasets.

Also, we analyzed the performance of ML models for identifying LQTS subtypes. ML models are more accurate in detecting LQT2 than LQT1. Bos et al. used unsupervised feature extraction, in which all 12-lead ECG waveform were analyzed agnostically during training. The LQTS subtypes of this model were subsequently classified using this unsupervised training model. Considering the nature of CNNs, they could not identify which ECG features were critical for identifying LQTS subtypes (21). Aufiero et al. assumed that QRS features are the main ECG signal for classification as their ML models derive much from the initial part of the QRS complex (34).

Clinical model performance

Even though genetic testing is an effective method for identifying genetic heterogeneity, it cannot definitively exclude LQTS as a diagnosis (43). 12-lead ECGs is recommended as a basis for diagnosing LQTS (28, 44). Therefore, it is imperative to investigate whether ECG-based ML models could improve patient outcomes. We distinguished between studies comparing the performance of ML models with that of expert cardiologists. There were only two studies that validated their models and showed that they were almost as accurate as experts at providing diagnostic information (34, 37). ML models were in good agreement with experts with regard to sensitivity and specificity. Theoretically, ML models could not replace experts, but could be used to assist physicians with limited knowledge of LQTS (37).

Future directions and academic contribution

The fact that many models were developed using similar populations should be noted. Four models were developed based on patients predominantly of European ancestry, with three models developed in the Netherlands and only one model developed in Iran. It is known that LQTS genes are influenced by ancestry in terms of diversity and prevalence of genetic variants (45). Prior studies have demonstrated that arrhythmia-associated sequence variants as well as polymorphisms vary significantly by ethnicity (4648). According to a study that assessed KCNQ1 (LQT1) and KCNH2 (LQT2) variants in 744 individuals, 86% (42 of 49) of them were ethnicity-specific and were found exclusively in Asians (n = 2), Hispanics (n = 2), African Americans (n = 26), and Europeans (n = 12) (46). European ancestry showed higher frequencies of LQT1 and 2 compared to other ethnicities. It is most prevalent among African Americans (4.5%) who carry the rare variant of the SCN5A (LQT3) gene (47). SCN5A variants were found in 829 individuals, with 49% of them being African Americans. In particular, among African Americans, 13% have a variant of SCN5A related to increased arrhythmia risks, while neither Asians nor Europeans do (46). To ensure generalization, we encourage the development of models based on data from different centers and countries.

Further research is needed to determine the most effective strategies for integrating ML models into clinical workflows and to assess the effectiveness of these strategies on relevant clinical outcomes. Several factors, including sex, age, and comorbidities, may contribute to clinically relevant heterogeneity in the detection of LQTS (24). With the use of ML models, it can be possible to investigate whether or not it is possible to identify the precise population using pre-specified parameters in order to administer treatment more effectively to groups of patients that are more homogeneous.

Strengths and limitations

A number of strengths can be found in our study. In this study, we attempted to investigate the potential of ML models in the diagnosis and genotyping of LQTS. A comprehensive search strategy and an in-depth analysis were employed. Any model used to identify LQTS was included, which permitted us to include models that were not originally intended for detecting LQTS but may have merit. ML models for the two most common LQTS subtypes were also analyzed by subgroup meta-analysis, enhancing the relevance of our study.

According to the studies we included, we were limited by the data they reported. Our meta-analysis has the following limitations: (1) LQTS patients were identified using various types of ML models in all the included studies. As a result, we are unable to suggest which model was most suitable for detecting LQTS based on the results of our study. Nevertheless, all of the models examined in the included studies demonstrated that ML algorithms can assist cardiologists in detecting LQTS. (2) Only two studies were externally validated, which inherently limited the quality of evidence. The outcome of this study would have been improved if all studies had been externally validated. (3) As LQTS is relatively rare, we enrolled only 2,692 patients with LQTS in our analysis, mostly with LQT1 or LQT2, and only 12% with LQT3, so further evaluation is necessary to determine the performance of ML models for LQTS subtypes. (4) As β-blocker will shorten the QT interval, ML algorithms performance may differ on ECGs where individuals are treated with β-blocker. In our study, only Aufero and Hermans reported the number of LQTS patients with β-blocker therapy (19, 34). However, like another six studies, they also did not separate groups based on β-blocker treatment due to the lack of sample size. Thus, it will be important to investigate whether the discriminatory power of ML algorithms can be replicated in an independent cohort of LQTS patients. (5) For diagnosing LQTS, T-wave morphology is most commonly used. Most researchers in our study also implemented ML algorithms to identify the differentiation of LQTS and controls based on T-wave morphology features after calculating the length of QTc (19, 21, 34, 35, 3739). They extracted features from T-wave, such as the Q to T-peak, the T-peak to T-end interval, T-wave magnitude, and T-loop slopes to differentiate LQTS from other cardiovascular diseases with similar T-wave. However, only 3 included studies explicitly reported that the patients in control group had other possible cardiac disease, restricting analysis of ML algorithms’ diagnostic value in individuals with any underlying cardiovascular disease.

Conclusion

ML algorithms for the diagnosis of LQTS are currently being developed, but they are gradually evolving into a form that can be applied to clinical practice. Clinically, however, the development of algorithms for identifying LQTS subtypes is critical. In addition, external validation of ML algorithms in LQTS is crucial. ML algorithms could be popularized and applied to clinical practice by cardiologists once they are validated in this manner.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Author contributions

M-JW, W-QW, WZ, J-HL, and X-WZ prepared the material, collected the data, and conducted the analysis. M-JW wrote the initial draft of the manuscript, and X-WZ critically reviewed it. All authors contributed to the article and approved the submitted version.

Funding

Hangzhou City Science and Technology Development Plan Project (20190101A03).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcvm.2023.1172451/full#supplementary-material

References

1. Schwartz PJ, Ackerman MJ. The long QT syndrome: a transatlantic clinical approach to diagnosis and therapy. Eur Heart J. (2013) 34(40):3109–16. doi: 10.1093/eurheartj/eht089

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Shah SR, Park K, Alweis R. Long QT syndrome: a comprehensive review of the literature and current evidence. Curr Probl Cardiol. (2019) 44(3):92–106. doi: 10.1016/j.cpcardiol.2018.04.002.29784533

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Vink AS, Neumann B, Lieve KVV, Sinner MF, Hofman N, El Kadi S, et al. Determination and interpretation of the QT interval. Circulation. (2018) 138(21):2345–58. doi: 10.1161/CIRCULATIONAHA.118.033943

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Hofman N, Wilde AA, Kääb S, van Langen IM, Tanck MW, Mannens MM, et al. Diagnostic criteria for congenital long QT syndrome in the era of molecular genetics: do we need a scoring system? Eur Heart J. (2007) 28(5):575–80. doi: 10.1093/eurheartj/ehl355

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Steinberg C. Diagnosis and clinical management of long-QT syndrome. Curr Opin Cardiol. (2018) 33(1):31–41. doi: 10.1097/HCO.0000000000000465

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Schwartz PJ, Stramba-Badiale M, Crotti L, Pedrazzini M, Besana A, Bosi G, et al. Prevalence of the congenital long-QT syndrome. Circulation. (2009) 120(18):1761–7. doi: 10.1161/CIRCULATIONAHA.109.863209

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Ackerman MJ, Khositseth A, Tester DJ, Hejlik JB, Shen WK, Porter CB. Epinephrine-induced QT interval prolongation: a gene-specific paradoxical response in congenital long QT syndrome. Mayo Clin Proc. (2002) 77(5):413–21. doi: 10.1016/S0025-6196(11)62209-X

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Viskin S, Postema PG, Bhuiyan ZA, Rosso R, Kalman JM, Vohra JK, et al. The response of the QT interval to the brief tachycardia provoked by standing: a bedside test for diagnosing long QT syndrome. J Am Coll Cardiol. (2010) 55(18):1955–61. doi: 10.1016/j.jacc.2009.12.015

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Sugrue A, Rohatgi RK, Noseworthy PA, Kremen V, Bos JM, Qiang B, et al. Architectural T-wave analysis and identification of on-therapy breakthrough arrhythmic risk in type 1 and type 2 long-QT syndrome. Circ Arrhythm Electrophysiol. (2017) 10(11):1–10. doi: 10.1161/CIRCEP.117.005648

CrossRef Full Text | Google Scholar

10. Sugrue A, Noseworthy PA, Kremen V, Bos JM, Qiang B, Rohatgi RK, et al. Automated T-wave analysis can differentiate acquired QT prolongation from congenital long QT syndrome. Ann Noninvasive Electrocardiol. (2017) 22(6):1–7. doi: 10.1111/anec.12455

PubMed Abstract | CrossRef Full Text | Google Scholar

11. ter Bekke RMA, Haugaa KH, van den Wijngaard A, Bos JM, Ackerman MJ, Edvardsen T, et al. Electromechanical window negativity in genotyped long-QT syndrome patients: relation to arrhythmia risk. Eur Heart J. (2015) 36(3):179–86. doi: 10.1093/eurheartj/ehu370

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Lane CM, Bos JM, Rohatgi RK, Ackerman MJ. Beyond the length and look of repolarization: defining the non-QTc electrocardiographic profiles of patients with congenital long QT syndrome. Heart Rhythm. (2018) 15(9):1413–9. doi: 10.1016/j.hrthm.2018.04.033

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Shimizu W. Clinical impact of genetic studies in lethal inherited cardiac arrhythmias. Circ J. (2008) 72(12):1926–36. doi: 10.1253/circj.CJ-08-0947

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Goldenberg I, Horr S, Moss AJ, Lopes CM, Barsheshet A, McNitt S, et al. Risk for life-threatening cardiac events in patients with genotype-confirmed long-QT syndrome and normal-range corrected QT intervals. J Am Coll Cardiol. (2011) 57(1):51–9. doi: 10.1016/j.jacc.2010.07.038

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Bond RR, Finlay DD, Nugent CD, Breen C, Guldenring D, Daly MJ. The effects of electrode misplacement on clinicians’ interpretation of the standard 12-lead electrocardiogram. Eur J Intern Med. (2012) 23(7):610–5. doi: 10.1016/j.ejim.2012.03.011

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Maršánová L, Ronzhina M, Smíšek R, Vítek M, Němcová A, Smital L, et al. ECG Features and methods for automatic classification of ventricular premature and ischemic heartbeats: a comprehensive experimental study. Sci Rep. (2017) 7(1):11239. doi: 10.1038/s41598-017-10942-6

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Miyazawa AA. Artificial intelligence: the future for cardiology. Heart. (2019) 105(15):1214. doi: 10.1136/heartjnl-2018-314464

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Prifti E, Fall A, Davogustto G, Pulini A, Denjoy I, Funck-Brentano C, et al. Deep learning analysis of electrocardiogram for risk prediction of drug-induced arrhythmias and diagnosis of long QT syndrome. Eur Heart J. (2021) 42(38):3948–61. doi: 10.1093/eurheartj/ehab588

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Immanuel SA, Sadrieh A, Baumert M, Couderc JP, Zareba W, Hill AP, et al. T-wave morphology can distinguish healthy controls from LQTS patients. Physiol Meas. (2016) 37(9):1456–73. doi: 10.1088/0967-3334/37/9/1456

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Giudicessi JR, Schram M, Bos JM, Galloway CD, Shreibati JB, Johnson PW, et al. Artificial intelligence–enabled assessment of the heart rate corrected QT interval using a Mobile electrocardiogram device. Circulation. (2021) 143(13):1274–86. doi: 10.1161/CIRCULATIONAHA.120.050231

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Bos JM, Attia ZI, Albert DE, Noseworthy PA, Friedman PA, Ackerman MJ. Use of artificial intelligence and deep neural networks in evaluation of patients with electrocardiographically concealed long QT syndrome from the surface 12-lead electrocardiogram. JAMA Cardiol. (2021) 6(5):532–8. doi: 10.1001/jamacardio.2020.7422

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Moher D, Liberati A, Tetzlaff J, Altman DG. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med. (2009) 6(7):e1000097. doi: 10.1371/journal.pmed.1000097

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Priori SG, Wilde AA, Horie M, Cho Y, Behr ER, Berul C, et al. HRS/EHRA/APHRS expert consensus statement on the diagnosis and management of patients with inherited primary arrhythmia syndromes: document endorsed by HRS, EHRA, and APHRS in May 2013 and by ACCF, AHA, PACES, and AEPC in June 2013. Heart Rhythm. (2013) 10(12):1932–63. doi: 10.1016/j.hrthm.2013.05.014

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Wilde AAM, Amin AS, Postema PG. Diagnosis, management and therapeutic strategies for congenital long QT syndrome. Heart. (2022) 108(5):332–8. doi: 10.1136/heartjnl-2020-318259

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Van Langen IM, Birnie E, Alders M, Jongbloed RJ, Le Marec H, Wilde AAM. The use of genotype-phenotype correlations in mutation analysis for the long QT syndrome. J Med Genet. (2003) 40(2):141–5. doi: 10.1136/jmg.40.2.141

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Whiting PF, Rutjes AWS, Westwood ME, Mallett S, Deeks JJ, Reitsma JB, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. (2011) 155(8):529–36. doi: 10.7326/0003-4819-155-8-201110180-00009

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Sounderajah V, Ashrafian H, Rose S, Shah NH, Ghassemi M, Golub R, et al. A quality assessment tool for artificial intelligence-centered diagnostic test accuracy studies: QUADAS-AI. Nat Med. (2021) 27(10):1663–5. doi: 10.1038/s41591-021-01517-0

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Barsheshet A, Dotsenko O, Goldenberg I. Genotype-specific risk stratification and management of patients with long QT syndrome. Ann Noninvasive Electrocardiol. (2013) 18(6):499–509. doi: 10.1111/anec.12117

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Debray TP, Damen JA, Riley RD, Snell K, Reitsma JB, Hooft L, et al. A framework for meta-analysis of prediction model studies with binary and time-to-event outcomes. Stat Methods Med Res. (2019) 28(9):2768–86. doi: 10.1177/0962280218785504

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Higgins JPT, Thompson SG, Deeks JJ, Altman DG. Measuring inconsistency in meta-analyses. Br Med J (Clin Res Ed). (2003) 327(7414):557–60. doi: 10.1136/bmj.327.7414.557

CrossRef Full Text | Google Scholar

31. Moses LE, Shapiro D, Littenberg B. Combining independent studies of a diagnostic test into a summary ROC curve: data-analytic approaches and some additional considerations. Stat Med. (1993) 12(14):1293–316. doi: 10.1002/sim.4780121403

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Deeks JJ. Systematic reviews in health care: systematic reviews of evaluations of diagnostic and screening tests. Br Med J (Clin Res Ed). (2001) 323(7305):157–62. doi: 10.1136/bmj.323.7305.157

CrossRef Full Text | Google Scholar

33. Islam MM, Nasrin T, Walther BA, Wu C-C, Yang H-C, Li Y-C. Prediction of sepsis patients using machine learning approach: a meta-analysis. Comput Methods Programs Biomed. (2019) 170:1–9. doi: 10.1016/j.cmpb.2018.12.027

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Aufiero S, Bleijendaal H, Robyns T, Vandenberk B, Krijger C, Bezzina C, et al. A deep learning approach identifies new ECG features in congenital long QT syndrome. BMC Med. (2022) 20(1):162. doi: 10.1186/s12916-022-02350-z

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Doldi F, Plagwitz L, Hoffmann LP, Rath B, Frommeyer G, Reinke F, et al. Detection of patients with congenital and often concealed long-QT syndrome by novel deep learning models. J Pers Med. (2022) 12(7):1135. doi: 10.3390/jpm12071135

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Hajimolahoseini H, Redfearn D, Krahn A. A deep learning approach for diagnosing long QT syndrome without measuring QT interval. 32nd Canadian Conference on Artificial Intelligence (Canadian AI); Kingston, Canada: Queens Univ Kingston (2019).

37. Hermans BJM, Bennis FC, Vink AS, Koopsen T, Lyon A, Wilde AAM, et al. Improving long QT syndrome diagnosis by a polynomial-based T-wave morphology characterization. Heart Rhythm. (2020) 17(5):752–8. doi: 10.1016/j.hrthm.2019.12.020

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Hermans BM, Stoks J, Bennis FC, Vink AS, Garde A, Wilde AAM, et al. Support vector machine-based assessment of the T-wave morphology improves long QT syndrome diagnosis. Europace. (2018) 20:113–9. doi: 10.1093/europace/euy243

CrossRef Full Text | Google Scholar

39. Zeraatkar E, Kermani S, Mehridehnavi A, Aminzadeh A, Zeraatkar E, Sanei H. Arrhythmia detection based on morphological and time-frequency features of T-wave in electrocardiogram. J Med Signals Sens. (2011) 1(2):99–106. doi: 10.4103/2228-7477.95293

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Taggart NW, Haglund CM, Tester DJ, Ackerman MJ. Diagnostic miscues in congenital long-QT syndrome. Circulation. (2007) 115(20):2613–20. doi: 10.1161/CIRCULATIONAHA.106.661082

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Mandrekar JN. Receiver operating characteristic curve in diagnostic test assessment. J Thorac Oncol. (2010) 5(9):1315–6. doi: 10.1097/JTO.0b013e3181ec173d

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Zhang J, Liu Y, Mitsuhashi T, Matsuo T. Accuracy of deep learning algorithms for the diagnosis of retinopathy of prematurity by Fundus images: a systematic review and meta-analysis. J Ophthalmol. (2021) 2021:8883946. doi: 10.1155/2021/8883946

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Tester DJ, Will ML, Haglund CM, Ackerman MJ. Effect of clinical phenotype on yield of long QT syndrome genetic testing. J Am Coll Cardiol. (2006) 47(4):764–8. doi: 10.1016/j.jacc.2005.09.056

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Neira V, Enriquez A, Simpson C, Baranchuk A. Update on long QT syndrome. J Cardiovasc Electrophysiol. (2019) 30(12):3068–78. doi: 10.1111/jce.14227

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Roston TM, De Souza AM, Romans HV, Franciosi S, Armstrong KR, Sanatani S. Potential overdiagnosis of long QT syndrome using exercise stress and QT stand testing in children and adolescents with a low probability of disease. J Cardiovasc Electrophysiol. (2021) 32(2):500–6. doi: 10.1111/jce.14865

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Splawski I, Timothy KW, Tateyama M, Clancy CE, Malhotra A, Beggs AH, et al. Variant of SCN5A sodium channel implicated in risk of cardiac arrhythmia. Science. (2002) 297(5585):1333–6. doi: 10.1126/science.1073569

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Ackerman MJ, Splawski I, Makielski JC, Tester DJ, Will ML, Timothy KW, et al. Spectrum and prevalence of cardiac sodium channel variants among black, white, Asian, and hispanic individuals: implications for arrhythmogenic susceptibility and brugada/long QT syndrome genetic testing. Heart Rhythm. (2004) 1(5):600–7. doi: 10.1016/j.hrthm.2004.07.013

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Ackerman MJ, Tester DJ, Jones GS, Will ML, Burrow CR, Curran ME. Ethnic differences in cardiac potassium channel variants: implications for genetic susceptibility to sudden cardiac death and genetic testing for congenital long QT syndrome. Mayo Clin Proc. (2003) 78(12):1479–87. doi: 10.4065/78.12.1479

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: machine learning, ECG, LQTS, systematic review, meta-analysis

Citation: Wu M-J, Wang W-Q, Zhang W, Li J-H and Zhang X-W (2023) The diagnostic value of electrocardiogram-based machine learning in long QT syndrome: a systematic review and meta-analysis. Front. Cardiovasc. Med. 10:1172451. doi: 10.3389/fcvm.2023.1172451

Received: 23 February 2023; Accepted: 16 May 2023;
Published: 7 June 2023.

Edited by:

Oliver Faust, Sheffield Hallam University, United Kingdom

Reviewed by:

Josef Halamek, Institute of Scientific Instruments (ASCR), Czechia
William J. Young, Queen Mary University of London, United Kingdom

© 2023 Wu, Wang, Zhang, Li and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xing-Wei Zhang aHNkenh3QDEyNi5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.