Skip to main content

ORIGINAL RESEARCH article

Front. Cardiovasc. Med., 25 April 2024
Sec. Cardiovascular Genetics and Systems Medicine

Genome-wide association studies reveal differences in genetic susceptibility between single events vs. recurrent events of atrial fibrillation and myocardial infarction: the HUNT study

\r\nMartina Hall,Martina Hall1,2Anne Heidi SkogholtAnne Heidi Skogholt2Ida SurakkaIda Surakka3Haavard Dalen,,Haavard Dalen4,5,6Eivind Almaas,
\r\nEivind Almaas1,2*
  • 1Department of Biotechnology and Food Science, Norwegian University of Science and Technology, Trondheim, Norway
  • 2K.G. Jebsen Center for Genetic Epidemiology, Norwegian University of Science and Technology, Trondheim, Norway
  • 3Department of Internal Medicine, University of Michigan, Ann Arbor, MI, United States
  • 4Clinic of Cardiology, St. Olavs University Hospital, Trondheim, Norway
  • 5Department of Circulation and Medical Imaging, Norwegian University of Science and Technology, Trondheim, Norway
  • 6Department of Medicine, Levanger Hospital, Nord-Trøndelag Hospital Trust, Levanger, Norway

Genetic research into atrial fibrillation (AF) and myocardial infarction (MI) has predominantly focused on comparing afflicted individuals with their healthy counterparts. However, this approach lacks granularity, thus overlooking subtleties within patient populations. In this study, we explore the distinction between AF and MI patients who experience only a single disease event and those experiencing recurrent events. Integrating hospital records, questionnaire data, clinical measurements, and genetic data from more than 500,000 HUNT and United Kingdom Biobank participants, we compare both clinical and genetic characteristics between the two groups using genome-wide association studies (GWAS) meta-analyses, phenome-wide association studies (PheWAS) analyses, and gene co-expression networks. We found that the two groups of patients differ in both clinical characteristics and genetic risks. More specifically, recurrent AF patients are significantly younger and have better baseline health, in terms of reduced cholesterol and blood pressure, than single AF patients. Also, the results of the GWAS meta-analysis indicate that recurrent AF patients seem to be at greater genetic risk for recurrent events. The PheWAS and gene co-expression network analyses highlight differences in the functions associated with the sets of single nucleotide polymorphisms (SNPs) and genes for the two groups. However, for MI patients, we found that those experiencing single events are significantly younger and have better baseline health than those with recurrent MI, yet they exhibit higher genetic risk. The GWAS meta-analysis mostly identifies genetic regions uniquely associated with single MI, and the PheWAS analysis and gene co-expression networks support the genetic differences between the single MI and recurrent MI groups. In conclusion, this work has identified novel genetic regions uniquely associated with single MI and related PheWAS analyses, as well as gene co-expression networks that support the genetic differences between the patient subgroups of single and recurrent occurrence for both MI and AF.

1 Introduction

Myocardial infarction (MI) and atrial fibrillation (AF) are two prevalent cardiovascular diseases. AF, in particular, is a well-established risk factor for several other cardiovascular conditions. The severity and mortality risk associated with AF increase significantly upon relapse. To manage and prevent new AF events, a range of medical and interventional therapies are available. These treatments aim to either normalize the rhythm or stabilize the heart rate. Among these approaches, AF ablation has emerged as a leading clinical treatment. However, its success rate varies, and approximately 20%–40% of patients may require additional treatment (1). Similarly, MI is a severe heart diagnosis associated with high mortality rates. Upon survival, the heart is most likely weakened, making the patient vulnerable to other diseases. In fact, 33% of patients experiencing MI die within a year (all causes of death) (2). However, some patients experience only a single event of MI and lead a normal and healthy life afterward. Most MI patients undergo cardiac catheterization and percutaneous coronary interventions in the acute phase, while medical therapies targeting clotting of blood and lipids, as well as lifestyle interventions, are provided to reduce the chance of recurrent events. Still, a significant proportion of MI patients suffer from relapse.

Many studies have been conducted to identify genetic variants that likely affect the risk of AF (3, 4) and MI (5, 6). While multiple variants have been identified and later replicated in other studies, these variants were identified when comparing all included cases of AF or MI with healthy controls. Some studies have been conducted to understand the genetics of patients experiencing recurrent AF (1, 710) and MI events (1113). These are, however, mostly focused on either the patients’ response after treatment or the genetic effects on recurrence from known AF/MI variants or genes. Little effort has been made regarding the comparison of genetics between patients experiencing a single event vs. patients experiencing recurrent events.

To date, no comprehensive genome-wide association studies (GWAS) analysis has directly compared the genetic profiles of patients with recurrent events to those experiencing single events in either AF or MI. In this study, we explore whether statistically significant genetic differences exist between patients who encounter recurrent events (defined as two or more occurrences) of AF or MI and patients who only experience a single event. Notably, we do not differentiate cases based on the specific treatment received after the initial AF or MI event. By adopting this approach, we aim for a broad comparison to uncover potential genetic distinctions between patients with single events and those with recurrent events.

2 Methods

2.1 Cohorts

2.1.1 The HUNT study

The Trøndelag Health study (HUNT) is a health-related population-based longitudinal study based on four rounds of data collection: HUNT1 (1984–1986), HUNT2 (1995–1997), HUNT3 (2006–2008), and HUNT4 (2017–2019). With a unique database covering clinical measurements, questionnaire data, and biological samples from roughly 230,000 inhabitants of the Trøndelag county from 1984 onward, it is one of the largest health study ever performed (14). A great benefit of the HUNT study is the connection to other health-related registries by use of the Norwegian unique personal identification number. These health registries include hospital and general practitioner registries, cancer registries, cause of death registries, and the prescription database.

In the current study, genotype data for 69,621 participants from HUNT2 and HUNT3 were used, and these were linked to questionnaire data and clinical measurements from HUNT1, HUNT2, and HUNT3, regional hospital records, the Nord-Trøndelag Hospital Trust (HNT), and the Norwegian Cause of Death Registry (COD). The HNT registry contains all ICD9 and ICD10 codes for hospital visits of these HUNT participants from August 1987 to April 2017. The COD registry spans the same period, with registered ICD9 and ICD10 codes for the primary and secondary causes of death.

2.1.2 The United Kingdom Biobank

The United Kingdom Biobank (UKBB) is a health-related population-based study consisting of approximately 500,000 middle-aged UK inhabitants. Sampling of the participants took place from 2006 to 2010, when questionnaires, clinical measurements, and biological samples were collected. Similar to the HUNT study, it is also linked to electronic health records that contain information about the participants’ hospital, general practitioner, and death records with ICD9 and ICD10 diagnose codes (15). Genotyped data are available for more than 480,000 of the participants, and in the current study, we use these data together with relevant questionnaire data, clinical measurements, and hospital and death records for the genotyped participants (of European ancestry). The hospital records span from December 1992 to September 2021. The death registry spans from 2006 until September 2021.

2.2 Genotyping and imputation

Genotyping and imputation of the HUNT and UKBB participants have been described elsewhere (16, 17). Briefly, genotyping was performed using one of three Illumina HumanCoreExome arrays: 12 v.1.0, 12 v.1.1 with custom content (UM HUNT Biobank v1.0) according to standard protocols for the HUNT participants, and standard protocols for Affymetrix Applied Biosystems UK BiLEVE Axiom or Applied Biosystems UK Biobank Axiom array for the UKBB participants. Standard quality control was performed for the HUNT genotyping, as well as a UKBB-specific quality control for the UKBB genotyping. Imputation in HUNT was performed using 2,202 whole-genome sequenced samples from HUNT together with the Haplotype Reference Consortium (HRC) reference panel (18, 19), resulting in 25 million genetic markers. For UKBB, the HRC and UK10K+1000 Genomes reference panel were used, resulting in 90 million variants.

2.3 Definitions of traits and outcomes

Hospital records of HUNT and UKBB participants were used to determine cases of MI and AF as well as the number of events for each participant. An MI event is defined as the patient having a registered diagnosis of ICD10:I21-I24 or ICD9:410. An AF event is defined as a diagnosis of ICD10:I48 or ICD9:427.3.

In both UKBB and HUNT records, each diagnosis is registered as a main or bi-diagnosis (denoted as first and second diagnosis in UKBB), and we take both of them into consideration when determining the number of events for each participant. Limiting our analysis to only the main diagnosis while excluding bi-diagnoses introduces potential errors and significantly reduces the available data. Bi-diagnoses can be interpreted in various ways. For instance, consider a scenario where a patient is admitted to the hospital with a primary diagnosis, and additional diagnoses are identified and documented during that initial visit. Despite being new diagnoses, these are categorized as bi-diagnoses. Had they been the sole disease and reason for the hospital visit on that day, they might have been recorded as the main diagnoses. Alternatively, a physician might infer from historical records that certain other conditions (such as AF or MI) are relevant to the primary diagnosis and include them as bi-diagnoses, even if they are not recent events. Given the diverse reporting practices related to bi-diagnoses, we employ selective filtering to distinguish single events from recurrent events of MI and AF.

A first event is defined as the initial visit during which a specific diagnosis appears in the medical records, either as a primary (main) or as a secondary (bi-diagnosis) diagnosis. Subsequently, a second event is established if there exists a time gap of more than 1 month between the initial event and any subsequent occurrences. The second event must meet one of the following criteria: (i) it is recorded as a main diagnosis or (ii) it is a bi-diagnosis and the sole diagnosis documented on that particular day (indicating a genuinely new event reported at that time). Subsequent events are similarly categorized as second events, with a minimum interval of 1 month from the previously defined event. This event definition ensures the selection of new occurrences, except in cases where only main diagnoses are exclusively considered.

In our investigation of comparing patients with recurrent events to those who remain relapse-free, it is crucial to address the potential misclassification of patients with only a single recorded event. Specifically, we need to ensure that such patients are not erroneously categorized due to premature mortality before experiencing subsequent events. Analyzing the HUNT dataset, we observe that approximately 80% of secondary events occur within a 5-year window for AF and a 7-year window for MI. To mitigate this potential bias, we apply the following filtering criteria: First, we exclude single-event participants who have passed away either due to the phenotype itself or within the specified time frames after the initial AF or MI event. These time frames align with the observed secondary event patterns in the HUNT and UKBB datasets. Second, we remove participants registered with only a single AF or MI event if it occurred less than 5 (AF) or 7 (MI) years before the censoring dates (6 April 2017 for HUNT and 12 November 2021 for UKBB). The three trait groups are denoted as single AF/MI: participants that experience only one event of AF/MI, and recurrent AF/MI: participants that experience more than one event of AF/MI, while satisfying the conditions specified above.

Baseline and clinical characteristics, as well as information about other relevant diseases identified with the participants, were taken from the HUNT and UKBB hospital records, questionnaires, and clinical measurements. Participants were defined to have diabetes and/or hypertension if they have ever been registered with the ICD codes ICD10:E10-E14 or ICD9:250 for diabetes and/or ICD10:I10-I15 (excluding I11.0), and ICD9:401-405 for hypertension. The smoking variable was derived from the HUNT questionnaire response to the question: “Have you ever smoked?” (with options for “Yes” or “No”). For each patient, we utilized the most recent HUNT participation data available prior to the disease event. The corresponding variable in UKBB was “Ever smoked (Yes/No),” which was constructed upon sampling. To assess statistical differences in characteristics (age, diabetes, hypertension, BMI, smoking, cholesterol, systolic, and diastolic blood pressure) between groups with single vs. recurrent events, we employed the Student’s t-test for continuous variables and Fishers’ exact test for binary variables. Test statistics with Bonferroni-adjusted p-values (p<0.05/8=6.25×103) were considered significant findings.

2.4 GWAS meta-analysis

To identify genetic factors associated with single or recurrent events of AF and MI, we conducted three GWAS analyses for each trait in both cohorts separately: (i) patients with single events vs. healthy controls, (ii) patients with recurrent events vs. healthy controls, and (iii) patients with recurrent events vs. patients with single events. As a control, we also conducted a GWAS analysis in each cohort with all cases of each disease against healthy controls. Healthy controls were defined as participants with no registered events of AF and MI. Variants with minor allele count (MAC) <3 and an imputation score <0.3 were excluded from all GWAS results. Participants with non-European recent ancestry were excluded from the analyses in UKBB (note that all genotyped HUNT participants are of European ancestry). Association analyses were performed with SAIGE, using a generalized linear mixed model adjusted for relatedness and unbalanced case–control ratios (20). Birth year, gender, batch/chip, and the first four principal components were added as covariates in the models. Here, birth year is chosen instead of age at the time the event was recorded to facilitate the building of phenotypes based on a heterogenic set of data sources collected at different time points using multiple diagnostic codes. Genomic variants with minor allele frequency (MAF) >1% in one or both studies were included in the meta-analysis.

From the eight GWAS analyses (three for AF, three for MI, and one control for each disease) performed for both the HUNT population and the UKBB population, we performed eight fixed-effect inverse variance weighted (IVW) meta-analyses using METAL (21). In METAL, each variant is assigned a new effect size as the sum of each study’s effect size weighted by the corresponding study variance. The p-values in the meta-analysis are calculated based on the Z statistic given by the new effect sizes and standard errors. Variants reaching genome-wide significance (p-values <5×108) from the Z statistic were considered significant findings. Annotations of significant single nucleotide polymorphisms (SNPs), identification of nearest genes, and a search for nearby SNPs associated with relevant traits were performed with the FUMA platform and the GWAS catalog (22, 23). Variants were considered to be in the same genetic region if they were less than 500 kb apart, and genetic regions denoted as shared for both the single and the recurrent events meta-analysis were either consisting of the same SNPs or SNPs within the same genetic region. Observed scale genetic heritability of the traits were found using the LD Score Regression software (24), with precomputed LD Scores for Europeans from the 1000 Genomes reference panel (25) and summary statistics from the meta-analysis.

2.5 Phenome-wide association studies

Phenome-wide association studies (PheWAS) were performed on all SNPs from the meta-analyses reaching genome-wide significance. From the comprehensive Pan UKBB resource (26), we collected results from GWAS conducted on 1,326 phenocodes, and we identified the effect of each of our SNPs of interest on each phenocode. All GWAS results from the Pan UKBB are based on UKBB participants, and we selected results for European ancestry exclusively. For each set of SNPs (identified in common or specifically for either single or recurrent AF/MI), phenotypes with a p-value <0.05/(1326×nset), where nset is the number of SNPs in the set, were considered significant associations. For simplicity, only the SNP with the lowest p-value for each phenotype was selected from each set of SNPs.

2.6 Gene function and network analyses

The sets of nearest genes to the SNPs identified through the GWAS analyses as common or unique to either recurrent or single AF/MI events (in total six sets) were analyzed for tissue specificity (differentially expressed gene sets in each tissue). We employed both FUMA (22) and gene ontology enrichment using Fisher’s exact over-representation test in PANTHER (protein annotation through evolutionary relationship) (27). Here, biological processes with a false discovery rate (FDR) adjusted for multiple testing <0.05 were considered functionally enriched for the gene set. To further investigate the processes connected to these genes, we performed gene co-expression network analysis (2830), where the hypothesis is that highly correlated genes have a regulatory relationship or similar response in a condition (31). Using the identified gene sets as target genes in an egocentric gene co-expression network analysis, we generated a network from the shared neighborhoods among the closest neighbor genes of each target gene in the gene set, and we inspected the gene functions in the network.

Creating these egocentric networks involves several steps. First, using gene expression data from GTEx v.8 (32) (https://www.gtexportal.org) gene co-expression networks for seven tissue sub-types from the heart, muscle, skeletal, artery, and kidney (GTEx_Analysis_v8_eQTL_expression_matrices.tar: Heart Atrial Appendage, Heart Left Ventricle, Muscle Skeletal, Artery Aorta, Artery Coronary, Artery Tibial, and Kidney Cortex) were created. Since co-expression patterns may vary in different tissues (31), a separate network was created for each tissue. Following the WGCNA approach (33), the link weight (strength of co-expression) between each pair of genes (i and j) were defined by the weighted topological overlap (wTO) in Equation 1:

wTOij=Aij+ki,jAikAkjmin(uAiu,uAju)+1Aij,(1)

where Aij=|cor(i,j)|6 is the absolute Pearson correlation of the gene expressions raised to a power 6 to emphasize the strongest correlations. The resulting gene co-expression network is then an all-to-all network where pairs of genes with high wTO-link weights represent strong connections between the genes and their topological neighborhood. Only the 15% strongest links from each tissue were included in the following analysis (still leaving about 30 million links) to avoid the inclusion of genes based on weak (and likely spurious) connections.

Next, for each of the seven tissues, egocentric networks for each target gene were extracted from the co-expression networks. The egocentric networks were filtered to include only the 25 genes with the strongest wTO-link weights with each target gene. By merging and further reducing the complexity of the networks, the 25 strongest linked genes to each target gene across all tissues were selected in the final network. Here, we weighted the link strengths using wTOijweighted=w=1WwTOij,w2, where W is the number of tissues in which the linked gene is among the 25 strongest linked genes to the target gene and wTOij,w is the corresponding wTO-link weight in tissue w.

The final six sets of egocentric networks for target genes identified as common or as unique for the single or recurrent AF/MI events were analyzed with the igraph R-package (34, 35). Shared neighboring genes were defined as genes linked to two or more of the target genes. The set of shared neighborhood genes for each network was plotted separately with Cytoscape (v. 3.8.1) (36) and gene ontology enrichment of these gene sets were obtained through the PANTHER over-representation test (27).

3 Results

3.1 Characteristics of trait groups

Among the genotyped participants with European ancestry included in this study, there are 7,127 and 29,330 hospital patients registered with AF in HUNT and UKBB, respectively. Employing the filtering approach described in the Methods section, we identified 1,425 HUNT and 9,561 UKBB patients with single AF events and 2,267 HUNT and 7,267 UKBB patients with recurrent AF events. Correspondingly, 5,805 HUNT and 14,592 UKBB participants are registered with MI events. Of these, 1,651 HUNT and 6,584 UKBB patients are identified with single MI events and 1,615 HUNT and 1,615 UKBB patients are identified with recurrent MI events.

Baseline and clinical characteristics of these patients are presented in Table 1. In the HUNT study, a comparison between the two AF groups reveals a discernible pattern. The group experiencing a single AF episode tends to be older (adjusted p-value 2×1010) and displays elevated levels of cholesterol and systolic blood pressure (adjusted p-values 6.1×106 and 5×104, respectively). Similarly, an examination of the AF groups within the UKBB reinforces this trend, with the single AF event group exhibiting higher age (adjusted p-value <1016), along with marginally higher levels of BMI and systolic blood pressure compared to the recurrent AF group.

Table 1
www.frontiersin.org

Table 1. Characteristics of sample groups of single and recurrent events of AF and MI in the HUNT and UKBB population.

However, the reverse trends emerge when analyzing the MI groups in the HUNT study. Here, patients experiencing recurrent MI events are older (adjusted p-value <1016) and demonstrate higher rates of diabetes and hypertension (adjusted p-values 8.8×105 and 7.6×103, respectively), alongside elevated levels of cholesterol and systolic blood pressure (adjusted p-values 5.4×106 and 1.4×105, respectively). In addition, there is a tendency toward higher BMI and diastolic blood pressure within this group. These trends persist within the UKBB MI cohorts, where the recurrent MI event group exhibits higher age, BMI, and prevalence of diabetes and hypertension (adjusted p-values 6×1014, 3.9×105, <1016, and <1016, respectively) compared to the single MI event group. Moreover, there is a tendency toward higher systolic blood pressure levels within the UKBB single MI event group.

In summary, our observations reveal distinct patterns between patients experiencing single AF events and those with recurrent AF events. Notably, the single AF event group tends to be older at their initial event and exhibits worse health conditions and lifestyle factors compared to the recurrent AF group. Based on these findings, we hypothesize that single AF events may be primarily influenced by age and lifestyle factors, whereas recurrent AF events may be driven by genetic factors. The characteristics related to MI point in the opposite direction, since patients experiencing recurrent MI events are older and generally exhibit worse health conditions and lifestyle factors compared to those with only one MI event (and survive it). For MI, we therefore consider two alternative hypotheses: (i) recurrent MI events are associated with the age at the first event and worsened health conditions and single MI events are driven by genetic factors, or (ii) both single and recurrent MI events share common genetic factors, but recurring MI events are influenced by higher age and other lifestyle factors, affecting the risk of subsequent MI occurrences.

3.2 Genetic differences

In the following sections, we explore our hypotheses (as defined above) for AF and MI by investigating genetic differences between the groups identified through the GWAS meta-analyses.

3.2.1 Genetic differences in AF

To test our hypothesis that patients experiencing recurrent AF events are more genetically susceptible than patients experiencing single AF events, we perform three GWAS meta-analyses (see Methods). The GWAS meta-analysis comparing single to recurrent AF events found no regions with significantly different effects. Some SNPs were identified to be of genome-wide significance in the HUNT population, but these were rare variants (MAF 0.2%), and we removed them through filtering prior to the meta-analysis. Comparing the GWAS meta-analyses of each group against healthy controls (Table 2 and Figure 1), we find that 18 regions are specifically associated with recurrent AF events: 2 are specifically associated with single AF events and 16 are identified in both GWAS investigations. Many regions comprise multiple SNPs that exhibit significant effects in only one of the study groups. Five regions identified in the recurrent AF GWAS study consist of only one SNP, yet these are identified with similarly strong effects in both the HUNT and the UKBB studies, indicating a genuine association. Regional plots of the single SNP hits uniquely associated with recurrent AF are shown in Supplementary File 1, Figures S15–S19. The presented results show that more than half of the identified regions are specifically linked to single or recurrent AF, supporting the hypothesis that patients who have experienced recurrent AF events are genetically more susceptible than those who have only experienced one event and survived it.

Table 2
www.frontiersin.org

Table 2. AF variants found to be significant in the GWAS meta-analysis.

Figure 1
www.frontiersin.org

Figure 1. GWAS meta-analysis results for AF. Top: Comparing recurrent AF patients to AF-free controls. Bottom: Comparing single AF patients to AF-free controls. Blue spikes represent regions of SNPs found to be statistically significant in both GWAS studies (common), while magenta spikes represent statistically significant regions of SNPs specifically associated with the given AF group.

All regions had previously been associated with AF, and all regions except one (chromosome 7, in the KCNH2 gene) were identified in the full AF GWAS meta-analysis by comparing all AF cases against healthy controls. This indicates that all the regions identified as unique for single or recurrent AF (excluding the KCNH2 gene region) have an effect when compared to healthy controls, with the true effect being mainly or solely for patients experiencing single or recurrent AF. The five SNPs in the KCNH2 gene, however, are not detected in our full GWAS meta-analysis and therefore only show an effect for patients experiencing recurrent AF.

To our knowledge, only seven genes have previously been found to be associated with AF recurrence: SOX5, CAV1, EPHX2, ITGA9, SLC8A1, TBX5, and PITX2 (1, 8, 10, 37). Our findings show that regions proximate to the SOX5, CAV1, TBX5, and PITX2 genes are identified in both the single and the recurrent GWAS, yielding comparable effects. Thus, there is no evidence of differences in the impact of these regions between the two groups. Furthermore, no regions were identified near the EPHX2, SCL8A1, and ITGA9 genes. Variants near the NAV2 and SCN10A genes have previously been tested for their effect in recurrent AF events without any significant findings (37, 38). In this study, we discovered 26 and 16 SNPs located within and nearby the NAV2 and SCN10A genes, respectively, that are exclusively associated with recurrent AF, suggesting that these SNPs have a distinct effect on recurrent AF patients compared to single AF cases.

Several of the genes listed in Table 2 code for functions related to AF. Two of the genes listed as “Common” (KCNN3 and HCN4) and three genes identified uniquely for recurrent AF (SCN10A, KCNH2, and KCNJ5) are related to electrophysiological activity, coding for potassium and sodium channels. Other genes listed as “Common” in Table 2 code for functions directly linked to heart activity and AF (TTN, TBX5, SYNE2, and RPL3L) , or they are indirectly linked to AF through comorbidities (ATXN1, CAV1, SH3PXD2A, and ZFHX3). Two of the recurrent AF genes also code for functions directly or indirectly linked to AF (CASQ2 and GOSR2), and some genes indicate a possible indirect link related to comorbidities, e.g., hypertension or malignancy (PPFIA4, USP34, WIPF1, SPATS2L, CAND2, and AOPEP). The two genes uniquely identified for single AF events have been shown important for myocardial diseases and cardiac abnormalities, coding for functions found to be central in malformation in heart (NKX2-5) and myosin (MYH7).

The genetic observed scale heritability was found to be 0.0139 (SE 0.0024) for recurrent AF and 0.0086 (SE 0.0018) for single AF events.

3.2.2 Genetic differences in MI

Based on the characteristics of the two MI groups, we formulated two hypotheses: (i) Recurrent MI events are associated with the age at the first event and worsened health conditions and single MI events are driven by genetic factors, or (ii) Both single and recurrent MI events share common genetic factors, but recurring MI events are influenced by higher age and other lifestyle factors, affecting the risk of subsequent MI occurrences. Testing for direct genetic differences between the two MI groups, the GWAS meta-analysis (comparing single to recurrent events) did not detect any regions with significant effects. When testing for genetic effects in each group as compared to MI-free controls, the GWAS meta-analyses shown in Figure 2 and Table 3 identified four regions that are in common for both groups, 24 regions that are specifically identified for the single event group, and two regions that are unique for the recurrent events group. Hence, some genetic factors are common for both groups, but most identified genetic effects are unique to patients experiencing only one event of MI and surviving it. These results are in support of our first hypothesis.

Figure 2
www.frontiersin.org

Figure 2. GWAS meta-analysis results for MI. Top: Comparing recurrent MI patients to MI-free controls. Bottom: Comparing single MI patients to MI-free controls. Blue spikes represent regions of SNPs found to be statistically significant in both GWAS studies (common), while magenta spikes represent statistically significant regions of SNPs that are specifically associated with the given MI group.

Table 3
www.frontiersin.org

Table 3. MI variants that are significant in the GWAS meta-analysis.

Some distinct regions, including the SNPs in the NBEAL1 and ATXN2 genes for single MI events and SNPs in the MIA3 gene for recurrent events, exhibit substantial effects for multiple SNPs in the region, with comparable effects in both the HUNT and UKBB populations. Several regions represent suggestive findings comprising only single SNPs and are only identified in the HUNT population (regional plots of the single SNP hits uniquely associated with single or recurrent MI are shown in Supplementary File 1, Figures S20–S39). However, as shown in Supplementary File 4, these variants are not HUNT-specific since they are reported with relatively high frequencies in the general European population. Hence, although they are rare in the UKBB population and thereby not included in the meta-analysis, including a different European study population could validate or dispute the effect identified here. Also, many of these regions are well-known for MI, further suggesting that these findings might be valid.

Comparing the GWAS meta-analysis of all MI cases to MI-free controls, we find that all four regions that were identified as common for single and recurrent MI (regions in or close to the genes HPCAL1, LPA, CDKN2A, and CXADR) were also found in the full MI GWAS meta-analysis. The two regions that were specifically associated with recurrent MI events (regions close to the MIA3 and NOVA1 genes) were also identified in the full MI GWAS, but nine of the regions specifically associated with single MI were not detected in the full MI GWAS (regions in or close to the genes OVAAL, TAF1B, GLI2, BBS9, MCPH1, GLIS3, HECTD4, INSR, and SYNDIG1). Hence, certain regions identified in the full MI GWAS are exclusively linked to either single or recurrent MI, and some regions are only observed when patients with single MI events are filtered out, emphasizing the need for sub-dividing the MI groups.

We notice that 24 regions are specifically associated with a single MI. Among these, 10 regions, proximal to or within the genes OVAAL, BMP3, RIOK1, AC096553.5, MCPH1, KCNU1, GLIS3, TUT7, TRIB3, and SYNDIG1, represent novel associations with MI and have not been previously linked to Cardiovascular disease (CVD)-related traits. These regions are predominantly characterized by a single SNP, with the exception of five SNPs in proximity to the RIOK1 gene. These SNPs are only identified within the HUNT population, barring the SNP near the KCNU1 gene. Interestingly, some of these regions encode for functions similar to those of genes previously associated with MI. Three of these genes, namely, OVAAL, RIOK1, and TUT7, are commonly associated with malignancy, akin to NBEAL1, where we identified a known MI region comprising 254 SNPs uniquely associated with single MI. Other genes encode proteins involved in calcium handling (BMP3 and SYNDIG1) or are associated with diabetes mellitus (GLIS3 and TRIB3), suggesting a potential link to accelerated atherosclerosis development. Similarly, both ATXN2 and HECTD4 are associated with diabetes mellitus, and we identified known MI regions uniquely associated with single MI in these genes. Two regions exclusively linked to recurrent MI events were identified, both exhibiting negative effects in the HUNT and the UKBB population. The region near the MIA3 gene has been previously associated with MI, while the single SNP near the NOVA1 gene, which may also be related to malignancy, represents a novel finding. Collectively, these findings underscore the potential relevance of these genes to MI. Further investigations are warranted to ascertain if these effects are replicable in other European and non-European populations and to determine the specific links of these SNPs/genes to MI, particularly in relation to single or recurrent MI events.

The observed genetic scale heritability was found to be 0.0051 (SE 0.0011) for single MI and 0.0039 (SE 0.0011) for recurrent MI.

3.3 Identification of additional phenotypes affected by SNPs through PheWAS

To delve deeper into the genetic distinctions observed between single and recurrent AF and MI, we conducted a PheWAS analysis. This enabled us to pinpoint other phenotypes associated with the same set of SNPs designated as either common or unique for single and recurrent AF and MI.

Our PheWAS investigation of the SNPs identified as common for both single and recurrent AF revealed a total of 1,903 SNPs linked with 236 phenocodes (shown in Figure 3 and Supplementary File 2). Not surprisingly, the strongest associations were found for Atrial fibrillation and flutter and Cardiac dysrhythmia (p-value 10400 and 10220). Furthermore, we identified robust associations with phenocodes related to Appendiceal conditions and Coagulation defects. Notably, the circulatory system category emerged as the predominant category, encompassing 54 phenocodes. This includes, but is not limited to, conditions such as Phlebitis and thrombophlebitis, Sinoatrial node dysfunction (Bradycardia), Heart failure, and Hypertension.

Figure 3
www.frontiersin.org

Figure 3. Phenocodes associated with each set of SNPs found for both single and recurrent AF/MI or uniquely for one of them. The x-axis shows each of the 1,326 phenocodes sorted by phenocode category, and the y-axis shows the lowest p-value for the association between the phenocode and the SNPs in the set. The dotted line shows the threshold for significant associates, which vary according to the number of SNPs in each set. (A) A set of 1,903 SNPs found in common for both single and recurrent AF. (B) A set of 245 SNPs found in common for both single and recurrent MI. (C) A set of 33 SNPs found uniquely for single AF events. (D) A set of 299 SNPs found uniquely for single MI events. (E) A set of 286 SNPs found uniquely for recurrent AF events. (F) A set of six SNPs found uniquely for recurrent MI events.

Intriguingly, the two identified regions specifically associated with single AF consist of 33 SNPs that exhibit significant association with 44 phenocodes (see Supplementary File 2). Not surprisingly, the strongest associations for these SNPs pertain to the phenocodes Atrial fibrillation and flutter and Cardiac dysrhythmias, the remaining 42 phenocodes span a diverse array of phenocode categories. These include not only Migraine and Large cell lymphoma but also conditions such as Arrhythmia (cardiac) NOS, Paroxysmal supraventricular tachycardia, and Cerebral atherosclerosis.

In contrast, the 18 regions specifically associated with recurrent AF events consisting of 286 SNPs show a significant association with 91 phenocodes (see Supplementary File 2), and a majority of the strong associations pertain to phenocodes of the circulatory system category. Again, the phenocode with the most potent associations are Atrial fibrillation and flutter and Cardiac dysrhythmias. In addition, these SNPs also display significant associations with Asthma and 27 phenocodes from the circulatory system, including conditions such as Hypertension, Atrioventricular block, Cardiomyopathy, Heart failure, Ischemic heart disease, Cardiac arrest, and Palpitations. Collectively, these results underscore genetic susceptibility disparities between patients experiencing single vs. recurrent AF events. In particular, SNPs specifically tied to recurrent AF are linked to a broad range of phenocodes related to the heart and circulatory system, in contrast to SNPs exclusively linked to single AF events.

Regarding MI, we identified four regions associated with both single and recurrent MI, comprising 245 SNPs that exhibit significant associations to 144 phenocodes (see Supplementary File 2). The most prominent associations are observed with Ischemic heart disease and Hyperlipidemia disorders. In addition, numerous diseases within the circulatory system category, such as Non-rheumatic aortic valve disorders, Peripheral vascular disease, Stricture of artery, Hypertension, and Heart valve disorders, are also strongly associated.

The 24 regions specifically identified for single MI events consist of 299 SNPs that are associated with 128 phenocodes (see Supplementary File 2). These include Ischemic heart disease, Hypertension, and diseases of Hyperlipidemia. In addition, there are strong associations with neurodegenerative disorders, such as Dementia, Alzheimer’s, and Delirium. These SNPs are furthermore linked with 33 phenocodes from the circulatory system category, highlighting conditions such as Cerebral ischemia, Cardiac conduction disorders, Heart failure, Aortic valve disease, and Pulmonary heart disease.

Notably, the two regions consisting of six SNPs specifically identified for recurrent MI were associated with a mere 16 phenocodes (see Supplementary File 2). While these included Ischemic heart disease, Heart failure, Cardiac conduction disorders, and diseases of Hyperlipidemia, they lacked the other 27 circulatory system disorders identified for the single MI SNPs. Once again, these findings emphasize the genetic differences between patients experiencing single and recurrent MI. SNPs specifically associated with single MI events appear to be associated with a broader and more diverse range of cardiovascular disorders compared to those solely linked to recurrent MI.

3.4 Gene sets and co-expression network neighborhood

In our final analysis, we leverage multiple sets of gene expression data from the GTEx consortium (32) measured in tissue sub-types taken from the heart, muscle, skeletal, artery, and kidney to generate gene co-expression networks (see Methods for details). Here, our expectation is that highly correlated genes have a regulatory relationship or similar response in a condition (31). Thus, this approach should uncover genes that display an expression profile that most closely links to the set of target genes found through our GWAS analyses, and we investigate their functions.

3.4.1 AF-associated genes in co-expression networks

Differential gene expression analysis of the 18 genes identified in recurrent AF (listed as Recurrent in Table 2) reveals a significant upregulation of these genes in atrial appendage tissues from the heart. Furthermore, elevated expression levels are discerned in left ventricular heart, artery tibial, and skeletal muscle tissues (see Supplementary File 1, Figure S9). Gene ontology analysis indicates that this set of genes is significantly enriched for cell–cell signaling involved in cardiac conduction (fold enrichment (FE) >100, FDR =1.17×102), cardiac muscle cell action potential (FE =70.03, FDR =3.15×102), and regulation of heart rate (FE =43.99, FDR =1.62×102).

Following the co-expression analysis approach detailed in the Methods section, we find that 16 of the 18 recurrent AF genes show strong co-expression with other genes in heart, artery, kidney, and skeletal muscle tissues. Selecting the top 25 genes with the strongest connection to each of the 16 target genes, Figure 4A shows that all of the 16 target genes are connected through 82 shared neighboring genes (see Supplementary File 3), i.e., the 82 shared genes are among the top 25 strongest connections for two or more of the target genes. These 82 neighboring genes are significantly enriched for a variety of biological processes, including acetyl-CoA biosynthetic process from pyruvate (FE >100, FDR =4.41×103), tricarboxylic acid cycle (FE =54.70, FDR =1.38×106), NLS-bearing protein import into nucleus (FE =50.99, FDR =1.93×103), inner mitochondrial membrane organization (FE =32.73, FDR =9.49×104), respiratory electron transport chain (FE =10.72, FDR =4.32×102), regulation of proteasomal protein catabolic process (FE =7.49, FDR =4.94×102), proteasome-mediated ubiquitin-dependent protein catabolic process (FE =5.58, FDR =3.79×102), and regulation of cellular catabolic process (FE =4.01, FDR =9.85×103).

Figure 4
www.frontiersin.org

Figure 4. Networks showing the strongest shared neighborhood of co-expressed genes for the GWAS (target) genes associated with (A) recurrent AF uniquely and (B) both single and recurrent AF. Pink diamond nodes represent the target genes and blue circular nodes represent the neighboring genes. The sizes of the blue nodes are scaled according to their number of nearest neighbors in the network.

Focusing on the two genes specific to single AF events (listed as Single in Table 2), our analysis reveals that these genes show significant upregulation in left ventricle tissues of the heart and also high expression levels for atrial appendage tissues of the heart (see Supplementary File 1, Figure S10). Gene ontology analysis confirms that these genes are closely linked to adult heart development (FE >100, FDR =7.77×103), ventricular cardiac muscle tissue morphogenesis (FE >100, FDR =4.17×102), myofibril assembly (FE >100, FDR =2.98×102), cardiac muscle contraction (FE >100, FDR =2.47×102), and regulation of striated muscle contraction (FE >100, FDR = 2.87×102). Thus, although both target genes exhibit the specified enriched functions, an egocentric network analysis reveals that they do not share mutual genes with strong co-expression across the heart, artery, kidney, and skeletal muscle tissues. Therefore, while they may have functional overlap, the co-expressing gene partners diverge for each target gene.

In our comparative analysis of gene sets uniquely associated with either single or recurrent AF, we also evaluated genes that were consistent across both AF categories. Among the 16 genes identified for both AF groups (labeled as Common in Table 2), there was a significant upregulation in tissues of the heart’s left ventricle, atrial appendage, and skeletal muscles (see Supplementary File 1, Figure S11). These genes did not show any enrichment in specific gene ontology categories. Upon conducting an egocentric network analysis, we found that 14 out of these 16 genes displayed strong co-expression patterns with genes differentially expressed in the heart, arteries, kidneys, and skeletal muscles. Moreover, 13 of these genes were interconnected via 61 shared neighbor genes (see Figure 4B and Supplementary File 3). The 61 genes found in common for the 13 target genes show significant enrichment for fumarate metabolic process (FE >100, FDR =1.83×102), regulation of atrial cardiac muscle cell action potential (FE >100, FDR =2.51×102), regulation of mitochondrial RNA catabolic process (FE >100, FDR =3.36×102), mitochondrial acetyl-CoA biosynthetic process from pyruvate (FE >100, FDR = 3.32×102), succinyl-CoA catabolic process (FE >100, FDR =4.10×102), tricarboxylic acid cycle (FE =73.80, FDR =5.34×108), branched-chain amino acid catabolic process (FE =46.69, FDR =1.11×102), inner mitochondrial membrane organization (FE =35.33, FDR =2.16×103), aerobic electron transport chain (FE =15.20, FDR =2.98×102), mitochondrial ATP synthesis coupled electron transport (FE =14.37, FDR =3.50×102), and alpha-amino acid catabolic process (FE =14.21, FDR =3.53×102).

Thus, the gene sets identified as common and unique for single and recurrent AF events are upregulated in atrial appendage tissues of the heart, left ventricular tissues of the heart, and skeletal muscle tissues, where the gene set identified for recurrent AF and single AF events show significant upregulation in atrial appendage and left ventricular tissues of the heart, respectively, while the gene set identified for both single and recurrent AF shows significant upregulation of all three tissues. While both gene sets specifically identified for single and recurrent AF events show enrichment for functions related to heart and cardiac muscle, the recurrent AF gene set shows interesting enrichment of functions related to regulation of heart rate and cardiac muscle cell action potential. Collectively, these findings underscore the distinct genetic underpinnings between patients experiencing single vs. recurrent AF events, with the recurrent AF genes revealing more intricate processes.

3.4.2 Network MI

The 24 nearest genes to the regions specifically identified for single MI events (listed as Single in Table 3) show no significant enrichment for any gene ontology terms. Differential gene expression analysis shows no significant up- or downregulation of these genes in any tissue, but significant expression levels in the tibial nerve and high expression levels in the tibial artery and aorta artery tissues (see Supplementary File 1, Figure S12).

When inspecting gene co-expression in heart, artery, kidney, and skeletal muscle tissues, we find that 21 of these genes show high co-expression with other genes in these tissues. Again, creating egocentric networks for each of these 21 target genes, we find that all 21 target genes cluster in a giant component, where 110 genes are connected to two or more of the target genes. The network depicted in Figure 5A illustrates genes that are interconnected with multiple single MI target genes. Notably, there is a dense cluster in the upper left portion of the figure, dominated by GLI2, GLIS3, TRIB3, and SFRP1, all of which are interconnected and share a majority of their neighborhood genes. The known associations of both GLI2 and SFRP1 with CVD-related traits, coupled with the detection of TRIB3 and SFRP1 in the comprehensive MI GWAS, suggest potentially shared functional roles of these four genes. Furthermore, they also exhibit significant interconnections with BBS9, HECTD4, ATXN2, and SYNDIG1. It is worth noting that HECTD4 and ATXN2 have recognized associations with MI.

Figure 5
www.frontiersin.org

Figure 5. Networks showing the strongest shared neighborhood of co-expressed genes for the GWAS (target) genes associated with (A) single MI uniquely, (B) recurrent MI uniquely and (C) both single and recurrent MI. Pink diamond nodes represent the target genes and blue circular nodes represent the neighboring genes. The sizes of the blue nodes are scaled according to their number of nearest neighbors in the network.

Gene ontology analysis of the 110 genes shared between two or more target genes (see Supplementary File 3) show significant enrichment for positive regulation of several functional categories, more specifically establishment of protein localization to telomere (FE =54.18, FDR =1.95×102), positive regulation of protein localization to the Cajal body (FE =49.26, FDR =2.28×102), NLS-bearing protein import into nucleus (FE =38.02, FDR =3.85×103), positive regulation of telomerase RNA localization to the Cajal body (FE =36.12, FDR =4.17×102), cristae formation (FE =36.12,4.00×102), tricarboxylic acid cycle (FE =29.13, FDR =1.59×103), regulation of proteasomal protein catabolic process (FE =8.38, FDR =1.63×103), regulation of ubiquitin-dependent protein catabolic process (FE =7.35, FDR =2.27×102), proteasome-mediated ubiquitin-dependent protein catabolic process (FE =4.68, FDR =4.76×102), purine-containing compound metabolic process (FE =4.21, FDR =4.69×102), nucleotide metabolic process (FE =3.84, FDR =4.69×102), and cellular component biogenesis (FE =2.06, FDR =3.57×102).

Upon analyzing the two genes uniquely associated with recurrent MI (listed as Recurrent in Table 3), we observe neither significant enrichment in gene ontology terms nor any distinctive expression patterns across tissue types (see Supplementary File 1, Figure 13). In constructing egocentric networks for these genes, we identify seven overlapping genes (see Supplementary File 3) among the top 25 strongest correlations for each gene, as depicted in Figure 5B. While four of these seven neighboring genes (TOMM70, FASTKD2, MMADHC, and OPA1) are related to mitochondrial function and energy production, gene ontology analysis yielded no significant enrichment.

Investigating the genes that are common for both MI groups (listed as Common in Table 3), we find no significant differential co-expression in any of the tissues (see Supplementary File 1, Figure S14). No gene ontology terms were found to be significantly enriched for these four genes. The egocentric networks (Figure 5C) revealed that 3 of them are linked through 18 shared neighborhood genes (see Supplementary File 3). The 18 genes connecting the 3 target genes show significant enrichment for mitochondrial acetyl-CoA biosynthetic process from pyruvate (FE >100, FDR =5.92×103), tricarboxylic acid cycle (FE >100, FDR =5.37×107), negative regulation of release of cytochrome c from mitochondria (FE >100, FDR =4.88×102), cell redox homeostasis (FE =81.70, FDR =3.38×103), aerobic electron transport chain (FE =39.90, FDR =2.07×102), mitochondrial ATP synthesis coupled electron transport (FE =37.71, FDR =2.38×102), reactive oxygen species metabolic process (FE =32.68, FDR =3.25×102), and mitochondrion organization (FE =12.85, FDR =1.15×102).

In summary, MI gene sets do not exhibit notable tissue specificity or functional enrichment. Yet, the gene sets linked to both common and single MI incidents share multiple genes within their co-expression network neighborhoods. These shared genes exhibit functional enrichment for several pertinent processes. At the same time, the shared gene neighborhood specifically related to recurrent MI does not present any functional enrichment. However, some of the genes are related to mitochondrial function and energy production, similar to several of the enriched functions for the shared neighboring genes for common MI genes. Notably, the neighboring genes of the single MI genes show significant enrichment for several functions not identified for the recurrent of common neighboring gene sets. Only one of the biological processes identified in the neighborhood of single MI genes is also seen in the neighborhood of common genes. This suggests unique functions associated with genes specific to single MI incidents, shedding light on potential reasons why certain patients experience only one MI event.

4 Discussion

This study demonstrates distinct clinical characteristics and genetic predispositions between patients who experience a single AF/MI event and those with recurrent events. To ensure that the single AF and single MI patient groups represent relapse-free patients, we have applied a data filtering procedure to ensure that the ones who only have recorded single events were alive for at least 5 years for AF and 7 years for MI after the episode and before the study either ended or the patient died.

Single AF incidents appear more influenced by lifestyle factors and age, with only two unique genetic regions identified. In contrast, patients with recurrent AF are typically younger at their first event and exhibit a stronger genetic basis, as evidenced by 18 unique genetic regions linked to recurrent AF. Among these, two regions are near the NAV2 and SCN10A genes, previously hypothesized, but unconfirmed (37, 38) (N=42,585 East Asian population, N=660 German population), to affect recurrent AF. The region with the lowest p-values and also the largest amount of significant SNPs was found near the PITX2 gene for both the single and the recurrent AF groups. Though PITX2 is the most known gene associated with recurrent AF (1, 7, 10) (N=295 Turkish population, N=195 Caucasian population, N=991 German and American population), our results do not indicate such an effect. This observation extends to six other genes (SOX5, CAV1, EPHX2, ITGA9, SLC8A1, and TBX5) (8, 10, 37) (N=660 German population, N=295 Turkish population, N=42,585 East Asian population, N=486 Caucasian population), where variants near the SOX5, CAV1, and TBX5 genes are identified in both the single and recurrent AF groups, and no variants near the EPHX2, ITGA9, and SLC8A1 genes are identified in any of the groups. PheWAS analysis of these unique single and recurrent AF SNPs further highlight distinct susceptibilities: SNPs associated with single AF correlate with 44 phenocodes across various categories, whereas recurrent AF SNPs are linked to 91 phenocodes, predominantly in the circulatory system category. Network analysis of gene sets revealed that 16 of the 18 genes associated with recurrent AF are connected through 82 shared, highly co-expressed neighboring genes. These recurrent AF genes, along with their neighboring genes, are involved in complex processes related to heart rate regulation and cardiac muscle cell action potential. In contrast, the two genes associated with single AF events are linked to heart and cardiac muscle processes but do not share highly co-expressed genes.

We also find distinct clinical and genetic differences between patients with single and recurrent MI. Unlike AF, recurrent MI is more associated with older age at the first MI event, lifestyle factors, and age-related issues. This is despite the fact that the single MI group is adjusted for early death due to MI and/or related comorbidities, and thus, this should not affect the results. The genetic predisposition seems stronger in single MI cases, with a total of 24 uniquely associated regions. In contrast, recurrent MI is only associated with two regions that did not share the association with single MI cases. Of the 24 genetic regions uniquely identified for single MI, 14 are previously known for MI or other CVD-related traits. The remaining 10 are novel for MI and have previously not been reported for other CVD-related traits. While most of these novel regions consist of single SNPs primarily identified in the HUNT population, their nearest genes code for functions similar to known MI regions. Looking into the allele frequencies in each population (Supplementary File 4), we see that these variants are rather common in HUNT and in the general European population [based on reports from gnomAD (39)], while they are rare variants in the UKBB population and thereby not included in the meta-analysis. Further studies are needed to investigate if these variants show similar effects in other European and non-European populations.

PheWAS analysis reinforces the genetic distinction between single and recurrent MI groups. The single MI group’s SNPs are linked to 128 phenotypes, predominantly in the circulatory system category, with additional associations in the endocrine/metabolic category and notable links to neurodegenerative disorders. In contrast, the SNPs related to recurrent MI correlate with 16 phenocodes, involving both circulatory and endocrine/metabolic categories, but the associations are not as pronounced as those in the single MI group. Our network analysis reveals distinct gene interactions for each group. Of the 24 genes uniquely associated with single MI, 21 share connections with 110 genes in the co-expression network. However, the two genes associated with recurrent MI have seven highly co-expressed genes. This indicates more extensive genetic interconnections in single MI cases. Interestingly, the shared neighboring genes for single MI, and those common to both single and recurrent MI, show functional enrichment in several biological processes. In contrast, the shared neighboring genes for recurrent MI do not show significant functional enrichment. This disparity suggests distinct biological pathways involved in single vs. recurrent MI events. Moreover, only one function is common between the shared genes for both single and recurrent MI, suggesting unique biological mechanisms specific to single MI events.

The results suggest a greater genetic influence in AF compared to MI, but several factors could affect this perception. Clinically, it is expected that more genes would increase the risk of recurrent AF, a pattern observed in this study. In contrast, the findings for MI are the opposite, potentially influenced by their higher age: MI becomes less common in younger individuals, and in older populations, comorbidities often overshadow genetic risk factors. There could also be physiological reasons behind these observations. For instance, MI in younger individuals might more frequently result from genetic factors related to platelet aggregation or atherosclerosis, conditions that are generally more responsive to treatment. In older individuals, MI might be more associated with broader age-related issues, reducing the relative impact of genetics. The reporting of these conditions could also influence the results, with AF potentially being under-reported compared to MI, which itself is possibly over-reported. This discrepancy could explain why phenocodes related to neurodegenerative disorders and cerebral ischemia emerge as significant only in single MI cases in the PheWAS results. These findings might be influenced by diagnostic practices where MI is often recorded as a cause of death, even when other diseases are the actual cause, or in cases where individuals die before a recurrent event occurs. The latter has been adjusted for by excluding individuals with single events with less than 7 years between the event and the censoring date, being either death or the end of the study. Still, even with this adjustment, some single MI event cases might be censored out before a second event, possibly influencing the case groups and thereby the results.

The observed differences in clinical characteristics for both diseases might be even more pronounced if the HUNT and UKBB studies had used similar questionnaire formats. In the HUNT study, participants involved in both HUNT2 and HUNT3 provided cholesterol and blood pressure measurements taken 11 years apart. We opted to use the measurements recorded closest to the first AF or MI event, as we believe these offer the most relevant information. However, for the UKBB participants, most were measured only once, eliminating the option to choose measurements closest to the event. This discrepancy in data collection methodology could explain why significant differences in cholesterol and blood pressure measurements are observed between single and recurrent event groups in the HUNT population but not in the UKBB group. The lack of longitudinal data in the UKBB may obscure potential differences that are more apparent in the HUNT study due to its repeated measurements.

In any GWAS study, larger sample sizes, diverse ancestry representation, and result replication are crucial. The HUNT study, comprised solely of individuals of European ancestry, led us to select only European ancestry participants from the UKBB for consistency. However, for global applicability, conducting similar analyses across all ancestries is essential. Regarding sample sizes, while the combined participant pool of HUNT and UKBB exceeds 500,000, the specific filtering and subgrouping in our study result in some case/control groups having fewer than 2,000 individuals. This reduced size may limit our ability to achieve robust significant findings. The small sample sizes might explain the prevalence of significant singleton SNP hits (one SNP per region) found only in the HUNT study, with the same SNPs with too rare allele frequencies in the UKBB population. Increasing sample sizes or including an additional population would likely enhance the reliability of these findings, particularly since these variants are reported as common in the general European population. While increasing the sample sizes could verify/dispute or even identify additional regions, some genetic differences observed between recurrent and single cases might diminish. Although many spikes in the Manhattan plots (Figures 1 and 2) indicate clear differences between single and recurrent cases, certain regions almost reach significance for the opposite group, such as the hit on chromosome 8 for recurrent AF. This suggests that some observed genetic distinctions might be less pronounced with a more substantial and diverse sample.

While our meta-analysis did not yield any GWAS significant results when directly comparing recurrent to single AF or MI cases, the data still indicate genetic differences between these groups. Notably, some genetic regions were identified in both single and recurrent groups (termed “common”), while a substantial number were unique to each group. This could suggest that the common regions may play a role in the general susceptibility to AF and MI, whereas the unique regions could confer specific genetic risks for the disease’s form (as a single or recurrent events). Further studies are needed to test the hypotheses generated from this study: Investigate the effect of the novel SNPs and genes identified and their involvement in either single or recurrent AF/MI in particular. While many of the identified genes and their related function might not have a direct confirmed effect on AF/MI, we do see similar functions in the novel genes as in the known AF/MI genes, thus showing the potential for identification and generating an understanding of new genetic functions of the diseases.

There has been considerable research aimed at uncovering genetic causes for AF and MI. Our findings underscore the importance of genetic studies focused on disease subgroups, as conducted here. Both AF and MI are widespread diseases with varied impacts on individuals’ lives. The progression and outcomes of these diseases are not uniform across all patients. By examining subgroups within these diseases, we could gain new insights into their mechanisms, potentially leading to more effective prevention and treatment strategies tailored to different patient profiles.

A limitation of using the LD Score Regression software to calculate the observed scale heritability is that it is not well suited for mixed models. The GWAS analysis was performed using SAIGE, where we assume a logistic mixed model to account for imbalanced case–control ratios and relatedness. Hence, the precision of the heritability calculated here is limited due to this fact.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Ethics statement

The studies involving humans were approved by the Norwegian Data Inspectorate and The Regional Ethics Committee for Medical Research in Norway (REK: 2014/144). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

MH: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing. AS: Conceptualization, Investigation, Methodology, Supervision, Writing – original draft, Writing – review & editing. IS: Conceptualization, Writing – review & editing. HD: Writing – review & editing. EA: Funding acquisition, Methodology, Project administration, Resources, Supervision, Writing – original draft, Writing – review & editing.

Funding

The authors declare financial support was received for the research, authorship, and/or publication of this article.

MH and EA acknowledge the K.G. Jebsen Foundation for Grant SKGJ-MED-015.

Acknowledgments

The Trøndelag Health Study (HUNT) is a collaboration between HUNT Research Centre (Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology NTNU), Trøndelag County Council, Central Norway Regional Health Authority, and the Norwegian Institute of Public Health. The genetic investigations of the HUNT Study are a collaboration between researchers from the K.G. Jebsen Center for Genetic Epidemiology and University of Michigan Medical School and the University of Michigan School of Public Health. The K.G. Jebsen Center for Genetic Epidemiology is supported by Stiftelsen Kristian Gerhard Jebsen, Faculty of Medicine and Health Sciences, NTNU, Norway. The authors would like to thank clinicians and other employees at Nord-Trøndelag Hospital Trust for their support and for contributing to data collection in this research project. The authors would also like to thank all participants who have contributed in the UKBB study.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcvm.2024.1372107/full#supplementary-material

Supplementary File 2

Excel file containing all phenocodes found to be significantly associated with the set of SNPs associated with both single and recurrent AF/MI or uniquely for one of them.

Supplementary File 3

Excel file containing genes shared between the GWAS associated genes (target genes) in the network analysis for genes associated with both single and recurrent AF/MI or uniquely for one of them.

Supplementary File 4

Excel file containing allele frequency of SNPs identified in only one of the populations (listed as for instance −? in Tables 2, 3). The columns named ‘AF HUNT’, ‘AF UKBB’ and ‘AF European’ shows the allele frequency in the HUNT population, the UKBB population and as reported in gnomAD (39)(https://gnomad.broadinstitute.org/), respectively. The ‘HUNT/UKBB’ columns shows if the SNP is identified in HUNT or UKBB, and the ‘imputation info’ columns shows the imputation info given in the genotyped UKBB files.

Supplementary File 5

Excel file containing the data from Tables 2, 3.

References

1. Shoemaker MB, Bollmann A, Lubitz SA, Ueberham L, Saini H, Montgomery J, et al. Common genetic variants and response to atrial fibrillation ablation. Circ Arrhythm Electrophysiol. (2015) 8:296–302. doi: 10.1161/CIRCEP.114.001909

PubMed Abstract | Crossref Full Text | Google Scholar

2. Official Statistics of Sweden Health and Medical Care. Statistics on myocardial infarctions 2020. Technical Report (2021).

Google Scholar

3. Nielsen JB, Thorolfsdottir RB, Fritsche LG, Zhou W, Skov MW, Graham SE, et al. Biobank-driven genomic discovery yields new insight into atrial fibrillation biology. Nat Genet. (2018) 50:1234–9. doi: 10.1038/s41588-018-0171-3

PubMed Abstract | Crossref Full Text | Google Scholar

4. Christophersen IE, Rienstra M, Roselli C, Yin X, Geelhoed B, Barnard J, et al. Large-scale analyses of common and rare variants identify 12 new loci associated with atrial fibrillation. Nat Genet. (2017) 49:946–52. doi: 10.1038/ng.3843

PubMed Abstract | Crossref Full Text | Google Scholar

5. Nikpay M, Goel A, Won HH, Hall LM, Willenborg C, Kanoni S, et al. A comprehensive 1,000 genomes-based genome-wide association meta-analysis of coronary artery disease. Nat Genet. (2015) 47:1121–30. doi: 10.1038/ng.3396

PubMed Abstract | Crossref Full Text | Google Scholar

6. Hartiala JA, Han Y, Jia Q, Hilser JR, Huang P, Gukasyan J, et al. Genome-wide analysis identifies novel susceptibility loci for myocardial infarction. Eur Heart J. (2021) 42:919–33. doi: 10.1093/eurheartj/ehaa1040

PubMed Abstract | Crossref Full Text | Google Scholar

7. Husser D, Adams V, Piorkowski C, Hindricks G, Bollmann A. Chromosome 4q25 variants and atrial fibrillation recurrence after catheter ablation. J Am Coll Cardiol. (2010) 55:747–53. doi: 10.1016/j.jacc.2009.11.041

PubMed Abstract | Crossref Full Text | Google Scholar

8. Wutzler A, Kestler C, Perrot A, Loehr L, Huemer M, Parwani AS, et al. Variations in the human soluble epoxide hydrolase gene and recurrence of atrial fibrillation after catheter ablation. Int J Cardiol. (2013) 168:3647–51. doi: 10.1016/j.ijcard.2013.05.010

PubMed Abstract | Crossref Full Text | Google Scholar

9. Zhang X, Huang J, Li J, Lu Q, Huang Y, Lu D, et al. Association between TCF21 gene polymorphism with the incidence of paroxysmal atrial fibrillation and the efficacy of radiofrequency ablation for patients with paroxysmal atrial fibrillation. Int J Gen Med. (2022) 15:4975–83. doi: 10.2147/ijgm.s366956

PubMed Abstract | Crossref Full Text | Google Scholar

10. Ulus T, Dural M, Mese P, Yetmis F, Mert KU, Gorenek B, et al. Genetic variants associated with atrial fibrillation and long-term recurrence after catheter ablation for atrial fibrillation in Turkish patients. Anatol J Cardiol. (2021) 25:129–38. doi: 10.14744/AnatolJCardiol.2020.44082

PubMed Abstract | Crossref Full Text | Google Scholar

11. Verweij N, Eppinga RN, Hagemeijer Y, van der Harst P. Identification of 15 novel risk loci for coronary artery disease and genetic risk of recurrent events, atrial fibrillation and heart failure. Sci Rep. (2017) 7:2761. doi: 10.1038/s41598-017-03062-8

PubMed Abstract | Crossref Full Text | Google Scholar

12. Tragante V, Doevendans PA, Nathoe HM, van der Graaf Y, Spiering W, Algra A, et al. The impact of susceptibility loci for coronary artery disease on other vascular domains and recurrence risk. Eur Heart J. (2013) 34:2896–904. doi: 10.1093/eurheartj/eht222

PubMed Abstract | Crossref Full Text | Google Scholar

13. Wauters E, Carruthers KF, Buysschaert I, Dunbar DR, Peuteman G, Belmans A, et al. Influence of 23 coronary artery disease variants on recurrent myocardial infarction or cardiac death: the GRACE genetics study. Eur Heart J. (2013) 34:993–1001. doi: 10.1093/eurheartj/ehs389

PubMed Abstract | Crossref Full Text | Google Scholar

14. Åsvold BO, Langhammer A, Rehn TA, Kjelvik G, Grøntvedt TV, Sørgjerd EP, et al. Cohort profile update: the HUNT study, Norway. Int J Epidemiol. (2022) 52:e80–91. doi: 10.1093/ije/dyac095

Crossref Full Text | Google Scholar

15. Sudlow C, Gallacher J, Allen N, Beral V, Burton P, Danesh J, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. (2015) 12:e1001779. doi: 10.1371/journal.pmed.1001779

PubMed Abstract | Crossref Full Text | Google Scholar

16. Brumpton BM, Graham S, Surakka I, Skogholt AH, Løset M, Fritsche LG, et al. The HUNT study: a population-based cohort for genetic research. Cell Genom. (2022) 2:100193. doi: 10.1016/j.xgen.2022.100193

PubMed Abstract | Crossref Full Text | Google Scholar

17. Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, et al. The UK biobank resource with deep phenotyping and genomic data. Nature. (2018) 562:203–9. doi: 10.1038/s41586-018-0579-z

PubMed Abstract | Crossref Full Text | Google Scholar

18. Zhao L, Zhang G, Wen Z, Huang C, Wu H, Xu J, et al. Common variants predict recurrence after nonfamilial atrial fibrillation ablation in Chinese Han population. Int J Cardiol. (2017) 227:360–6. doi: 10.1016/j.ijcard.2016.11.057

PubMed Abstract | Crossref Full Text | Google Scholar

19. McCarthy S, Das S, Kretzschmar W, Delaneau O, Wood AR, Teumer A, et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat Genet. (2016) 48:1279–83. doi: 10.1038/ng.3643

PubMed Abstract | Crossref Full Text | Google Scholar

20. Zhou W, Nielsen JB, Fritsche LG, Dey R, Gabrielsen ME, Wolford BN, et al. Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies. Nat Genet. (2018) 50:1335–41. doi: 10.1038/s41588-018-0184-y

PubMed Abstract | Crossref Full Text | Google Scholar

21. Willer CJ, Li Y, Abecasis GR. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics. (2010) 26:2190–1. doi: 10.1093/bioinformatics/btq340

PubMed Abstract | Crossref Full Text | Google Scholar

22. Watanabe K, Taskesen E, Van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nat Commun. (2017) 8:1826. doi: 10.1038/s41467-017-01261-5

PubMed Abstract | Crossref Full Text | Google Scholar

23. Sollis E, Mosaku A, Abid A, Buniello A, Cerezo M, Gil L, et al. The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Res. (2022) 51:D977–85. doi: 10.1093/nar/gkac1010

Crossref Full Text | Google Scholar

24. Bulik-Sullivan BK, Loh PR, Finucane HK, Ripke S, Yang J, Patterson N, et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet. (2015) 47:291–5. doi: 10.1038/ng.3211

PubMed Abstract | Crossref Full Text | Google Scholar

25. The 1000 Genomes Project Consortium. A map of human genome variation from population-scale sequencing. Nature. (2010) 467:1061–73. doi: 10.1038/nature09534

PubMed Abstract | Crossref Full Text | Google Scholar

26. PAN-UKB Team. (2020). Available from: https://pan.ukbb.broadinstitute.org (Accessed January 15, 2023).

27. Mi H, Muruganujan A, Casagrande JT, Thomas PD. Large-scale gene function analysis with the PANTHER classification system. Nat Protoc. (2013) 8:1551–66. doi: 10.1038/nprot.2013.092

PubMed Abstract | Crossref Full Text | Google Scholar

28. Stuart JM, Segal E, Koller D, Kim SK. A gene-coexpression network for global discovery of conserved genetic modules. Science. (2003) 302:249–55. doi: 10.1126/science.1087447

PubMed Abstract | Crossref Full Text | Google Scholar

29. Horvath S, Zhang B, Carlson M, Lu KV, Zhu S, Felciano RM, et al. Analysis of oncogenic signaling networks in glioblastoma identifies ASPM as a molecular target. Proc Natl Acad Sci U S A. (2006) 103:17402–7. doi: 10.1073/pnas.0608396103

PubMed Abstract | Crossref Full Text | Google Scholar

30. Nowick K, Gernat T, Almaas E, Stubbs L. Differences in human and chimpanzee gene expression patterns define an evolving network of transcription factors in brain. Proc Natl Acad Sci U S A. (2009) 106:22358–63. doi: 10.1073/pnas.0911376106

PubMed Abstract | Crossref Full Text | Google Scholar

31. van Dam S, Võsa U, van der Graaf A, Franke L, de Magalhães JP. Gene co-expression analysis for functional classification and gene-disease predictions. Brief Bioinf. (2018) 19:575–92. doi: 10.1093/bib/bbw139

Crossref Full Text | Google Scholar

32. Aguet F, Anand S, Ardlie KG, Gabriel S, Getz GA, Graubert A, et al. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science. (2020) 369:1318–30. doi: 10.1126/science.aaz1776

PubMed Abstract | Crossref Full Text | Google Scholar

33. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinf. (2008) 9:559. doi: 10.1186/1471-2105-9-559

Crossref Full Text | Google Scholar

34. Csardi G, Nepusz T. The igraph software package for complex network research. Int J Compl Syst. (2006) 1695.

Google Scholar

35. Csárdi G, Nepusz T, Traag V, Horvát S, Zanini F, Noom D, et al. igraph: network analysis and visualization in R. Technical Report (2023). doi: 10.5281/zenodo.7682609

Google Scholar

36. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. (2003) 13:2498–504. doi: 10.1101/gr.1239303

PubMed Abstract | Crossref Full Text | Google Scholar

37. Husser D, Büttner P, Stübner D, Ueberham L, Platonov PG, Dinov B, et al. PR interval associated genes, atrial remodeling and rhythm outcome of catheter ablation of atrial fibrillation—a gene-based analysis of GWAS data. Front Genet. (2017) 8:224. doi: 10.3389/fgene.2017.00224

PubMed Abstract | Crossref Full Text | Google Scholar

38. Hong M, Hwang I, Yu HT, Kim TH, Uhm JS, Joung B, et al. Potential causal association of a prolonged PR interval and clinical recurrence of atrial fibrillation after catheter ablation: a Mendelian randomization analysis. J Hum Genet. (2020) 65:813–21. doi: 10.1038/s10038-020-0774-2

PubMed Abstract | Crossref Full Text | Google Scholar

39. Chen S, Francioli LC, Goodrich JK, Collins RL, Kanai M, Wang Q, et al. A genomic mutational constraint map using variation in 76,156 human genomes. Nature. (2024) 625:92–100. doi: 10.1038/s41586-023-06045-0

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: GWAS meta-analysis, recurrent events, atrial fibrillation, myocardial infarction, networks, PheWAS, HUNT

Citation: Hall M, Skogholt AH, Surakka I, Dalen H and Almaas E (2024) Genome-wide association studies reveal differences in genetic susceptibility between single events vs. recurrent events of atrial fibrillation and myocardial infarction: the HUNT study. Front. Cardiovasc. Med. 11:1372107. doi: 10.3389/fcvm.2024.1372107

Received: 17 January 2024; Accepted: 4 April 2024;
Published: 25 April 2024.

Edited by:

Qing K. Wang, Huazhong University of Science and Technology, China

Reviewed by:

Morten Salling Olesen, University of Copenhagen, Denmark
Giovanni Fiorito, University of Sassari, Italy

© 2024 Hall, Skogholt, Surakka, Dalen and Almaas. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Eivind Almaas eivind.almaas@ntnu.no

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.