- 1Department of Biostatistics, School of Public Health, Peking University, Beijing, China
- 2Center for Statistical Science, Peking University, Beijing, China
- 3Department of Epidemiology & Biostatistics, School of Public Health, Peking University, Beijing, China
- 4Department of Global Health, School of Public Health, Peking University, Beijing, China
- 5Key Laboratory of Molecular Cardiovascular Sciences (Peking University), Ministry of Education, Beijing, China
Objective: This study aimed to explore shared genetic etiology and the causality between smoking status and type 2 diabetes (T2D), cardiovascular diseases (CVDs), and related metabolic traits.
Methods: Using summary statistics from publicly available genome-wide association studies (GWASs), we estimated genetic correlations between smoking status and T2D, 6 major CVDs, and 8 related metabolic traits with linkage disequilibrium score regression (LDSC) analysis; identified shared genetic loci with large-scale genome-wide cross-trait meta-analysis; explored potential shared biological mechanisms with a series of post-GWAS analyses; and determined causality with Mendelian randomization (MR).
Results: We found significant positive genetic associations with smoking status for T2D (Rg = 0.170, p = 9.39 × 10−22), coronary artery disease (CAD) (Rg = 0.234, p = 1.96 × 10−27), myocardial infarction (MI) (Rg = 0.226, p = 1.08 × 10−17), and heart failure (HF) (Rg = 0.276, p = 8.43 × 10−20). Cross-trait meta-analysis and transcriptome-wide association analysis of smoking status identified 210 loci (32 novel loci) and 354 gene–tissue pairs jointly associated with T2D, 63 loci (12 novel loci) and 37 gene–tissue pairs with CAD, 38 loci (6 novel loci) and 17 gene–tissue pairs with MI, and 28 loci (3 novel loci) and one gene–tissue pair with HF. The shared loci were enriched in the exo-/endocrine, cardiovascular, nervous, digestive, and genital systems. Furthermore, we observed that smoking status was causally related to a higher risk of T2D (β = 0.385, p = 3.31 × 10−3), CAD (β = 0.670, p = 7.86 × 10−11), MI (β = 0.725, p = 2.32 × 10−9), and HF (β = 0.520, p = 1.53 × 10−6).
Conclusions: Our findings provide strong evidence on shared genetic etiology and causal associations between smoking status and T2D, CAD, MI, and HF, underscoring the potential shared biological mechanisms underlying the link between smoking and T2D and CVDs. This work opens up a new way of more effective and timely prevention of smoking-related T2D and CVDs.
Introduction
Despite concerted efforts to combat the global tobacco epidemic, tobacco smoking remains the leading preventable cause of morbidity and mortality (1). Smoking has multiple well-known adverse health effects (2, 3), and its association with type 2 diabetes (T2D) and cardiovascular diseases (CVDs) has been a major public health concern. Considerable studies, both prospective cohort studies among different population groups (4–6) and meta-analyses (7–10), have provided compelling evidence of the important role of smoking in increasing the risk of T2D and CVDs. Approximately 30%~40% of the increased risk of T2D (2) and 20%~30% of all CVD deaths (11, 12) compared to never smokers are attributed to smoking. In addition, previous twin or family studies have shown that smoking, T2D, and many CVDs, such as coronary artery disease (CAD), are heritable traits (13–15), and the heritability was estimated to range from 4% to 19% for smoking phenotypes (16, 17), 17%~23% for T2D (18), and 14%~21% for CAD (19, 20) in recent large-scale genome-wide association studies (GWASs). Furthermore, genetic correlations between several smoking phenotypes and T2D or CVDs have been observed (16, 21). For example, two recent large-scale GWASs on tobacco use revealed that smoking initiation was genetically positively correlated with T2D, CAD, myocardial infarction (MI), and heart failure (HF) and that cigarettes per day and smoking cessation were genetically positively correlated with CAD. More interestingly, single-nucleotide polymorphisms (SNPs) in some genes have been reported to have effects on both smoking and T2D or CVDs (22–24).
These lines of evidence suggest two possibilities to account for such associations between smoking and T2D or CVDs. One is pleiotropy. Smoking and T2D or CVDs may share common genetic variants that simultaneously influence two or more of these traits or disorders by engaging in common pathways or controlling common risk factors. An alternative possibility is that causal associations may exist between smoking and T2D or CVDs. In recent years, large publicly available GWAS datasets and multiple state-of-the-art statistical analysis methods including linkage disequilibrium score regression (LDSC) (25), cross-trait meta-analysis (26), transcriptome-wide association studies (TWAS) (27), and Mendelian randomization (MR) analysis (28–31), can be utilized to facilitate investigations of whether the comorbidity and risk interrelationship of these traits or disorders can be explained by common genetic variants or causality. Given these possibilities and methodological advances, it is now important and feasible for us to elucidate the mechanisms underlying the comorbidity between smoking and T2D and CVDs. As is apparent from the literature, the associations between smoking and T2D or CVDs varied due to the differences in the measurement of smoking in different studies (3–5, 8). In our study, we chose smoking status, an ordinal categorical variable, which is divided into current smokers, former smokers, and never smokers according to smoking intensity and recency.
To our knowledge, no genetic study has systematically explored the common genetic etiology between smoking status and T2D and CVDs. Therefore, in the present study, we conducted a comprehensive analysis using summary statistics from publicly available GWASs to explore shared genetic etiology and the causality between smoking status and T2D, CVDs, and related metabolic traits.
Materials and Methods
Study Design and Data Summary
The whole study design is shown in Figure 1. Summary statistics used in this study were extracted from publicly available GWASs. The dataset of smoking status was from Gene ATLAS, consisting of 452,264 participants (32, 33). We retrieved summary statistics from the Diabetes Genetics Replication And Meta-analysis (DIAGRAM) Consortium for T2D (N = 898,130) (18). Generally, CVDs encompass a broad range of disorders of the heart and blood vessels including coronary heart disease, cerebrovascular disease, and other conditions. In this study, we chose six common or devastating CVDs including CAD (N = 148,715) (20) and MI (N = 163,665) (34) from the Coronary Artery Disease Genome wide Replication and Meta-analysis (CARDIoGRAM) plus the Coronary Artery Disease (C4D) Genetics (CARDIoGRAMplusC4D) consortium, HF (N = 977,323) (35) from the Heart Failure Molecular Epidemiology for Therapeutic Targets (HERMES), ischemic stroke (IS; N = 521,612) from the METASTROKE collaboration (36), intracerebral hemorrhage (ICH; N = 3,026) from the International Stroke Genetics Consortium (37), and atrial fibrillation (AF; N = 133,073) from the Atrial Fibrillation Genetics Consortium (38). In addition, several important T2D/CVD-related metabolic traits were considered in this study, including glycemic traits [fasting glucose (FG; N = 46,186), fasting insulin (FI; N = 38,238), and the surrogate estimates of β-cell function (HOMA-β; N = 36,466) and insulin resistance (HOMA-IR; N = 37,037) derived from fasting variables by homeostasis model assessment from the Meta-Analyses of Glucose and Insulin-related traits Consortium (39) and blood lipids [high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C), total cholesterol (TC), and triglyceride (TG), N = 188,577] from the Global Lipids Genetics Consortium (40). The majority of the participants were of European ancestry in each GWAS (Supplementary Table 1). Detailed disease definition and baseline characteristics for each study were described in previous studies (18, 20, 32–40). For example, smoking status, an ordinal categorical variable based on several questions about smoking intensity and recency, includes the categories of current smokers (those who have smoked 100 cigarettes in their lifetime and currently smoke cigarettes), former smokers (those who have smoked at least 100 cigarettes in their lifetime but had quit smoking at the time of interview), and never smokers (those who have never smoked or who have smoked less than 100 cigarettes in their lifetime) (32, 33). T2D status was defined based on multiple sources of evidence, including a self-reported history of T2D, doctor-diagnosed T2D, antidiabetic treatment, fasting plasma glucose >7.0 mmol/L, or 2-h plasma glucose >11.1 mmol/L (18). In CARDIoGRAMplusC4D, CAD status was defined by an inclusive CAD diagnosis, including MI, percutaneous transluminal coronary angioplasty (PTCA), coronary artery bypass grafting (CABG), chronic ischemic heart disease (IHD), and angina (20). More details of these datasets can be seen in the original publications or related websites (18, 20, 32–40). In this study, our analyses were restricted to autosomal chromosomes.
Figure 1 Overall study design. Multiple genome-wide association study (GWAS) data sources were first retrieved. We first conducted a genome-wide genetic correlation analysis between smoking status and type 2 diabetes (T2D), six cardiovascular diseases (CVDs), and eight related metabolic traits. For the traits that showed significant genetic correlation with smoking status, we further conducted post-GWAS analyses to investigate the genetic overlap between them (variant/region/functional levels). Then, we also explored the causal relationship between smoking status and T2D, six CVDs, and eight related metabolic traits.
Statistical Analysis
Linkage Disequilibrium Score Regression
We used LDSC, a method requiring only GWAS summary statistics and having no bias by sample overlap, to estimate genetic correlations between smoking status and T2D, 6 major CVDs, and 8 related metabolic traits (41). This method relies on an algorithm that multiplies the Z score of the same SNP and two different phenotypes and then regresses the product of the Z scores from two phenotypes on the LD that the SNP has with all neighboring SNPs (25, 41). The Bonferroni correction was used to adjust multiple testing (two-tailed p < 0.05/15).
Partitioned Genetic Correlation
Genetic correlations within functional categories between smoking status and T2D, CAD, MI, and HF were estimated using partitioned LDSC to further describe the genetic overlap at the level of functional categories (42). Eleven functional categories were involved, including the DNase I digital genomic footprinting (DGF) region, DNase I hypersensitivity sites (DHSs), fetal DHS, intron, super-enhancer, transcription factor-binding sites (TFBS), transcribed regions, and histone marks H3K4me1, H3K27ac, H3K4me3, and H3K9ac. This method recalculated LD scores for SNPs partitioned in each particular functional category to estimate the genetic correlation within that functional group.
Cross-Trait Meta-Analysis
We applied a cross-trait GWAS meta-analysis by the R package Cross-Phenotype Association (CPASSOC) to further identify shared loci of the above four trait pairs with strong and significant genetic correlation (26). This method is robust to sample overlap and accommodates different types of phenotypic traits, correlated, independent, continuous, or binary traits. In addition, the effects of trait heterogeneity, population structure, and cryptic relatedness can be controlled by CPASSOC (26). SHet was chosen as the main statistics. SNPs with PSHet < 5 × 10−8 and trait-specific p < 0.01 were considered to have effects on both traits.
Fine-Mapping Credible Set Analysis
To identify the regions of shared loci more precisely, fine-mapping credible set analysis based on a Bayesian algorithm was performed to determine credible sets of causal variants at each of the shared loci (43–45). The identified credible sets of causal variants were 99% likely to contain causal disease-associated SNPs by extracting variants that were highly linked (r2 > 0.4) with the index SNP and within 500 kb of the index SNP (46).
Colocalization Analysis
A colocalization analysis by the R package coloc was applied to determine whether the association signals of trait pairs colocalized at the same locus (47, 48). The probability that both traits are associated and share a single causal variant (Coloc H4 Prob) was calculated with variants extracted within 500 kb of the index SNP at each of the shared loci. Loci with Coloc H4 Prob greater than 0.5 were considered to colocalize (49).
Tissue Enrichment Analysis, Overrepresentation Enrichment Analysis, and Transcriptome−Wide Association Study Analysis
To further understand the biological insights of the identified shared genes between smoking status and T2D, CAD, MI, and HF, we conducted multiple post-GWAS functional analyses. Based on RNA-Seq data from the Human Protein Atlas (HPA) across 35 human tissues (50), we used the TissueEnrich web application to calculate the tissue-specific gene enrichment and further understand whether identified shared genes of each trait pair were enriched in disease-relevant tissues (51). We applied the WebGestalt application (52) to determine overrepresentation enrichment of the identified shared gene set in Gene Ontology (GO) biological processes (53, 54). Furthermore, we conducted TWAS using the FUSION software package and 48 Genotype-Tissue Expression (GTEx) (version 7) reference weights (27) to explore the gene expression association in different tissues between smoking status and T2D, CAD, MI, and HF. The false discovery rate (FDR) Benjamini–Hochberg procedure was applied to correct for multiple testing, and FDR < 0.05 was regarded as significant.
Bidirectional Mendelian Randomization Analysis
Finally, we used the TwoSampleMR package to perform a bidirectional MR analysis to explore the causality between smoking status and T2D, 6 major CVDs, and 8 related metabolic traits (28–31). Bidirectional MR is a form of causal inference analysis that can estimate causal directions and effects by employing genetic instruments selected from large-scale GWASs (55), even in the presence of unmeasured confounders. Three basic assumptions must be fulfilled to yield unbiased causal estimates in the MR analysis: 1) the genetic instruments used must be associated with the exposure, 2) the genetic instruments should be independent of the confounders between the exposure and outcome, and 3) the genetic instruments affect the outcome only through the exposure (46, 56). In this study, we extracted genetic instruments (SNPs) with p < 5 × 10−8 from the GWAS summary statistics of the exposure of interest, conducted the horizontal pleiotropy test, and selected independent genetic instruments at r2 < 0.001 to satisfy these three assumptions. For each potential causality, the inverse variance-weighted (IVW) method was used to obtain the primary causal estimates. The FDR Benjamini–Hochberg procedure was applied to correct for multiple testing (FDR < 0.05).
Notably, the T2D, CAD, and HF GWASs contained UK Biobank participants, which may overlap to some extent with smoking status GWAS from the UK Biobank. Therefore, we additionally extracted T2D, CAD, and HF GWAS summary statistics from earlier or smaller-scale GWASs (57–59) that did not contain UK Biobank participants to further confirm the potential causal associations between smoking status and T2D, CAD, and HF. The details of these GWASs are presented in Supplementary Table 2.
Results
Genome-Wide Genetic Correlation
Understanding the genetic correlations of different complex traits or diseases can provide preliminary insights into genetic etiology. Therefore, we firstly estimated genetic correlations between smoking status and T2D, 6 major CVDs, and 8 related metabolic traits by LDSC. Among these traits, T2D (Rg = 0.170, p = 9.39 × 10−22), CAD (Rg = 0.234, p = 1.96 × 10−27), MI (Rg = 0.226, p = 1.08 × 10−17), and HF (Rg = 0.276, p = 8.43 × 10−20) showed strong and significant positive genetic correlations with smoking status (Table 1). In addition, we found nominally significant positive genetic correlations with smoking status for IS, ICH, and FG (Table 1). Genetic correlations between smoking status and HDL-C or TG reached statistical significance, but the magnitude of genetic correlation was less than 10% (Table 1). However, we did not find evidence of genetic correlations with smoking status for AF, FI, HOMA-B, HOMA-IR, LDL-C, and TC (Table 1).
Table 1 Genetic correlations between smoking status and T2D, CVDs, and related metabolic traits (α = 0.05/15).
Partitioned Genetic Correlation
We used partitioned LDSC analysis to further evaluate genetic correlations between smoking status and T2D, CAD, MI, and HF in 11 functional annotations. Almost all the partitioned genetic correlations in each trait pair were positive (Figure 2 and Supplementary Table 3). Large and statistically significant genetic correlations in many functional categories were observed, and a few categories stood out in particular. The highest magnitude of significant genetic correlation between smoking status and T2D (Rg = 0.167), MI (Rg = 0.164), and HF (Rg = 0.227) was in transcribed regions, where this region can transcribe DNA sequence to mRNA (Figure 2 and Supplementary Table 3). Smoking status and CAD (Rg = 0.162) showed the highest magnitude of significant genetic correlation in DHSs, which are regions of chromatin that are sensitive to cleavage by the DNase I enzyme (Figure 2 and Supplementary Table 3).
Figure 2 Partitioned genetic correlations of smoking status and T2D, CAD, MI, and HF. (A) Partitioned genetic correlations of smoking status and T2D. (B) Partitioned genetic correlations of smoking status and CAD, MI, and HF. The vertical axis represents the genetic correlation estimate; the horizontal axis represents 11 functional categories. The asterisk represents significance after Benjamini–Hochberg correction (FDR < 0.05); error bars represent the SE of the genetic correlation estimate. T2D, type 2 diabetes; CAD, coronary artery disease; MI, myocardial infarction; HF, heart failure; FDR, false discovery rate.
Cross-Trait Meta-Analysis
The strong genetic correlations for smoking status and T2D, CAD, MI, and HF encouraged the exploration of common genetic architecture; therefore, we performed a genome-wide cross-trait meta-analysis to identify shared genetic loci between them (meta-analysis p < 5 × 10− 8; trait-specific p < 0.01). The lists of shared loci of each trait pair are provided in Tables 2, 3 and Supplementary Tables 4–7.
Table 2 Novel shared loci in the cross-trait meta-analysis of smoking status and T2D (pmeta < 5 × 10−8; single trait p < 0.01).
Table 3 Novel shared loci in the cross-trait meta-analysis of smoking status and CAD, MI, and HF (pmeta < 5 × 10−8; single trait p < 0.01).
We found 210 loci significantly associated with both smoking status and T2D, and of these, 32 loci were novel. The most significant locus (index SNP rs9937053, pmeta = 6.72 × 10− 81) was mapped to FTO (Supplementary Table 4), the first gene contributing to common forms of human obesity (60). Previous studies have indicated that FTO is an essential regulator in the development of obesity-induced metabolic and vascular changes (61) and that adiposity-related risk alleles at FTO may predispose individuals to diabetes and cardiovascular events (62, 63). A total of 63 genome-wide significant loci were identified in the meta-analysis of smoking status and CAD, of which 12 loci were novel (Supplementary Table 5). The most significant locus (index SNP rs1412830, pmeta = 3.03 × 10−34) was mapped to the CDKN2B-AS1 region, which was also found to be significant in the cross-trait meta-analysis for smoking status and T2D (pmeta = 2.63 × 10−17) or MI (pmeta = 1.45 × 10−23) (Figure 3). CDKN2B-AS1 is a significant genetic susceptibility locus for CVDs and has also been linked to several other pathologies, such as several cancers, T2D, periodontitis, Alzheimer’s disease, and glaucoma (64, 65). A sum of 38 loci, including 6 novel loci, were found to be significantly associated with both smoking status and MI (Supplementary Table 6). The top two significant loci (index SNP rs12617922, pmeta = 4.36 × 10− 25; index SNP rs12244388, pmeta = 7.40 × 10− 24) were located at RPL6P5 and AS3MT. AS3MT encodes arsenite methyltransferase and plays a role in arsenic metabolism by catalyzing the transfer of a methyl group from S-adenosyl-L-methionine (AdoMet) to trivalent arsenical (66). Cigarette smoke contains arsenic with adverse effects and arsenic exposure has been proven to be linked with the risk of acute MI (67). The genome-wide cross-trait meta-analysis between smoking status and HF identified 28 genome-wide significant loci, of which 3 loci were novel (Supplementary Table 7). The strongest signal was observed on chromosome 3 at the CADM2 region (index SNP rs34495106, pmeta = 3.02 × 10− 19), a critical gene associated with a range of behavioral and metabolic traits, including physical activity, alcohol and cannabis use, and obesity (68).
Figure 3 The overlapping loci at the SNP level identified by the cross-trait meta-analysis across different trait pairs. The Venn diagram illustrates the overlapping loci at the SNP level identified by the cross-trait meta-analysis across different trait pairs. T2D, type 2 diabetes; CAD, coronary artery disease; MI, myocardial infarction; HF, heart failure; SNP, single-nucleotide polymorphism.
Notably, some shared loci overlapped in the cross-trait meta-analysis of smoking status–T2D and smoking status–CVDs (Figures 3, 4). In addition to the SNP rs1412830 located at the CDKN2B-AS1 region, we observed four overlapping significant loci (index SNPs: rs12453682, rs1381274, rs2867112, and rs4790874) in the genome-wide cross-trait meta-analysis of smoking status–T2D and smoking status–CAD. Of these, the SNP rs2867112 is near the protein-coding gene body TMEM18, and genetic variants in the proximity of the gene have been linked to obesity (69), insulin levels, and blood glucose levels (70). In addition, two loci (index SNPs: rs72712556 and rs10030552) mapped to MAML3 were found to be genome-wide significant in the meta-analysis of smoking status–T2D and smoking status–HF. These two loci reached genome-wide significance in the single-trait GWAS of smoking status, but their association with T2D or HF remains unknown. More importantly, genes AS3MT and SMG6 were identified in the cross-trait meta-analysis of all four trait pairs (smoking status–T2D, smoking status–CAD, smoking status–MI, and smoking status–HF). Gene AS3MT is known to act in arsenic metabolism (66), and polymorphisms in the AS3MT have been reported to be associated with CVDs (71) and T2D risks (72, 73). SMG6 is ubiquitously expressed in many tissues and cell types and has dual functions in telomere maintenance and RNA surveillance pathways (74). Multiple loci in SMG6 have been proven to be associated with smoking behavior (17) and CAD (75, 76). However, its role in T2D remains to explore.
Figure 4 The overlapping loci at the gene level identified by the cross-trait meta-analysis across different trait pairs. The Venn diagram illustrates the overlapping loci at the gene level identified by the cross-trait meta-analysis across different trait pairs. T2D, type 2 diabetes; CAD, coronary artery disease; MI, myocardial infarction; HF, heart failure.
Fine-Mapping Credible Set Analysis and Colocalization Analysis
Based on Bayesian fine-mapping, we identified the 99% credible set of causal variants at each of the shared loci. The lists of credible sets of causal variants for each shared locus are provided in Supplementary Tables 8–11. In addition, a colocalization analysis was applied to determine whether the two traits were associated and shared the same causal variant at each shared locus. The number of the shared loci considered to colocalize in each trait pair was 20 (smoking status–T2D), 7 (smoking status–CAD), 4 (smoking status–MI), and 4 (smoking status–HF) (Supplementary Tables 12–15). Among these, 3 loci (index SNPs: rs329122, rs3742305, and rs1443750) reached a great probability (>95%) of having shared causal variants of smoking status and T2D, in addition to 2 loci (index SNPs: rs11556924 and rs10774625) for smoking status–CAD, 2 loci (index SNPs: rs11556924 and rs653178) for smoking status–MI, and one locus (index SNP: rs4766578) for smoking status–HF.
Tissue Enrichment Analysis
To determine whether shared genes from cross-trait meta-analysis between smoking status and T2D, CAD, MI, and HF were enriched for expression in the disease-relevant tissues, we performed a tissue enrichment analysis using the TissueEnrich web application and tissue-specific genes from RNA-Seq data of the HPA. We found that the shared genes of smoking status with T2D, CAD, MI, and HF were all most strongly enriched in the adrenal gland (Figure 5). The stomach was another strongly enriched tissue for the shared genetic genes of smoking status–CAD and smoking status–MI, in addition to the cerebral cortex for the shared genetic genes of smoking status–HF (Figure 5).
Figure 5 Tissue enrichment analysis for the expression of cross-trait-associated genes between smoking status and T2D (A), CAD (B), MI (C), and HF (D). The vertical axis illustrates the logarithm of tissue expression enrichment fold change based on two. The horizontal axis illustrates 35 independent tissue types. T2D, type 2 diabetes; CAD, coronary artery disease; MI, myocardial infarction; HF, heart failure.
Overrepresentation Enrichment Analysis
The overrepresentation enrichment analysis of the GO biological processes highlighted several significantly enriched biological processes for the shared genes between smoking status and T2D, mainly involving regulation of insulin secretion and regulation of peptide hormone secretion (Supplementary Table 16). In addition, the shared genes between smoking status and CAD were significantly enriched in the positive regulation of leukocyte adhesion to vascular endothelial cells, axon development, cell morphogenesis involved in neuron differentiation, and neuron projection morphogenesis (Supplementary Table 17). However, no significantly enriched biological process for the shared genes of smoking status–MI and smoking status–HF was found.
Transcriptome-Wide Association Analysis
We conducted a TWAS analysis to explore the genes whose expression in different tissues was associated with smoking status, T2D, CAD, MI, and HF, and to determine if these genes were common among these traits. The lists of gene–tissue pairs significantly associated with each trait are shown in Supplementary Tables 18–22. Among these gene–tissue pairs, 354 gene–tissue pairs overlapped between smoking status and T2D, in addition to 37 gene–tissue pairs for smoking status–CAD, 17 gene–tissue pairs for smoking status–MI, and one gene–tissue pair for smoking status–HF (Supplementary Table 23). Notably, 17 gene–tissue pairs involving four genes (FAM117B, FES, ICA1L, and NBEAL1) for smoking status–MI were contained in gene–tissue pairs for smoking status–CAD, most of which were observed in the nervous, cardiovascular, exo-/endocrine, and digestive systems. C2orf69–Brain Caudate basal ganglia gene–trait pair was the only one observed overlapping gene–tissue pair between smoking status and HF. Moreover, the enrichment of smoking status and T2D genes expressed were across multiple tissues, not only including nervous, cardiovascular, exo-/endocrine, and digestive systems but also involving the genital system.
Mendelian Randomization Analysis
We performed a bidirectional MR analysis to explore the causal relationship between smoking status and T2D, 6 major CVDs, and 8 related metabolic traits. In the detection of the causal effect of smoking status on cardiometabolic traits, we found that smoking status had significant positive causal effects on T2D (β = 0.385, p = 3.31 × 10−3), CAD (β = 0.670, p = 7.86 × 10−11), MI (β = 0.725, p = 2.32 × 10−9), and HF (β = 0.520, p = 1.53 × 10−6) (Table 4). However, the causal effects of smoking status on other traits (IS, ICH, AF, FG, FI, HOMA-B, HOMA-IR, HDL-C, LDL, TC, and TG) were not identified (Table 4). In addition, we did not observe any significant causal effect of cardiometabolic traits on smoking status (Table 4). Consistent findings that smoking status had significant positive causal effects on T2D, CAD, and HF were observed using additional GWAS data (Supplementary Table 24). MR-Egger regression analysis showed that none of the results were affected by horizontal pleiotropy (Table 4). These results corroborated each other and supported the robustness of our primary findings.
Discussion
To our knowledge, this is the first study to systematically explore shared genetic etiology and the causal relationship between smoking status and T2D and CVDs. First, we found strong positive genetic correlations and further identified shared genetic loci between smoking status and T2D, CAD, MI, and HF. Second, we found that the shared genetic loci were mainly enriched in the adrenal gland and stomach tissues and the biological pathways of nervous system development and regulation of peptide hormone secretion. Third, our TWAS further provided evidence that the enrichment of shared genes expressed was across multiple tissues, including exo-/endocrine, cardiovascular, nervous, digestive, and genital systems. Finally, we identified the causal associations of smoking status with T2D, CAD, MI, and HF. In general, exploration of the shared genetic architecture and causality between smoking status and T2D or CVDs furthers the understanding of the biological mechanisms underlying this comorbidity.
The strong genetic correlations consistent with previous studies (21, 77) suggested that the phenotypic correlations between smoking status and T2D, CAD, MI, and HF were due to a common genetic predisposition base, and we further identified 210 shared genetic loci for smoking status–T2D, in addition to 63 loci for smoking status–CAD, 38 loci for smoking status–MI, and 28 loci for smoking status–HF in the genome-wide cross-trait meta-analysis. Among these shared genetic variants, 32 novel loci were found for smoking status–T2D, along with 12 novel loci for smoking status–CAD, 6 novel loci for smoking status–MI, and 3 novel loci for smoking status–HF, demonstrating the great power of cross-trait meta-analysis in identifying specific shared loci. We highlight several overlapping loci or genes in different trait pairs, which may provide more effective genetic targets for the timely prevention, diagnosis, and treatment of smoking-related T2D and CVDs. The only top locus common to the smoking status–T2D, smoking status–CAD, and smoking status–MI meta-analysis was rs1412830 mapped to CDKN2B-AS1. CDKN2B-AS1 gene is an indispensable long non-coding RNA in multiple diseases (65). In addition to T2D and CVDs (64), CDKN2B-AS1 has been shown to be aberrantly expressed in various malignancies, idiopathic pulmonary fibrosis, endometriosis, inflammatory bowel disease, and primary open-angle glaucoma and to participate in the progression of lipids, carbohydrate metabolism, and inflammation regulation (65), which is likely to serve as a promising therapeutic target or prognostic biomarker in multiple human diseases. The SNP rs2867112 near the protein-coding gene body TMEM18 was found to be significant in the meta-analysis for smoking status–T2D and smoking status–CAD. TMEM18 is an important susceptibility locus for obesity (69), which is an independent risk factor for the development and progression of T2D and CVDs. A previous study provided evidence that smoking might modify the genetic effects of TMEM18 on body mass index (BMI), a proxy for adiposity (78). In addition, two loci (index SNPs: rs72712556 and rs10030552) mapped to MAML3 were found to have genome-wide significance in the meta-analysis of smoking status–T2D and smoking status–HF, which reached genome-wide significance in the single-trait GWAS of smoking status, but its association with HF or T2D remains unknown and deserves in-depth study. AS3MT and SMG6 are two important genes that were identified in the cross-trait meta-analysis of all four trait pairs (smoking status–T2D, smoking status–CAD, smoking status–MI, and smoking status–HF). Cigarette smoke is a vital source of ingested low-level arsenic, and chronic arsenic exposure is associated with increased morbidity and mortality from CVDs (71, 79) and an increased risk of T2D (72, 73). Polymorphisms in AS3MT gene are associated with the efficiency of arsenic biotransformation (66, 72), suggesting that the mechanisms of arsenic metabolism and biotransformation may play an important role in smoking-related T2D and CVDs. Multiple loci in SMG6 have been proven to be associated with smoking behavior (17) and CAD (75, 76). Moreover, a previous study has shown that tobacco smoking is associated with the methylation of genes related to CAD, which includes SMG6 gene (75). These findings provide novel insights into the pathways that link tobacco smoking to the risk of CVDs. However, the role of SMG6 gene in smoking-related T2D remains to be explored.
In addition to the significant findings in the shared genes related to both smoking and T2D or CVDs, we identified the relevant tissues and biological processes that the shared genes enriched in which suggests the potential biological mechanisms that confer comorbid effects. Tissue enrichment analysis showed that the shared genes of smoking status with T2D, CAD, MI, and HF were all most strongly enriched in the adrenal gland. A previous study has reported that cigarette smoking is a strong activator of the hypothalamus–pituitary–adrenal (HPA) axis followed by significant elevations in the adrenal hormone cortisol (80). Cortisol plays an important role in lipid and glucose metabolism; and elevated cortisol levels, if prolonged, lead to a redistribution of body fat characterized by truncal obesity, which is a risk factor for T2D and CVDs (81). Activation of the HPA axis is also thought to contribute to drug abuse during the addictive process, which may also contribute to the abuse-related effects of cigarette smoking (82). In the overrepresentation enrichment analysis, the biological pathway of insulin secretion was found to be significant for the shared genes of smoking status and T2D, indicating that smoking can affect pancreatic islet cell function. Many studies have found neuronal nicotinic acetylcholine receptors (nAChRs) expressed on pancreatic islet cells (83), and these functional nAChRs sensitive to nicotine in pancreatic cells may be a switch to modulate pancreatic cell physiological function and involved in tobacco toxicity (84). Furthermore, several studies in animal models have shown that nicotine can increase apoptosis of islet β-cells, thus reducing insulin secretion (85–88). Mitochondrial dysfunction, oxidative stress, and inflammation are involved as underlying mechanisms for the direct toxicity induced by nicotine via nAChRs (84). The stomach was another strongly enriched tissue for the shared genetic loci of smoking status–CAD and smoking status–MI. Relevant studies have shown that smoking can increase the probability of getting heartburn and peptic ulcers (89), and gastrointestinal diseases may trigger myocardial ischemia-related chest pain probably through the afferent vagal fibers shared by the esophagus and the heart to induce a coronary spasm (90, 91). In addition, the shared genes for smoking status–HF/CAD were enriched in cerebral cortex tissue and the biological pathways of nervous system development, indicating the important role of the nervous system on the comorbidity of smoking and CVDs. Nicotine and fine particulate matter in tobacco smoke can lead to increased sympathetic nerve activity (92), which is one of the hallmarks of chronic congestive HF (93) and plays a role in the process of atherosclerosis (94).
Our TWAS further provided evidence that the shared genes were mostly from the exo-/endocrine, cardiovascular, nervous, and digestive systems. In addition, the TWAS result reported the enrichment of the shared genes between smoking status and T2D from the genital system. Smoking and T2D have a variety of adverse effects on the genital system (95, 96). More importantly, smoking and diabetes may influence the epigenetic modification during the production of germ cells, and these epigenetic dysregulations may be inherited through the germ line and passed onto more than one generation, which in turn may increase the risk of related diseases in offspring (97). A total of 58 significant genes in TWAS were also found to be genome-wide significant in cross-trait meta-analysis for smoking status–T2D, in addition to 13 genes for smoking status–CAD and 3 genes for smoking status–MI, which further indicated the fact that a significant portion of shared genetic loci we identified in the cross-trait meta-analysis were indeed functional variants of modulating gene expression on influencing both phenotypes. Among these, we highlight the importance of the gene TCF7L2, which showed significance in the cross-trait meta-analysis and TWAS of smoking status and T2D. SNPs in TCF7L2 are especially known to be associated with a higher risk of developing T2D (98). Recently, a study has suggested that TCF7L2 links nicotine addiction to diabetes in animal models. This study has revealed that TCF7L2 is densely expressed in the medial habenula and plays an important role in regulating the function of nAChRs in the habenula and in controlling nicotine intake (22). Habenular neurons provide polysynaptic input to the pancreas, and nicotine acts on this habenula–pancreas circuit, in a TCF7L2-dependent manner and via the autonomic nervous system, to increase blood glucose levels (22). Furthermore, FES, ICA1L, and NBEAL1 genes showed significance in the cross-trait meta-analysis and TWAS of smoking status–CAD and smoking status–MI and expressed in multiple tissues, including the brain, nerve, artery, adipose, pancreas, and thyroid tissues. Gene FES, which encodes the human cellular counterpart of a feline sarcoma retrovirus protein with transforming capabilities, is well known to be associated with myeloid leukemia (99), but recent studies observed the function of FES in modulating atherosclerotic plaque vulnerability (100) and the effect of tobacco smoking on DNA methylation of FES (75). Genes ICA1L and NBEAL1 were mapped by the same locus (index SNP: rs114123510), and both are related to cholesterol metabolism, in which dysregulation promotes the pathology of atherosclerosis, MI, and strokes (101). Notably, C2orf69–Brain Caudate basal ganglia gene–trait pair was the only one observed overlapping gene–tissue pair between smoking status and HF. C2orf69 is an evolutionarily conserved gene whose function needs to be further clarified, but recent studies have shown its association with a fatal autoinflammatory syndrome that disrupts the development/homeostasis of the immune and central nervous systems (102, 103), which may contribute to the link between smoking and HF.
In addition to pleiotropy, the associations between smoking status and these cardiometabolic traits may be due to causality. Consistent with previous large cohort (4, 5) and MR studies (104–106), our exploratory bidirectional MR analysis found that smoking status had significant positive causal effects on T2D, CAD, MI, and HF, which suggests that the genetic correlations of the above trait pairs are attributed to both shared genetic architecture and causality. However, we did not observe a significant causal association between smoking status and IS, which is inconsistent with two recent studies (104, 105). This may be due to the different definitions of smoking, involving different ancestry populations, and different sample sizes, which need further confirmation. Besides, we did not observe any causality in the detection of the causal effect of 15 cardiometabolic traits on smoking status, excluding the possibility of reverse causation between smoking status and T2D or CVDs. The potential mechanisms underlying the causal relationship between smoking and T2D or CVDs require further investigation, but the shared loci and related pathways could provide new insights and directions.
In addition, we explored the genetic correlations between smoking status and T2D/CVD-related metabolic traits and observed a nominal positive correlation of smoking status with FG, a weak negative correlation of smoking status with HDL-C, and a weak positive correlation of smoking status with TG. Lipid and glycemic traits, resulting from complex and interwoven physiological mechanisms, are indicators of T2D and CVD risks, and understanding their associations with smoking can provide better insight into the pathophysiological intersect of T2D and CVDs. Previous studies have proven the role of smoking in elevating plasma TG concentration, decreasing plasma HDL-C concentration (107), and increasing the risk of impaired FG (108) and insulin resistance (109), which enhance the increased risk of T2D and CVDs. Although smoking cessation can ameliorate these changes, it is worth noting that smoking cessation is frequently followed by weight gain, which can contribute to the increased short-term risk of T2D (5, 110). Therefore, for smokers at risk for T2D, smoking cessation should be coupled with strategies for T2D prevention and early detection (5).
We acknowledge the limitations of our study. Despite the large sample sizes and high power of the GWAS summary statistics coming from meta-analysis studies, the homogeneity among different summary statistics was reduced. However, each study conducted study-specific quality control to ensure data quality. In addition, simulations have confirmed that the effect of population structure and cryptic relatedness could be controlled well by our cross-trait meta-analysis method CPASSOC. Second, because of the concerns on sample size, accuracy, and availability of the GWAS data, we only analyzed smoking status in this study and did not consider quantitative smoking phenotypes such as cigarettes smoked per day or the years of smoking. Besides, smokeless tobacco products such as snuff tend to show different associations with T2D or CVDs as compared to cigarette smoking (111–113). It is important to consider these phenotypes in future investigations to shed light on the relationship between smoking and T2D or CVDs. Third, limited to the existing original GWASs, the sample sizes of some original trait-specific GWASs, especially ICH, were relatively small, which resulted in limited statistical power (Supplementary Table 25). Fourth, to yield reliable results, we used the data from the largest or latest GWASs, but there may be sample overlap between smoking status and T2D, CAD, and HF, which can influence the inference of causality in MR analysis. However, we used additional GWAS data of these traits with no sample overlap with smoking status GWAS to further confirm our primary findings and observed highly consistent results. Such consistency reinforced the robustness of our findings. Fifth, additional appropriate data were not available for us to replicate our findings. However, we used the data from the largest or latest GWASs for these traits to yield reliable results, and if possible, we will perform replication analysis in the future. Finally, our study was limited to assessing the shared genetic etiology between smoking status and T2D or CVDs. The effects of environmental factors and gene–environment interactions between smoking status and T2D or CVDs still need to be explored in further studies.
In summary, our findings provide strong evidence on shared genetic etiology and causal associations between smoking status and T2D or CVDs, underscoring the potential shared biological mechanisms underlying the link between smoking and T2D or CVDs. This work is important and opens up a new way for more effective and timely prevention, diagnosis, and treatment of smoking-related T2D or CVDs.
Data Availability Statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material. The download links for all the data relevant to the study can be found in the Supplementary Material.
Ethics Statement
The studies involving human participants were reviewed and approved by the relevant institutional review boards. The patients/participants provided their written informed consent to participate in this study.
Author Contributions
YC, XW, JJ, and TH designed the research. JJ and TH had full access to all the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. YC and XW wrote the paper and performed the data analysis. All authors contributed to the statistical analysis, critically reviewed the manuscript during the writing process, and approved the final version to be published. YC, XW, JJ, and TH are the guarantors for the study.
Funding
The study was supported by grants from the National Key R&D Program of China (2019YFC2003400), the Peking University Start-up Grant (BMU2018YJ002), and the high-performance Computing Platform of Peking University. The funding organization had no role in the preparation of the manuscript.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2022.809445/full#supplementary-material
References
1. World Health Organization. WHO Report on the Global Tobacco Epidemic, 2017: Monitoring Tobacco Use and Prevention Policies. Available at: https://www.who.int/publications/i/item/9789241512824 (Accessed January 13, 2022).
2. National Center for Chronic Disease Prevention and Health Promotion (US) Office on Smoking and Health. The Health Consequences of Smoking—50 Years of Progress: A Report of the Surgeon General. Atlanta (GA: Centers for Disease Control and Prevention (US (2014).
3. Pirie K, Peto R, Reeves GK, Green J, Beral V. The 21st Century Hazards of Smoking and Benefits of Stopping: A Prospective Study of One Million Women in the UK. Lancet (2013) 381(9861):133–41. doi: 10.1016/s0140-6736(12)61720-6
4. Banks E, Joshy G, Korda RJ, Stavreski B, Soga K, Egger S, et al. Tobacco Smoking and Risk of 36 Cardiovascular Disease Subtypes: Fatal and Non-Fatal Outcomes in a Large Prospective Australian Study. BMC Med (2019) 17(1):128. doi: 10.1186/s12916-019-1351-4
5. Yeh HC, Duncan BB, Schmidt MI, Wang NY, Brancati FL. Smoking, Smoking Cessation, and Risk for Type 2 Diabetes Mellitus: A Cohort Study. Ann Intern Med (2010) 152(1):10–7. doi: 10.7326/0003-4819-152-1-201001050-00005
6. Wannamethee SG, Shaper AG, Perry IJ. Smoking as a Modifiable Risk Factor for Type 2 Diabetes in Middle-Aged Men. Diabetes Care (2001) 24(9):1590–5. doi: 10.2337/diacare.24.9.1590
7. Willi C, Bodenmann P, Ghali WA, Faris PD, Cornuz J. Active Smoking and the Risk of Type 2 Diabetes: A Systematic Review and Meta-Analysis. Jama (2007) 298(22):2654–64. doi: 10.1001/jama.298.22.2654
8. Pan A, Wang Y, Talaei M, Hu FB, Wu T. Relation of Active, Passive, and Quitting Smoking With Incident Type 2 Diabetes: A Systematic Review and Meta-Analysis. Lancet Diabetes Endocrinol (2015) 3(12):958–67. doi: 10.1016/s2213-8587(15)00316-2
9. Mons U, Müezzinler A, Gellert C, Schöttker B, Abnet CC, Bobak M, et al. Impact of Smoking and Smoking Cessation on Cardiovascular Events and Mortality Among Older Adults: Meta-Analysis of Individual Participant Data From Prospective Cohort Studies of the CHANCES Consortium. BMJ (Clin Res Ed) (2015) 350:h1551. doi: 10.1136/bmj.h1551
10. Huxley RR, Woodward M. Cigarette Smoking as a Risk Factor for Coronary Heart Disease in Women Compared With Men: A Systematic Review and Meta-Analysis of Prospective Cohort Studies. Lancet (Lond Engl) (2011) 378(9799):1297–305. doi: 10.1016/s0140-6736(11)60781-2
11. World Health Organization. Tobacco Responsible for 20% of Deaths From Coronary Heart Disease (2020). Available at: https://www.who.int/news/item/22-09-2020-tobacco-responsible-for-20-of-deaths-from-coronary-heart-disease (Accessed January 13, 2022).
12. Ockene IS, Miller NH. Cigarette Smoking, Cardiovascular Disease, and Stroke: A Statement for Healthcare Professionals From the American Heart Association. American Heart Association Task Force on Risk Reduction. Circulation (1997) 96(9):3243–7. doi: 10.1161/01.cir.96.9.3243
13. Li MD, Cheng R, Ma JZ, Swan GE. A Meta-Analysis of Estimated Genetic and Environmental Effects on Smoking Behavior in Male and Female Adult Twins. Addict (Abingdon Engl) (2003) 98(1):23–31. doi: 10.1046/j.1360-0443.2003.00295.x
14. Almgren P, Lehtovirta M, Isomaa B, Sarelin L, Taskinen MR, Lyssenko V, et al. Heritability and Familiality of Type 2 Diabetes and Related Quantitative Traits in the Botnia Study. Diabetologia (2011) 54(11):2811–9. doi: 10.1007/s00125-011-2267-5
15. Zdravkovic S, Wienke A, Pedersen NL, Marenberg ME, Yashin AI, De Faire U. Heritability of Death From Coronary Heart Disease: A 36-Year Follow-Up of 20 966 Swedish Twins. J Intern Med (2002) 252(3):247–54. doi: 10.1046/j.1365-2796.2002.01029.x
16. Liu M, Jiang Y, Wedow R, Li Y, Brazel DM, Chen F, et al. Association Studies of Up to 1.2 Million Individuals Yield New Insights Into the Genetic Etiology of Tobacco and Alcohol Use. Nat Genet (2019) 51(2):237–44. doi: 10.1038/s41588-018-0307-5
17. Xu K, Li B, McGinnis KA, Vickers-Smith R, Dao C, Sun N, et al. Genome-Wide Association Study of Smoking Trajectory and Meta-Analysis of Smoking Status in 842,000 Individuals. Nat Commun (2020) 11(1):5302. doi: 10.1038/s41467-020-18489-3
18. Mahajan A, Taliun D, Thurner M, Robertson NR, Torres JM, Rayner NW, et al. Fine-Mapping Type 2 Diabetes Loci to Single-Variant Resolution Using High-Density Imputation and Islet-Specific Epigenome Maps. Nat Genet (2018) 50(11):1505–13. doi: 10.1038/s41588-018-0241-6
19. van der Harst P, Verweij N. Identification of 64 Novel Genetic Loci Provides an Expanded View on the Genetic Architecture of Coronary Artery Disease. Circ Res (2018) 122(3):433–43. doi: 10.1161/circresaha.117.312086
20. Nelson CP, Goel A, Butterworth AS, Kanoni S, Webb TR, Marouli E, et al. Association Analyses Based on False Discovery Rate Implicate New Loci for Coronary Artery Disease. Nat Genet (2017) 49(9):1385–91. doi: 10.1038/ng.3913
21. Matoba N, Akiyama M, Ishigaki K, Kanai M, Takahashi A, Momozawa Y, et al. GWAS of Smoking Behaviour in 165,436 Japanese People Reveals Seven New Loci and Shared Genetic Architecture. Nat Hum Behav (2019) 3(5):471–7. doi: 10.1038/s41562-019-0557-y
22. Duncan A, Heyer MP, Ishikawa M, Caligiuri SPB, Liu XA, Chen Z, et al. Habenular TCF7L2 Links Nicotine Addiction to Diabetes. Nature (2019) 574(7778):372–7. doi: 10.1038/s41586-019-1653-x
23. Bentley AR, Sung YJ, Brown MR, Winkler TW, Kraja AT, Ntalla I, et al. Multi-Ancestry Genome-Wide Gene–Smoking Interaction Study of 387,272 Individuals Identifies New Loci Associated With Serum Lipids. Nat Genet (2019) 51(4):636–48. doi: 10.1038/s41588-019-0378-y
24. Wang XL, Greco M, Sim AS, Duarte N, Wang J, Wilcken DE. Effect of CYP1A1 MspI Polymorphism on Cigarette Smoking Related Coronary Artery Disease and Diabetes. Atherosclerosis (2002) 162(2):391–7. doi: 10.1016/s0021-9150(01)00723-7
25. Bulik-Sullivan BK, Loh PR, Finucane HK, Ripke S, Yang J, Schizophrenia Working Group of the Psychiatric Genomics C, et al. LD Score Regression Distinguishes Confounding From Polygenicity in Genome-Wide Association Studies. Nat Genet (2015) 47(3):291–5. doi: 10.1038/ng.3211
26. Zhu X, Feng T, Tayo BO, Liang J, Young JH, Franceschini N, et al. Meta-Analysis of Correlated Traits via Summary Statistics From GWASs With an Application in Hypertension. Am J Hum Genet (2015) 96(1):21–36. doi: 10.1016/j.ajhg.2014.11.011
27. Gusev A, Ko A, Shi H, Bhatia G, Chung W, Penninx BW, et al. Integrative Approaches for Large-Scale Transcriptome-Wide Association Studies. Nat Genet (2016) 48(3):245–52. doi: 10.1038/ng.3506
28. Smith GD, Ebrahim S. 'Mendelian Randomization': Can Genetic Epidemiology Contribute to Understanding Environmental Determinants of Disease? Int J Epidemiol (2003) 32(1):1–22. doi: 10.1093/ije/dyg070
29. Davey Smith G, Hemani G. Mendelian Randomization: Genetic Anchors for Causal Inference in Epidemiological Studies. Hum Mol Genet (2014) 23(R1):R89–98. doi: 10.1093/hmg/ddu328
30. Pierce BL, Burgess S. Efficient Design for Mendelian Randomization Studies: Subsample and 2-Sample Instrumental Variable Estimators. Am J Epidemiol (2013) 178(7):1177–84. doi: 10.1093/aje/kwt084
31. Bowden J, Davey Smith G, Burgess S. Mendelian Randomization With Invalid Instruments: Effect Estimation and Bias Detection Through Egger Regression. Int J Epidemiol (2015) 44(2):512–25. doi: 10.1093/ije/dyv080
32. GeneATLAS. Available at: http://geneatlas.roslin.ed.ac.uk/trait/?traits=675 (Accessed January 13, 2022).
33. Canela-Xandri O, Rawlik K, Tenesa A. An Atlas of Genetic Associations in UK Biobank. Nat Genet (2018) 50(11):1593–9. doi: 10.1038/s41588-018-0248-z
34. Nikpay M, Goel A, Won HH, Hall LM, Willenborg C, Kanoni S, et al. A Comprehensive 1,000 Genomes-Based Genome-Wide Association Meta-Analysis of Coronary Artery Disease. Nat Genet (2015) 47(10):1121–30. doi: 10.1038/ng.3396
35. Shah S, Henry A, Roselli C, Lin H, Sveinbjornsson G, Fatemifar G, et al. Genome-Wide Association and Mendelian Randomisation Analysis Provide Insights Into the Pathogenesis of Heart Failure. Nat Commun (2020) 11(1):163. doi: 10.1038/s41467-019-13690-5
36. Malik R, Chauhan G, Traylor M, Sargurupremraj M, Okada Y, Mishra A, et al. Multiancestry Genome-Wide Association Study of 520,000 Subjects Identifies 32 Loci Associated With Stroke and Stroke Subtypes. Nat Genet (2018) 50(4):524–37. doi: 10.1038/s41588-018-0058-3
37. Woo D, Falcone GJ, Devan WJ, Brown WM, Biffi A, Howard TD, et al. Meta-Analysis of Genome-Wide Association Studies Identifies 1q22 as a Susceptibility Locus for Intracerebral Hemorrhage. Am J Hum Genet (2014) 94(4):511–21. doi: 10.1016/j.ajhg.2014.02.012
38. Christophersen IE, Rienstra M, Roselli C, Yin X, Geelhoed B, Barnard J, et al. Large-Scale Analyses of Common and Rare Variants Identify 12 New Loci Associated With Atrial Fibrillation. Nat Genet (2017) 49(6):946–52. doi: 10.1038/ng.3843
39. Dupuis J, Langenberg C, Prokopenko I, Saxena R, Soranzo N, Jackson AU, et al. New Genetic Loci Implicated in Fasting Glucose Homeostasis and Their Impact on Type 2 Diabetes Risk. Nat Genet (2010) 42(2):105–16. doi: 10.1038/ng.520
40. Willer CJ, Schmidt EM, Sengupta S, Peloso GM, Gustafsson S, Kanoni S, et al. Discovery and Refinement of Loci Associated With Lipid Levels. Nat Genet (2013) 45(11):1274–83. doi: 10.1038/ng.2797
41. Bulik-Sullivan B, Finucane HK, Anttila V, Gusev A, Day FR, Loh PR, et al. An Atlas of Genetic Correlations Across Human Diseases and Traits. Nat Genet (2015) 47(11):1236–41. doi: 10.1038/ng.3406
42. Finucane HK, Bulik-Sullivan B, Gusev A, Trynka G, Reshef Y, Loh PR, et al. Partitioning Heritability by Functional Annotation Using Genome-Wide Association Summary Statistics. Nat Genet (2015) 47(11):1228–35. doi: 10.1038/ng.3404
43. Gormley P, Anttila V, Winsvold BS, Palta P, Esko T, Pers TH, et al. Meta-Analysis of 375,000 Individuals Identifies 38 Susceptibility Loci for Migraine. Nat Genet (2016) 48(8):856–66. doi: 10.1038/ng.3598
44. Wellcome Trust Case Control C, Maller JB, McVean G, Byrnes J, Vukcevic D, Palin K, et al. Bayesian Refinement of Association Signals for 14 Loci in 3 Common Diseases. Nat Genet (2012) 44(12):1294–301. doi: 10.1038/ng.2435
45. Schizophrenia Working Group of the Psychiatric Genomics C. Biological Insights From 108 Schizophrenia-Associated Genetic Loci. Nature (2014) 511(7510):421–7. doi: 10.1038/nature13595
46. Wang X, Jia J, Huang T. Shared Genetic Architecture and Casual Relationship Between Leptin Levels and Type 2 Diabetes: Large-Scale Cross-Trait Meta-Analysis and Mendelian Randomization Analysis. BMJ Open Diabetes Res Care (2020) 8(1):e001140. doi: 10.1136/bmjdrc-2019-001140
47. Wallace C. Statistical Testing of Shared Genetic Control for Potentially Related Traits. Genet Epidemiol (2013) 37(8):802–13. doi: 10.1002/gepi.21765
48. Plagnol V, Smyth DJ, Todd JA, Clayton DG. Statistical Independence of the Colocalized Association Signals for Type 1 Diabetes and RPS26 Gene Expression on Chromosome 12q13. Biostatistics (2009) 10(2):327–34. doi: 10.1093/biostatistics/kxn039
49. Zhu Z, Lee PH, Chaffin MD, Chung W, Loh PR, Lu Q, et al. A Genome-Wide Cross-Trait Analysis From UK Biobank Highlights the Shared Genetic Architecture of Asthma and Allergic Diseases. Nat Genet (2018) 50(6):857–64. doi: 10.1038/s41588-018-0121-0
50. Uhlén M, Fagerberg L, Hallström BM, Lindskog C, Oksvold P, Mardinoglu A, et al. Proteomics. Tissue-Based Map of the Human Proteome. Science (2015) 347(6220):1260419. doi: 10.1126/science.1260419
51. Jain A, Tuteja G. TissueEnrich: Tissue-Specific Gene Enrichment Analysis. Bioinformatics (2019) 35(11):1966–7. doi: 10.1093/bioinformatics/bty890
52. Wang J, Duncan D, Shi Z, Zhang B. WEB-Based GEne SeT AnaLysis Toolkit (WebGestalt): Update 2013. Nucleic Acids Res (2013) 41(Web Server issue):W77–83. doi: 10.1093/nar/gkt439
53. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene Ontology: Tool for the Unification of Biology. The Gene Ontology Consortium. Nat Genet (2000) 25(1):25–9. doi: 10.1038/75556
54. Gene Ontology Consortium. Gene Ontology Consortium: Going Forward. Nucleic Acids Res (2015) 43(Database issue):D1049–56. doi: 10.1093/nar/gku1179
55. Choi KW, Chen CY, Stein MB, Klimentidis YC, Wang MJ, Koenen KC, et al. Assessment of Bidirectional Relationships Between Physical Activity and Depression Among Adults: A 2-Sample Mendelian Randomization Study. JAMA Psychiatry (2019) 76(4):399–408. doi: 10.1001/jamapsychiatry.2018.4175
56. Benn M, Nordestgaard BG. From Genome-Wide Association Studies to Mendelian Randomization: Novel Opportunities for Understanding Cardiovascular Disease Causality, Pathogenesis, Prevention, and Treatment. Cardiovasc Res (2018) 114(9):1192–208. doi: 10.1093/cvr/cvy045
57. FinnGen. FinnGen (Documentation of R5 Release) (2021). Available at: https://finngen.gitbook.io/documentation/ (Accessed January 13, 2022).
58. Scott RA, Scott LJ, Mägi R, Marullo L, Gaulton KJ, Kaakinen M, et al. An Expanded Genome-Wide Association Study of Type 2 Diabetes in Europeans. Diabetes (2017) 66(11):2888–902. doi: 10.2337/db16-1253
59. Schunkert H, Konig IR, Kathiresan S, Reilly MP, Assimes TL, Holm H, et al. Large-Scale Association Analysis Identifies 13 New Susceptibility Loci for Coronary Artery Disease. Nat Genet (2011) 43(4):333–8. doi: 10.1038/ng.784
60. Loos RJF, Bouchard C. FTO: The First Gene Contributing to Common Forms of Human Obesity. Obes Rev (2008) 9(3):246–50. doi: 10.1111/j.1467-789X.2008.00481.x
61. Krüger N, Biwer LA, Good ME, Ruddiman CA, Wolpe AG, DeLalio LJ, et al. Loss of Endothelial FTO Antagonizes Obesity-Induced Metabolic and Vascular Dysfunction. Circ Res (2020) 126(2):232–42. doi: 10.1161/circresaha.119.315531
62. Freathy RM, Timpson NJ, Lawlor DA, Pouta A, Ben-Shlomo Y, Ruokonen A, et al. Common Variation in the FTO Gene Alters Diabetes-Related Metabolic Traits to the Extent Expected Given its Effect on BMI. Diabetes (2008) 57(5):1419–26. doi: 10.2337/db07-1466
63. Ahmad T, Chasman DI, Mora S, Pare G, Cook NR, Buring JE, et al. The Fat-Mass and Obesity-Associated (FTO) Gene, Physical Activity, and Risk of Incident Cardiovascular Events in White Women. Am Heart J (2010) 160(6):1163–9. doi: 10.1016/j.ahj.2010.08.002
64. Cunnington MS, Santibanez Koref M, Mayosi BM, Burn J, Keavney B. Chromosome 9p21 SNPs Associated With Multiple Disease Phenotypes Correlate With ANRIL Expression. PloS Genet (2010) 6(4):e1000899. doi: 10.1371/journal.pgen.1000899
65. Song C, Qi Y, Zhang J, Guo C, Yuan C. CDKN2B-AS1: An Indispensable Long Non-Coding RNA in Multiple Diseases. Curr Pharm Design (2020) 26(41):5335–46. doi: 10.2174/1381612826666200806102424
66. Lin S, Shi Q, Nix FB, Styblo M, Beck MA, Herbin-Davis KM, et al. A Novel S-Adenosyl-L-Methionine:Arsenic(III) Methyltransferase From Rat Liver Cytosol. J Biol Chem (2002) 277(13):10795–803. doi: 10.1074/jbc.M110246200
67. Yuan Y, Marshall G, Ferreccio C, Steinmaus C, Selvin S, Liaw J, et al. Acute Myocardial Infarction Mortality in Comparison With Lung and Bladder Cancer Mortality in Arsenic-Exposed Region II of Chile From 1950 to 2000. Am J Epidemiol (2007) 166(12):1381–91. doi: 10.1093/aje/kwm238
68. Morris J, Bailey MES, Baldassarre D, Cullen B, de Faire U, Ferguson A, et al. Genetic Variation in CADM2 as a Link Between Psychological Traits and Obesity. Sci Rep (2019) 9(1):7339. doi: 10.1038/s41598-019-43861-9
69. Willer CJ, Speliotes EK, Loos RJ, Li S, Lindgren CM, Heid IM, et al. Six New Loci Associated With Body Mass Index Highlight a Neuronal Influence on Body Weight Regulation. Nat Genet (2009) 41(1):25–34. doi: 10.1038/ng.287
70. Wiemerslage L, Gohel PA, Maestri G, Hilmarsson TG, Mickael M, Fredriksson R, et al. The Drosophila Ortholog of TMEM18 Regulates Insulin and Glucagon-Like Signaling. J Endocrinol (2016) 229(3):233–43. doi: 10.1530/JOE-16-0040
71. Gong G, O'Bryant SE. Low-Level Arsenic Exposure, AS3MT Gene Polymorphism and Cardiovascular Diseases in Rural Texas Counties. Environ Res (2012) 113:52–7. doi: 10.1016/j.envres.2012.01.003
72. Grau-Perez M, Kuo CC, Gribble MO, Balakrishnan P, Jones Spratlen M, Vaidya D, et al. Association of Low-Moderate Arsenic Exposure and Arsenic Metabolism With Incident Diabetes and Insulin Resistance in the Strong Heart Family Study. Environ Health Perspect (2017) 125(12):127004. doi: 10.1289/EHP2566
73. Drobná Z, Del Razo LM, García-Vargas GG, Sánchez-Peña LC, Barrera-Hernández A, Stýblo M, et al. Environmental Exposure to Arsenic, AS3MT Polymorphism and Prevalence of Diabetes in Mexico. J Exposure Sci Environ Epidemiol (2013) 23(2):151–5. doi: 10.1038/jes.2012.103
74. SMG6. Nonsense Mediated mRNA Decay Factor [Homo Sapiens (Human)] - Gene - NCBI. Available at: https://www.ncbi.nlm.nih.gov/gene/23293 (Accessed January 13, 2022).
75. Steenaard RV, Ligthart S, Stolk L, Peters MJ, van Meurs JB, Uitterlinden AG, et al. Tobacco Smoking is Associated With Methylation of Genes Related to Coronary Artery Disease. Clin Epigenet (2015) 7:54. doi: 10.1186/s13148-015-0088-y
76. Mega JL, Stitziel NO, Smith JG, Chasman DI, Caulfield M, Devlin JJ, et al. Genetic Risk, Coronary Heart Disease Events, and the Clinical Benefit of Statin Therapy: An Analysis of Primary and Secondary Prevention Trials. Lancet (Lond Engl) (2015) 385(9984):2264–71. doi: 10.1016/s0140-6736(14)61730-x
77. Carlsson S, Kuja-Halkola R, Magnusson C, Lagerros YT, Andersson T. Tobacco and Type 2 Diabetes: Is the Association Explained by Genetic Factors? Int J Epidemiol (2019) 48(3):926–33. doi: 10.1093/ije/dyz002
78. Fesinmeyer MD, North KE, Lim U, Bůžková P, Crawford DC, Haessler J, et al. Effects of Smoking on the Genetic Risk of Obesity: The Population Architecture Using Genomics and Epidemiology Study. BMC Med Genet (2013) 14:6. doi: 10.1186/1471-2350-14-6
79. States JC, Srivastava S, Chen Y, Barchowsky A. Arsenic and Cardiovascular Disease. Toxicol Sci (2009) 107(2):312–23. doi: 10.1093/toxsci/kfn236
80. Mendelson JH, Sholar MB, Goletiani N, Siegel AJ, Mello NK. Effects of Low- and High-Nicotine Cigarette Smoking on Mood States and the HPA Axis in Men. Neuropsychopharmacology (2005) 30(9):1751–63. doi: 10.1038/sj.npp.1300753
81. Rosmond R, Björntorp P. The Hypothalamic-Pituitary-Adrenal Axis Activity as a Predictor of Cardiovascular Disease, Type 2 Diabetes and Stroke. J Intern Med (2000) 247(2):188–97. doi: 10.1046/j.1365-2796.2000.00603.x
82. Mendelson JH, Goletiani N, Sholar MB, Siegel AJ, Mello NK. Effects of Smoking Successive Low- and High-Nicotine Cigarettes on Hypothalamic–Pituitary–Adrenal Axis Hormones and Mood in Men. Neuropsychopharmacology (2008) 33(4):749–60. doi: 10.1038/sj.npp.1301455
83. Yoshikawa H, Hellström-Lindahl E, Grill V. Evidence for Functional Nicotinic Receptors on Pancreatic Beta Cells. Metabolism: Clin Exp (2005) 54(2):247–54. doi: 10.1016/j.metabol.2004.08.020
84. Xie XT, Liu Q, Wu J, Wakui M. Impact of Cigarette Smoking in Type 2 Diabetes Development. Acta Pharmacol Sin (2009) 30(6):784–7. doi: 10.1038/aps.2009.49
85. Somm E, Schwitzgebel VM, Vauthay DM, Camm EJ, Chen CY, Giacobino JP, et al. Prenatal Nicotine Exposure Alters Early Pancreatic Islet and Adipose Tissue Development With Consequences on the Control of Body Weight and Glucose Metabolism Later in Life. Endocrinology (2008) 149(12):6289–99. doi: 10.1210/en.2008-0361
86. Bruin JE, Gerstein HC, Morrison KM, Holloway AC. Increased Pancreatic Beta-Cell Apoptosis Following Fetal and Neonatal Exposure to Nicotine is Mediated via Mitochondria. Toxicol Sci (2008) 103(2):362–70. doi: 10.1093/toxsci/kfn012
87. Bruin JE, Petre MA, Raha S, Morrison KM, Gerstein HC, Holloway AC. Fetal and Neonatal Nicotine Exposure in Wistar Rats Causes Progressive Pancreatic Mitochondrial Damage and Beta Cell Dysfunction. PloS One (2008) 3(10):e3371. doi: 10.1371/journal.pone.0003371
88. Bruin JE, Petre MA, Lehman MA, Raha S, Gerstein HC, Morrison KM, et al. Maternal Nicotine Exposure Increases Oxidative Stress in the Offspring. Free Radical Biol Med (2008) 44(11):1919–25. doi: 10.1016/j.freeradbiomed.2008.02.010
89. Li LF, Chan RL, Lu L, Shen J, Zhang L, Wu WK, et al. Cigarette Smoking and Gastrointestinal Diseases: The Causal Relationship and Underlying Molecular Mechanisms (Review). Int J Mol Med (2014) 34(2):372–80. doi: 10.3892/ijmm.2014.1786
90. Liuzzo JP, Ambrose JA. Chest Pain From Gastroesophageal Reflux Disease in Patients With Coronary Artery Disease. Cardiol Rev (2005) 13(4):167–73. doi: 10.1097/01.crd.0000148844.13702.ce
91. Manfrini O, Bazzocchi G, Luati A, Borghi A, Monari P, Bugiardini R. Coronary Spasm Reflects Inputs From Adjacent Esophageal System. Am J Physiol Heart Circ Physiol (2006) 290(5):H2085–91. doi: 10.1152/ajpheart.00925.2005
92. Middlekauff HR, Park J, Moheimani RS. Adverse Effects of Cigarette and Noncigarette Smoke Exposure on the Autonomic Nervous System: Mechanisms and Implications for Cardiovascular Risk. J Am Coll Cardiol (2014) 64(16):1740–50. doi: 10.1016/j.jacc.2014.06.1201
93. Zucker IH, Wang W, Brändle M, Schultz HD, Patel KP. Neural Regulation of Sympathetic Nerve Activity in Heart Failure. Prog Cardiovasc Dis (1995) 37(6):397–414. doi: 10.1016/s0033-0620(05)80020-9
94. Lichtor T, Davis HR, Johns L, Vesselinovitch D, Wissler RW, Mullan S. The Sympathetic Nervous System and Atherosclerosis. J Neurosurg (1987) 67(6):906–14. doi: 10.3171/jns.1987.67.6.0906
95. Dechanet C, Anahory T, Mathieu Daude JC, Quantin X, Reyftmann L, Hamamah S, et al. Effects of Cigarette Smoking on Reproduction. Hum Reprod Update (2011) 17(1):76–95. doi: 10.1093/humupd/dmq033
96. Maresch CC, Stute DC, Alves MG, Oliveira PF, de Kretser DM, Linn T. Diabetes-Induced Hyperglycemia Impairs Male Reproductive Function: A Systematic Review. Hum Reprod Update (2018) 24(1):86–105. doi: 10.1093/humupd/dmx033
97. Ding GL, Liu Y, Liu ME, Pan JX, Guo MX, Sheng JZ, et al. The Effects of Diabetes on Male Fertility and Epigenetic Regulation During Spermatogenesis. Asian J Andrology (2015) 17(6):948–53. doi: 10.4103/1008-682x.150844
98. Jin T, Liu L. The Wnt Signaling Pathway Effector TCF7L2 and Type 2 Diabetes Mellitus. Mol Endocrinol (2008) 22(11):2383–92. doi: 10.1210/me.2008-0135
99. Cheng HY, Schiavone AP, Smithgall TE. A Point Mutation in the N-Terminal Coiled-Coil Domain Releases c-Fes Tyrosine Kinase Activity and Survival Signaling in Myeloid Leukemia Cells. Mol Cell Biol (2001) 21(18):6170–80. doi: 10.1128/MCB.21.18.6170-6180.2001
100. Karamanavi E, Mcvey DG, van der Laan SW, Yang W, Chan K, Poston RN, et al. The FES Gene, Located at the Chromosome 15Q21.6 Coronary-Artery-Disease Locus, Modulates Atherosclerotic Plaque Vulnerability. Atherosclerosis (2020) 315:e19. doi: 10.1016/j.atherosclerosis.2020.10.070
101. Bindesboll C, Aas A, Ogmundsdottir MH, Pankiv S, Reine T, Zoncu R, et al. NBEAL1 Controls SREBP2 Processing and Cholesterol Metabolism and is a Susceptibility Locus for Coronary Artery Disease. Sci Rep (2020) 10(1):4528. doi: 10.1038/s41598-020-61352-0
102. Lausberg E, Gießelmann S, Dewulf JP, Wiame E, Holz A, Salvarinova R, et al. C2orf69 Mutations Disrupt Mitochondrial Function and Cause a Multisystem Human Disorder With Recurring Autoinflammation. J Clin Invest (2021) 131(12):e143078. doi: 10.1172/jci143078
103. Wong HH, Seet SH, Maier M, Gurel A, Traspas RM, Lee C, et al. Loss of C2orf69 Defines a Fatal Autoinflammatory Syndrome in Humans and Zebrafish That Evokes a Glycogen-Storage-Associated Mitochondriopathy. Am J Hum Genet (2021) 108(7):1301–17. doi: 10.1016/j.ajhg.2021.05.003
104. Larsson SC, Mason AM, Back M, Klarin D, Damrauer SM, Million Veteran P, et al. Genetic Predisposition to Smoking in Relation to 14 Cardiovascular Diseases. Eur Heart J (2020) 41(35):3304–10. doi: 10.1093/eurheartj/ehaa193
105. Levin MG, Klarin D, Assimes TL, Freiberg MS, Ingelsson E, Lynch J, et al. Genetics of Smoking and Risk of Atherosclerotic Cardiovascular Diseases: A Mendelian Randomization Study. JAMA Netw Open (2021) 4(1):e2034461. doi: 10.1001/jamanetworkopen.2020.34461
106. Yuan S, Larsson SC. A Causal Relationship Between Cigarette Smoking and Type 2 Diabetes Mellitus: A Mendelian Randomization Study. Sci Rep (2019) 9(1):19342. doi: 10.1038/s41598-019-56014-9
107. Chelland Campbell S, Moffatt RJ, Stamford BA. Smoking and Smoking Cessation – The Relationship Between Cardiovascular Disease and Lipoprotein Metabolism: A Review. Atherosclerosis (2008) 201(2):225–35. doi: 10.1016/j.atherosclerosis.2008.04.046
108. Nakanishi N, Nakamura K, Matsuo Y, Suzuki K, Tatara K. Cigarette Smoking and Risk for Impaired Fasting Glucose and Type 2 Diabetes in Middle-Aged Japanese Men. Ann Intern Med (2000) 133(3):183–91. doi: 10.7326/0003-4819-133-3-200008010-00009
109. Bergman BC, Perreault L, Hunerdosse D, Kerege A, Playdon M, Samek AM, et al. Novel and Reversible Mechanisms of Smoking-Induced Insulin Resistance in Humans. Diabetes (2012) 61(12):3156–66. doi: 10.2337/db12-0418
110. Hu Y, Zong G, Liu G, Wang M, Rosner B, Pan A, et al. Smoking Cessation, Weight Change, Type 2 Diabetes, and Mortality. N Engl J Med (2018) 379(7):623–32. doi: 10.1056/NEJMoa1803626
111. Titova OE, Baron JA, Michaëlsson K, Larsson SC. Swedish Snuff (Snus) and Risk of Cardiovascular Disease and Mortality: Prospective Cohort Study of Middle-Aged and Older Individuals. BMC Med (2021) 19(1):111. doi: 10.1186/s12916-021-01979-6
112. Janzon E, Hedblad B. Swedish Snuff and Incidence of Cardiovascular Disease. A Population-Based Cohort Study. BMC Cardiovasc Disord (2009) 9:21. doi: 10.1186/1471-2261-9-21
Keywords: smoking status, type 2 diabetes, cardiovascular disease, shared genetic etiology, causality
Citation: Chi Y, Wang X, Jia J and Huang T (2022) Smoking Status and Type 2 Diabetes, and Cardiovascular Disease: A Comprehensive Analysis of Shared Genetic Etiology and Causal Relationship. Front. Endocrinol. 13:809445. doi: 10.3389/fendo.2022.809445
Received: 05 November 2021; Accepted: 18 January 2022;
Published: 18 February 2022.
Edited by:
Kezhong Zhang, Wayne State University, United StatesReviewed by:
Qinghua Sun, Zhejiang Chinese Medical University, ChinaShuai Yuan, Karolinska Institutet (KI), Sweden
Wuxiang Xie, Peking University, China
Copyright © 2022 Chi, Wang, Jia and Huang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jinzhu Jia, anpqaWFAcGt1LmVkdS5jbg==; Tao Huang, aHVhbmcudGFvQHBrdS5lZHUuY24=