- 1Department of Biostatistics, School of Public Health, Peking University, Beijing, China
- 2Center for Statistical Science, Peking University, Beijing, China
- 3Department of Epidemiology & Biostatistics, School of Public Health, Peking University, Beijing, China
- 4Department of Global Health, School of Public Health, Peking University, Beijing, China
- 5Key Laboratory of Molecular Cardiovascular Sciences (Peking University), Ministry of Education, Beijing, China
Purpose: To explore whether coffee intake is associated with the risk of type 2 diabetes mellitus (T2DM) from a genetic perspective, and whether this association remains the same among different types of coffee consumers.
Methods: We utilized the summary-level results of 12 genome-wide association studies. First, we used linkage disequilibrium score regression and cross-phenotype association analysis to estimate the genetic correlation and identify shared genes between coffee intake and T2DM in addition to some other T2DM-related phenotypes. Second, we used Mendelian randomization (MR) analysis to test whether there is a significant genetically predicted causal association between coffee intake and the risk of T2DM or other T2DM-related phenotypes. For all the analyses above, we also conducted a separate analysis for different types of coffee consumers, in addition to total coffee intake.
Results: Genetically, choice for ground coffee was significantly negatively associated with the risk of T2DM and some other related risks. While coffee intake and choice for decaffeinated/instant coffee had significant positive correlation with these risks. Between these genetically related phenotypes, there were 1571 genomic shared regions, of which 134 loci were novel. Enrichment analysis showed that these shared genes were significantly enriched in antigen processing related biological processes. MR analysis indicated that higher genetically proxied choice for ground coffee can reduce the risk of T2DM (T2DM: b: -0.2, p-value: 4.70×10-10; T2DM adjusted for body mass index (BMI): b: -0.11, p-value: 4.60×10-5), and BMI (b: -0.08, p-value: 6.50×10-5).
Conclusions: Compared with other types of coffee, ground coffee has a significant negative genetic and genetically predicated causal relationship with the risk of T2DM. And this association is likely to be mediated by immunity. The effect of different coffee types on T2DM is not equal, researchers on coffee should pay more attention to distinguishing between coffee types.
1 Introduction
Coffee is one of the most widely consumed beverages in the world, with about 500 billion cups consumed yearly (1). Coffee contains a variety of chemical substances, although some of which have clear health effects, such as chlorogenic acids (CGAs), which can protect the body from oxidative damage (2), and ochratoxin A, which would aggravate obesity (3), there are still some coffee ingredients whose functions have not been clarified yet. And the content of components in coffee is also affected by various factors such as processing (4). Therefore, research on the effects of coffee types on human health despite being complicated is important.
In recent years, a plethora of studies have been published focusing on the association between coffee intake and type 2 diabetes mellitus (T2DM). Cumulative evidence from observational studies suggests a significant relationship between coffee intake and T2DM (5–7), but observational research is susceptible to confounding factors and reverse causality (8). This problem is somewhat overcome by randomized clinical trials (RCTs), but current RCTs on the impact of coffee intake on T2DM are faced with problems of short intervention period and inconsistent results (9–11). Some researchers used genetic variations as instrumental variables (IVs) to study the effect of genetic proxied coffee intake on T2DM by Mendelian randomization (MR), but these studies hardly distinguished coffee types and the results are not consistent with observational studies, or even completely opposite (12–14). Because of the inconsistent nature of findings regarding the impact of habitual coffee intake on T2DM, more research is necessary before health care workers can make evidence-based recommendations.
It has been indicated that people’s choices for coffee types vary (15) and there are differences in the chemical content of different coffee types (16–18). Previous research typically only investigated the association between total coffee consumption and T2DM risk without distinguishing among coffee types. Therefore, the results obtained from previous studies may not be applicable to every type of coffee, which may limit the clinical application thereof. Recently, the summary statistics of genome-wide association study (GWAS) on the total coffee intake and choices for different coffee types from the UK Biobank (UKB) large cohort have been released (19). Some of the loci found in the GWAS also have been reported to be associated with T2DM (19, 20), suggesting that there may be some genetic or causal links between them.
Therefore, the main objective of the present study is to explore whether coffee intake is associated with T2DM and some other T2DM-related phenotypes from a genetic perspective, and whether this association remains the same among different types of coffee consumers. Our study will shed light on the mechanism of the association between coffee consumption and T2DM and provide more precise guidance for coffee consumption.
2 Materials and Methods
2.1 Study Design
This study aims to identify the association of coffee intake/coffee types and T2DM from a genetic perspective. In order to explore the association more comprehensively, we also included some other T2DM-related phenotypes. The flowchart of the study is shown in Supplementary Figure 1. First, we calculated the genetic correlation between coffee intake/coffee types and T2DM with the use of linkage disequilibrium (LD) score regression. For the phenotypes with significant genetic correlation, we further conducted cross-phenotype association analysis to identify the shared genes and enrichment analysis to find out the enriched biological processes and pathways of these genes. Second, we used MR to estimate the effect of genetically proxied coffee intake/coffee types on T2DM and T2DM-related phenotypes. It is hoped that this study will lead to new insights of the association between coffee intake and T2DM.
In addition to total coffee intake, our research also included an analysis of the choice for different coffee types, including decaffeinated coffee (any type), instant coffee, ground coffee (include espresso, filter etc), and other types of coffee. In addition to T2DM, we also analyzed some other T2DM-related phenotypes that consists of body mass index (BMI), fasting glucose (FG), fasting insulin (FI), insulin resistance (HOMA-IR), and beta-cell function (HOMA-β).
2.2 Data Sources
The data used in this study were obtained from 12 large-scale GWASs (Supplementary Table 1). The data of coffee intake and different coffee type choices came from GWAS of more than 320,000 UKB participants, who were asked how many cups of coffee they drank per day and what type of coffee they usually consumed (decaffeinated coffee (any type), instant coffee, ground coffee (include espresso, filter etc), and other types of coffee) (21), released by Neale lab (19).
The T2DM (adjusted and unadjusted for BMI) data came from the largest T2DM GWAS meta-analysis, with a total of 898,130 participants (22). BMI data was obtained from a meta-analysis of ∼700,000 individuals (23). And the data of FG, FI, HOMA-IR, HOMA - β were obtained from the meta-analysis of 21 GWAS by Jos é e Dupuis et al. (24). For details of these studies, please refer to Supplementary Table 1 and corresponding literature.
2.3 Statistical Analysis
2.3.1 Linkage Disequilibrium Score Regression Analysis
As a convenient and frequently used tool in the study of genetic correlation between different phenotypes, LDSC relies on that the product of z-scores from two studies of phenotypes with non-zero genetic correlation is related to the LD score of the SNP under a polygenic model (25). This method requires only GWAS summary statistics and therefore is faster than other methods. We used LDSC to calculate the genetic correlation between coffee intake, choice for different coffee types and T2DM. We have adopted a series of quality control processes to ensure the accuracy of the results. To ensure the quality of SNP genotyping and imputation, only SNPs present in Hapmap3 were included in the analysis, and a variant was removed from the analysis if it had one of the following conditions: had missing values, INFO score <= 0.9, minor allele frequency (MAF) <= 0.01, p-value<0 or p-value>1, low sample size, not SNPs (e.g., indels), strand ambiguous SNPs, had duplicated rs numbers. Besides, we also checked that the median value of the signed summary statistic column was close to the null median in order to make sure that this column is not mislabeled. Bonferroni correction was performed on the obtained p-value and tests were judged statistically significant at p-value < 1.43×10-3 (0.05/5/7).
2.3.2 Cross-Phenotype Association Analysis
Having identified the significantly genetically related phenotype-pairs, we further conducted cross-phenotype association (26) analysis to identify the shared genes of these phenotype pairs. This was implemented using the R software package ‘CPASSOC’ (26). This package uses the square root of the sample size of each phenotype as the weight and estimates the correlation matrix through summary statistics of all independent SNPs in the two phenotypes (26). This method not only considers the heterogeneity within the same phenotype or between different phenotypes, but also accounts for the potential kinship or population stratification between the participants (26).
SNPs with cross-phenotype association analysis p-value < 5×10-8 and single trait GWAS p-value < 0.05 were thought to have significant influence on both phenotypes. Then we used PLINK 1.9 software (27) to divide these SNPs into different independent clumping regions. SNPs with a distance less than 10,000kb and LD score R2 > 0.001 were divided into the same clumping region. And the SNP with the lowest p-value in each region was taken as the index SNP. A genomic region was judged as a novel shared genomic region if it meets the following four conditions at the same time: (1) was identified in the cross-phenotype association study (p-value < 5 × 10-8); (2) was not identified in the single phenotype GWAS (p-value > 5 × 10-8); (3) was not reported to be associated with either of the two phenotypes in the GWAS catalog (28); (4) was not reported to be associated with either of the two phenotypes in the Phenoscanner (29, 30).
2.3.3 KEGG and GO Enrichment Analysis
In order to have a deeper understanding of the shared mechanism between coffee intake and T2DM, we performed Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Ontology (GO) enrichment analysis on the shared genes identified in the previous step. All analysis is done through the R package ‘clusterProfiler’ (31).
2.3.4 Mendelian Randomization Analysis
MR is currently a widely accepted method for assessing potential causal relationship for its unique advantages, compared with observational studies and RCTs (8). The core of MR is IV. In genetic epidemiology, IVs refer to genetic variations related to exposure but not directly related to outcomes and confounders. The selected IVs must satisfy three basic assumptions (8): first, it must be related to the exposure of interest; second, it must be independent of confounding factors; third, given exposure and confounders, the IVs are independent of the outcome.
We used two packages which are widely available to conduct MR: “TwoSampleMR” (32, 33) and “CAUSE” (34). “TwoSampleMR” is a convenient and frequently used tool in the study of two-sample MR. Using this package, we selected independent genetic variations with the threshold of p-value 5×10-8 and clump-kb 10000, clump-r2 0.01 as IVs, calculated causal estimates based on the association effect sizes of IVs with outcome and exposure, and then used a series of methods including inverse variance weighted method (IVW), MR-Egger regression, simple median, weighted median, penalized weighted median, simple mode, and weighted mode to obtain the overall causal estimate. We also tested whether the MR-Egger regression (33) intercept item was zero to check whether the horizontal pleiotropy was balanced.
Different from “TwoSampleMR”, as a recently published method, “CAUSE” uses all SNPs in estimating causal effects, not just variants that are strongly correlated with exposure (34). This method relies on that if exposure has a causal effect on the outcome, then for any SNP that has a non-zero effect on the exposure, its association effect size with exposure and outcome should be related. Based on this, this method is thought to be able to distinguish between causality and horizontal pleiotropy (related and unrelated). If the result shows that the causal model is better to fit the data than the shared model (p-value <0.05), then we think that exposure has a causal effect on the outcome.
3 Results
3.1 Genetic Correlation
There were extensive significant genetic correlations between the amount of coffee intake or choices for different coffee types and T2DM in addition to other T2DM-related phenotypes (Figure 1, Supplementary Tables 2–6). Without distinguishing between coffee types, we found that the amount of coffee intake was positively genetically associated with BMI (Rg = 0.2617, p-value = 1.12×10-30).
Figure 1 Genetic correlation between coffee intake or choices for different coffee types and T2DM/related phenotypes. T2DM, type 2 diabetes mellitus; T2DM(bmiadj), T2DM adjusted for body mass index (BMI); FG, fasting glucose; FI, fasting insulin; HOMA-IR, insulin resistance; HOMA-B, beta-cell function; Significant correlation, p-value < 1.43×10-3 (0.05/5/7).
When considering different coffee types, there was a considerable difference between various coffee type choices. Choosing decaffeinated coffee had a significant positive genetic correlation with T2DM (Rg = 0.1496, p-value = 1.00×10-4) and BMI (Rg = 0.1541, p-value = 2.96×10-6), compared to other types of coffee. Choice for instant coffee showed more significant associations. It was positively associated with T2DM (Rg = 0.1090, p-value = 1.00×10-3), BMI (Rg = 0.1522, p-value = 1.73×10-7), FG (Rg = 0.2084, p-value = 1.00×10-3), FI (Rg = 0.3643, p-value = 6.40×10-6), and HOMA-IR (Rg = 0.3989, p-value = 4.86×10-6).
Conversely, choice for ground coffee showed a favorable relationship with all six phenotypes except HOMA-β (T2DM: Rg = -0.1805, p-value = 2.81×10-13; T2DM adjusted for BMI: Rg = -0.0979, p-value = 3.00×10-4; BMI: Rg = -0.2104, p-value = 6.40×10-26; FG: Rg = -0.1903, p-value = 9.39×10-5; FI: Rg = -0.3584, p-value = 6.88×10-8; HOMA-IR: Rg = -0.3809, p-value = 2.72×10-8). There was no significant genetic association between choice for coffee other than the above-mentioned types and T2DM or any related phenotype.
3.2 Cross-Phenotype Association Analysis
We identified multiple common genomic regions shared between the amount of coffee intake or choice for coffee types and T2DM or other T2DM-related phenotypes through cross-phenotype association analysis of trait pairs with significant genetic correlations (Supplementary Tables 7–20), including many novel regions which have not been reported in previous studies (Tables 1A–D).
Table 1A Novel shared genomic regions shared between coffee intake and T2DM or T2DM-related phenotypes.
In summary, there were 304 shared genomic regions between total amount of coffee consumed and BMI. Choice for decaffeinated coffee had 75 shared genetic regions with T2DM, 238 shared genetic regions with BMI. Choice for instant coffee and T2DM, BMI, FG, FI, HOMA-IR, had 111, 260, 4, 1, 1 shared regions, respectively. Choice for ground coffee shared the most genomic regions with T2DM and related phenotypes, including 136 shared with T2DM, 88 shared with T2DM (adjusted for BMI), 321 shared with BMI, 15 shared with FG, 7 shared with FI, 10 shared with HOMA-IR.
One of the most important findings of this analysis was the novel genomic regions which had not been reported by previous studies. We identified 134 novel regions totally. For example, we found that rs10492872 located on chromosome 16 was the shared site of choice for instant coffee and choice for ground coffee and BMI, and the effects of this locus on choice for instant coffee and ground coffee were opposite, similar to rs11660753 on chromosome 18. Another SNP, rs4988483, also located on chromosome 16, was the shared site of choice for ground coffee and T2DM (adjusted and unadjusted for BMI), suggesting that this SNP may affect both choice of ground coffee and risk of T2DM through a non-BMI-mediated pathway. Further analysis of these sites will help to understand the association between coffee and T2DM.
The discovery of these shared sites suggested that the genetic correlations between coffee intake or choice for different coffee types and T2DM were likely to be driven by these shared regions. Further research on these regions will help to gain a deeper understanding of the pathogenesis of T2DM and its association with coffee.
3.3 KEGG and GO Enrichment Analysis
The KEGG pathways and GO terms enriched by shared genes are shown in Supplementary Figures 2–8. We performed enrichment analysis on the four shared gene sets of total coffee intake/choice for different coffee types (decaffeinated coffee, ground coffee, instant coffee) and T2DM or related phenotypes. In the KEGG analysis, in addition to total coffee intake, the remaining three gene sets all showed significant enrichment in allograft rejection (hsa05330), autoimmune thyroid disease (hsa05320), Human T-cell leukemia virus 1 infection (hsa05166), and viral myocarditis (hsa05416).
In GO analysis, all four gene sets were significantly enriched in the following five biological processes related to antigen processing: antigen processing and presentation (GO:0019882), antigen processing and presentation of endogenous peptide antigen (GO:0002483), antigen processing and presentation of endogenous peptide antigen via MHC class I (GO:0019885), antigen processing and presentation of exogenous peptide antigen (GO:0002478), antigen processing and presentation of peptide antigen (GO: 0048002).
3.4 MR
We used eight different methods to estimate the genetically predicated causal effects of coffee intake and coffee type choice on T2DM or related phenotypes. The results are given in Supplementary Tables 21–25 and shown graphically in Figure 2.
Figure 2 Forest plot for Mendelian randomization results. The vertical axis represents different coffee types and different method. The horizontal axis represents the size of causal effect estimation. T2DM, type 2 diabetes mellitus; BMI, body mass index. CAUSE, causal analysis using summary effect estimates (a mendelian randomization method); CI, confidence interval.
For total coffee intake, its harmful associations with T2DM (b CAUSE: 0.31, p-value CAUSE: 3.30×10-2) and BMI (b CAUSE: 0.36, p-value CAUSE: 1.8×10-5) were found by seven methods (except MR-Egger regression). Even after adjusting BMI, its enhancement of T2DM risk was also reflected in the results of six models (except CAUSE and MR-Egger regression).
Analysis of different coffee types indicated various results. Compared with other types of coffee, choice for decaffeinated coffee was found to increase the risk of T2DM (CAUSE; b CAUSE: 0.29, p-value CAUSE: 2.30×10-2) and BMI (IVW, simple median, weighted median, penalized weighted median) by some methods. In the research of instant coffee, the results obtained by different methods were not completely consistent. CAUSE’s result supported the view that choice for instant coffee could increase the risk of T2DM (b CAUSE: 0.17, p-value CAUSE: 2.90×10-2) and this causality was mediated by BMI, but weighted median, penalized weighted median, simple mode, and weighted mode showed the opposite result. Different from the above two types of coffee, choice for ground coffee showed a protective association with glucose homeostasis. It was indicated to decrease BMI (b CAUSE: -0.08, p-value CAUSE: 6.50×10-5) by CAUSE. Regardless of whether BMI was adjusted or not, the results of most methods showed that choice for ground coffee can reduce the risk of T2DM (except IVW and MR-Egger regression; T2DM: b CAUSE: -0.2, p-value CAUSE: 4.70×10-10; T2DM adjusted for BMI: b CAUSE: -0.11, p-value CAUSE: 4.60×10-5).
4 Discussion
In the present study, we found that genetically proxied total coffee intake, choice for decaffeinated or instant coffee were significantly associated with increased T2DM risk, whereas genetically proxied choice for ground coffee was associated with decreased risk. We have identified the genes shared by these trait pairs and further determined the biological processes and pathways that these genes were enriched in, and the results indicated the association between coffee intake and T2DM was likely mediated by immune system. This is of great significance for a better understanding of the impact of coffee on T2DM.
In the genetic association analysis, our results showed that those opting for ground coffee (include espresso, filter etc) were less genetically susceptible to T2DM, while those who tended to choose instant coffee or decaffeinated coffee (any type) were more genetically susceptible to these risks (Figure 1). Because of the heterogeneity among the consumers of different coffee types showed above, we suggested that combining the data of different coffee beverage consumers in statistical analyses should be done with caution.
We have identified many shared genomic regions between coffee intake or choice for different coffee types and T2DM or related phenotypes (Supplementary Tables 7–20). Some of these regions have been discovered in GWAS of single trait, but a considerable part of them has never been discovered till now (Tables 1A–D). These regions should be paid more attention because they suggest common unknown pathways which may affect both phenotypes and these regions may partly account for the “missing heritability” (35) in mono-phenotype GWAS.
The results of GO enrichment analysis showed that shared genes were significantly enriched in the biological processes related to antigen processing (Supplementary Figures 2, 4, 6, 8). This result suggests that the association between coffee intake and T2DM is likely to be related to immune regulation. The KEGG enrichment results also supported this inference (Supplementary Figures 3, 5, 7). Moreover, in the cross-phenotype association study, the notable identified SNP rs10492872 mapped to FTO is shared by two trait pairs of ground coffee/BMI and instant coffee/BMI, and its effects on ground coffee and instant coffee are in opposite directions, so as rs11660753 mapped to SMIM21 (Tables 1C, D). FTO gene is the first candidate gene of obesity, and it is also related to a variety of other phenotypes, such as alcohol intake (36), C-reactive protein levels (37), etc. A recent study has found that FTO can help cancer cells escape immune surveillance (38). These findings suggest that FTO is likely to participate in immune activities. Compared with FTO, there is less research on SMIM21, but epidemiological studies have also found its significant association with rheumatoid arthritis (39). The above research results suggest that the relationship between coffee intake and T2DM and the differences among various coffee type drinkers are likely to be related to the immune system.
In our MR analysis, we used eight different methods for MR research, with their own characteristics (Figure 2, Supplementary Tables 21–25). Heterogeneity test showed that heterogeneity was common (Supplementary Tables 26–30), which means that the IVs selected according to the conventional principles may be invalid or at least partially invalid. Under this circumstance, the results obtained by IVW may be problematic (40). The test of the intercept term of MR-Egger regression (33) showed that the horizontal pleiotropy did not exist or had reached equilibrium (Supplementary Table 31). However, it should be noted that MR-Egger regression can only test the unrelated pleiotropy, that is, the horizontal pleiotropy of IVs on the outcome is not related to confounding factors, but cannot be used to identify related pleiotropy. Our genetic correlation results showed that these phenotypes had extensive genetic correlations (Figure 1), which suggested that related pleiotropy was likely to exist. The median/mode method (41) has lower requirements for IVs, but considering that it uses little information, the results need to be carefully understood. As a newly proposed method, the CAUSE method (34) can deal with related and unrelated pleiotropy to identify causal models and shared models, but the validity of its results needs to be tested in more applications. In short, we recommend that MR results should be obtained from a combination of different methods such as we did here to address relationships between lifestyle factors and health optimally.
We have noticed that there have been MR studies on coffee intake and the risk of T2DM published (12, 14, 42). Among them, the results of the MR study by Shuai Yuan et al. (12) was similar to our results, they found that coffee intake could increase the risk of T2DM, but this effect disappeared after adjusting for BMI. Whereas the remaining two studies (14, 42) found no significant effect of coffee intake on the risk of T2DM. Compared with these researches, our study has several outstanding advantages. First, in addition to total coffee intake, we also analyzed the association between the choice for different coffee types and T2DM, and found significant differences between various coffee type drinkers. This is not considered by the previous MR studies on coffee intake and T2DM. Second, instead of using only a few variants related to caffeine metabolism, we used more variants as IVs to achieve a greater proportion of explanation for the variance of coffee intake. Finally, we used multiple MR methods with different advantages to overcome the limitations of a single method. Thus, we believe that our research is more reliable and meaningful.
Both the results of genetic correlation study and MR analysis showed that choice for ground coffee was associated with lower risk of T2DM, compared to instant coffee and decaffeinated coffee. This indicates that the content of coffee’s beneficial cardiovascular components is different in various coffee types. This is consistent with the results of the laboratory investigation. A study investigating the content of CGAs and caffeine in 83 commercially available coffee species found that unblended ground coffee had the highest CGAs content and lowest mean caffeine/CGAs ratio (16). Study on total phenol content has reached similar conclusions (4). In instant coffee, the lower beneficial effects of chlorogenic acid and total phenols are likely to be offset by the harmful effects of added sugar, creamer, and other ingredients, as the results of observational studies on instant coffee showed (43).
When we directly analyzed total coffee intake without distinguishing between types, we found significant positive associations. This may be because in the UKB participants, compared with ground coffee, people who drink instant coffee or decaffeinated coffee account for a larger proportion. Actual data support this inference (decaffeinated coffee: N=64,717, instant coffee: N=185,482; ground coffee: N=73,906; other type of coffee: N=5,566). Although we are very clear about the cardiovascular beneficial effects of certain components in coffee, previous studies on the association between coffee intake and cardiovascular risks have not reached a consistent result. Some studies thought that moderate coffee drinking can benefit cardiovascular health (44), but there are also research suggesting the harmful effects of coffee (12, 13). Our results suggest a possible reason for this phenomenon. We suggest that research on the health effects of coffee should distinguish between coffee types.
There are several notable strengths of our work. First, this is the first article to simultaneously study the genetic and potential causal relationship between coffee intake and T2DM. We identified many shared genomic regions and found significant causal relationship, which provided new insights into the mechanism of their associations. In addition to the amount of total coffee intake, we also studied the relationship between choice for different types of coffee and T2DM. The differences in the results suggest the importance of distinguishing different types of coffee in coffee research. In addition to T2DM, we have also investigated other T2DM-related phenotypes (BMI, FG, FI, HOMA-IR, HOMA-β), which will facilitate a more thorough understanding of the association between coffee and T2DM. The GWAS summary statistics selected for our study are all from large-scale and high-quality GWAS. We identify many shared loci and found many significant causal relationships that have not been found in previous less-powerful MR studies.
Our research still has some areas that can be further improved. First, for different types of coffee, it should be noted that we are studying the relationship between choosing this type of coffee and T2DM, but not the amount of this type of coffee consumed. In the future, if relevant data is available, the association between the consumption amount of each type of coffee and T2DM can be further studied. Second, although it is observed that there is a great difference between the results of instant coffee and ground coffee, our research results still cannot answer whether this difference is due to the composition difference caused by the processing or the usual addition of saccharin and other additives in instant coffee. Later, we will further analyze whether the preference of adding milk/cream/sugar to coffee will affect its association with T2DM. We believe that these sites will be much useful for studying the pathogenesis of T2DM and its relationship with coffee.
In summary, we have shown that choice for ground coffee (include espresso, filter etc) has extensive significant negative genetic correlation with T2DM and related phenotypes. MR analysis using all variants indicated genetically proxied choice for ground coffee can decrease BMI and the risk of T2DM, while other types of coffee may increase the risk of T2DM. This study provides new insights and evidence for the health effect of coffee. The results of different coffee types suggests that research on coffee’s health effect should pay more attention to distinguishing between coffee types.
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.
Author Contributions
XW, JJ, and TH designed the study. XW performed the statistical analysis. XW wrote the manuscript. All authors helped interpret the data, reviewed, and edited the final paper, and approved the submission.
Funding
The study was supported by grants from the Peking University Start-up Grant (71013Y2114), High-performance Computing Platform of Peking University and Beijing Technology and Business University Grant(88442Y0033). The funding organization had no role in the preparation of the manuscript.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2022.818831/full#supplementary-material
References
1. Kilpatrick J. Beanie Coffee. 2020 December 9 (2020). Available at: https://www.beaniecoffee.com/top-coffee-consuming-countries-world/.
2. Williamson G. Protection Against Developing Type 2 Diabetes by Coffee Consumption: Assessment of the Role of Chlorogenic Acid and Metabolites on Glycaemic Responses. Food Funct (2020) 11(6):4826–33. doi: 10.1039/d0fo01168a
3. Chen C, Wu F. The Need to Revisit Ochratoxin A Risk in Light of Diabetes, Obesity, and Chronic Kidney Disease Prevalence. Food Chem Toxicol (2017) 103:79–85. doi: 10.1016/j.fct.2017.03.001
4. Olechno E, Puścion-Jakubik A, Markiewicz-Żukowska R, Socha K. Impact of Brewing Methods on Total Phenolic Content (TPC) in Various Types of Coffee. Molecules (2020) 25(22):5274. doi: 10.3390/molecules25225274.
5. Kokubo Y, Iso H, Saito I, Yamagishi K, Yatsuya H, Ishihara J, et al. The Impact of Green Tea and Coffee Consumption on the Reduced Risk of Stroke Incidence in Japanese Population: The Japan Public Health Center-Based Study Cohort. Stroke (2013) 44(5):1369–74. doi: 10.1161/strokeaha.111.677500
6. Choi Y, Chang Y, Ryu S, Cho J, Rampal S, Zhang Y, et al. Coffee Consumption and Coronary Artery Calcium in Young and Middle-Aged Asymptomatic Adults. Heart (2015) 101(9):686–91. doi: 10.1136/heartjnl-2014-306663
7. O'Keefe JH, DiNicolantonio JJ, Lavie CJ. Coffee for Cardioprotection and Longevity. Prog Cardiovasc Dis (2018) 61(1):38–42. doi: 10.1016/j.pcad.2018.02.002
8. Benn M, Nordestgaard BG. From Genome-Wide Association Studies to Mendelian Randomization: Novel Opportunities for Understanding Cardiovascular Disease Causality, Pathogenesis, Prevention, and Treatment. Cardiovasc Res (2018) 114(9):1192–208. doi: 10.1093/cvr/cvy045
9. Karusheva Y, Kunstein L, Bierwagen A, Nowotny B, Kabisch S, Groener JB, et al. An 8-Week Diet High in Cereal Fiber and Coffee But Free of Red Meat Does Not Improve Beta-Cell Function in Patients With Type 2 Diabetes Mellitus: A Randomized Controlled Trial. Nutr Metab (Lond) (2018) 15:90. doi: 10.1186/s12986-018-0324-5
10. Alperet DJ, Rebello SA, Khoo EY, Tay Z, Seah SS, Tai BC, et al. The Effect of Coffee Consumption on Insulin Sensitivity and Other Biological Risk Factors for Type 2 Diabetes: A Randomized Placebo-Controlled Trial. Am J Clin Nutr (2020) 111(2):448–58. doi: 10.1093/ajcn/nqz306
11. van Dijk AE, Olthof MR, Meeuse JC, Seebus E, Heine RJ, van Dam RM. Acute Effects of Decaffeinated Coffee and the Major Coffee Components Chlorogenic Acid and Trigonelline on Glucose Tolerance. Diabetes Care (2009) 32(6):1023–5. doi: 10.2337/dc09-0207
12. Yuan S, Larsson SC. An Atlas on Risk Factors for Type 2 Diabetes: A Wide-Angled Mendelian Randomisation Study. Diabetologia (2020) 63(11):2359–71. doi: 10.1007/s00125-020-05253-x
13. Nicolopoulos K, Mulugeta A, Zhou A, Hyppönen E. Association Between Habitual Coffee Consumption and Multiple Disease Outcomes: A Mendelian Randomisation Phenome-Wide Association Study in the UK Biobank. Clin Nutr (2020) 39(11):3467–76. doi: 10.1016/j.clnu.2020.03.009
14. Nordestgaard AT, Thomsen M, Nordestgaard BG. Coffee Intake and Risk of Obesity, Metabolic Syndrome and Type 2 Diabetes: A Mendelian Randomization Study. Int J Epidemiol (2015) 44(2):551–65. doi: 10.1093/ije/dyv083
15. Wong THT, Sui Z, Rangan A, Louie JCY. Discrepancy in Socioeconomic Status Does Not Fully Explain the Variation in Diet Quality Between Consumers of Different Coffee Types. Eur J Nutr (2018) 57(6):2123–31. doi: 10.1007/s00394-017-1488-x
16. Jeon JS, Kim HT, Jeong IH, Hong SR, Oh MS, Yoon MH, et al. Contents of Chlorogenic Acids and Caffeine in Various Coffee-Related Products. J Adv Res (2019) 17:85–94. doi: 10.1016/j.jare.2019.01.002
17. Sirota R, Gorelik S, Harris R, Kohen R, Kanner J. Coffee Polyphenols Protect Human Plasma From Postprandial Carbonyl Modifications. Mol Nutr Food Res (2013) 57(5):916–9. doi: 10.1002/mnfr.201200557
18. Restuccia D, Spizzirri UG, Parisi OI, Cirillo G, Picci N. Brewing Effect on Levels of Biogenic Amines in Different Coffee Samples as Determined by LC-UV. Food Chem (2015) 175:143–50. doi: 10.1016/j.foodchem.2014.11.134
19. team P-U. Pan-UKB (2020). Available at: https://pan.ukbb.broadinstitute.org.
20. Kanai M, Akiyama M, Takahashi A, Matoba N, Momozawa Y, Ikeda M, et al. Genetic Analysis of Quantitative Traits in the Japanese Population Links Cell Types to Complex Human Diseases. Nat Genet (2018) 50(3):390–400. doi: 10.1038/s41588-018-0047-6
21. Canela-Xandri O, Rawlik K, Tenesa A. An Atlas of Genetic Associations in UK Biobank. Nat Genet (2018) 50(11):1593–9. doi: 10.1038/s41588-018-0248-z
22. Mahajan A, Taliun D, Thurner M, Robertson NR, Torres JM, Rayner NW, et al. Fine-Mapping Type 2 Diabetes Loci to Single-Variant Resolution Using High-Density Imputation and Islet-Specific Epigenome Maps. Nat Genet (2018) 50(11):1505–13. doi: 10.1038/s41588-018-0241-6
23. Yengo L, Sidorenko J, Kemper KE, Zheng Z, Wood AR, Weedon MN, et al. Meta-Analysis of Genome-Wide Association Studies for Height and Body Mass Index in ∼700000 Individuals of European Ancestry. Hum Mol Genet (2018) 27(20):3641–9. doi: 10.1093/hmg/ddy271
24. Dupuis J, Langenberg C, Prokopenko I, Saxena R, Soranzo N, Jackson AU, et al. New Genetic Loci Implicated in Fasting Glucose Homeostasis and Their Impact on Type 2 Diabetes Risk. Nat Genet (2010) 42(2):105–16. doi: 10.1038/ng.520
25. Bulik-Sullivan B, Finucane HK, Anttila V, Gusev A, Day FR, Loh PR, et al. An Atlas of Genetic Correlations Across Human Diseases and Traits. Nat Genet (2015) 47(11):1236–41. doi: 10.1038/ng.3406
26. Zhu X, Feng T, Tayo BO, Liang J, Young JH, Franceschini N, et al. Meta-Analysis of Correlated Traits via Summary Statistics From GWASs With an Application in Hypertension. Am J Hum Genet (2015) 96(1):21–36. doi: 10.1016/j.ajhg.2014.11.011
27. Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-Generation PLINK: Rising to the Challenge of Larger and Richer Datasets. Gigascience (2015) 4:7. doi: 10.1186/s13742-015-0047-8
28. Buniello A, MacArthur JAL, Cerezo M, Harris LW, Hayhurst J, Malangone C, et al. The NHGRI-EBI GWAS Catalog of Published Genome-Wide Association Studies, Targeted Arrays and Summary Statistics 2019. Nucleic Acids Res (2019) 47(D1):D1005–d1012. doi: 10.1093/nar/gky1120
29. Staley JR, Blackshaw J, Kamat MA, Ellis S, Surendran P, Sun BB, et al. PhenoScanner: A Database of Human Genotype-Phenotype Associations. Bioinformatics (2016) 32(20):3207–9. doi: 10.1093/bioinformatics/btw373
30. Kamat MA, Blackshaw JA, Young R, Surendran P, Burgess S, Danesh J, et al. PhenoScanner V2: An Expanded Tool for Searching Human Genotype-Phenotype Associations. Bioinformatics (2019) 35(22):4851–3. doi: 10.1093/bioinformatics/btz469
31. Yu G, Wang LG, Han Y, He QY. Clusterprofiler: An R Package for Comparing Biological Themes Among Gene Clusters. Omics (2012) 16(5):284–7. doi: 10.1089/omi.2011.0118
32. Pierce BL, Burgess S. Efficient Design for Mendelian Randomization Studies: Subsample and 2-Sample Instrumental Variable Estimators. Am J Epidemiol (2013) 178(7):1177–84. doi: 10.1093/aje/kwt084
33. Bowden J, Davey Smith G, Burgess S. Mendelian Randomization With Invalid Instruments: Effect Estimation and Bias Detection Through Egger Regression. Int J Epidemiol (2015) 44(2):512–25. doi: 10.1093/ije/dyv080
34. Morrison J, Knoblauch N, Marcus JH, Stephens M, He X. Mendelian Randomization Accounting for Correlated and Uncorrelated Pleiotropic Effects Using Genome-Wide Summary Statistics. Nat Genet (2020) 52(7):740–7. doi: 10.1038/s41588-020-0631-4
35. Génin E. Missing Heritability of Complex Diseases: Case Solved? Hum Genet (2020) 139(1):103–13. doi: 10.1007/s00439-019-02034-
36. Hubacek JA, Pikhart H, Peasey A, Malyutina S, Pajak A, Tamosiunas A, et al. The Association Between the FTO Gene Variant and Alcohol Consumption and Binge and Problem Drinking in Different Gene-Environment Background: The HAPIEE Study. Gene (2019) 707:30–5. doi: 10.1016/j.gene.2019.05.002
37. Mitra SR, Tan PY, Amini F. Effect of FTO Rs9930506 on Obesity and Interaction of the Gene Variants With Dietary Protein and Vitamin E on C-Reactive Protein Levels in Multi-Ethnic Malaysian Adults. J Hum Nutr Diet (2018) 31(6):758–72. doi: 10.1111/jhn.12593
38. Liu Y, Liang G, Xu H, Dong W, Dong Z, Qiu Z, et al. Tumors Exploit FTO-Mediated Regulation of Glycolytic Metabolism to Evade Immune Surveillance. Cell Metab (2021) 33(6):1221–33.e11. doi: 10.1016/j.cmet.2021.04.001
39. Bossini-Castillo L, de Kovel C, Kallberg H, van 't Slot R, Italiaander A, Coenen M, et al. A Genome-Wide Association Study of Rheumatoid Arthritis Without Antibodies Against Citrullinated Peptides. Ann Rheum Dis (2015) 74(3):e15. doi: 10.1136/annrheumdis-2013-204591
40. Burgess S, Dudbridge F, Thompson SG. Combining Information on Multiple Instrumental Variables in Mendelian Randomization: Comparison of Allele Score and Summarized Data Methods. Stat Med (2016) 35(11):1880–906. doi: 10.1002/sim.6835
41. Bowden J, Davey Smith G, Haycock PC, Burgess S. Consistent Estimation in Mendelian Randomization With Some Invalid Instruments Using a Weighted Median Estimator. Genet Epidemiol (2016) 40(4):304–14. doi: 10.1002/gepi.21965
42. Said MA, van de Vegte YJ, Verweij N, van der Harst P. Associations of Observational and Genetically Determined Caffeine Intake With Coronary Artery Disease and Diabetes Mellitus. J Am Heart Assoc (2020) 9(24):e016808. doi: 10.1161/JAHA.120.016808
43. Kim HJ, Cho S, Jacobs DR Jr., Park K. Instant Coffee Consumption may be Associated With Higher Risk of Metabolic Syndrome in Korean Adults. Diabetes Res Clin Pract (2014) 106(1):145–53. doi: 10.1016/j.diabres.2014.07.007
Keywords: type 2 diabetes mellitus, coffee types, coffee intake, cardiac metabolic risks, Mendelian randomization
Citation: Wang X, Jia J and Huang T (2022) Coffee Types and Type 2 Diabetes Mellitus: Large-Scale Cross-Phenotype Association Study and Mendelian Randomization Analysis. Front. Endocrinol. 13:818831. doi: 10.3389/fendo.2022.818831
Received: 20 November 2021; Accepted: 10 January 2022;
Published: 11 February 2022.
Edited by:
Changwei Li, Tulane University School of Public Health and Tropical Medicine, United StatesCopyright © 2022 Wang, Jia and Huang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jinzhu Jia, jzjia@math.pku.edu.cn; Tao Huang, huangtaotao@pku.edu.cn