- 1School of Food Science and Nutrition, University of Leeds, Leeds, United Kingdom
- 2Public Health Science Division, Fred Hutchinson Cancer Center, Seattle, WA, United States
- 3Leeds Institute of Medical Research, University of Leeds, Leeds, United Kingdom
- 4Leeds Institute for Data Analytics, University of Leeds, Leeds, United Kingdom
Introduction: Gestational diabetes mellitus (GDM) is the most common pregnancy complication worldwide and is associated with short- and long-term health implications for both mother and child. Prevalence of GDM varies between ethnicities, with South Asians (SAs) experiencing up to three times the risk compared to white Europeans (WEs). Recent evidence suggests that underlying metabolic difference contribute to this disparity, but an investigation of causality is required.
Methods: To address this, we paired metabolite and genomic data to evaluate the causal effect of 146 distinct metabolic characteristics on gestational dysglycemia in SAs and WEs. First, we performed 292 GWASs to identify ethnic-specific genetic variants associated with each metabolite (P ≤ 1 x 10-5) in the Born and Bradford cohort (3688 SA and 3354 WE women). Following this, a one-sample Mendelian Randomisation (MR) approach was applied for each metabolite against fasting glucose and 2-hr post glucose at 26-28 weeks gestation. Additional GWAS and MR on 22 composite measures of metabolite classes were also conducted.
Results: This study identified 15 novel genome-wide significant (GWS) SNPs associated with tyrosine in the FOXN and SLC13A2 genes and 1 novel GWS SNP (currently in no known gene) associated with acetate in SAs. Using MR approach, 14 metabolites were found to be associated with postprandial glucose in WEs, while in SAs a distinct panel of 11 metabolites were identified. Interestingly, in WEs, cholesterols were the dominant metabolite class driving with dysglycemia, while in SAs saturated fatty acids and total fatty acids were most commonly associated with dysglycemia.
Discussion: In summary, we confirm and demonstrate the presence of ethnic-specific causal relationships between metabolites and dysglycemia in mid-pregnancy in a UK population of SA and WE pregnant women. Future work will aim to investigate their biological mechanisms on dysglycemia and translating this work towards ethnically tailored GDM prevention strategies.
1 Introduction
Pregnancy is accompanied by a period of intense maternal metabolic adaptation to meet the energy demands of the foetus (1–3). Mild maternal insulin resistance (IR) is a natural adaption to prioritise adequate glucose for the growing foetus (4). However, if gestational IR exceeds healthy levels and glycaemia is uncontrolled, moderate IR can progress to gestational diabetes mellitus (GDM) (1, 5). GDM is characterised by persistent maternal and foetal exposure to elevated levels of glucose, and places the mother and offspring at risk during pregnancy (i.e., macrosomia and haemorrhaging) and in later life (i.e. from obesity, type 2 diabetes (T2D) and cardiovascular disease) (6).
Globally, GDM is the most common pregnancy complication, affecting up to one in every seven births; however, its prevalence varies between ethnic groups, with South Asian (SA) women at 3-fold greater risk compared to white European (WE) women, irrespective of BMI and country of residence (5, 7). Furthermore, SA women are more likely to develop T2D in later life following a GDM diagnosis (8). Factors driving this disparity in prevalence are not fully understood but metabolism is thought to play a key role (6), given emerging evidence demonstrating (i) differences in metabolic profiles between GDM and non-GDM pregnancies in an ethnic-specific manner (2, 9); and (ii) that a single dietary strategy to manage GDM across all ethnic groups appears ineffective (10, 11). However, heterogeneity of reported metabolite-GDM associations between studies (due to differences in quantification methods, GDM diagnostic criteria (12, 13), ethnic and cultural groups), as well as residual confounding in observational studies have prevented complete understanding and, moreover, advancement to improved equitable care. In short, the field requires a clear and accurate understanding of ethnic-specific metabolic drivers of gestational dysglycemia to inform appropriate and effective prevention and management strategies across ethnic groups.
Mendelian Randomisation (MR) is an instrumental variable technique that can provide estimates of causal associations between an exposure (such as metabolites) and outcome (such as dysglycemia) (14–16). However, no study has yet used MR to establish the presence of casual associations between metabolites and measures of glycaemia at or before the 28 week of pregnancy in an ethnic-specific manner. The present study aimed to address this using the multi-ethnic Born in Bradford (BiB) cohort to identify ethnic-specific metabolic drivers of GDM.
2 Material and methods
2.1 Exposure data
BiB is a prospective longitudinal birth cohort that aimed to recruit all mothers receiving maternity care in the Bradford Royal Infirmary between 2007-2010 (17). Bradford, a large city in the north of England, has high levels of deprivation and a large SA population, predominantly of Pakistani ancestry. A total of 12,453 women (mean maternal age 27.8) were recruited, 45% of which were of SA ancestry (17, 18). The study was not pre-registered but (SP622) was approved by Born in Bradford. All participants provided written consent and ethnical approval was obtained from the Bradford Research Ethnics Committee (ref07/H1302/112) (17).
Fasted plasma sample collection and high-throughput metabolite quantification by automated NMR (Nightingale Health©; Helsinki, Finland) has been previously described and validated to a high accuracy (2). Briefly, samples were taken by trained phlebotomists from BiB participants (26-28 weeks’ gestation) and were processed in the absence of freeze-thaw cycles within 2.5 hours before storage at -80°C. One hundred and forty-six absolute measures of metabolites were included in the analysis following the removal of metabolites expressed as a percentage or ratio to minimise redundancy. In total, 10 overarching classes of metabolites were included in the analysis: lipoproteins (n=97), amino acids (n=9), apolipoproteins (n=2), cholesterols (n=8), fatty acids (n=8), glycerides and phospholipids (n=8), glycolysis related metabolites (n=4), ketone bodies (n=2), measures of fluid balance and inflammation (n=3) and measures of lipoprotein particle diameter (n=3). A full list of included metabolites can be found in Supplementary Table 1.
2.2 Outcome data
All participants were assesed prior to GDM diagnosis and the 28th week of pregnancy. Individuals were diagnosed with GDM if either their fasting glucose or if 2-hour post-load glucose concentration exceeded 6.1 mmol/L or 7.8 mmol/L following a 75g oral glucose tolerance test (OGTT) (19). The OGTT was performed in the morning following an overnight fast. To maximise power, MR analyses were performed using continuous metabolite values and fasting glucose and 2-hour post glucose. Fasting glucose and 2-hour post glucose values were log normalised prior to analysis.
2.3 Metabolite data
Information on metabolite data preparation has been described in full elsewhere (9). In brief, 11,480 blood samples were metabolically profiled from BiB, 54 of which were excluded due to failure of any one of five Nightingale© quality control measures leaving 11,426 samples for imputation. Missing data was imputed via multiple imputation using the missMDA package in R (20).
After combining with postprandial glucose data, 3,693 SA and 3,377 individuals whose samples were taken before the 28th week of pregnancy were retained before outlier removal (Supplementary Figure 1). Outliers were removed (those outside of 1.5 x IQR) for each metabolite in each ethnicity separately and metabolite values were normalised by taking the log, square root or normal score transformation (NST) as appropriate following the visual inspection of histograms and QQ plots. Following the removal of outliers, the number of individuals available for GWAS analysis of each metabolite varied (Supplementary Table 2) but was relatively consistent: for SAs the average sample size for each metabolite was 3622 (range 3472-3688), while for WEs it was 3301 (range 3158-3345). Information on gestational age at sample collection and parity was obtained from obstetric records. Ethnicity was self-reported or obtained from primary care records if missing. Individuals of a SA descent other than Pakistani were excluded from the analysis due to the small sample sizes of these populations. Differences in the distribution of continuous variables between ethnic groups were assessed by Mann-Whitney tests, while differences in the distribution of categorical variables were assessed by chi-squared tests. Women of SA ancestry tended to be older than WE women (27.9 ± 0.1 vs 26.7 ± 0.1 years) and were more likely to be overweight/obese (64.5% vs 53.4%), and be on their ≥2nd pregnancy (67.1% vs 48.8%), but were less likely have smoked during pregnancy (2.9% vs 30.9%) (Supplementary Table 3).
2.4 Genetic data
Imputed genetic data were obtained from BiB. All samples were genotyped using two chips: the Infinium Global Sequencing Array-24 v.1 (GSA) (~640K SNPs) and the Infinium CoreExome-24 v1.1 BeadChip (~550K SNPs) (21). Genetic data from the Illumina Global Sequencing Array (GSA) and Illumina CoreExome SNPs were combined. Where SNPs were missing in >5% of individuals, they were excluded (21). When evaluating imputed data, the R2 value can be a measure of quality control as it reflects to the estimated proportion of genetic variation maintained in the imputed data. As a result, SNPs with an R2 <0.9 were excluded prior to analysis.
2.5 GWAS analysis
Conventionally a GWAS assumes individuals are unrelated and the inclusion of related individuals can potentially lead to spurious associations (22, 23). However, the removal of individuals from the BiB sample with close ancestry would substantially reduce the sample size. In addition, high rates of consanguinity in the SA stratum of the cohort makes relatedness difficult to assess (24). As such, a GWAS mixed linear model association (MLMA) analysis was conducted in PLINK (version 1.9) that allowed for the inclusion of related individuals (23, 25–27). MLMA models include a fixed effect, adjusted covariates, and an additional random effect comprised of a variance-covariance matrix that models the correlation (here relatedness) between individuals to be accounted for (23, 25). GWAS MLMA models were implemented using the GCTA (Genome-wide Complex Trait Analysis) command line tool for each metabolite in both ethnicities (28). To increase power, MLMA-loo (leave-one-out) analysis was used, preventing a SNP from being included in both the fixed and random effects concurrently, thereby avoiding double fitting (25). MLMA models also included parity and principal components (PC) 1 and 2 to account for population stratification (Supplementary Figure 2, Supplementary Table 4). Gestational age, which showed little variation, was not included in the modelling (median gestational age SA = 184 days, IQR= 182-186.7, median gestational age WE = 184 days, IQR= 182-187). Genomic inflation factors (λ) were calculated for all models for a range of minor allele frequency (MAF) cut-offs (MAF <0.001, 0.001≤ MAF < 0.005, 0.005 ≤ MAF < 0.01, 0.01 ≤MAF < 0.05, 0.05 ≤ MAF < 0.1, and MAF ≥ 0.1) to minimise data loss while also minimising false positives. λ ≥ 1.1 was considered indicative of genomic inflation (29, 30). A MAF cut-off of <0.05 was the least stringent cut-off found to reduce λ to ~1 meaning this cut off was used in the analysis (Supplementary Table 5).
When a SNP was found to be associated with a metabolite value in only one ethnicity, a fixed effect inverse-variance weighted meta-analysis was implemented to assess the heterogeneity (via the I2 statistic) of identified associations between ethnicities and to see if the SNP retained significance in a larger sample. Meta-analyses were conducted within the command-line tool METAL (31) and supplemented with FUMA (v1.5.2) (32) to investigate SNP function based on their effect on phenotypes.
2.6 One-sample MR
2.6.1 Genetic instruments
One-Sample MR was conducted for all 146 metabolite values in both ethnic groups using SNPs identified as significant at a genome-wide suggestive level (p-value ≤ 1 x 10-5) in the GWAS. All variants were also entered into MR-base (33) to test for other reported known associations that may be in horizontal or vertical pleiotropy. Metabolites were grouped into their overall classes and SNPs in each class were thinned by linkage disequilibrium (LD) (R2<0.2) via the NIH LDlink online tool (https://ldlink.nci.nih.gov) reducing the overlap of instruments in each class (34, 35). For individuals of WE ancestry, all European (EUR) and South Asian (SAS) populations in LD link (software that utilises 1000 Genome data) were used to estimate LD due to the expected similarity in their LD structure allowing for an increase sample size and resultant improvement in the accuracy of LD estimates (36). Similarity between 1000 Genome SA samples and BiB samples was assessed using principal components analysis (PCA) in PLINK (version 1.9) because Pakistani samples from BiB originate from a different region of Pakistan (the Mirpur Region) from the 1000 Genome SA samples (26, 27). This is of particular importance in SA as even geographically close populations can have differing allele frequencies due to differing Biraderi (‘Brotherhood’) membership between population subgroups. Biraderi membership is assigned at birth, is an indicator of male lineage as well as social-occupational status which largely governs partner choice and can result in higher levels of consanguinity in the population (21). PCA plots were created using the ggplot2 package in R studio (version 4.0.2) (37, 38). No clear separation in SA BiB samples and SA 1000 Genome (1000G) samples was identified indicating that LD estimates obtained from 1000G was suitable for use in BiB (Supplementary Figures 3-5).
2.6.2 Analysis
Genetic Risk Scores (GRS) were created in PLINK (version 1.9) for each metabolite with each SNP receiving a weight based on its estimated effect size on the metabolite obtained from the GWAS (39). One-sample MR was then performed by Two-Stage Least Squares regression (TSLS; ivpack, ivreg, and AER packages in R version 4.0.2) to obtain a causal estimate for the effect of each metabolite value on the log-normalised continuous measures of fasting glucose and 2-hour post glucose following a 75g oral glucose tolerance test (OGTT) (37, 40, 41). Here, the level of a metabolite is regressed on its respective GRS and, subsequently, the outcome is regressed onto these fitted GRS-metabolite values in the second stage. All MR results have been reported according to STROBE-MR guidelines (42).
When significant associations were identified, leave-one-out analysis was performed. For this, SNPs were removed sequentially from the instrument and changes to the effect estimate and F-statistic was assessed. If the exclusion of a SNP was found to alter either the effect estimates or F-statistic (through the visualisation of forest plots) it is possible that the SNP is influencing the outcome via an alternative pathway to other SNPs, potentially highlighting a violation of the 2nd or 3rd MR assumption. To further test for violations of these assumptions, included SNPs were searched for in both the Phenoscanner and GWAS Catalogue databases to identify previously identified associations (43–45) with potential confounders in horizontal (i.e., multiple pregnancies, type-1 diabetes, deprivation index, parity) rather than vertical pleiotropy (i.e., along causal pathway, such anthropometrics). In both databases a p-value ≤ 1 x 10-5 was interpreted as indicative of an additional association. Differences between MR and linear regression results were also evaluated via the Wu-Hausman statistic to assess deviation of the instrumental variable estimate from the ordinary least squares (OLS) estimate (46). Deviations in these two measures can indicate either confounding in the OLS estimate (indicating a need for MR) or violations of the MR assumptions due to pleiotropy.
2.6.3 Post-hoc power analyses
For metabolites that associated with a measure of postprandial glucose in only one ethnic group, post-hoc power analyses were performed using the mRnd CNS genomics tool (https://shiny.cnsgenomics.com/mRnd/) to assess whether the absence of an association in the alternate ethnicity was due to limited power (47). Observational and ‘true’ associations required by the tool were obtained by performing linear regression of the outcome on the metabolite and obtaining unadjusted and adjusted estimates (adjusted for maternal age (years), BMI (continuous), smoking status, multiple pregnancy, parity, and gestational age) respectively. Due to the post-hoc nature of this analysis, additional power analyses could be conducted assuming the MR estimate to be the true causal effect in the MR calculation. This analysis was performed in the non-significant population for each metabolite associated in only one ethnicity. If power was found to be adequate (80%) at the 5% level (α = 0.05) power was also assessed at the 1% level (α = 0.01).
3 Results
3.1 GWAS of metabolite measures
A total of 6184 SNPs were associated with at least one metabolite in WEs at the suggestive level (1 x 10-5), with 2616 (42.3%) SNPs being associated with a single metabolite measure. However, no SNPs were identified below the genome-wide significant level (p-value <5 x 10-8) in WEs.
Fewer SNPs were identified at the suggestive level in SAs, with 3685 SNP-metabolite associations in total, of which 1544 (41.9%) SNPs being associated with only one metabolite measure. SNP associations were identified for 138/146 (94.5%) metabolite exposures in SAs (Supplementary Table 6). No SNP was identified as being associated at the suggestive level in both ethnicities, although shared genomic regions were identified between ethnicities (Supplemental Excel). Using FUMA to investigate SNP function based on their effect on phenotypes in both ancestries (32), of the 85 genetic variants meta-analysed that surpassed suggestive GWAS significance: 54 were intergenic (i.e., between genes), 21 were intronic (i.e., between exons of a gene), 5 were upstream (i.e., within 250 bps before transcription start site), 3 were downstream (i.e., within 500 bps after transcription start site), and 2 (2%) were exonic (i.e., within protein coding region).
To evaluate the possibility of shared genomic predictors of metabolites, a pooled meta-analysis of effect estimates in both ethnicities was performed. For 90 metabolite values, no associations were found to exceed the genome-wide suggestive level (p<10-5) following meta-analyses of both ethnicities. SNP associations were identified at the suggestive level for four metabolite measures (concentration of XL-HDL, total lipids in M-VLDL, mean density of VLDL and citrate) despite these differing in direction of effects in SAs and WEs. In addition, 4 SNPs were associated with alanine, despite these SNPs initially being associated with alanine only in the SA population. These SNPs (rs12256633, rs17121228, rs7096521, rs12240368) are all found on chromosome 10 in the receptor gene SORCS1 and have not been associated with alanine levels previously (48, 49).
3.2 MR Results
After LD thinning, genetic instruments were available for all 146 metabolites in WEs and for 136/146 (93.2%) metabolites in SAs. In WEs, 1040 SNPs were retained following LD thinning including 423 (40.67%) that were unique to an individual metabolite. Fewer SNPs were identified in SAs, where 383 SNPS remained after LD thinning, 195 (50.9%) of which were unique to a single metabolite. Only 2.7% of included genetic instruments (4 metabolites) in WEs and 12.5% (9 metabolites) of included genetic instruments in SAs had an F-statistic < 10, indicating that most instruments were at low risk of weak instrument bias (50). The average F-statistic for WEs instruments was 72.4, while in SAs it was considerably lower at 26.7 (Supplementary Figure 6). Screening of genomic predictors using Phenoscanner and GWAS databases did not raise major concerns for horizonal pleiotropy (Supplementary Table 7) but did suggest that modification of anthropometrics is a common pathways by which metabolites elicit their effect on dysglycemia − i.e., vertical pleiotropy. However, where horizontal pleiotropy was a possibility (e.g., cholesterol levels), sensitivity analyses were performed (see 3.2.1.1 and 3.2.2.1 Sensitivity Analyses). Using MR-Base, we report that [in agreement with recent GWAS (51)], almost all SNPs included in an instrument have been previously associated with dysglycemia metrices or diabetes (Supplementary Table 8). However, since most evidence of genomics-diabetes associations are sourced from non-SA populations, these results may not accurately reflect genetic associations in SAs.
3.2.1 White Europeans
Two metabolite values, leucine and mean density of HDL lipoproteins (HDL_D), were associated with both fasting glucose and 2-hour post glucose (Table 1; Supplementary Figures 7, 8). Specifically, a 1mmol/L increase in blood leucine associated with lower fasting glucose (-0.193 mmol/L, 95% CI -0.069, -0.319) and 2-hour post glucose (-0.443 mmol/L, 95% CI -0.113, -0.774). Likewise, a 1nm increase in mean diameter of HDL associated with lower fasting glucose (-0.082 mmol/L, 95% CI 0.026, 0.138) and 2-hour post glucose (-0.191, 95% CI 0.043, 0.339 mmol/L). No other metabolites were associated with both measures of glucose in WEs.
For fasting glucose, an increase of 1mmol/L total cholesterol in M-HDL (M-HDL-C) and cholesterol esters in M-HDL (M-HDL-CE) were associated with lower fasting glucose measures (-0.189 mmol/L, 95% CI -0.021, -0.358, and -0.327 mmol/L, 95% CI -0.069, -0.586 respectively). For 2-hr post-glucose, 8 metabolite values were positively associated with this (HDLC, HDL2C, HDL3C, triglycerides in XS-VLDL, cholesterol esters in XL-HDL, total concentration of L-HDL, total lipids in L-HDL and cholesterol esters in S-HDL) and one (total concentration of S-LDL) was negatively associated (Table 1). Cholesterol metabolites, measures of total cholesterols in lipoproteins and total cholesterols in lipoproteins were the most common types of metabolite class to be associated with postprandial glucose in WEs, with leucine being the only amino acid identified. Wu-Hausman p-values < 0.05 indicate deviations of the instrumental variable estimate from the OLS estimate (Table 1).
3.2.1.1 Sensitivity analyses
For 6 of 13 metabolite values, leave-out one analyses maintained significance (P≤ 0.05) indicating that no individual SNP was driving the identified associations in WEs: leucine, mean diameter of HDL, total lipids in L-HDL, cholesterol esters in S-HDL and cholesterol esters in M-HDL (Supplementary Figure 9). For the remaining 8 metabolites, β values were consistent across leave-one-out analyses although not all associations remained significant. Additionally, for 12/13 metabolites (all but M-HDL-CE), the F-statistic did not substantially differ through the exclusion of individual SNPs from the instruments, which suggests they were not substantially driven by a single SNP (Supplementary Figure 9). The exception to this was cholesterol esters in M-HDL, where the exclusion of rs2138011 or rs739018 increased the F-statistic.
Three of the metabolites (leucine, L-HDL-L, L-HDL-C) that were associated with postprandial glucose in WEs included a SNP that previous studies have associated (p ≤ 1 x 10-5) with at least one potential confounder (BMI, hypertension or waist circumference) (Supplementary Table 7). The removal of these SNPs from the instrument did not impact the significance of the associations identified for leucine or L-HDL-L (Table 2). However, for L-HDL-C instrument, the exclusion of two SNPs (rs5576825 and rs6811162) previously associated with a potential confounder (waist circumference and hypertension respectively) resulted in non-significant association between L-HDL-C and 2-hour post glucose. Importantly, for both SNPs, it is conceivable that the confounders could reside on their causal pathway (i.e., vertically pleiotropic, where L-HDL-C effects 2-hr post-prandial glucose through its effect on weight gain) rather than be in horizontal pleiotropy and may, therefore, violate the 2nd MR assumption (50).
3.2.2 South Asians
No metabolite was associated with both fasting glucose and 2-hour post glucose in SAs (Supplementary Figures 7, 8). Although, for fasting glucose, a 1 mmol/L increase in either total FAw3 or S-HDL-C was associated with an increase of fasting glucose by 0.432 mmol/L (95% CI 0.063 – 0.798) and 1 mmol/L (95% CI 0.116 – 1.882) respectively. No metabolite associated with a decrease in fasting glucose in SAs.
Nine metabolites associated with 2-hour post glucose levels in SAs. Of these, 4 metabolites, LA, FAw6, total lipids in M-VLDL (M-VLDL-L) and total phospholipids in L-HDL (L-HDL-PL), associated with an increase in with 2-hour post glucose, with the largest effect being identified for L-HDL-PL. Specifically, a 1mmol/L increase in L-HDL-PL associated with a 0.692 mmol/L increase (95% CI 0.106 - 1.280) in 2-hour post glucose. A further 5 additional metabolites were associated with a decrease in 2-hour post glucose: concentration of L-LDL (L-LDL-P), total cholesterols in IDL (IDL-C), cholesterol esters in IDL (IDL-CE) concentration, total cholesterols in IDL (IDL-C), total lipids in small S-LDL (S-LDL-L), and total lipids in small S-HDL (S-HDL-L). The largest decrease in 2-hour post glucose was observed for L-LDL-P where a 1mmol/L increase in L-LDL-P associated with a 3.86 mmol/L decrease (95% 0.467 - 7.27) in 2-hour post glucose levels (Table 3).
Fatty acids were the class of metabolites most frequently associated with postprandial glucose in SAs. All three fatty acids (LA, FAw3 and FAw6) associations identified in SAs were of similar magnitude: a 1 mmol/l increase of FAw3 associated with a +0.4 mmol/l increase in fasting glucose or and a 1 mmol/increase of FAw6 and LA associated with a +0.4 mmol/l increase of 2-hour post glucose.
No metabolite found to be associated with postprandial glucose measures in WEs was found to be associated with postprandial glucose in SAs. However, in both populations, members of the S-HDL and L-HDL class were found to be associated with increased postprandial glucose.
3.2.2.1 Sensitivity analyses
Six instruments in SAs were comprised of a single SNP meaning it was not possible to perform a leave-one-out analysis for these metabolites. For the remaining 5 metabolites, associations were consistent across each leave-one-out analyses (Supplementary Figure 9). Likewise, no large differences in F-statistics following the removal of individual SNPs were identified (Supplementary Figure 10).
Just as in WEs, 3 metabolites identified in SAs included SNPs associated with cholesterol or hypertension, which are potential confounders of the association between metabolites and dysglycemia (Supplementary Table 7). Significance was maintained following the removal of SNP rs7486176 (found within the C12orf76 gene) from the total phospholipids in L-HDL instrument. For the LA and FAw6 exposures, the removal of SNP rs12720820 (found within the APOB gene) resulted in a non-significant association indicating that this SNP was the main driver of the identified association (Table 2). In leave-one-out analyses, the removal of SNP rs58865405 from the FAw6 instrument resulted in non-significance, although the biological role of this SNP remains unknown.
3.3 Post-hoc analysis: analysis of metabolite classes
Numerous SNPs were found to be associated with more than one metabolite measure, particularly for metabolites in the same metabolite class (Supplementary Figures 11-12). This was anticipated since many metabolomic pathways are biologically intertwined. To minimise the risk of violation of the third MR assumption (that the genetic instrument must only influence the outcome via the exposure and not via an alternative biological pathway) (14), the collective effect of an entire class of metabolites on postprandial glucose measures was examined. A composite score for each metabolite class was created by placing all metabolites in a single class (e.g. all LDL metabolites), conducting a PCA and extracting PC1. This was only possible for 20 classes in WEs and 21 classes in SAs that had > 2 metabolites and ≥70% of the class variation was explained by PC1 (Supplementary Table 9). To assess the impact of outliers on PCA, outliers were defined and removed based on two cut-offs: standard (1.5 X IQR from the median) and stringent (3 x IQR from the median). For all classes, PC1 and PC2 scores were comparable after removal of both types of outliers so only 3xIQR outliers were removed prior to analyses (Supplementary Table 10).
138 SNPS remained after LD thinning in WEs, 87 (63.04%) of which were unique to a single metabolite class exposure. 19/20 (95%) of the metabolite classes examined in WEs had an F-statistic ≥10 (the only exception being the MHDL class). 54 SNPS remained after LD thinning in SAs, 42 (77.78%) of which were unique to a single metabolite class. Screening of metabolite class predictors in Phenoscanner and GWAS databases did not raise concerns for horizonal (Supplementary Table 11). Despite the lower number of SNPs identified in SAs,17/20 (85%) of the metabolite classes examined had a F-statistic ≥10 indicating that instrument strength was still sufficient, for all classes except for the non-branched amino acids, LLDL class, and all VLDL classes. On average, the mean F-statistic of metabolite class instruments was 19.83%. On average genetic instruments of composite measures of the metabolite classes were weaker than the instruments for individual metabolites in both WEs and SAs (Supplementary Figure 13).
In WEs, 4 metabolite classes were associated with a glucose measure: S-LDLs associated with fasting and 2-hour post glucose; M-LDL and all LDLS (i.e. the collective grouping of HDLs, MDLs and SDLs) were associated with fasting glucose, and XL-HDLs were associated with 2-hour post glucose (Table 4). In SAs, the fatty acid metabolite class were associated with 2-hour post glucose levels. No other associations were identified in SAs.
As with the sensitivity analyses of individual metabolites, the removal of individual SNPs was not found to greatly impact the F-statistic of most instruments (Supplementary Figures 14-15). However, for the fatty acid metabolites class, the exclusion of rs12720820 or rs7159441, and for ‘XL-HDL’, the removal of rs55768285, resulted in non-significant associations, suggesting that these SNPs are key drivers of the association. (Table 4; Supplementary Table 10).
3.4 Power analysis
R2 values were consistently lower in the ethnic group where an effect was not detected but all genetic instruments for the metabolite values had an F-statistic ≥ 10, indicating that weak instrument bias was not responsible for the absence of significant effects. Where an association was identified in one ethnic group but not another, to determine whether the absence of an association was potentially due a lack of power in the other ethnicity rather than an ethnic-specific effect, post-hoc power analyses were performed.
When using MR estimates as an estimate for the true causal effect both the analyses of FAw3 and the overall fatty acid class in WEs were adequately powered to detect the observed MR effect in both populations. Therefore, the absence of an effect of FAw3 in WEs is unlikely due to inadequate power (Supplementary Table 12). The analysis of HDL2C and HDL3C in SAs was also sufficiently powered to detect the observed MR effect in WEs.
4 Discussion
This study has identified ethnically distinct associations between a range of metabolites and postprandial glucose measures taken during pregnancy in SAs and WEs, with notably no shared associations were identified. Fourteen metabolites were found to be associated with postprandial glucose measures in WEs. Whereas, a distinct set of 11 metabolites were associated in SAs. In WEs, cholesterols and lipoproteins were the metabolite classes associated with postprandial glucose measures, while in SAs fatty acids were the most commonly associated.
Furthermore, through an extensive GWAS of metabolites, this study identified novel genome-wide significant associations in relation to acetate (1 SNP, rs10945476) and tyrosine (15 SNPs, all on chromosome 17) in SAs. No previous associations have been identified for SNP rs10945476, found within the non-coding transcript gene PRDM15 in relation to acetate or any other exposure.
Interestingly, 3 of the 15 SNPs associated with tyrosine are found in a transmembrane transporter gene, SLC13A2. Moreover, an additional 10 of the newly identified 15 SNPs associated with tyrosine were found in the FOXN gene, a transcription factor that has previously been identified to be associated with ceramide levels (a lipid metabolite) in a GWAS from a Chinese cohort (52). Moreover, ceramide has been shown to induce tyrosine phosphorylation in membrane proteins meaning it is plausible that a gene associated with ceramide is also associated with tyrosine levels in an Asian population (51). Interestingly, ceramide has been proposed as a mediator of the interaction between saturated fat and insulin resistance and has been associated with T2D and cardiovascular disease (53). To the best of our knowledge tyrosine levels have not previously been associated with either FOXN SLC13A2. The remaining 2 of the 15 SNPs identified as being associated with tyrosine in SAs, are currently not in any known genes. All 15 SNPs identified as being associated with tyrosine in SAs are in LD with each other in 1000G SA populations (all R2 ≥ 0.38). In agreement with a recent GWAS of GDM and T2D (54), almost all SNPs included in an instrument significantly associated with either outcome (fasting glucose or 2-hour post glucose) have previously been associated with a diabetes related disease outcome, providing additional evidence for their validity as instrumental variables. However, since most evidence of genomics-diabetes associations are sourced from non-SA populations, these results may not accurately reflect genetic associations in SAs.
4.1 Identified associations in white Europeans (WEs)
4.1.1 Leucine
Branched chain amino acids (BCAAs), including leucine, are predominantly metabolised in skeletal muscles where they regulate protein synthesis and mitochondrial functions (55). In addition, BCAAs are hormonal signalling regulators and are expected to module insulin resistance (IR) through increasing insulin secretion in human pancreatic β-cells (56, 57). Our study found leucine to be negatively associated with both fasting glucose and 2-hour post glucose levels during pregnancy in WEs; with 1 mmol/L of leucine associated with a decrease of 0.193 mmol/L in fasting glucose and 0.327 mmol/L 2-hour post glucose respectively. Although few studies have investigated the role of leucine in glucose regulation during pregnancy, interestingly the ratio of leucine/isoleucine was similarly found associated with reductions in fasting glucose in the HAPO study, a multi-ethnic cohort of pregnant women of Afro-Caribbean, Mexican American, Northern European, and Thai ancestry (58). Common dietary sources of leucine include meat products and cheese, with smaller amounts also being present in other dairy products (such as dairy and yoghurt), fish and in certain legumes and nuts, such as dried raw broad beans and pine nuts (55). Hence, dietary interventions aimed at increasing leucine levels during pregnancy, possibly through a dietary intervention promoting the consumption of lean animal protein, low-fat dairy and nuts, may help improve pregnancy hyperglycaemia in WEs.
4.1.2 Cholesterols
HDL cholesterol is colloquially described as ‘good cholesterol’ due to its role in the removal of cholesterols from atherosclerotic plaque, thereby reducing an individual’s risk of CVD (59). Furthermore, low HDL levels have commonly been associated with diabetes in humans, with HDL shown to increase insulin secretion and β-cell survival (60, 61). We identified four associations between HDL cholesterol and postprandial glucose measures in WEs. Herein, 1 mmol/L increase in S-HDL-CE confers a 1.78 mmol/L increase (95% CI 0.49 – 3.11) in 2-hour post glucose. This is consistent with previous evidence from a Finnish sample of overweight and obese women where cholesterol esters in S-HDL were higher in the serum samples of GDM cases at ~14 weeks gestation (62). Discrepancies in the direct effect of HDL cholesterol on dysglycemia have also been identified in the genetic literature (61), with a recent review highlighting that while a genetic study utilising linear relation analysis did find HDLs to have a protective effects against T2D (n cases = 2,447) (63), the same effect was not been replicated in an MR setting (n cases= 47,627) (64). When considering LDL cholesterols, only S-LDLs was found to be significantly associated with a postprandial glucose measure (fasting glucose) in WEs. Additionally, in our composite analysis of metabolite classes, S-LDLs were associated with fasting glucose and 2-hour post glucose in WEs, whereas the M-LDL and all LDL (a combined measure of S-LDLs, M-LDLs and HDLs) classes were associated with fasting glucose. Unfortunately, because composite scores were comprised of PC1 coordinates the direction of effect of these associations could not be evaluated. To our knowledge, no previous study has conducted an MR of metabolites on dysglycemic predictors of GDM.
4.1.3 Triglycerides
Triglycerides are an abundant class of lipid particles found in the blood, originating from either from the consumption of dietary fats or as a result of hepatic metabolism (65, 66). Once in the blood, triglycerides can be incorporated into HDL and LDL cholesterol particles. In addition to dietary triglyceride consumption, dietary fatty acids can be converted into triglycerides before they enter circulation, highlighting the complex relationship between triglyceride, cholesterol, and fatty acid levels (65).
Our results suggest triglycerides in XSVLDL (XS-VLDL-TG) associate with increased 2-hour post glucose (0.841 mmol/L) in WEs. In agreement with these findings, increased triglycerides in XSVLDL levels have also previously been associated with increased likelihood of GDM in a Finnish population (62). No other triglyceride was found to be associated with in WEs. One explanation no additional associations were detected could be due to the average BMI of the WEs in BiB. For example, an analysis of a prospective Irish cohort (~94% WE) found that triglyceride levels were only associated with GDM in obese individuals, a higher average BMI than that observed in the BiB cohort (67). Further confirmation of these findings of increased triglycerides in XS-VLDLs would suggest that this association is, at least in part, responsible for the identified associations between diets high in fats and increased prevalence of GDM in WEs (10).
4.2 Identified associations in South Asians (SAs)
4.2.1 Fatty acids
Polyunsaturated acids (PUFAs) are consumed in the diet and can be converted to long-chain PUFAs (LC-PUFAs) through a process of desaturation and elongation reactions that predominately occur in the liver (68). Changes in dietary patterns can have a large impact on fatty acid composition in the body and with-it disease risk. For example, a western dietary pattern, which has high levels of n-6 fatty acids, has been associated with GDM risk (10, 69). In a cohort of Chinese adults, total n-6 fatty acids and 18:2 n-6 levels at baseline in venous blood samples were both found to associate with an increased risk of T2D after ~8 years of follow up, while increased n-3 fatty acid levels were protective (70). However, a recent two-sample MR suggested only a negligible effect of n-6 PUFA synthesis on T2D in a predominantly WE cohort (71). Moreover, the relationship between n-3, n-6, n-9 fatty acids and GDM remains inconclusive, and in a recent (2021) systematic review none of the identified studies (n=15) was conducted in a SA population (69), highlighting the need for more studies exploring the role of fatty acids in GDM development in Asians (72).
This study provides evidence of an association between LA and total FAw6 levels and an increase in 2-hour post glucose levels during pregnancy in SAs. In addition, the fatty acid class associated with 2-hour post glucose in SAs. Through a leave-out-one sensitivity analyses for the FAw6 and LA instruments, the removal of the SNP rs12720820 (found within the APOB gene) resulted in non-significant associations for both exposures, indicating that this rs12720820 was the largest contributor to the identified associations and has previously been associated with cholesterol levels and the use of cholesterol lowering drugs (73). Interestingly, FAw3 also associated with increased 2-hour post glucose in SAs; however, with only one SNP potential pleiotropy could not be explored.
It is well established that fatty acid profiles can impact blood cholesterol levels (74–76). In addition, increased dietary cholesterol has previously been associated with an increased risk of GDM in a systematic review of observational studies (77). Taken together, our data confirm that fatty acids and cholesterol metabolites are in vertically pleiotropy and are likely impacting gestational dysglycemia via the same causal pathway. Unlike horizontal pleiotropy, vertical pleiotropy does not result in a violation of the 2nd MR assumption as cholesterol is not acting as a confounder, meaning MR estimates are still valid (Figure 1). Furthermore, it is also possible that this interaction between fatty acids and cholesterols may be ethnic-specific due to the absence of associations identified between fatty acids and postprandial glucose measures in WEs. In addition to possible variations in cholesterol metabolism, it is plausible that variations in fatty acid synthesis are also partially responsible for the increased GDM risk experienced by SAs. For example, variants within the FADS genes impact LC-PUFA conversion (78, 79). Current evidence suggests that SAs are likely to synthesise LC-PUFA more quickly than WEs, which could contribute to elevated risk of prolonged exposure to elevated LC-PUFA levels (namely, w6) and risk of dysglycemia (78, 79). If these ethnic differences in fatty acid metabolism are confirmed to be linked to disease risk, it would aid in the development of tailored GDM prevention strategies that focus on modifying fatty acid profiles in an ethnic-specific manner.
Figure 1 Schematic of potential horizontal and vertical pleiotropy in relation to fatty acid and cholesterol metabolites and postprandial glucose measures. (A) Illustration of horizontal pleiotropy. (B) Illustration of vertical pleiotropy. Vertical pleiotropy does not result in a violation of the 2nd MR assumption because the metabolites progress along a single linear causal pathway.
The analyses found no association between triglycerides and dysglycemia in SAs. This agrees with a recent meta-analysis that concluded, although triglyceride levels associated with likelihood of GDM (I2 ≥ 84%) (80), after stratification by culture/geographical location, they found no association between triglyceride levels and likelihood of GDM. The reasons for this are unclear but it has also been shown that SAs have a higher prevalence of hypertriglyceridemia than WEs and at lower BMI levels, meaning it is possible that the difference in triglyceride levels and in SA GDM cases and controls is less pronounced than in WEs (81).
4.3 Strengths and limitations
This analysis has several strengths. Firstly, this study involved a large and comprehensive panel of metabolites allowing for the relationships between metabolites and postprandial glucose to be thoroughly investigated. Secondly, this is the first MR study to investigate dysglycemia during pregnancy while also being one of the few MR studies to be conducted in a SA population. Finally, through leave-one-out analyses and the searching of both Phenoscanner and GWAS Catalog databases, violations of the 2nd and 3rd MR assumptions were thoroughly investigated meaning that it was possible to conclude that identified robust associations between genetic variants and outcomes (1st assumption of MR) may not be subjected to horizontal pleiotropy and that identified causal associations are valid due to the absence of detectable violations of the MR assumptions.
Nonetheless, this study has some limitations. Firstly, metabolites are highly correlated meaning it is not possible to confidently interpret that an individual metabolite is independently associated with a postprandial outcome measure. To account for this limitation MR analyses were performed on composite measures of each metabolite class (when PC1 explained ≥70% of the variation in the metabolite class) to assess the overall impact of each metabolite class on pregnancy dysglycemia. Secondly, MR also assumes the level of genetically conferred exposure from conception to the time of measurement is constant, which is unknown when studying metabolites − therefore, we cannot presume that these associations would be observed outside of pregnancy. Thirdly, limited sample size may have led to some underpowered analyses and combined with high consanguinity persuaded us to use statistical modelling to account for ‘relatedness’ (rather than participant pruning) to preserve analytical power and study integrity. We acknowledge that this strategy has limitations, and tested for their effect (i.e., LOO analysis) (25), and look forward to the future when larger SA prospective cohort are available, and we can validate these results with increased confidence. In addition, the limited sample size meant that further adjustment could not be made at the GWAS stage since missing data in certain variables would further reduce sample size (e.g., age) and some associations may have been underpowered to detect an effect. However, a post hoc power analyses found that for some metabolite values significant effects only identified in one ethnicity were possible to detect in the alternate ethnicity. Fourthly, some genetic instruments included only one SNP meaning it was not possible to evaluate the impact of pleiotropy for any identified associations involving these instruments. Fifthly, it was not possible to fully assess the presence of associations between SNPs included in significant instruments, potential confounders and T2D traits in SAs due to the limited number of GWAS conducted in SAs. Although it is likely that many of the associations in WEs are also present in SAs, it is possible that not all associations identified in WE are present in SAs and that some SA-specific pleiotropic associations are unknown. Lastly, due to limitations in data availability in SAs a two-sample MR could not be conducted meaning it was not possible to assess the generalisability of these findings.
5 Conclusions
The presence of causal relationships between a comprehensive set of distinct metabolites and metabolite families with postprandial glucose measures (fasting glucose and 2-hour post glucose) in mid-pregnancy has been established in a UK SA and WE population. This study has found a range of metabolite values to be associated with postprandial glucose measures in WEs and high-risk SA women, although more associations were identified in WEs despite these individuals being at lower risk of GDM. In high-risk SA women, total n-6 fatty acids and the n-6 fatty acid, LA appear to increase postprandial glucose levels suggesting that fatty acids may be responsible for a large proportion of metabolically driven risk for GDM experienced by this population. Future work in a larger sample (potentially using a two-sample MR) and a larger panel of metabolites is needed to investigate our findings and hypotheses more closely, ideally over the course of a pregnancy in order to aid in GDM prevention in this high-risk population.
Data availability statement
Publicly available datasets were analyzed in this study. This data can be found here: Born in Bradford [https://borninbradford.nhs.uk].
Author contributions
HF, MI, and MZ contributed to conception and design of the study. HF organized the database and performed the statistical analysis. JM, MI, and MZ supervised the study. HF and MZ wrote the first draft of the manuscript. All authors contributed to the article and approved the submitted version.
Funding
MZ is currently funded by the Wellcome Trust [217446/Z/19/Z]. The Born in Bradford data reported here were supported by a Wellcome Trust infrastructure grant (101597) and the National Institute for Health Research ARC Yorkshire and Humber [NIHR200166]. Funding for the metabolomics analyses has been provided by the US National Institutes of Health [R01 DK10324], the European Research Council (ERC) under the European Union’s Seventh Framework Programme [FP7/2007-2013]/ERC grant agreement no 669545 and the MRC via the MRC Integrative Epidemiology Unit Programme [MC_UU_00011/6].
Acknowledgments
Born in Bradford is only possible because of the enthusiasm and commitment of the children and parents in BiB. We are grateful to all the participants, health professionals, schools and researchers who have made Born in Bradford happen. This work was undertaken on ARC4, part of the High Performance Computing (HPC) facilities at the University of Leeds, UK.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2023.1157416/full#supplementary-material
References
1. Plows JF, Stanley JL, Baker PN, Reynolds CM, Vickers MH. The pathophysiology of gestational diabetes mellitus. Int J Mol Sci (2018) 19(11):3342. doi: 10.3390/ijms19113342
2. Taylor K, Ferreira DLS, West J, Yang T, Caputo M, Lawlor DA. Differences in pregnancy metabolic profiles and their determinants between white European and south Asian women: findings from the born in Bradford cohort. Metabolites (2019) 9(9):190. doi: 10.3390/metabo9090190
3. Mills HL, Patel N, White SL, Pasupathy D, Briley AL, Santos Ferreira DL, et al. The effect of a lifestyle intervention in obese pregnant women on gestational metabolic profiles: findings from the UK pregnancies better eating and activity trial (UPBEAT) randomised controlled trial. BMC Med (2019) 17(1):15. doi: 10.1186/s12916-018-1248-7
4. Kampmann U, Knorr S, Fuglsang J, Ovesen P. Determinants of maternal insulin resistance during pregnancy: an updated overview. J Diabetes Res (2019) 2019:5320156. doi: 10.1155/2019/5320156
5. McIntyre HD, Catalano P, Zhang C, Desoye G, Mathiesen ER, Damm P. Gestational diabetes mellitus. Nat Rev Dis Primers (2019) 5(1):47. doi: 10.1038/s41572-019-0098-8
6. Rodrigo N, Glastras SJ. The emerging role of biomarkers in the diagnosis of gestational diabetes mellitus. J Clin Med (2018) 7(6):120. doi: 10.3390/jcm7060120
7. Menard V, Sotunde OF, Weiler HA. Ethnicity and immigration status as risk factors for gestational diabetes mellitus, anemia and pregnancy outcomes among food insecure women attending the Montreal diet dispensary program. Can J Diabetes. (2020) 44(2):139–45.e1. doi: 10.1016/j.jcjd.2019.05.004
8. Gadve SS, Chavanda S, Mukherjee AD, Aziz S, Joshi A, Patwardhan M. Risk of developing type 2 diabetes mellitus in south Asian women with history of gestational diabetes mellitus: a systematic review and meta-analysis. Indian J Endocrinol Metab (2021) 25(3):176–81. doi: 10.4103/ijem.IJEM_57_21
9. Fuller H, Iles M, Moore JB, Zulyniak MA. Unique metabolic profiles associate with gestational diabetes and ethnicity in low and high-risk women living in the UK. J Nutr 152(10):2186–97. doi: 10.1093/jn/nxac163
10. Fuller H, Moore JB, Iles MM, Zulyniak MA. Ethnic-specific associations between dietary consumption and gestational diabetes mellitus incidence: a meta-analysis. PloS Global Public Health (2022) 2(5):e0000250. doi: 10.1371/journal.pgph.0000250
11. Griffith RJ, Alsweiler J, Moore AE, Brown S, Middleton P, Shepherd E, et al. Interventions to prevent women from developing gestational diabetes mellitus: an overview of cochrane reviews. Cochrane Database Syst Rev (2020) 6:CD012394. doi: 10.1002/14651858.CD012394.pub3
12. de Souza RJ, Shanmuganathan M, Lamri A, Atkinson SA, Becker A, Desai D, et al. Maternal diet and the serum metabolome in pregnancy: robust dietary biomarkers generalizable to a multiethnic birth cohort. Curr Dev Nutr (2020) 4(10):nzaa144–nzaa. doi: 10.1093/cdn/nzaa144
13. Wang QY, You LH, Xiang LL, Zhu YT, Zeng Y. Current progress in metabolomics of gestational diabetes mellitus. World J Diabetes (2021) 12(8):1164–86. doi: 10.4239/wjd.v12.i8.1164
14. Sheehan NA, Meng S, Didelez V. Mendelian randomisation: a tool for assessing causality in observational epidemiology. In: Teare MD, editor. Genetic epidemiology Totowa, NJ: Humana Press (2011). p. 153–66.
15. Liu J, van Klinken JB, Semiz S, van Dijk KW, Verhoeven A, Hankemeier T, et al. A mendelian randomization study of metabolite profiles, fasting glucose, and type 2 diabetes. Diabetes (2017) 66(11):2915–26. doi: 10.2337/db17-0199
16. Carreras-Torres R, Johansson M, Gaborieau V, Haycock PC, Wade KH, Relton CL, et al. The role of obesity, type 2 diabetes, and metabolic factors in pancreatic cancer: a mendelian randomization study. J Natl Cancer Inst (2017) 109(9):djx012. doi: 10.1093/jnci/djx012
17. Wright J, Small N, Raynor P, Tuffnell D, Bhopal R, Cameron N, et al. Cohort profile: the born in Bradford multi-ethnic family cohort study. Int J Epidemiol (2013) 42(4):978–91. doi: 10.1093/ije/dys112
18. Bird PK, McEachan RRC, Mon-Williams M, Small N, West J, Whincup P, et al. Growing up in Bradford: protocol for the age 7-11 follow up of the born in Bradford birth cohort. BMC Public Health (2019) 19(1):939. doi: 10.1186/s12889-019-7222-2
19. Lawlor DA, West J, Fairley L, Nelson SM, Bhopal RS, Tuffnell D, et al. Pregnancy glycaemia and cord-blood levels of insulin and leptin in Pakistani and white British mother-offspring pairs: findings from a prospective pregnancy cohort. Diabetologia (2014) 57(12):2492–500. doi: 10.1007/s00125-014-3386-6
20. Josse J, Husson F. missMDA: a package for handling missing values in multivariate data analysis. J Stat Software (2016) 70(1):1–31. doi: 10.18637/jss.v070.i01
21. Arciero E, Dogra SA, Malawsky DS, Mezzavilla M, Tsismentzoglou T, Huang QQ, et al. Fine-scale population structure and demographic history of British pakistanis. Nat Commun (2021) 12(1):7189. doi: 10.1038/s41467-021-27394-2
22. Marees AT, de Kluiver H, Stringer S, Vorspan F, Curis E, Marie-Claire C, et al. A tutorial on conducting genome-wide association studies: quality control and statistical analysis. Int J Methods Psychiatr Res (2018) 27(2):e1608. doi: 10.1002/mpr.1608
23. Uffelmann E, Huang QQ, Munung NS, de Vries J, Okada Y, Martin AR, et al. Genome-wide association studies. Nat Rev Methods Primers (2021) 1(1). doi: 10.1038/s43586-021-00056-9
24. Sheridan E, Wright J, Small N, Corry PC, Oddie S, Whibley C, et al. Risk factors for congenital anomaly in a multiethnic birth cohort: an analysis of the born in Bradford study. Lancet (2013) 382(9901):1350–9. doi: 10.1016/S0140-6736(13)61132-0
25. Yang J, Zaitlen NA, Goddard ME, Visscher PM, Price AL. Advantages and pitfalls in the application of mixed-model association methods. Nat Genet (2014) 46(2):100–6. doi: 10.1038/ng.2876
26. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet (2007) 81(3):559–75. doi: 10.1086/519795
27. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a toolset for whole-genome association and population-based linkage analysis. Am J Hum Genet (2007) 81
28. Yang J, Lee SH, Goddard ME, Visscher PM. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet (2011) 88(1):76–82. doi: 10.1016/j.ajhg.2010.11.011
29. van den Berg S, Vandenplas J, van Eeuwijk FA, Lopes MS, Veerkamp RF. Significance testing and genomic inflation factor using high-density genotypes or whole-genome sequence data. J Anim Breed Genet (2019) 136(6):418–29. doi: 10.1111/jbg.12419
30. Rivadeneira F, Uitterlinden AG. Chapter 18 - genetics of osteoporosis. In: Dempster DW, Cauley JA, Bouxsein ML, Cosman F, editors. Marcus And feldman’s osteoporosis, Fifth Edition. (London EC2Y 5AS, United Kingdom: Academic Press (2021). p. 405–51.
31. Willer CJ, Li Y, Abecasis GR. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics (2010) 26(17):2190–1. doi: 10.1093/bioinformatics/btq340
32. Watanabe K, Taskesen E, van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nat Commun (2017) 8(1):1826. doi: 10.1038/s41467-017-01261-5
33. Hemani G, Zheng J, Elsworth B, Wade KH, Haberland V, Baird D, et al. The MR-base platform supports systematic causal inference across the human phenome. Elife (2018) 7:e34408. doi: 10.7554/eLife.34408
34. Machiela MJ, Chanock SJ. LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants. Bioinformatics (2015) 31(21):3555–7. doi: 10.1093/bioinformatics/btv402
35. Myers TA, Chanock SJ, Machiela MJ. LDlinkR: an r package for rapidly calculating linkage disequilibrium statistics in diverse populations. Front Genet (2020) 11:157. doi: 10.3389/fgene.2020.00157
36. The 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature (2015) 526(7571):68–74. doi: 10.1038/nature15393
37. R Development Core Team. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing (2010).
39. Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience (2015) 4(1). doi: 10.1186/s13742-015-0047-8
40. Budu-Aggrey A, Brumpton B, Tyrrell J, Watkins S, Modalsli EH, Celis-Morales C, et al. Evidence of a causal relationship between body mass index and psoriasis: a mendelian randomization study. PloS Med (2019) 16(1):e1002739. doi: 10.1371/journal.pmed.1002739
41. Fox J, Kleiber C, Zeileis A. _ivreg: instrumental-variables regression by ‘2SLS’, ‘2SM’, or ‘2SMM’, with diagnostics. R package version 0.6-1 ed2021. (2020)
42. Skrivankova VW, Richmond RC, Woolf BAR, Davies NM, Swanson SA, VanderWeele TJ, et al. Strengthening the reporting of observational studies in epidemiology using mendelian randomisation (STROBE-MR): explanation and elaboration. BMJ (2021) 375:n2233. doi: 10.1136/bmj.n2233
43. Buniello A, MacArthur JAL, Cerezo M, Harris LW, Hayhurst J, Malangone C, et al. The NHGRI-EBI GWAS catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res (2019) 47:D1005–D1012. doi: 10.1093/nar/gky1120
44. Staley JR, Blackshaw J, Kamat MA, Ellis S, Surendran P, Sun BB, et al. PhenoScanner: a database of human genotype-phenotype associations. Bioinformatics (2016) 32(20):3207–9. doi: 10.1093/bioinformatics/btw373
45. Kamat MA, Blackshaw JA, Young R, Surendran P, Burgess S, Danesh J, et al. PhenoScanner V2: an expanded tool for searching human genotype-phenotype associations. Bioinformatics (2019) 35(22):4851–3. doi: 10.1093/bioinformatics/btz469
46. Davies NM, Holmes MV, Davey Smith G. Reading mendelian randomisation studies: a guide, glossary, and checklist for clinicians. Bmj (2018) 362:k601. doi: 10.1136/bmj.k601
47. Brion M-JA, Shakhbazov K, Visscher PM. Calculating statistical power in mendelian randomization studies. Int J Epidemiol (2012) 42(5):1497–501. doi: 10.1093/ije/dyt179
48. Howe KL, Achuthan P, Allen J, Allen J, Alvarez-Jarreta J, Amode MR, et al. Ensembl 2021. Nucleic Acids Res (2020) 49(D1):D884–D91. doi: 10.1093/nar/gkaa942
49. Ensembl. gene: SORCS1 e! ensembl: ensembl release 105; 2021 . Available at: https://www.ensembl.org/Homo_sapiens/Gene/Summary?g=ENSG00000108018;r=10:106573663-107164706.
50. Lawlor DA, Harbord RM, Sterne JA, Timpson N, Davey Smith G. Mendelian randomization: using genes as instruments for making causal inferences in epidemiology. Stat Med (2008) 27(8):1133–63. doi: 10.1002/sim.3034
51. Gulbins E, Szabo I, Baltzer K, Lang F. Ceramide-induced inhibition of T lymphocyte voltage-gated potassium channel is mediated by tyrosine kinases. Proc Natl Acad Sci USA (1997) 94(14):7661–6. doi: 10.1073/pnas.94.14.7661
52. Chai JF, Raichur S, Khor IW, Torta F, Chew WS, Herr DR, et al. Associations with metabolites in Chinese suggest new metabolic roles in alzheimer’s and parkinson’s diseases. Hum Mol Genet (2020) 29(2):189–201. doi: 10.1093/hmg/ddz246
53. Yki-Jarvinen H, Luukkonen PK, Hodson L, Moore JB. Dietary carbohydrates and fats in nonalcoholic fatty liver disease. Nat Rev Gastroenterol Hepatol (2021) 18(11):770–86. doi: 10.1038/s41575-021-00472-y
54. Pervjakova N, Moen G-H, Borges M-C, Ferreira T, Cook JP, Allard C, et al. Multi-ancestry genome-wide association study of gestational diabetes mellitus highlights genetic links with type 2 diabetes. Hum Mol Genet (2022) 31(19):3377–91. doi: 10.1093/hmg/ddac050
55. Rondanelli M, Nichetti M, Peroni G, Faliva MA, Naso M, Gasparri C, et al. Where to find leucine in food and how to feed elderly with sarcopenia in order to counteract loss of muscle mass: practical advice. Front Nutr (2021) 7:622391. doi: 10.3389/fnut.2020.622391
56. Nie C, He T, Zhang W, Zhang G, Ma X. Branched chain amino acids: beyond nutrition metabolism. Int J Mol Sci (2018) 19(4):954. doi: 10.3390/ijms19040954
57. Leenders M, van Loon LJ. Leucine as a pharmaconutrient to prevent and treat sarcopenia and type 2 diabetes. Nutr Rev (2011) 69(11):675–89. doi: 10.1111/j.1753-4887.2011.00443.x
58. Liu Y, Kuang A, Talbot O, Bain JR, Muehlbauer MJ, Hayes MG, et al. Metabolomic and genetic associations with insulin resistance in pregnancy. Diabetologia (2020) 63(9):1783–95. doi: 10.1007/s00125-020-05198-1
59. CDC. LDL and HDL cholesterol: “Bad” and “Good” cholesterol: US.gov (2020). Available at: https://www.cdc.gov/cholesterol/ldl_hdl.htm.
60. Ryckman KK, Spracklen CN, Smith CJ, Robinson JG, Saftlas AF. Maternal lipid levels during pregnancy and gestational diabetes: a systematic review and meta-analysis. BJOG (2015) 122(5):643–51. doi: 10.1111/1471-0528.13261
61. Wong NKP, Nicholls SJ, Tan JTM, Bursill CA. The role of high-density lipoproteins in diabetes and its vascular complications. Int J Mol Sci (2018) 19(6):1680. doi: 10.3390/ijms19061680
62. Mokkala K, Vahlberg T, Pellonperä O, Houttu N, Koivuniemi E, Laitinen K. Distinct metabolic profile in early pregnancy of overweight and obese women developing gestational diabetes. J Nutr (2020) 150(1):31–7. doi: 10.1093/jn/nxz220
63. Qi Q, Liang L, Doria A, Hu FB, Qi L. Genetic predisposition to dyslipidemia and type 2 diabetes risk in two prospective cohorts. Diabetes (2012) 61(3):745–52. doi: 10.2337/db11-1254
64. Haase CL, Tybjærg-Hansen A, Nordestgaard BG, Frikke-Schmidt R. HDL cholesterol and risk of type 2 diabetes: a mendelian randomization study. Diabetes (2015) 64(9):3328–33. doi: 10.2337/db14-1603
65. Alexopoulos A-S, Qamar A, Hutchins K, Crowley MJ, Batch BC, Guyton JR. Triglycerides: emerging targets in diabetes care? review of moderate hypertriglyceridemia in diabetes. Curr Diabetes Rep (2019) 19(4):13–. doi: 10.1007/s11892-019-1136-3
66. Laufs U, Parhofer KG, Ginsberg HN, Hegele RA. Clinical review on triglycerides. Eur Heart J (2020) 41(1):99–109c. doi: 10.1093/eurheartj/ehz785
67. O’Malley EG, Reynolds CME, Killalea A, O’Kelly R, Sheehan SR, Turner MJ. Maternal obesity and dyslipidemia associated with gestational diabetes mellitus (GDM). Eur J Obstet Gynecol Reprod Biol (2020) 246:67–71. doi: 10.1016/j.ejogrb.2020.01.007
68. Gonzalez-Soto M, Mutch DM. Diet regulation of long-chain PUFA synthesis: role of macronutrients, micronutrients, and polyphenols on Δ-5/Δ-6 desaturases and elongases 2/5. Adv Nutr (2021) 12(3):980–94. doi: 10.1093/advances/nmaa142
69. Hosseinkhani S, Dehghanbanadaki H, Aazami H, Pasalar P, Asadi M, Razi F. Association of circulating omega 3, 6 and 9 fatty acids with gestational diabetes mellitus: a systematic review. BMC Endocrine Disord (2021) 21(1):120. doi: 10.1186/s12902-021-00783-w
70. Bragg F, Kartsonaki C, Guo Y, Holmes M, Du H, Yu C, et al. Circulating metabolites and the development of type 2 diabetes in Chinese adults. Diabetes Care (2021) 45(2):477–80. doi: 10.2337/dc21-1415
71. Zulyniak MA, Fuller H, Iles MM. Investigation of the causal association between long-chain n-6 polyunsaturated fatty acid synthesis and the risk of type 2 diabetes: a mendelian randomization analysis. Lifestyle Genomics (2020) 13(5):146–53. doi: 10.1159/000509663
72. Pan X-F, Huang Y, Li X, Wang Y, Ye Y, Chen H, et al. Circulating fatty acids and risk of gestational diabetes mellitus: prospective analyses in China. Eur J Endocrinol (2021) 185(1):87–97. doi: 10.1530/eje-21-0118
73. MedlinePlus. APOB gene national library of medicine (US): NIH (2021). Available at: https://medlineplus.gov/genetics/gene/apob/.
74. Mensink RP, Zock PL, Kester AD, Katan MB. Effects of dietary fatty acids and carbohydrates on the ratio of serum total to HDL cholesterol and on serum lipids and apolipoproteins: a meta-analysis of 60 controlled trials. Am J Clin Nutr (2003) 77(5):1146–55. doi: 10.1093/ajcn/77.5.1146
75. Jacobson TA, Glickstein SB, Rowe JD, Soni PN. Effects of eicosapentaenoic acid and docosahexaenoic acid on low-density lipoprotein cholesterol and other lipids: a review. J Clin Lipidol. (2012) 6(1):5–18. doi: 10.1016/j.jacl.2011.10.018
76. Ooi EM, Ng TW, Watts GF, Barrett PH. Dietary fatty acids and lipoprotein metabolism: new insights and updates. Curr Opin Lipidol. (2013) 24(3):192–7. doi: 10.1097/MOL.0b013e3283613ba2
77. Schoenaker DA, Mishra GD, Callaway LK, Soedamah-Muthu SS. The role of energy, nutrients, foods, and dietary patterns in the development of gestational diabetes mellitus: a systematic review of observational studies. Diabetes Care (2016) 39(1):16–23. doi: 10.2337/dc15-0540
78. Koletzko B, Reischl E, Tanjung C, Gonzalez-Casanova I, Ramakrishnan U, Meldrum S, et al. FADS1 and FADS2 polymorphisms modulate fatty acid metabolism and dietary impact on health. Annu Rev Nutr (2019) 39:21–44. doi: 10.1146/annurev-nutr-082018-124250
79. Ameur A, Enroth S, Johansson A, Zaboli G, Igl W, Johansson AC, et al. Genetic adaptation of fatty-acid metabolism: a human-specific haplotype increasing the biosynthesis of long-chain omega-3 and omega-6 fatty acids. Am J Hum Genet (2012) 90(5):809–20. doi: 10.1016/j.ajhg.2012.03.014
80. Hu J, Gillies CL, Lin S, Stewart ZA, Melford SE, Abrams KR, et al. Association of maternal lipid profile and gestational diabetes mellitus: a systematic review and meta-analysis of 292 studies and 97,880 women. EClinicalMedicine (2021) 34:100830. doi: 10.1016/j.eclinm.2021.100830
Keywords: genetics, GDM, gestational diabetes, South Asian, metabolism, glucose, pregnancy
Citation: Fuller H, Iles MM, Moore JB and Zulyniak MA (2023) Metabolic drivers of dysglycemia in pregnancy: ethnic-specific GWAS of 146 metabolites and 1-sample Mendelian randomization analyses in a UK multi-ethnic birth cohort. Front. Endocrinol. 14:1157416. doi: 10.3389/fendo.2023.1157416
Received: 02 February 2023; Accepted: 01 May 2023;
Published: 15 May 2023.
Edited by:
Haifeng Hou, Shandong First Medical University, ChinaReviewed by:
Rachelle Elizabeth Irwin, Ulster University, United KingdomSinan Tanyolac, Istanbul University, Türkiye
Copyright © 2023 Fuller, Iles, Moore and Zulyniak. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Michael A. Zulyniak, bS5hLnp1bHluaWFrQGxlZWRzLmFjLnVr