Skip to main content

ORIGINAL RESEARCH article

Front. Genet., 18 October 2021
Sec. Applied Genetic Epidemiology
This article is part of the Research Topic Diversity in Immune-Mediated Diseases View all 6 articles

Comparing Genetic and Socioenvironmental Contributions to Ethnic Differences in C-Reactive Protein

  • 1School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, United States
  • 2IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, GA, United States
  • 3National Institute on Minority Health and Health Disparities, National Institutes of Health, Bethesda, MD, United States
  • 4PanAmerican Bioinformatics Institute, Cali, Colombia

C-reactive protein (CRP) is a routinely measured blood biomarker for inflammation. Elevated levels of circulating CRP are associated with response to infection, risk for a number of complex common diseases, and psychosocial stress. The objective of this study was to compare the contributions of genetic ancestry, socioenvironmental factors, and inflammation-related health conditions to ethnic differences in C-reactive protein levels. We used multivariable regression to compare CRP blood serum levels between Black and White ethnic groups from the United Kingdom Biobank (UKBB) prospective cohort study. CRP serum levels are significantly associated with ethnicity in an age and sex adjusted model. Study participants who identify as Black have higher average CRP than those who identify as White, CRP increases with age, and females have higher average CRP than males. Ethnicity and sex show a significant interaction effect on CRP. Black females have higher average CRP levels than White females, whereas White males have higher average CRP than Black males. Significant associations between CRP, ethnicity, and genetic ancestry are almost completely attenuated in a fully adjusted model that includes socioenvironmental factors and inflammation-related health conditions. BMI, smoking, and socioeconomic deprivation all have high relative effects on CRP. These results indicate that socioenvironmental factors contribute more to CRP ethnic differences than genetics. Differences in CRP are associated with ethnic disparities for a number of chronic diseases, including type 2 diabetes, essential hypertension, sarcoidosis, and lupus erythematosus. Our results indicate that ethnic differences in CRP are linked to both socioenvironmental factors and numerous ethnic health disparities.

Introduction

C-reactive protein (CRP) is synthesized by hepatocytes and secreted to the bloodstream in response to inflammation. CRP is employed as a serum biomarker for both acute and chronic inflammation, with important implications for immune response and overall health (Pepys and Hirschfield 2003; Black et al., 2004). Elevated levels of CRP have been shown to be associated with an increased risk of diabetes (Dehghan et al., 2007), cardiovascular disease (Danesh et al., 2004), psychological stress (Wium-Andersen et al., 2013), and all-cause mortality (Zacho et al., 2010).

CRP blood serum levels vary across ethnic groups (Nazmi and Victora 2007), with a number of studies showing that Black patients have higher average levels of circulating CRP than White patients (Wong et al., 2001; Abramson et al., 2002; Ford 2002; Danner et al., 2003; Khera et al., 2005; Matthews et al., 2005; Alley et al., 2006; Lakoski et al., 2006). Ethnic differences of this kind are likely to have multifactorial causes, including contributions from genetic, socioeconomic, and environmental factors. Given the fact that ethnicity co-varies with all of these classes of risk factors, it is difficult to tease apart the genetic and socioenvironmental contributions to ethnic health disparities. This is further complicated by the fact that socially defined ethnicity is an imprecise proxy for genetic diversity.

We use genetic ancestry inference as a means to disambiguate genetic and socioenvironmental effects on ethnic health disparities. Genetic ancestry refers to patterns of genetic diversity that are linked to the geographical origins of human populations (Mathieson and Scally 2020). Individuals who share common ancestors have genetic similarities, and distinct ancestry groups show correlated allele frequency differences (Bergstrom et al., 2020; Genomes Project et al., 2015). Genetic ancestry can be defined objectively, using comparative genomic analysis, without relying on socially defined ethnic groups (Yudell et al., 2016). Patterns of genetic ancestry can be compared to self-identified ethnicity to understand the extent to which they overlap and how they may differ (Tang et al., 2005; Banda et al., 2015; Fang et al., 2019). Modelling of health outcomes with genetic ancestry and socioenvironmental factors as independent (predictor) variables can be used to assess how each contribute to health disparities and how they may interact (Choudhry et al., 2007; Nagar et al., 2021).

The objective of this study was to characterize the effects of genetic ancestry, socioenvironmental factors, and health conditions on ethnic disparities in CRP serum levels. Participants from the United Kingdom Biobank (UKBB) prospective cohort study who self-identified as belonging to Black or White ethnic groups were characterized with respect to CRP levels, genome-wide genotypes, and a variety of socioenvironmental factors and health conditions that show prevalence disparities across groups. Multivariable regression and relative importance analysis were used in an effort to decompose genetic and socioenvironmental contributions to CRP ethnic disparities. Age and sex were considered as covariates in all models given their known associations with CRP serum levels. Given that genetic and socioenvironmental contributions to health outcomes can co-vary across ethnic groups, we hypothesized that the inclusion of socioenvironmental covariates could attenuate the association between ethnicity, genetic ancestry, and CRP levels.

Materials and Methods

Study Cohort

Study participants and data were taken from the United Kingdom Biobank (UKBB), a prospective cohort study on the effects of demography, environment, and genetics on health and disease (Bycroft et al., 2018). The UKBB database contains phenotypic, clinical, and genetic information on more than 500,000 participants between the ages of 40 and 70, enrolled from 2006 to 2010. Ethics approval for the UKBB was obtained from the North West Multi-centre Research Ethics Committee (MREC) for the United Kingdom, the Patient Information Advisory Group (PIAG) for England and Wales, and the Community Health Index Advisory Group (CHIAG) for Scotland1. UKBB participants self-identified as belonging to a single ethnic group upon enrollment2, and we included participants who identified as Black or White for this study. It should be noted that the UKBB ethnic group labels used here correspond directly to racial group labels from the United States. This study was conducted under the United Kingdom Biobank project #65206 to LMR and IKJ.

Participant Data

UKBB participants completed questionnaires, nurse-led interviews, and medical assessments upon enrollment and provided access to their electronic health records. We accessed participant information on age (Field 21003: Age when attended assessment center), BMI (Field 21001: Body mass index (BMI), ethnicity (Field 21000: Ethnic background), ICD-10 disease diagnosis codes (Fields 41270: Diagnoses–ICD10), insomnia (Field 1200: Sleeplessness/insomnia), recruitment year (Field 53: Date of attending assessment centre), sex (Field 31: Sex), smoking status (Field 20116: Smoking status), and Townsend deprivation index (Field 189: Townsend deprivation index at recruitment) from the UKBB data portal. If multiple instances of participant data were available from a follow-up visit, only data collected during the initial assessment visit (2006–2010) were used for analysis.

The Townsend deprivation index is a widely used measure of socioeconomic deprivation known to be associated with poor health outcomes (Foster et al., 2018). It combines four variables–unemployment, non-car ownership, non-home ownership, and household overcrowding–to generate a numerical score (Townsend et al., 1988), which ranges from −6.26 to 11.0 in the UKBB study cohort. Negative values indicate less socioeconomic deprivation, and relative affluence, whereas higher scores indicate greater socioeconomic deprivation.

UKBB participants provided whole blood samples for characterization of protein biomarkers and DNA as previously described (Elliott et al., 2008). C-reactive protein (CRP) blood serum levels were measured as mg/L units using the immuno-turbidimetric method with the Beckman Coulter AU5800 clinical chemistry analyzer3. This procedure corresponds to the high-sensitivity (hs) CRP test. DNA was extracted from 850 μl buffy coat blood aliquots4, and participant genome-wide genotypes were characterized using the UKBB Axiom Array or United Kingdom BiLEVE Array (Welsh et al., 2017).

Disease Case/Control Cohorts

Disease (or health condition) diagnoses for study participants were taken from UKBB ICD-10 diagnosis codes, which were then converted into disease-specific phenotype codes (phecodes) using the scheme developed by the PheWAS consortium (Carroll et al., 2014). Phecodes have been manually curated and validated by disease experts, and they are widely used for the analysis of electronic health record data (Wu et al., 2019). The phecode scheme provides ICD-10 code inclusion and exclusion criteria for each individual disease in order to define disease-specific case/control cohorts that can be confidently compared. For example, when studying participants with type 2 diabetes, participants with type 1 diabetes are removed from the control cohort to avoid any overlapping genetic or environmental signals that might be common to both. This approach improves power for the detection of disease-specific association signals when modelling case/control status. Phecode case/control cohorts were curated for a total of 1,537 diseases or health-related conditions.

Genetic Ancestry Inference

UKBB participant genome-wide genotypes were merged and harmonized with whole genome sequence data from global reference populations characterized as part of the 1,000 Genomes Project (1KGP) and the Human Genome Diversity Project (HGDP) (Bergstrom, et al., 2020; Genomes Project, et al., 2015). Global reference populations were grouped into six regional ancestry groups based on their genetic and geographic affinity, including African (sub-Saharan) and European reference population groups (Supplementary Table 1).

UKBB, 1KGP, and HGDP genomic variant data were merged to include variants present in all three data sets. Minor allele frequency >1% and variant sample missingness <5% filters were used for merging, with variant strand flips and identifier inconsistencies corrected as needed. The merged genome variant data set was pruned for linkage disequilibrium using the program PLINK v2 (Chang et al., 2015).

Principal component analysis (PCA) of the harmonized UKBB, 1KGP, and HGDP genome variant dataset was performed using the FastPCA program implemented in PLINK v2 (Galinsky et al., 2016). Data for the first ten PCs were used to infer UKBB participant genetic ancestry fractions for African, European, and other regional ancestry groups. Our PCA-based genetic ancestry inference approach compares PCA data from UKBB participants to PCA data from reference population individuals using non-negative least squares to assign genetic ancestry fractions for regional ancestry groups as previously described (Conley et al., 2017; Jordan et al., 2019). Participants showing >5% non-African or non-European ancestry fractions were excluded from the study cohort.

Statistical Modeling

All statistical analyses were performed using the R statistical language v3.6.1 (Team R. C, 2013). Forest plots were generated using the forestmodel R package (Kennedy 2020). Other plots were generated using the ggplot R package (Wickham 2009).

CRP blood serum levels are measured in mg/L units by the UKBB. Log transformed CRP blood serum levels were modeled as a continuous outcome using linear regression, and clinically elevated CRP (>3 mg/L) was modeled as a binary outcome using logistic regression using the “glm” function in R. Independent (predictor) variables for the base models included ethnicity, age, and sex. Ethnicity was modeled as a binary variable (Black or White), age was mean centered and modeled as a continuous variable, and sex was modeled as a binary variable (female or male). Independent (predictor) variables for the fully adjusted models included BMI, insomnia, major depressive disorder, PC1, and PC2, recruitment year, smoking status, and the Townsend deprivation index. BMI, PC1, and PC2, and the Townsend index were modeled as continuous variables, and insomnia and smoking status were modeled as categorical variables, and major depressive disorder was modeled as a binary variable. Relative importance analysis was used to assess the relative contributions of the different covariates in the fully adjusted model, while accounting for multicollinearity. The R package “relaimpo” was used for analysis.

The odds of prevalence of specific diseases or health conditions were modeled as the dependent (outcome) variable with multivariable logistic regression computed using the “glm” function in R. Independent (predictor) variables for disease models included ethnicity, mean centered age, sex, and log transformed CRP levels.

Model specifications and detailed results for all statistical models are provided in the Supplementary Material. For each model, we provide the regression equation and model coefficients, along with effect size estimates, standard errors, z-values, and p-values for each model coefficient.

Results

C-Reactive Protein, Ethnicity, Age, and Sex

The study cohort is made up of 433,298 United Kingdom Biobank participants who self-identify as Black (n = 6,456) or White (n = 426,842) (Table 1). Males make up 45.7% of the cohort compared to 54.3% females, and the mean age of cohort participants is 57. C-reactive protein (CRP) blood serum levels vary by ethnicity, age, and sex. Black participants show a mean CRP level of 2.75 mg/L, and White participants show mean CRP level of 2.59 mg/L (t = 2.72, p = 0.01; Table 1 and Figure 1A). 25% of Black participants have clinically elevated CRP compared to 22.6% of White participants (Table 1). Participant CRP levels increase with increasing age (Figure 1B), and females show higher mean CRP levels than males (t = 17.22, p < 2.2 × 10−16; Figure 1C). When CRP levels are modeled by ethnicity, age, and sex, Black ethnicity and age show significant positive associations with CRP, whereas male sex shows a significant negative association with CRP (ethnicity: β = 0.08, p = ∼0, age: β = 0.02, p = ∼0, sex: β = −0.06, p = ∼0; Figure 1D and Supplementary Table 2). These results are robust to a sensitivity analysis conducted by removing individuals with high levels of CRP (≥10 mg/L), indicative of acute infections (Supplementary Table 3). Similarly, when clinically elevated CRP (>3 mg/L) is modeled as a binary outcome, Black ethnicity and age show significant positive associations, and male sex shows a significant negative association (ethnicity: β = 0.04, p = ∼0, age: β = 0.003, p = ∼0, sex: β = −0.04, p = ∼0; Supplementary Table 4).

TABLE 1
www.frontiersin.org

TABLE 1. Characteristics of the United Kingdom Biobank participant cohort.

FIGURE 1
www.frontiersin.org

FIGURE 1. C-reactive protein (CRP), ethnicity, age, and sex. Average CRP serum levels (±95% CI) are shown for (A) Black and White participants (B) age ranges, and (C) female and male participants. (D) Forest plot showing the results of the multivariable linear regression model of participant CRP serum levels. Effect sizes, 95% CIs, and p-values are shown for ethnicity, age, and sex.

Inclusion of interaction terms in the CRP linear regression model revealed a significant interaction between ethnicity and sex (Supplementary Table 5). A likelihood ratio test showed a significantly better fit for a model with the ethnicity-sex interaction term compared to the model with no interaction term, providing additional support for the interaction (Supplementary Table 6). The observed ethnicity-sex interaction results from higher CRP for Black female participants compared to White female participants and lower CRP for Black male participants compared to White male participants (Figure 2).

FIGURE 2
www.frontiersin.org

FIGURE 2. C-reactive protein (CRP) interaction effect of sex and ethnicity. Average CRP serum levels (±95% CI) are shown for Black and White, female, and male participants.

Ethnicity and Genetic Ancestry

Black and White participants differ with respect to mean levels of African and European genetic ancestry (Table 1). Black participants show averages of 87.8% African ancestry and 11.7% European ancestry compared to averages of 99.8% European and 0.03% African ancestry for White participants (African ancestry: t = 671.91, p < 2.2 × 10−16; European ancestry: t = 672.38, p < 2.2 × 10−16).

The relationship between ethnicity and genetic diversity for Black and White participants can be visualized via principal components analysis of genome-wide genotype data (Figure 3A). Principal components one and two separate participants by ethnicity along a continuum of genetic diversity, whereas principal component two alone shows more within ethnic group differences among participants. Black participants show a range of admixture between African and European genetic ancestry fractions, whereas White participants show almost entirely European ancestry (Figure 3B). The probability of participant self-identification as Black or White shifts in the range of 23–44% African ancestry (Figure 3C). Participants are more likely to identify as Black, and less likely to identify as White, if they have ≥29% African ancestry.

FIGURE 3
www.frontiersin.org

FIGURE 3. Ethnicity and genetic ancestry. (A) Principal components analysis showing the relationship between participant ethnicity and genetic diversity. (B) Participant ethnicity (left) compared to participant genetic ancestry fractions (right). (C) Probability of participant ethnic self-identity (y-axis) compared to African genetic ancestry (x-axis). Dots are color-coded by participant ethnicity and each dot shows the probability of self-identification as Black or White across one hundred bins of African ancestry. Black and White self-identification probability trend lines were fit using loess regression.

Genetic and Socioenvironmental Effects on C-Reactive Protein Ethnic Disparities

Multivariable regression and relative importance analysis were used to model the effects of ethnicity, genetic ancestry, socioenvironmental factors, and health conditions that disproportionately affect different ethnic groups on CRP serum levels. Genetic ancestry was modeled using the first two principal components, which show the greatest separation between African and European ancestry components. Socioeconomic deprivation was modeled using the Townsend index, BMI, insomnia, and major depressive disorder (MDD) were included as disparate health conditions, and recruitment year and smoking were included as environmental exposures. The effect of recruitment year was considered to account for possible psychosocial stress associated with the financial crisis of 2007–2008. The effects of all covariates on CRP were considered separately using age and sex adjusted models and together with a fully adjusted model (Table 2 and Supplementary Table 7).

TABLE 2
www.frontiersin.org

TABLE 2. C-reactive protein multivariable linear regression models. Covariates are shown along with the category labels for categorical variables. The effect of each covariate on CRP serum levels was considered separately with age and sex adjusted models and together in the fully adjusted model. Effect point estimates are shown for each covariate along with standard errors and p-values in parentheses. The relative importance rank of each covariate in the fully adjusted model is shown.

The significant positive association between Black ethnicity and CRP levels is attenuated in the fully adjusted model, and ethnicity shows the lowest relative importance among the 11 covariates included in the model. The effects of genetic ancestry are similarly attenuated in the fully adjusted model and have the next lowest relative importance values (PC1 = 9, PC2 = 10). BMI has the highest relative effect on CRP in the fully adjusted model followed by age, smoking, socioeconomic deprivation measured by the Townsend index, and sex. Insomnia, recruitment year, and MDD all show significant but marginally important effects on CRP. Considered together, these results are consistent with the hypothesis that ethnic differences in socioenvironmental exposures and health outcomes explain the association between ethnicity, genetic ancestry, and CRP levels.

C-Reactive Protein and Ethnic Health Disparities

The relationship between CRP serum levels and ethnic health disparities was evaluated by independently modeling the effect of CRP and the effect of ethnicity on disease outcomes and comparing the results. There are 116 out of 1,537 diseases analyzed where both CRP and ethnicity showed significant associations with disease status, after correcting for multiple tests using the Bonferroni correction (Figure 4 and Supplementary Table 8). The effect size estimates for all diseases with significant CRP and ethnicity associations were evaluated to identify diseases where differences in CRP serum levels are implicated in ethnic health disparities (Figure 5). The combined effects of CRP and ethnicity on disease outcomes were quantified by summing the ranks of the individual effect sizes. The top 20 diseases ordered via descending effect size rank sums are shown in Table 3, and data for all diseases are shown in Supplementary Table 8. The top ranked diseases include examples of infectious disease (tuberculosis and HIV), metabolic diseases (type 2 diabetes and hypoglycemia), circulatory system diseases (hypertensive chronic kidney disease, hypertensive heart disease, and essential hypertension), schizophrenia, genitourinary diseases (nephrotic syndrome and chronic kidney disease), and dermatologic diseases (lupus erythematosus and sarcoidosis).

FIGURE 4
www.frontiersin.org

FIGURE 4. C-reactive protein (CRP) and ethnic health disparities. Manhattan plots showing the statistical significance levels (-log10 p-values) for associations of disease and CRP (left, red circles) and associations of disease and ethnicity (right, blue circles). Diseases are categorized as shown using the phecode scheme. Bonferroni corrected p-value thresholds are shown with black lines. Bonferroni corrected significant associations are indicated with larger, dark circles and select disease examples are annotated as shown.

FIGURE 5
www.frontiersin.org

FIGURE 5. Effects of C-reactive protein (CRP) and ethnicity on disease. Effect sizes for statistically significant CRP-disease associations (βCRP, y-axis) and significant ethnicity-disease associations (βEthnicity, x-axis) associations. βEthnicity > 0 shows diseases that are positively associated with Black ethnicity, and βEthnicity < 0 shows diseases that are positively associated with White ethnicity. Select disease examples are annotated as shown.

TABLE 3
www.frontiersin.org

TABLE 3. Top 20 diseases implicated for CRP-associated ethnic health disparities.

Discussion

Ethnic Differences in Inflammation are Explained by Socioenvironmental Factors

C-reactive protein (CRP) is a widely used clinical marker of inflammation. Our study of the UKBB found that CRP blood serum levels differ according to participant’s self-identification as belonging to Black or White ethnic groups, and ethnicity in our cohort is highly correlated with genetic ancestry. Given these results, it could be naively expected that differences in genetic ancestry between the Black and White ethnic groups explain differences in CRP levels. However, comparisons between ethnic groups with distinct genetic ancestry profiles can be confounded by gene-environment correlations. Indeed, genetic ancestry, socioenvironmental factors, and health conditions can all co-vary among ethnic groups. We used multivariable regression and relative importance analysis in an effort to decompose the contributions of genetic ancestry, socioenvironmental factors, and health conditions to CRP ethnic disparities.

When differences in socioenvironmental exposures and health outcomes are accounted for, the associations between ethnicity, genetic ancestry, and CRP levels are almost completely attenuated. These results indicate that the environment plays a more important role than genetics in shaping ethnic disparities in inflammation for this cohort. Possible socioenvironmental factors leading to higher levels of CRP observed for Black participants could include psychosocial stress linked to racial discrimination and poverty (Sanders-Phillips et al., 2009; Williams and Mohammed 2009; Lewis et al., 2010; Beatty et al., 2014). The high relative effects of BMI and smoking on CRP suggests that aspects of diet and lifestyle associated with socioeconomic deprivation could also be linked to ethnic differences in inflammation (Feinstein 1993; Inglis et al., 2005; Govil et al., 2009).

Although we are making a semantic distinction between genetic effects, as measured by genetic ancestry, and the effects of socioenvironmental factors and health conditions, it should be noted that the socioenvironmental and health covariates modeled here are also likely be influenced by genetics. For example, BMI and smoking are both highly heritable traits. The effect of sex also likely includes a genetic component based on chromosomal differences between males and females. In other words, our approach to decomposing genetic and environmental contributions to health disparities is limited by the pervasive contributions of both genetics and the environment to human traits.

Interaction Between Ethnicity and Sex

UKBB participant CRP blood serum levels vary by ethnicity, age, and sex. Modeling CRP levels with all of these factors reveled a highly significant interaction effect between ethnicity and sex. Black females show higher CRP levels than White females, whereas Black males have lower CRP than White males. Thus, Black females are at the highest risk of chronic inflammation, suggesting the possibility of exposure to particularly high levels of stress for this group. This finding is consistent with previous studies showing that Black women can experience worse health outcomes than Black men, White women, or White men owing to their relatively subordinate position in both ethnic and gender hierarchies (Woods-Giscombe 2010; Farmer et al., 2021). This perspective underscores the importance of an ethnic health disparities analysis framework that includes multiple, interacting demographic, genetic, and socioenvironmental factors (Bowleg 2012; Brown and Hargrove 2013; Bauer 2014; Richardson and Brown 2016).

Inflammation and Ethnic Health Disparities

We related inflammation and ethnic health disparities by independently modeling the effect of CRP and ethnicity on disease status and then looking for diseases that showed significant associations with both factors. There were 109 out of 1,537 diseases that showed significant associations with both CRP and ethnicity, and we explored the diseases that showed the strongest effects for both. This approach uncovered a number of diseases linked to immune response and inflammation, including infectious diseases and complex, common diseases. This suggests the possibility that ethnic differences in inflammation, related to environmental exposures and psychosocial stress, could be broadly related to ethnic health disparities.

It is important to note, however, that our observational study design and statistical modelling do not allow for unambiguous causal inference regarding the relationship between CRP and disease (Davey Smith et al., 2005; Pingault et al., 2018). For infectious diseases, CRP levels are expected to be elevated after infection, which would entail a kind of reverse causality with respect to how our regression models are specified. For chronic diseases, systemic inflammation could precede disease or contribute to disease progression, but it could also reflect the presence of disease. Our models cannot distinguish between these possibilities, and it is not known whether participant CRP levels measured at recruitment precede or follow the diagnosis and course of disease. Thus, it is possible that the observed ethnic differences in CRP reflect a higher overall burden of disease for ethnic minorities in the UKBB, linked to higher levels of socioeconomic deprivation, rather than a causal risk factor for ethnic health disparities.

Data Availability Statement

The datasets analyzed for this study can be found on the United Kingdom Biobank website (https://www.ukbiobank.ac.uk/register-apply/).

Ethics Statement

The studies involving human participants were reviewed and approved by Ethics approval for the UKBB was obtained from the North West Multi-centre Research Ethics Committee (MREC) for the United Kingdom, the Patient Information Advisory Group (PIAG) for England and Wales, and the Community Health Index Advisory Group (CHIAG) for Scotland. The patients/participants provided their written informed consent to participate in this study.

Author Contributions

SN: Conceptualization, data curation, methodology, analysis, visualization, and writing. AC: Data curation, methodology, and analysis. SS: Data curation and analysis. LR: Data curation and analysis. IJ: Conceptualization, funding acquisition, methodology, project administration, visualization, and writing. LM-R: Conceptualization, funding acquisition, methodology, project administration, visualization, and writing.

Funding

LMR was supported by the National Institutes of Health (NIH) Distinguished Scholars Program (DSP). SS and LM-R were supported by the Division of Intramural Research (DIR) of the National Institute on Minority Health and Health Disparities (NIMHD) at NIH. SN, AC, LR, and IJ were supported by the IHRC-Georgia Tech Applied Bioinformatics Laboratory.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

This study was made possible by the United Kingdom Biobank application number 65206.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2021.738485/full#supplementary-material

Footnotes

1https://www.ukbiobank.ac.uk/learn-more-about-uk-biobank/about-us/ethics

2https://biobank.ctsu.ox.ac.uk/crystal/field.cgi?id=21000

3https://biobank.ctsu.ox.ac.uk/crystal/crystal/docs/serum_biochemistry.pdf

4https://biobank.ctsu.ox.ac.uk/crystal/crystal/docs/ukb_dna_processing.pdf

References

Abramson, J. L., Weintraub, W. S., and Vaccarino, V. (2002). Association between Pulse Pressure and C-Reactive Protein Among Apparently Healthy US Adults. Hypertension 39, 197–202. Epub 2002/02/16. doi:10.1161/hy0202.104270

PubMed Abstract | CrossRef Full Text | Google Scholar

Alley, D. E., Seeman, T. E., Ki Kim, J., Karlamangla, A., Hu, P., and Crimmins, E. M. (2006). Socioeconomic Status and C-Reactive Protein Levels in the US Population: NHANES IV. Brain Behav. Immun. 20, 498–504. Epub 2005/12/07. doi:10.1016/j.bbi.2005.10.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Banda, Y., Kvale, M. N., Hoffmann, T. J., Hesselson, S. E., Ranatunga, D., Tang, H., et al. (2015). Characterizing Race/Ethnicity and Genetic Ancestry for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort. Genet. Aug. 200, 1285–1295. Epub 2015/06/21. doi:10.1534/genetics.115.178616

PubMed Abstract | CrossRef Full Text | Google Scholar

Bauer, G. R. (2014). Incorporating Intersectionality Theory into Population Health Research Methodology: Challenges and the Potential to advance Health Equity. Soc. Sci. Med. 110, 10–17. Epub 2014/04/08. doi:10.1016/j.socscimed.2014.03.022

PubMed Abstract | CrossRef Full Text | Google Scholar

Beatty Moody, D. L., Brown, C., Matthews, K. A., and Bromberger, J. T. (2014). Everyday Discrimination Prospectively Predicts Inflammation across 7-Years in Racially Diverse Midlife Women: Study of Women's Health across the Nation. J. Soc. Issues. 70, 298–314. Epub 2014/10/25. doi:10.1111/josi.12061

PubMed Abstract | CrossRef Full Text | Google Scholar

Bergström, A., McCarthy, S. A., Hui, R., Almarri, M. A., Ayub, Q., Danecek, P., et al. (2020). Insights into Human Genetic Variation and Population History from 929 Diverse Genomes. Science 367, 367. Epub 2020/03/21. doi:10.1126/science.aay5012

CrossRef Full Text | Google Scholar

Black, S., Kushner, I., and Samols, D. (2004). C-reactive Protein. J. Biol. Chem. 279 (279), 48487–48490. Epub 2004/09/01. doi:10.1074/jbc.R400025200

CrossRef Full Text | Google Scholar

Bowleg, L. (2012). The Problem with the PhraseWomen and Minorities:Intersectionality-An Important Theoretical Framework for Public Health. Am. J. Public Health 102, 1267–1273. Epub 2012/05/19. doi:10.2105/ajph.2012.300750

CrossRef Full Text | Google Scholar

Brown, T. H., and Hargrove, T. W. (2013). Multidimensional Approaches to Examining Gender and Racial/ethnic Stratification in Health. Women, Gend. Families Color. 1, 180–206. doi:10.5406/womgenfamcol.1.2.0180

CrossRef Full Text | Google Scholar

Bycroft, C., Freeman, C., Petkova, D., Band, G., Elliott, L. T., Sharp, K., et al. (2018). The UK Biobank Resource with Deep Phenotyping and Genomic Data. Nature 562, 203–209. Epub 2018/10/12. doi:10.1038/s41586-018-0579-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Carroll, R. J., Bastarache, L., and Denny, J. C. (2014). R PheWAS: Data Analysis and Plotting Tools for Phenome-wide Association Studies in the R Environment. Bioinformatics 30, 2375–2376. Epub 2014/04/16. doi:10.1093/bioinformatics/btu197

PubMed Abstract | CrossRef Full Text | Google Scholar

Chang, C. C., Chow, C. C., Tellier, L. C., Vattikuti, S., Purcell, S. M., and Lee, J. J. (2015). Second-generation PLINK: Rising to the challenge of Larger and Richer Datasets. GigaSci 4, 7. Epub 2015/02/28. doi:10.1186/s13742-015-0047-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Choudhry, S., Seibold, M. A., Borrell, L. N., Tang, H., Serebrisky, D., Chapela, R., et al. (2007). Dissecting Complex Diseases in Complex Populations: Asthma in Latino Americans. Proc. Am. Thorac. Soc. 4, 226–233. Epub 2007/07/04. doi:10.1513/pats.200701-029aw

PubMed Abstract | CrossRef Full Text | Google Scholar

Conley, A. B., Rishishwar, L., Norris, E. T., Valderrama-Aguirre, A., Mariño-Ramírez, L., Medina-Rivas, M. A., et al. (2017). A Comparative Analysis of Genetic Ancestry and Admixture in the Colombian Populations of Chocó and Medellín. G3 Genes|Genomes|Genetics 7, 3435–3447. Epub 2017/09/01. doi:10.1534/g3.117.1118

PubMed Abstract | CrossRef Full Text | Google Scholar

Danesh, J., Wheeler, J. G., Hirschfield, G. M., Eda, S., Eiriksdottir, G., Rumley, A., et al. (2004). C-reactive Protein and Other Circulating Markers of Inflammation in the Prediction of Coronary Heart Disease. N. Engl. J. Med. 350, 1387–1397. Epub 2004/04/09. doi:10.1056/NEJMoa032804

CrossRef Full Text | Google Scholar

Danner, M., Kasl, S. V., Abramson, J. L., and Vaccarino, V. (2003). Association between Depression and Elevated C-Reactive Protein. Psychosomatic Med. 65, 347–356. Epub 2003/05/24. doi:10.1097/01.psy.0000041542.29808.01

CrossRef Full Text | Google Scholar

Davey Smith, G., Ebrahim, S., Lewis, S., Hansell, A. L., Palmer, L. J., and Burton, P. R. (2005). Genetic Epidemiology and Public Health: hope, Hype, and Future Prospects. Lancet 366, 1484–1498. Epub 2005/10/26. doi:10.1016/S0140-6736(05)67601-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Dehghan, A., Kardys, I., de Maat, M. P. M., Uitterlinden, A. G., Sijbrands, E. J. G., Bootsma, A. H., et al. (2007). Genetic Variation, C-Reactive Protein Levels, and Incidence of Diabetes. Diabetes 56, 872–878. Epub 2007/03/01. doi:10.2337/db06-0922

PubMed Abstract | CrossRef Full Text | Google Scholar

Elliott, P., Peakman, T. C., and Biobank, U. K. (2008). The UK Biobank Sample Handling and Storage Protocol for the Collection, Processing and Archiving of Human Blood and Urine. Int. J. Epidemiol. 37, 234–244. Epub 2008/04/03. doi:10.1093/ije/dym276

CrossRef Full Text | Google Scholar

Fang, H., Hui, Q., Lynch, J., Honerlaw, J., Assimes, T. L., Huang, J., et al. (2019). Harmonizing Genetic Ancestry and Self-Identified Race/Ethnicity in Genome-wide Association Studies. Am. J. Hum. Genet. 105, 763–772. Epub 2019/10/01. doi:10.1016/j.ajhg.2019.08.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Farmer, H. R., Wray, L. A., and Haas, S. A. (2021). Race, Gender, and Socioeconomic Variations in C-Reactive Protein Using the Health and Retirement Study. J. Gerontol. B Psychol. Sci. Soc. Sci. Feb. 17 (76), 583–595. Epub 2020/02/18. doi:10.1093/geronb/gbaa027

CrossRef Full Text | Google Scholar

Feinstein, J. S. (1993). The Relationship between Socioeconomic Status and Health: a Review of the Literature. Milbank Q. 71, 279–322. Epub 1993/01/01. doi:10.2307/3350401

PubMed Abstract | CrossRef Full Text | Google Scholar

Ford, E. S. (2002). Does Exercise Reduce Inflammation? Physical Activity and C-Reactive Protein Among U.S. Adults. Epidemiology 13, 561–568. Epub 2002/08/23. doi:10.1097/00001648-200209000-00012

PubMed Abstract | CrossRef Full Text | Google Scholar

Foster, H. M. E., Celis-Morales, C. A., Nicholl, B. I., Petermann-Rocha, F., Pell, J. P., Gill, J. M. R., et al. (2018). The Effect of Socioeconomic Deprivation on the Association between an Extended Measurement of Unhealthy Lifestyle Factors and Health Outcomes: a Prospective Analysis of the UK Biobank Cohort. The Lancet Public Health 3, e576–e585. Epub 2018/11/24. doi:10.1016/s2468-2667(18)30200-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Galinsky, K. J., Bhatia, G., Loh, P.-R., Georgiev, S., Mukherjee, S., Patterson, N. J., et al. (2016). Fast Principal-Component Analysis Reveals Convergent Evolution of ADH1B in Europe and East Asia. Am. J. Hum. Genet. 98, 456–472. Epub 2016/03/01. doi:10.1016/j.ajhg.2015.12.022

PubMed Abstract | CrossRef Full Text | Google Scholar

Genomes Project, C., Auton, A., Brooks, L. D., Durbin, R. M., Garrison, E. P., Kang, H. M., et al. (2015). A Global Reference for Human Genetic Variation. Nature 526, 68–74. Epub 2015/10/04. doi:10.1038/nature15393

PubMed Abstract | CrossRef Full Text | Google Scholar

Govil, S. R., Weidner, G., Merritt-Worden, T., and Ornish, D. (2009). Socioeconomic Status and Improvements in Lifestyle, Coronary Risk Factors, and Quality of Life: the Multisite Cardiac Lifestyle Intervention Program. Am. J. Public Health 99, 1263–1270. Epub 2008/10/17. doi:10.2105/ajph.2007.132852

PubMed Abstract | CrossRef Full Text | Google Scholar

Inglis, V., Ball, K., and Crawford, D. (2005). Why Do Women of Low Socioeconomic Status Have Poorer Dietary Behaviours Than Women of Higher Socioeconomic Status? A Qualitative Exploration. Appetite 45, 334–343. Epub 2005/09/21. doi:10.1016/j.appet.2005.05.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Jordan, I. K., Rishishwar, L., and Conley, A. B. (2019). Native American Admixture Recapitulates Population-specific Migration and Settlement of the continental United States. Plos Genet. 15, e1008225. Epub 2019/09/24. doi:10.1371/journal.pgen.1008225

PubMed Abstract | CrossRef Full Text | Google Scholar

Kennedy, N. (2020). Forestmodel: forest Plots from Regression Models. R Package Version 06.2. https://www.ukbiobank.ac.uk/register-apply/.

Google Scholar

Khera, A., McGuire, D. K., Murphy, S. A., Stanek, H. G., Das, S. R., Vongpatanasin, W., et al. (2005). Race and Gender Differences in C-Reactive Protein Levels. J. Am. Coll. Cardiol. 46 (46), 464–469. Epub 2005/08/02. doi:10.1016/j.jacc.2005.04.051

CrossRef Full Text | Google Scholar

Lakoski, S. G., Cushman, M., Criqui, M., Rundek, T., Blumenthal, R. S., D'Agostino, R. B., et al. (2006). Gender and C-Reactive Protein: Data from the Multiethnic Study of Atherosclerosis (MESA) Cohort. Am. Heart J. 152, 593–598. Epub 2006/08/23. doi:10.1016/j.ahj.2006.02.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Lewis, T. T., Aiello, A. E., Leurgans, S., Kelly, J., and Barnes, L. L. (2010). Self-reported Experiences of Everyday Discrimination Are Associated with Elevated C-Reactive Protein Levels in Older African-American Adults. Brain Behav. Immun. 24, 438–443. Epub 2009/12/01. doi:10.1016/j.bbi.2009.11.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Mathieson, I., and Scally, A. (2020). What Is Ancestry. Plos Genet. 16, e1008624. Epub 2020/03/10. doi:10.1371/journal.pgen.1008624

PubMed Abstract | CrossRef Full Text | Google Scholar

Matthews, K. A., Sowers, M. F., Derby, C. A., Stein, E., Miracle-McMahill, H., Crawford, S. L., et al. (2005). Ethnic Differences in Cardiovascular Risk Factor burden Among Middle-Aged Women: Study of Women's Health across the Nation (SWAN). Am. Heart J. 149, 1066–1073. Epub 2005/06/25. doi:10.1016/j.ahj.2004.08.027

PubMed Abstract | CrossRef Full Text | Google Scholar

Nagar, S. D., Nápoles, A. M., Jordan, I. K., and Mariño-Ramírez, L. (2021). Socioeconomic Deprivation and Genetic Ancestry Interact to Modify Type 2 Diabetes Ethnic Disparities in the United Kingdom. EClinicalMedicine.37, 100960. doi:10.1016/j.eclinm.2021.100960

PubMed Abstract | CrossRef Full Text | Google Scholar

Nazmi, A., and Victora, C. G. (2007). Socioeconomic and Racial/ethnic Differentials of C-Reactive Protein Levels: a Systematic Review of Population-Based Studies. BMC Public Health 7, 212. Epub 2007/08/21. doi:10.1186/1471-2458-7-212

PubMed Abstract | CrossRef Full Text | Google Scholar

Pepys, M. B., and Hirschfield, G. M. (2003). C-reactive Protein: a Critical Update. J. Clin. Invest. 111, 1805–1812. Epub 2003/06/19. doi:10.1172/jci200318921

CrossRef Full Text | Google Scholar

Pingault, J.-B., O’Reilly, P. F., Schoeler, T., Ploubidis, G. B., Rijsdijk, F., and Dudbridge, F. (2018). Using Genetic Data to Strengthen Causal Inference in Observational Research. Nat. Rev. Genet. 19, 566–580. Epub 2018/06/07. doi:10.1038/s41576-018-0020-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Richardson, L. J., and Brown, T. H. (2016). (En)gendering Racial Disparities in Health Trajectories: A Life Course and Intersectional Analysis. SSM - Popul. Health. 2, 425–435. Epub 2017/01/24. doi:10.1016/j.ssmph.2016.04.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Sanders-Phillips, K., Settles-Reaves, B., Walker, D., and Brownlow, J. (2009). Social Inequality and Racial Discrimination: Risk Factors for Health Disparities in Children of Color. Pediatrics 124 (Suppl. 3), S176–S186. Epub 2009/11/05. doi:10.1542/peds.2009-1100E

PubMed Abstract | CrossRef Full Text | Google Scholar

Tang, H., Quertermous, T., Rodriguez, B., Kardia, S. L. R., Zhu, X., Brown, A., et al. (2005). Genetic Structure, Self-Identified Race/ethnicity, and Confounding in Case-Control Association Studies. Am. J. Hum. Genet. 76, 268–275. Epub 2004/12/31. doi:10.1086/427888

CrossRef Full Text | Google Scholar

Team R C (2013). R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing.

Google Scholar

Townsend, P., Phillimore, P., and Beattie, A. (1988). Health and Deprivation: Inequality and the. North: Routledge.

Google Scholar

Welsh, S., Peakman, T., Sheard, S., and Almond, R. (2017). Comparison of DNA Quantification Methodology Used in the DNA Extraction Protocol for the UK Biobank Cohort. BMC Genomics 18, 26. Epub 2017/01/07. doi:10.1186/s12864-016-3391-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Wickham, H. (2009). Elegant Graphics for Data Analysis. Media 35, 10–1007.

Google Scholar

Williams, D. R., and Mohammed, S. A. (2009). Discrimination and Racial Disparities in Health: Evidence and Needed Research. J. Behav. Med. 32, 20–47. Epub 2008/11/26. doi:10.1007/s10865-008-9185-0

CrossRef Full Text | Google Scholar

Wium-Andersen, M. K., Ørsted, D. D., Nielsen, S. F., and Nordestgaard, B. G. (2013). Elevated C-Reactive Protein Levels, Psychological Distress, and Depression in 73 131 Individuals. JAMA Psychiatry 70, 176–184. Epub 2012/12/26. doi:10.1001/2013.jamapsychiatry.102

PubMed Abstract | CrossRef Full Text | Google Scholar

Wong, N. D., Pio, J., Valencia, R., and Thakal, G. (2001). Distribution of C-Reactive Protein and its Relation to Risk Factors and Coronary Heart Disease Risk Estimation in the National Health and Nutrition Examination Survey (NHANES) III. Prev. Cardiol. 4, 109–114. Epub 2002/02/06. doi:10.1111/j.1520-037x.2001.00570.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Woods-Giscombé, C. L. (2010). Superwoman Schema: African American Women's Views on Stress, Strength, and Health. Qual. Health Res. 20, 668–683. Epub 2010/02/16. doi:10.1177/1049732310361892

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, P., Gifford, A., Meng, X., Li, X., Campbell, H., Varley, T., et al. (2019). Mapping ICD-10 and ICD-10-CM Codes to Phecodes: Workflow Development and Initial Evaluation. JMIR Med. Inform. 7 (7), e14325. Epub 2019/09/26. doi:10.2196/14325

PubMed Abstract | CrossRef Full Text | Google Scholar

Yudell, M., Roberts, D., DeSalle, R., and Tishkoff, S. (2016). Taking Race Out of Human Genetics. Science 351, 564–565. Epub 2016/02/26. doi:10.1126/science.aac4951

PubMed Abstract | CrossRef Full Text | Google Scholar

Zacho, J., Tybjaerg-Hansen, A., and Nordestgaard, B. G. (2010). C-reactive Protein and All-Cause Mortality-Tthe Copenhagen City Heart Study. Eur. Heart J. 31, 1624–1632. Epub 2010/04/29. doi:10.1093/eurheartj/ehq103

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: C-reacitve protein, inflammation, race and ethnicity (minority issues), health disparities, genetic ancestry, socioeconomic stages

Citation: Nagar SD, Conley AB, Sharma S, Rishishwar L, Jordan IK and Mariño-Ramírez L (2021) Comparing Genetic and Socioenvironmental Contributions to Ethnic Differences in C-Reactive Protein. Front. Genet. 12:738485. doi: 10.3389/fgene.2021.738485

Received: 08 July 2021; Accepted: 05 October 2021;
Published: 18 October 2021.

Edited by:

Matthew Patrick, University of Michigan, United States

Reviewed by:

Michael Guo, University of Pennsylvania, United States
Piril Hepsomali, University of Roehampton London, United Kingdom
Magdalena Krajewska, Wroclaw Medical University, Poland
Kamil Sicinski, University of Wisconsin-Madison, United States

Copyright © 2021 Nagar, Conley, Sharma, Rishishwar, Jordan and Mariño-Ramírez. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: I. King Jordan, king.jordan@biology.gatech.edu; Leonardo Mariño-Ramírez, marino@nih.gov

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.