- 1Sydney Brenner Institute for Molecular Bioscience, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
- 2Strengthening Oncology Services Research Unit, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
- 3National Cancer Registry, National Health Laboratory Service, Johannesburg, South Africa
- 4Clinical Research Unit of Nanoro, Institut de Recherche en Sciences de la Santé, Nanoro, Burkina Faso
- 5Division of Human Genetics, National Health Laboratory Service and School of Pathology, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
- 6Navrongo Health Research Centre, Navrongo, Ghana
- 7Department of Biochemistry and Forensic Sciences, School of Chemical and Biochemical Sciences, C. K. Tedam University of Technology and Applied Sciences, Navrongo, Ghana
- 8SAMRC/Wits Developmental Pathways for Health Research Unit, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
- 9Medical Research Council/Wits University Rural Public Health and Health Transitions Research Unit (Agincourt), School of Public Health, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
- 10African Population and Health Research Center, Nairobi, Kenya
- 11Department of Women’s and Children’s Health, Karolinska Institute, Stockholm, Sweden
- 12Department of Pathology and Medical Sciences, School of Healthcare Sciences, Faculty of Health Sciences, University of Limpopo, Polokwane, South Africa
- 13School of Electrical and Information Engineering, University of the Witwatersrand, Johannesburg, South Africa
- 14Centre for Genetics and Genomics Versus Arthritis, Centre for Musculoskeletal Research, The University of Manchester, Manchester, United Kingdom
- 15Wits Donald Gordon Medical Centre, School of Clinical Medicine, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
Background: Genome-wide association studies (GWAS) have predominantly focused on populations of European and Asian ancestry, limiting our understanding of genetic factors influencing kidney disease in Sub-Saharan African (SSA) populations. This study presents the largest GWAS for urinary albumin-to-creatinine ratio (UACR) in SSA individuals, including 8,970 participants living in different African regions and an additional 9,705 non-resident individuals of African ancestry from the UK Biobank and African American cohorts.
Methods: Urine biomarkers and genotype data were obtained from two SSA cohorts (AWI-Gen and ARK), and two non-resident African-ancestry studies (UK Biobank and CKD-Gen Consortium). Association testing and meta-analyses were conducted, with subsequent fine-mapping, conditional analyses, and replication studies. Polygenic scores (PGS) were assessed for transferability across populations.
Results: Two genome-wide significant (P < 5 × 10−8) UACR-associated loci were identified, one in the BMP6 region on chromosome 6, in the meta-analysis of resident African individuals, and another in the HBB region on chromosome 11 in the meta-analysis of non-resident SSA individuals, as well as the combined meta-analysis of all studies. Replication of previous significant results confirmed associations in known UACR-associated regions, including THB53, GATM, and ARL15. PGS estimated using previous studies from European ancestry, African ancestry, and multi-ancestry cohorts exhibited limited transferability of PGS across populations, with less than 1% of observed variance explained.
Conclusion: This study contributes novel insights into the genetic architecture of kidney disease in SSA populations, emphasizing the need for conducting genetic research in diverse cohorts. The identified loci provide a foundation for future investigations into the genetic susceptibility to chronic kidney disease in underrepresented African populations Additionally, there is a need to develop integrated scores using multi-omics data and risk factors specific to the African context to improve the accuracy of predicting disease outcomes.
Introduction
Chronic kidney disease (CKD) is a leading risk factor for years of life lost and premature mortality, with a 41.5% relative increase in mortality worldwide from 1990 to 2017 (GBD Chronic Kidney Disease Collaboration et al., 2020; Kovesdy, 2022). The estimated global prevalence of CKD is 9.1% and while predicted to be higher in Sub-Saharan Africa (SSA), the true prevalence and associated risk factors remain understudied (Kaze et al., 2018; GBD Chronic Kidney Disease Collaboration et al., 2020). The Africa Wits-INDEPTH partnership for Genomic Studies (AWI-Gen) cohort, which included ∼12,000 participants from four SSA countries in West, East, and Southern Africa, reported overall CKD prevalence as 10.7% (95% confidence interval [CI]: 9.9–11.7), with notable geographic regional differences. The most important risk factors for CKD in SSA were older age, female sex, diabetes, hypertension, and human immunodeficiency virus (HIV) infection (George et al., 2019).
Over the past decade, genome-wide association studies (GWAS) have identified numerous genetic loci associated with kidney function, namely, estimated glomerular filtration rate [eGFR], serum creatinine, and urine albumin-creatinine ratio [UACR] (Böger et al., 2011; Pattaro et al., 2012; 2016; Teumer et al., 2016; 2019; Hellwege et al., 2019; Tin and Köttgen, 2020). The majority of the GWAS for kidney function and disease have examined associations with eGFR, while UACR, as a measure for albuminuria, has been investigated less often (Mahajan et al., 2016; Pattaro et al., 2016; Gorski et al., 2017; Haas et al., 2018; Teumer et al., 2019; Wuttke et al., 2019; Zanetti et al., 2019). A recent GWAS in 564,257 individuals of multi-ancestry origins identified 68 associated risk loci for UACR and proposed a priority list of genes to explore as targets for the treatment of albuminuria (Teumer et al., 2019).
While the majority of kidney disease-associated risk loci have been identified in studies on participants of European and East Asian ancestry, and the African diaspora (Lee et al., 2018), few have focused on participants living in SSA (Böger et al., 2011; Pattaro et al., 2012; Lin et al., 2019; Morris et al., 2019). Recently, a study of genetic associations of eGFR in a Ugandan population-based cohort, (Fatumo et al., 2020), replicated the association between eGFR and the GATM locus.
Replication and transferability of GWAS signals across populations of different ancestries, and specifically with African ancestry populations, tend to be poor despite regional replication often identifying shared associated genomic regions (Pattaro et al., 2012). This may be due to differences in linkage disequilibrium (LD) with the causal variant, allele frequency differences between the populations, underlying population structure, and variabilities in environmental exposures. African populations, with their great genetic diversity and deep evolutionary roots, represent an opportunity for genetic discovery to identify and fine-map disease-associated risk variants (Gomez et al., 2014; Pereira et al., 2021).
Polygenic scores (PGS) are used to quantify and stratify populations according to genetic risk. A PGS based on 63 eGFR-associated alleles showed significant association with kidney disease-related phenotypes, such as chronic kidney failure and hypertensive kidney disease in the Million Veteran Study (US) on 192,868 white and non-Hispanic individuals (Hellwege et al., 2019). A PGS based on 64 urine UACR associated alleles was significantly associated with CKD (Teumer et al., 2019). Further analysis revealed positive associations of the PGS with an increased risk of hypertension (HT) and diabetes. However, PGS often translate poorly across different ancestries (Martin et al., 2017; Kamiza et al., 2022; Kachuri et al., 2023). Since most published GWAS for kidney disease and kidney function markers are based on European ancestry populations, the predictive accuracy of models developed from these studies is expected to be significantly diminished for African populations (Adam et al., 2022; Choudhury et al., 2022; Kamiza et al., 2023; Majara et al., 2023).
In this study, we present a GWAS for UACR conducted within resident Sub-Saharan African individuals. This population cross-sectional study includes a cohort of 8,970 individuals from four SSA countries from the AWI-Gen study (Ali et al., 2018), the African Research on Kidney Disease (ARK) study (Kalyesubula et al., 2020), with 9,705 individuals of African-ancestry from the UK Biobank (UKB) and African American participants from the CKD-Gen Consortium (Teumer et al., 2019). The primary objectives are to: (1) identify genetic loci associated with UACR as a marker of kidney disease in individuals from SSA and of African ancestry; (2) explore the replication of findings identified in previous GWAS; (3) perform analysis and comparison of PGS derived from non-African and multi-ancestry population studies and evaluate their transferability to African populations.
Participants and methods
Study participants
Africa Wits-INDEPTH partnership for genomic research (AWI-Gen)
The study participants are a subset of the population cross-sectional AWI-Gen study (Ramsay et al., 2016; Ali et al., 2018). The study recruited adults primarily between the ages of 40 and 80 years from six SSA study sites in West Africa (Nanoro, Burkina Faso and Navrongo, Ghana), East Africa (Nairobi, Kenya) and in South Africa (Bushbuckridge - hereinafter referred to as Agincourt, Mpumalanga Province; Dikgale, Limpopo Province; and Soweto, Gauteng). All participants were of self-identified black ethnicity. Data collection was described in detail previously (Ali et al., 2018; George et al., 2019). Detailed demographic data, health-related questionnaire data, and anthropometric measurements were collected. Peripheral blood samples and urine samples were collected for biomarker assays (the relevant assays are described below). DNA was extracted from peripheral blood-derived buffy coat samples and used for genotyping. Urine albumin was measured using a colorimetric method on the Cobas© 6000/c501 analyzer, and urine creatinine was measured by the modified Jaffe method (Craik et al., 2023). This study was approved by the Human Research Ethics Committee (Medical), University of the Witwatersrand, South Africa (M121029, M170880) and the ethics committees of all participating institutions. All participants provided written informed consent following community engagement and individual consenting processes.
African research on kidney disease (ARK)
The African Research Kidney Disease (ARK) study is a well characterised population-based cohort study of 2021 adults (20–80 years) of self-identified black ethnicity from Agincourt, (Mpumalanga, South Africa) with demographic data, health-related questionnaire data, and anthropometric measurements collected at enrolment (Fabian et al., 2022). Blood and urine were collected for biomarker assays (the relevant assays are described below). DNA was extracted from buffy coat samples and used for genotyping. Urine albumin was measured using a colorimetric method on the Cobas© 6000/c501 analyzer, and urine creatinine was measured by the modified Jaffe method (Craik et al., 2023). This study was approved by the Human Research Ethics Committee (Medical), University of the Witwatersrand, South Africa (M160939). All participants provided written informed consent following community engagement and individual consenting processes. The geographical area of recruitment overlaps with the Agincourt sub-cohort of AWI-Gen but there is no overlap in participants.
UK-Biobank (UKB)
Individuals of self-reported Caribbean and African ancestry from the UKB were identified for this study. Of this subset of UKB individuals, those with both genotyping and UACR data were retained for the analysis. UACR was derived using urinary levels of albumin and creatinine. In the UKB, albumin was measured using the immuno-turbidimetric analysis method (Randox Biosciences, UK) while creatinine was measured using the enzymatic analysis method (Beckman Coulter, UK) (Casanova et al., 2019).
Phenotype generation and harmonization
UACR was calculated for AWI-Gen and ARK studies using urinary levels of albumin and creatinine as previously described (George et al., 2019; Fabian et al., 2022). Participants with missing values for albumin and creatinine were excluded from this study. We applied filtering criteria similar to those employed by the CKD-Gen consortium (Köttgen and Pattaro, 2020). In cases where the values for urine albumin and urine creatinine fell outside the upper and lower limits of detection, the values were replaced with the respective upper and lower limits: for urine creatinine, the range was 3–400 mmol/L and for urine albumin, the range was 3.75–475 mg/L for AWI-Gen and ARK. For the UKB dataset, the upper limit was 6.7 mg/L for urine albumin. Albuminuria was defined as UACR >3.0 mg/mmol.
Genotyping
AWI-Gen and ARK
Genomic DNA was genotyped using the H3Africa custom genotyping array. The H3Africa custom array was designed as an African-common-variant-enriched GWAS array (https://www.h3abionet.org/h3africa-chip) (Illumina) with ∼2.3 million single nucleotide polymorphisms (SNPs).
UK-Biobank
Genotyping was performed by Affymetrix on two closely related purpose-designed arrays. ∼50,000 participants were genotyped using the UK BiLEVE Axiom array (Resource 149,600) and the remaining ∼450,000 were genotyped using the UK Biobank Axiom array (Resource 149,601). The dataset is a combination of results from both arrays. A total of 805,426 markers were released in the genotype data. We extracted individuals with self-reported (Data-Field in dataset 21,000) African Ancestry split between African (UKB-African) and Caribbean origins (UKB-Caribbean) from the raw dataset (Casanova et al., 2019).
Quality control
For each dataset, AWI-Gen (Choudhury et al., 2022), ARK and UKB, the following pre-imputation quality control (QC) steps were applied: removal of non-autosomal and mitochondrial SNPs; SNPs with genotype missingness greater than 0.05; minor allele frequency (MAF) less than 0.01; and Hardy-Weinberg equilibrium (HWE) p-value less than 0.0001. Individuals were excluded if they had more than 5% overall genotype missingness; heterozygosity lower than 0.150 and higher than 0.343; and discordant genotype and phenotype sex information. We used the GWAS QC workflow of the H3Africa Consortium Pan-African Bioinformatics Network to perform data QC (H3ABioNet H3AGWAS) (https://github.com/h3abionet/h3agwas) (Baichoo et al., 2018; Brandenburg et al., 2022a).
In our final QC step, we identified and excluded outliers, admixed and related individuals using PCASmart, a feature of the EIGENSOFT software (Price et al., 2006), Admixture software (Alexander et al., 2009) using AGV (Gurdasani et al., 2015) and 1000 Genomes Project data (Auton et al., 2015) and PLINK (Version 1.9) (Purcell et al., 2007; Chang et al., 2015). More detail on filter parameters for each software can be found in Supplementary Table S1.
Imputation
Genotype imputation was performed on each dataset separately (AWI-Gen, ARK, and UKB) using the Sanger Imputation Server with the African Genome Resources reference panel (https://www.sanger.ac.uk/tool/sanger-imputation-service/). EAGLE2 was used for pre-phasing and the PBWT algorithm was used for imputation (Loh et al., 2016). After imputation, poorly imputed SNPs with info scores less than 0.3 and with a HWE p-value less than 1 × 10−04 were removed. The genomic positions were mapped to GRCh37p11.
Phenotype transformation for association testing
For AWI-Gen, ARK, and UKB datasets, UACR was transformed on the logarithm scale. Linear regression of variables was performed with covariates in R (Version 3.6): ln (UACR) ∼age + sex + genetic principal components (PCs) 1-5. Residuals were extracted and transformed using Rank-Based Inverse Normal Transformation to ensure the normal distribution of residuals (Casanova et al., 2019). PCs were calculated using a sub-set of LD pruned pre-imputed SNPs in PLINK (Version 1.9) (Purcell et al., 2007; Chang et al., 2015). The sub-set was derived by LD pruning using PLINK (Version 1.9) (Purcell et al., 2007; Chang et al., 2015) with an LD (r2) threshold of 0.2 with windows of 50 kb and 10 kb for step size.
Association testing
Mixed model association testing was performed with imputed genotype probabilities using GEMMA (Version 0.98.1) (Zhou and Stephens, 2012). GEMMA uses a relatedness matrix to account for genetic structure and relatedness between individuals. The relatedness matrix was built with a sub-set of pre-imputed SNPs described above.
Mixed model association testing was performed independently on each dataset. A total of nine datasets were tested. The datasets were defined as follows: six datasets for AWI-Gen: AWI-Agincourt, AWI- Dikgale, AWI-Nanoro, AWI-Nairobi, AWI-Navrongo and AWI-Soweto; one dataset for ARK: ARK-Agincourt; and two datasets for UK Biobank: UKB-Caribbean and UKB-African. For each dataset, Quantile-to-quantile plots (QQ-plots) were generated, and inflation factors were calculated using SNPs with MAF>0.01 to verify that the association signals were not inflated due to unaccounted population sub-structure. The genome-wide significance level for novel discovery was considered at P < 5 × 10−08.
CKD-Gen
We used previously published meta-analysis summary statistics from the CKD-Gen Consortium. The CKD-Gen Consortium datasets consist of three meta-analysis summary statistics: 1) CKD-Gen European ancestry individuals (CKD-Gen-EA); 2) CKD-Gen African American ancestry individuals (CKD-Gen-AA); and 3) CKD-Gen multi-ancestry individuals (CKD-Gen-MA) which include individuals from CKD-Gen-EA and CKD-Gen-AA (Teumer et al., 2019). The CKD-Gen Consortium meta-analysis summary statistics were retrieved from http://ckdgen.imbi.uni-freiburg.de/.
Briefly, CKD-Gen-AA is a meta-analysis based on 7 studies with African American participants. For each study, genotyping was performed using genome-wide arrays followed by application of study-specific quality filters prior to phasing, imputation, and association analysis software [description can be found in Supplementary Tables 1, 2 from (Teumer et al., 2019)]. Meta-analysis was performed using fixed effects inverse-variance weighted meta-analysis of the study-specific GWAS result files with imputation quality (IQ) score > 0.6 and MAC > 10, effective sample size ≥ 100, and a beta < 10, using METAL [for more details see (Teumer et al., 2019)].
Meta-analysis
Fixed-effect meta-analyses were conducted using the METASOFT software (Han and Eskin, 2011). The first meta-analysis (MetaSSA) used the GWAS summary statistics generated from individual-level data from resident SSA populations. This included AWI-Agincourt, AWI-Dikgale, AWI-Nanoro, AWI-Nairobi, AWI-Navrongo, AWI-Soweto and ARK-Agincourt. The second meta-analysis (MetaNONRES) included data from individuals of African ancestry who are not residing in SSA. We used the GWAS summary statistics generated from individual-level data from the UK-Biobank (UKB-African and UKB-Caribbean) and CKD-Gen African American sub-set (CKD-Gen-AA). The third meta-analysis (MetaALL) consisted of a meta-analysis that pooled the summary statistics of all studies from AWI-Agincourt, AWI-Dikgale, AWI-Nanoro, AWI-Nairobi, AWI-Navrongo, AWI-Soweto, ARK-Agincourt, UKB-African, UKB-Caribbean and CKD-Gen-AA. Figure 1 outlines the meta-analysis workflow. As a secondary analysis, the role of heterogeneity had been investigated between cohorts from different regions of origin by performing separate meta-analyses for residents of Southern African (AWI-Agincourt, AWI-Dikgale, AWI-Soweto, and ARK-Agincourt) and residents of West Africa (AWI-Nanoro, AWI-Navrongo). Random-effects model from METASOFT (Han and Eskin, 2011) took into account potential heterogeneity between study sites, we performed Meta RE using all dataset (MetaALLRE) (Borenstein et al., 2010; Nikolakopoulou et al., 2014). The genome-wide significance level for novel discovery was considered at P < 5 × 10−08.
Post association analysis
Plotting
QQ-plots and Manhattan plots were generated using the FastMan library (Paria et al., 2022) (available at https://github.com/kaustubhad/fastman) and the Hudson library (available at https://github.com/anastasia-lucas/hudson). These visualizations were created using SNPs with a MAF threshold of 0.01 or more. For regional plots, we utilized the standalone version of the LocusZoom software (Pruim et al., 2010).
Genetic LD reference
For the estimation of the LD reference panel for conditional and joint (COJO) analysis, clumping, and fine-mapping, three LD reference panels were constructed using genotype data from the appropriate datasets. For resident SSA dataset comparisons, the LD reference panel (LDSSA) was constructed using AWI-Gen and ARK individual-level genotype data. For non-resident SSA dataset comparisons, the LD reference panel (LDNONRES) was constructed using UKB individual-level genotype data. For the combined datasets comparison, the LD reference panel (LDALL) was constructed using AWI-Gen, ARK, and UKB individual-level genotype data.
Fine-mapping and lead SNPs
For each locus with a lead SNP with a p-value below 5 × 10−08, fine-mapping was conducted using the H3ABioNet H3AGWAS pipeline and implementing a stepwise model selection procedure through GCTA (Yang et al., 2011; 2012; Brandenburg et al., 2022a) to identify independently associated SNPs. Subsequently, we utilized the FINEMAP software (Version 1.4) (Benner et al., 2016), considering one causal variant, to define the credible set with 99% confidence using a stochastic approach (Benner et al., 2016).
Conditional analyses (GCTA)
Conditional analyses used the GCTA software implemented within the H3AGWAS pipeline, with summary statistics obtained from the meta-analyses as input. In these analyses, the lead SNPs identified in each meta-analysis were conditioned upon lead SNPs found in previously published studies. Changes in the p-value, both increasing or decreasing significance, of the lead SNP, confirmed a relationship between the two SNPs.
Replication of previous findings
Replication was performed according to the following criteria: 1) Exact replication: if any genome-wide significant lead SNPs found in CKD-Gen-EA and CKD-Gen-MA reached statistical significance (p < 0.05) in MetaSSA, MetaNONRES or MetaALL after Bonferroni correction (A total of 60 independent lead SNPs were identified in the CKD-Gen datasets, of which 55 lead SNPs were from CKD-Gen-EA and 57 lead SNPs were from CKD-Gen-MA) and that the lead SNPs have same direction of effect. 2) LD Window replication: for a given genome-wide significant SNP found in the CKD-Gen datasets, SNPs were extracted from MetaSSA, MetaNONRES and MetaALL that are in LD with the said CKD-Gen lead SNP. LD pruning used the clump procedure in PLINK (Version 1.9) (r2 = 0.1, windows size 1000 kb, P1 = 5 × 10−08, P2 = 0.1). The lowest p-value(s) from SNPs within the given LD window were extracted and this LD window was considered statistically significant if the p-value was less than 5 × 10−04 in both datasets. Additionally, the direction of effect between the CKD-Gen and Meta-datasets (MetaSSA, MetaNONRES and MetaALL) must be consistent. Conditional analyses were performed between the genome-wide significant SNP(s) in CKD-Gen and lead SNP in our meta-analyses to confirm the replication.
For replication, the findings from MetaSSA were compared to CKD-Gen-MA and CKD-Gen-EA, and the findings from MetaNONRES and MetaALL were only compared to CKD-Gen-EA to avoid sample overlaps within the CKD-Gen datasets (as CKD-Gen-AA is contained within CKD-Gen-MA).
Annotation and expression quantitative trait locus (eQTL) analysis
Functional annotation of genome-wide significant SNPs found in MetaSSA, MetaNONRES and/or MetaALL was done using the ANNOVAR software (Wang et al., 2010). eQTL analysis was performed using the database of cis-eQTLs in both glomerular and tubulointerstitial tissues, derived from participants in the Nephrotic Syndrome Study Network (NEPTUNE) using SNPs with false discovery rate (FDR) < 0.05 (Han et al., 2023). In this analysis a 1000 kb window was defined around each genome-wide significant locus and an eQTL was considered significant if the LD (r2) was ≥0.01 between the lead SNP and significant eQTL, LD computation used the genetics data from the African populations from the 1000 Genomes Project (v5a, hg19) (Auton et al., 2015; Sudmant et al., 2015).
Polygenic scores
PGS were computed for each dataset independently (AWI-Agincourt, AWI-Dikgale, AWI-Nanoro, AWI-Nairobi, AWI-Navrongo, AWI-Soweto, ARK-Agincourt, UKB-African, and UKB-Caribbean). The effect sizes from 3 previous studies were used: CKD-Gen-AA, CKD-Gen-MA, and CKD-Gen-EA. PRS-CS (Ge et al., 2019), software that estimates posterior SNP effect sizes by implementing continuous shrinkage (CS) priors, was used to calculate the PGS. As external LD references are required for this analysis, the African LD data derived from the 1000 Genomes Project by the PRScs project was used for this purpose (accessible at https://github.com/getian107/PRScs). The PGS values were regressed against the residualized UACR value in a linear regression model that adjusted for age, sex, and the first five principal components to assess the performance of PGS.
Results
Study participants and phenotype data
Genomic and phenotypic data were accessible for 7,959 individuals in the AWI-Gen datasets, 1,011 individuals in the ARK dataset, and 2,916 individuals in the UK-Biobank dataset with 1,205 individuals and 1,711 individuals in UKB-African and UKB-Caribbean respectively (Supplementary Figure S1). CKD-Gen AA was a meta-analysis of 7 studies including 6,795 individuals in total. Overall, there was a higher prevalence of albuminuria (17.9%; median UACR 1.01 mg/mmol) among individuals from the UKB with African and Caribbean ancestry compared to individuals residing in SSA, where notable regional differences were observed. The highest prevalence of albuminuria occurred in AWI-Agincourt, South Africa (14.1%; median UACR 0.59 mg/mmol) while the lowest prevalence occurred in AWI-Nanoro, West Africa (prevalence of albuminuria 4.5%, median UACR 0.35 mg/mmol) (Table 1).
Table 1. Study participants and phenotype data. Participant characteristics for each AWI-Gen study site, ARK-Agincourt and UKB-African and UKB-Caribbean, with phenotype distributions of UACR (median) and covariables used in the study.
Meta-analysis
Meta-analyses were conducted to investigate the genetics of UACR in resident Sub-Saharan African datasets (MetaSSA) (Figure 2A), non-resident Sub-Saharan African datasets (MetaNONRES) (Figure 2B) and all African ancestry datasets (MetaALL) (Figure 2C).
Figure 2. Manhattan plot—GWAS of UACR in the (A) MetaSSA (B) MetaNONRES (C) MetaALL datasets using the fixed effect model. Lead genome-wide significant SNPs (P < 5 × 10−08) and gene annotations are highlighted.
No genomic inflation was observed for the individual-dataset association testing performed on the 9 datasets. All genomic inflation factors (lambda) were below 1.1. This was visually confirmed on the dataset specific QQ-plots and Manhattan plots (Supplementary Figure S2; Supplementary Figures S3A–I). Dataset-specific significant findings are reported in Supplementary Table S2; Supplementary Figures S4A–C.
One genome-wide significant locus with the lead SNP rs9505286 (p = 4.3.10−08) was identified in MetaSSA on chromosome 6. (Figure 3A). One genome-wide significant locus with the lead SNP rs73404549 was identified on chromosome 11 in MetaNONRES (p = 5.6.10−11) and MetaALL (p = 7.7 10−13) (Table 2; Figures 3B, C).
Figure 3. Regional plot using LocusZoom of genome-wide significant SNPs found in meta-analyses using the fixed effect model, (A) rs9505286 from the result of MetaSSA, (B) rs9966824 from the result MetaNONRES (C) rs9966824 from the result MetaALL.
Table 2. Lead genome-wide significantly associated SNPs for sub-Saharan African population meta-analysis (MetaSSA), non-resident African ancestry population meta-analysis (MetaNONRES) and the combined African ancestry population meta-analysis (MetaALL).
SNP rs9505286 (chr6:7820353) is located in the intronic region of BMP6. Two SNPs were identified in the 95% credible set using FINEMAP (Figure 3A; Supplementary Table S4). eQTLs in the region were found to be associated with the expression of two genes RREB1 and BMP6 (Table 2; Supplementary Figure S5; Supplementary Table S3; Supplementary Table S5).
SNP rs73404549 (chr11:5320654) is located near the HBE1, OR51B4, and HBB genes. This signal is primarily driven by results from West African ancestry datasets in the MetaNONRES and MetaALL (Figures 3B, C; Supplementary Figure S6). Notably, this SNP is monomorphic in the Southern African and East African datasets. Furthermore, rs73404549 is in LD with rs334 (r2 = 0.52; 72,422 bp apart), the SNP that defines the sickle cell mutation (HbS). SNP rs334 was also significant in MetaALL (PALL = 8.55 × 10−9).
In the window of 1000 kb around rs73404549, SNPs in the region colocalized with gene expression of TRIM6 and STIM1 in glomerular and tubulointerstitial tissues (Table 2; Supplementary Table S5).
Seven and two SNPs were identified in the 99% credible set using FINEMAP in the MetaNONRES and MetaALL results, respectively (Supplementary Table S3; Supplementary Figure S6).
Replication of previous findings
Replication analysis confirmed associations in three were identified in the 95% credible setpreviously identified region in THBS3, SPATA5L1/GATM, and ARL15 (Supplementary Table S3).
In the THBS3 region, the MetaALL meta-analysis rs370545 was the lead SNP in our dataset, with a p-value of 1 × 10−04. However, a conditional analysis using rs2974937 (lead SNP in CKD-Gen-EA) resulted in a decrease in significance level (Pconditional_analysis = 0.85). This suggests that the association in the THBS3 region was driven by rs2974937 in MetaALL even though it was not the lead SNP in this region (Supplementary Table S4; Supplementary Figure S7).
In the ARL15 region, a statistically significant association signal was observed in MetaSSA (rs1664781, p = 1.8 × 10−04). Conditional analysis using rs1694068 (lead SNP in CDK-Gen-EA) revealed a reduction in p-value for rs1664781 (Pconditional_analysis = 0.87), suggesting that rs1694068 and rs1664781 are in LD thus confirming the association in this region (Supplementary Table S4; Supplementary Figure S8).
In the SPATA5L1/GATM region, the MetaALL meta-analysis identified rs1694067 as the lead SNP in this region with a p-value of 7.0 × 10−05. Furthermore, the lead SNP rs1153847 identified in CKD-Gen-EA, was present in our dataset, and its association was replicated (PBonferoni_adjusted = 0.04). For the window-based replication, a conditional analysis using rs2467858 (genome-wide significant SNP in CKD-Gen-EA), a reduced p-value was observed (Pconditional_analysis = 0.87) confirming rs1694067 and rs2467858 are in LD and replicated the CKD-Gen signal. (Supplementary Table S4; Supplementary Figure S9).
Polygenic score analyses
The variance explained by the PGS for UACR residuals was between 0% and 0.82%. PGS constructed using the betas from CKD-Gen-EA and CKD-Gen-MA performed better for the non-SSA resident datasets, particularly in the UKB-African, showing the best predictivity (% variance: 0.82, p = 1 × 10−04) and statistically significant correlation between the PGS and the UACR residual (Figure 4; Supplementary Table S6).
Figure 4. Percent variance (r2) explained between PGS and residual phenotypes computed using age, sex and 5 PCs. Key- The negative relationship between PGS and the phenotype in the result of the linear model, *p < 0.05, **p < 0.01 and ***p < 0.001. Details in Supplementary Table S6.
Using the PGS constructed from CKD-Gen-MA, ARK-Agincourt (% variance: 0.61, p = 0.01) and AWI-Agincourt (%variance: 0.58, p = 0.002) demonstrated better predictivity in SSA populations. PGS constructed from CKD-Gen-AA did not improve the variance explained. Variance explained was lower using PGS constructed from CKD-GEN-AA than CKD-Gen-MA or CKD-Gen-EA.
Discussion
This study is the first GWAS for UACR conducted in Sub-Saharan African populations. Two genomic regions were identified to be significantly associated with UACR among 8,970 participants from West, East, and Southern Africa and among 9,705 non-resident African-ancestry participants from the UK Biobank and CKD-Gen Consortium.
For the first locus, the SNP rs9505286 reached genome-wide significance in resident African individuals MetaSSA and is located in the intronic region of BMP6. eQTLs in LD with rs9505286 were found to be associated with expression of two genes, namely bone morphogenetic protein 6 (BMP6) and ras-responsive element binding protein 1 (RREB1). Both genes are plausibly linked with kidney disease. BMP6 encodes a secreted ligand of the transforming growth factor (TGF-beta) superfamily of proteins, of which TGF-β1 is one of the most important regulators of kidney fibrosis, the pathological hallmark of irreversible loss of kidney function in CKD (Dendooven et al., 2011; Jenkins and Fraser, 2011). TGF-B1 is highly expressed in various fibrotic kidney diseases, including diabetic nephropathy (DN), hypertensive nephropathy, obstructive kidney disease, autosomal dominant polycystic kidney disease, immunoglobulin A nephropathy, crescentic glomerulonephritis, and focal segmental glomerulosclerosis. Because of its pivotal role in mediating kidney fibrosis, TGF-B1 is a potential target for drug discovery, and these results point towards similar potential in African populations for further exploration. RREB1, initially identified as a repressor of the angiotensinogen gene, is associated with type 2 diabetes in African Americans with end stage kidney disease (Bonomo et al., 2014). RREB1 polymorphisms have been shown to interact with APOL1, and are implicated in fat distribution and fasting glucose, a potential explanation for the association with type 2 diabetes. As obesity and type 2 diabetes prevalence emerge in many African communities undergoing rapid sociodemographic transition, these findings must inform future work (Bonomo et al., 2014). Unfortunately, neither of the eQTLs has strong LD support with the lead SNP rs9505286 (see Supplementary Table S5).
Variability of kidney function, confounding factors and allele frequency differences between datasets may explain why the rs9505286 signal was not replicated in MetaALL or MetaNONRES (Marigorta et al., 2018).
For the second locus, the SNP rs73404549 was found to be statistically significant in non-resident individuals with African Ancestry (MetaNONRES) and overall (MetaALL), but not in Sub-Saharan African individuals (MetaSSA). This can be explained by the fact that the variant allele of rs73404549 is extremely rare or absent in East and South African populations. This SNP was found to be in LD with rs334, the sickle cell trait mutation (HbS) in the HBB gene. The HbS mutation has been linked to malaria resistance among heterozygotes, with differences in allele frequency attributed to variations in selection pressures between Bantu-speaking populations in West and South/East Africa (Gurdasani et al., 2019; Choudhury et al., 2020). Notably, sickle cell trait and rs334 had been associated with various kidney function (eGFR) and kidney disease traits, including albuminuria, and chronic and end-stage kidney disease in African, African American and US Hispanic/Latino populations (Naik et al., 2014; Gurdasani et al., 2019; Fatumo et al., 2020; Masimango et al., 2022). Furthermore, an interaction between APOL1 high-risk genotypes and the sickle cell trait enhances the risk for low eGFR (Masimango et al., 2022).
In addition to the HBB region, our GWAS revealed transferability of three previously identified signals. Of the 60 UACR-associated loci identified in European and Multi-Ancestry studies, only three were replicated, including variants in GATM. This region was also associated with eGFR in a Ugandan population (Fatumo et al., 2020). We also replicated the association with ARL15 in the region of chromosome 1 ARL15 is a regulator of Mg2+ transport thereby promoting the complex N-glycosylation of cyclin M proteins (CNNM 1-4) and could play a role in the pathogenesis of hypertension mediated via altered tubular handling of magnesium in the kidney (Zolotarov et al., 2021).
Allelic heterogeneity is high in African ancestry populations, as demonstrated by the high genetic diversity in our study (Supplementary Figure S1). However, analysis of regional subgroups using meta-analysis (residents of South, West, or East Africa) did not reveal significant population-specific signatures (p < 5 x 10-8), likely due to small sample sizes within these subgroups (Supplementary Figures S3I, 10A, B). Interestingly, meta-analysis under a random-effects model that allows for heterogeneity in allelic effects between regions (MetaALLRE) did not improve the detection of specific signals already observed with the fixed-effects methods for HBB (Supplementary Figure S11). Consequently, the heterogeneity observed might be explained primarily by variations in LD or environmental factors rather than by the effect of a specific allele, such as the presence or absence of sickle cell trait (Nikolakopoulou et al., 2014; Kuchenbaecker et al., 2019; Choudhury et al., 2020).
The transferability of PGS developed using the effect sizes quantified in three previous association studies in European ancestry, African ancestry and multi-ancestry populations showed limited predictability, explaining less than 1% of the variability in UACR. PGS in resident African populations (AWI-Gen and ARK) explained between 0.58% and 0.60% of the variance of UACR compared to UKB-African, where best prediction was observed (0.80%). The poor predictability of UACR using summary statistics derived from African Americans was likely due to the small sample size of the discovery dataset. Unfortunately, there have been few studies on PGS approaches to compare findings with, and the genetic heritability of UACR is relatively low, estimated at 4.3% (Teumer et al., 2019).
The limited transferability of PGS and previous GWAS signals across ancestral groups could be due to differences in genetic architecture and/or pleiotropic effects. Different demographic histories and genetic selection pressures between European and African populations could modify the ability to replicate previous GWAS results due to differences in allele frequencies between non-African and African populations, with generally lower LDs in African genomes. Environmental factors and variability in the prevalence and aetiology of kidney and disease-related risk factors such as diabetes and hypertension (Fatumo et al., 2020) could also influence the genetic architecture of kidney disease in Africans populations (Limou et al., 2014; Teumer et al., 2019; Brandenburg et al., 2022b). Selection pressures have increased the frequencies of APOL1 kidney risk variants and HbS due to their protective properties in areas of Africa where trypanosomiasis and malaria are endemic. This may have contributed to shaping genetic susceptibility to kidney disease in African individuals. In our study, the APOL1 gene region did not exhibit significant associations with UACR. The indel rs71785313 was not imputed using the African Sanger reference for imputation, and a specific study had previously been published to describe APOL1 variant distribution in the AWI-Gen dataset using other imputation panels, but the locus did not reach genomic significance (5 x 10 -8) for association with eGFR and UACR (Brandenburg et al., 2022b).
While the burden of CKD in SSA is high, it is noteworthy that no prior GWAS on UACR has been conducted on the continent. Despite its uniqueness, our study is limited by its relatively modest sample size, which impacts statistical power to detect small-effect associations reaching genome-wide significance thresholds. Kidney and disease markers were measured at a single time point, and spot urine albumin and creatine levels are sensitive to incident infections and other environmental factors that could affect the prevalence of albuminuria.
It is important to note that our study populations are mainly treatment naïve in relation to kidney disease and other cardiometabolic conditions, which may be an advantage in detecting genetic associations (Pereira et al., 2021). Other studies, based on lipid-associated loci, attributed non-transferability of associated loci to pleiotropic effects, gene-environment interactions, and also to variability in allele frequencies and LD patterns (Kuchenbaecker et al., 2019; Choudhury et al., 2022), as we hypothesize for UACR.
In conclusion, this study describes genetic associations with UACR in a unique SSA cohort and non-resident individuals with African ancestry. CKD in African populations remains understudied but from available data, hypertension, rather than diabetes is the most commonly associated risk factor and in some regions, up to 60% of people with CKD do not have an associated “traditional” risk factor common to high-income settings, suggesting alternate underlying molecular pathways or aetiologies for CKD (Kalyesubula et al., 2018; Nakanga et al., 2019; Muiru et al., 2020). Our study identified two novel SNPs associated with UACR in populations of African ancestry. We further replicated three known UACR-associated loci. Regional genetic diversity due to different selection pressures appear to play a role in the genetic aetiology of CKD across the African continent. These factors likely contribute to the limited transferability of previous association signals and the poor transfer of polygenic scores developed in non-African populations to African populations. Larger genomic studies are necessary to better understand the genetic architecture of kidney function and chronic kidney disease across different African populations and inform region-specific kidney risk profiles. As demonstrated in this study, the low genetic heritability of UACR limits the predictive power of polygenic score for kidney disease in our setting. It is critical for future research to address these gaps by modelling integrative risk scores that incorporate locally relevant clinical risk factors that are powerful predictors of kidney disease, multiple kidney phenotypes (eGFRcystatin C, eGFRcreatinine, eGFRcreatinine+cystatin C, albuminuria, blood urea nitrogen), using multi-omics (Eddy et al., 2020), and the impacts of African-specific genetic risk for kidney disease, such as APOL1 high-risk genotypes and sickle cell trait or disease (Naik et al., 2014; Friedman and Pollak, 2016; Brandenburg et al., 2022b).
Data availability statement
Publicly available datasets were analyzed in this study. Summary statistics are available in the GWAS catalog (GCST90399578, GCST90399579 and GCST90399580) and scripts are available from the corresponding authors. The AWI-Gen data set is available from the European Genome-phenome Archive (EGA) database (https://ega-archive.org/), accession number EGAS00001002482 (phenotype dataset: EGAD00001006425; genotype dataset: EGAD00010001996). The availability of these datasets is subject to controlled access through the Data and Biospecimen Access Committee of the H3Africa Consortium. ARK data is available in WIReDSpace repository on request (https://wiredspace.wits.ac.za/), https://doi.org/10.54223/uniwitwatersrand-10539-33016 35 Data are available under the terms of the Creative Commons Zero “No rights reserved” data waiver (CC0 1.0 Public domain dedication). The processed data generated in this study are provided in Supplementary Material. Permission was obtained to access the genotype and phenotype dataset for UKB (research project number: 63215). Publicly available databases include Teumer et al. in CKD-GEN consortium website: https://ckdgen.imbi.uni-freiburg.de/.
Ethics statement
The studies involving humans were approved by Human Research Ethics Committee (Medical), University of the Witwatersrand/South Africa (M121029; M170880; M160939), Centre Muraz Institutional Ethics Committee/Burkina Faso (015-2014/CE- CM), National Ethics Committee For Health Research/Burkina Faso, Ghana Health Service Ethics Review Committee (ID No: GHS-ERC:05/05/2015), AMREF Health Ethics and Scientific Review Committee in Kenya (approval no. P114/2014), the Navrongo Institutional Review Board (ID No: NHRCIRB178). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
J-TB: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Software, Supervision, Validation, Visualization, Writing–original draft, Writing–review and editing. WC: Writing–review and editing. PB: Validation, Writing–review and editing. MG: Writing–review and editing. GA: Writing–review and editing. LM: Writing–review and editing. HS: Writing–review and editing. ST: Writing–review and editing. GA: Writing–review and editing. FM: Writing–review and editing. SH: Writing–review and editing. AM: Conceptualization, Methodology, Supervision, Writing–original draft, Writing–review and editing. JF: Conceptualization, Methodology, Supervision, Writing–original draft, Writing–review and editing. MR: Conceptualization, Funding acquisition, Resources, Supervision, Writing–original draft, Writing–review and editing.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This study was funded by the National Institutes of Health (NIH) through the H3Africa AWI-Gen project (NIH grant number U54HG006938). AWI-Gen is supported by the National Human Genome Research Institute (NHGRI), Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD), and Office of the Director (OD) at the National Institutes of Health. PB is funded by the National Research Foundation/The World Academy of Sciences “African Renaissance Doctoral Fellowship” (Grant no. 100004). J-TB is supported by grants from the National Human Genome Research Institute (U54HG006938) as part of the H3A Consortium (AWI-Gen); and the Science For African Foundation—REACCT-CAN Grant (Del-22-008). MR is the South African Research Chair in Genomics and Bioinformatics of African populations hosted by the University of the Witwatersrand (SARChI), funded by the South African Department of Science and Innovation, and administered by the National Research Foundation. WC is supported by the Research Networks for Health Innovations in sub-Saharan Africa Funding Initiative of the German Federal Ministry of Education and Research (RHISSA) Grant and the Science For African Foundation—REACCT-CAN Grant (Del-22-008). The ARK study was jointly funded by the South African MRC, MRC UK (via the Newton Fund), and GSK Africa Non-Communicable Disease Open Lab (via a supporting grant; project number 074). The views expressed in this manuscript do not necessarily reflect the views of the funders.
Acknowledgments
We acknowledge the sterling contributions of our field workers, phlebotomists, laboratory scientists, administrators, data personnel, and all other staff who contributed to the AWI-Gen and ARK data and sample collection, processing, storage, and shipping, and the participants who generously agreed to be part of the studies. The UK Biobank kindly shared data under the UK Biobank Resource Application Number 63215 and approved submission of the paper.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The reviewer HQ declared a past collaboration with the authors MR to the handling editor.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2024.1372042/full#supplementary-material
References
Adam, Y., Sadeeq, S., Kumuthini, J., Ajayi, O., Wells, G., Solomon, R., et al. (2022). Polygenic risk score in african populations: progress and challenges. F1000Res 11, 175. doi:10.12688/f1000research.76218.2
Alexander, D. H., Novembre, J., and Lange, K. (2009). Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664. doi:10.1101/gr.094052.109
Ali, S. A., Soo, C., Agongo, G., Alberts, M., Amenga-Etego, L., Boua, R. P., et al. (2018). Genomic and environmental risk factors for cardiometabolic diseases in Africa: methods used for Phase 1 of the AWI-Gen population cross-sectional study. Glob. Health Action 11, 1507133. doi:10.1080/16549716.2018.1507133
Auton, A., Abecasis, G. R., Altshuler, D. M., Durbin, R. M., Abecasis, G. R., Bentley, D. R., et al. (2015). A global reference for human genetic variation. Nature 526, 68–74. doi:10.1038/nature15393
Baichoo, S., Souilmi, Y., Panji, S., Botha, G., Meintjes, A., Hazelhurst, S., et al. (2018). Developing reproducible bioinformatics analysis workflows for heterogeneous computing environments to support African genomics. BMC Bioinforma. 19, 457. doi:10.1186/s12859-018-2446-1
Benner, C., Spencer, C. C. A., Havulinna, A. S., Salomaa, V., Ripatti, S., and Pirinen, M. (2016). FINEMAP: efficient variable selection using summary data from genome-wide association studies. Bioinforma. Oxf. Engl. 32, 1493–1501. doi:10.1093/bioinformatics/btw018
Böger, C. A., Chen, M.-H., Tin, A., Olden, M., Köttgen, A., de Boer, I. H., et al. (2011). CUBN is a gene locus for albuminuria. J. Am. Soc. Nephrol. JASN 22, 555–570. doi:10.1681/ASN.2010060598
Bonomo, J. A., Guan, M., Ng, M. C. Y., Palmer, N. D., Hicks, P. J., Keaton, J. M., et al. (2014). The ras responsive transcription factor RREB1 is a novel candidate gene for type 2 diabetes associated end-stage kidney disease. Hum. Mol. Genet. 23, 6441–6447. doi:10.1093/hmg/ddu362
Borenstein, M., Hedges, L. V., Higgins, J. P. T., and Rothstein, H. R. (2010). A basic introduction to fixed-effect and random-effects models for meta-analysis. Res. Synthesis Methods 1, 97–111. doi:10.1002/jrsm.12
Brandenburg, J.-T., Clark, L., Botha, G., Panji, S., Baichoo, S., Fields, C., et al. (2022a). H3AGWAS: a portable workflow for genome wide association studies. BMC Bioinforma. 23, 498. doi:10.1186/s12859-022-05034-w
Brandenburg, J.-T., Govender, M. A., Winkler, C. A., Boua, P. R., Agongo, G., Fabian, J., et al. (2022b). Apolipoprotein L1 high-risk genotypes and albuminuria in sub-saharan african populations. Clin. J. Am. Soc. Nephrol. 17, 798–808. doi:10.2215/CJN.14321121
Casanova, F., Tyrrell, J., Beaumont, R. N., Ji, Y., Jones, S. E., Hattersley, A. T., et al. (2019). A genome-wide association study implicates multiple mechanisms influencing raised urinary albumin–creatinine ratio. Hum. Mol. Genet. 28, 4197–4207. doi:10.1093/hmg/ddz243
Chang, C. C., Chow, C. C., Tellier, L. C., Vattikuti, S., Purcell, S. M., and Lee, J. J. (2015). Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7. doi:10.1186/s13742-015-0047-8
Choudhury, A., Aron, S., Botigué, L. R., Sengupta, D., Botha, G., Bensellak, T., et al. (2020). High-depth African genomes inform human migration and health. Nature 586, 741–748. doi:10.1038/s41586-020-2859-7
Choudhury, A., Brandenburg, J.-T., Chikowore, T., Sengupta, D., Boua, P. R., Crowther, N. J., et al. (2022). Meta-analysis of sub-Saharan African studies provides insights into genetic architecture of lipid traits. Nat. Commun. 13, 2578. doi:10.1038/s41467-022-30098-w
Craik, A., Gondwe, M., Mayindi, N., Chipungu, S., Khoza, B., Gmez-Oliv, X., et al. (2023). Forgotten but not gone in rural South Africa: urinary schistosomiasis and implications for chronic kidney disease screening in endemic countries. Wellcome Open Res. 8, 68. doi:10.12688/wellcomeopenres.18650.3
Dendooven, A., van Oostrom, O., van der Giezen, D. M., Leeuwis, J. W., Snijckers, C., Joles, J. A., et al. (2011). Loss of endogenous bone morphogenetic protein-6 aggravates renal fibrosis. Am. J. Pathol. 178, 1069–1079. doi:10.1016/j.ajpath.2010.12.005
Eddy, S., Mariani, L. H., and Kretzler, M. (2020). Integrated multi-omics approaches to improve classification of chronic kidney disease. Nat. Rev. Nephrol. 16, 657–668. doi:10.1038/s41581-020-0286-5
Fabian, J., Gondwe, M., Mayindi, N., Chipungu, S., Khoza, B., Gaylard, P., et al. (2022). Chronic kidney disease (CKD) and associated risk in rural South Africa: a population-based cohort study. Wellcome Open Res. 7, 236. doi:10.12688/wellcomeopenres.18016.2
Fatumo, S., Chikowore, T., Kalyesubula, R., Nsubuga, R., Asiki, G., Nashiru, O., et al. (2020). Discovery and fine-mapping of kidney function loci in first genome-wide association study in Africans. doi:10.1101/2020.06.09.142463
Friedman, D. J., and Pollak, M. R. (2016). Apolipoprotein L1 and kidney disease in african Americans. Trends Endocrinol. metabolism TEM 27, 204–215. doi:10.1016/j.tem.2016.02.002
Gbd Chronic Kidney Disease Collaboration, B., Purcell, C. A., Levey, A. S., Smith, M., Abdoli, A., Abebe, M., et al. (2020). Global, regional, and national burden of chronic kidney disease, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet London, Engl. 395, 709–733. doi:10.1016/S0140-6736(20)30045-3
Ge, T., Chen, C.-Y., Ni, Y., Feng, Y.-C. A., and Smoller, J. W. (2019). Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat. Commun. 10, 1776. doi:10.1038/s41467-019-09718-5
George, J. A., Brandenburg, J.-T., Fabian, J., Crowther, N. J., Agongo, G., Alberts, M., et al. (2019). Kidney damage and associated risk factors in rural and urban sub-Saharan Africa (AWI-Gen): a cross-sectional population study. Lancet Glob. Health 7, e1632–e1643. doi:10.1016/S2214-109X(19)30443-7
Gomez, F., Hirbo, J., and Tishkoff, S. A. (2014). Genetic variation and adaptation in Africa: implications for human evolution and disease. Cold Spring Harb. Perspect. Biol. 6, a008524. doi:10.1101/cshperspect.a008524
Gorski, M., van der Most, P. J., Teumer, A., Chu, A. Y., Li, M., Mijatovic, V., et al. (2017). 1000 Genomes-based meta-analysis identifies 10 novel loci for kidney function. Sci. Rep. 7, 45040. doi:10.1038/srep45040
Gurdasani, D., Carstensen, T., Fatumo, S., Chen, G., Franklin, C. S., Prado-Martinez, J., et al. (2019). Uganda genome resource enables insights into population history and genomic discovery in Africa. Cell. 179, 984–1002. doi:10.1016/j.cell.2019.10.004
Gurdasani, D., Carstensen, T., Tekola-Ayele, F., Pagani, L., Tachmazidou, I., Hatzikotoulas, K., et al. (2015). The african genome variation project shapes medical genetics in Africa. Nature 517, 327–332. doi:10.1038/nature13997
Haas, M. E., Aragam, K. G., Emdin, C. A., Bick, A. G., Hemani, G., Davey Smith, G., et al. (2018). Genetic association of albuminuria with cardiometabolic disease and blood pressure. Am. J. Hum. Genet. 103, 461–473. doi:10.1016/j.ajhg.2018.08.004
Han, B., and Eskin, E. (2011). Random-effects model aimed at discovering associations in meta-analysis of genome-wide association studies. Am. J. Hum. Genet. 88, 586–598. doi:10.1016/j.ajhg.2011.04.014
Han, S. K., McNulty, M. T., Benway, C. J., Wen, P., Greenberg, A., Onuchic-Whitford, A. C., et al. (2023). Mapping genomic regulation of kidney disease and traits through high-resolution and interpretable eQTLs. Nat. Commun. 14, 2229. doi:10.1038/s41467-023-37691-7
Hellwege, J. N., Velez Edwards, D. R., Giri, A., Qiu, C., Park, J., Torstenson, E. S., et al. (2019). Mapping eGFR loci to the renal transcriptome and phenome in the VA Million Veteran Program. Nat. Commun. 10, 3842. doi:10.1038/s41467-019-11704-w
Jenkins, R. H., and Fraser, D. J. (2011). BMP-6 emerges as a potential major regulator of fibrosis in the kidney. Am. J. Pathol. 178, 964–965. doi:10.1016/j.ajpath.2010.12.010
Kachuri, L., Chatterjee, N., Hirbo, J., Schaid, D. J., Martin, I., Kullo, I. J., et al. (2023). Principles and methods for transferring polygenic risk scores across global populations. Nat. Rev. Genet. 25, 8–25. doi:10.1038/s41576-023-00637-2
Kalyesubula, R., Fabian, J., Nakanga, W., Newton, R., Ssebunnya, B., Prynn, J., et al. (2020). How to estimate glomerular filtration rate in sub-Saharan Africa: design and methods of the African Research into Kidney Diseases (ARK) study. BMC Nephrol. 21, 20. doi:10.1186/s12882-020-1688-0
Kalyesubula, R., Hau, J. P., Asiki, G., Ssebunya, B., Kusemererwa, S., Seeley, J., et al. (2018). Impaired renal function in a rural Ugandan population cohort. Wellcome Open Res. 3, 149. doi:10.12688/wellcomeopenres.14863.3
Kamiza, A. B., Toure, S. M., Vujkovic, M., Machipisa, T., Soremekun, O. S., Kintu, C., et al. (2022). Transferability of genetic risk scores in African populations. Nat. Med. 28, 1163–1166. doi:10.1038/s41591-022-01835-x
Kamiza, A. B., Touré, S. M., Zhou, F., Soremekun, O., Cissé, C., Wélé, M., et al. (2023). Multi-trait discovery and fine-mapping of lipid loci in 125,000 individuals of African ancestry. Nat. Commun. 14, 5403. doi:10.1038/s41467-023-41271-0
Kaze, A. D., Ilori, T., Jaar, B. G., and Echouffo-Tcheugui, J. B. (2018). Burden of chronic kidney disease on the African continent: a systematic review and meta-analysis. BMC Nephrol. 19, 125. doi:10.1186/s12882-018-0930-5
Köttgen, A., and Pattaro, C. (2020). The CKDGen Consortium: ten years of insights into the genetic basis of kidney function. Kidney Int. 97, 236–242. doi:10.1016/j.kint.2019.10.027
Kovesdy, C. P. (2022). Epidemiology of chronic kidney disease: an update 2022. Kidney Int. Suppl. 12 12, 7–11. doi:10.1016/j.kisu.2021.11.003
Kuchenbaecker, K., Telkar, N., Reiker, T., Walters, R. G., Lin, K., Eriksson, A., et al. (2019). The transferability of lipid loci across African, Asian and European cohorts. Nat. Commun. 10, 4330. doi:10.1038/s41467-019-12026-7
Lee, J., Lee, Y., Park, B., Won, S., Han, J. S., and Heo, N. J. (2018). Genome-wide association analysis identifies multiple loci associated with kidney disease-related traits in Korean populations. PLOS ONE 13, e0194044. doi:10.1371/journal.pone.0194044
Limou, S., Nelson, G. W., Kopp, J. B., and Winkler, C. A. (2014). APOL1 kidney risk alleles: population genetics and disease associations. Adv. Chronic Kidney Dis. 21, 426–433. doi:10.1053/j.ackd.2014.06.005
Lin, B. M., Nadkarni, G. N., Tao, R., Graff, M., Fornage, M., Buyske, S., et al. (2019). Genetics of chronic kidney disease stages across ancestries: the PAGE study. Front. Genet. 10, 494. doi:10.3389/fgene.2019.00494
Loh, P.-R., Danecek, P., Palamara, P. F., Fuchsberger, C., A Reshef, Y., K Finucane, H., et al. (2016). Reference-based phasing using the haplotype reference consortium panel. Nat. Genet. 48, 1443–1448. doi:10.1038/ng.3679
Mahajan, A., Rodan, A. R., Le, T. H., Gaulton, K. J., Haessler, J., Stilp, A. M., et al. (2016). Trans-ethnic fine mapping highlights kidney-function genes linked to salt sensitivity. Am. J. Hum. Genet. 99, 636–646. doi:10.1016/j.ajhg.2016.07.012
Majara, L., Kalungi, A., Koen, N., Tsuo, K., Wang, Y., Gupta, R., et al. (2023). Low and differential polygenic score generalizability among African populations due largely to genetic diversity. Hum. Genet. Genomics Adv. 4, 100184. doi:10.1016/j.xhgg.2023.100184
Marigorta, U. M., Rodríguez, J. A., Gibson, G., and Navarro, A. (2018). Replicability and prediction: lessons and challenges from GWAS. Trends Genet. 34, 504–517. doi:10.1016/j.tig.2018.03.005
Martin, A. R., Gignoux, C. R., Walters, R. K., Wojcik, G. L., Neale, B. M., Gravel, S., et al. (2017). Human demographic history impacts genetic risk prediction across diverse populations. Am. J. Hum. Genet. 100, 635–649. doi:10.1016/j.ajhg.2017.03.004
Masimango, M. I., Jadoul, M., Binns-Roemer, E. A., David, V. A., Sumaili, E. K., Winkler, C. A., et al. (2022). APOL1 renal risk variants and sickle cell trait associations with reduced kidney function in a large Congolese population-based study. Kidney Int. Rep. 7, 474–482. doi:10.1016/j.ekir.2021.09.018
Morris, A. P., Le, T. H., Wu, H., Akbarov, A., van der Most, P. J., Hemani, G., et al. (2019). Trans-ethnic kidney function association study reveals putative causal genes and effects on kidney-specific disease aetiologies. Nat. Commun. 10, 29. doi:10.1038/s41467-018-07867-7
Muiru, A. N., Charlebois, E. D., Balzer, L. B., Kwarisiima, D., Elly, A., Black, D., et al. (2020). The epidemiology of chronic kidney disease (CKD) in rural East Africa: a population-based study. PLoS One 15, e0229649. doi:10.1371/journal.pone.0229649
Naik, R. P., Derebail, V. K., Grams, M. E., Franceschini, N., Auer, P. L., Peloso, G. M., et al. (2014). Association of sickle cell trait with chronic kidney disease and albuminuria in african Americans. JAMA 312, 2115–2125. doi:10.1001/jama.2014.15063
Nakanga, W. P., Prynn, J. E., Banda, L., Kalyesubula, R., Tomlinson, L. A., Nyirenda, M., et al. (2019). Prevalence of impaired renal function among rural and urban populations: findings of a cross-sectional study in Malawi. Wellcome Open Res. 4, 92. doi:10.12688/wellcomeopenres.15255.1
Nikolakopoulou, A., Mavridis, D., and Salanti, G. (2014). How to interpret meta-analysis models: fixed effect and random effects meta-analyses. Evid. Based Ment. Health 17, 64. doi:10.1136/eb-2014-101794
Paria, S. S., Rahman, S. R., and Adhikari, K. (2022). fastman: a fast algorithm for visualizing GWAS results using Manhattan and Q-Q plots. bioRxiv. doi:10.1101/2022.04.19.488738
Pattaro, C., Köttgen, A., Teumer, A., Garnaas, M., Böger, C. A., Fuchsberger, C., et al. (2012). Genome-wide association and functional follow-up reveals new loci for kidney function. PLoS Genet. 8, e1002584. doi:10.1371/journal.pgen.1002584
Pattaro, C., Teumer, A., Gorski, M., Chu, A. Y., Li, M., Mijatovic, V., et al. (2016). Genetic associations at 53 loci highlight cell types and biological pathways relevant for kidney function. Nat. Commun. 7, 10023. doi:10.1038/ncomms10023
Pereira, L., Mutesa, L., Tindana, P., and Ramsay, M. (2021). African genetic diversity and adaptation inform a precision medicine agenda. Nat. Rev. Genet. 22, 284–306. doi:10.1038/s41576-020-00306-8
Price, A. L., Patterson, N. J., Plenge, R. M., Weinblatt, M. E., Shadick, N. A., and Reich, D. (2006). Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909. doi:10.1038/ng1847
Pruim, R. J., Welch, R. P., Sanna, S., Teslovich, T. M., Chines, P. S., Gliedt, T. P., et al. (2010). LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics 26, 2336–2337. doi:10.1093/bioinformatics/btq419
Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M. A. R., Bender, D., et al. (2007). PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575. doi:10.1086/519795
Ramsay, M., Crowther, N., Tambo, E., Agongo, G., Baloyi, V., Dikotope, S., et al. (2016). H3Africa AWI-Gen Collaborative Centre: a resource to study the interplay between genomic and environmental risk factors for cardiometabolic diseases in four sub-Saharan African countries. Glob. health, Epidemiol. Genomics 1, e20. doi:10.1017/gheg.2016.17
Sudmant, P. H., Rausch, T., Gardner, E. J., Handsaker, R. E., Abyzov, A., Huddleston, J., et al. (2015). An integrated map of structural variation in 2,504 human genomes. Nature 526, 75–81. doi:10.1038/nature15394
Teumer, A., Li, Y., Ghasemi, S., Prins, B. P., Wuttke, M., Hermle, T., et al. (2019). Genome-wide association meta-analyses and fine-mapping elucidate pathways influencing albuminuria. Nat. Commun. 10, 4130. doi:10.1038/s41467-019-11576-0
Teumer, A., Tin, A., Sorice, R., Gorski, M., Yeo, N. C., Chu, A. Y., et al. (2016). Genome-wide association studies identify genetic loci associated with albuminuria in diabetes. Diabetes 65, 803–817. doi:10.2337/db15-1313
Tin, A., and Köttgen, A. (2020). Genome-wide association studies of CKD and related traits. Clin. J. Am. Soc. Nephrol. CJN 15, 1643–1656. doi:10.2215/CJN.00020120
Wang, K., Li, M., and Hakonarson, H. (2010). ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164. doi:10.1093/nar/gkq603
Wuttke, M., Li, Y., Li, M., Sieber, K. B., Feitosa, M. F., Gorski, M., et al. (2019). A catalog of genetic loci associated with kidney function from analyses of a million individuals. Nat. Genet. 51, 957–972. doi:10.1038/s41588-019-0407-x
Yang, J., Ferreira, T., Morris, A. P., Medland, S. E., Madden, P. A. F., Heath, A. C., et al. (2012). Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet. 44, 369–S3. doi:10.1038/ng.2213
Yang, J., Lee, S. H., Goddard, M. E., and Visscher, P. M. (2011). GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82. doi:10.1016/j.ajhg.2010.11.011
Zanetti, D., Rao, A., Gustafsson, S., Assimes, T. L., Montgomery, S. B., and Ingelsson, E. (2019). Identification of 22 novel loci associated with urinary biomarkers of albumin, sodium, and potassium excretion. Kidney Int. 95, 1197–1208. doi:10.1016/J.KINT.2018.12.017
Zhou, X., and Stephens, M. (2012). Genome-wide efficient mixed-model analysis for association studies. Nat. Genet. 44, 821–824. doi:10.1038/ng.2310
Keywords: chronic kidney disease, GWAS, UACR, African diversity, Polygenic score
Citation: Brandenburg J-T, Chen WC, Boua PR, Govender MA, Agongo G, Micklesfield LK, Sorgho H, Tollman S, Asiki G, Mashinya F, Hazelhurst S, Morris AP, Fabian J and Ramsay M (2024) Genetic association and transferability for urinary albumin-creatinine ratio as a marker of kidney disease in four Sub-Saharan African populations and non-continental individuals of African ancestry. Front. Genet. 15:1372042. doi: 10.3389/fgene.2024.1372042
Received: 18 January 2024; Accepted: 12 April 2024;
Published: 15 May 2024.
Edited by:
Segun Fatumo, University of London, United KingdomReviewed by:
Hui-Qi Qu, Children’s Hospital of Philadelphia, United StatesOpeyemi Soremekun, MRC/UVRI and MRC/UVRI at London School of Health and Tropical Medicine, Uganda
Copyright © 2024 Brandenburg, Chen, Boua, Govender, Agongo, Micklesfield, Sorgho, Tollman, Asiki, Mashinya, Hazelhurst, Morris, Fabian and Ramsay. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jean-Tristan Brandenburg, amVhbi10cmlzdGFuLmJyYW5kZW5idXJnQHdpdHMuYWMuemE=; Michèle Ramsay, bWljaGVsZS5yYW1zYXlAd2l0cy5hYy56YQ==
†These authors have contributed equally to this work and share senior authorship