- 1DIADE, Université de Montpellier, IRD, CIRAD, Montpellier, France
- 2LNRPV, Institut Sénégalais de Recherches Agricoles (ISRA), Dakar, Senegal
- 3Laboratoire Mixte International LAPSE, Campus de Bel Air, route des Hydrocarbures, Dakar, Senegal
- 4CERAAS, Institut Sénégalais de Recherches Agricoles, Thiès, Senegal
- 5Human Evolution, Department of Organismal Biology, Uppsala University, Uppsala, Sweden
- 6CIRAD, UMR AGAP Institut, Montpellier, France
- 7UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, Montpellier, France
- 8DIADE, Institut de Recherche pour le Développement (IRD), Niamey, Niger
Pearl millet is among the top three-cereal production in one of the most climate vulnerable regions, sub-Saharan Africa. Its Sahelian origin makes it adapted to grow in poor sandy soils under low soil water regimes. Pearl millet is thus considered today as one of the most interesting crops to face the global warming. Flowering time, a trait highly correlated with latitude, is one of the key traits that could be modulated to face future global changes. West African pearl millet landraces, can be grouped into early- (EF) and late-flowering (LF) varieties, each flowering group playing a specific role in the functioning and resilience of Sahelian smallholders. The aim of this study was thus to detect genes linked to flowering but also linked to relevant traits within each flowering group. We thus investigated genomic and phenotypic diversity in 109 pearl millet landrace accessions, i.e., 66 early-flowering and 43 late-flowering, grown in the groundnut basin, the first area of rainfed agriculture in Senegal dominated by dry cereals (millet, maize, and sorghum) and legumes (groundnuts, cowpeas). We were able to confirm the role of PhyC gene in pearl millet flowering and identify several other genes that appear to be as much as important, such as FSR12 and HAC1. HAC1 and two other genes appear to be part of QTLs previously identified and deserve further investigation. At the same time, we were able to highlight a several genes and variants that could contribute to the improvement of pearl millet yield, especially since their impact was demonstrated across flowering cycles.
Introduction
By the end of this century, the frequency of heat waves in the Sahelian region will increase, with temperatures exceeding the maximums observed over the past century (Battisti and Naylor, 2009; Russo et al., 2016). Global warming has already led to significant crop yield losses in Africa—up to 20% for pearl millet and 15% for sorghum (Sultan et al., 2019). This trend is likely to accelerate with a further 8% yield loss or more predicted in Africa by the 2050s (Roudier et al., 2011; Knox et al., 2012). The main staple food for communities in sub-Saharan Africa, pearl millet (Cenchrus americanus (L.) Morrone syn. Pennisetum glaucum (L.) R. Br.) is the top-ranked crop in terms of cultivated area in Nigeria (2,000,000 ha), Burkina Faso (1,183,792 ha), Mali (2,164,374 ha), and Senegal (1,023,065 ha) in 2020 (FAOStat, 2016) due to its high tolerance to extreme climate conditions. However, its production remains low, with an average grain yield of 900 kg/ha (Pucher et al., 2015). In the wake of drought periods in the 1970s and 1980s, West African breeding efforts have mostly been focused on developing short-cycle and non-photoperiodic varieties (Matlon, 1990; Niangado, 2001; Evenson and Gollin, 2003). Yet, West African smallholders rely on both early-and late-flowering types (De Rouw and Winkel, 1998; De Rouw, 2004). Despite the drought episodes, late-flowering landraces have been conserved or reintroduced from the 2000s with the trend towards increased rainfall (Lalou et al., 2019). In terms of performance, early-flowering landraces ensure minimum production in the event of a short raining season, while late-flowering landraces benefit from late rains but others specificities such as adaptation to different types of soil, different culinary uses and secondary uses, e.g., fodder, also explaining why farmers conserved both flowering regimes. It has been demonstrated that flowering time highly contributes to the landraces genomic vulnerability to future climatic conditions and thus constitute a major trait for pearl millet adaptation (Rhoné et al., 2020). The pearl millet flowering cycle ranges from very early (<40 days) to very late (>120 days, Haussmann et al., 2006; Upadhyaya et al., 2017). Variations in flowering time are associated with different climatic conditions of cropping and the pattern usually follows a latitudinal gradient (Haussmann et al., 2006; Pucher et al., 2015). While a landrace can be classified as early-or late-flowering, there is still a great deal of heterogeneity. For instance, within landrace full-sib families, differences between the earliest and the latest flowering accessions were reported to be 16 days at minimum and 39 days at maximum; a flowering variability that appears to be genetically controlled (Haussmann et al., 2007, 2012). Landrace variability has already been exploited by farmers to shorten flowering cycles, as a response to the 1970s and 1980s drought episodes, while conserving their varieties (Vigouroux et al., 2011; Dussert et al., 2015). Substantial effort has been made in the past decades to investigate genes associated with flowering cycle in pearl millet (Saïdou et al., 2009; Vigouroux et al., 2011; Lakis et al., 2012; Kumar et al., 2017; Diack et al., 2020). To date, only the role of PgPHYC gene has been really validated and some other QTLs were identified (Saïdou et al., 2009; Kumar et al., 2017). Which other genes are involved in the flowering variability of pearl millet landraces remains an open question.
Although the difference in flowering time is one of the primary characteristics between the two types, many differences in other important agronomic traits distinguish the two groups and lead to different uses by farmers as exposed earlier. Late-flowering landraces also have more productive tillers, high-tillering and small panicle, characteristics that can improve drought avoidance strategy (Van Oosterom et al., 2006; Yadav, 2008). Therefore, pearl millet landraces display both individual and populational variability, an asset that needs to be valorized to ensure productivity and stability with the upcoming increase in climate uncertainty in Africa.
We previously used the core collection from Senegal to identify genes involved in the agromorphological differentiation of the two flowering groups (Diack et al., 2020). This study identified a number of genetic markers associated with biomass, but did not identify genes clearly involved in flowering. Here, we adopt a different strategy, by sampling early-and late-flowering accessions that are grown in the same locations, in the groundnut growing area (13–15°N) where they can be cultivated in mixtures allowing gene flow between the two groups (Lalou et al., 2019). By doing so, we minimized the drift effect and maximize the probability of highlighting the regions involved in the differentiation between the two groups. With the objective to identify key genes underlying the phenotypic differentiation between the two flowering groups, we implemented two complementary approaches: (1) genome-wide selection scan to identify key variants showing the highest genetic differentiation between early-and late-flowering landraces, and (2) genome-wide association analysis (GWAS) to detect relevant variants segregating in both groups. This combination of approaches allowed us to identify genes that are strongly involved in flowering including PgPHYC and other promising candidates such as FSR12 and HAC1; but also genes related to yield which magnitude of the effects can vary according to the genetic background in which they evolve.
Materials and methods
Plant material
At West African scale, three types of varieties according to the cycle length are usually described in pearl millet: early flowering (EF) varieties (70–90 days from sowing to harvest), semi-late flowering (SF) varieties (90–120 days) and late flowering (LF) varieties (120–180 days) (Bezançon et al., 2009). In this study, we sampled individuals from EF varieties, sensitive to photoperiod and adapted to drier zones in Senegal (low rainfall 350–600 mm), and LF varieties, less photoperiod-sensitive type adapted to wetter Sudanian zone (high rainfall 900–1,200 mm) (Niangado, 2001).
With the aim of revealing genetic differences between EF and LF pearl millet, we sampled the two flowering groups in Niakhar, an overlapping of the cultivation areas of the two varietal types. This maximizes the probability to detect genes linked to the agro-morphometric characteristics of the two functional groups. Based on information gathered in interviews with farmers, we collected only landraces that had been cultivated in the region for at least 5 years. A total of 109 pearl millet landrace accessions, including 66 early-flowering ‘and 43 late-flowering ‘Sanio’, were collected in four villages in Senegal (Figure 1).
Phenotyping
Three field experiments were performed in the 2016 rainy season in Senegal and in 2017 in both Niger and Senegal. In Senegal, trials were conducted at the Institut Sénégalais de Recherche Agricole (ISRA) field station in Bambey (14°70′N, −16°47′W). In Niger, the trial was conducted at the International Crops Research Institute for the Semi-Arid Tropics (ICRISAT) field station in Sadoré (13°14′N, 2°17′E). In both areas, the average monthly temperature in June are around 30–35°C during the rainy season, and 20–40°C during the dry season. In the Sahel, millet is generally grown on deep, sandy, red soils (tropical ferruginous soils with little leaching), containing more than 65% sand and less than 18% clay (Swindale, 1982). The trials included three repetitions fully randomized. Eight and 10 individuals per accession for each repetition were sown in Niger and Senegal, respectively. Spacing between each hill was 0.9 m × 0.9 m in the Bambey trial, and 1 m × 0.8 m in the Sadoré trial. To avoid side effects, two rows of cultivated pearl millet were used to border the plots. The sowing dates were 2 August 2016, 21 July 2017 in Bambey and 17 July 2017 in Sadoré. The trials were conducted under rainfall conditions with supplementary sprinkler irrigation when necessary. The Eperon fungicide (3.88% metalaxyl-M + 64% mancozeb) was used at the seedling stage to prevent mildew attacks. Thinning was done to two plants per hill 2 weeks after sowing. All trials were fertilized using the micro-dosing technique (6 g NPK – 15–15–15/hill, corresponding to 93 kg ha−1) applied at planting, followed by a 50 kg ha−1 urea topdressing after thinning. A total of 9,290 plants were phenotyped for 11 traits associated with plant morphology and fitness: heading date (i.e., number of days from sowing to heading), main stem length, main stem diameter, main panicle length, main panicle diameter, main panicle weight, total seed weight and 1,000 seed weight of the main panicle, total number of tillers and total number of productive and non-productive tillers. For each repetition, the mean trait value was calculated from 6.5 individuals on average after elimination of the minimal and maximal measures.
Broad-sense heritability estimates were calculated as:
with being the genotypic variance; , the genotype by environment (G X E) variance, and , the residual error variance for R replicates and E environments. Analyses were performed using the mmer function of the sommer package (Covarrubias-Pazaran, 2016, 2018) in the R environment (Team R Development, 2008). Average values across the nine repetitions for 11 traits of 109 accessions were used to perform a centered and scaled principal component analysis (PCA) and to test for phenotypic differences between EF and LF landraces using the R software ade4 package (Dray and Dufour, 2007).
Genotyping
Exome capture
An exome capture approach was used to target gene-bearing regions of the pearl millet genome. NGS libraries were prepared as previously described (Mariac et al., 2014). Enrichment by capture was performed as recommended by the provider using myBaits kits (Arbor Biosciences) but a single dose of bait was used to enrich the bulk of 48 normalized libraries. Probes were 80 bp long with 80 bp spacing (0.5 x tilling). Probes were designed on the 37,617 mRNA sequences of the pearl millet genome (version v1.0) using the annotation file (pmassemblyv1.020140816.gff available at http://dx.doi.org/10.5524/100192). Contig and scaffold sequences were excluded. For each mRNA target, only the first 1,000 bp of mRNA (starting from the 5′ of the annotation) was used to design up to seven capture probes. A final set of 152,619 probes from 31,895 loci was obtained. The total baited length was 12,209,520 bp, which covered 13.4% of the initial target loci. Enriched libraries were paired-end sequenced (2 × 150 bp) on an Illumina Hiseq2000 platform at Genotoul, Toulouse, France.
SNP calling, SNP filtering, and annotation
Demultiplexing based on 6 bp barcodes was performed using the freely available PYTHON script DEMULADAPT (https://github.com/Maillol/demultadapt), using a zero mismatch threshold. Adapters were then removed with Cutadapt 1.2.1 software (Martin, 2011) using a quality cut-off of 20, a minimum overlap of 7 and a minimum remaining length sequence of 35 bp. Reads with a mean quality of under 30 were discarded thereafter using a freely available PERL script (https://github.com/SouthGreenPlatform/arcad-hts/blob/master/scripts/arcad_hts_2_Filter_Fastq_On_Mean_Quality.pl). Reads were mapped to the pearl millet genome (Varshney et al., 2017) with BWA mem v0.7.2 (Li and Durbin, 2009). Only properly paired reads were kept. We used the rmdup module from Samtools 0.1.17 (Li et al., 2009) to remove duplicate reads. RealignerTargetCreator and IndelRealigner from GATK 2.4.7 (DePristo et al., 2011) were used to realign indels. SNPs and genotypes were called using UnifiedGenotyper and the resulting VCF file was filtered for biallelic SNPs only, clustered SNPs (no more than three SNPs per 10-bp window), and mapping quality (MQ0 ≥ 4 && ((MQ0/(1.0*DP)) > 0.1). Obtained raw SNPs were filtered out for low coverage (−min-meanDP ≤4) or excessively high coverage (−min-meanDP ≥100); for low quality (QUAL <60); for quality by depth (QD <2.0); for mean quality (MQ ≤ 40.0); and for Fisher strand score (FS ≥ 60). Finally, SNPs with less than 50% missing data were kept. The final VCF file contained 196,581 SNPs with an average depth per site per individual of 11.057. The maximum individual missing rate was 12.4%. To annotate SNPs and predict their effects, the genome annotation files available at http://dx.doi.org/10.5524/100192 (Varshney et al., 2017) and SNPeff 4.3 (Cingolani et al., 2012) were used considering a maximal distance of 1,000 bp for the SNP to be associated with a gene (Supplementary Tables 1, 2). Based on the available annotation (Varshney et al., 2017), we retrieved, when possible, homologous genes in Arabidopsis thaliana and their annotation that were extracted from the TAIR database (https://www.arabidopsis.org/, Supplementary Table 6).
Genomic diversity and structure
Genetic diversity was estimated by calculating the observed heterozygosity (HOBS), expected heterozygosity (HEXP), inbreeding coefficient (FIS), and differentiation (FST; Weir, 1996) between flowering groups using a homemade script.
To evaluate the genetic structure of our dataset, we first ran a PCA using the R package ‘SNPRelate’ v.1.16.0 (Zheng et al., 2012). We further examined genomic clustering patterns using the sNMF function (Frichot et al., 2014) implemented from the R package ‘LEA’ v.3.1 (Frichot and François, 2015). The number of K clusters was allowed to vary between 1 and 10. The estimation procedure for each assumed K was replicated tenfold. The K value and its best run were retained based on the lowest cross-entropy value (Frichot and François, 2015).
Selection and phenotypic association
Genomic scans for outlier SNP detection
In order to detect outlier SNPs, two different genomic scans implemented in the R package ‘PCAdapt’ v.4.1.0 (Luu et al., 2017) and in BayeScan v.2.1 software (Foll and Gaggiotti, 2008) were performed. Both approaches are geared towards identifying SNPs with extreme genetic differentiation values between gene pools. PCAdapt performs a PCA and calculates, for each SNP, a statistic that measures the proportion of genetic variance explained by the first K principal components. SNP showing significantly higher loads on the K axis are pinpointed as outliers. BayeScan quantifies the biological processes (migration rates, drift) of a demographic history that would lead to the observed allelic frequency variation between populations. BayeScan analysis involves first defining a priori populations in order to estimate the allele frequencies of the ancestral population. Outlier SNPs are identified as SNPs that show between-population differentiation that cannot be explained by the modeled demographic history. Each method has its own assumptions and hypotheses and thus is not sensitive to the same biological features. Comparing the results of the two detection methods and keeping only SNPs detected by both can be an efficient way to limit false detections.
We applied the PCAdapt method to SNPs with a MAF higher than 5% (n = 96,801) and retained the first principal component (K = 1), which differentiated EF and LF accessions. We used the Mahalanobis distance test statistics and applied a false discovery rate (FDR) of 0.001 to identify candidate outlier SNPs. All methods used (PCA on phenotypic or genetic data; sNMF) assigned accessions to the same and expected flowering group. This EF and LF classification was thus used to apply the BayeScan method. We set the prior odds of the model with selection at 10,000, with a thinning interval of 20 and a FDR of 0.05. High prior odds were applied to reduce the false-positive rate but at the expense of missing true loci under selection (Foll and Gaggiotti, 2008).
Genome-wide association study
To gain further insight into the genetic variants involved in relevant phenotypes and their variability within each flowering, a genotype–phenotype association analysis of each SNP with the 11 agronomic traits was performed. Three different models for this analysis were used: (i) an efficient mixed-model association (EMMA) implemented in the R package ‘emma’ (Zhou and Stephens, 2014); (ii) a latent factor mixed model (LFMM) implemented in the R package ‘lfmm’ v2 (Caye and Francois, 2018); and (iii) the compressed MLM (CMLM) model implemented in the R package GAPIT v2016.03.01. Kang et al. (2008) developed an efficient mixed-model association (EMMA) method that includes an identity-by-state allele sharing kinship matrix to control for neutral genetic background. Latent factor methods such as LFMM (Frichot et al., 2013), the generic relationship matrix can be described by K latent factors (analogous to the principal components in PCA) and can account for more subtle population structures (de Villemereuil et al., 2014; Lotterhos and Whitlock, 2015). CMLM model was run by taking in consideration the kinship and the population structure matrices. For structure, a PCA with two principal components was used. SNPs with MAF below 0.05 were filtered and missing values were imputed using the available function in the LEA v3.1 package (Frichot and François, 2015; Gain and François, 2021). The three models (LFMM, EMMA, CMLM) were implemented for each of the nine repetitions (3 trials × 3 repetitions). The resulting p-values of each trial were combined using a Fisher’s combined probability test and associations were considered significant if the Fisher’s combined p-values were < 10−8. For each SNP, the allelic effect on the heading date was estimated. A mixed linear model (MLM) using the GAPIT R package (Lipka et al., 2012) including, as co-factors, either the kinship matrix obtained from EMMA or a matrix derived from the latent factor matrix estimated by LFMM was used. The following transformation was applied to derived a kinship-like matrix from the latent factors matrix:
where, U is the matrix of latent factors and n is the number of genotypes. We then extracted the effect estimated by the mixed model. For CMLM, the effects were directly extracted from the GAPIT implementation of the model, using the same kinship matrix and assuming two principal components of a PCA to describe genetic structure.
Results
Phenotypic diversity and clustering
Broad sens heritability for heading date was very high (0.99), as well as for panicle length (0.94). The lowest heritability values were found for the number of non-productive tillers (0.53) and seed weight (0.55, Table 1). All traits were significantly different between early-flowering (EF) and late-flowering (LF) accessions (Table 1; Wilcoxon test, p < 0.05). All but 10 pairwise correlations between traits were significant (Supplementary Table 3). Heading was positively correlated with stem size (r > 0.77) and number of tillers (r > 0.6). For EF accessions, heading occurred on average 52 ± 2.1 days after sowing, which was 30 days earlier than for LF accessions (82 ± 2.6; Table 1). This marked differentiation in cycle length measured in terms of average number of days to 50% flowering after sowing is observed in most studies (Ouendeba et al., 1995; Haussmann et al., 2007; Pucher et al., 2015; Sy et al., 2015). These two maturity categories are found for most varieties in West Africa, in other regions early maturity is less than 50 days from flowering (Lakis et al., 2011). EF accessions showed shorter stems (240.2 ± 12.2), fewer tillers (4.1 ± 0.6) but longer panicles (61.7 ± 7.5) than LF accessions with longer stems (277.8 ± 17.7), more tillers (5.3 ± 0.7) and shorter panicles (54.2 ± 4.5). These results are similar to those of Akanvou et al. (2012) and Dancette (1983) who indicate that LFs are taller. This small size of EFs could be responsible for their greater drought tolerance marked by significant differences in stomatal conductance mainly in leaves (Ghatak et al., 2016, 2021). In addition to these quantitative characters, the LFs spikes aristation is a qualitative character allowing to distinguish LFs to EFs. The principal component analysis (PCA; Figure 2; Supplementary Figure 1) also revealed a clear morphological distinction between EF and LF accessions as largely captured by PC1, which explained 41.5% of the inertia. As expected, the heading date highly contributed to the phenotypic differentiation (PC1: 15% and PC2: 6%; Table 1), but other traits also seemed important for differentiation between the two groups, such as the number of productive tillers (PC1: 17%), panicle weight (PC1: 7% and PC2: 18%) and panicle diameter (PC2: 23%).
Table 1. Phenotypic diversity Heritability estimates (±SD) and mean values of phenotype diversity (±SD) in early- (EF) and late-flowering (LF) pearl millet accessions are given.
Figure 2. Phenotypic differentiation for early- and late-flowering pearl millet accessions. Mean phenotypic measures across nine repetitions for 11 traits of 109 accessions were used to perform PCA. PC1, and PC2 explained 41.5 and 28% of the inertia, respectively. Early- (red) and late-flowering (black) accessions are well separate on the first PCA axis.
Genomic diversity and structure
After applying different VCF filters, a total of 196,581 variants were identified with an average depth per site per individual of 11.057. All variants were within or close to 20,126 genes (Supplementary Table 2). Among these SNPs, 96,881 SNPs had a minimum allele frequency higher than 5% (MAF > 0.05, Supplementary Table 4). The average genomic expected and observed heterozygosities seemed slightly higher in EF accessions (HEXP: 0.167; HOBS: 0.167) than in LF accessions (HEXP: 0.155; HOBS: 0.158). The mean genetic differentiation FST was low but significant at 0.048.
The first two principal components of the PCA obtained from the genomic dataset explained 5.7 and 1.3% of the inertia, respectively (Figure 3A). The first axis (PC1) separated EF from LF accessions while PC2 explained the diversity among LF accessions. The lowest cross entropy value for sNMF clustering analysis was obtained for K = 2 (Figure 3B), separating EF from LF accessions. All but one individual had a high membership coefficient (≥0.80) with respect to their corresponding flowering group. Overall, PCA and sNMF results were largely congruent, indicating that the phenotypic groups of EF and LF landraces corresponded to highly distinguishable genetic groups.
Figure 3. Genomic differentiation for early and late-flowering pearl millet accessions. (A) Principal component analysis (PCA) of pearl millet showing first and second principal components. (B) Genetic structure for K = 2 using sNMF. Each individual is represented by a vertical bar, partitioned into K segments representing the amount of assignment of its genome in K clusters identified by different colors. Red circles and bars correspond to EF accessions and black circles and bars characterize LF accessions.
Genomic scans for SNP outliers
The PCAdapt analysis highlighted 253 outlier SNPs, with 54% of them localized on chromosome 2 and 19% on chromosome 5 (Figure 4). BayeScan analysis identified 23 outlier SNPs, while they were all also detected in the PCAdapt analysis (Supplementary Figure 2). All common outlier SNPs were nearly fixed in one variety and absent in the other one (FST > 0.88; Table 2). A total of 21 of these 23 SNPs localized in 12 genes on chromosome 2. The two remaining SNPs were in one gene that was found on chromosome 5. 12 of the 21 SNPs on chromosome 2 co-localized in a single 45 kb window (from 6,908,254 to 6,951,748). Among the 13 genes detected by both selection methods, four were linked to flowering time variation in pearl millet or in other species: PHYTOCHROME C (PhyC, Pgl_GLEAN_10016106); HEADING DATE 16/EARLY FLOWERING 1 (Hd16/EF1, Pgl_GLEAN_10033790); FAR1-RELATED SEQUENCE 12 (FRS12, Pgl_GLEAN_10004675) and a HISTONE ACETYL TRANSFERASE 1/HAC703 (HAC1/HAC703, Pgl_GLEAN_10020525).
Figure 4. Manhattan plots of p-values for PCAdapt analysis along the genome. The analysis was performed on the 96,881 SNPs with a MAF ≥ 5%. The red dashed line represents the 0.001 false discovery rate (FDR) value considered for significance.
Table 2. List of the 23 SNPs detected by both PCAdapt and BayeScan methods. Reference and alternate alleles are provided.
Genomic association analyses
We applied a stringent threshold of 10−8 to declare that a SNP was significantly associated with a trait. All SNPs significant at this threshold were also significant when considering a FDR of 1%. The LFMM approach detected 5,017 SNPs out of the 96,881 SNPs tested. A total of 72% were associated with the heading date (Figure 5A). The EMMA approach detected 5,576 SNPs and 70% were associated with the heading date (Figure 5B). The CMLM approach detected 860 SNPs and 32% only were associated with the heading date while 46% were found associated with panicle length (Figure 5C). The Q-Q plots and value of p distributions suggested a good fit of all three models (Supplementary Figures 3, 4). Only the 315 SNPs detected by all three GWAS methods were further considered. Among those 315 SNPs, 24% were found on chromosome 2 and 16% on chromosomes 1 and 5. Most SNPs detected by all three methods were linked to length (67%) and diameter (13%) of the panicle. A total of 18 SNPs (6%) were found to be associated with the heading date and were found in 12 genes with some of them being annotated as part of the RAB GTPase family and two as cyclin-dependent protein serine/threonine kinase. Allele effect of these 18 SNPs on heading date was estimated (Supplementary Table 5). A median effect of +11.4 days for LFMM, +4.7 days for EMMA and + 1.9 days for CLMM was found. Despite these differences in values, rank of SNPs was conserved across methods. Two SNPs stand out with an estimated effect of +23 days with CMLM. Those SNPs are found in the gene Pgl_GLEAN_10008450 on chromosome 2. This gene is encoding as a calcium dependent protein kinase (CDPK). Looking at the genotypes distribution of these SNPs, we observed that the LF accessions are found at the heterozygous state only while the EF are fixed for the alternate allele (Supplementary Figure 6). For diagnostic in breeding, lines with those SNPs (chr2_21407739, chr2_21407791 and chr2_7716171) can be used as donors/sources with, respectively, G/C, G/A, and TC favorable alleles for EF and LF group.
Figure 5. Manhattan plots of Fischer combined p-values for association with heading date. (A) with the EMMA approach, (B) with the LFMM approach, and (C) with the CMLM approach. Black and grey colors indicate chromosome. The red dashed line represents the 10−8 value considered for significance. Manhattan plots for the remaining traits are in the Supplementary Figure 5.
Discussion
In Senegal, late-flowering landraces compared to early-flowering landraces are preferred for fodder as well as for roof and fence construction because of longer and more robust stems. This implied a genetic structuring and different selection pressure on these two types. Data reported here supported a clear genetic differentiation between EF and LF landraces. Meanwhile, a large fraction of genetic diversity (95%) is still segregating inside each flowering group. Genetic clustering was consistent with the flowering structure; other phenotypic findings were also significant. Short cycle landraces had longer and heavier panicles compared to long cycle landraces, which had longer stems and shorter panicles. This finding was consistent with those of previous studies (Marchais, 1982; Dancette, 1983; Zongo et al., 1988; Akanvou et al., 2012; Diack et al., 2020) and was in accordance with the farmer’s uses.
To identify key genes related to flowering time and relevant agronomic traits, two complementary approaches were used: (1) genome-wide selection scanning and (2) genome-wide association studies (GWAS). Conservative thresholds (PCAdapt: FDR of 0.001, BayeScan: prior odds of the model with selection to 10,000) were applied for both genome-scanning methods to pick up the strongest selection signature and identify key variants. GWAS models were used while taking the genetic structure into account. As in our case the genetic structure very closely matched the classification into EF and LF landraces, GWAS was expected to be more effective for detecting SNPs segregating within the flowering groups. For instance, SNPs detected by the genomic approaches were fixed in both groups with allelic frequencies >0.99 (Table 2), and thus could not be detected by the GWAS approaches, notably by the conservative CMLM method which seems to overcorrect the population structure. GWAS analysis was thus able to highlight other phenotypic traits and notably to detect a number of polymorphisms and genes associated with yield (Supplementary Tables 5, 6). Noteworthy, our combination of approaches appears to have been effective and relevant since we were able to detect genes that had been identified in previous studies with different samples and spatial scales [five genes identified by Diack et al. (2020) and Rhoné et al. (2020)], suggesting false discoveries were reduced and that our results could be mainstreamed to other pearl millet landraces.
In relation to flowering time, we highlighted fives relevant genes all found on chromosome 2: PhyC (PHYTOCRHOME C), FRS12 (FAR1-RELATED SEQUENCE 12), HEADING DATE 16 (Hd16)/EARLY FLOWERING 1 (EF1), HEADING DATE 3 (Hd3a) and OsHAC1/OsHAC703 genes. The PhyC gene plays an important role in flowering time induction via photoperiodic cues in several cereal species (Izawa et al., 2002; rice: Takano et al., 2005; Ishikawa et al., 2011; wheat: Chen et al., 2014). PhyC has already been found to be a major flowering time gene in pearl millet (Saïdou et al., 2009, 2014; Vigouroux et al., 2011; Diack et al., 2017). In our study, SNP chr2_11155563 with is 4,745 bp away from the polymorphisms found in those previous studies, was detected by genomic scan approaches and in association with stem length by LFMM and EMMA methods. This gene was also reported to be associated with spike length and stem diameter in an inbred line panel (Saïdou et al., 2009), and with other traits including stay green, panicle diameter, panicle harvest index and panicle length under both well-watered and drought stress conditions (Sehgal et al., 2015; Debieu et al., 2018). Sequencing the entire PhyC gene from West African inbred lines would be interesting to investigate haplotypic diversity and the effect of the different variants of flowering time.
This is the first time that the role of FRS12 in pearl millet flowering is highlighted. Loss-of-function of FRS12 in A. thaliana results in early-flowering plants with overly elongated hypocotyls (Ritter et al., 2017). In pearl millet, the FRS12 gene is located in a 530 kb region encompassing 12 genes detected in our study. This genomic region may represent an important quantitative trait locus (QTL) for flowering time. Although FRS12 is a good candidate, we cannot overlook the possibility that other genes in this specific region could be directly causative of the phenotype. The gene Pgl_GLEAN_10033790 is annotated as HEADING DATE 16 (Hd16)/EARLY FLOWERING 1 (EF1). The role of Hd16 on flowering has been studied in rice (Dai and Xue, 2010) and linked to adaptation to high latitude (Hori et al., 2013). Interestingly, plants with non-functional Hd16 have delayed flowering and heavy straw and panicle weight in paddy fields with low nitrogen input and no fertilizer application (Tanaka et al., 2019). [AA] pearl millet genotypes for the SNP chr2_4843975 showed a + 1.5 days delay in flowering and longer ([AA] 279 cm vs [CC] 240 cm) and thicker ([AA] 1.8 cm vs [CC] 1.6 cm) stems. Another strong candidate is the gene Pgl_GLEAN_10020525, which is annotated as OsHAC1/OsHAC703 (histone acetyl transferase). This gene was also detected as associated to the heading date in pearl millet by Rhoné et al. (2020). By aligning on the reference genome the SSR markers used in the QTL analysis from (Kumar et al., 2017), we were able to find that the HAC1 gene is part of QTL1:166 identified in that study as strongly involved in flowering. In Arabidopsis HAC1 mutants flower late due to increased FLOWERING LOCUS C (FLC) expression (Deng et al., 2007). In rice, OsHAC1 may be involved in the abscisic acid signaling pathway for responses to environmental stress during rice seedling growth, as well as to salt and drought stress (Liu et al., 2012; Fang et al., 2014). Kumar et al. (2017) study identified two additional QTLs on chromosome 3 linked to flowering, the QTL3/62 and the QTL3.100. The QTL3/62 encompasses the gene Pgl_GLEAN_10027181 annotated as CyclinT1;3 which was found to be associated with heading date in our study. The mean CMLM effect of the alternate allele of SNP chr3_295789863 is +1.53 days. The Arabidopsis thaliana homolog AT1G27630 may be involved in pathway linking circadian clock and cell cycles. The QTL3/100 encompasses the gene Pgl_GLEAN_10032681 that was detected in our PCAdapt analysis. This gene is annotated as an ethylene-responsive factor like protein (ERFL1) from the APETALA2 family.
The GWAS analysis detected SNPs linked to yield through association with panicle and grain characteristics. The annotation of homologous genes in Arabidopsis thaliana revealed a number of potentially interesting genes which magnitude of effect seems to be also a function of the genetic background, i.e., early-vs. late-flowering landraces. Fourteen of these genes are annotated as being related to responses to abiotic stimuli (salt stress, water deprivation, osmotic stress) or related to the immune response. A total of 13 genes are related to reproduction. One of them is Pgl_GLEAN_10001878 and it is annotated as LORELEI in Arabidospis thaliana (Tsukamoto et al., 2010). When looking at its effect, it seems that the presence of the alternate allele on SNP chr3_18486380, increases 1,000 seed weight (Supplementary Figure 6). A second gene was found to be associated with 1,000 seeds weight, Pgl_GLEAN_10007024. This gene annotated as a PGR5-like protein and it may be involved in photosynthesis (DalCorso et al., 2008) in Arabidopsis. A clear decrease of 1,000 seeds weight is noted when individuals carry the alternative allele on SNP chr7_55923381 (Supplementary Figure 6). We have detected four genes whose annotation was linked to root development. Three of them were detected by the GWAS approach through their association with panicle size. The gene Pgl_GLEAN_10017314 is annotated as a global transcriptor factor in Arabidopsis thaliana. The alternate allele of SNP chr4_48223622 increases panicle diameter at the homozygous state (Supplementary Figure 6). Another of these genes is Pgl_GLEAN_10003976. This gene is homolog to the ERULUS gene in Arabidopsis thaliana. ERULUS gene has been described as a core root hair regulator, involved in the establishment of a functional apical [Ca2+] gradient (Bai et al., 2014; Kwon et al., 2018). The establishment of this gradient seems to impact the pollen tube growth and fertilization (Schoenaers et al., 2017).
Conclusion
Two complementary genome-wide approaches, i.e., selection scanning and association analysis (GWAS) were used in this study with the main objective to identify key genes linked to flowering and agromorphologic traits in pearl millet, a major staple cereal in sub-Saharan Africa. To date, PhyC was the only clearly identified flowering-related gene. We were able to identify several other genes that appear to be as much as important, such as FSR12 and HAC1. HAC1 and two other genes appear to be part of QTLs identified in previous studies and deserve further investigation. At the same time, we were able to identify a large number of genes and variants that could contribute to the improvement of pearl millet yield, especially since their impact was demonstrated across flowering cycles.
Data availability statement
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.
Author contributions
AB, CB-S, ADF, NK, and YV designed the study. AB, CB, CB-S, PC, ADF, and BR performed the analyses. AD, MC, CM, and LZ generated genomic datasets. AB, CB-S, AC, ADF, ALF, KO, DM, MT, and YV managed the field studies and generated phenotypic datasets. ADF and CB-S wrote the paper with all authors’ contributions. All authors contributed to the article and approved the submitted version.
Funding
This project was supported by Agropolis Fondation under reference ID 1403–057 through the Investissements d’avenir programme (Labex Agro: ANR-10-LABX-0001-01) in the framework of I-SITE MUSE (ANR-16-IDEX-0006), the CERAO project (ANR-13-AGRO-002), while the research leading to these results received funding from the UK National Environment Research Council (NERC)/Department for International Development (DFID) Future Climate For Africa Program, under the AMMA-2050 project (grant numbers NE/M020002/1; NE/M019934/1). YV was also funded under the CRP on Dryland Cereals.
Acknowledgments
We thank David Manley for English revision, Hélène Adam for useful discussions, and farmers for sharing their seed stocks.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Key message
Tuning flowering time plays a key role for adaptation to future climate change. We show that PhyC, FRS12 and HAC1 are key genes in the pearl millet flowering cycle.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2022.880631/full#supplementary-material
References
Akanvou, L., Akanvou, R., Kouakou, C. K., N’da, H. A., and Koffi, K. G. C. (2012). Evaluation de la diversité agro morphologique des accessions de mil [Pennisetum glaucum (L.) R. Br.] collectées en Côte d’Ivoire. J. Appl. Biosci. 50, 3468–3477.
Bai, L., Ma, X., Zhang, G., Song, S., Zhou, Y., Gao, L., et al. (2014). A receptor-like kinase mediates ammonium homeostasis and is important for the polar growth of root hairs in Arabidopsis. Plant Cell 26, 1497–1511. doi: 10.1105/tpc.114.124586
Battisti, D. S., and Naylor, R. L. (2009). Historical warnings of future food insecurity with unprecedented seasonal heat. Science 323, 240–244. doi: 10.1126/science.1164363
Bezançon, G., Pham, J.-L., Deu, M., Vigouroux, Y., Sagnard, F., Mariac, C., et al. (2009). Changes in the diversity and geographic distribution of cultivated millet (Pennisetum glaucum (L.) R. Br.) and sorghum (Sorghum bicolor (L.) Moench) varieties in Niger between 1976 and 2003. Genet. Resour. Crop Evol. 56, 223–236. doi: 10.1007/s10722-008-9357-3
Caye, K., and Francois, O. (2018). LFMM 2.0: latent factor models for confounder adjustment in genome and epigenome-wide association studies. Biorxiv :255893. doi: 10.1101/255893
Chen, A., Li, C., Hu, W., Lau, M. Y., Lin, H., Rockwell, N. C., et al. (2014). PHYTOCHROME C plays a major role in the acceleration of wheat flowering under long-day photoperiod. Proc. Natl. Acad. Sci. 111, 10037–10044. doi: 10.1073/pnas.1409795111
Cingolani, P., Platts, A., Wang, L. L., Coon, M., Nguyen, T., Wang, L., et al. (2012). A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin). 6, 80–92. doi: 10.4161/fly.19695
Covarrubias-Pazaran, G. (2016). Genome-assisted prediction of quantitative traits using the R package sommer. PLoS One 11:e0156744. doi: 10.1371/journal.pone.0156744
Covarrubias-Pazaran, G. (2018). Software update: moving the R package sommer to multivariate mixed models for genome-assisted prediction. bioRxiv :354639. doi: 10.1101/354639
Dai, C., and Xue, H. (2010). Rice early flowering1, a CKI, phosphorylates DELLA protein SLR1 to negatively regulate gibberellin signalling. EMBO J. 29, 1916–1927. doi: 10.1038/emboj.2010.75
DalCorso, G., Pesaresi, P., Masiero, S., Aseeva, E., Schünemann, D., Finazzi, G., et al. (2008). A complex containing PGRL1 and PGR5 is involved in the switch between linear and cyclic electron flow in Arabidopsis. Cell 132, 273–285. doi: 10.1016/j.cell.2007.12.028
Dancette, C. (1983). Estimation des besoins en eau des principales cultures pluviales en zone soudano-sahélienne. L’agronomie Trop. 38, 281–294.
De Rouw, A. (2004). Improving yields and reducing risks in pearl millet farming in the African Sahel. Agric. Syst. 81, 73–93. doi: 10.1016/j.agsy.2003.09.002
De Rouw, A., and Winkel, T. (1998). Drought avoidance by asynchronous flowering in pearl millet stands cultivated on-farm and on-station in Niger. Exp. Agric. 34, 19–39. doi: 10.1017/S0014479798001057
de Villemereuil, P., Frichot, É., Bazin, É., François, O., and Gaggiotti, O. E. (2014). Genome scan methods against more complex models: when and how much should we trust them? Mol. Ecol. 23, 2006–2019. doi: 10.1111/mec.12705
Debieu, M., Sine, B., Passot, S., Grondin, A., Akata, E., Gangashetty, P., et al. (2018). Response to early drought stress and identification of QTLs controlling biomass production under drought in pearl millet. PLoS One 13:e0201635. doi: 10.1371/journal.pone.0201635
Deng, W., Liu, C., Pei, Y., Deng, X., Niu, L., and Cao, X. (2007). Involvement of the histone acetyltransferase AtHAC1 in the regulation of flowering time via repression of FLOWERING LOCUS C in Arabidopsis. Plant Physiol. 143, 1660–1668. doi: 10.1104/pp.107.095521
DePristo, M. A., Banks, E., Poplin, R., Garimella, K. V., Maguire, J. R., Hartl, C., et al. (2011). A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498. doi: 10.1038/ng.806
Diack, O., Kane, N. A., Berthouly-Salazar, C., Gueye, M. C., Diop, B. M., Fofana, A., et al. (2017). New genetic insights into pearl millet diversity As revealed by characterization of early-and late-flowering landraces from Senegal. Front. Plant Sci. 8:818. doi: 10.3389/fpls.2017.00818
Diack, O., Kanfany, G., Gueye, M. C., Sy, O., Fofana, A., Tall, H., et al. (2020). GWAS unveils features between early-and late-flowering pearl millets. BMC Genomics 21, 1–11. doi: 10.1186/s12864-020-07198-2
Dray, S., and Dufour, A.-B. (2007). The ade4 package: implementing the duality diagram for ecologists. J. Stat. Softw. 22, 1–20. doi: 10.18637/jss.v022.i04
Dussert, Y., Snirc, A., and Robert, T. (2015). Inference of domestication history and differentiation between early-and late-flowering varieties in pearl millet. Mol. Ecol. 24, 1387–1402. doi: 10.1111/mec.13119
Evenson, R. E., and Gollin, D. (2003). Assessing the impact of the green revolution, 1960 to 2000. J. Sci. 300, 758–762. doi: 10.1126/science.1078710
Fang, H., Liu, X., Thorn, G., Duan, J., and Tian, L. (2014). Expression analysis of histone acetyltransferases in rice under drought stress. Biochem. Biophys. Res. Commun. 443, 400–405. doi: 10.1016/j.bbrc.2013.11.102
Foll, M., and Gaggiotti, O. (2008). A genome-scan method to identify selected loci appropriate for both dominant and codominant markers: A Bayesian perspective. Genetics 180, 977–993. doi: 10.1534/genetics.108.092221
Frichot, E., and François, O. (2015). LEA: An R package for landscape and ecological association studies. Methods Ecol. Evol. 6, 925–929. doi: 10.1111/2041-210X.12382
Frichot, E., Mathieu, F., Trouillon, T., Bouchard, G., and François, O. (2014). Fast and efficient estimation of individual ancestry coefficients. Genetics 196, 973–983. doi: 10.1534/genetics.113.160572
Frichot, E., Schoville, S. D., Bouchard, G., and François, O. (2013). Testing for associations between loci and environmental gradients using latent factor mixed models. Mol. Biol. Evol. 30, 1687–1699. doi: 10.1093/molbev/mst063
Gain, C., and François, O. (2021). LEA 3: Factor models in population genetics and ecological genomics with R. Mol. Ecol. Resour. 21, 2738–2748. doi: 10.1111/1755-0998.13366
Ghatak, A., Chaturvedi, P., Bachmann, G., Valledor, L., Ramšak, Ž., Bazargani, M. M., et al. (2021). Physiological and proteomic signatures reveal mechanisms of superior drought resilience in pearl millet compared to wheat. Front. Plant Sci. 11. doi: 10.3389/fpls.2020.600278
Ghatak, A., Chaturvedi, P., Nagler, M., Roustan, V., Lyon, D., Bachmann, G., et al. (2016). Comprehensive tissue-specific proteome analysis of drought stress responses in Pennisetum glaucum (L.) R. Br. (pearl millet). J. Proteome 143, 122–135. doi: 10.1016/j.jprot.2016.02.032
Haussmann, B. I. G., Boubacar, A., Boureima, S. S., and Vigouroux, Y. (2006). Multiplication and preliminary characterization of west and central African pearl millet landraces. Int. Sorghum Millets Newsl. 47, 110–112.
Haussmann, B. I. G., Boureima, S. S., Kassari, I. A., Moumouni, K. H., and Boubacar, A. (2007). Mechanisms of adaptation to climate variability in west African pearl millet landraces–a preliminary. J. SAT Agric. Res. 3, 1–3.
Haussmann, B. I. G., Fred Rattunde, H., Weltzien-Rattunde, E., Traoré, P. S. C., Vom Brocke, K., and Parzies, H. K. (2012). Breeding strategies for adaptation of pearl millet and sorghum to climate variability and change in West Africa. J. Agron. Crop Sci. 198, 327–339. doi: 10.1111/j.1439-037X.2012.00526.x
Hori, K., Ogiso-Tanaka, E., Matsubara, K., Yamanouchi, U., Ebana, K., and Yano, M. (2013). H d16, a gene for casein kinase I, is involved in the control of rice flowering time by modulating the day-length response. Plant J. 76, 36–46. doi: 10.1111/tpj.12268
Ishikawa, R., Aoki, M., Kurotani, K., Yokoi, S., Shinomura, T., Takano, M., et al. (2011). Phytochrome B regulates heading date 1 (Hd1)-mediated expression of rice florigen Hd3a and critical day length in rice. Mol. Gen. Genomics. 285, 461–470. doi: 10.1007/s00438-011-0621-4
Izawa, T., Oikawa, T., Sugiyama, N., Tanisaka, T., Yano, M., and Shimamoto, K. (2002). Phytochrome mediates the external light signal to repress FT orthologs in photoperiodic flowering of rice. Genes Dev. 16, 2006–2020. doi: 10.1101/gad.999202
Kang, H. M., Zaitlen, N. A., Wade, C. M., Kirby, A., Heckerman, D., Daly, M. J., et al. (2008). Efficient control of population structure in model organism association mapping. Genetics 178, 1709–1723. doi: 10.1534/genetics.107.080101
Knox, J., Hess, T., Daccache, A., and Wheeler, T. (2012). Climate change impacts on crop productivity in Africa and South Asia. Environ. Res. Lett. 7:34032. doi: 10.1088/1748-9326/7/3/034032
Kumar, S., Hash, C. T., Nepolean, T., Satyavathi, C. T., Singh, G., Mahendrakar, M. D., et al. (2017). Mapping QTLs controlling flowering time and important agronomic traits in pearl millet. Front. Plant Sci. 8:1731. doi: 10.3389/fpls.2017.01731
Kwon, T., Sparks, J. A., Liao, F., and Blancaflor, E. B. (2018). ERULUS is a plasma membrane-localized receptor-like kinase that specifies root hair growth by maintaining tip-focused cytoplasmic calcium oscillations. Plant Cell 30, 1173–1177. doi: 10.1105/tpc.18.00316
Lakis, G., Ousmane, A. M., Sanoussi, D., Habibou, A., Badamassi, M., Lamy, F., et al. (2011). Evolutionary dynamics of cycle length in pearl millet: the role of farmer’s practices and gene flow. Genetica 139, 1367–1380. doi: 10.1007/s10709-012-9633-1
Lakis, G., Navascués, M., Rekima, S., Simon, M., Remigereau, M.-S., Leveugle, M., et al. (2012). Evolution of neutral and flowering genes along pearl millet (Pennisetum glaucum) domestication. PLoS One 7:e36642. doi: 10.1371/journal.pone.0036642
Lalou, R., Sultan, B., Muller, B., and Ndonky, A. (2019). Does climate opportunity facilitate smallholder farmers’ adaptive capacity in the Sahel? Palgrave Commun. 5, 1–11. doi: 10.1057/s41599-019-0288-8
Li, H., and Durbin, R. (2009). Fast and accurate short read alignment with burrows–Wheeler transform. Bioinformatics 25, 1754–1760. doi: 10.1093/bioinformatics/btp324
Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., et al. (2009). The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079. doi: 10.1093/bioinformatics/btp352
Lipka, A. E., Tian, F., Wang, Q., Peiffer, J., Li, M., Bradbury, P. J., et al. (2012). GAPIT: genome association and prediction integrated tool. Bioinformatics 28, 2397–2399. doi: 10.1093/bioinformatics/bts444
Liu, X., Luo, M., Zhang, W., Zhao, J., Zhang, J., Wu, K., et al. (2012). Histone acetyltransferases in rice (Oryza sativa L.): phylogenetic analysis, subcellular localization and expression. BMC Plant Biol. 12:145. doi: 10.1186/1471-2229-12-145
Lotterhos, K. E., and Whitlock, M. C. (2015). The relative power of genome scans to detect local adaptation depends on sampling design and statistical method. Mol. Ecol. 24, 1031–1046. doi: 10.1111/mec.13100
Luu, K., Bazin, E., and Blum, M. G. B. (2017). Pcadapt: An R package to perform genome scans for selection based on principal component analysis. Mol. Ecol. Resour. 17, 67–77. doi: 10.1111/1755-0998.12592
Marchais, L. (1982). La diversité phénotypique des mils pénicillaires cultivés au Sénégal et au Mali. Agron Trop Paris. Available at: http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=PASCALAGROLINEINRA82X0271941
Mariac, C., Scarcelli, N., Pouzadou, J., Barnaud, A., Billot, C., Faye, A., et al. (2014). Cost-effective enrichment hybridization capture of chloroplast genomes at deep multiplexing levels for population genetics and phylogeography studies. Mol. Ecol. Resour. 14, 1103–1113. doi: 10.1111/1755-0998.12258
Martin, M. (2011). Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. J. 17, 10–12. doi: 10.14806/ej.17.1.200
Matlon, P. J. (1990). Improving productivity in sorghum and pearl millet in semi-arid Africa. Food Res. Inst. Stud. 22, 1–43.
Niangado, O. (2001). “The state of millet diversity and its use in West Africa”, in Broadening Genet. base Crop Prod. eds. H. D. Cooper, C. Spillane, and T. Hodgin. (Rome, Italy: IPGRI/FAO), 147–157.
Ouendeba, B., Ejeta, G., Hanna, W. W., and Kumar, A. K. (1995). Diversity among African pearl millet landrace populations. Crop. Sci. 35, 919–924. doi: 10.2135/cropsci1995.0011183X003500030048x
Pucher, A., Sy, O., Angarawai, I. I., Gondah, J., Zangre, R., Ouedraogo, M., et al. (2015). Agro-morphological characterization of west and central African pearl millet accessions. Crop Sci. 55, 737–748. doi: 10.2135/cropsci2014.06.0450
Rhoné, B., Yves, V., Cecile, B.-S., and Concetta, B. (2020). Pearl millet genomic vulnerability to climate change in West Africa highlights the need of regional collaboration. Nat. Commun. 11, 1–9. doi: 10.1038/s41467-020-19066-4
Ritter, A., Iñigo, S., Fernández-Calvo, P., Heyndrickx, K. S., Dhondt, S., Shi, H., et al. (2017). The transcriptional repressor complex FRS7-FRS12 regulates flowering time and growth in Arabidopsis. Nat. Commun. 8, 1–14. doi: 10.1038/ncomms15235
Roudier, P., Sultan, B., Quirion, P., and Berg, A. (2011). The impact of future climate change on west African crop yields: what does the recent literature say? Glob. Environ. Chang. 21, 1073–1083. doi: 10.1016/j.gloenvcha.2011.04.007
Russo, S., Marchese, A. F., Sillmann, J., and Immé, G. (2016). When will unusual heat waves become normal in a warming Africa? Environ. Res. Lett. 11:54016. doi: 10.1088/1748-9326/11/5/054016
Saïdou, A.-A., Clotault, J., Couderc, M., Mariac, C., Devos, K. M., Thuillet, A.-C., et al. (2014). Association mapping, patterns of linkage disequilibrium and selection in the vicinity of the PHYTOCHROME C gene in pearl millet. Theor. Appl. Genet. 127, 19–32. doi: 10.1007/s00122-013-2197-3
Saïdou, A.-A., Mariac, C., Luong, V., Pham, J.-L., Bezançon, G., and Vigouroux, Y. (2009). Association studies identify natural variation at PHYC linked to flowering time and morphological variation in pearl millet. Genetics 182, 899–910. doi: 10.1534/genetics.109.102756
Schoenaers, S., Balcerowicz, D., Costa, A., and Vissenberg, K. (2017). The kinase ERULUS controls pollen tube targeting and growth in Arabidopsis thaliana. Front. Plant Sci. 8:1942. doi: 10.3389/fpls.2017.01942
Sehgal, D., Skot, L., Singh, R., Srivastava, R. K., Das, S. P., Taunk, J., et al. (2015). Exploring potential of pearl millet germplasm association panel for association mapping of drought tolerance traits. PLoS One 10:e0122165. doi: 10.1371/journal.pone.0122165
Sultan, B., Defrance, D., and Iizumi, T. (2019). Evidence of crop production losses in West Africa due to historical global warming in two crop models. Sci. Rep. 9, 12834–12815. doi: 10.1038/s41598-019-49167-0
Sy, O., Fofana, A., Cisse, N., Noba, K., Diouf, D., Ndoye, I., et al. (2015). Étude de la variabilité agromorphologique de la collection nationale de mils locaux du Sénégal. J. Appl. Biosci. 87, 8030–8046.
Swindale, L. D. (1982). A time for rainfed agriculture: Eleventh coromandel lecture. Fertil. News 27, 48–59.
Takano, M., Inagaki, N., Xie, X., Yuzurihara, N., Hihara, F., Ishizuka, T., et al. (2005). Distinct and cooperative functions of phytochromes A, 0B, and C in the control of deetiolation and flowering in rice. Plant Cell 17, 3311–3325. doi: 10.1105/tpc.105.035899
Tanaka, N., Yoshida, S., Takagi, H., Terauchi, R., Shimizu, A., and Fujiwara, T. (2019). Evidence for rice heading date 16 contribution to yield increase under low-nutrient conditions. Soil Sci. Plant Nutr. 65, 589–597. doi: 10.1080/00380768.2019.1647081
Team R Development, C. (2008). R: A Language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0.
Tsukamoto, T., Qin, Y., Huang, Y., Dunatunga, D., and Palanivelu, R. (2010). A role for LORELEI, a putative glycosylphosphatidylinositol-anchored protein, in Arabidopsis thaliana double fertilization and early seed development. Plant J. 62, 571–588. doi: 10.1111/j.1365-313X.2010.04177.x
Upadhyaya, H. D., Reddy, K. N., Ahmed, M. I., Kumar, V., Gumma, M. K., and Ramachandran, S. (2017). Geographical distribution of traits and diversity in the world collection of pearl millet [Pennisetum glaucum (L.) R. Br., synonym: Cenchrus americanus (L.) Morrone] landraces conserved at the ICRISAT genebank. Genet. Resour. Crop. Evol. 64, 1365–1381. doi: 10.1007/s10722-016-0442-8
Van Oosterom, E. J., Weltzien, E., Yadav, O. P., and Bidinger, F. R. (2006). Grain yield components of pearl millet under optimum conditions can be used to identify germplasm with adaptation to arid zones. F. Crop. Res. 96, 407–421. doi: 10.1016/j.fcr.2005.08.008
Varshney, R. K., Shi, C., Thudi, M., Mariac, C., Wallace, J., Qi, P., et al. (2017). Pearl millet genome sequence provides a resource to improve agronomic traits in arid environments. Nat. Biotechnol. 35, 969–976. doi: 10.1038/nbt.3943
Vigouroux, Y., Mariac, C., De Mita, S., Pham, J.-L., Gérard, B., Kapran, I., et al. (2011). Selection for earlier flowering crop associated with climatic variations in the Sahel. PLoS One 6:e19563. doi: 10.1371/journal.pone.0019563
Yadav, O. P. (2008). Performance of landraces, exotic elite populations and their crosses in pearl millet (Pennisetum glaucum) in drought and non-drought conditions. Plant Breed. 127, 208–210. doi: 10.1111/j.1439-0523.2007.01467.x
Zheng, X., Levine, D., Shen, J., Gogarten, S. M., Laurie, C., and Weir, B. S. (2012). A high-performance computing toolset for relatedness and principal component analysis of SNP data. Bioinformatics 28, 3326–3328. doi: 10.1093/bioinformatics/bts606
Zhou, X., and Stephens, M. (2014). Efficient multivariate linear mixed model algorithms for genome-wide association studies. Nat. Methods 11, 407–409. doi: 10.1038/nmeth.2848
Zongo, J. D., Sédogo, M. C., Sérémé, P., and Zangré, G. R. (1988). “Synthèse des prospections du mil (Pennisetum glaucum (L.) R. Br.) au Burkina Faso. In proceedings of the regional pearl millet improvement workshop,” in ICRISAT-Institute for Agricultural Research, Ahmadu Bello University (IAR). eds. L. K. Fussel and J. Werder (Nigeria: Zaria), 15–19.
Keywords: adaptation, flowering, GWAS, genomic scan, yield, PhyC, far-red light FRS12, HAC1
Citation: Faye A, Barnaud A, Kane NA, Cubry P, Mariac C, Burgarella C, Rhoné B, Faye A, Olodo KF, Cisse A, Couderc M, Dequincey A, Zekraouï L, Moussa D, Tidjani M, Vigouroux Y and Berthouly-Salazar C (2022) Genomic footprints of selection in early-and late-flowering pearl millet landraces. Front. Plant Sci. 13:880631. doi: 10.3389/fpls.2022.880631
Edited by:
Mahalingam Govindaraj, HarvestPlus – Alliance Bioversity-CIAT, ColombiaReviewed by:
Arindam Ghatak, University of Vienna, AustriaJulia Sibiya, University of KwaZulu-Natal, South Africa
Copyright © 2022 Faye, Barnaud, Kane, Cubry, Mariac, Burgarella, Rhoné, Faye, Olodo, Cisse, Couderc, Dequincey, Zekraouï, Moussa, Tidjani, Vigouroux and Berthouly-Salazar. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Adama Faye, adamafae@gmail.com; Cécile Berthouly-Salazar, cecile.berthouly@ird.fr