- 1Key Laboratory of Biological Resource Protection and Utilization of Tarim Basin, Xinjiang Production and Construction Group, Alar, China
- 2College of Life Sciences, Tarim University, Alar, China
- 3Desert Poplar Research Center of Tarim University, Alar, China
- 4Hubei Provincial Key Laboratory for Protection and Application of Special Plant Germplasm in Wuling Area of China, College of Life Sciences, South-Central University for Nationalities, Wuhan, China
Populus sect. Turanga (hereafter referred to as “Populus”), including Populus euphratica and Populus pruinosa, are the predominant tree species in desert riparian forests in northwestern China. These trees play key roles in maintaining ecosystem balance, curbing desertification, and protecting biodiversity. However, the distribution area of Populus forests has been severely diminished and degraded in recent years due to increased habitat destruction and human activity. Understanding the genetic diversity among Populus individuals and populations is essential for designing conservation strategies, but comprehensive studies of their genetic diversity in northwest China are lacking. Here, we assessed the population structures and genetic diversity of 1,620 samples from 85 natural populations of Populus (59 P. euphratica and 26 P. pruinosa populations) covering all of northwestern China using 120 single nucleotide polymorphism (SNP) markers. Analysis of population structure revealed significant differentiation between these two sister species and indicated that strong geographical distribution patterns, a geographical barrier, and environmental heterogeneity shaped the extant genetic patterns of Populus. Both P. euphratica and P. pruinosa populations in southern Xinjiang had higher genetic diversity than populations in other clades, perhaps contributing to local geographic structure and strong gene flow. Analysis of molecular variance (AMOVA) identified 15% variance among and 85% variance within subpopulations. Mantel tests suggested that the genetic variation among P. euphratica and P. pruinosa populations could be explained by both geographical and environmental distance. The genetic diversity of P. euphratica showed a significant negative correlation with latitude and longitude and a positive correlation with various environmental factors, such as precipitation of warmest quarter and driest month, temperature seasonality, and annual mean temperature. These findings provide insights into how the genetic differentiation of endangered Populus species was driven by geographical and environmental factors, which should be helpful for designing strategies to protect these genetic resources in the future.
Introduction
Populus euphratica Oliv. and Populus pruinosa Schrenk are sister species in the section Turanga (Populus sect. Turanga, hereafter referred to as “Populus”). Both species are predominant trees in the extremely arid desert areas in northwestern China and play important roles in maintaining ecosystem balance and protecting biodiversity (Lang et al., 2015; Zheng et al., 2016). P. euphratica and P. pruinosa diverged during the Pleistocene era due to climate oscillations and were further isolated by the barrier imposed by the Tianshan Mountains (Wang et al., 2014). Along with the hitchhiking of incidental variations, ancient genetic polymorphisms are thought to have driven the speciation of P. euphratica and P. pruinosa (Ma et al., 2018). Whereas, P. pruinosa is mainly restricted to Xinjiang province in China and the adjacent countries and regions (Wang et al., 1995), the natural distribution areas of P. euphratica range from western China and the Middle East to North Africa and southwestern Europe (Ma et al., 2018). Approximately 61% of P. euphratica forests worldwide are located in China, with 91.1% of those in Xinjiang province. Alarmingly, the distribution area of P. euphratica forests has been seriously diminished or degraded in recent years due to increased habitat destruction and human activity. The protection of Populus forests in northwestern China is an extremely urgent issue. One effective strategy for protection of a plant species is to identify priority protection areas or conservation units (Zhang et al., 2020). Assessing the genetic variations in Populus populations would be beneficial for the conservation and utilization of the genetic resources of these species and could provide useful basic data on Populus germplasm to facilitate breeding.
Studies of genetic diversity and population structure are crucial for exploring natural selection, adaptive evolution, and the genetic relationships of populations within or among P. euphratica and P. pruinosa. However, although several molecular population genetics studies have been performed on P. euphratica and P. pruinosa, these studies led to opposite conclusions about their genetic diversity (Wang et al., 2011, 2014; Ma et al., 2018; Zeng et al., 2018). For example, a preliminary investigation using a relatively small number of nuclear and chloroplast DNA markers to investigate the genetic diversity of P. euphratica populations (552 P. euphratica individuals of 33 natural populations) indicated that this was lower in southern Xinjiang (SX) than in northern Xinjiang (NX) (Zeng et al., 2018). However, whole-genome resequencing of 252 P. euphratica individuals from 27 natural populations showed the reverse (Jia et al., 2020). This inconsistency might be attributed to the relatively limited numbers of markers, individuals, or populations studied in narrow geographical areas (Wu et al., 2008; Eusemann et al., 2013; Xu et al., 2013; Wang et al., 2014), or the relatively less advanced methods utilized (e.g., AFLP, RAPD, and SRAP) (Saito et al., 2002; Vonlanthen and Bruelheide, 2010; Wang et al., 2011; Kansu and Kaya, 2020).
The development of molecular markers at the whole-genome level is becoming increasingly accurate and efficient (Verma et al., 2015). Genome-wide single nucleotide polymorphism (SNP) markers are widely used to assess the genotypes and genetic relationships of various populations (Lu et al., 2020; Zhang et al., 2020). Genotyping-in-Thousands by sequencing (GT-seq) has proven to be an efficient SNP genotyping technology (Campbell et al., 2015) and has been widely applied in many fields of research (Bootsma et al., 2020; McKinney et al., 2020; Powell and Campbell, 2020; Schmidt et al., 2020). To better understand the genetic diversity and population differentiation of P. euphratica and P. pruinosa to facilitate further conservation and breeding efforts, an analysis of their population genetics using larger-scale samples and more advanced molecular markers is needed.
The natural ranges of P. euphratica and P. pruinosa include different niches that show discontinuous geographical distribution. The barrier represented by the Tianshan Mountains, a major mountain system and biodiversity hotspot, has impeded gene flow between SX and NX in Populus populations (Zeng et al., 2018). The geographical differences caused by environmental heterogeneity and climatic fluctuations could drive plant-specific or intraspecific genetic divergence and distribution patterns (Muellner-Riehl, 2019). Geographical distance can also affect the gene flow from one population to another, while environmental factors can influence genetic diversity and help fix a population in a specific ecological niche (Liu et al., 2013). However, there is no report on how the genetic distance and differentiation of P. euphratica and P. pruinosa were influenced by geographical and environmental factors. Elucidating the relationship between genetic diversity and geographical or environmental heterogeneity will help explain the discontinuous distribution pattern of Populus in northwestern China.
In this study, we collected the leaves of 1,620 accessions from 85 natural populations throughout the natural distribution range of P. euphratica (1,183 individuals from 59 natural populations) and P. pruinosa (437 individuals from 26 natural populations) in northwest China. Based on our de novo assembled reference genome and published resequencing data, we developed 120 high-quality SNPs with MAF ≥ 0.4 for further genetic analysis. Our objectives were to (1) determine the optimal population structures of P. euphratica and P. pruinosa; (2) evaluate genetic diversity and genetic differentiation between and within the subpopulations; and (3) assess the relationships between population-level genetic diversity and geographical as well as environmental factors. Our findings provide new insights into the genetic differentiation of P. euphratica and P. pruinosa and lay a genetic foundation for the construction of priority conservation areas of Populus.
Materials and Methods
Sample Collection and DNA Extraction
Leaves were sampled from 1,183 P. euphratica individuals from 59 natural populations in northwest China, including 27 populations from southern Xinjiang (SX), 23 from northern Xinjiang (NX), and 9 total from Gansu (GS), Qinghai (QH), Ningxia (NiX), and Inner Mongolia (NMG) provinces (Supplementary Table 1). Leaves were sampled from 437 P. pruinosa individuals from 26 natural populations in Xinjiang province (23 from southern Xinjiang, three from northern Xinjiang), 22 of which co-occurred with P. euphratica. To reduce the impact of clonal ramets, all sampled individuals from each population were located at least 100 m apart. The leaves were dried with silica gel in the field. Total DNA was extracted from the leaves and purified using a modified CTAB method (Allen et al., 2006). The integrity of the DNA was evaluated by 0.8% agarose gel electrophoresis, and DNA quality was evaluated based on the A260/A280 absorbance ratio of the samples.
SNP Marker Development
Marker development and sequencing were carried out using the GT-seq method (Campbell et al., 2015). In brief, 120 SNP markers were selected and developed for the P. euphratica samples based on our reference genome and resequencing data from P. euphratica (Jia et al., 2020). These SNPs, which are uniformly distributed on the 19 P. euphratica chromosomes, were retained according to the following criteria (Supplementary Figure 1): (i) SNP interval >50 kb; (ii) minor allele frequency (MAF) ≥0.4; (iii) heterozygosity ≤25%; (iv) missing rates ≤0.25. Primer 3 software (version 2.5.0) was used to design amplification primers for each target site according to the following criteria: (i) the primers ranged from 17 to 32 bases in length; (ii) melting temperature (Tm) values ranged from 60°C to 64°C, with an optimal value of 62°C; (iii) product size was ≤500 bp; (iv) sequenced reads had to be able to cover target sites. Primers for each target site that could specifically amplify the target were obtained (Supplementary Table 2). Libraries were constructed and sequenced (PE150) on the Illumina HiSeq 2500 platform.
Sequencing Data Analysis
Reliable clean reads were obtained via quality control and filtering of raw reads. Quality control was performed based on strict criteria to remove the following types of reads: (i) adaptor sequences of reads were removed using Cutadapt software (version 1.13); (ii) low-quality bases were removed using Trimmomatic software (Version 0.36); (iii) reads ≤50 bp long were excluded. The reliable clean reads were aligned to the P. euphratica reference genome using the BWA-MEM algorithm (version 0.7.15-r1140). After alignment, GATK (version 3.7) (DePristo et al., 2011) and SAMtools (version 1.3.1) (Li and Durbin, 2009) were used for SNP calling. Genotypes at SNP loci were analyzed using an in-house Python script.
Analysis of Population Structure
Population structure analysis was implemented using STRUCTURE v2.3.4 (Earl and vonHoldt, 2011). The K-value was used to evaluate the number of clusters based on the genotyping data; the number of genetic clades was predefined using 10,000 iterations for each run. The optimal K-value was determined using Structure Harvester (Earl and vonHoldt, 2011). Principal component analysis (PCA) was carried out using GCTA (Yang et al., 2011). A neighbor-joining (NJ) tree was constructed and visualized using MEGA 7 (Tamura et al., 2013).
Analysis of Molecular Variance and Genetic Diversity
Analysis of Molecular Variance (AMOVA) was performed and Nei's genetic distance was calculated using GenAlEx 6.503 based on the number of subpopulations (Peakall and Smouse, 2012). Pairwise population differentiation (FST) was calculated. Multiple indicators of genetic diversity from each population, such as Shannon's information index (I), observed heterozygosity (Ho), expected heterozygosity (He), and polymorphism information content (PIC), were obtained using GenAlEx v6.503.
Analysis of Genetic Isolation
To study the relationships of isolation-by-environment vs. isolation-by-distance based on the genetic differentiation of P. euphratica and P. pruinosa, 19 bioclimatic variables with a resolution of 2.5 arc-min from the WorldClim Database (http://www.worldclim.org) were obtained (Fick and Hijmans, 2017). Their correlation coefficients were then calculated, and the seven least correlated bioclimatic variables (Spearman's r < 0.8) were retained (Supplementary Tables 3, 4). The correlation coefficients between pairwise genetic differentiation and the bioclimatic distance matrix, as well as the geographical distance matrix, were calculated. Mantel tests were performed with 1,000 permutations based on Genepop (https://genepop.curtin.edu.au/) (Rousset, 2008).
To examine the relationships between genetic diversity at the population level and geographical factors, as well as environmental factors according to a previous study (Zhang et al., 2020), the genetic diversity of each population was represented by expected heterozygosity (He). Latitude and longitude were considered to be the main geographical factors affecting the P. euphratica and P. pruinosa populations. All seven retained bioclimatic variables were environmental factors. PCA was used to analyze the climate variables of the sample sites, and the absolute values from standardized PC1 scores were obtained based on correlation analyses using R version 3.5.0.
Results
Population Structure and Genetic Relationships
To comprehensively investigate the genetic diversity of P. euphratica (1,183 individuals from 59 natural populations) and P. pruinosa (437 individuals from 26 natural populations), we sampled 1,620 individuals from 85 natural Populus populations covering the distribution regions throughout northwestern China (Figure 1). The number and distribution area of P. euphratica are larger in SX than in NX, and P. pruinosa is primarily distributed in SX. The habitat regions of P. euphratica showed discontinuous distribution on both banks of inland rivers in the arid regions (e.g., the Tarim River, Yarkant River, and Hotan River of southern Xinjiang and the Ili River, Manas River, and Eerqisi River of northern Xinjiang). In addition, for P. euphratica, compared to previous studies, we collected samples from the easternmost population (SZWQ) of China for the first time and added samples from Ningxia Province (ZW). For P. pruinosa, compared to previous studies, samples from three populations of P. pruinosa (CX, KKDL, and NLKX) were collected for the first time in northern Xinjiang (Figure 1; Supplementary Table 1). In summary, 1,620 samples with high coverage from northwestern China were collected to investigate the genetic diversity and population structures of P. euphratica and P. pruinosa.
Figure 1. Geographical distributions of the natural populations of P. euphratica and P. pruinosa analyzed in this study. Different font colors represent the two species: black for P. euphratica populations, light blue for P. pruinosa populations, and red for overlapping populations. The map was created using the ArcMap package in ArcGIS ver. 10.8 (http://www.esri.com/software/arcgis). The populations are described in Supplementary Table 1.
We then carried out population structure analysis based on our genotypic data (Figure 2). When K = 2, all individuals were clearly subdivided into two species-specific clusters, highlighting the significant differentiation of P. euphratica and P. pruinosa at the species level. Notably, a few P. euphratica individuals were mixed with the P. pruinosa lineage (e.g., MGTX, YPHX, BCX, ALE, YTX), implying the presence of recent gene flow in the two species. When K = 3, the P. euphratica populations were further divided into southern Xinjiang, northern Xinjiang, and mixed clusters (including GS, NMG, NiX, and QH populations). When K = 4, the P. euphratica populations were divided into four distinct clades that demonstrated strong geographical distribution patterns (NX, SX, GNM, and QH, with the GNM clade containing individuals from Gansu, Ningxia and Inner Mongolia).
PCA and neighbor-joining phylogenetic analysis confirmed our findings about population structure. PC1 mainly explained the variation between P. pruinosa and P. euphratica, and intermediate individuals (SX1) were primarily from the mixed forests of P. pruinosa and P. euphratica in southern Xinjiang (e.g., MGTX, ALE, and LPX) (Figure 3; Supplementary Table 5). The presence of these intermediate individuals provides further evidence of gene flow between the two species. PC2 strongly explained the differentiation among the four clades (SX, NX, QH, and GNM) of P. euphratica. The QH cluster demonstrated distinct genetic variation compared to the other clades, which might be related to the high altitude (2,709–2,936 m) of this region. The phylogenetic tree, which was constructed to represent the genetic distances between these populations, revealed distinctly different evolutionary relationships among the populations (Figure 4). All three complementary methods (STRUCTURE, PCA, and NJ tree analysis) indicated that distinct genetic differences existed with different geographical distribution patterns at the population level and that the two sister species (P. euphratica and P. pruinosa) showed significant differentiation at the species level regardless of geographical distance.
Figure 3. PCA based on genetic distance showing four clustered subpopulations within P. euphratica and P. pruinosa.
Figure 4. Neighbor-joining phylogenetic tree based on a genetic distance matrix representing the grouping of 1,183 P. euphratica and 437 P. pruinosa individuals.
Genetic Diversity of P. euphratica and P. pruinosa Populations
Based on the population structures of P. euphratica and P. pruinosa (Figure 2), we calculated the four genetic indices (He, Ho, I, and PIC) for each clade and population. The genetic diversity values revealed higher diversity for P. pruinosa (He = 0.256) than for the four clades of P. euphratica. For P. euphratica, the SX clade exhibited the highest genetic diversity (He = 0.216), followed by the GNM clade (He = 0.212) and the NX clade (He = 0.206), while the QH clade had the lowest genetic diversity (He = 0.112) (Table 1).
In addition, we calculated the genetic diversity of each population within P. euphratica and P. pruinosa (Supplementary Tables 6, 7). Among P. euphratica populations, the MGTX population (from the KS region of southern Xinjiang) had the highest genetic diversity (He = 0.29) and the TLX population (from the TC region of northern Xinjiang) had the lowest genetic diversity (He = 0.11). On the whole, the populations of southern Xinjiang had relatively high genetic diversity (e.g., MGTX, ALE, YPHX, LPX, PSX, and YTX), followed by one population from Inner Mongolia (EJN) and those from Ningxia (ZW) and Gansu (GZX and JTX), while the populations of northern Xinjiang (e.g., TLX, MLX, FYX, HBHX, and ALSK) and QH (GEM), and the second population from Inner Mongolia (SZWQ), had relatively low genetic diversity (Supplementary Table 6). Among P. pruinosa populations, the YTX population (HT region of southern Xinjiang) had the highest genetic diversity (He = 0.28), while the YLX population (BaZ region of southern Xinjiang) had the lowest (He = 0.12). The three P. pruinosa populations (CX, KKDL, and NLKX) found only in northern Xinjiang had relatively low genetic diversity (Supplementary Table 7). Overall, the populations of Populus in southern Xinjiang had higher genetic diversity than those of other clades. Of these, the MGTX, YTX, ALE, BCX, and KPX populations had relatively high genetic diversity, whereas the KCX, KKDL, AKTX, NLKX, and YLX populations had lower genetic diversity.
Genetic Differentiation of the Populations
The genetic divergence (FST from 0.118 to 0.160) between P. euphratica and P. pruinosa was high (Table 2). Of the four clades of P. euphratica, we detected high genetic differentiation between QH and GNM (FST = 0.064), moderate differentiation between NX and SX (FST = 0.044) and between NX and QH (FST = 0.044), and lower genetic differentiation between SX and GNM (FST = 0.010).
We further analyzed the genetic differentiation within P. euphratica and P. pruinosa via pairwise comparisons of different populations. For P. euphratica, the genetic differentiation coefficient within populations in southern Xinjiang was low except for MGTX, ALE, and CLX, indicative of strong gene flow among populations. Detailed PCA showed that most of the MGTX (9/11) and ALE (8/11) samples were in SX1 (Supplementary Table 5), suggesting that the higher genetic diversity of MGTX and ALE might have been caused by hybridization of P. euphratica and P. pruinosa. The genetic differentiation among P. euphratica populations in southern Xinjiang and populations of other clades (NX, GNM, and QH) was higher, especially for MLX, FYX, TLX, MQX, SZWQ, and GEM (Supplementary Table 8). However, higher levels of genetic differentiation occurred within the northern Xinjiang population between NX and other populations (GNM and QH). Similar results were obtained regarding the genetic differentiation between the GNM and QH populations. For P. pruinosa, the genetic differentiation within populations in southern Xinjiang was high except for MGTX and BCX, and the genetic differentiation among populations in SX and NX was high. Notably, QMX exhibited high genetic differentiation from all other populations of P. pruinosa (FST from 0.180 to 0.397), which is associated with the isolated geographic location of this region (Supplementary Table 9). Overall, the populations of Populus had relatively high genetic differentiation between SX and NX. AMOVA revealed higher genetic variation within populations (85%, a total of 36% among individuals and 49% within individuals) than among populations (15%) of P. euphratica, suggesting a high level of differentiation (Table 3). Nei's genetic distance analysis revealed moderate levels of population differentiation (FST = 0.147) accompanied by a high rate of gene flow between P. euphratica populations (Nm = 1.447).
Table 3. Analysis of molecular variance of the genetic variation among and within P. euphratica populations.
Potential Distributions and Genetic Isolation
To determine whether the genetic differentiation in Populus is significantly associated with isolation-by-distance and/or isolation-by-environment, we conducted Mantel tests between pairwise genetic differentiation [FST/(1 – FST)] and a geographical distance matrix, as well as a Euclidean bioclimatic distance matrix with the seven bioclimatic variables. Geographical distance (Figure 5A) and environmental distance (Figure 5B) had significant effects on genetic distance among P. euphratica populations. Positive correlations of genetic distance with geographical distance (Figure 5C) and environmental distance (Figure 5D) were also observed for the P. pruinosa populations. This finding implies that the genetic differentiation of Populus increased as the geographical and environmental distances between populations increased, leading to low gene flow. Also, the different correlations for P. euphratica and P. pruinosa suggest that environmental distance played a more important role in the genetic differentiation of the P. euphratica populations (R2 = 0.11) than the P. pruinosa populations (R2 = 0.05), while geographical distance contributed equally to the genetic differentiation of both the P. euphratica and P. pruinosa populations.
Figure 5. Tests of isolation-by-distance and isolation-by-environment among 58 P. euphratica populations (A,B) and 24 P. pruinosa populations (C,D), respectively. Correlation coefficients were analyzed with data matrixes of genetic distance × geographical distance and genetic distance × environmental distance from pairwise populations.
Finally, to test the effects of environmental and geographical factors on the genetic diversity (He) of P. euphratica and P. pruinosa at the population level, we performed correlation analysis between the genetic diversity of Populus and latitudinal, longitudinal, and environmental factors. He showed significant negative correlations with longitude and especially latitude for the P. euphratica populations (Figures 6A,B), suggesting that the genetic diversity of this species markedly decreased with increasing latitude and longitude. The He of the P. pruinosa populations was negatively correlated with latitude but not longitude (Figures 6D,E). Thus, the negative influence of latitude on the genetic diversity of Populus was more important than that of longitude. The genetic diversity values of both the P. euphratica and P. pruinosa populations showed significant positive correlations with environmental factors (Figures 6C,F), suggesting that such factors play important roles in the genetic diversity of both species.
Figure 6. Effects of geographical and environmental factors on genetic diversity in Populus populations. (A–C) P. euphratica. (D–F) P. pruinosa.
Discussion
Geographical isolation and environmental heterogeneity are considered to be the crucial drivers of allopatric variance (Evans et al., 2014; Jiang et al., 2018; Binks et al., 2019). A geographic barrier not only facilitates the local adaptation of populations but also can lead to the evolution of populations with unique genetic characteristic (Barnes et al., 2002; Zhang et al., 2020). Our results reveal distinct differentiation between P. euphratica populations of the NX and SX clades, suggesting that the Tianshan Mountains may limit gene flow between the two clades and that the differences in local environments and the long period of isolation facilitated their allopatric divergence. It is also thought that the Tianshan Mountains impede seed dispersal and pollen flow in these two regions (Zeng et al., 2018; Jia et al., 2020). Accumulating evidence suggests that climatic fluctuations during the Pleistocene era led to extreme drought and desert expansion in southern Xinjiang (Meng et al., 2015), resulting in habitat fragmentation of desert plant populations. This fragmentation likely affected the geographical distributions of different species, further promoting population divergence and affecting intraspecific genetic diversity (Larmuseau et al., 2009; Ye et al., 2019). Here, Mantel tests showed that the greater the geographical distance, the higher the genetic differentiation and the weaker the gene flow between populations (Figure 5). Moreover, in the current study, we collected CX, KKDL, and NLKX populations of P. pruinosa in northern Xinjiang for the first time, and we found that the above analyses were also applicable to P. pruinosa.
Genetic diversity refers to the variation of genetic characteristics in a population, which allows the population to adapt to environmental changes. The genetic diversities of each clade were ranked from high to low as follows: SX > GNM > NX > QH. Our results are similar to previous findings based on whole-genome resequencing (Jia et al., 2020) but are not quite consistent with findings based on analysis using 17 nuclear microsatellite loci from Populus trichocarpa (Zeng et al., 2018). Perhaps the relatively small number and the P. trichocarpa origin of these molecular markers limit their applicability for evaluating genetic diversity in P. euphratica and P. pruinosa.
Accumulating evidence suggests that gene flow is a creative force in evolution that functions via pollen spread or seed dispersal. Intraspecific genetic admixture can produce new allelic combinations, leading to novel genotypes and phenotypes (Hendry et al., 2002; Olson-Manning et al., 2012; Rius and Darling, 2014). Interestingly, the P. euphratica MGTX population showed variable leaf morphology intermediate between that of P. euphratica and P. pruinosa (Supplementary Figure 2), along with the highest genetic diversity among the P. euphratica populations, which could be attributed to gene flow and gene introgression between species (Supplementary Table 5). In addition, P. euphratica and P. pruinosa are dioecious, with males flowering earlier than females in each species, although the flowering periods basically overlap. However, P. euphratica flowers slightly earlier than P. pruinosa, suggesting that asymmetrical gene flow from male P. pruinosa to female P. euphratica might increase the probability of genetic variation (Wang et al., 1995, 2011). This observation likely explains why the co-occurring population of P. euphratica and P. pruinosa had higher genetic diversity than the other populations of each species.
Current assumptions suggest that palaeogeographical changes (Wang et al., 2013) or heterogeneous environments (Forester et al., 2016) might also affect the spatial genetic patterns of plants. We found here that both geographical distance and environmental heterogeneity significantly influenced the genetic variation of P. euphratica and P. pruinosa, suggesting that the genetic distance within the isolates increased with geographical distance. Changes in longitude and latitude represent the effects of geographical distance on genetic variation, which further strengthens our hypothesis that the presence of a geographical barrier facilitated the genetic differentiation of these populations. Local environmental differences and a long period of isolation facilitate allopatric divergence. For example, the environmental factors of NMG, NiX, and GS were intermediate between those of SX and NX (Jia et al., 2020), which might have contributed to the formation of the GNM clade. Meanwhile, analysis of the effects of geographical and environmental factors on the genetic diversity of P. euphratica and P. pruinosa showed that genetic diversity decreased with increasing latitude and longitude. In particular, the genetic diversity of Populus is more closely related to latitude, which might be related to differences in photoperiod or temperature in different latitudes. This notion requires further study.
A recent study of environmental factors influencing the geographical distribution of P. euphratica showed that this is affected by precipitation of the driest month and the warmest quarter, the soil moisture content (10–40 cm underground), soil moisture around the root system, and the evapotranspiration of soil water (Guo et al., 2020). Precipitation of the driest month and the warmest quarter is an important source of soil groundwater recharge during the growing season of P. euphratica. Soil water evaporation limits the effective recharge of soil groundwater, and soil water content and root soil moisture are important indices of effective soil groundwater recharge. Notably, precipitation of the driest month and the warmest month was opportunely encountered flooding season in southern Xinjiang (Zheng et al., 2016; Gai et al., 2020). In general, the peak of yearly seed release (at the end of July) coincides with the annual flooding period in that region (Eusemann et al., 2013). These favorable conditions may be conducive to the reproduction of P. euphratica and P. pruinosa, perhaps explaining why southern Xinjiang owns the largest P. euphratica and P. pruinosa forests in northwest China.
The high genetic diversity of the SX populations may result from its distinct local geographic structure and strong gene flow (Shen et al., 2014). Southern Xinjiang represents the widest distribution area of P. euphratica globally. The high genetic diversity of P. euphratica populations in this region reflects the large effective population size and large amount of endemic gene resources. To better protect these genetic Populus resources, more effort should focus on the intensity of flood diversion and irrigation and the protection of P. euphratica forests in southern Xinjiang, regardless of their ecological significance or importance to genetic conservation. More attention should also be paid to the replacement and renewal of P. euphratica forests in northern Xinjiang. Finally, the dramatic decreases in P. pruinosa populations underline the need to preserve this species, which is important not only in itself but as a precious genetic resource to enrich the gene pool of P. euphratica. Our genetic data here indicate that a comprehensive strategy is needed to maximize the protection of desert poplar.
Conclusion
In this study, we explored population structure and genetic diversity among 1,620 samples covering the full distribution ranges of P. euphratica and P. pruinosa populations in northwest China. Our study found that the natural populations of P. euphratica in northwest China show strong geographical distribution patterns and that P. euphratica and P. pruinosa populations in southern Xinjiang have higher genetic diversity than the populations of other clades. Mantel tests suggested that both geographical and environmental distance significantly influence genetic variation in P. euphratica and P. pruinosa: the genetic diversity of P. euphratica markedly decreases with increasing latitude and longitude and is positively correlated with environmental factors. Finally, the higher genetic diversity in southern Xinjiang may contribute to the local geographic structure and strong gene flow in these populations.
Data Availability Statement
The datasets presented in this study can be found in online repositories. The reference genome of Populus euphratica is available at our Populus genome database (http://118.24.202.236:20003/). The names of the SNP repository and accession number(s) can be found below: https://www.ncbi.nlm.nih.gov/, PRJNA738601.
Author Contributions
ZL and ZW conceived and designed the study and provided suggestions and comments about the manuscript. ZG collected and analyzed the data and wrote the manuscript. JZ, XC, PJ, SZ, and JS collected the samples. RQ and HL provided constructive comments. ZW revised the manuscript. All authors read and approved the manuscript.
Funding
This work was financially supported by the National Natural Sciences Foundation of China (31860198 and 30660018), and Innovative Team Building Plan for Key Areas of Xinjiang Production and Construction Corps (2018CB003), Xinjiang Production and Construction Corps Key Laboratory of Protection and Utilization of Biological Resources in Tarim Basin (BRZD2003).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
We are grateful to Dr. Hong-Xiang Zhang from the Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, for his assistance with data analysis. For the sample collection, we are grateful to the colleagues from the Forestry and Grassland Bureaus and the Management and Protection Stations of Populus euphratica forests in the five provinces (Xinjiang, Gansu, Ningxia, Qinghai, and Inner Mongolia), as well as Xinjiang EbinurHu National Nature Reserve Administration and Xinjiang Ganjiahu Haloxylon Forest National Nature Reserve Administration. We are grateful for technical support from Wuhan Genoseq Science and Technology Co., Ltd.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2021.705083/full#supplementary-material
Supplementary Table 1. Overview of information about the samples used in this study.
Supplementary Table 2. The primers used for each target SNP site.
Supplementary Table 3. Spearman's rank correlation coefficients (r) for the seven remaining bioclimatic variables in P. euphratica.
Supplementary Table 4. Spearman's rank correlation coefficients (r) for the seven remaining bioclimatic variables in P. pruinosa.
Supplementary Table 5. Information about the SX1 samples analyzed by PCA.
Supplementary Table 6. Genetic diversity analysis of P. euphratica samples from 59 natural populations.
Supplementary Table 7. Genetic diversity analysis of P. pruinosa samples from 26 natural populations.
Supplementary Table 8. Genetic differentiation coefficients among P. euphratica populations.
Supplementary Table 9. Genetic differentiation coefficients among P. pruinosa populations.
Supplementary Figure 1. Distribution of selected SNPs on the 19 P. euphratica chromosomes.
Supplementary Figure 2. Phenotypic variation in the leaves of P. euphratica and P. pruinosa natural populations in Maigaiti County, southern Xinjiang.
References
Allen, G. C., Flores-Vergara, M. A., Krasynanski, S., Kumar, S., and Thompson, W. F. (2006). A modified protocol for rapid DNA isolation from plant tissues using cetyltrimethylammonium bromide. Nat. Protoc. 1, 2320–2325. doi: 10.1038/nprot.2006.384
Barnes, I., Matheus, P., Shapiro, B., Jensen, D., and Cooper, A. (2002). Dynamics of Pleistocene population extinctions in Beringian brown bears. Science 295, 2267–2270. doi: 10.1126/science.1067814
Binks, R. M., Gibson, N., Ottewell, K. M., Macdonald, B., and Byrne, M. (2019). Predicting contemporary range-wide genomic variation using climatic, phylogeographic and morphological knowledge in an ancient, unglaciated landscape. J. Biogeogr. 46, 503–514. doi: 10.1111/jbi.13522
Bootsma, M. L., Gruenthal, K. M., McKinney, G. J., Simmons, L., Miller, L., Sass, G. G., et al. (2020). A GT-seq panel for walleye (Sander vitreus) provides important insights for efficient development and implementation of amplicon panels in non-model organisms. Mol. Ecol. Resour. 20, 1706–1722. doi: 10.1111/1755-0998.13226
Campbell, N. R., Harmon, S. A., and Narum, S. R. (2015). Genotyping-in-Thousands by sequencing (GT-seq): a cost effective SNP genotyping method based on custom amplicon sequencing. Mol. Ecol. Resour. 15, 855–867. doi: 10.1111/1755-0998.12357
DePristo, M. A., Banks, E., Poplin, R., Garimella, K. V., Maguire, J. R., Hartl, C., et al. (2011). A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498. doi: 10.1038/ng.806
Earl, D. A., and vonHoldt, B. M. (2011). STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv. Genet. Resour. 4, 359–361. doi: 10.1007/s12686-011-9548-7
Eusemann, P., Petzold, A., Thevs, N., and Schnittler, M. (2013). Growth patterns and genetic structure of Populus euphratica Oliv. (Salicaceae) forests in NW China – Implications for conservation and management. For. Ecol. Manag. 297, 27–36. doi: 10.1016/j.foreco.2013.02.009
Evans, L. M., Slavov, G. T., Rodgers-Melnick, E., Martin, J., Ranjan, P., Muchero, W., et al. (2014). Population genomics of Populus trichocarpa identifies signatures of selection and adaptive trait associations. Nat. Genet. 46, 1089–1096. doi: 10.1038/ng.3075
Fick, S. E., and Hijmans, R. J. (2017). WorldClim 2: new 1-km spatial resolution climate surfaces for global land areas. Int. J. Climatol. 37, 4302–4315. doi: 10.1002/joc.5086
Forester, B. R., Jones, M. R., Joost, S., Landguth, E. L., and Lasky, J. R. (2016). Detecting spatial genetic signatures of local adaptation in heterogeneous landscapes. Mol. Ecol. 25, 104–120. doi: 10.1111/mec.13476
Gai, Z., Li, X., Zhai, J., Chen, X., and Li, Z. (2020). Effects of river bank soil physical and chemical factors on populus pruinosa schrenk clonal growth. Appl. Ecol. Environ. Res. 18, 4791–4806. doi: 10.15666/aeer/1803_47914806
Guo, F., Xu, G., Lu, M., Meng, Y., Yuan, C., and Guo, K. (2020). Prediction of potential suitable distribution areas for populus euphratica using the maxent model. Sci. Silvae Sin. 56, 184–192. doi: 10.11707/j.1001-7488.20200521
Hendry, A. P., Taylor, E. B., and McPhail, J. D. (2002). Adaptive divergence and the balance between selection and gene flow_lake and stream stickleback in the Misty system. Evolution 56, 1199–1216. doi: 10.1111/j.0014-3820.2002.tb01432.x
Jia, H., Liu, G., Li, J., Zhang, J., Sun, P., Zhao, S., et al. (2020). Genome resequencing reveals demographic history and genetic architecture of seed salinity tolerance in Populus euphratica. J. Exp. Bot. 71, 4308–4320. doi: 10.1093/jxb/eraa172
Jiang, X. L., An, M., Zheng, S. S., Deng, M., and Su, Z. H. (2018). Geographical isolation and environmental heterogeneity contribute to the spatial genetic patterns of Quercus kerrii (Fagaceae). Heredity 120, 219–233. doi: 10.1038/s41437-017-0012-7
Kansu, Ç., and Kaya, Z. (2020). Genetic diversity of marginal populations of Populus euphratica Oliv. from highly fragmented river ecosystems. Silvae Genet. 69, 139–151. doi: 10.2478/sg-2020-0019
Lang, P., Jeschke, M., Wommelsdorf, T., Backes, T., Lv, C., Zhang, X., et al. (2015). Wood harvest by pollarding exerts long-term effects on Populus euphratica stands in riparian forests at the Tarim River, NW China. For. Ecol. Manage. 353, 87–96. doi: 10.1016/j.foreco.2015.05.011
Larmuseau, M. H. D., Van Houdt, J. K. J., Guelinckx, J., Hellemans, B., and Volckaert, F. A. M. (2009). Distributional and demographic consequences of Pleistocene climate fluctuations for a marine demersal fish in the north-eastern Atlantic. J. Biogeogr. 36, 1138–1151. doi: 10.1111/j.1365-2699.2008.02072.x
Li, H., and Durbin, R. (2009). Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760. doi: 10.1093/bioinformatics/btp324
Liu, Y., Webber, S., Bowgen, K., Schmaltz, L., Bradley, K., Halvarsson, P., et al. (2013). Environmental factors influence both abundance and genetic diversity in a widespread bird species. Ecol. Evol. 3, 4683–4695. doi: 10.1002/ece3.856
Lu, R. S., Chen, Y., Tamaki, I., Sakaguchi, S., Ding, Y. Q., Takahashi, D., et al. (2020). Pre-quaternary diversification and glacial demographic expansions of Cardiocrinum (Liliaceae) in temperate forest biomes of Sino-Japanese Floristic Region. Mol. Phylogenet. Evol. 143:106693. doi: 10.1016/j.ympev.2019.106693
Ma, T., Wang, K., Hu, Q., Xi, Z., Wan, D., Wang, Q., et al. (2018). Ancient polymorphisms and divergence hitchhiking contribute to genomic islands of divergence within a poplar species complex. Proc. Natl. Acad. Sci. U.S.A. 115, E236–E243. doi: 10.1073/pnas.1713288114
McKinney, G. J., Seeb, J. E., Pascal, C. E., Schindler, D. E., Gilk-Baumer, S. E., and Seeb, L. W. (2020). Y-chromosome haplotypes are associated with variation in size and age at maturity in male Chinook salmon. Evol. Appl. 13, 2791–2806. doi: 10.1111/eva.13084
Meng, H.-H., Gao, X.-Y., Huang, J.-F., and Zhang, M.-L. (2015). Plant phylogeography in arid Northwest China: retrospectives and perspectives. J. Syst. Evol. 53, 33–46. doi: 10.1111/jse.12088
Muellner-Riehl, A. N. (2019). Mountains as evolutionary arenas: patterns, emerging approaches, paradigm shifts, and their implications for plant phylogeographic research in the Tibeto-Himalayan region. Front. Plant Sci. 10:195. doi: 10.3389/fpls.2019.00195
Olson-Manning, C. F., Wagner, M. R., and Mitchell-Olds, T. (2012). Adaptive evolution: evaluating empirical support for theoretical predictions. Nat. Rev. Genet. 13, 867–877. doi: 10.1038/nrg3322
Peakall, R., and Smouse, P. E. (2012). GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research–an update. Bioinformatics 28, 2537–2539. doi: 10.1093/bioinformatics/bts460
Powell, J. H., and Campbell, M. R. (2020). Contemporary genetic structure affects genetic stock identification of steelhead trout in the Snake River basin. Ecol. Evol. 10, 10520–10531. doi: 10.1002/ece3.6708
Rius, M., and Darling, J. A. (2014). How important is intraspecific genetic admixture to the success of colonising populations? Trends Ecol. Evol. 29, 233–242. doi: 10.1016/j.tree.2014.02.003
Rousset, F. (2008). genepop'007: a complete re-implementation of the genepop software for Windows and Linux. Mol. Ecol. Resour. 8, 103–106. doi: 10.1111/j.1471-8286.2007.01931.x
Saito Y. Shiraishi S. Tanimoto T. Yin L. and S. W. Ide Y. (2002). Genetic diversity of Populus euphratica populations in northwestern China determined by RAPD DNA analysis. New For. 23, 97–103. doi: 10.1023/A:1015605928414
Schmidt, D. A., Campbell, N. R., Govindarajulu, P., Larsen, K. W., and Russello, M. A. (2020). Genotyping-in-Thousands by sequencing (GT-seq) panel development and application to minimally invasive DNA samples to support studies in molecular ecology. Mol. Ecol. Resour. 20, 114–124. doi: 10.1111/1755-0998.13090
Shen, D., Bo, W., Xu, F., and Wu, R. (2014). Genetic diversity and population structure of the Tibetan poplar (Populus szechuanica var. tibetica) along an altitude gradient. BMC Genet. 15, 1471–2156. doi: 10.1186/1471-2156-15-S1-S11
Tamura, K., Stecher, G., Peterson, D., Filipski, A., and Kumar, S. (2013). MEGA6: molecular evolutionary genetics analysis version 6.0. Mol. Biol. Evol. 30, 2725–2729. doi: 10.1093/molbev/mst197
Verma, S., Gupta, S., Bandhiwal, N., Kumar, T., Bharadwaj, C., and Bhatia, S. (2015). High-density linkage map construction and mapping of seed trait QTLs in chickpea (Cicer arietinum L.) using Genotyping-by-Sequencing (GBS). Sci. Rep. 5:17512. doi: 10.1038/srep17512
Vonlanthen, Z. X., and Bruelheide, H. (2010). Clonal structure and genetic diversity of three desert phreatophytes. Am. J. Bot. 97, 234–242. doi: 10.3732/ajb.0800329
Wang, J., Kallman, T., Liu, J., Guo, Q., Wu, Y., Lin, K., et al. (2014). Speciation of two desert poplar species triggered by Pleistocene climatic oscillations. Heredity (Edinb) 112, 156–164. doi: 10.1038/hdy.2013.87
Wang, J., Wu, Y., Ren, G., Guo, Q., Liu, J., and Lascoux, M. (2011). Genetic differentiation and delimitation between ecologically diverged Populus euphratica and P. pruinosa. PLoS ONE 6:e26530. doi: 10.1371/journal.pone.0026530
Wang, Q., Abbott, R. J., Yu, Q. S., Lin, K., and Liu, J. Q. (2013). Pleistocene climate change and the origin of two desert plant species, Pugionium cornutum and Pugionium dolabratum (Brassicaceae), in northwest China. New Phytol. 199, 277–287. doi: 10.1111/nph.12241
Wang, S. J., Chen, B. H., and Li, H. Q. (1995). Euphrates Poplar Forest. Beijing: China Environmental Science Press.
Wu, Y., Wang, J., and Liu, J. (2008). Development and characterization of microsatellite markers in Populus euphratica (Populaceae). Mol. Ecol. Resour. 8, 1142–1144. doi: 10.1111/j.1755-0998.2008.02202.x
Xu, F., Feng, S., Wu, R., and Du, F. K. (2013). Two highly validated SSR multiplexes (8-plex) for Euphrates' poplar, Populus euphratica (Salicaceae). Mol. Ecol. Resour. 13, 144–153. doi: 10.1111/1755-0998.12030
Yang, J., Lee, S. H., Goddard, M. E., and Visscher, P. M. (2011). GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82. doi: 10.1016/j.ajhg.2010.11.011
Ye, J. W., Zhang, Z. K., Wang, H. F., Bao, L., and Ge, J. P. (2019). Phylogeography of Schisandra chinensis (Magnoliaceae) Reveal Multiple Refugia With Ample Gene Flow in Northeast China. Front. Plant Sci. 10:199. doi: 10.3389/fpls.2019.00199
Zeng, Y. F., Zhang, J. G., Abuduhamiti, B., Wang, W. T., and Jia, Z. Q. (2018). Phylogeographic patterns of the desert poplar in Northwest China shaped by both geology and climatic oscillations. BMC Evol. Biol. 18:75. doi: 10.1186/s12862-018-1194-1
Zhang, H. X., Li, X. S., Wang, J. C., and Zhang, D. Y. (2020). Insights into the aridification history of Central Asian Mountains and international conservation strategy from the endangered wild apple tree. J. Biogeogr. 48, 332–344. doi: 10.1111/jbi.13999
Keywords: Populus euphratica, Populus pruinosa, population structure, genetic diversity, geographical and environmental factors
Citation: Gai Z, Zhai J, Chen X, Jiao P, Zhang S, Sun J, Qin R, Liu H, Wu Z and Li Z (2021) Phylogeography Reveals Geographic and Environmental Factors Driving Genetic Differentiation of Populus sect. Turanga in Northwest China. Front. Plant Sci. 12:705083. doi: 10.3389/fpls.2021.705083
Received: 04 May 2021; Accepted: 14 July 2021;
Published: 11 August 2021.
Edited by:
Niels Thevs, World Agroforestry Centre, KenyaReviewed by:
Itzi Fragoso-Martínez, Instituto de Ecología (INECOL), MexicoJia Hui Xia, Chinese Academy of Forestry, China
Copyright © 2021 Gai, Zhai, Chen, Jiao, Zhang, Sun, Qin, Liu, Wu and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Zhijun Li, bGl6aGlqdW4wMjAyJiN4MDAwNDA7MTI2LmNvbQ==; Zhihua Wu, emh3dSYjeDAwMDQwO3NjdWVjLmVkdS5jbg==