Skip to main content

ORIGINAL RESEARCH article

Front. Plant Sci. , 06 March 2025

Sec. Plant Genetics, Epigenetics and Chromosome Biology

Volume 16 - 2025 | https://doi.org/10.3389/fpls.2025.1500759

Contrasting levels of transcriptome-wide SNP diversity and adaptive molecular variation among conifers

  • 1Canada Research Chair in Forest Genomics, Institute for Systems and Integrative Biology and Forest Research Centre, Université Laval, Québec, QC, Canada
  • 2Institute for Systems and Integrative Biology, Université Laval, Québec, QC, Canada
  • 3Département de Médecine Moléculaire, Faculté de Médecine, Université Laval, Québec, QC, Canada
  • 4Gydle Inc., Québec, QC, Canada
  • 5Département de Mathématiques et de Statistiques, Faculté des Sciences et de Génie, Université Laval, Québec, QC, Canada
  • 6Department of Biology, University of Oxford, Oxford, United Kingdom

Adaptive convergence can arise when response to natural selection involves shared molecular or functional mechanisms among multiple taxa. Conifers are archaic species of ancient origin with delayed sexual maturity related to their woody perennial nature. Thus, they represent a relevant plant group to assess if convergence from selection may have become disconnected between molecular and functional levels. In this purpose, transcriptome-wide SNP diversity was assessed in seven partially sympatric and reproductively isolated conifer species (118 individuals from 67 populations) populating the temperate and boreal forests of northeastern North America. SNP diversity was found highly heterogeneous among species, which would relate to variation in species-specific demography and history. Rapidly evolving genes with signatures of positive selection were identified, and their relative abundance among species reflected differences in transcriptome-wide SNP diversity. The analysis of sequence homology also revealed very limited convergence among taxa in spite of sampling same tissues at same age. However, convergence increased gradually at the levels of gene families and biological processes, which were largely related to stress response and regulatory mechanisms in all species. Given their multiple small to large gene families and long time since inception, conifers may have had sufficient gene network flexibility and gene functional redundancy for evolving alternative adaptive genes for similar metabolic responses to environmental selection pressures. Despite a long divergence time of ~350 Mya between conifers and Angiosperms, we also uncovered a set of 17 key genes presumably under positive selection in both lineages.

Introduction

Adaptive genetic variation allows organisms to cope with natural selective pressures and thrive in their environment. This is especially true for long-lived woody plants, such as conifers from mid-northern latitudes, that must contend with delayed sexual maturity to adapt to highly heterogeneous and changing climatic conditions (Depardieu et al., 2021). Therefore, identifying and characterizing adaptive genetic variation within species is crucial to understand the molecular mechanisms underlying their response to environmental pressures. Molecular convergence can arise when such molecular mechanisms are shared by multiple species (Stern, 2013). This process may occur at different hierarchical levels, such as specific nucleotides, protein-coding genes (often referred to as ‘gene reuse’), gene families, or genes belonging to the same biological pathways (Hao et al., 2019; Sackton and Clark, 2019). As a general trend, molecular convergence is expected to increase with hierarchical levels under similar positive selection pressures (Stern, 2013; He et al., 2020; Xu et al., 2020) in spite of divergent adaptive evolution at the gene level.

However, the many determinants of molecular convergence complicate the prediction of patterns of adaptive evolution at both intraspecific and interspecific taxonomic levels. The most influential determinants include ancestry (the probability of convergence decreases along with taxa divergence time), effective population size (taxa of small effective population size are less likely to converge due to increased genetic drift), gene flow/introgression (gene flow usually increases convergence by constraining differentiation among taxa, but can also prevent or delay local adaptation), selection landscape (convergence is expected to decrease when the number of selective pressures increases in a given habitat), and many-to-one mapping (convergence is expected to decrease as the number of traits governing a given functional output increases) (reviewed by Bolnick et al., 2018).

Many-to-one mapping is a particularly relevant determinant of molecular convergence when studying the adaptive trajectories of lineages and species. Indeed, considering that the link between phenotypic and molecular convergence is well established in a variety of taxa (see Martin and Orgogozo, 2013 for a catalog of genetic hotspots of phenotypic variation in animals, plants, and yeast), it is reasonable to assume that molecular convergence reflects shared adaptive response to similar selective pressures. However, the opposite is not necessarily true. The fact that adaptative traits are usually highly polygenic (e.g. Le Corre and Kremer, 2012; Csilléry et al., 2018; Barghi et al., 2020; Depardieu et al., 2021) suggests that plant taxa have typically many genetic solutions available to solve the adaptive challenges they face in nature (Arendt and Reznick, 2008; Losos, 2011; Tenaillon et al., 2012; Storz, 2016), including in closely-related populations from the same species (e.g. Manceau et al., 2010; Elmer and Meyer, 2011). Most plant groups such as conifers are also characterized by large gene families and redundancy of gene function (Guillet-Claude et al., 2004; Bedon et al., 2010; Pavy et al., 2012a; Stival Sena et al., 2018; Van Ghelder et al., 2019). Therefore, lineages and species may follow similar adaptive trajectories, while showing reduced levels convergence at the molecular level. Hence, assessing the extent of functional convergence of adaptive genes in multiple lineages and species can complement the picture derived from molecular convergence alone, and reveal otherwise hidden adaptive patterns.

To address these fundamental questions related to adaptive convergence, conifers from northeastern North America represent an ideal framework for several reasons. First, contrary to European forests for instance, these forests have been generally characterized by low levels of anthropic disturbance up to the twentieth century (i.e. reduced urbanization and forest management) and the regional landscape is of relative topographic homogeneity, thus facilitating gene flow, compared to western North America for instance. The most significant barriers to gene flow in eastern North America include the Great Lakes and the Appalachian Mountains, which are thought to be responsible for the genetic divergence of most historical lineages still observable nowadays in eastern North American conifers (Jaramillo-Correa et al., 2009). The two main glacial refugia in this region would have been located south of the Great Lakes and east of the Appalachian Mountains along the Atlantic coast (Jaramillo-Correa et al., 2009). With the limited potential of confounding factors from long-term human activity and the lack of significant barriers constraining migration during the Holocene, tree species could track their most suitable current habitats and evolve local adaptations in response to environmental selective pressures, as evidenced by several empirical studies (e.g. Namroud et al., 2008; Prunier et al., 2011; Hornoy et al., 2015; Nadeau et al., 2016). Second, none of the conifers in the boreal forest of northeastern North America are known to hybridize, although they are sympatric in most of their range: potential hybrid zones are all located at the southern or western edge of the species ranges, and they have been quite well delimited and were therefore easy to avoid by using an adequate sampling strategy (Jaramillo-Correa et al., 2009), which would minimize the risk that molecular signatures of natural selection within species are confounded by interspecific introgression. Also, the extensive range overlap of conifer species across the mid-latitude forests of northeastern North America indicates that these species generally face common environmental pressures, of which harsh and heterogenous climatic conditions are a large component (Hornoy et al., 2015; Depardieu et al., 2021). Thus, these conifers represent relevant models to address questions about long-term evolution and adaptation from a comparative perspective.

However, studying molecular convergence in woody perennial plant taxa with such large and complex genomes is highly challenging. Over the last 20 years, our understanding of conifer genomes has progressed significantly through the sequencing and analyses of their genome structure, evolution and functions (De La Torre et al., 2014; Prunier et al., 2016). Nonetheless, extensive resequencing has been restricted to only a few conifer species belonging primarily to the Picea and Pinus genera. This limits the potential to conduct exhaustive comparative studies across conifers, which are essential to understand the common determinants of adaptive evolution. To date, the main findings indicate a rather limited convergence among adaptive genes identified from species belonging to the same or different genera (Mosca et al., 2012; Yeaman et al., 2016; Bousquet et al., 2021; Gagalova et al., 2022).

In this study, we investigated adaptive molecular convergence at the transcriptome-wide level among six sympatric Pinaceae species native of northeastern North America, namely white spruce (Picea glauca), black spruce (Picea mariana), eastern white pine (Pinus strobus), jack pine (Pinus banksiana), balsam fir (Abies balsamea), and tamarack (Larix laricina), as well as one sympatric Cupressaceae taxon, eastern white cedar (Thuja occidentalis), for a total of 118 individuals representing 67 populations. Considering that the Cupressaceae and the Pinaceae diverged ~315 Mya (Leslie et al., 2018), while taxa divergence within the Pinaceae did not take place before ~185 Mya (divergence of the Abies genus from its sister taxa; Leslie et al., 2018), we included a Cupressaceae taxon to qualitatively assess the effect of phylogenetic relatedness on our inferences. In this study, we first identified gene nucleotide polymorphisms within each species, and assessed their level of overall genetic diversity across much of the transcriptome. We then identified genes with sequence signatures of positive selection in each species in order to estimate the level of adaptive convergence among species from a molecular and functional perspectives. This approach also allowed us to identify shared drivers of adaptive molecular evolution among species. We also investigated the extent of adaptive molecular convergence between Angiosperms and this group of conifers despite their ancient phylogenetic divergence (Savard et al., 1994).

Materials and methods

Biological materials

Seven conifer species were sampled, namely Picea glauca, Picea mariana, Pinus strobus, Pinus banksiana, Abies balsamea, Larix laricina, and Thuja occidentalis. For each species, seeds from ten provenances were obtained from the National Tree Seed Center (Fredericton, New-Brunswick, Canada), paying special attention to avoid provenances located within sympatric or paratric zones in species known to spontaneously hybridize with related taxa (Figure 1; Supplementary Methods S1). For each species, between 15 and 18 diploid embryos representing ten distinct provenances (two seeds per provenance on average) were extracted and flash frozen in separate tubes. All species considered, a total of 118 individuals from 67 populations were sequenced. In addition, four provenances were randomly selected per species and one seed per provenance was dissected to extract the haploid megagametophyte, which was flash frozen in liquid nitrogen in separate tubes. Megagametophytes were barcoded individually, while embryos were pooled at equimolar concentration and barcoded as a single library, prior to the sequencing step (Supplementary Methods S1). All SNPs identified subsequently in the pool of embryos and in at least one megagametophyte were discarded so to filter out paralogous non-mendelian SNPs (see next section 'SNP calling').

Figure 1
www.frontiersin.org

Figure 1. Characteristics of the seven conifer species analyzed in this study. The approximate natural range of each species is represented in green besides its associated tree silhouette, and populations sampled are mapped as red dots. For illustrative purposes only, the figure shows species divergence times derived from fossil-calibrated molecular clocks (Bouillé and Bousquet, 2005; Leslie et al., 2018; Li et al., 2019). Colors at branch tips represent the three groups of overall SNP diversity identified in this study at the transcriptome-wide level (see Results). Source of tree silhouettes: https://tidcf.nrcan.gc.ca/; https://www.phylopic.org/.

Sequencing

Total RNA was extracted using the MasterPure™ Plant RNA Purification kit (Epicenter, Madison, WI, USA). RNAs were sequenced in paired-end mode (2×125 bp) with an Illumina HiSeq 2500 (Methods S2). Raw sequencing data (reads) were deposited in the public database ENA (European Nucleotide Archive, https://www.ebi.ac.uk/ena/browser/home, accessions ERS16017105-ERS16017139 and ERS16049778-ERS16049791) and vcf files containing variants identified in each species were deposited in DRYAD ('https://doi.org/10.5061/dryad.p8cz8w9w1). We assessed the good representativity of the analyzed transcriptomes based on sequence similarity searches (Methods S3).

SNP calling

After sequence quality controls and filtering (Supplementary Methods S4), reads were aligned to the reference transcriptomes of each species previously published (Van Ghelder et al., 2019). SNPs were called using HaplotypeCaller version 3.4, from the GATK tool kit (McKenna et al., 2010; DePristo et al., 2011) and subsequently quality-filtered (Supplementary Methods S5). Since the megagametophyte is a haploid tissue in all seven conifer species investigated herein, SNPs identified within megagametophyte libraries were likely indicative of variations between paralogous gene sequences also occurring in embryos. Thus, these SNPs were considered as false-positives and were subtracted from SNPs identified in pools of embryos before subsequent analyses were carried out. In total, above 1.4 million raw SNPs were identified (Supplementary Methods S5, Supplementary Table S5.1). After removal of paralogous SNPs identified in haploid megagametophytes, around 867K SNPs remained (Supplementary Table S5.1). Among them, ~398K high-quality SNPs (Supplementary Methods S5), were located in coding sequences representing almost ~97K Open Reading Frames in total (~16K transcripts per species on average) (Supplementary Table S1). All species considered and on average, ~82% of the transcripts and ~70% of the ORFs carried SNPs (Supplementary Table S1).

Transcripts that were included in the comparison of molecular genetic diversity across species and the analysis of the functional annotations of polymorphic gene sequences had to exhibit an average coverage of 10 reads or more, at least 300 nucleotides with a depth of 10 or more reads, and contain coding sequences.

SNP abundance in transcripts and comparison across species

The length and read depth of transcripts were significantly heterogenous across the seven conifer species investigated (Supplementary Methods S6). It is essential to control for such effects before analyzing SNP abundance differences across species (Eo and DeWoody, 2012). In this purpose, we applied a regression model assuming that the number of SNPs within transcripts follows a negative binomial distribution (Eo and DeWoody, 2012).

The model corrected efficiently for variations among transcripts depth and length, thus enabling a more rigorous comparison of the SNP abundance across species (Supplementary Methods S7). Then, SNP rate heterogeneity among species was tested using a Kruskall-Wallis test. To group species based on their level of total SNP diversity, Kolmogorov-Smirnov and Cramer-von Mises tests were conducted.

Estimation of gene SNP A/S ratios to identify positively selected genes (PSGs)

Based on the premise that nonsynonymous substitutions are predicted to contribute more to adaptive evolution than synonymous substitutions (Stern and Orgogozo, 2008), one way to study molecular convergence is to compare the ratio of substitution rates at nonsynonymous (Ka) versus synonymous (Ks) sites in orthologous protein-coding sequences betwen species. Similar inferences can be drawn within taxa from gene SNP A/S ratios, since both ratios have been shown to be strongly positively correlated (Liu et al., 2008).

The SNP A/S ratio was then calculated for each gene as the number of SNPs per nonsynonymous site (A) divided by the number of SNPs per synonymous site (S). An adjusted SNP A/S ratio was used to include genes with no synonymous SNPs following the empirical logit principle (Agresti, 2013):

Adj. SNP A/S ratio=(number of nonsynonymous SNPs + 0.5)/(La+1)(number of synonymous SNPs+0.5)/(Ls+1) 

The SNP A/S ratio was calculated over the longest open reading frame predicted for each transcript in each species. In order to identify putative positively selected genes (PSGs), we first retained those with A/S values exceeding 1, the threshold usually considered as evidence for positive selection (Kimura, 1983), and applied a subsequent filtering step to ensure that the probability that a transcript has an A/S ratios exceeding 1 by chance alone was lower than 5%, thus reducing much the number of false positives (see in Supplementary Methods S8). The resulting sets of PSGs were then used to analyze and compare the sequences and functional annotations of these deemed PSGs across the seven conifer species considered in the study.

Sequence annotation and analyses

Predicted protein sequences were clustered into orthogroups with OrthoFinder version 2.3.8 (Emms and Kelly, 2019) run with default settings. Functional annotations of ORFs were derived from sequence similarity searches conducted with blastp version 2.13.0 against Uniprot (E-value <e-15) and PFAM (El-Gebali et al., 2019). Sequences were also assigned to Gene Ontology (GO) classes by using the mapping between the UniprotKB sequences and the GO terms. The heatmaps were generated using the pheatmap R package (Kolde, 2019).

Enrichment tests were conducted with the R (version 4.0.2) package topGO (version 2.42.0; Alexa et al., 2006; https://bioconductor.org/packages/release/bioc/html/topGO.html), in order to identify GO terms enriched among annotations of the genes with high SNP A/S values (Supplementary Methods S9).

Sequence comparisons with positively selected genes in Brassica or poplar

PSGs were identified in Brassica (Guo et al., 2017). Their Arabidopsis orthologs were retrieved (https://www.arabidopsis.org/) for a total of 621 sequences. PSGs were also identified in poplar (Lin et al., 2018). The Populus trichocarpa sequences were retrieved from PopGenIE.org. Sequences of the 2,047 conifer genes under positive selection were then compared at the protein level to poplar and Arabidopsis proteins. Overall, pairs of homologous sequences between these dicots and conifers were identified following a blastp search (E-value<1E-30). When one dicot gene sequence was found homologous to several conifer gene sequences, or when one conifer gene sequence was homologous to several dicot gene sequences, only the best match was selected.

Results

SNP diversity

This study enabled the identification of nearly 1.5 million of SNPs across the transcriptomes of seven conifers (Picea glauca and Picea mariana, Pinus strobus and Pinus banksiana, Larix laricina, Abies balsamea, Thuja occidentalis). However, we retained the ~867K SNPs with highest quality (see Methods S5) to conduct the subsequent analyses. Among them, ~398K high-quality SNPs were located in coding sequences representing almost ~97K Open Reading Frames (equivalent to ~16K transcripts per species, on average) (Table 1). All species considered, ~82% of the transcripts and ~70% of the ORFs carried SNPs. In such transcriptome sequencing endeavor, it appeared important to minimize the effects caused by sequencing depth before undertaking any analysis of SNP diversity. We carefully adjusted the SNP diversity by both sequence length and sequencing depth before proceeding to data analysis and comparison across genes and across species (see Methods). After these adjustments, SNP diversity was estimated for each species and it was found significantly heterogeneous among the seven conifer taxa. A Kruskal-Wallis test revealed significant differences in rates of synonymous, nonsynonymous and total SNPs across the seven species (Supplementary Table S2). Based on overall SNP diversity, three groups were delineated based on the results of Kolmogorov-Smirnov and Cramer-von Mises tests (Figure 1; Table 1; Supplementary Methods S7). The group of species with the highest level of overall SNP diversity included the two Picea species, the group with the lowest diversity included Pinus strobus and Thuja occidentalis, while the three remaining species, Abies balsamea, Larix laricina, and Pinus banksiana, had intermediate overall SNP diversity (Table 1; Supplementary Table S1).

Table 1
www.frontiersin.org

Table 1. Metrics about high-quality SNPs for the seven conifer transcriptome datasets, including adjusted metrics for variations in sequence length and read coverage.

Detection of genes with high SNP A/S ratios and relationship with overall SNP diversity

Synonymous and nonsynonymous sites were identified and rates of synonymous (S) and nonsynonymous SNPs (A) were estimated to calculate the gene SNP A/S ratio (see Methods). A ratio above 1 is indicative of positive or balancing selection related to adaptive evolution (Kimura, 1983; Fay et al., 2001). Ratios above 1 were found in around 19% and 25% of genes depending on the species (Supplementary Table S8.2). Moreover, an excess of nonsynonymous SNPs was significant in 2,047 genes (around 2% of genes within each species) (Supplementary Table S8.2) that were considered as genes under putative positive selection (PSGs, Positively Selected Genes) for subsequent analyses. In PSGs, the mean A/S values were in the range of 2.70-2.89 (Supplementary Table S8.2). The proportion of PSGs was highly correlated with the level of overall SNP diversity detected within each species (R2 = 0.93; p-value < 0.01; Figure 2). It should however be noted that while high A/S ratios are indicative of positive selection, further investigation would be required to validate the role of positive selection.

Figure 2
www.frontiersin.org

Figure 2. Relationship between overall SNP diversity and molecular adaptive variation. In each species, the proportion of genes under positive selection (PSGs) was calculated relatively to the total number of ORFs in the species considered. The number of SNPs was adjusted for sequence length and depth.

Annotations of positively selected genes in conifers

Out of the 2,047 putative conifer PSGs, 73.5% (1,504 genes) had a significant match (blastp E-value <1E-15) with a SwissProt-Uniprot protein, a proportion consistent with other studies in conifers (Hart et al., 2020). Moreover, 932 genes (45.5%) had a match with 434 PFAM families (match E-value <1E-15). These annotated genes had a wide variety of functions (Supplementary Figure S1). Among GO terms, 1,219 Biological Processes (BPs) were assigned to these 2,047 genes. The most represented processes were directly related to signal transduction, responses to biotic stresses, as well as various related processes (Supplementary Figure S2). For instance, six terms describing plant responses to pathogens were found 576 times. Unsurprisingly, signal transduction, which is a common denominator of cell response to a stimulus, was the most predominant term (10.3% of the genes), along with defense response (7.7% of the genes) (Supplementary Figures S1-S2). The nicotinamide adenine dinucleotide (NAD) catabolic process was also highly represented (4.3% of the genes), which is consistent with the central role of NAD in plant defense responses (Pétriacq et al., 2013).

Comparison of sequences, functions and processes of PSGs across conifer species

The complete dataset of gene sequences was successfully clustered into orthogroups, demonstrating both a high clustering capability at the intraspecific level (with few unassigned genes) and at the interspecific level (with few species-specific orthogroups) (Supplementary Table S3). Altogether, 94.7% of the gene sequences were assigned to an orthogroup and the remainder were orphans, as previously observed in conifers (Gagalova et al., 2022). Out of the 16,982 orthogroups delimited in total, 8,647 contained gene sequences from the seven species, and 1,034 others contained gene sequences from all species except the more phylogenetically distant Cupresseae taxon Thuja occidentalis (Figure 3A). The number of species-specific genes in our dataset (hereafter referred to as ‘species-specific genes’ to simplify terminology) was low in Pinaceae taxa (between 3% and 9%) and higher (19%) inThuja occidentalis, which was expected given that this taxon belongs to the more divergent Cupressaceae family (Figure 3A). These species-specific gene sequences could either not be assigned to any orthogroup, or represented species-specific orthogroups in our dataset (Supplementary Table S3). In contrast to the trend observed in the complete dataset, orthogroups derived from PSGs in each species showed a much lower overlap among species (Figure 3B). The vast majority of them were species-specific (total of 699; 68.7%), with only eight orthogroups shared across all seven species (Figure 3B; Table 2). No particular trend between gene convergence and phylogenetic relatedness among species was detected. For instance, shared PSGs were not more predominant within genera than among genera (Supplementary Table S4).

Figure 3
www.frontiersin.org

Figure 3. Overlap among species of orthogroups (A, B), PFAM families (C, D) and Gene Ontology Biological Processes (E, F). (A, B) Orthogroups were identified by clustering the complete dataset of 139k gene sequences of the seven conifer species (Picea glauca, Picea mariana, Pinus banksiana, Abies balsamea, Larix laricina, Pinus strobus, Thuja occidentalis) (A), and by clustering the 2,047 sequences identified as positively selected genes (B). The number reported in each intersection corresponds to the number of positively selected orthogroups shared by species, while the number reported in each species-specific zone corresponds to the number of singleton positively selected sequences and species-specific positively selected orthogroups. (C, D) Protein families were identified based on similarities against the PFAM database across the overall transcript datasets (C) and across positively selected genes (D). (E, F) Biological processes GO terms across the overall sequence dataset (E) and across positively selected genes (F).

Table 2
www.frontiersin.org

Table 2. Percentage of shared or species-specific genes and annotations among the complete dataset (all ORFs) and among the 2,047 positively selected genes.

Similar trends were observed at the gene family level. In the overall dataset, only 102 PFAM accessions (3.2%) were species-specific, indicating that protein families or domains were predominantly shared among the seven conifer species (Figure 3C). In contrast, in PSGs, a majority of PFAM accessions (407 accessions, 68.8%) were species-specific (Figure 3D). Nevertheless, the proportion of shared families among species increased as compared to that for orthogroups (Table 2; Supplementary Figure S3).

Among GO terms, 4,504 BPs and 814 BPs were associated with the overall gene sequence dataset and 814 BPs for the dataset of PSGs (Figure 3E; Supplementary Figure S4). Species-specific BPs were few in the overall sequence dataset (total of 389, 8.6%) but relatively more abundant (333, 40.9%) in PSGs, although at a much lower rate than that observed for orthogroups or PFAM families. In spite of more convergence observed at the level of BPs, these results are indicative of a high level of functional diversity in PSGs (Figure 3F; Table 2).

The eight orthogroups of PSGs shared by all conifer species (Figure 3B) were homologous to sequences of known functions except one. They included three disease resistance genes, a member of pentatricopeptide repeat family, a C-type lectin receptor-like tyrosine-protein kinase, a cytochrome P450 and an ABC transporter family member.

The shared PFAM accessions included the NB-ARC domain (PF00931), the protein kinase domain (PF00069) and the TIR domain (PF01582), which are frequently found in combination in proteins involved in defense responses, as well as the cytochrome P450 family (PF00067), and UDP-glucoronosyl and UDP-glucosyl transferase family (PF00201) (Figure 3D; Supplementary Figure S3). The most abundantly represented and shared families by at least two species are detailed in Supplementary Figure S3.

Regarding the gene ontologies of PSGs, 29 BPs (2.38% of the total number of BPs observed in PSGs) were shared by all conifer species (Figure 3F) and were categorized into eight groups (Supplementary Figure S4). Eleven BPs were related to responses to biotic (like response to fungi, hypersensitive response) or abiotic stresses (like oxidative stress, cold) (Supplementary Figure S4). Four terms were related to protein modification (like protein ubiquitination) and three others were related to growth and development (like pollen development). Main shared BPs also encompassed other mechanism important for plant life and survival such as seed development and germination or response to light (Figure 4 and Supplementary Figure S5).

Figure 4
www.frontiersin.org

Figure 4. Summary of the conserved biological processes across the 2,047 positively selected genes in conifers. The outer circle shows the biological processes found with some degree of conservation across the seven species, the inner circle shows examples of genes in each functional category. Genes in black were found in a single conifer species, those in blue were found in two species, those in red were found in three to six species and green font indicates genes found in all species. Source of illustrations: https://tidcf.nrcan.gc.ca/; https://www.vecteezy.com/; https://www.freepik.com/; https://pixabay.com/.

PSGs were enriched in several GO classes, including 16 BPs, 15 molecular functions and 5 cellular components (Figure 5). Among them, five terms were enriched in the seven conifer species and are involved in defense against pathogens. Half of the enriched BPs were related to stress responses (Figure 4). The enriched molecular functions were involved in several enzymatic activities or in binding (Figure 5). Noticeably, terms involving nucleotidases, ADP or ATP binding were enriched in almost all seven species (Figure 5).

Figure 5
www.frontiersin.org

Figure 5. Gene Ontology terms enriched in the 2,047 positively selected genes in the seven conifer species analyzed, compared with the overall gene set for these species. (PG: Picea glauca, PM: Picea mariana, PB: Pinus banksiana, AB: Abies balsamea, LL: Larix laricina, PS: Pinus strobus, TO: Thuja occidentalis). The heatmap is based on the p-values of the enrichment tests (Fisher tests) and the color scale illustrates statistical significance. White cells represent non-significant tests at a threshold of 0.005.

Abundance of conifer PSGs in defense mechanisms

Several gene families representing PSGs were involved in disease resistance. Among resistance genes, there were 72 genes homologous to run1 and 41 homologous to rpv1 conferring resistance to mildew (Supplementary Table S5). Several gene families were involved in resistance against Pseudomonas syringae (rps2, rps5, rfl1, tao1, rpp3) or against viruses (eds1, tmvrn) (Supplementary Table S5, Figure 4). Among families involving PSGs shared by all species, the C-Lectin (CLEC) and the DRL28 protein are also involved in disease resistance. CLEC proteins have a diverse range of functions including cell-cell adhesion, immune response to pathogens and apoptosis.

Several genes belonged to the chitin pathway involved in fungal wall degradation (Supplementary Table S6). They included several chitinases, an homologue of the LYK5 chitin receptor (Cao et al., 2014), and an homologue of CERK1 (Chitin Elicitor Receptor Kinase 1) required as a cell surface receptor for chitin elicitor signaling leading to innate immunity in response to biotic stresses (Hu et al., 2021).

Also, several PSGs were found among gene families implicated in the secondary metabolite pathways (Figure 4, Supplementary Table S7). They included proteins with roles in wood formation and defense response against insects such as the cytochrome P450 76T24 involved in the monoterpenoid synthetic pathway (Miettinen et al., 2014), the abietadienol/abietadienal oxidase (Ro et al., 2005), and the delta-selinene synthase (Steele et al., 1998).

Conifer PSGs homologous to Brassica or poplar PSGs

Conifer PSGs had 54 homologs also positively selected in both poplar and Brassica. This set of 54 PSGs matched a total of 17 distinct Arabidopsis genes including five transcription factors, three glucosyltransferases, two peroxidases and a range of other gene families (Table 3).

Table 3
www.frontiersin.org

Table 3. Description of 17 Arabidopsis genes whose homologs are positively selected genes in at least one conifer species and in both poplar (Lin et al., 2018) and Brassica (Guo et al., 2017).

In addition, 384 conifer PSGs were homologous to PSGs in either poplar or Brassica. Their annotations were diverse (Figure 6) and related to a variety of BPs (Supplementary Table S8). The most represented BPs in this group were related to stress responses (164 genes) and metabolism (127 genes) (Figure 6). Several genes were homologous to disease resistance genes including several members of the rps family and homologs to summ2 (SUPPRESSOR OF MKK1 MKK2 2) conferring resistance against Pseudomonas syringae, as well as lrks4 and lrks7 (lectin-domain containing receptor kinase involved in resistance response to the pathogenic oomycetes Phytophthora) (Supplementary Table S5).

Figure 6
www.frontiersin.org

Figure 6. Most represented GO terms (found 10 times or more) in annotations of the 384 positively selected conifer genes found homologous (blastp E-value<1E-30) to positively selected genes in poplar (Lin et al., 2018) or Brassica (Guo et al., 2017). (A) biological processes, (B) cellular components, (C) molecular functions.

Discussion

Contrasting levels of molecular genetic diversity among conifers

Several lines of evidence indicated that levels of total molecular genetic diversity differed substantially among the seven conifers (Table 1; Supplementary Methods S7; Supplementary Table S2). Pinus strobus and Thuja occidentalis had the lowest overall SNP diversity in their transcriptome, the two spruces were the most diverse, and the three remaining species were intermediate (Figure 1). Several factors related to the experimental design may result in an underestimation of the intraspecific diversity estimates reported herein. For instance, rare alleles may be missed out due to their low frequencies in sequencing pools. Moreover, diversity estimates rely only on gene expressed in the embryo, which nonetheless represents roughly 60% of the transcriptome (Rigault et al., 2011). However, our comparative analyses should not be biased by these limitations, as they would affect all species equally. Interpreting transcriptome-wide genetic diversity patterns is not straightforward because intraspecific variation results from the complex interplay between mutation rate, effective population size (long-term Ne, which depends itself on historical and demographic factors), and linked selection (the molecular genetic diversity-reducing effect of selective sweeps on neutral loci in linkage with loci under natural selection) (Ellegren and Galtier, 2016).

The seven conifer species analyzed here are not expected to have significantly different mutation rates, as they are all long-lived woody perennials (Petit and Hampe, 2006; Sung et al., 2012) and considering that our genetic diversity estimates derive from transcriptome-wide SNP data rather than a limited number of genes. However, in relation to the neutral theory of evolution which assumes that much of the standing genetic variation derives from neutral or nearly neutral mutations (e.g. Kimura, 1983; Ohta, 1992), part of the observed interspecific differences in molecular genetic diversity likely relates to historical effective population sizes (Bousquet et al., 1992). Indeed, the minimum historical population size (Ne) of Picea glauca and Picea mariana, the most diverse species group in this study, was estimated at ~100,000 or more individuals (Bouillé and Bousquet, 2005; Chen et al., 2010), while that of Pinus strobus, which belongs to the low diversity group, has been estimated to be an order of magnitude lower (5000 to 10,000 individuals; Zinck and Rajora, 2016). Likewise, Thuja occidentalis, the less genetically diverse species studied herein, also harbors a low Ne (Pandey and Rajora, 2012). There was also an apparent relation between the level of intraspecific SNP diversity and the geographical extent of the species range (Figure 1), though the study of more species would be needed to confirm this trend. For instance, the two most diverse species have wide transcontinental distributions, while the two less diverse species are the most geographically restricted. As the number of glacial lineages of the Pleistocene era is usually positively related with range size in North American tree taxa (Jaramillo-Correa et al., 2009), widely-distributed species are likely to have retained larger historical population size and standing genetic variation than species with currently more restricted natural ranges.

Since our molecular genetic diversity estimates originate from transcriptomic data, it is very likely that selection also played a role in shaping molecular genetic diversity because the transcriptome mostly encodes functional information. Indeed, natural selection can constrain intraspecific molecular genetic diversity through selective sweeps (Smith and Haigh, 1974; Hermisson and Pennings, 2005). Although selective sweeps are presumably uncommon in plants (Wright and Gaut, 2005; Grossman et al., 2010) including in conifers (Palme et al., 2008; Pavy et al., 2012b; Eckert et al., 2013), evidence for selective sweeps has been reported previously in some conifers (Eckert et al., 2009; Namroud et al., 2010; Wang et al., 2020; De La Torre et al., 2021; Gagalova et al., 2022). The pool sequencing data obtained for each species in our study does not allow to test directly for the existence of such sweeps. However, it is unlikely that selective sweeps are a major determinant of transcriptome-wide genetic diversity in conifers because, on average, linkage disequilibrium decays rapidly within gene limits in conifer natural populations (e.g. Pavy et al., 2012b; De La Torre et al., 2017), therefore restricting the possible loss of neutral diversity surrounding selected loci. Hence, although both Ne and linked selection may have contributed to shaping molecular genetic diversity at the intraspecific level, the former is more likely to have been the main driver of differences in overall SNP diversity observed herein among the transcriptomes of the studied species.

Relationships between overall SNP diversity and molecular variation of adaptive nature

The positive relationship observed between the overall SNP diversity of the transcriptome of each species and their proportions of PSGs (Figure 2) suggests that standing genetic variation can constrain variation of more adaptive nature. It is well established that the fixation probability of favorable alleles in a finite population increases along with the effective population size by reducing the strength of genetic drift and therefore limiting the loss of beneficial alleles as well as the fixation of deleterious ones (Charlesworth, 2009). Consistently, our results show that species of presumably larger historical population sizes carry the most adaptive molecular variation. This is also in agreement with more common local adaptation in large plant populations than in small ones (Leimu and Fischer, 2008), which has been reported for several species investigated herein, namely white spruce (Namroud et al., 2008; Hornoy et al., 2015; Depardieu et al., 2021), black spruce (Prunier et al., 2011; 2012), jack pine (Cullingham et al., 2014), and eastern white pine (Nadeau et al., 2016). Because environmental adaptation is highly polygenic in conifers and involves heterogeneous gene responses (e.g. Hornoy et al., 2015; Yeaman et al., 2016; Depardieu et al., 2021), high standing genetic variation associated with large historical population size likely improves species adaptative potential by increasing the number of possible genetic trajectories to achieve adaptation. Hence, such flexibility may allow species to cope with a wider range of environmental conditions (i.e. gain the ability to colonize larger natural range and/or increase their ecological amplitude) and selective pressures (i.e. biotic and abiotic pressures encountered across their range).

Extent of molecular and functional convergence among conifer adaptive genes

Despite the high overlap among gene sequences of the seven conifer species (Figure 3A), we only found limited molecular convergence among their genes under positive selection (Figure 3B). Convergence appeared also limited at the protein family level (Figure 3D), a result consistent with the pattern of species-specific expansion of large paralogous gene families (reviewed by De La Torre et al., 2020) and high functional redundancy in conifers (Guillet-Claude et al., 2004; Bedon et al., 2010; Stival Sena et al., 2018; Van Ghelder et al., 2019). The extent of molecular genetic convergence among taxa would also be expected to increase with their phylogenetic proximity, as a result of shared ancestry (Losos, 2011; Storz, 2016). However, no pattern related to phylogenetic relatedness among taxa was evident, with no sign of increased convergence among the two pairs of congeneric taxa that would have diverged the most recently (divergence between Picea glauca and Picea mariana ~10 Mya (Bouillé and Bousquet, 2005); divergence between Pinus strobus and Pinus banksiana ~85 Mya (Leslie et al., 2018) (Supplementary Table S4). Similarly, convergence was only marginally higher among the Pinaceae taxa that between the Pinaceae taxa and the Cupressaceae taxon despite their more recent divergence (Leslie et al., 2018) (Supplementary Table S4). These observations suggest that each species followed a largely distinct adaptive path, and that adaptive convergence at the molecular genetic level appears to be limited in such reproductively isolated and phylogenetically distant conifers.

There is also strong evidence that the marked difference in sets of PSGs among species would be primarily driven by natural selection, rather than by stochastic processes such as mutation or genetic drift (e.g. Losos, 2011; Mosca et al., 2012; Storz, 2016). The low levels of convergence observed among the species gene sets under positive selection either indicate that gene functional redundancy would allow species to cope with similar selective pressures using alternative genes, and/or that species experienced heterogeneous selective pressures throughout their historical and extent natural ranges. Our data support the first hypothesis, as functional convergence among biological processes associated with genes under positive selection was quite higher that molecular convergence among these genes (Figure 3). This partly decoupled pattern indicates that the studied species would have had sufficient metabolic and gene network flexibility to evolve alternative responses to the various selective pressures they faced under temperate and boreal climate regimes. It is also consistent with gene family expansions in conifers, implying some redundancy in gene functions (Guillet-Claude et al., 2004; Bedon et al., 2010; Pavy et al., 2012a; Stival Sena et al., 2018; Van Ghelder et al., 2019). This redundancy at the functional level may have had significant evolutionary implications for the persistence of these northern conifer species during millions of years, in the face of geological climate instability and in spite of demographic fluctuations. For instance, with the multiple glaciation cycles of the Pleistocene era in northeastern North America, signatures of demographic fluctuations such as bottleneck effects or founder events have been detected in various conifer species from this region, which would have implied more or less important losses of overall genetic diversity (Perron et al., 2000; Gamache et al., 2003; Jaramillo-Correa and Bousquet, 2003; Godbout et al., 2010; Namroud et al., 2010).

In addition to the gene redundancy hypothesis, it is possible that the low convergence observed among conifer PSG sets also reflects that the species have coped with specific long-term selective pressures throughout their large natural ranges. Some species such as the two Picea spp. have large ecological amplitude and a transcontinental range across which they may encounter a variety of selective pressures related to biotic and abiotic stresses (Nienstaedt and Zasada, 1990; Viereck and Johnston, 1990). Also, Picea glauca would prefer mesic sites (Hornoy et al., 2015) while Picea mariana could adapt to a larger variety of site conditions (Lo et al., 2024). Others have more distinct preferred habitats, such as wetter sites for Larix laricina (Cheliak et al., 1988), dryer sites for Pinus banksiana (Rudolph and Laidly, 1990), or both for the more extremophile Thuja occidentalis (Matthes-Sears and Larson, 1991), which can trigger specific adaptive responses and explain the finite extent of adaptive convergence in gene sets under positive selection among the conifer species studied.

Hence, taken together, our results suggest that the adaptive trajectories of these conifer species were likely shaped by the interplay of gene redundancy and heterogeneous selection landscape, and that these two drivers likely contributed to the low convergence observed in terms of gene sets accumulating nonsynonymous SNPs, but higher convergence at the functional level.

This pattern aligns well with other reports of low estimates of molecular genetic convergence among conifers. A study of adaptive traits in four alpine conifers from the Pinus, Abies and Larix genera identified only seven climate-associated genes shared by two or more species out of several hundreds of sequences analyzed (Mosca et al., 2012). Another study uncovered only 47 convergent genes (representing between 10% and 18% of all genes putatively under selection) involved in local adaptation in a spruce and a pine taxon that diverged ~150 Mya (Yeaman et al., 2016). Also, a transcriptome-wide survey of genetic variation in the two quite closely-related but ecologically contrasted Picea glauca and the coastal Picea sitchensis from the Pacific Northwest, revealed only 15 shared genes out of hundreds of genes showing selection footprints (Gagalova et al., 2022). Similar modest molecular genetic convergence was also reported in Angiosperms such as in Arabidopsis (Guggisberg et al., 2018; Preite et al., 2019), or among taxa from the Brassicaceae family (Rellstab et al., 2020).

Genes under positive selection in conifers and overlap with angiosperms

Functional annotations of genes under positive selection were associated with a great variety of molecular functions and biological processes (Supplementary Figure S1), in agreement with the polygenic nature of adaptive traits and the many empirical studies reporting a wide range of genes and functions underlying them in conifers (e.g. Prunier et al., 2011; Mosca et al., 2012; Hornoy et al., 2015; Depardieu et al., 2021).

Despite the large functional diversity observed, the core set of functions and processes shared by all conifer species analyzed revealed a clear pattern of shared adaptive evolution at the functional level. Indeed, half of the 29 shared BPs by all conifer taxa examined herein were related to environmental stress responses, and mechanisms related to defense against pathogens (responses to biotic stress, programmed cell death) were widely represented and enriched in genes under positive selection (Figure 4; Figure 5). We found many homologs of disease resistance genes with the NB-ARC domain (Supplementary Figure S4), several gene families involved in resistance to Xanthomonas, Pseudomonas or rusts (Figure 4; Supplementary Table S5), and several genes encoding enzymes from the secondary metabolite pathways that are known to be involved in defense mechanisms as well as wood formation (Supplementary Table S7). This indicates that selection pressures exerted by pathogens are likely ubiquitous in conifers and play a prominent role in their adaptation to changing environments. Similarly, several shared BPs were linked to abiotic stress response, and more specifically to water and oxygen stimuli (i.e. cellular response to hypoxia, response to oxidative stress, or response to water deprivation), suggesting that drought and flooding could also be drivers of adaptive evolution at the functional level in conifers.

Mechanisms with crucial roles in stress response such as RNA modification and regulatory mechanisms were also found in conifer genes under selection (Figure 4). Regarding RNA modification, we identified 59 sequences encoding pentatricopeptide repeat (PPR) proteins, among which two were also reported as convergent adaptive genes in pine and spruce taxa (Figure 4, Supplementary Figure S4; Yeaman et al., 2016). The PPR proteins have fundamental roles in organelle biogenesis and function, being involved in photosynthesis, respiration, development and environmental responses (Barkan and Small, 2014). Positive regulation of programmed cell death, a process known to be involved in response to biotic and abiotic stresses in plants (Daneva et al., 2016), was also enriched in all Pinaceae species (Figure 5). Transcriptional regulators included several MYB and WRKY gene sequences. Homologs of genes involved in seasonal transitions in Arabidopsis are likely prime targets of natural selection given that they contribute to the adaptation of plants to their environment, assuming that their functions are conserved across seed plants. Indeed, a FRIGIDA-like protein (accession FRL3_ARATH) was found among conifer genes under positive selection (Figure 4). In Arabidopsis, it regulates phase transition during shoot, flower and seed development such that FRIGIDA gene sequences are required for the winter-annual habit (Michaels et al., 2004). The identification of several homologous regulators involved in survival in Angiosperms also suggests a possible key role in conifers and makes them a relevant class of genes to target in future molecular functional studies.

We uncovered 17 PSGs shared by conifers and two Angiosperm taxa (Brassicaceae and poplar) (Table 3). This level of molecular convergence was higher than expected, given that Angiosperms and Gymnosperms (to which belong conifers) diverged ~350 Mya (Li et al., 2019). Disease resistance genes against pathogens are known to evolve rapidly in flowering plants (Meyers et al., 2005). Although well represented among conifer PSGs, they were not predominant among the core set of convergent genes between conifers and Angiosperms. In contrast, several genes encoding enzymes (transferases, peptidase, peroxidases), one PPR gene involved in RNA editing, and several transcription factors such as MYBs showed up in this set of genes (Table 3). To our knowledge, such widely spread molecular signatures of adaptation have not been reported to date and may be interpreted as a sign that adaptive convergence at the molecular level, though limited, can take place at a very broad taxonomic level. Consequently, these genes represent valuable candidates for future evolutionary studies aiming to characterize molecular and functional convergence among seed plants.

Conclusions

Transcriptome-wide SNP diversity was assessed for seven partially sympatric and reproductively isolated conifers. We found marked variation in overall SNP diversity among species, that would reflect mainly differences in demography and historical population size. Little overlap in sets of adaptive genes under positive selection was noted among species, suggesting distinct evolutionary trajectories. In contrast, their biological functions were much convergent and largely related to stress response and regulatory mechanisms. This trend indicates high molecular plasticity in response to similar climate and natural selective pressures. Several adaptive gene homologs were shared between conifers and Angiosperms, despite their ancient divergence ~350 Mya.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://www.ebi.ac.uk/ena, ERS16017105-ERS16017139 and ERS16049778-ERS16049791; https://doi.org/10.5061/dryad.p8cz8w9w1, Dryad.

Author contributions

NP: Formal Analysis, Investigation, Methodology, Writing – original draft, Writing – review & editing. SG: Formal Analysis, Investigation, Methodology, Writing – original draft, Writing – review & editing. JP: Project administration, Resources, Writing – review & editing. PR: Resources, Writing – review & editing. JL: Resources, Writing – review & editing. GD: Methodology, Software, Writing – review & editing. BB: Resources, Writing – review & editing. JM: Funding acquisition, Writing – review & editing, Conceptualization. JB: Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Supervision, Validation, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This research was part of the GenAC project supported by a research grant from the Québec Ministry for the Economy, Science and Innovation (PSR-SIIRI-836) to JM, PR and JB, and from the Natural Sciences and Engineering Research Council of Canada to JB.

Acknowledgments

We are grateful to Aida Azaiez, France Gagnon and Isabelle Giguère (Canada Research Chair in Forest Genomics, Univ. Laval), as well as Manuel Lamothe (Laurentian Forestry Centre) for providing technical assistance, as well as Jean Beaulieu (CRC Forest Genomics, Univ. Laval) for his valuable insights into early statistical analyses.

Conflict of interest

Author PR was employed by Gydle Inc.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2025.1500759/full#supplementary-material

References

Agresti, A. (2013). Categorical Data Analysis. 3rd edition (Hoboken, NJ: John Wiley and Sons).

Google Scholar

Alexa, A., Rahnenführer, J., Lengauer, T. (2006). Improved scoring of functional groups from gene expression data by decorrelating GO graph structure. Bioinformatics 22, 1600–1607. doi: 10.1093/bioinformatics/btl140

PubMed Abstract | Crossref Full Text | Google Scholar

Arendt, J., Reznick, D. (2008). Convergence and parallelism reconsidered: what have we learned about the genetics of adaptation? Trends Ecol. Evol. 23, 26–32. doi: 10.1016/j.tree.2007.09.011

PubMed Abstract | Crossref Full Text | Google Scholar

Barghi, N., Hermisson, J., Schlötterer, C. (2020). Polygenic adaptation: a unifying framework to understand positive selection. Nat. Rev. Genet. 21, 769–781. doi: 10.1038/s41576-020-0250-z

PubMed Abstract | Crossref Full Text | Google Scholar

Barkan, A., Small, I. (2014). Pentatricopeptide repeat proteins in plants. Annu. Rev. Plant Biol. 65, 415–442. doi: 10.1146/annurev-arplant-050213-040159

PubMed Abstract | Crossref Full Text | Google Scholar

Bedon, F., Bomal, C., Caron, S., Levasseur, C., Boyle, B., Mansfield, S. D., et al. (2010). Subgroup 4 R2R3-MYBs in conifer trees: gene family expansion and contribution to the isoprenoid-oriented response. J. Exp. Bot. 61, 3847–3864. doi: 10.1093/jxb/erq196

PubMed Abstract | Crossref Full Text | Google Scholar

Bolnick, D. I., Barrett, R. D. H., Oke, K. B., Rennison, D. J., Stuart, Y. E. (2018). (Non)parallel evolution. Annu. Rev. Ecol. Syst. 49, 303–330. doi: 10.1146/annurev-ecolsys-110617-062240

Crossref Full Text | Google Scholar

Bouillé, M., Bousquet, J. (2005). Trans-species shared polymorphisms at orthologous nuclear gene loci among distant species in the conifer Picea (Pinaceae): implications for the long-term maintenance of genetic diversity in trees. Am. J. Bot. 92, 63–73. doi: 10.3732/ajb.92.1.63

PubMed Abstract | Crossref Full Text | Google Scholar

Bousquet, J., Gérardi, S., de Lafontaine, G., Jaramillo-Correa, J. P., Pavy, N., Prunier, J., et al. (2021). “Spruce population genomics,” in Population Genomics: Forest Trees. Ed. Rajora, O. P. (Springer Nature, Switzerland), 1–64. doi: 10.1007/13836_2021_96

Crossref Full Text | Google Scholar

Bousquet, J., Strauss, S. H., Doerksen, A. H., Price, R. A. (1992). Extensive variation in evolutionary rate of rbcL gene sequences among seed plants. Proc. Natl. Acad. Sci. U.S.A. 89, 7844–7848. doi: 10.1073/pnas.89.16.7844

PubMed Abstract | Crossref Full Text | Google Scholar

Cao, Y., Liang, Y., Tanaka, K., Nguyen, C. T., Jedrzejczak, R. P., Joachimiak, A., et al. (2014). The kinase LYK5 is a major chitin receptor in Arabidopsis and forms a chitin-induced complex with related kinase CERK1. elife 3, e03766. doi: 10.7554/eLife.03766

PubMed Abstract | Crossref Full Text | Google Scholar

Charlesworth, B. (2009). Effective population size and patterns of molecular evolution and variation. Nat. Rev. Genet. 17, 195–205. doi: 10.1038/nrg2526

PubMed Abstract | Crossref Full Text | Google Scholar

Cheliak, W. M., Wang, J., Pitel, J. A. (1988). Population structure and genic diversity in tamarack, Larix laricina (Du Roi) K. Koch. Can. J. For. Res. 18, 1318–1324. doi: 10.1139/x88-203

Crossref Full Text | Google Scholar

Chen, J., Källman, T., Gyllenstrand, N., Lascoux, M. (2010). New insights on the speciation history and nucleotide diversity of three boreal spruce species and a Tertiary relict. Heredity 104, 3–14. doi: 10.1038/hdy.2009.88

PubMed Abstract | Crossref Full Text | Google Scholar

Csilléry, K., Rodríguez-Verdugo, A., Rellstab, C., Guillaume, F. (2018). Detecting the genomic signal of polygenic adaptation and the role of epistasis in evolution. Mol. Ecol. 27, 606–612. doi: 10.1111/mec.14499

PubMed Abstract | Crossref Full Text | Google Scholar

Cullingham, C. I., Cooke, J. E. K., Coltman, D. W. (2014). Cross-species outlier detection reveals different evolutionary pressures between sister species. New Phytol. 204, 215–229. doi: 10.1111/nph.12896

PubMed Abstract | Crossref Full Text | Google Scholar

Daneva, A., Gao, Z., Van Durme, M., Nowack, M. K. (2016). Functions and regulation of programmed cell death in plant development. Annu. Rev. Cell Dev. Biol. 32, 441–468. doi: 10.1146/annurev-cellbio-111315-124915

PubMed Abstract | Crossref Full Text | Google Scholar

De La Torre, A. R., Birol, I., Bousquet, J., Ingvarsson, P., Jansson, S., Jones, S. J. M., et al. (2014). Insights into conifer giga-genomes. Plant Physiol. 166, 1724–1732. doi: 10.1104/pp.114.248708

PubMed Abstract | Crossref Full Text | Google Scholar

De La Torre, A. R., Ling, Y. C., Van de Peer, Y., Ingvarsson, P. K. (2017). Contrasting rates of molecular evolution and patterns of selection among gymnosperms and flowering plants. Mol. Biol. Evol. 34, 1363–1377. doi: 10.1093/molbev/msx069

PubMed Abstract | Crossref Full Text | Google Scholar

De La Torre, A. R., Piot, A., Liu, B., Wilhite, B., Weiss, M., Porth, I. (2020). Functional and morphological evolution in gymnosperms: A portrait of implicated gene families. Evol. Appl. 13, 210–227. doi: 10.1111/eva.12839

PubMed Abstract | Crossref Full Text | Google Scholar

De La Torre, A. R., Sekhwal, M. K., Neale, D. B. (2021). Selective sweeps and polygenic adaptation drive local adaptation along moisture and temperature gradients in natural populations of coast redwood and giant sequoia. Genes 12, 1826. doi: 10.3390/genes12111826

PubMed Abstract | Crossref Full Text | Google Scholar

Depardieu, C., Gérardi, S., Nadeau, S., Parent, G. J., Mackay, J., Lenz, P., et al. (2021). Connecting tree-ring phenotypes, genetic associations and transcriptomics to decipher the genomic architecture of drought adaptation in a widespread conifer. Mol. Ecol. 30, 3898–3917. doi: 10.1111/mec.15846

PubMed Abstract | Crossref Full Text | Google Scholar

DePristo, M. A., Banks, E., Poplin, R., Garimella, K. V., Maguire, J. R., Hartl, C., et al. (2011). A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498. doi: 10.1038/ng.806

PubMed Abstract | Crossref Full Text | Google Scholar

Eckert, A. J., Bower, A. D., Jermstad, K. D., Wegrzyn, J. L., Knaus, B. J., Syring, J. V., et al. (2013). Multilocus analyses reveal little evidence for lineage-wide adaptive evolution within major clades of soft pines (Pinus subgenus Strobus). Mol. Ecol. 22, 5635–5650. doi: 10.1111/mec.12514

PubMed Abstract | Crossref Full Text | Google Scholar

Eckert, A. J., Wegrzyn, J. L., Pande, B., Jermstad, K. D., Lee, J. M., Liechty, J. D., et al. (2009). Multilocus patterns of nucleotide diversity and divergence reveal positive selection at candidate genes related to cold hardiness in coastal Douglas fir (Pseudotsuga menziesii var. menziesii). Genetics 183, 289–298. doi: 10.1534/genetics.109.103895

PubMed Abstract | Crossref Full Text | Google Scholar

El-Gebali, S., Mistry, J., Bateman, A., Eddy, S. R., Luciani, A., Potter, S. C., et al. (2019). The Pfam protein families database in 2019. Nuc. Acids Res. 47, D427–D432. doi: 10.1093/nar/gky995

PubMed Abstract | Crossref Full Text | Google Scholar

Ellegren, H., Galtier, N. (2016). Determinants of genetic diversity. Nat. Rev. Genet. 17, 422–433. doi: 10.1038/nrg.2016.58

PubMed Abstract | Crossref Full Text | Google Scholar

Elmer, K. R., Meyer, A. (2011). Adaptation in the age of ecological genomics: insights from parallelism and convergence. Trends Ecol. Evol. 26, 298–306. doi: 10.1016/j.tree.2011.02.008

PubMed Abstract | Crossref Full Text | Google Scholar

Emms, D. M., Kelly, S. (2019). Orthofinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 20, 238. doi: 10.1186/s13059-019-1832-y

PubMed Abstract | Crossref Full Text | Google Scholar

Eo, S. H., DeWoody, J. A. (2012). The effects of contig length and depth on the estimation of SNP frequencies, and the relative abundance of SNPs in protein-coding and non-coding transcripts of tiger salamanders (Ambystoma tigrinum). BMC Genomics 13, 1. doi: 10.1186/1471-2164-13-259

PubMed Abstract | Crossref Full Text | Google Scholar

Fay, J. C., Wyckoff, G. J., Wu, C. I. (2001). Positive and negative selection on the human genome. Genetics 158, 1227–1234. doi: 10.1093/genetics/158.3.1227

PubMed Abstract | Crossref Full Text | Google Scholar

Gagalova, K. K., Warren, R. L., Coombe, L., Wong, J., Nip, K. M., Yuen, M. M., et al. (2022). Spruce giga-genomes: structurally similar yet distinctive with differentially expanding gene families and rapidly evolving genes. Plant J. 111, 1469–1485. doi: 10.1111/tpj.15889

PubMed Abstract | Crossref Full Text | Google Scholar

Gamache, I., Jaramillo-Correa, J. P., Payette, S., Bousquet, J. (2003). Diverging patterns of mitochondrial and nuclear DNA diversity in subarctic black spruce: imprint of a founder effect associated with postglacial colonization. Mol. Ecol. 12, 891–901. doi: 10.1046/j.1365-294x.2003.01800.x

PubMed Abstract | Crossref Full Text | Google Scholar

Godbout, J., Beaulieu, J., Bousquet, J. (2010). Phylogeographic structure of jack pine (Pinus banksiana; Pinaceae) supports the existence of a coastal glacial refugium in northeastern North America. Am. J. Bot. 97, 1903–1912. doi: 10.3732/ajb.1000148

PubMed Abstract | Crossref Full Text | Google Scholar

Grossman, S. R., Shylakhter, I., Karlsson, E. K., Byrne, E. H., Morales, S., Frieden, G., et al. (2010). A composite of multiple signals distinguishes causal variants in regions of positive selection. Science 327, 883–886. doi: 10.1126/science.1183863

PubMed Abstract | Crossref Full Text | Google Scholar

Guggisberg, A., Liu, X., Suter, L., Mansion, G., Fischer, M. C., Fior, S., et al. (2018). The genomic basis of adaptation to calcareous and siliceous soils in Arabidopsis lyrata. Mol. Ecol. 27, 5088–5103. doi: 10.1111/mec.14930

PubMed Abstract | Crossref Full Text | Google Scholar

Guillet-Claude, C., Isabel, N., Pelgas, B., Bousquet, J. (2004). The evolutionary implications of knox-I gene duplications in conifers: correlated evidence from phylogeny, gene mapping, and analysis of functional divergence. Mol. Biol. Evol. 21, 2232–2245. doi: 10.1093/molbev/msh235

PubMed Abstract | Crossref Full Text | Google Scholar

Guo, Y., Liu, J., Zhang, J., Liu, S., Du, J. (2017). Selective modes determine evolutionary rates, gene compactness and expression patterns in Brassica. Plant J. 91, 34–44. doi: 10.1111/tpj.13541

PubMed Abstract | Crossref Full Text | Google Scholar

Hao, Y., Qu, Y., Song, G., Lei, F. (2019). Genomic insights into the adaptive convergent evolution. Curr. Genomics 20, 81–89. doi: 10.2174/1389202920666190313162702

PubMed Abstract | Crossref Full Text | Google Scholar

Hart, A. J., Ginzburg, S., Xu, M., Fisher, C. R., Rahmatpour, N., Mitton, J. B., et al. (2020). EnTAP: bringing faster and smarter functional annotation to non-model eukaryotic transcriptomes. Mol. Ecol. 20, 591–604. doi: 10.1111/1755-0998.13106

PubMed Abstract | Crossref Full Text | Google Scholar

He, Z., Xu, S., Zhang, Z., Guo, W., Lyu, H., Zhong, C., et al. (2020). Convergent adaptation of the genomes of woody plants at the land–sea interface. Natl. Sci. Rev. 7, 978–993. doi: 10.1093/nsr/nwaa027

PubMed Abstract | Crossref Full Text | Google Scholar

Hermisson, J., Pennings, P. S. (2005). Soft sweeps: molecular population genetics of adaptation from standing genetic variation. Genetics 169, 2335–2352. doi: 10.1534/genetics.104.036947

PubMed Abstract | Crossref Full Text | Google Scholar

Hornoy, B., Pavy, N., Gerardi, S., Beaulieu, J., Bousquet, J. (2015). Genetic adaptation to climate in white spruce involves small to moderate allele frequency shifts in functionally diverse genes. Genome Biol. Evol. 7, 3269–3285. doi: 10.1093/gbe/evv218

PubMed Abstract | Crossref Full Text | Google Scholar

Hu, S. P., Li, J. J., Dhar, N., Li, J. P., Chen, J. Y., Jian, W., et al. (2021). Lysin motif (LysM) proteins: interlinking manipulation of plant immunity and fungi. Int. J. Mol. Sci. 22, 3114. doi: 10.3390/ijms22063114

PubMed Abstract | Crossref Full Text | Google Scholar

Jaramillo-Correa, J. P., Beaulieu, J., Khasa, D. P., Bousquet, J. (2009). Inferring the past from the present phylogeographic structure of North American forest trees: seeing the forest for the genes. Can. J. For. Res. 39, 286–307. doi: 10.1139/X08-181

Crossref Full Text | Google Scholar

Jaramillo-Correa, J. P., Bousquet, J. (2003). New evidence from mitochondrial DNA of a progenitor-derivative species relationship between black spruce and red spruce (Pinaceae). Amer. J. Bot. 90, 1801–1806. doi: doi : 10.3732/ajb.90.12.1801

PubMed Abstract | Crossref Full Text | Google Scholar

Jaramillo-Correa, J. P., Gérardi, S., Beaulieu, J., Ledig, F. T., Bousquet, J. (2015). Inferring and outlining past population declines with linked microsatellites: a case study in two spruce species. Tree Genet. Genomes 11, 12p. doi: 10.1007/s11295-015-0835-4

Crossref Full Text | Google Scholar

Kimura, M. (1983). The neutral theory of molecular evolution (New York: Cambridge University Press). doi: 10.1017/CBO9780511623486

Crossref Full Text | Google Scholar

Kolde, R. (2019). pheatmap: pretty heatmaps. Available online at: https://CRAN.R-project.org/package=pheatmap (Accessed February 21, 2025).

Google Scholar

Le Corre, V., Kremer, A. (2012). The genetic differentiation at quantitative trait loci under local adaptation. Mol. Ecol. 217, 1548–1566. doi: 10.1111/j.1365-294X.2012.05479.x

PubMed Abstract | Crossref Full Text | Google Scholar

Leimu, R., Fischer, M. (2008). A meta-analysis of local adaptation in plants. PloS One 3, e4010. doi: 10.1371/journal.pone.0004010

PubMed Abstract | Crossref Full Text | Google Scholar

Leslie, A. B., Beaulieu, J., Holman, G., Campbell, C. S., Mei, W., Raubeson, L. R., et al. (2018). An overview of extant conifer evolution from the perspective of the fossil record. Am. J. Bot. 105, 1531–1544. doi: 10.1002/ajb2.1143

PubMed Abstract | Crossref Full Text | Google Scholar

Li, H. T., Yi, T. S., Gao, L. M., Ma, P. F., Zhang, T., Yang, J. B., et al. (2019). Origin of angiosperms and the puzzle of the Jurassic gap. Nat. Plants 5, 461–470. doi: 10.1038/s41477-019-0421-0

PubMed Abstract | Crossref Full Text | Google Scholar

Lin, Y. C., Wang, J., Delhomme, N., Schiffthaler, B., Sundström, G., Zuccolo, A., et al. (2018). Functional and evolutionary genomic inferences in Populus through genome and population sequencing of American and European aspen. Proc. Natl. Acad. Sci. U.S.A. 115, 10970–10978. doi: 10.1073/pnas.1801437115

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, J., Zhang, Y., Lei, X., Zhang, Z. (2008). Natural selection of protein structural and functional properties: a single nucleotide polymorphism perspective. Genome Biol. 9, 1–17. doi: 10.1186/gb-2008-9-4-r69

PubMed Abstract | Crossref Full Text | Google Scholar

Lo, T., Coombe, L., Gagalova, K., Marr, A., Warren, R. L., Kirk, H., et al. (2024). Assembly and annotation of the black spruce genome provide insights on spruce phylogeny and evolution of stress response. G3 – Genes Genomes Genet. 14, jkad247. doi: 10.1093/g3journal/jkad247

PubMed Abstract | Crossref Full Text | Google Scholar

Losos, J. B. (2011). Convergence, adaptation, and constraint. Evolution 65, 1827–1840. doi: 10.1111/j.1558-5646.2011.01289.x

PubMed Abstract | Crossref Full Text | Google Scholar

Manceau, M., Domingues, V. S., Linnen, C. R., Rosenblum, E. B., Hoekstra, H. E. (2010). Convergence in pigmentation at multiple levels: mutations, genes and function. Phil. Trans. R. Soc B: Biol. Sci. 365, 2439–2450. doi: 10.1098/rstb.2010.0104

PubMed Abstract | Crossref Full Text | Google Scholar

Martin, A., Orgogozo, V. (2013). The loci of repeated evolution: a catalog of genetic hotspots of phenotypic variation. Evolution 67, 1235–1250. doi: 10.1111/evo.12081

PubMed Abstract | Crossref Full Text | Google Scholar

Matthes-Sears, U., Larson, D. W. (1991). Growth and physiology of Thuja occidentalis L. from cliffs and swamps: is variation habitat or site specific? Bot. Gazette 152, 500–508. doi: 10.1086/337912

Crossref Full Text | Google Scholar

McKenna, A., Hanna, M., Banks, E., Sivachenko, A., Cibulskis, K., Kernytsky, A., et al. (2010). The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303. doi: 10.1101/gr.107524.110

PubMed Abstract | Crossref Full Text | Google Scholar

Meyers, B. C., Kaushik, S., Nandety, R. S. (2005). Evolving disease resistance genes. Curr. Opin. Plant Biol. 8, 129–134. doi: 10.1016/j.pbi.2005.01.002

PubMed Abstract | Crossref Full Text | Google Scholar

Michaels, S. D., Bezerra, I. C., Amasino, R. M. (2004). FRIGIDA-related genes are required for the winter-annual habit in Arabidopsis. Proc. Natl. Acad. Sci. U.S.A. 101, 3281–3285. doi: 10.1073/pnas.0306778101

PubMed Abstract | Crossref Full Text | Google Scholar

Miettinen, K., Dong, L., Navrot, N., Schneider, T., Burlat, V., Pollier, J., et al. (2014). The seco-iridoid pathway from Catharanthus roseus. Nat. Commun. 5, 3606. doi: 10.1038/ncomms4606

PubMed Abstract | Crossref Full Text | Google Scholar

Mosca, E., Eckert, A. J., Di Pierro, E. A., Rocchini, D., La Porta, N., Belletti, P., et al. (2012). The geographical and environmental determinants of genetic diversity for four alpine conifers of the European Alps. Mol. Ecol. 21, 5530–5545. doi: 10.1111/mec.12043

PubMed Abstract | Crossref Full Text | Google Scholar

Nadeau, S., Meirmans, P. G., Aitken, S. N., Ritland, K., Isabel, N. (2016). The challenge of separating signatures of local adaptation from those of isolation by distance and colonization history: The case of two white pines. Ecol. Evol. 6, 8649–8664. doi: 10.1002/ece3.2550

PubMed Abstract | Crossref Full Text | Google Scholar

Namroud, M. C., Beaulieu, J., Juge, N., Laroche, J., Bousquet, J. (2008). Scanning the genome for gene single nucleotide polymorphisms involved in adaptive population differentiation in white spruce. Mol. Ecol. 17, 3599–3613. doi: 10.1111/j.1365-294X.2008.03840.x

PubMed Abstract | Crossref Full Text | Google Scholar

Namroud, M. C., Guillet-Claude, C., Mackay, J., Isabel, N., Bousquet, J. (2010). Molecular evolution of regulatory genes in spruces from different species and continents: heterogeneous patterns of linkage disequilibrium and selection but correlated recent demographic changes. J. Mol. Evol. 70, 371–386. doi: 10.1007/s00239-010-9335-1

PubMed Abstract | Crossref Full Text | Google Scholar

Nienstaedt, H., Zasada, J. C. (1990). Picea glauca (Moench) Voss. white spruce. Silvics North America 1, 204–226.

Google Scholar

Ohta, T. (1992). The nearly neutral theory of molecular evolution. Annu. Rev. Ecol. Syst. 23, 263–286. doi: 10.1146/annurev.es.23.110192.001403

Crossref Full Text | Google Scholar

Palme, A. E., Wright, M., Savolainen, O. (2008). Patterns of divergence among conifer ESTs and polymorphism in Pinus sylvestris identify putative selective sweeps. Mol. Biol. Evol. 25, 2567–2577. doi: 10.1093/molbev/msn194

PubMed Abstract | Crossref Full Text | Google Scholar

Pandey, M., Rajora, O. P. (2012). Higher fine-scale genetic structure in peripheral than in core populations of a long-lived and mixed-mating conifer-eastern white cedar (Thuja occidentalis L.). BMC Evol. Biol. 12, 1–14. doi: 10.1186/1471-2148-12-48

PubMed Abstract | Crossref Full Text | Google Scholar

Pavy, N., Namroud, M. C., Gagnon, F., Isabel, N., Bousquet, J. (2012b). The heterogeneous levels of linkage disequilibrium in white spruce genes and comparative analysis with other conifers. Heredity 108, 273–284. doi: 10.1038/hdy.2011.72

PubMed Abstract | Crossref Full Text | Google Scholar

Pavy, N., Pelgas, B., Laroche, J., Rigault, P., Isabel, N., Bousquet, J. (2012a). A spruce gene map infers ancient plant genome reshuffling and subsequent slow evolution in the gymnosperm lineage leading to extant conifers. BMC Biol. 10, 84. doi: 10.1186/1741-7007-10-84

PubMed Abstract | Crossref Full Text | Google Scholar

Perron, M., Perry, D. J., Andalo, C., Bousquet, J. (2000). Evidence from sequence-tagged-site markers of a recent progenitor-derivative species pair in conifers. Proc. Natl. Acad. Sci. U.S. A. 97, 11331–11336. doi: 10.1073/pnas.200417097

PubMed Abstract | Crossref Full Text | Google Scholar

Petit, R. J., Hampe, A. (2006). Some evolutionary consequences of being a tree. Annu. Rev. Ecol. Syst. 37, 187–214. doi: 10.1146/annurev.ecolsys.37.091305.110215

Crossref Full Text | Google Scholar

Pétriacq, P., de Bont, L., Tcherkez, G., Gakière, B. (2013). NAD: not just a pawn on the board of plant-pathogen interactions. Plant Signal. Behav. 8, e22477. doi: 10.4161/psb.22477

PubMed Abstract | Crossref Full Text | Google Scholar

Preite, V., Sailer, C., Syllwasschy, L., Bray, S., Ahmadi, H., Krämer, U., et al. (2019). Convergent evolution in Arabidopsis halleri and Arabidopsis arenosa on calamine metalliferous soils. Phil. Trans. R. Soc B: Biol. Sci. 374, 20180243. doi: 10.1098/rstb.2018.0243

PubMed Abstract | Crossref Full Text | Google Scholar

Prunier, J., Gerardi, S., Laroche, J., Beaulieu, J., Bousquet, J. (2012). Parallel and lineage-specific molecular adaptation to climate in boreal black spruce. Mol. Ecol. 21, 4270–4286. doi: 10.1111/j.1365-294X.2012.05691.x

PubMed Abstract | Crossref Full Text | Google Scholar

Prunier, J., Laroche, J., Beaulieu, J., Bousquet, J. (2011). Scanning the genome for gene SNPs related to climate adaptation and estimating selection at the molecular level in boreal black spruce. Mol. Ecol. 20, 1702–1716. doi: 10.1111/j.1365-294X.2011.05045.x

PubMed Abstract | Crossref Full Text | Google Scholar

Prunier, J., Verta, J. P., MacKay, J. J. (2016). Conifer genomics and adaptation: at the crossroads of genetic diversity and genome function. New Phytol. 209, 44–62. doi: 10.1111/nph.13565

PubMed Abstract | Crossref Full Text | Google Scholar

Rellstab, C., Zoller, S., Sailer, C., Tedder, A., Gugerli, F., Shimizu, K. K., et al. (2020). Genomic signatures of convergent adaptation to Alpine environments in three Brassicaceae species. Mol. Ecol. 29, 4350–4365. doi: 10.1111/mec.15648

PubMed Abstract | Crossref Full Text | Google Scholar

Rigault, P., Boyle, B., Lepage, P., Cooke, J. E., Bousquet, J., MacKay, J. J. (2011). A white spruce gene catalog for conifer genome analyses. Plant Phys. 157, 14–28. doi: 10.1104/pp.111.179663

PubMed Abstract | Crossref Full Text | Google Scholar

Ro, D. K., Arimura, G. I., Lau, S. Y., Piers, E., Bohlmann, J. (2005). Loblolly pine abietadienol/abietadienal oxidase PtAO (CYP720B1) is a multifunctional, multisubstrate cytochrome P450 monooxygenase. Proc. Natl. Acad. Sci. U.SA. 102, 8060–8065. doi: 10.1073/pnas.0500825102

PubMed Abstract | Crossref Full Text | Google Scholar

Rudolph, T. D., Laidly, P. R. (1990). Pinus banksiana Lamb. jack pine. Silvics North America 1, 280–293.

Google Scholar

Sackton, T. B., Clark, N. (2019). Convergent evolution in the genomics era: new insights and directions. Phil. Trans. R. Soc B: Biol. Sci. 374, 20190102. doi: 10.1098/rstb.2019.0102

PubMed Abstract | Crossref Full Text | Google Scholar

Savard, L., Li, P., Strauss, S. H., Chase, M. W., Michaud, M, Bousquet, J. (1994). Chloroplast and nuclear gene sequences indicate Late Pennsylvanian time for the last common ancestor of extant seed plants. Proc. Natl. Acad. Sci. U.S.A. 91, 5163–5167. doi: 10.1073/pnas.91.11.5163

PubMed Abstract | Crossref Full Text | Google Scholar

Smith, J. M., Haigh, J. (1974). The hitch-hiking effect of a favourable gene. Genet. Res. 23, 23–35. doi: 10.1017/S0016672308009579

PubMed Abstract | Crossref Full Text | Google Scholar

Steele, C. L., Katoh, S., Bohlmann, J., Croteau, R. (1998). Regulation of oleoresinosis in grand fir (Abies grandis) differential transcriptional control of monoterpene, sesquiterpene, and diterpene synthase genes in response to wounding. Plant Phys. 116, 1497–1504. doi: 10.1104/pp.116.4.1497

PubMed Abstract | Crossref Full Text | Google Scholar

Stern, D. L. (2013). The genetic causes of convergent evolution. Nat. Rev. Genet. 14, 751–764. doi: 10.1038/nrg3483

PubMed Abstract | Crossref Full Text | Google Scholar

Stern, D. L., Orgogozo, V. (2008). The loci of evolution: how predictable is genetic evolution? Evolution 62, 2155–2177. doi: 10.1111/j.1558-5646.2008.00450.x

PubMed Abstract | Crossref Full Text | Google Scholar

Stival Sena, J., Giguère, I., Rigault, P., Bousquet, J., Mackay, J. (2018). Expansion of the dehydrin gene family in conifers is associated with considerable structural diversity and drought responsive expression. Tree Physiol. 38, 442–456. doi: 10.1093/treephys/tpx125

PubMed Abstract | Crossref Full Text | Google Scholar

Storz, J. F. (2016). Hemoglobin-oxygen affinity in high-altitude vertebrates: is there evidence for an adaptive trend? J. Exp. Biol. 219, 3190–3203. doi: 10.1242/jeb.127134

PubMed Abstract | Crossref Full Text | Google Scholar

Sung, W., Ackerman, M. S., Miller, S. F., Doak, T. G., Lynch, M. (2012). Drift-barrier hypothesis and mutation-rate evolution. Proc. Natl. Acad. Sci. U.S.A. 109, 18488–18492. doi: 10.1073/pnas.1216223109

PubMed Abstract | Crossref Full Text | Google Scholar

Tenaillon, O., Rodriguez-Verdugo, A., Gaut, R. L., McDonald, P., Bennett, A. F., Long, A. D., et al. (2012). The molecular diversity of adaptive convergence. Science 335, 457–461. doi: 10.1126/science.1212986

PubMed Abstract | Crossref Full Text | Google Scholar

Van Ghelder, C., Parent, G. J., Rigault, P., Prunier, J., Giguère, I., Caron, S., et al. (2019). The large repertoire of conifer NLR resistance genes includes drought responsive and highly diversified RNLs. Sci. Rep. 9, 1–3. doi: 10.1038/s41598-019-47950-7

PubMed Abstract | Crossref Full Text | Google Scholar

Viereck, L. A., Johnston, W. F. (1990). Picea mariana (Mill.) BSP black spruce. Silvics North America 1, 227–237.

Google Scholar

Wang, X., Bernhardsson, C., Ingvarsson, P. K. (2020). Demography and natural selection have shaped genetic variation in the widely distributed conifer Norway spruce (Picea abies). Genome Biol. Evol. 12, 3803–3817. doi: 10.1093/gbe/evaa005

PubMed Abstract | Crossref Full Text | Google Scholar

Wright, S. I., Gaut, B. S. (2005). Molecular population genetics and the search for adaptive evolution in plants. Mol. Biol. Evol. 22, 506–519. doi: 10.1093/molbev/msi035

PubMed Abstract | Crossref Full Text | Google Scholar

Xu, S., Wang, J., Guo, Z., He, Z., Shi, S. (2020). Genomic convergence in the adaptation to extreme environments. Plant Commun. 1, 100117. doi: 10.1016/j.xplc.2020.100117

PubMed Abstract | Crossref Full Text | Google Scholar

Yeaman, S., Hodgins, K. A., Lotterhos, K. E., Suren, H., Nadeau, S., Degner, J. C., et al. (2016). Convergent local adaptation to climate in distantly related conifers. Science 353, 1431–1433. doi: 10.1126/science.aaf7812

PubMed Abstract | Crossref Full Text | Google Scholar

Zinck, J. W. R., Rajora, O. P. (2016). Post-glacial phylogeography and evolution of a wide-ranging highly-exploited keystone forest tree, eastern white pine (Pinus strobus) in North America: single refugium, multiple routes. BMC Evol. Biol. 16, 56. doi: 10.1186/s12862-016-0624-1

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: intraspecific molecular genetic diversity, molecular and functional adaptive variation, SNP A/S ratio, comparative genomics, expressed genes, Cupressaceae, Pinaceae, positively selected genes

Citation: Pavy N, Gérardi S, Prunier J, Rigault P, Laroche J, Daigle G, Boyle B, MacKay J and Bousquet J (2025) Contrasting levels of transcriptome-wide SNP diversity and adaptive molecular variation among conifers. Front. Plant Sci. 16:1500759. doi: 10.3389/fpls.2025.1500759

Received: 23 September 2024; Accepted: 13 February 2025;
Published: 06 March 2025.

Edited by:

Zhikai Liang, North Dakota State University, United States

Reviewed by:

Kristian Karsten Ullrich, Max Planck Institute for Evolutionary Biology, Germany
Yuan Feng, St. Jude Children’s Research Hospital, United States

Copyright © 2025 Pavy, Gérardi, Prunier, Rigault, Laroche, Daigle, Boyle, MacKay and Bousquet. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jean Bousquet, jean.bousquet@sbf.ulaval.ca

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Research integrity at Frontiers

Man ultramarathon runner in the mountains he trains at sunset

94% of researchers rate our articles as excellent or good

Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


Find out more