- Graduate School of Science and Technology, Kumamoto University, Kumamoto, Japan
Short secreted plant peptides act as key signaling molecules and control a plethora of developmental and physiological processes. The ROOT GROWTH FACTOR (RGF)/GOLVEN (GLV)/CLE-Like (CLEL) family of peptides was discovered to be involved in root development in Arabidopsis thaliana. In contrast to active research efforts, which have been revealing receptors and downstream signaling components, little attention has been paid to evolutionary processes that shaped the RGF signaling system as we know it in angiosperms today. As a first step toward understanding how RGF signaling emerged and evolved, this study aimed to elucidate the phylogenetic distribution and functional conservation of RGF-like sequences. Using publicly available, genome and transcriptome data, RGF-like sequences were searched in 27 liverworts, 22 mosses, 8 hornworts, 23 lycophytes, 23 ferns, 38 gymnosperms, and 8 angiosperms. This led to the identification of more than four hundreds of RGF-like sequences in all major extant land plant lineages except for hornworts. Sequence comparisons within and between taxonomic groups identified lineage-specific characters. Notably, one of the two major RGF subgroups, represented by A. thaliana RGF6/GLV1/CLEL6, was found only in vascular plants. This subgroup, therefore, likely emerged in a common ancestor of vascular plants after its divergence from bryophytes. In bryophytes, our results infer independent losses of RGF-like sequences in mosses and hornworts. On the other hand, a single, highly similar RGF-like sequence is conserved in liverworts, including Marchantia polymorpha, a genetically tractable model species. When constitutively expressed, the M. polymorpha RGF-like sequence (MpRGF) affected plant development and growth both in A. thaliana and M. polymorpha. This suggests that MpRGF can exert known RGF-like effects and that MpRGF is under transcriptional control so that its potent activities are precisely controlled. These data suggest that RGFs are conserved as signaling molecules in both vascular plants and bryophytes and that lineage-specific diversification has increased sequence variations of RGFs. All together, our findings form a basis for further studies into RGF peptides and their receptors, which will contribute to our understandings of how peptide signaling pathways evolve.
Introduction
Debate is ongoing regarding phylogenetic relationships among land plant lineages. Gaining more support from recent molecular data, one hypothesis proposes that vascular and non-vascular plants are both monophyletic and stem from a deeply rooted split that took place early in land plant evolution (for schematic representation, refer to Figure 8). Morphologically, non-vascular plants (i.e., bryophytes) and angiosperms are quite different; homology at the organ level is not generally accepted or remains unresolved between the dominant gametophytic body of bryophytes and dominant sporophytic body of angiosperms. Nevertheless, recent molecular genetics studies have revealed cases where homologous genetic modules control development of functionally analogous organs in the two bodies of different lineages and generations (Koi et al., 2016; Rovekamp et al., 2016; Whitewoods et al., 2018; Yasui et al., 2019; Hirakawa et al., 2020). These observations could be accounted for by a common origin of organs in study or independent recruitment of the same genetic module to analogous organs. Either way, knowledge of molecular bases that underlie biological processes in diverse species is instrumental in understanding genetic mechanisms of how plants evolve and diversify.
Since the first discovery of an endogenous short signaling peptide, genetic, biochemical, and bioinformatic approaches have continued to add new signaling peptide families that act in various biological processes (Olsson et al., 2019; Segonzac and Monaghan, 2019). They mature from short precursor proteins through post-translational modifications and proteolytic cleavage into peptides of typically 10-to-20 amino acid residues (post-translationally modified peptides; PTMPs for short). More than ten PTMP families have been reported to date and all share common gene structures; PTMPs are encoded near the N-terminus. Outside the short PTMP-encoding region, PTMP precursor sequences are highly variable among family members. This structural feature has been hampering confident identification of PTMP homologs especially between two distantly related species. Nonetheless, studies started to reveal conservation of PTMPs across land plants (e.g., Miwa et al., 2009; Ghorbani et al., 2015; Tavormina et al., 2015; Goad et al., 2017).
ROOT GROWTH FACTOR (RGF)/GOLVEN (GLV)/CLE-Like (CLEL) is one of the widely conserved PTMP families (Shinohara, 2021). The RGF/GLV/CLEL (hereafter referred to as RGF for clarity) peptides were discovered independently in Arabidopsis thaliana (Matsuzaki et al., 2010; Meng et al., 2012; Whitford et al., 2012). Followed by the identification of their receptors, RGF1-INSENSITIVE (RGI)/RGF1 RECEPTOR (RGFR), the RGF-RGI module has been studied intensively, which in a short period of time has led to the independent elucidation of the downstream events after the receptor activation in A. thaliana (Ou et al., 2016; Shinohara et al., 2016; Song et al., 2016; Fernandez et al., 2020; Lu et al., 2020; Shao et al., 2020; Yamada et al., 2020). Gain-of-function alleles and exogenous application of RGF result in developmental defects in the root, and previous studies primarily focus on the roles of RGFs in root development. It has been noted from early on, however, that the expression of the RGF family genes is not restricted to the root (Ghorbani et al., 2016; Busatto et al., 2017). Likewise, although the RGF-RGI module has been studied so far only in angiosperms, bioinformatics analyses identified RGF homologs also in gymnosperm and lycophyte lineages of vascular plants (Strabala et al., 2014; Ghorbani et al., 2015). These limit our view on how this peptide signaling pathway evolved to play their roles as known today. Meanwhile, increasing availability of complete genome sequences enabled larger-scale bioinformatics analyses and led to the identification of RGF homolog in another vascular plant lineage, ferns (Furumizu et al., 2021). An RGF-like sequence was also found in a non-vascular plant, Marchantia polymorpha (designated as MpRGF for ROOT GROWTH FACTOR/GOLVEN/CLE-Like-FLAVORED) (Furumizu et al., 2021). These discoveries open up opportunities for studying RGF signaling pathways in different developmental or evolutionary settings. Yet, limiting searches to completely sequenced genomes excludes the possibility to fully consider the large diversity of land plants; a better understanding of the evolution of RGFs comes with a more comprehensive investigation among phylogenetically diverse species.
This study, therefore, searched for RGF-like sequences extensively in all major extant land plant lineages using a large-scale transcriptome data collected by the 1000 Plant Genomes Project (1KP1) as well as other valuable genome and transcriptome resources (Rensing et al., 2008; Banks et al., 2011; Goodstein et al., 2012; Amborella Genome Project, 2013; Fernandez-Pozo et al., 2015; Ming et al., 2015; Cheng et al., 2017; Filiault et al., 2018; Dong et al., 2019; One Thousand Plant Transcriptomes Initiative, 2019; Hetherington et al., 2020; Li et al., 2020; Zhang J. et al., 2020; Zhang L. et al., 2020; Geng et al., 2021). As a result, more than four hundreds of RGF-like sequences were retrieved from all major extant land plant lineages excepting hornworts. Mapping these RGF-like sequences on the species phylogeny revealed independent losses in mosses and hornworts and evolution of lineage-specific variations in vascular plants. We further demonstrated that MpRGF possesses known RGF-like activities and can affect plant growth with constitutive expression experiments in A. thaliana and M. polymorpha. This suggests that a role as a signaling molecule is conserved between vascular plant and bryophyte RGF-like sequences. Our findings form a basis for advancing studies into RGF peptides and their receptors and contribute to our understandings of how peptide signaling pathways evolve.
Materials and Methods
Identification of RGF-Like Sequences
Initial BLAST (Basic Local Alignment Search Tool) queries were assembled based on the previously published RGF-like sequences (Furumizu et al., 2021) and used to set up BLASTP or tBLASTn searches. The finalized query file is available as Supplementary Data Sheet 1. Fasta files of the published transcriptome and proteome data were downloaded to set up databases for BLAST searches. BLASTP was used to search against the proteome with the following parameters: -task blastp-short -max_target_seqs 30. tBLASTn was used to search against the transcriptome data with the parameters as follows: -word_size 2 -gapopen 9 -gapextend 1 -matrix PAM30 -threshold 16 -comp_based_stats 0 -window_size 15 -max_target_seqs 30. BLAST results were manually examined, and RGF-like candidates were listed. These hits were assessed by the sequence conservation of the presumptive mature peptides, which we defined to begin with the first aspartic acid and second tyrosine residues. When a potential full-length precursor sequence was available, the location of the mature peptide-encoding region and the presence of signal peptide were examined. Given that the processing and secretion modes of RGF-like peptides have not yet been elucidated outside of angiosperms, predicted presence of signal peptide was not considered as prerequisite. All amino acid sequences were aligned with the online version of MAFFT version 7 (Katoh et al., 2019) and manually edited and visualized with Jalview version 2.11.1.3 (Waterhouse et al., 2009). WebLogo version 2.8.22 was used to generate sequence logos (Crooks et al., 2004) with the same clustalw color scheme used in Jalview. RGF-like sequences presented in this study are available in Supplementary Data Sheet 2.
Plant Materials and Growth Conditions
A. thaliana Col-0 and M. polymorpha Tak-1 (Ishizaki et al., 2008) were used as wild type. Plants were grown at 23°C under continuous light. A. thaliana seeds were surface-sterilized and germinated on 140 mm × 100 mm × 14.5 mm square dishes (sterile No. 2 square Schale, Eiken Chemical) containing half-strength Murashige and Skoog (MS) Basal Medium (M5519, Sigma-Aldrich) solidified with 1% (w/v) agar (01028-85, Nacalai Tesque). The pH of the medium was adjusted to pH 5.7 with potassium hydroxide. Transformants were selected on half-strength MS plates containing 15 μg/mL glufosinate ammonium (079-05371, FUJIFILM Wako Pure Chemical) and 12.5 μg/mL carbenicillin sodium salt (037-23693, FUJIFILM Wako Pure Chemical). Plates were placed vertically during plant cultivation. For root observation, 1-week-old seedlings were transferred to the half-strength MS medium containing 1.0% sucrose (190-00013, FUJIFILM Wako Pure Chemical) and 1.8% agar, with (for transformants) or without (for Col-0) 15 μg/mL glufosinate ammonium and 12.5 μg/mL carbenicillin sodium salt. Plates were inclined at an angle of approximately 45° to the vertical. M. polymorpha plants were grown on 90 mm × 20 mm Petri dishes (BIO-BIK I-90-20, Ina⋅Optica) containing half-strength Gamborg’s B5 Medium (399-00621, FUJIFILM Wako Pure Chemical) solidified with 1.4% (w/v) agar. The pH of the medium was adjusted to pH 5.5 with potassium hydroxide. Transformants were selected on half-strength Gamborg’s B5 plates containing 10 μg/mL hygromycin B (085-06153, FUJIFILM Wako Pure Chemical) and 100 μg/mL cefotaxime sodium salt (030-16113, FUJIFILM Wako Pure Chemical). All nutrient agar plates were double wrapped with surgical tape (Micropore Surgical Tape 1530-0, 3M) during plant cultivation. Plant images were obtained by scanning plates using a flat-bed scanner (GT-X830, Epson).
Construction of Plasmids
For constitutive expression in A. thaliana, the AtRGF1, AtRGF6, and MpRGF coding sequences were PCR amplified from the Col-0 root-derived complementary DNA (cDNA), Col-0 genomic DNA, and Tak-1 gemmae-derived cDNA, respectively, cloned into the pART7 plasmid digested with BamHI and XhoI using the In-Fusion HD cloning kit (639649, Takara Bio), and sequenced. The pART7 vector contains the cauliflower mosaic virus 35S promoter sequence and the terminator sequence from the octopine synthase gene. A 15-bp deletion was introduced into pART7 carrying MpRGF by inverse PCR and confirmed by sequencing. Subsequently, the NotI fragment of the obtained plasmid was cloned into the NotI site of the pMLBART binary vector using the DNA ligation kit, mighty mix (6023, Takara Bio). For constitutive expression in M. polymorpha, the MpRGF coding sequence was PCR amplified from the Tak-1 gemmae-derived cDNA, cloned into the pENTR4 dual selection vector (Invitrogen A10561, Thermo Fisher Scientific) digested with BamHI and XhoI using the In-Fusion HD cloning kit, and sequenced. Subsequently, the MpRGF coding sequence was subcloned into pMpGWB103 (Ishizaki et al., 2015) using the Gateway LR clonase II enzyme mix (Invitrogen 11791-100, Thermo Fisher Scientific). Sequences of the primers used in this study are listed in Supplementary Table 1.
Plant Transformation
Constructs were introduced into the Agrobacterium tumefaciens strain GV3101 (pMP90) by electroporation to be used for plant transformation. A. thaliana plants were transformed by the floral dip method using surfactant Silwet L-77 (BMS-SL7755, Bio Medical Sciences) (Clough and Bent, 1998). M. polymorpha gemmalings were transformed as previously described (Kubota et al., 2013).
Results
RGF-Like Sequences Are Highly Conserved in Liverworts
Mature RGF peptides in A. thaliana (hereafter, AtRGFs) share several characteristic residues: the first aspartic acid (D), the second tyrosine (Y) to be sulfated, the tenth proline (P), and the thirteenth asparagine (N). The fifth and ninth P as well as the eighth histidine (H) residues are also highly conserved (Figure 1A). All these residues are conserved in the RGF-like sequence found in a liverwort, M. polymorpha, MpRGF (Furumizu et al., 2021; Figure 1A). A notable difference is the twelfth residue, which is either H or N in AtRGFs but is serine (S) in MpRGF, pointing to RGF sequence diversification between two evolutionary distant lineages. This prompted us to further explore the presence and sequence of putative RGFs in other liverworts.
Figure 1. Liverwort RGF-like sequences. (A) Alignment of the predicted mature RGF peptides encoded in the A. thaliana and M. polymorpha genomes. Conserved residues are highlighted with graded shadings. Sequence titles are preceded by the abbreviated species name consisting of the first two letters of the generic and specific names. The sequence of A. thaliana RGF4 is based on a previous study (Matsuzaki et al., 2010). (B) Shown are the RGF-like peptide sequences identified in the selected liverworts. Phylogenetic relationships of the analyzed species are depicted as a simplified cladogram based on the published phylogenetic analyses (Forrest et al., 2006; Dong et al., 2019). A heavy asterisk indicates that the proteome predicted from the sequenced genome was analyzed for M. polymorpha (Bowman et al., 2017). Daggers indicate that the published transcriptome data was analyzed for these species (Dong et al., 2019). The 1KP dataset was analyzed for other species. See also Supplementary Figure 1.
The liverwort transcriptomes in the 1KP dataset was searched for RGF-like sequences. The 16 amino-acid sequences, encompassing the predicted mature peptide and three preceding residues, of AtRGFs and previously identified RGF-like sequences were used as query in the BLAST (Basic Local Alignment Search Tool) searches against the transcriptome data, translated in all six frames. This identified RGF-like sequences in 18 transcriptomes (Figure 1B). These findings corroborate the utility of the 1KP dataset in searching signaling peptide-encoding transcripts, which are often expressed at low levels. In the alignment of these sequences, the conservation stretches beyond the presumptive peptide-encoding region (Supplementary Figure 1), validating that several partial sequences lacking the N-terminus are indeed homologous to RGFs. In addition, the alignment identified two potential proteolytic cleavage sites in the variable, middle region of nascent polypeptides (Supplementary Figure 1). Equivalent residues are targeted by subtilisin-like serine proteases, and these processing events are crucial for the AtRGF biogenesis (Ghorbani et al., 2016; Stuhrwohldt et al., 2020). This suggests that liverwort RGF-like sequences can be processed to PTMPs by molecular machineries shared between vascular plants and bryophytes.
The identified RGF-like sequences are nearly identical. No RGF-like sequences were found in Radula lindenbergiana, Monoclea gottschei, and Blasia species listed in the 1KP dataset. In order to examine the presence of liverwort RGFs more widely, additional BLAST searches were performed using the transcriptome data in an independent study (Dong et al., 2019). This led to the finding of RGF-like sequences in the following five species: Frullania orientalis, Radula japonica, Blasia pusilla, Haplomitrium mnioides, and Treubia lacunosa (Figure 1B). Given that highly similar RGF-like sequences were uniformly conserved across liverworts, the absence of significant hits in the BLAST searches of several 1KP datasets is likely explained by the lack or the loss of low-abundance transcripts in the transcriptome data.
RGF-Like Sequences Were Lost Independently During Moss and Hornwort Evolution
The conservation of RGF between angiosperms and liverworts indicates that the ancestral RGF gene(s) should have been present in the common ancestor of vascular plants and bryophytes (liverworts, mosses, and hornworts) while RGF-like sequences were not identified in the sequenced genomes of a moss, Physcomitrium patens, and Anthoceros hornworts (Rensing et al., 2008; Li et al., 2020; Zhang J. et al., 2020; Furumizu et al., 2021). Therefore, we investigated the possibility of finding moss or hornwort RGFs in the 1KP dataset.
RGF-like sequences were found in several but not all moss transcriptomes used in our searches (Figure 2 and Supplementary Figure 2). In order to examine these findings in the evolutionary context, the presence or absence as well as the identified sequences were mapped on the moss species phylogeny. This revealed that the RGF distribution is limited to the species that diverged early during the moss evolution (Figure 2). RGF-like sequences were not found in the species that belong to evolutionary younger lineages, collectively known as Bryophytina, suggesting that the RGF-like sequences were lost before the divergence of Bryophytina from Andreaeophytina. This is consistent with the previous report that RGF-like sequences are absent in the sequenced genome of P. patens, which belongs to Bryophytina. Alternatively, evolutionary changes in the expression pattern could also account for the lack of RGF-like sequences in Bryophytina in our results, through affecting the amount of RGF transcripts in a given transcriptome.
Figure 2. Moss RGF-like sequences. The presence (+) or absence (n.d., not detected) of RGF-like sequences was examined in mosses, and the identified, presumptive RGF-like peptide sequences are listed. Phylogenetic relationships of the analyzed species are depicted as a simplified cladogram based on the published phylogenetic analyses (Liu et al., 2019). A heavy asterisk indicates that the proteome predicted from the sequenced genome was analyzed for P. patens (Rensing et al., 2008). The 1KP dataset was analyzed for other species. See also Supplementary Figure 2.
Searches in the hornwort transcriptomes did not find any RGF-like sequences. Taken together with the previous reports, in which RGF was not identified in the sequenced four genomes of Anthoceros species (Li et al., 2020; Zhang J. et al., 2020; Furumizu et al., 2021), these data suggest that during the hornwort evolution, RGF was possibly lost before extant lineages diversified (Figure 3). It is still yet possible that the low to zero abundance in the transcriptome data did not allow us to identify hornwort RGFs in this study.
Figure 3. RGF-like sequences were not found in hornworts. Phylogenetic relationships of the analyzed species are depicted as a simplified cladogram based on the published phylogenetic analyses (Frangedakis et al., 2020). Heavy asterisks indicate that the proteome data predicted from the completely sequenced genomes were analyzed for the Anthoceros species (Li et al., 2020; Zhang J. et al., 2020). The 1KP dataset was examined for other species.
RGF-Like Sequences Diversified in Lycophytes
The C terminus of the liverwort and moss RGF-like sequences contains SSN and R/K-G/S-N, respectively (Figures 1, 2). These sequence features were not previously recognized for seed plant RGFs and can be explained by lineage-specific gene family evolution (Strabala et al., 2014; Ghorbani et al., 2015). Thus, we examined non-seed vascular plant lineages for hitherto unidentified sequence diversity. Lycophyte is a sister group to the ferns and seed plants and represents the earliest-diverging lineage of extant vascular plants (Spencer et al., 2020). Selaginella moellendorffii is the only lycophyte species of which genome was sequenced, and previous studies reported that RGF-like sequences are encoded in its genome (Banks et al., 2011; Ghorbani et al., 2015). To test other lycophyte species, lycophyte transcriptomes were selected from the 1KP dataset so that the large phylogenetic diversity of this lineage was captured and searched for RGF-like sequences. This led to the identification of RGF-like sequences from 17 out of 21 transcriptome data. Additional RGF-like sequences were retrieved by searching against the predicted Selaginella moellendorffii proteome and the Isoetes echinospora transcriptome data (Banks et al., 2011; Hetherington et al., 2020; Figure 4 and Supplementary Figure 3). No RGF-like sequences were found in four species of Lycopodiaceae (Figure 4). In the case of Huperzia selago (designated as A in Figure 4), a single RGF-like sequence was found in another transcriptome (B). This example suggests that the low transcript level can possibly hinder RGF identification in the 1KP dataset.
Figure 4. Lycophyte RGF-like sequences. The presence (+) or absence (n.d., not detected) of RGF-like sequences was examined in lycophytes. The identified, presumptive RGF-like peptide sequences are classified into two groups and listed. Phylogenetic relationships of the analyzed species are depicted as a simplified cladogram based on the published phylogenetic analyses (Wikstrom and Kenrick, 2000; Field et al., 2016; Weststrand and Korall, 2016; Schafran et al., 2018). A heavy asterisk indicates that the proteome predicted from the sequenced genome was analyzed for S. moellendorffii (Banks et al., 2011). A dagger indicates that the published transcriptome data was examined for Isoetes echinospora (Hetherington et al., 2020). The 1KP dataset was analyzed for other species. See also Supplementary Figure 3.
The lycophyte RGF-like sequences are classified into two major groups. Group H is characterized by the highly conserved eighth H residue in the predicted mature peptide-encoding region. The last H residue is also conserved with exceptions being five sequences found in the three Isoetes transcriptomes (marked with short black vertical line in Supplementary Figure 3). Unique residues at the C terminus differentiate these from more canonical Group H sequences. Besides, several Group H sequences have three P residues instead of the otherwise highly conserved two consecutive P residues at the ninth and tenth positions and could generate atypical 14 amino acid-long RGF peptides. The other, Group R, is characterized by the central R/K residues and the invariable twelfth and thirteenth N residues.
Differences between Group H and R are also found in the sequences adjacent to the predicted signaling peptide-encoding region. Most Group H sequences end with the predicted peptide-encoding region whereas additional 9-to-18 residues follow the peptide-encoding region of Group R, suggesting different modes of peptide maturation processes. These distinctive features indicate that two groups underwent significant sequence diversification if they share the same origin and thus are homologous. Pertinent to this is the presence of Group R sequences being limited to Selaginellaceae and Isoetaceae. This observation raises two possibilities: gain of Group R in the lineage leading to Selaginellaceae and Isoetaceae after its split from Lycopodiaceae (Huperzioideae and Lycopodioideae); and loss of Group R in the common ancestor of Lycopodiaceae. It is of note that A. thaliana RGFs can also be classified into Group H and Group R (Figure 1A). These will be discussed later in conjunction with our findings in other vascular plant lineages.
Fern RGF-Like Sequences All Belong to Group H
The living sister group to lycophytes, euphyllophytes, consists of two major groups: seed plants and ferns. Ferns diverged from the seed plants 400 million years ago, and extant ferns are highly diverse (Plackett et al., 2015). Their early emergence and present-day diversity, however, has not always been sufficiently considered in previous phylogenetic studies, let alone in the studies of plant peptide signaling. It was recently reported that RGF-like sequences are encoded in the sequenced genomes of Azolla filiculoides and Salvinia cucullata (Furumizu et al., 2021). These are all similar to the Group H sequences in lycophytes, and it was not clear whether ferns have Group R-type sequences. We addressed this question by searching for RGF-like sequences in the selected fern transcriptomes of the 1 KP project as well as in the Ceratopteris richardii transcriptome reported independently (Geng et al., 2021).
As shown in Figure 5 and Supplementary Figure 4, nearly all fern RGF-like sequences are characterized by the eighth H residue in the predicted mature form. No sequence was found to carry consecutive R/K residues at the seventh and eighth positions as seen in the lycophyte Group R sequences. The fern RGF-like sequences are classified into two groups according to the last residue of the predicted mature form: N in Group H-N, and P in the other, Group H-P (Supplementary Figure 4). These findings were mapped onto the fern species phylogeny (Figure 5). Either Group H-N or H-P sequences are absent in several fern species. In particular, Group H-N sequences were not found in Angiopteris evecta and Marattia sp., both of which belong to Marattiales. Group H-N may have been lost in this lineage, and validation of this possibility requires more species sampling.
Figure 5. Fern RGF-like sequences. The presence (+) or absence (n.d., not detected) of RGF-like sequences was examined in ferns. The identified, presumptive RGF-like peptide sequences are classified into two groups and listed. Phylogenetic relationships of the analyzed species are depicted as a simplified cladogram based on the published phylogenetic analyses (Pryer et al., 2004; Knie et al., 2015; Schuettpelz et al., 2016). Heavy asterisks indicate that the proteome data predicted from the completely sequenced genomes were analyzed (Li et al., 2018). A dagger indicates that the published transcriptome data was examined for Ceratopteris richardii (Geng et al., 2021). The 1KP dataset was analyzed for other species. See also Supplementary Figure 4.
Gymnosperms Have Both Group H and Group R RGF-Like Sequences
The lack of Group R sequences in ferns and its presence in lycophytes and angiosperms obscures the evolutionary history of Group R RGF-like sequences in vascular plants. To gain an insight into how Group R sequences emerged and evolved, RGF-like sequences were analyzed in gymnosperms that represent an informative node, bridging seed-free vascular plants and angiosperms. Gymnosperms comprise only 1,000 species but consist of a wide variety of groups. Species were therefore sampled broadly from the available 1 KP dataset.
Collected gymnosperm RGF-like sequences include both Group H and Group R sequences (Figure 6 and Supplementary Figure S5). This is consistent with a previous study, which also identified RGF-like genes of both types in coniferous species (Strabala et al., 2014). Several transcriptomes lack Group R sequences. The corresponding species do not belong to specific taxonomic groups, and Group R includes sequences from all major taxonomic lineages. This suggests that lower expression levels but not their absence in the genome accounts for the lack of Group R sequences in these transcriptomes. In fact, the sequenced Picea abies genome encodes seven RGF-like sequences in total, four of which belong to Group R. A mere number of zero-to-two in other species appears consistent with the hypothesis of lower transcript accumulation levels of Group R sequences in general.
Figure 6. Gymnosperm RGF-like sequences. The presumptive RGF-like peptide sequences identified in gymnosperms are classified into two groups and listed. Sequences in magenta in a given species share similar N-terminal sequences, suggesting the common origin. R-type sequences were not found in several species (n.d., not detected). Phylogenetic relationships of the analyzed species are depicted as a simplified cladogram based on the published phylogenetic analyses (Lu et al., 2014; Ran et al., 2018; One Thousand Plant Transcriptomes Initiative, 2019). Names of taxonomic groups are color coded consistently with Supplementary Figure 5. A heavy asterisk indicates that the proteome predicted from the genome sequence was analyzed (Nystedt et al., 2013). The 1KP dataset was analyzed for other species. See also Supplementary Figures 5, 6.
Gymnosperm Group H sequences were further classified into five subgroups (Supplementary Figure 5). Subgroup H1 sequences were found only in Cupressaceae and Taxaceae, possibly having originated recently in the common ancestor of these evolutionary young lineages. Similar to Group R, other subgroups are more uniformly distributed in gymnosperms. Exceptions to this are subgroup H3 lacking cycad and Ginkgo sequences and subgroup H5 without Araucariaceae sequences. Given the lack of completely sequenced genomes in these lineages and the large size of gymnosperm genomes, it requires additional investigation to confirm lineage-specific sequence variations. Group H as a whole comprises sequences from all major gymnosperm lineages. Therefore, both Group H and R sequences must have been present in the common ancestor of extant gymnosperm species.
Manual curation of gymnosperm RGF-like precursor sequences revealed that several group H and R sequences show striking similarities (Supplementary Figures 5 and 6). These paired sequences were recognized so far only in several gymnosperm lineages in our searches and could possibly result from alternative splicing as reported by Strabala et al. (2014). Cross comparison between paired examples of different taxonomic groups did not show sequence conservation beyond mature-peptide encoding regions. This suggests their possible independent origins and raises a question as to what drove their evolution. Further investigations are needed to elucidate how commonly these pairs exist both within and outside gymnosperms.
Cryptic Sequence Diversity in Angiosperm RGFs
RGF-like sequences were originally identified in A. thaliana and has since been reported in other angiosperm species (Matsuzaki et al., 2010; Meng et al., 2012; Whitford et al., 2012; Ghorbani et al., 2015; Tadiello et al., 2016). These angiosperm sequences, however, have not been characterized from an evolutionary perspective by comparing their profiles with those of RGF-like sequences from non-flowering plants. In view of the rapidly growing availability of plant genome data, RGF-like sequences were searched in diverse angiosperm species (Goodstein et al., 2012; Amborella Genome Project, 2013; Fernandez-Pozo et al., 2015; Ming et al., 2015; Cheng et al., 2017; Filiault et al., 2018; Zhang L. et al., 2020; Figure 7 and Supplementary Figure 7). This resulted in identifying both Group H and R sequences in all species examined. In addition, similar but distinct sequences were found to constitute a new subclass of Group H. Their presumptive mature forms in 13 amino acids end with PP and are hereby named Group H-PP. Some Group H-PP sequences have a C-terminal extension longer than 20 amino acids. It remains to be tested whether they encode secreted short peptides.
Figure 7. Angiosperm RGF-like sequences. The presumptive RGF-like peptide sequences identified in the sequenced genomes of angiosperms are classified into two groups and listed. Sequences in dull yellow belong to the group H-PP. Phylogenetic relationships of the analyzed species are depicted as a simplified cladogram based on the published phylogenetic analyses (One Thousand Plant Transcriptomes Initiative, 2019). See also Supplementary Figure 7.
Conserved and Lineage-Specific Features of RGF-Like Sequences
As summarized in Figure 8, RGF-like sequences show both conserved and lineage-specific characteristics. Group H sequences are found in all major lineages of extant land plants, suggesting an early origin of Group H in the common ancestor of land plants. Group R sequences, on the other hand, are present only in vascular plants. It is possible Group R evolved early in vascular plant lineage and was subsequently lost in ferns. Alternatively, Group R sequences may have evolved independently at least twice in ancestral lycophytes and seed plants. The latter could explain the difference in the twelfth position of predicted, mature RGF-like peptides, which is dominated by N in lycophytes and H in seed plants. Other sequence characteristics are highly similar between seed plant and lycophyte Group R sequences, in favor of their common origin.
Figure 8. Lineage-specific features of RGF-like sequences. Sequence logos of the identified RGF-like sequences are shown along with a plausible phylogenetic tree of the extant, major land plant groups. The branch lengths are to the scale of the estimated divergence dates (Morris et al., 2018).
Compared with Group R, Group H sequences show more diversity. This is of particularly the case in vascular plants. It has to be emphasized that except for A. thaliana RGFs, secreted, bioactive forms have not been determined (Matsuzaki et al., 2010; Whitford et al., 2012). Such structural information will be critical to delve into the sequence diversity of RGF-like sequences.
M. polymorpha RGF-Like Sequence Elicits RGF-Like Activities in A. thaliana
As a first step toward understanding biological functions of RGF-like sequences in non-flowering plants, the M. polymorpha RGF-like sequence was chosen for molecular experiments for the following reasons. First, liverworts including M. polymorpha and other bryophyte groups form a presumably monophyletic lineage sister to vascular plants and serve as an informative reference to complement our knowledge on angiosperm RGFs (Figure 8). Second, a single, highly similar RGF-like sequence is present in the liverwort species analyzed in this study (Figure 1). This observation points to a conserved role for liverwort RGF-like sequences. Third, canonical proteolytic cleavage sites are identified at similar positions in the liverwort and A. thaliana RGF precursors (Ghorbani et al., 2016; Stuhrwohldt et al., 2020; Supplementary Figure 1). This led us to reason that conserved molecular players or analogous mechanisms for the biogenesis of RGF-like peptides could exist in M. polymorpha and A. thaliana. Fourth, the predicted M. polymorpha RGF-like sequence has several residues not found in angiosperm RGFs (Figures 1, 7, 8). These variations could possibly affect ligand-receptor interactions, which can be tested by heterologous expression.
In A. thaliana, it has been reported that constitutive expression and exogenous application of RGFs affect root development (Matsuzaki et al., 2010; Meng et al., 2012; Whitford et al., 2012; Fernandez et al., 2013, 2015). We generated transgenic A. thaliana plants constitutively expressing either A. thaliana RGF (AtRGF1/GLV11/CLEL8 of Group H or AtRGF6/GLV1/CLEL6 of Group R) or M. polymorpha RGF (MpRGF, Group H) and compared their root morphologies (Figure 9). In order to circumvent differences in the number and expression level of the transgene among independent lines, plants in the T1 generation were used. When grown on inclined, hard-agar plates, MpRGF-expressing plants showed enhanced wavy growth. The overall direction of their root growth was bended. These phenotypes are similar to those caused by AtRGF1/GLV11/CLEL8 over-expression (Meng et al., 2012). AtRGF1/GLV11/CLEL8-expressing plants formed irregular waves including loops as previously reported (Whitford et al., 2012). These morphologies are hardly observed in MpRGF-expressing plants. Common to all three transgenic lines are the phenotypes such as shorter roots and lower density of lateral roots (Fernandez et al., 2013; Figure 9B). These phenotypes were not observed in A. thaliana plants constitutively expressing a mutated form of MpRGF, of which predicted mature form lacks the first five amino acids (Mprgf, Figure 9A). Therefore, the observed biological activities of MpRGF in A. thaliana are attributable to the putative peptide-encoding region. Despite notable sequence differences from angiosperm RGFs, our finding suggests that MpRGF can signal through the A. thaliana RGF receptors.
Figure 9. Overexpression of MpRGF in A. thaliana. (A) A mutated form of MpRGF, designated as Mprgf, used in this experiment. Shown are the nucleotide and translated amino acid sequences encompassing the mature peptide encoding region. The deduced mature peptide sequences are highlighted in bold magenta letters. (B) Constitutive expression of AtRGF1/GLV11/CLEL8, AtRGF6/GLV1/CLEL6, MpRGF, or its mutated form, Mprgf in A. thaliana seedlings. A Col-0 plant is shown for comparison. Note the difference in plant vigor due to the chemicals added in the medium for selecting transgenic plants. White arrow heads indicate the position of the root tip at the time of seedling transfer to the hard agar medium plate. Emerged lateral roots are marked with black circles. Scale bars: 1 cm.
Over-Expression of MpRGF Results in Dwarfing in M. polymorpha
Our current understanding of the roles of RGFs is based on the previous studies focusing on the sporophytic organs of angiosperms, and it has not been reported to date that RGFs have any roles during gametophyte development. In contrast to angiosperms, in which the sporophyte generation is dominated, liverworts have a life cycle with the dominant gametophytic phase. It is thus more feasible to study RGF functions in the gametophyte using liverworts. Detection of RGF-like sequences in the 1KP dataset (Figure 1) corroborates that liverwort RGF-like sequences are expressed in their gametophytic bodies, from which samples are typically collected for transcriptome analyses (One Thousand Plant Transcriptomes Initiative, 2019). In M. polymorpha, a genetically tractable liverwort model (Bowman et al., 2017; Kohchi et al., 2021), publicly available RNA-Seq data shows that the MpRGF transcript (Mp6g09180.1) is detected both in vegetative and reproductive organs with higher mRNA abundance in gametophytic tissues than in sporophytic tissues (MarpolBase3).
To explore gametophytic RGF roles, the impact of activated RGF signaling was evaluated. For this purpose, transgenic M. polymorpha plants expressing MpRGF under the constitutive MpEF1 promoter (Althoff et al., 2014) were generated by transforming regenerating thalli. Independent twelve T1 lines were distinguishable from wild-type plants by their small size. Gemma cup and gemmae formation was delayed but eventually observed in the MpRGF overexpressors. This allowed us to grow wild-type and transgenic G1 gemmae simultaneously, which confirmed the observations in the T1 generation (Figure 10). That is, excessive MpRGF interfers with normal growth and development. We infer that MpRGF is under strict transcriptional regulation so that its potent activities are controlled tightly.
Figure 10. Overexpression of MpRGF in M. polymorpha. M. polymorpha plants ectopically expressing the MpRGF cDNA are shown with wild-type Tak-1 plants. G1 gemmalings plated simultaneously were observed. Scale bar: 1 cm.
Discussion
Framing Sequence Diversity of Signaling Peptides in the Context of Land Plant Evolution
Despite being as short as a chain of 10-20 amino acids, plant signaling peptides carry a large amount of information for binding with interacting partners to induce downstream signaling events. Much attention has been paid to understand the effect of peptide sequence variation on ligand-receptor interactions and resultant signaling responses within a few model species. These efforts have been contributing to an understanding of signaling peptide language, so to say, and even to the creation of a new one or a new word (peptide). The latter, for instance, was realized through a serendipitous discovery using a chemical synthetic approach (Hirakawa et al., 2017). These previous studies were driven by collecting, comparing, and clustering peptide sequences based on similarities. There exist, however, potential pitfalls associated with the common features found in signaling peptide encoding genes; namely, except for the short mature peptide encoding region, a large part of precursor proteins lack sequence conservation. This poses a challenge when identifying signaling peptides and analyzing their sequence variation with conventional homology-based and molecular phylogenetic approaches. As an attempt to overcome these shortcomings, we placed RGF-like peptides in a species phylogeny as summarized in Figure 8. This offers an opportunity to decipher evolutionary trajectories of RGF-like peptides and lay a foundation for further exploration of driving forces that underlie changes in peptide sequences.
Challenges in Translating Bioinformatic Knowledge Into an Understanding of Living Systems
The unparalleled 1KP project as well as other valuable resources allowed us to identify more than four hundreds of RGF-like sequences in all major extant land plant lineages excepting hornworts. The absence of RGF-like sequences in our searches can result from the nature of the original transcriptome data, such as the lack or the loss of low-abundance transcripts in the tissue used for RNA extraction. Therefore, additional investigation by PCR-based homolog detection and ultimately by genome sequencing is necessary to confirm the lack of RGF-like sequences in a given species. This poses an intrinsic limitation to the transcriptome data. Moreover, a question remains unsolved as to whether these genes are all homologous due to the overall low sequence similarity. Homology itself is not a clear-cut concept (Inkpen and Doolittle, 2016), which one confronts when assessing similarities of sequences found in distantly related species. Pertinent to this is likely independent evolution of sequences similar to plant signaling peptides in animals and microbes (Mitchum et al., 2012; Ronald and Joe, 2018). To our knowledge, horizontal gene transfer has not been demonstrated in any of these cases of so-called peptide mimics, which can hijack the host peptide-receptor signaling pathways. A problem will thus linger of how to define a signaling peptide family, for instance, by homology or by interacting receptor.
Experimental validations are fundamental to confirm that the newly identified genes encode secreted short peptides and to determine amino acid sequences of their bioactive forms before postulating mechanisms that drive and generate a peptide sequence diversity. These biochemical analyses, however, are not always feasible in every species. We took an alternative and complementary approach, heterologous expression in A. thaliana, and demonstrated that the M. polymorpha RGF sequence shows RGF-like activities in plants (Figure 9). The result also showed that MpRGF-induced responses are more similar to the phenotypes caused by constitutively expressing AtRGF6/GLV1/CLEL6 than AtRGF1/GLV11/CLEL8, while MpRGF resembles the latter A. thaliana RGF in sequence (Figure 1). Such discrepancies at first glance can result from differences in expression levels or heterologous nature of the experiment; they could be reconciled in future studies if we gain a better understanding of ligand-receptor interactions both in A. thaliana and in M. polymorpha.
Evolutionary Understanding Facilitates Mechanistic Studies in Model Species
In A. thaliana, multiple receptors have been reported for RGF peptides (Ou et al., 2016; Shinohara et al., 2016; Song et al., 2016). As pointed out previously, it remains unknown whether the RGF receptors have different affinities for specific RGF peptides in vivo (Pruitt et al., 2017). It was reported, however, that one of the A. thaliana RGF receptors (At4g26540) interacts with RGFs in vitro with different binding affinities (Song et al., 2016). Overall, H-type RGFs have higher affinities than R-type RGFs, suggestive of inherent differences among RGF ligand-receptor pairs in signaling outputs. Functional differences among RGFs have been noted since their discovery. For example, the H-type, and not R-type, acts as primary RGF in maintaining the meristematic activities in the A. thaliana root (Matsuzaki et al., 2010). Given that Group H RGF-like sequences are more widely conserved across land plants, with its emergence predating the split between vascular plants and bryophytes (Figure 8), when studying roles of RGFs in biological processes of interest, it can be helpful to keep in mind evolutionary differences between two major RGF subgroups; they show structural differences as well as changes in expression patterns. A recent study reported a novel role for R-type RGF, A. thaliana RGF7, in triggering innate immunity through interactions with a subset of RGF receptors (Wang et al., 2021). This opens up a new avenue to dissect how RGF signaling diversified.
Mechanisms Underlying Functional Evolution of Peptide Ligand-Receptor Signaling
In the two examples of known RGF functions described above, the angiosperm root possibly evolved in the common ancestor of ferns and seed plants. Innate immunity in non-flowering plants is just beginning to be addressed (Ponce de León and Montesano, 2017; Carella et al., 2019; Matsui et al., 2020; Poveda, 2020). Therefore, it is not yet straightforward to transfer these knowledges in A. thaliana into studies on bryophyte and non-flowering vascular plant RGFs. In order to understand the evolution of RGF signaling, it is crucial to reveal biological roles of RGFs especially in the non-vascular plant lineage, bryophytes. Losses of RGF in hornworts and the moss lineage including P. patens directed our attention to the liverwort, M. polymorpha. Our results indicate that RGF-like sequences exist possibly as a single gene in liverworts. The high sequence similarities suggest their conserved roles in this lineage. The altered growth and development of MpRGF-overexpressing plants indeed verifies its potency as a signaling molecule (Figure 10).
In future studies, it is of critical importance to identify MpRGF receptor(s). Homologs of A. thaliana RGF receptors are strong candidates as demonstrated for another signaling peptide family (Whitewoods et al., 2018; Hirakawa et al., 2019, 2020). In accordance, when expressed in A. thaliana, MpRGF appears to activate RGF receptor signaling (Figure 9). Assigning receptors to conserved signaling peptides in diverse lineages helps us dissect pairing rules of peptide ligands and receptors (Song et al., 2016; Zhang et al., 2016). The understanding of ligand-receptor interactions, in turn, provides insight into how they co-evolve.
Contrasting phylogenetically widespread RGFs, several signaling peptide families, such as LURE, PEP, and SCR/SP11, appear less conserved (Bartels and Boller, 2015; Nasrallah, 2017; Takeuchi, 2021). These presumably evolutionary-young families share intertwined features in common: rapid sequence diversification and participation in inter- or intra-species interactions with each influenced by the other (Takeuchi and Higashiyama, 2012; Lori et al., 2015; Ma et al., 2016; Murase et al., 2020). Understanding structural details of the evolving interface of homologous ligand-receptor pairs as well as underlying genetic changes has been advanced (Ma et al., 2016; Durand et al., 2020; Murase et al., 2020). Studies into RGF-like molecules and their receptors could offer opportunities to study ligands and receptors co-evolving over a longer period of time, from which related yet functionally distinct ligand-receptor pairs may emerge, for instance (van Kesteren et al., 1996).
In addition to ligand-receptor binding, other protein-protein interactions are also indispensable for functional peptide signaling and fine-tuning its activities; processing and modification enzymes involved in RGF peptide maturation have been studied in A. thaliana (Matsuzaki et al., 2010; Ghorbani et al., 2016; Stuhrwohldt et al., 2020). Assuming that the nascent MpRGF protein produces a short secreted peptide similar to mature AtRGFs, our finding supports that similar processing machineries are responsible for the MpRGF biogenesis (Supplementary Figure 1). Consequently, residues needed to interact with processing or modification enzymes should be conserved in the variable region of peptide precursors, where sequence similarities are seldom recognized, thereby constraining the primary and possibly higher structures of peptide precursors. Alternatively, changes in interactions can add new moieties to the signaling peptidome. These point to potential values of comparative evolutionary approaches in studying signaling peptide biogenesis.
Another key issue is to determine whether angiosperm and liverwort RGF signaling pathways involve any conserved cellular processes both in the sporophyte (dominant in angiosperms) and gametophyte (dominant in liverworts) generations (Fernandez et al., 2020; Lu et al., 2020; Shao et al., 2020; Yamada et al., 2020). Answers to this question will shed light on molecular mechanisms underlying diversification of land plants and, along with elucidation of RGF signaling components, will contribute to our understandings of how peptide signaling pathways evolve.
Data Availability Statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.
Author Contributions
CF conceived and designed the study and collected and analyzed the data. SS contributed to the materials. CF wrote the article with contributions from SS. Both authors contributed to the article and approved the submitted version.
Funding
This work was supported by JSPS KAKENHI (17H03967, 18H04841, 18H04625, 18H05487, and 20H00422 to SS and 20K06770 to CF) and Grant for Basic Science Research Projects from the Sumitomo Foundation to CF (200078).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
The authors would like to thank John L. Bowman for his help in identifying RGF homologs in liverworts, Eduardo Flores-Sandoval and Facundo Romani for stimulating discussions, and Reira Suzuki for providing the Arabidopsis thaliana cDNA.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2021.703012/full#supplementary-material
Footnotes
References
Althoff, F., Kopischke, S., Zobell, O., Ide, K., Ishizaki, K., Kohchi, T., et al. (2014). Comparison of the MpEF1alpha and CaMV35 promoters for application in Marchantia polymorpha overexpression studies. Transgenic Res. 23, 235–244. doi: 10.1007/s11248-013-9746-z
Amborella Genome Project (2013). The Amborella genome and the evolution of flowering plants. Science 342:1241089. doi: 10.1126/science.1241089
Banks, J. A., Nishiyama, T., Hasebe, M., Bowman, J. L., Gribskov, M., dePamphilis, C., et al. (2011). The Selaginella genome identifies genetic changes associated with the evolution of vascular plants. Science 332, 960–963. doi: 10.1126/science.1203810
Bartels, S., and Boller, T. (2015). Quo vadis, Pep? Plant elicitor peptides at the crossroads of immunity, stress, and development. J. Exp. Bot. 66, 5183–5193. doi: 10.1093/jxb/erv180
Bowman, J. L., Kohchi, T., Yamato, K. T., Jenkins, J., Shu, S., Ishizaki, K., et al. (2017). Insights into land plant evolution garnered from the Marchantia polymorpha genome. Cell 171, 287–304.e15. doi: 10.1016/j.cell.2017.09.030
Busatto, N., Salvagnin, U., Resentini, F., Quaresimin, S., Navazio, L., Marin, O., et al. (2017). The peach RGF/GLV signaling peptide pCTG134 is involved in a regulatory circuit that sustains auxin and ethylene actions. Front. Plant Sci. 8:1711. doi: 10.3389/fpls.2017.01711
Carella, P., Gogleva, A., Hoey, D. J., Bridgen, A. J., Stolze, S. C., Nakagami, H., et al. (2019). Conserved biochemical defenses underpin host responses to oomycete infection in an early-divergent land plant lineage. Curr. Biol. 29, 2282–2294.e5. doi: 10.1016/j.cub.2019.05.078
Cheng, C. Y., Krishnakumar, V., Chan, A. P., Thibaud-Nissen, F., Schobel, S., and Town, C. D. (2017). Araport11: a complete reannotation of the Arabidopsis thaliana reference genome. Plant J. 89, 789–804. doi: 10.1111/tpj.13415
Clough, S. J., and Bent, A. F. (1998). Floral dip: a simplified method for Agrobacterium-mediated transformation of Arabidopsis thaliana. Plant J. 16, 735–743. doi: 10.1046/j.1365-313x.1998.00343.x
Crooks, G. E., Hon, G., Chandonia, J. M., and Brenner, S. E. (2004). WebLogo: a sequence logo generator. Genome Res. 14, 1188–1190. doi: 10.1101/gr.849004
Dong, S., Zhao, C., Zhang, S., Wu, H., Mu, W., Wei, T., et al. (2019). The amount of RNA editing sites in liverwort organellar genes is correlated with GC content and nuclear PPR protein diversity. Genome Biol. Evol. 11, 3233–3239. doi: 10.1093/gbe/evz232
Durand, E., Chantreau, M., Le Veve, A., Stetsenko, R., Dubin, M., Genete, M., et al. (2020). Evolution of self-incompatibility in the Brassicaceae: lessons from a textbook example of natural selection. Evol. Appl. 13, 1279–1297. doi: 10.1111/eva.12933
Fernandez, A. I., Vangheluwe, N., Xu, K., Jourquin, J., Claus, L. A. N., Morales-Herrera, S., et al. (2020). GOLVEN peptide signalling through RGI receptors and MPK6 restricts asymmetric cell division during lateral root initiation. Nat. Plants 6, 533–543. doi: 10.1038/s41477-020-0645-z
Fernandez, A., Drozdzecki, A., Hoogewijs, K., Nguyen, A., Beeckman, T., Madder, A., et al. (2013). Transcriptional and functional classification of the GOLVEN/ROOT GROWTH FACTOR/CLE-like signaling peptides reveals their role in lateral root and hair formation. Plant Physiol, 161, 954–970. doi: 10.1104/pp.112.206029
Fernandez, A., Drozdzecki, A., Hoogewijs, K., Vassileva, V., Madder, A., Beeckman, T., et al. (2015). The GLV6/RGF8/CLEL2 peptide regulates early pericycle divisions during lateral root initiation. J. Exp. Bot. 66, 5245–5256. doi: 10.1093/jxb/erv329
Fernandez-Pozo, N., Menda, N., Edwards, J. D., Saha, S., Tecle, I. Y., Strickler, S. R., et al. (2015). The sol genomics network (SGN)–from genotype to phenotype to breeding. Nucleic Acids Res. 43, D1036–D1041. doi: 10.1093/nar/gku1195
Field, A. R., Testo, W., Bostock, P. D., Holtum, J. A. M., and Waycott, M. (2016). Molecular phylogenetics and the morphology of the Lycopodiaceae subfamily Huperzioideae supports three genera: Huperzia, Phlegmariurus and Phylloglossum. Mol. Phylogenet. Evol. 94(Pt B), 635–657. doi: 10.1016/j.ympev.2015.09.024
Filiault, D. L., Ballerini, E. S., Mandakova, T., Akoz, G., Derieg, N. J., Schmutz, J., et al. (2018). The Aquilegia genome provides insight into adaptive radiation and reveals an extraordinarily polymorphic chromosome with a unique history. eLife 7:e36426. doi: 10.7554/eLife.36426
Forrest, L. L., Davis, E. C., Long, D. G., Crandall-Stotler, B. J., Clark, A., and Hollingsworth, M. L. (2006). Unraveling the evolutionary history of the liverworts (Marchantiophyta): multiple taxa, genomes and analyses. Bryologist 109, 303–334.
Frangedakis, E., Shimamura, M., Villarreal, J. C., Li, F. W., Tomaselli, M., Waller, M., et al. (2020). The hornworts: morphology, evolution and development. New Phytol. 229, 735–754. doi: 10.1111/nph.16874
Furumizu, C., Krabberød, A. K., Hammerstad, M., Alling, R. M., Wildhagen, M., Sawa, S., et al. (2021). The sequenced genomes of non-flowering land plants reveal the innovative evolutionary history of peptide signaling. Plant Cell 173. doi: 10.1093/plcell/koab173
Geng, Y., Cai, C., McAdam, S. A. M., Banks, J. A., Wisecaver, J. H., and Zhou, Y. (2021). A De Novo transcriptome assembly of Ceratopteris richardii provides insights into the evolutionary dynamics of complex gene families in land plants. Genome Biol. Evol. 13:evab042. doi: 10.1093/gbe/evab042
Ghorbani, S., Hoogewijs, K., Pecenkova, T., Fernandez, A., Inze, A., Eeckhout, D., et al. (2016). The SBT6.1 subtilase processes the GOLVEN1 peptide controlling cell elongation. J. Exp. Bot. 67, 4877–4887. doi: 10.1093/jxb/erw241
Ghorbani, S., Lin, Y. C., Parizot, B., Fernandez, A., Njo, M. F., Van de Peer, Y., et al. (2015). Expanding the repertoire of secretory peptides controlling root development with comparative genome analysis and functional assays. J. Exp. Bot. 66, 5257–5269.
Goad, D. M., Zhu, C., and Kellogg, E. A. (2017). Comprehensive identification and clustering of CLV3/ESR-related (CLE) genes in plants finds groups with potentially shared function. New Phytol. 216, 605–616. doi: 10.1111/nph.14348
Goodstein, D. M., Shu, S., Howson, R., Neupane, R., Hayes, R. D., Fazo, J., et al. (2012). Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 40, D1178–D1186. doi: 10.1093/nar/gkr944
Hetherington, A. J., Emms, D. M., Kelly, S., and Dolan, L. (2020). Gene expression data support the hypothesis that Isoetes rootlets are true roots and not modified leaves. Sci. Rep. 10:21547. doi: 10.1038/s41598-020-78171-y
Hirakawa, Y., Fujimoto, T., Ishida, S., Uchida, N., Sawa, S., Kiyosue, T., et al. (2020). Induction of multichotomous branching by CLAVATA peptide in Marchantia polymorpha. Curr. Biol. 30, 3833–3840e3834. doi: 10.1016/j.cub.2020.07.016
Hirakawa, Y., Shinohara, H., Welke, K., Irle, S., Matsubayashi, Y., Torii, K. U., et al. (2017). Cryptic bioactivity capacitated by synthetic hybrid plant peptides. Nat. Commun. 8:14318. doi: 10.1038/ncomms14318
Hirakawa, Y., Uchida, N., Yamaguchi, Y. L., Tabata, R., Ishida, S., Ishizaki, K., et al. (2019). Control of proliferation in the haploid meristem by CLE peptide signaling in Marchantia polymorpha. PLoS Genet. 15:e1007997. doi: 10.1371/journal.pgen.1007997
Inkpen, S. A., and Doolittle, W. F. (2016). Molecular phylogenetics and the perennial problem of homology. J. Mol. Evol. 83, 184–192. doi: 10.1007/s00239-016-9766-4
Ishizaki, K., Chiyoda, S., Yamato, K. T., and Kohchi, T. (2008). Agrobacterium-mediated transformation of the haploid liverwort Marchantia polymorpha L., an emerging model for plant biology. Plant Cell Physiol. 49, 1084–1091. doi: 10.1093/pcp/pcn085
Ishizaki, K., Nishihama, R., Ueda, M., Inoue, K., Ishida, S., Nishimura, Y., et al. (2015). Development of gateway binary vector series with four different selection markers for the liverwort Marchantia polymorpha. PLoS One 10:e0138876. doi: 10.1371/journal.pone.0138876
Katoh, K., Rozewicki, J., and Yamada, K. D. (2019). MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief. Bioinform. 20, 1160–1166. doi: 10.1093/bib/bbx108
Knie, N., Fischer, S., Grewe, F., Polsakiewicz, M., and Knoop, V. (2015). Horsetails are the sister group to all other monilophytes and Marattiales are sister to leptosporangiate ferns. Mol. Phylogenet. Evol. 90, 140–149. doi: 10.1016/j.ympev.2015.05.008
Kohchi, T., Yamato, K. T., Ishizaki, K., Yamaoka, S., and Nishihama, R. (2021). Development and molecular genetics of Marchantia polymorpha. Annu. Rev. Plant Biol. 72, 677–702. doi: 10.1146/annurev-arplant-082520-094256
Koi, S., Hisanaga, T., Sato, K., Shimamura, M., Yamato, K. T., Ishizaki, K., et al. (2016). An evolutionarily conserved plant RKD factor controls germ cell differentiation. Curr. Biol. 26, 1775–1781. doi: 10.1016/j.cub.2016.05.013
Kubota, A., Ishizaki, K., Hosaka, M., and Kohchi, T. (2013). Efficient Agrobacterium-mediated transformation of the liverwort Marchantia polymorpha using regenerating thalli. Biosci. Biotechnol. Biochem. 77, 167–172. doi: 10.1271/bbb.120700
Li, F. W., Brouwer, P., Carretero-Paulet, L., Cheng, S., de Vries, J., Delaux, P. M., et al. (2018). Fern genomes elucidate land plant evolution and cyanobacterial symbioses. Nat. Plants 4, 460–472. doi: 10.1038/s41477-018-0188-8
Li, F. W., Nishiyama, T., Waller, M., Frangedakis, E., Keller, J., Li, Z., et al. (2020). Anthoceros genomes illuminate the origin of land plants and the unique biology of hornworts. Nat. Plants 6, 259–272. doi: 10.1038/s41477-020-0618-2
Liu, Y., Johnson, M. G., Cox, C. J., Medina, R., Devos, N., Vanderpoorten, A., et al. (2019). Resolution of the ordinal phylogeny of mosses using targeted exons from organellar and nuclear genomes. Nat. Commun. 10:1485. doi: 10.1038/s41467-019-09454-w
Lori, M., van Verk, M. C., Hander, T., Schatowitz, H., Klauser, D., Flury, P., et al. (2015). Evolutionary divergence of the plant elicitor peptides (Peps) and their receptors: interfamily incompatibility of perception but compatibility of downstream signalling. J. Exp. Bot. 66, 5315–5325. doi: 10.1093/jxb/erv236
Lu, X., Shi, H., Ou, Y., Cui, Y., Chang, J., Peng, L., et al. (2020). RGF1-RGI1, a peptide-receptor complex, regulates Arabidopsis root meristem development via a MAPK signaling cascade. Mol. Plant 13, 1594–1607. doi: 10.1016/j.molp.2020.09.005
Lu, Y., Ran, J. H., Guo, D. M., Yang, Z. Y., and Wang, X. Q. (2014). Phylogeny and divergence times of gymnosperms inferred from single-copy nuclear genes. PLoS One 9:e107679. doi: 10.1371/journal.pone.0107679
Ma, R., Han, Z., Hu, Z., Lin, G., Gong, X., Zhang, H., et al. (2016). Structural basis for specific self-incompatibility response in Brassica. Cell Res. 26, 1320–1329. doi: 10.1038/cr.2016.129
Matsui, H., Iwakawa, H., Hyon, G. S., Yotsui, I., Katou, S., Monte, I., et al. (2020). Isolation of natural fungal pathogens from Marchantia polymorpha reveals antagonism between salicylic acid and jasmonate during liverwort-fungus interactions. Plant Cell Physiol. 61:442. doi: 10.1093/pcp/pcz214
Matsuzaki, Y., Ogawa-Ohnishi, M., Mori, A., and Matsubayashi, Y. (2010). Secreted peptide signals required for maintenance of root stem cell niche in Arabidopsis. Science 329, 1065–1067. doi: 10.1126/science.1191132
Meng, L., Buchanan, B. B., Feldman, L. J., and Luan, S. (2012). CLE-like (CLEL) peptides control the pattern of root growth and lateral root development in Arabidopsis. Proc. Natl. Acad. Sci. U.S.A. 109, 1760–1765. doi: 10.1073/pnas.1119864109
Ming, R., VanBuren, R., Wai, C. M., Tang, H., Schatz, M. C., Bowers, J. E., et al. (2015). The pineapple genome and the evolution of CAM photosynthesis. Nat. Genet. 47, 1435–1442. doi: 10.1038/ng.3435
Mitchum, M. G., Wang, X., Wang, J., and Davis, E. L. (2012). Role of nematode peptides and other small molecules in plant parasitism. Annu. Rev. Phytopathol. 50, 175–195. doi: 10.1146/annurev-phyto-081211-173008
Miwa, H., Tamaki, T., Fukuda, H., and Sawa, S. (2009). Evolution of CLE signaling: origins of the CLV1 and SOL2/CRN receptor diversity. Plant Signal. Behav. 4, 477–481. doi: 10.4161/psb.4.6.8391
Morris, J. L., Puttick, M. N., Clark, J. W., Edwards, D., Kenrick, P., Pressel, S., et al. (2018). The timescale of early land plant evolution. Proc. Natl. Acad. Sci. U.S.A. 115, E2274–E2283. doi: 10.1073/pnas.1719588115
Murase, K., Moriwaki, Y., Mori, T., Liu, X., Masaka, C., Takada, Y., et al. (2020). Mechanism of self/nonself-discrimination in Brassica self-incompatibility. Nat. Commun. 11:4916. doi: 10.1038/s41467-020-18698-w
Nasrallah, J. B. (2017). Plant mating systems: self-incompatibility and evolutionary transitions to self-fertility in the mustard family. Curr. Opin. Genet. Dev. 47, 54–60. doi: 10.1016/j.gde.2017.08.005
Nystedt, B., Street, N. R., Wetterbom, A., Zuccolo, A., Lin, Y. C., Scofield, D. G., et al. (2013). The Norway spruce genome sequence and conifer genome evolution. Nature 497, 579–584. doi: 10.1038/nature12211
Olsson, V., Joos, L., Zhu, S., Gevaert, K., Butenko, M. A., and De Smet, I. (2019). Look closely, the beautiful may be small: precursor-derived peptides in plants. Annu. Rev. Plant Biol. 70, 153–186. doi: 10.1146/annurev-arplant-042817-040413
One Thousand Plant Transcriptomes Initiative (2019). One thousand plant transcriptomes and the phylogenomics of green plants. Nature 574, 679–685. doi: 10.1038/s41586-019-1693-2
Ou, Y., Lu, X., Zi, Q., Xun, Q., Zhang, J., Wu, Y., et al. (2016). RGF1 INSENSITIVE 1 to 5, a group of LRR receptor-like kinases, are essential for the perception of root meristem growth factor 1 in Arabidopsis thaliana. Cell Res. 26, 686–698. doi: 10.1038/cr.2016.63
Plackett, A. R., Di Stilio, V. S., and Langdale, J. A. (2015). Ferns: the missing link in shoot evolution and development. Front. Plant Sci. 6:972. doi: 10.3389/fpls.2015.00972
Ponce de León, I., and Montesano, M. (2017). Adaptation mechanisms in the evolution of moss defenses to microbes. Front. Plant Sci. 8:366. doi: 10.3389/fpls.2017.00366
Poveda, J. (2020). Marchantia polymorpha as a model plant in the evolutionary study of plant-microorganism interactions. Curr. Plant Biol. 23:100152. doi: 10.1016/j.cpb.2020.100152
Pruitt, R. N., Joe, A., Zhang, W. G., Feng, W., Stewart, V., Schwessinger, B., et al. (2017). A microbially derived tyrosine-sulfated peptide mimics a plant peptide hormone. New Phytol. 215, 725–736. doi: 10.1111/nph.14609
Pryer, K. M., Schuettpelz, E., Wolf, P. G., Schneider, H., Smith, A. R., and Cranfill, R. (2004). Phylogeny and evolution of ferns (monilophytes) with a focus on the early leptosporangiate divergences. Am. J. Bot. 91, 1582–1598. doi: 10.3732/ajb.91.10.1582
Ran, J. H., Shen, T. T., Wang, M. M., and Wang, X. Q. (2018). Phylogenomics resolves the deep phylogeny of seed plants and indicates partial convergent or homoplastic evolution between Gnetales and angiosperms. Proc. Biol. Sci. 285:20181012. doi: 10.1098/rspb.2018.1012
Rensing, S. A., Lang, D., Zimmer, A. D., Terry, A., Salamov, A., Shapiro, H., et al. (2008). The Physcomitrella genome reveals evolutionary insights into the conquest of land by plants. Science 319, 64–69. doi: 10.1126/science.1150646
Ronald, P., and Joe, A. (2018). Molecular mimicry modulates plant host responses to pathogens. Ann. Bot. 121, 17–23. doi: 10.1093/aob/mcx125
Rovekamp, M., Bowman, J. L., and Grossniklaus, U. (2016). Marchantia MpRKD regulates the gametophyte-sporophyte transition by keeping egg cells quiescent in the absence of fertilization. Curr. Biol. 26, 1782–1789. doi: 10.1016/j.cub.2016.05.028
Schafran, P. W., Zimmer, E. A., Taylor, W. C., and Musselman, L. J. (2018). A whole chloroplast genome phylogeny of diploid species of isoetes (Isoetaceae, Lycopodiophyta) in the Southeastern United States. Castanea 83, 224–235. doi: 10.2179/17-132
Schuettpelz, E., Schneider, H., Smith, A. R., Hovenkamp, P., Prado, J., Rouhan, G., et al. (2016). A community-derived classification for extant lycophytes and ferns. J. Syst. Evol. 54, 563–603. doi: 10.1111/jse.12229
Segonzac, C., and Monaghan, J. (2019). Modulation of plant innate immune signaling by small peptides. Curr. Opin. Plant Biol. 51, 22–28. doi: 10.1016/j.pbi.2019.03.007
Shao, Y., Yu, X., Xu, X., Li, Y., Yuan, W., Xu, Y., et al. (2020). The YDA-MKK4/MKK5-MPK3/MPK6 cascade functions downstream of the RGF1-RGI ligand-receptor pair in regulating mitotic activity in the root apical meristem. Mol. Plant 13, 1608–1623. doi: 10.1016/j.molp.2020.09.004
Shinohara, H. (2021). Root meristem growth factor RGF, a sulfated peptide hormone in plants. Peptides 142:170556. doi: 10.1016/j.peptides.2021.170556
Shinohara, H., Mori, A., Yasue, N., Sumida, K., and Matsubayashi, Y. (2016). Identification of three LRR-RKs involved in perception of root meristem growth factor in Arabidopsis. Proc. Natl. Acad. Sci. U.S.A. 113, 3897–3902.
Song, W., Liu, L., Wang, J., Wu, Z., Zhang, H., Tang, J., et al. (2016). Signature motif-guided identification of receptors for peptide hormones essential for root meristem growth. Cell Res. 26, 674–685. doi: 10.1038/cr.2016.62
Spencer, V., Venza, Z. N., and Harrison, C. J. (2020). What can lycophytes teach us about plant evolution and development? Modern perspectives on an ancient lineage. Evol. Dev. 23, 174–196. doi: 10.1111/ede.12350
Strabala, T. J., Phillips, L., West, M., and Stanbra, L. (2014). Bioinformatic and phylogenetic analysis of the CLAVATA3/EMBRYO-SURROUNDING REGION (CLE) and the CLE-LIKE signal peptide genes in the Pinophyta. BMC Plant Biol. 14:47. doi: 10.1186/1471-2229-14-47
Stuhrwohldt, N., Scholl, S., Lang, L., Katzenberger, J., Schumacher, K., and Schaller, A. (2020). The biogenesis of CLEL peptides involves several processing events in consecutive compartments of the secretory pathway. eLife 9:e55580. doi: 10.7554/eLife.55580
Tadiello, A., Ziosi, V., Negri, A. S., Noferini, M., Fiori, G., Busatto, N., et al. (2016). On the role of ethylene, auxin and a GOLVEN-like peptide hormone in the regulation of peach ripening. BMC Plant Biol. 16:44. doi: 10.1186/s12870-016-0730-7
Takeuchi, H. (2021). The role of diverse LURE-type cysteine-rich peptides as signaling molecules in plant reproduction. Peptides 142:170572. doi: 10.1016/j.peptides.2021.170572
Takeuchi, H., and Higashiyama, T. (2012). A species-specific cluster of defensin-like genes encodes diffusible pollen tube attractants in Arabidopsis. PLoS Biol. 10:e1001449. doi: 10.1371/journal.pbio.1001449
Tavormina, P., De Coninck, B., Nikonorova, N., De Smet, I., and Cammue, B. P. A. (2015). The plant peptidome: an expanding repertoire of structural features and biological functions. Plant Cell 27, 2095–2118. doi: 10.1105/tpc.15.00440
van Kesteren, R. E., Tensen, C. P., Smit, A. B., van Minnen, J., Kolakowski, L. F., Meyerhof, W., et al. (1996). Co-evolution of ligand-receptor pairs in the vasopressin/oxytocin superfamily of bioactive peptides. J. Biol. Chem. 271, 3619–3626. doi: 10.1074/jbc.271.7.3619
Wang, X. Y., Zhang, N., Zhang, L. N., He, Y. X., Cai, C., Zhou, J. G., et al. (2021). Perception of the pathogen-induced peptide RGF7 by the receptor-like kinases RGI4 and RGI5 triggers innate immunity in Arabidopsis thaliana. New Phytol. 230, 1110–1125. doi: 10.1111/nph.17197
Waterhouse, A. M., Procter, J. B., Martin, D. M., Clamp, M., and Barton, G. J. (2009). Jalview version 2–a multiple sequence alignment editor and analysis workbench. Bioinformatics 25, 1189–1191. doi: 10.1093/bioinformatics/btp033
Weststrand, S., and Korall, P. (2016). Phylogeny of Selaginellaceae: there is value in morphology after all! Am. J. Bot. 103, 2136–2159. doi: 10.3732/ajb.1600156
Whitewoods, C. D., Cammarata, J., Nemec Venza, Z., Sang, S., Crook, A. D., Aoyama, T., et al. (2018). CLAVATA was a genetic novelty for the morphological innovation of 3D growth in land plants. Curr. Biol. 28, 2365–2376.e5. doi: 10.1016/j.cub.2018.05.068
Whitford, R., Fernandez, A., Tejos, R., Perez, A. C., Kleine-Vehn, J., Vanneste, S., et al. (2012). GOLVEN secretory peptides regulate auxin carrier turnover during plant gravitropic responses. Dev. Cell 22, 678–685. doi: 10.1016/j.devcel.2012.02.002
Wikstrom, N., and Kenrick, P. (2000). Phylogeny of epiphytic Huperzia (Lycopodiaceae): paleotropical and neotropical clades corroborated by rbcL sequences. Nord. J. Bot. 20, 165–171. doi: 10.1111/j.1756-1051.2000.tb01561.x
Yamada, M., Han, X., and Benfey, P. N. (2020). RGF1 controls root meristem size through ROS signalling. Nature 577, 85–88. doi: 10.1038/s41586-019-1819-6
Yasui, Y., Tsukamoto, S., Sugaya, T., Nishihama, R., Wang, Q., Kato, H., et al. (2019). GEMMA CUP-ASSOCIATED MYB1, an ortholog of axillary meristem regulators, is essential in vegetative reproduction in Marchantia polymorpha. Curr. Biol. 29, 3987–3995e3985. doi: 10.1016/j.cub.2019.10.004
Zhang, H., Han, Z., Song, W., and Chai, J. (2016). Structural insight into recognition of plant peptide hormones by receptors. Mol. Plant 9, 1454–1463. doi: 10.1016/j.molp.2016.10.002
Zhang, J., Fu, X. X., Li, R. Q., Zhao, X., Liu, Y., Li, M. H., et al. (2020). The hornwort genome and early land plant evolution. Nat. Plants 6, 107–118. doi: 10.1038/s41477-019-0588-4
Keywords: RGF/GLV/CLEL, signaling peptide, land plant evolution, 1KP project, Marchantia polymorpha, homology, convergence
Citation: Furumizu C and Sawa S (2021) The RGF/GLV/CLEL Family of Short Peptides Evolved Through Lineage-Specific Losses and Diversification and Yet Conserves Its Signaling Role Between Vascular Plants and Bryophytes. Front. Plant Sci. 12:703012. doi: 10.3389/fpls.2021.703012
Received: 30 April 2021; Accepted: 28 June 2021;
Published: 20 July 2021.
Edited by:
Fuminori Takahashi, Tokyo University of Science, JapanReviewed by:
Lin Xu, Center for Excellence in Molecular Plant Sciences, Chinese Academy of Sciences (CAS), ChinaHervé Canut, UMR 5546 Laboratoire de Recherche en Sciences Vegetales (LRSV), France
Copyright © 2021 Furumizu and Sawa. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Chihiro Furumizu, Y2hpaGlyby5mdXJ1bWl6dUBtZWQuYWtpdGEtdS5hYy5qcA==
†Present address: Chihiro Furumizu, Department of Immunology, Graduate School of Medicine, Akita University, Akita, Japan