- 1Molecular and Biological Agricultural Sciences Program, Taiwan International Graduate Program, Academia Sinica and National Chung Hsing University, Taipei, Taiwan
- 2Agricultural Biotechnology Research Center, Academia Sinica, Taipei, Taiwan
- 3Graduate Institute of Biotechnology, National Chung Hsing University, Taichung, Taiwan
- 4Biotechnology Center, National Chung Hsing University, Taichung, Taiwan
The number of genes encoding receptor-like kinases (RLKs) has expanded in the plant lineage. Their expansion has resulted in the emergence of diverse domain architectures that function in signaling cascades related to growth, development, and stress response. In this study, we focused on receptor-like cytoplasmic kinase subfamily XI (RLCK XI) in plants. We discovered an exceptionally long kinase insert domain (KID), averaging 280 amino acids, between subdomains VII and VIII of the conserved protein kinase domain. Using sequence homology search, we identified members of RLCK XI with the unique KID architecture in terrestrial plants, up to a single copy in several hornwort and liverwort species. The KID shows a high propensity for being disordered, resembling the activation segment in the model kinase domain. Several conserved sequence motifs were annotated along the length of the KID. Of note, the KID harbors repetitive nuclear localization signals capable of mediating RLCK XI translocation from the plasma membrane to the nucleus. The possible physiological implication of dual localization of RLCK XI members is discussed. The presence of a KID in RLCK XI represents a unique domain architecture among RLKs specific to land plants.
Introduction
Protein phosphorylation by protein kinases is a fundamental means of cell signaling across all living organisms (Kobir et al., 2011; Jin and Pawson, 2012; Esser et al., 2016). Protein kinases regulate cellular processes through the catalytic transfer of ƴ-phosphate from ATP to serine, threonine, or tyrosine residues on their target substrates (Taylor and Kornev, 2011). Protein kinases are classified into two groups based on differences in their protein kinase fold, eukaryotic protein kinases (EPKs) with conserved subdomains and atypical protein kinases lacking conserved subdomains (Kanev et al., 2019; Modi and Dunbrack, 2019).
The protein kinase domain of EPKs has a bilobal structure with an average length of 250 amino acids (Taylor et al., 2012). It is comprised of 11 conserved subdomains (I-XI) that fold into an N-terminal and C-terminal lobe (Hanks et al., 1988; Taylor et al., 2012). Subdomains I-IV are part of the N-terminal lobe that anchors and orients ATP. Subdomain V forms the linker region between the two lobes. Subdomains VI-XI are part of the C-terminal lobe that binds substrates and brings them close to ATP for phosphorylation. A key regulatory region known as the activation segment is found within the C-terminal lobe (Johnson et al., 1996; Adams, 2003). The activation segment is flanked by the conserved tripeptide motifs Asp-Phe-Gly (DFG) and Ala-Pro-Glu (APE) of subdomain VII and subdomain VIII, respectively (Nolen et al., 2004). The activation segment does not typically form a stable secondary or tertiary structure (Gogl et al., 2019). In this regard, the activation segment exhibits a high propensity for protein disorder, which is found in sequences referred to as intrinsically disordered regions (IDRs). IDRs are protein segments that do not fold into a recognizable secondary or tertiary structure in their native state (Babu, 2016). As an IDR, the activation segment functions as a flexible region containing phosphorylation sites that can regulate protein kinase activity (Kurotani et al., 2014; Wright and Dyson, 2015).
The activation segment has an average length of 20-35 amino acids (Nolen et al., 2004; Taylor et al., 2012); however, there are a few reports of longer and much less conserved versions of the activation segments, called the kinase insert domains (KIDs), found between subdomain VI and VII in mammalian receptor tyrosine kinases (RTKs) (Locascio and Donoghue, 2013). In RTKs, KIDs can be classified according to their length. The boundaries between the conserved subdomains and the KID have been defined by sequence homology among the members of RTK families (Locascio and Donoghue, 2013). Phosphorylation within the KID of RTKs has been shown to regulate cellular processes, such as mitogenesis and chemotaxis (Locascio and Donoghue, 2013). Much less is known about KIDs in other protein kinase groups.
In plants, the largest proportion of protein kinases belong to a group known as receptor-like kinases (RLKs) that play key roles in growth, development, and stress response (Jamieson et al., 2018; Dievart et al., 2020). RLKs structurally resemble RTKs in having an extracellular domain followed by a single-pass transmembrane region and a C-terminal protein kinase domain (Lehti-Shiu et al., 2009; Jamieson et al., 2018; Jose et al., 2020). The closest homologs of RLKs in animals are Pelle and interleukin receptor-associated kinase (IRAK) (Shiu and Bleecker, 2001). However, RLKs form a monophyletic family distinct from other groups of protein kinases but closely related to Rapidly Accelerated Fibrosarcoma (Raf) kinases and RTKs from animals (Shiu and Bleecker, 2001). The number of RLKs has dramatically expanded in plants from as few as one member in the chlorophyte Ulva mutabilis, to more than 610 members in Arabidopsis thaliana and more than 1,000 copies in Oryza. sativa (Shiu et al., 2004; Dievart et al., 2020). Their expansion has led to diverse protein domains interacting with a wide range of ligands across multiple signaling pathways (Dievart et al., 2020; Gong and Han, 2021).
RLKs can be classified based on the presence of conserved domains in their sequences. Those with one or several extracellular ligand-binding domains are called receptor kinases (RKs), and those without are called receptor-like cytoplasmic kinases (RLCKs) (Yamaguchi et al., 2013; Liang and Zhou, 2018). There are 50 RLK subfamilies based on the sequence similarity of their protein kinase domain (Shiu and Bleecker, 2001). The largest subfamilies contain multiple leucine-rich repeats (LRR) domains, which interact with ligands involved in developmental and stress response pathways (Man et al., 2020). For example, CLAVATA1 (CLV1) controls meristem development (Hazak and Hardtke, 2016). Flagellin-sensitive 2 (FLS2) and BRASSINOSTEROID INSENSITIVE 1-associated kinase 1 (BAK1) undergo conformational changes following signal perception that leads to heterodimerization and reciprocal phosphorylation during pathogen perception (Schulze et al., 2010; Sun et al., 2013; Oh et al., 2018).
RLCKs work in tandem with RKs to transduce signals perceived at the plasma membrane and regulate intracellular processes involving fellow RLCKs (Liang and Zhou, 2018). One of the best-characterized plant RLCKs is the BOTRYTIS-INDUCED KINASE 1 (BIK1), which transmits signals upon the perception of microbe-associated molecular patterns by FLS2/BAK1 kinases at the cell surface to downstream intracellular events (Lu et al., 2010; Ma et al., 2020). RLCKs also act as decoys that interact with effectors to reduce pathogen severity and trigger downstream immune response pathways (van der Hoorn and Kamoun, 2008; Paulus and van der Hoorn, 2018).
Plant RLKs have a diverse collection of domain architectures that dictate their function in several signaling pathways (Jose et al., 2020). In this study, we characterize one RLCK subfamily, RLCK XI. We showed that RLCK XI is conserved in land plants and identified a previously unannotated region, referred to as the RLCK XI-KID, which splits the conserved protein kinase domain into two parts. The RLCK XI-KIDs have an average length of 280 amino acids. The RLCK XI-KID is located between protein kinase subdomains VII and VIII, making the KID equivalent to the activation segment in canonical protein kinases. Conventionally, the activation segments have no well-defined tertiary structure and display intrinsically disordered properties, which was also observed in RLCK XI-KIDs. Notably, we observed repetitive nuclear localization motifs in the KIDs, which were functional in mediating nuclear translocation. The full-length RLCK XI showed both plasma membrane and nuclear localization. However, deletion of the transmembrane domain leads to complete nuclear localization. Our study presents evidence of a newfound domain architecture in the RLCK XI members with a unique KID. Furthermore, functional nuclear localization signals were embedded within the KID of RLCK XI members, which might be implicated in the signaling of various kinase cascades.
Materials and methods
Sequence retrieval
Protein sequences were gathered from the Joint Genome Institute (JGI) Phytozome 12 database (https://phytozome-next.jgi.doe.gov) unless specified otherwise (Goodstein et al., 2012). Sequences from Antheros agrestis, Antheros angustus, and Antheros punctatus were gathered from the hornwort sequence database (https://www.hornworts.uzh.ch/en/download.html) of the Szovenyi group. Sequences from Salvinia cucullate were gathered from the Fernbase (https://www.fernbase.org/). Sequences from Picea abies were gathered from the gymnosperm PLAZA plant comparative genomics database (https://bioinformatics.psb.ugent.be/plaza) (Proost et al., 2009).
Identification of RLCK XI candidates
RLCK XI candidates were identified using the hmmsearch program from the HMMER3 package (Eddy, 2009). The hmmsearch was conducted on 60 proteomes (Supplementary Table 1) across the plant lineage against an RLCK XI profile Hidden Markov Model (HMM). The RLCK XI profile HMM comprised the four known members of RLCK XI from A. thaliana. The resulting 152 sequences that passed an E-value threshold of 0.01 were classified as RLCK XI candidates (Supplementary Table 2).
Sequence analysis and annotation for domains and motifs
RLCK XI candidates were annotated for domains using the Conserved Domain Search Tool (https://www.ncbi.nlm.nih.gov/Structure/bwrpsb/bwrpsb.cgi) (Lu et al., 2020). Nuclear localization signals within RLCK XI sequences were identified using the Eukaryotic Linear Motif (ELM) resource (Kumar et al., 2022). Conserved motifs were mapped using the Multiple Em for Motif Elicitation (MEME) online toolbox (http://meme-suite.org/) (Bailey et al., 2015). Transmembrane regions were annotated using Deep Transmembrane Hidden Markov Model (DeepTMHMM) (https://dtu.biolib.com/app/DeepTMHMM/) (Hallgren et al., 2022). Intrinsically disordered regions (IDRs) were analyzed using the IUPred2A program configured to the default (long) setting (Erdos and Dosztanyi, 2020). Sequence logos were built using WebLogo (https://weblogo.berkeley.edu/logo.cgi) (Crooks et al., 2004).
Sequence alignment and phylogenetic analysis
For protein alignment of AthRLCK XI and representative RLKs in Figure 1, we aligned members of RLCK XI from A. thaliana together with well-studied RLK AthCLV1 from LRR-RLK XI-1 subfamily, AthBIK1 from RLCK VIIa-2 subfamily, and the distant RLK homolog HsaIRAK1 from humans. For Supplementary Figure 1, we used the RLK dataset from Dievart et al. (2020), which included grouped subfamilies of RLKs from the iTAK database of protein kinases (http://itak.feilab.net/cgi-bin/itak/index.cgi). To identify suitable RLK representatives, each RLK subfamily was aligned with AthRLCK XI-2. The sequence with the highest pairwise comparison score with AthRLCK XI-2 was chosen to represent their subfamily. AthRLCK XI-2 was selected because it has the highest average pairwise similarity score among all AthRLCK XI members. Each RLK representative selected for alignment contained all eleven conserved protein kinase subdomains. RLK subfamilies whose members lacked one or more conserved protein kinase subdomains were excluded from the alignment. To determine the evolutionary conservation of the RLCK XI subfamily in plants, full RLCK XI protein sequences were aligned using the L-INS-i strategy from MAFFT version 7 (Katoh and Standley, 2013). The resulting alignment was used to construct a phylogenetic tree using the maximum likelihood method from IQ-TREE (Nguyen et al., 2015). The Phylogenetic tree was illustrated using iTOL (https://itol.embl.de) (Letunic and Bork, 2021).
Figure 1 The members of receptor-like cytoplasmic kinase subfamily XI from A. thaliana possess a kinase insert domain. Sequence alignment of the protein kinase domains of RLCK XI members, AthRLCK XI-1 (AT1G80870), AthRLCK XI-2 (AT2G45590), AthRLCK XI-3 (AT4G25390), AthRLCK XI-4 (AT5G51770), together with representative RLKs found in A. thaliana. CLV1 (AT1G75820) represents AthLRR-RLK XI-1, BIK1 (AT2G39660) represents AthRLCK VIIa-2, and HasIRAK1 represents RLK found in mammals. The location of conserved protein kinase subdomains (I-XI) on each sequence (orange) was based on sequence alignment with well-characterized protein kinase sequences. An insertion domain in the RLCK XI members was identified between the conserved motifs Asp-Phe-Gly (DFG) and Ala-Pro-Glu (APE) of subdomain VII and subdomain VIII, respectively.
BLAST search for KID, KID-motif, and NLS-containing sequences
The KID, M2, M3, and NLS sequences found in AthRLCK XI-2 were used as queries for BLAST search against the NCBI non-redundant (nr) protein database (Altschul et al., 2005). The BLAST search was performed using the BLASTP algorithm. The E-value threshold was set to 1 to broaden the coverage of the database search due to the lack of prior knowledge involving sequences containing the RLCK XI-KID.
Subcellular localization of RLCK XI from Arabidopsis thaliana and rice
Full-length protein-coding sequences of AthRLCK XI members and different individual domains were PCR amplified, subcloned into pCR8/GW/TOPO vector, and subsequently recombined into the binary vector by LR reaction following the manufacturer’s instructions (constructs are listed in Supplementary Table 3). Next, the constructs were transfected into Arabidopsis protoplasts by polyethylene glycol for transient expression (Yoo et al., 2007). Sixteen hours later, transfected cells were imaged using a Zeiss LSM 780 confocal microscope (Germany). Image analysis was performed with LSM software ZEN.
Inducible expression of AthRLCK XI in Arabidopsis
The full-length protein-coding sequences of AthRLCK XI-2 and AthRLCK XI-3 with the GFP tag at the C-terminal end were cloned into pMDC7 vector (Curtis and Grossniklaus, 2003) and transformed into A. thaliana Columbia (Col-0) accession for inducible expression by estradiol. Transgenic plants were generated through an Agrobacterium tumefaciens (GV3101) dipping procedure (Clough and Bent, 1998). The successful transgenic T2 lines were grown on agar plates with half-strength modified Hoagland nutrient solution (Chiou et al., 2006). Ten-day-old seedlings were transferred to the same medium containing 10 μM β-estradiol. The expression of GFP-tagged AthRLCK XI-2 and AthRLCK XI-3 in the root was observed two days after β-estradiol treatment. The confocal microscopy images of roots were taken using an Andor Dragonfly spinning disk confocal microscope with an objective lens PL APO 40X/1.10 W CORR. Excitation/emission wavelengths were 488 nm/500–550 nm for GFP.
Expression profile of RLCK XI from Arabidopsis thaliana
Publicly available RNA-seq data were obtained from the Arabidopsis RNA-seq database (ARS) (Zhang et al., 2020) (http://ipf.sustech.edu.cn/pub/athrna/). Data from Arabidopsis tissues and biotic stress response were collected for RLCK XI members from A. thaliana.
Results
Members of RLCK XI from Arabidopsis thaliana possess a kinase insert domain
Receptor-like kinases possess a well-defined protein kinase domain that has a length of 250-300 amino acids and includes eleven conserved subdomains (I-XI) (Hanks and Hunter, 1995; Shiu and Bleecker, 2001; Lehti-Shiu and Shiu, 2012). There are over 600 RLKs found in the model organism Arabidopsis thaliana that can be classified into more than 50 subfamilies (Shiu and Bleecker, 2001; Shiu and Bleecker, 2003; Dievart et al., 2020). However, only a handful of these RLKs have been studied (Fan et al., 2018; Dievart et al., 2020). Among the RLKs found in A. thaliana, we noticed that the four members of RLCK XI, namely, AthRLCK XI-1 (AT1G80870), AthRLCK XI-2 (AT2G45590), AthRLCK XI-3 (AT4G25390), and AthRLCK XI-4 (AT5G51770), had a longer protein sequence compared to other RLCKs by an average of 280 amino acids. We then investigated the origin of the difference in sequence length by aligning the protein kinase domain of AthRLCK XI members with well-studied RLKs AthCLV1, AthBIK1, and HsaIRAK1 to represent the different structural classifications of RLKs (Shiu and Bleecker, 2001; Dievart et al., 2020). The alignment revealed an unannotated region of low similarity that only aligned within members of RLCK XI (Figure 1). Closer inspection showed that this region resided in the activation segment between the conserved subdomain VII and VIII of the protein kinase domain. The unannotated region within the protein kinase domain of RLCK XI-1 to 4 had an average length of 276 amino acids, compared to the 20-35 amino acids in the activation segment of AthCLV1, BIK1, and HsaIRAK1 in the same region. We refer to this region as the RLCK XI kinase insert domain (RLCK XI-KID). To reinforce the uniqueness of the RLCK XI-KID among RLKs, we performed additional multiple sequence alignment with representatives from several RLK and RLCK subfamilies from A. thaliana. The resulting alignment revealed that none of these other RLCK or RLK subfamilies in A. thaliana had sequences similar to the KID (Supplementary Figure 1). These results suggest that members of AthRLCK XI possess a unique kinase insert domain located between conserved subdomain VII and VIII within their protein kinase domain.
The RLCK XI subfamily is conserved in land plants
To understand whether the members of AthRLCK XI are conserved, we built a profile Hidden Markov Model (HMM) using AthRLCK XI members to search the Phytozome 12 database for potential homologs in the plant lineage (Supplementary Table 1). A total of 152 RLCK XI candidate sequences were retrieved using the hmmsearch program of the HMMER3 package (Supplementary Table 2). Similar to the naming convention of AthRLCKs, these RLCK XI candidates were assigned names based on their physical locations on the genome of their respective plant species. Annotation of individual protein sequences revealed that all RLCK XI candidates possessed the same domain architecture as those seen in AthRLCK XI members. These results show that RLCK XI is evolutionarily conserved in land plants (Figure 2).
Figure 2 Receptor-like cytoplasmic kinase subfamily XI is conserved in land plants. The phylogenetic tree was constructed based on the protein sequence alignment of 152 full-length RLCK XI candidates from land plants. The alignment was used as input for IQ-TREE with the maximum-likelihood method. The tree is divided into two major clades (Clades 1 and 2), each containing members from different plant lineages represented by different colors. Members from A. thaliana are pointed at in the cladogram. Bootstrap replicates were set to 1000. Nodes showing bootstrap confidence of 80 or higher are shown. Visualization and annotation were conducted using iTOL.
The tree was divided into two major clades. Clade 1 consisted of members from ferns, gymnosperms, basal angiosperms, monocots, and dicots. On the other hand, clade 2 consisted of bryophytes, lycophytes, a basal angiosperm, monocots, and dicots. We identified eight RLCK XI members from the bryophytes Anthoceros agrestis, Anthoceros angustus, Anthoceros punctatus, Marchantia polymorpha, Physcomitrella patens, and Sphagnum fallax. The number of RLCK XI members increased from a single member in bryophytes to two members in basal angiosperms. In turn, the RLCK XI subfamily increased in number in monocots and dicots, with 2-3 members in monocots, and up to 8 members in the genome of dicot Kalanchoe laxiflora (Figure 2). Interestingly, AthRLCK XI-1 was grouped separately from the other AthRLCK XI members in clade 2. RLCK XI members from the same monocot and dicot species could be found in both clades, but RLCK XI members from gymnosperms, ferns, lycophytes, and bryophytes from the same species were exclusively found in one clade but not both. We did not detect RLCK XI members in the chlorophytes, such as Chlamydomonas reinhardtii, Dunaliella salina, Volvox carteri, Coccomyxa subellipsoidea, Micromonas pusilla, and Ostreococcus lucimarinus. Our results show that the RLCK XI subfamily is conserved in land plants. The presence of RLCK XI among land plants suggest that they may play a crucial role in the evolution of the land plant lineage.
To test whether the RLCK XI-KID influenced phylogenetic relationships, we performed the same procedure using only their KID sequences (Supplementary Figure 2). The constructed RLCK XI-KID tree demonstrated similarities to the full-length tree. The RLCK-KID tree remained split into two major clades where lineages and plant species were grouped similarly to the full-length tree. RLCK XI members from monocots and dicots clustered similarly in the RLCK XI-KID tree. However, RLCK XI from M. polymorpha, P. patens, and S. fallax became a sister of ferns, gymnosperms, and angiosperms in clade 1 (Supplementary Figure 2). These results suggest that the RLCK XI-KID is a conserved region among members of RLCK XI.
Additionally, we performed a BLAST search using the KID of AthRLCK XI-2 to examine whether the RLCK XI-KID is present in sequences other than RLCK XI. A total of 1346 hits that included both hypothetical and predicted proteins were identified and all of them could be traced as members of the RLCK XI subfamily based on their sequence length and coverage of the AthRLCK XI-2 KID (Supplementary Table 4). These results show that the RLCK XI-KID is exclusively found in members of RLCK XI within the plant lineage.
The RLCK XI-KID contains intrinsically disordered regions and conserved sequence motifs
We observed that the length of the RLCK XI-KID has been dynamic in the course of land plant evolution, ranging from 199-527 amino acids, with 280 amino acids on average. The longest KID belonged to MpoRLCK XI-1, which is the only RLCK XI homolog found in M. polymorpha (Supplementary Table 2).
Previous studies have shown that the activation segments of protein kinases function as flexible contact sites with high intrinsic disorder relative to the surrounding protein kinase core (Gogl et al., 2019). We thus examined whether the RLCK XI-KID also exhibits this property. The full-length sequences of RLCK XI members were used to predict the distribution of IDRs through IUPred2A (Dosztanyi et al., 2005). Based on the propensity for disordered regions, the kinase domain-containing conserved subdomains I-VII and VII-XI have a low propensity for being disordered. In contrast, the KIDs clearly show a high tendency for disorder among most RLCK XI members (Figure 3). This result reveals that, like the activation segment of protein kinases, RLCK XI-KIDs also presented qualities found in IDRs.
Figure 3 The intrinsically disordered region (IDR) of RLCK XI members corresponds to the kinase insertion domain. Energy estimation of full-length RLCK XI sequences based on IUPred2A. The color gradient represents the propensity of a protein region to be disordered, with darker shades of blue indicating lower disorder and darker shades of red indicating higher disorder. RLCK XI members are arranged according to their sequence length. RLCK XI members from different species are indicated by different colors in the left column.
Next, we wanted to know whether conserved motifs existed within the RLCK XI-KID. We observed that the 152 RLCK XI-KID sequences we collected had an average sequence identity of 40%. We examined these sequences for ungapped sequence motifs using the Multiple Em for Motif Elicitation (MEME) suite (http://meme-suite.org/) (Bailey et al., 2015) as a means to determine the origin of the observed sequence similarity. We found twelve conserved motifs with E-values of less than 0.01 across all RLCK XI-KID sequences (Figure 4A). These motifs were numbered from lowest E-value (M1) to highest (M12). Interestingly, sub-motif D/EW and D/EWW variations were observed in motifs M2, M3, and M6, while motifs with polylysine residues were detected in M4 and M11. We used the D/EWW-containing M2 and M3 sequences of AthRLCK XI-2 to perform a BLAST search against the NCBI non-redundant protein database to determine whether these motifs could be found outside of RLCK XI. We only found M2 or M3-containing sequences belonging to members of RLCK XI as determined by sequence homology search (Supplementary Tables 5, 6). These results suggest that the conserved motifs within the KID are unique to members of RLCK XI.
Figure 4 Members of RLCK XI contain conserved motifs in their kinase insert domain. (A) Conserved motifs in the RLCKXI-KID were arranged according to increasing E-value (M1-M12). The amino acids are color-coded as follows: acidic residues (D, E), red; basic residues (K, R), blue; polar residues (S, T, Y), orange; others, black. Sequence logos were built using WebLogo. (B) Position of conserved motifs along the kinase insert domain of RLCK XI. The frequencies of each of the twelve motifs (M1-M12) were mapped along twelve positions (P1-P12). The motif with the highest frequency in each of the twelve positions is enclosed in a box.
In our motif search, we observed that each of the 12 motifs identified only occurred once in all 152 RLCK XI-KID sequences. We wanted to investigate whether these motifs were positioned at random or arranged in a discernable pattern along the RLCK XI-KID. We divided the RLCK XI-KID into twelve positions (P1-P12) based on the single occurrence of each of the twelve motifs in all RLCK XI-KID sequences. If a discernable pattern exists in the KID, then each position would reflect a high frequency of specific motifs in that position. When we explored the position or arrangement of these motifs (Figure 4B), we found that M8 and M9 were positioned most frequently at the N- and C-terminal ends of the RLCK XI-KID. Interestingly, M3 and M4 had the highest frequency in multiple positions. M3 had the highest frequency in P3, P4, and P5. Similarly, M4 had the highest frequency in P9, P10, and P11. We noticed a prevalence of charged residues in M1, M2, M3, M4, and M11. On the other hand, M10 was composed of mostly serine residues. M10, M11, and M12 were found across multiple positions on the KID. We hypothesize that the prevalence of M10, M11, and M12 was in part due to the short sequence that made up their respective motifs. These results show that conserved motifs within the RLCK XI-KID have distinct sequence and spatial properties. The conservation of these motifs implies the involvement of potential functions.
The KID of RLCK XI members possesses nuclear localization signals
Disordered regions have been shown to contain a high frequency of interfaces for protein-protein interaction. Some of these sites exhibit both regulatory and signaling functions and are referred to as eukaryotic linear motifs (ELMs) (Kumar et al., 2022). ELMs are evolutionarily malleable sequences and interact with relatively low affinity due to the limited number of residues that make direct contact with the binding partner (Kumar et al., 2022). These features confer ELMs with the ability to mediate transient interactions to maintain robust cell signaling (Davey et al., 2012; Kumar et al., 2022).
To explore the functional relevance of RLCK XI-KID, we searched the ELM resources (Kumar et al., 2022) to examine whether the KIDs of four Arabidopsis RLCK XI members contain any ELMs. We identified several ELMs, annotated as “TRG_NLS_Bipartite_1”, “TRG_NLS_MonoCore_2”, “TRG_NLS_MonoExtC_3” or “TRG_NLS_MonoExtN_4” (Supplementary Table 7), which belong to bipartite variants of the classical basic and charged nuclear localization signal (NLS). Interestingly, these NLSs repeatedly occur in the KID (Figure 5A) and overlap with the conserved motifs M2 and M3, suggesting that the conserved motifs found in RLCK XI members could be functionally relevant. Similarly, the BLAST search for sequences containing the NLS that overlapped M2 and M3 were members of RLCK XI in plants (Supplementary Table 8). In addition to the KID, NLS could be identified in the N and/or C termini of several AthRLCK XI members except for AthRLCK XI-1 (Figure 5A). Similar to Arabidopsis, two RLCK XI members from rice also contain NLSs in their KIDs.
Figure 5 Subcellular localization of the KID from different RLCK XI members. (A) Protein domain model of RLCK XI members from A. thaliana and O. sativa. The transmembrane domain (TM), protein kinase (PK) domain, KID and repetitive NLS are indicated. The protein kinase domain for RLCK XI sequences was annotated from the Conserved Domain Database (CDD). Nuclear localization signals were predicted using the Eukaryotic Linear Motif (ELM) resource. (B) Subcellular localization of GFP-tagged KID from different RLCK XI members expressed in Arabidopsis protoplasts. The fluorescence signals of GFP and chloroplasts (Chl) are merged on the bright field (BF) images. Scale bars: 5 µm.
Members of AthRLCK XI are dually localized on the plasma membrane and in the nucleus
To examine whether the observed NLSs in the KID of AthRLCK XI members are functional, we tagged the protein with a green fluorescent protein (GFP) at the C-terminus. We performed a transient expression assay in Arabidopsis protoplasts to observe their subcellular localization using confocal microscopy. While the free GFP was distributed in the cytosol and nucleus, we found the GFP-tagged KIDs of all four AthRLCK XI members and one OsaRLCK XI member were strictly confined in the nucleus (Figure 5). These results indicate that the predicted NLSs in the KID of RLCK XI are functional in mediating nuclear import.
The nuclear localization of the KID in AthRLCK XI members was unexpected due to the presence of a single predicted transmembrane domain in all four members. To validate whether RLCK XI members are membrane-bound RLKs, we transiently expressed the full-length AthRLCK XI tagged with GFP at the C-terminus and observed their subcellular localization in Arabidopsis protoplasts. In the following experiments, we only focused on AthRLCK XI-2 and -3 because we were unable to clone the full-length transcripts of AthRLCK XI-1 and -4 due to unknown reasons. Transient expression of full-length AthRLCK XI-2-GFP and -3-GFP revealed their localization on the plasma membrane and in the nucleus with a lesser signal in the cytosol (Figures 6B, D). On the other hand, protoplasts expressing AthRLCK XI-2 and -3 without a transmembrane region (ΔTM-GFP) showed localization signals exclusively in the nucleus, similar to the GFP-tagged KID sequence. However, expression of AthRLCK XI-2 and -3 without the KID (ΔKID-GFP) resulted in signals on the plasma membrane and cytosol, where the signals often formed aggregates (Figures 6B, D). Intriguingly, expression of AthRLCK XI-2 and -3 that lacked a transmembrane region and KID (ΔTMΔKID-GFP) resulted in punctate signals in the nucleus (AthRLCK XI-2) or the cytosol with less uniform nuclear signals (AthRLCK XI-3) (Figures 6B, D). We presume this nuclear localization may result from the predicted NLS sequences in the C-terminal region of AthRLCK XI-2 and -3. Indeed, further removal of the C-terminus (ΔTMΔKIDΔC-GFP) resulted in punctate signals in the cytosol and none in the nucleus (Figure 6A). These results suggest KID is essential for the proper nuclear localization of AthRLCK XI-2 and -3. Consistent with the results observed in the protoplasts, full-length AthRLCK XI-2 and -3 mainly localized on the plasma membrane and in the nucleus of transgenic Arabidopsis root tips when induced by β-estradiol (Figures 6C, E).
Figure 6 Subcellular localization of AthRLCK XI‐2 and AthRLCK XI‐3. (A) Truncation and deletion variants of AthRLCK XI-2 and -3. AthRLCK XI-2 was used as a model to show the different deletion variants used, which include: deleted N-terminal transmembrane domain (ΔTM), deleted kinase insertion domain alone (ΔKID), deleted TM and KID (ΔTMΔKID), and ΔTMΔKID without C-terminal end (ΔTMΔKIDΔC). Subcellular localization of GFP‐tagged AthRLCK XI‐2 (B) and AthRLCK XI‐3 (D) variants with domain/region deletions expressed in Arabidopsis protoplasts. The fluorescence signals of GFP and chloroplasts (Chl) are merged on the bright field (BF) images. Scale bars: 5 μm. Subcellular localization of estradiol‐induced GFP‐tagged full‐length AthRLCK XI‐2 (C) and AthRLCK XI‐3 (E) in Arabidopsis root tips. The GFP signals are merged on the bright field images (bottom panels). Scale bars: 40 μm.
The observation of dual localization of full-length AthRLCK XI-2 and -3 on the plasma membrane and nucleus raises a possibility that the plasma membrane-anchored N-terminal portions of AthRLCK XI-2 and -3 may be proteolytically cleaved, triggered by an unknown signal. As a result, the transmembrane domain-truncated C-terminal portion of AthRLCK XI is redirected to the nucleus via the NLS signals in the KID to transmit and relay the signal. It is worth noting that although NLS sequences could be found in the KID and the C-terminal end of AthRLCK XI-2 and -3, the C-terminal NLS sequences alone were insufficient to mediate the nuclear localization observed in full-length AthRLCK XI-2 and -3 (Figure 6A).
Discussion
Receptor-like kinases perceive and transduce extracellular signals into downstream cascades that allow plants to respond to biotic and abiotic stresses, as well as developmental cues (Jose et al., 2020). Much of the diversity in signal perception associated with RLKs is due to their extracellular domain, which can bind to an equally large array of peptides and ligands (Hirakawa and Sawa, 2019; Zhou et al., 2019). In contrast, little is known about the possible structural variations in their protein kinase domain (Bradley et al., 2021). The protein kinase domain contains well-characterized lobes, regions, and subdomains that facilitate the proper folding of its protein structure and catalytic activity (Modi and Dunbrack, 2019). Compared to the otherwise stable structure of the N-terminal and C-terminal lobes, the activation segment found in the C lobe exhibits the property of IDRs and is essential for functional activation (Taylor et al., 2019; Dyla and Kjaergaard, 2020). In this study, we characterize a unique subfamily of RLK, RCLK XI, whose members possess an unusually long activation segment within the kinase domain, designated as KIDs. Several lines of evidence characterize the RLCK XI-KID as an IDR containing functional nuclear localization signals wedged within their protein kinase domain. RLCK XI members present an evolutionarily conserved family in land plants with a unique protein kinase domain architecture.
RLCK XI is a novel model for a KID-containing protein kinase in land plants
Members of the RLCK XI subfamily from A. thaliana have been classified in previous literature as part of large-scale phylogenetic studies aiming to place RLKs as a whole in the protein kinase superfamily (Shiu and Bleecker, 2001; Shiu and Bleecker, 2003; Lehti-Shiu et al., 2009). However, to our knowledge, there has been no specific report of a KID in any of them. Full-length protein alignment of representative protein kinases alongside members of RLCK XI clearly shows the presence of the KID with an average length of 280 amino acids that splits the conserved subdomains I-VII from subdomains VIII-XI (Figure 1 and Supplementary Figure 1). In protein kinases, this region corresponds to the activation segment, which is 20-35 amino acids long and is less conserved within the kinase domain (Nolen et al., 2004). The activation segment, usually exhibiting qualities similar to IDRs, functions as a site for domain interaction and for regulating kinase activity (Taylor and Kornev, 2011; Gogl et al., 2019). KIDs were previously described as anomalous sequences exhibiting qualities present in IDRs with little sequence similarity between protein kinase domains of RTKs (Locascio and Donoghue, 2013; Koike et al., 2020). Although RLCK XI KIDs exhibited properties similar to IDRs (Figure 3), the presence of several conserved motifs that overlapped with predicted NLS within this region suggests that the RLCK XI KID has a role in RLCK XI function (Figure 4 and Supplementary Table S8). Indeed, classical NLS have often been found in IDRs of cargo proteins (Wubben et al., 2020). Furthermore, KIDs found in RTKs contained phosphorylation sites that regulated protein function (Locascio and Donoghue, 2013).
It would be worth exploring whether the KID affects RLCK XI protein kinase activity. To conduct the in vitro kinase activity assay, we attempted to produce recombinant proteins of AthRLCK XI-2 and AthRLCK XI-3 without the N-terminal transmembrane domain (ΔTM) by expressing them in E. coli. Unfortunately, the recombinant proteins were not expressed for unknown reasons. We suspect the intrinsically disordered property of the KID may hinder their expression in E. coli. Nevertheless, we could glean some insight from previous studies. First, we’ve observed that members of AthRLCK XI have all the subdomains (from subdomains I to XI) found in functional protein kinases. In contrast, pseudokinases have been shown to lack one or more of these subdomains in their protein sequence (Kannan and Taylor, 2008; McClendon et al., 2014; Kwon et al., 2019). Second, the location of the KID corresponds to the activation segment, which contains regulatory regions and phosphorylation sites that regulate protein kinase activity (Nolen et al., 2004; Taylor et al., 2012). We hypothesize that the RLCK XI-KID has the potential to influence protein kinase activity due to being in the same region as the activation segment in other protein kinases. Interestingly, members of RLCK XI are also classified as non-RD protein kinases. This property refers to protein kinase subdomain VI, which follows the HxD motif wherein x could be R or another amino acid (Kornev et al., 2006). RD protein kinases have been shown to require autophosphorylation on the activation loop for regulation (Adams, 2003; Steichen et al., 2010). In contrast, non-RD protein kinases can be regulated through mechanisms other than autophosphorylation (Dardick et al., 2012; Bender et al., 2021). The presence of the KID and the non-RD subdomain VI in RLCK XI may indicate that these protein kinases are regulated through an alternate mechanism from phosphorylation within the activation segment.
The RLK family of protein kinases has expanded in land plants, which resulted in the various subfamilies observed in present-day plant species (Shiu and Bleecker, 2003; Lehti-Shiu and Shiu, 2012; Dievart et al., 2020). Recent studies have identified sequences belonging to the RLK family in glaucophytes (Gong and Han, 2021). However, we have only traced members of RLCK XI up until bryophytes. The conservation and small number of the RLCK XI subfamily is similar to previously described RLK subfamilies found in plants (Lehti-Shiu and Shiu, 2012). However, none of these other RLK subfamilies have the KID (Supplementary Figure 1). Having identified the unique domain architecture in members of RLCK XI, we speculate that during an event in the course of land plant evolution, likely aquatic-to-land plant transition, this RLK acquired the KID we observe today. How the ancestral RLK acquired the KID requires further study. Nevertheless, their conservation among land plants suggests they may have played a role in land plant evolution.
Functional implication of RLCK XI
A single-pass transmembrane region in the AthRLCK XI sequences suggested their localization on the plasma membrane (Figure 5A). However, detecting multiple NLSs within the RLCK XI-KID, the juxtamembrane region, and their C-terminal sequence also suggested their nuclear localization (Figure 5A). Different truncated variants of RLCK XI proteins (Figure 6A) proved that the NLS were functional in directing their nuclear localization. The dual localization of full-length AthRLCK XI-2 and -3 on the plasma membrane and in the nucleus further verified the functionality of both the transmembrane region and NLS (Figures 6B–E). We attempted to tag either the N-terminus or C-terminus of full-length AthRLCK XI-2 and AthRLCK XI-3. However, the N-terminal GFP-tagged protein resulted in irregular aggregates in the cytosol of protoplasts. We speculate that N-terminal GFP tagging of AthRLCK XI disrupt their targeting on the plasma membrane. We, therefore, only showed the result of the C-terminal GFP-tagged proteins. In the future, adding a different fluorescent protein tag downstream of the transmembrane domain but upstream of the kinase domain of the C-terminal GFP-tagged AthRLCK XI may allow us to track changes in their subcellular localization.
The dual localization of AthRLCK XI-2 and -3 on the plasma membrane and in the nucleus suggests that these proteins may undergo proteolytic cleavage before translocating to the nucleus. Indeed, proteolytic cleavage of plant RLKs to release the intracellular fragment has been reported. In rice, the LRR-RLK Xa21 confers broad-spectrum resistance to Xanthomonas oryzae pv. Oryzae (Song et al., 1995). Xa21 undergoes proteolytic cleavage, leading to the translocation of its intracellular kinase domain to the nucleus. Xa21 has been shown to bind WRKY62 in the nucleus of rice protoplasts (Peng et al., 2008; Park and Ronald, 2012). Several RLCK members were also shown to undergo translocation from plasma membranes to the nucleus as a part of their signaling cascade. BIK1 is translocated to the nucleus and interacts with WRKY transcription factors that regulate Jasmonic acid and salicylic acid levels (Lal et al., 2018). Similarly, chitin perception-induced PBS1-LIKE 19 (PBL19) can be translocated to the nucleus, where it interacts with WRKY8 to regulate its own transcription (Li et al., 2022). In another case, the RLCK Puccinia striiformis-Induced Protein Kinase 1 (PsIPK1) found in wheat interacts with the fungal effector PsSpg1 from Puccinia striiformis to promote its nuclear localization to phosphorylate CCAAT-binding transcription factor 1d (TaCBF1d) and enhance susceptibility to the pathogen (Wang et al., 2022). Since RLKs are widely recognized as receptors and mediators for plant development and stress response, we speculate that upon being triggered by a specific signal, RLCK XI may be cleaved by a protease and undergo translocation from the plasma membrane to the nucleus. RLCK XI may function to transmit the extracellular signal to the nucleus for a signaling cascade. The signal and protease responsible for inducing this translocation mechanism have yet to be determined.
It would be fascinating to know the physiological role of RLCK XI members. Unfortunately, to the best of our knowledge, there have been no prior studies on them. According to the publicly available expression profiles (Arabidopsis RNA-seq database, http://ipf.sustech.edu.cn) (Zhang et al., 2020), all the members of AthRLCK XI were expressed ubiquitously except in the pollen (Supplementary Figures 3A–D). Members of AthRLCK XI were also expressed under different biotic stresses with some variations (Supplementary Figures 3E–H). We attempted to characterize the T-DNA insertion Arabidopsis mutants of all four AthRLCK XI members individually, but none of the single mutants showed developmental defects. Higher-order mutants or specific stress conditions may be employed to examine their function in the future. On the other hand, the transgenic Arabidopsis lines with inducible expression of AthRLCK XI (Figures 6C, E) will be excellent materials for functional characterization in the future.
In summary, although the function of the RLCK XI members remains elusive, our characterization of RLCK XI in this study has expanded on the structural diversity known for the protein kinase domain and the assortment of evolutionary changes undergone by receptor-like kinases in plants.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.
Author contributions
T-JC conceived and supervised the experiments. CC advised on the project. JY, C-MS and S-FC performed the experiments and analyzed data. JY, CC and T-JC wrote the manuscript. All authors contributed to the article and approved the submitted version.
Funding
This work was supported by grants from the Ministry of Science and Technology of the Republic of China (MOST 107-2311-B-001-038-MY3) and Academia Sinica, Taiwan. Academia Sinica Advanced Optics Microscope Core Facility is funded by Academia Sinica Core Facility and Innovative Instrument Project (AS-CFII-111-208).
Acknowledgments
We are grateful to Jia Ling Li for assisting in handling microscopy images. We thank Academia Sinica Advanced Optics Microscope Core Facility for microscope imaging technical support.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2023.1117059/full#supplementary-material
References
Adams, J. A. (2003). Activation loop phosphorylation and catalysis in protein kinases: Is there functional evidence for the autoinhibitor model? Biochemistry 42 (3), 601–607. doi: 10.1021/bi020617o
Altschul, S. F., Wootton, J. C., Gertz, E. M., Agarwala, R., Morgulis, A., Schaffer, A. A., et al. (2005). Protein database searches using compositionally adjusted substitution matrices. FEBS J. 272 (20), 5101–5109. doi: 10.1111/j.1742-4658.2005.04945.x
Babu, M. M. (2016). The contribution of intrinsically disordered regions to protein function, cellular complexity, and human disease. Biochem. Soc. Trans. 44 (5), 1185–1200. doi: 10.1042/BST20160172
Bailey, T. L., Johnson, J., Grant, C. E., Noble, W. S. (2015). The MEME suite. Nucleic Acids Res. 43 (W1), W39–W49. doi: 10.1093/nar/gkv416
Bender, K. W., Couto, D., Kadota, Y., Macho, A. P., Sklenar, J., Derbyshire, P., et al. (2021). Activation loop phosphorylation of a non-RD receptor kinase initiates plant innate immune signaling. Proc. Natl. Acad. Sci. U.S.A. 118 (38), e2108242118. doi: 10.1073/pnas.2108242118
Bradley, D., Vieitez, C., Rajeeve, V., Selkrig, J., Cutillas, P. R., Beltrao, P. (2021). Sequence and structure-based analysis of specificity determinants in eukaryotic protein kinases. Cell Rep. 34 (2), 108602. doi: 10.1016/j.celrep.2020.108602
Chiou, T. J., Aung, K., Lin, S. I., Wu, C. C., Chiang, S. F., Su, C. L. (2006). Regulation of phosphate homeostasis by MicroRNA in arabidopsis. Plant Cell 18 (2), 412–421. doi: 10.1105/tpc.105.038943
Clough, S. J., Bent, A. F. (1998). Floral dip: A simplified method for agrobacterium-mediated transformation of arabidopsis thaliana. Plant J. 16 (6), 735–743. doi: 10.1046/j.1365-313x.1998.00343.x
Crooks, G. E., Hon, G., Chandonia, J. M., Brenner, S. E. (2004). WebLogo: A sequence logo generator. Genome Res. 14 (6), 1188–1190. doi: 10.1101/gr.849004
Curtis, M. D., Grossniklaus, U. (2003). A gateway cloning vector set for high-throughput functional analysis of genes in planta. Plant Physiol. 133 (2), 462–469. doi: 10.1104/pp.103.027979
Dardick, C., Schwessinger, B., Ronald, P. (2012). Non-arginine-aspartate (non-RD) kinases are associated with innate immune receptors that recognize conserved microbial signatures. Curr. Opin. Plant Biol. 15 (4), 358–366. doi: 10.1016/j.pbi.2012.05.002
Davey, N. E., Van Roey, K., Weatheritt, R. J., Toedt, G., Uyar, B., Altenberg, B., et al. (2012). Attributes of short linear motifs. Mol. Biosyst. 8 (1), 268–281. doi: 10.1039/c1mb05231d
Dievart, A., Gottin, C., Perin, C., Ranwez, V., Chantret, N. (2020). Origin and diversity of plant receptor-like kinases. Annu. Rev. Plant Biol. 71, 131–156. doi: 10.1146/annurev-arplant-073019-025927
Dosztanyi, Z., Csizmok, V., Tompa, P., Simon, I. (2005). IUPred: Web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content. Bioinformatics 21 (16), 3433–3434. doi: 10.1093/bioinformatics/bti541
Dyla, M., Kjaergaard, M. (2020). Intrinsically disordered linkers control tethered kinases via effective concentration. Proc. Natl. Acad. Sci. U.S.A. 117 (35), 21413–21419. doi: 10.1073/pnas.2006382117
Eddy, S. R. (2009). A new generation of homology search tools based on probabilistic inference. Genome Inf. 23 (1), 205–211. doi: 10.1142/9781848165632_0019
Erdos, G., Dosztanyi, Z. (2020). Analyzing protein disorder with IUPred2A. Curr. Protoc. Bioinf. 70 (1), e99. doi: 10.1002/cpbi.99
Esser, D., Hoffmann, L., Pham, T. K., Brasen, C., Qiu, W., Wright, P. C., et al. (2016). Protein phosphorylation and its role in archaeal signal transduction. FEMS Microbiol. Rev. 40 (5), 625–647. doi: 10.1093/femsre/fuw020
Fan, M., Ma, W., Liu, C., Zhang, C., Wu, S., Chen, M., et al. (2018). Evolution and expression characteristics of receptor-like cytoplasmic protein kinases in maize, rice and arabidopsis. Int. J. Mol. Sci. 19 (11), 3680. doi: 10.3390/ijms19113680
Gogl, G., Kornev, A. P., Remenyi, A., Taylor, S. S. (2019). Disordered protein kinase regions in regulation of kinase domain cores. Trends Biochem. Sci. 44 (4), 300–311. doi: 10.1016/j.tibs.2018.12.002
Gong, Z., Han, G. Z. (2021). Flourishing in water: The early evolution and diversification of plant receptor-like kinases. Plant J 106 (1), 174–184. doi: 10.1111/tpj.15157
Goodstein, D. M., Shu, S., Howson, R., Neupane, R., Hayes, R. D., Fazo, J., et al. (2012). Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 40 (Database issue), D1178–D1186. doi: 10.1093/nar/gkr944
Hallgren, J., Tsirigos, K. D., Pedersen, M. D., Almagro Armenteros, J. J., Marcatili, P., Nielsen, H., et al. (2022). DeepTMHMM predicts alpha and beta transmembrane proteins using deep neural networks. doi: 10.1101/2022.04.08.487609
Hanks, S. K., Hunter, T. (1995). Protein kinases 6. the eukaryotic protein kinase superfamily: Kinase (catalytic) domain structure and classification. FASEB J. 9 (8), 576–596. doi: 10.1096/fasebj.9.8.7768349
Hanks, S. K., Quinn, A. M., Hunter, T. (1988). The protein kinase family: Conserved features and deduced phylogeny of the catalytic domains. Science 241 (4861), 42–52. doi: 10.1126/science.3291115
Hazak, O., Hardtke, C. S. (2016). CLAVATA 1-type receptors in plant development. J. Exp. Bot. 67 (16), 4827–4833. doi: 10.1093/jxb/erw247
Hirakawa, Y., Sawa, S. (2019). Diverse function of plant peptide hormones in local signaling and development. Curr. Opin. Plant Biol. 51, 81–87. doi: 10.1016/j.pbi.2019.04.005
Jamieson, P. A., Shan, L., He, P. (2018). Plant cell surface molecular cypher: Receptor-like proteins and their roles in immunity and development. Plant Sci. 274, 242–251. doi: 10.1016/j.plantsci.2018.05.030
Jin, J., Pawson, T. (2012). Modular evolution of phosphorylation-based signalling systems. Philos. Trans. R Soc. Lond B Biol. Sci. 367 (1602), 2540–2555. doi: 10.1098/rstb.2012.0106
Johnson, L. N., Noble, M. E., Owen, D. J. (1996). Active and inactive protein kinases: Structural basis for regulation. Cell 85 (2), 149–158. doi: 10.1016/s0092-8674(00)81092-2
Jose, J., Ghantasala, S., Roy Choudhury, S. (2020). Arabidopsis transmembrane receptor-like kinases (RLKs): A bridge between extracellular signal and intracellular regulatory machinery. Int. J. Mol. Sci. 21 (11), 4000. doi: 10.3390/ijms21114000
Kanev, G. K., de Graaf, C., de Esch, I. J. P., Leurs, R., Wurdinger, T., Westerman, B. A., et al. (2019). The landscape of atypical and eukaryotic protein kinases. Trends Pharmacol. Sci. 40 (11), 818–832. doi: 10.1016/j.tips.2019.09.002
Kannan, N., Taylor, S. S. (2008). Rethinking pseudokinases. Cell 133 (2), 204–205. doi: 10.1016/j.cell.2008.04.005
Katoh, K., Standley, D. M. (2013). MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 30 (4), 772–780. doi: 10.1093/molbev/mst010
Kobir, A., Shi, L., Boskovic, A., Grangeasse, C., Franjevic, D., Mijakovic, I. (2011). Protein phosphorylation in bacterial signal transduction. Biochim. Biophys. Acta 1810 (10), 989–994. doi: 10.1016/j.bbagen.2011.01.006
Koike, R., Amano, M., Kaibuchi, K., Ota, M. (2020). Protein kinases phosphorylate long disordered regions in intrinsically disordered proteins. Protein Sci. 29 (2), 564–571. doi: 10.1002/pro.3789
Kornev, A. P., Haste, N. M., Taylor, S. S., Eyck, L. F. (2006). Surface comparison of active and inactive protein kinases identifies a conserved activation mechanism. Proc. Natl. Acad. Sci. U.S.A. 103 (47), 17783–17788. doi: 10.1073/pnas.0607656103
Kumar, M., Michael, S., Alvarado-Valverde, J., Meszaros, B., Samano-Sanchez, H., Zeke, A., et al. (2022). The eukaryotic linear motif resource: 2022 release. Nucleic Acids Res. 50 (D1), D497–D508. doi: 10.1093/nar/gkab975
Kurotani, A., Tokmakov, A. A., Kuroda, Y., Fukami, Y., Shinozaki, K., Sakurai, T. (2014). Correlations between predicted protein disorder and post-translational modifications in plants. Bioinformatics 30 (8), 1095–1103. doi: 10.1093/bioinformatics/btt762
Kwon, A., Scott, S., Taujale, R., Yeung, W., Kochut, K. J., Eyers, P. A., et al. (2019). Tracing the origin and evolution of pseudokinases across the tree of life. Sci. Signal 12 (578), eaav3810. doi: 10.1126/scisignal.aav3810
Lal, N. K., Nagalakshmi, U., Hurlburt, N. K., Flores, R., Bak, A., Sone, P., et al. (2018). The receptor-like cytoplasmic kinase BIK1 localizes to the nucleus and regulates defense hormone expression during plant innate immunity. Cell Host Microbe 23 (4), 485–497.e485. doi: 10.1016/j.chom.2018.03.010
Lehti-Shiu, M. D., Shiu, S. H. (2012). Diversity, classification and function of the plant protein kinase superfamily. Philos. Trans. R Soc. Lond B Biol. Sci. 367 (1602), 2619–2639. doi: 10.1098/rstb.2012.0003
Lehti-Shiu, M. D., Zou, C., Hanada, K., Shiu, S. H. (2009). Evolutionary history and stress regulation of plant receptor-like kinase/pelle genes. Plant Physiol. 150 (1), 12–26. doi: 10.1104/pp.108.134353
Letunic, I., Bork, P. (2021). Interactive tree of life (iTOL) v5: An online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49 (W1), W293–W296. doi: 10.1093/nar/gkab301
Li, Y., Xue, J., Wang, F. Z., Huang, X., Gong, B. Q., Tao, Y., et al. (2022). Plasma membrane-nucleo-cytoplasmic coordination of a receptor-like cytoplasmic kinase promotes EDS1-dependent plant immunity. Nat. Plants 8 (7), 802–816. doi: 10.1038/s41477-022-01195-x
Liang, X., Zhou, J. M. (2018). Receptor-like cytoplasmic kinases: Central players in plant receptor kinase-mediated signaling. Annu. Rev. Plant Biol. 69, 267–299. doi: 10.1146/annurev-arplant-042817-040540
Locascio, L. E., Donoghue, D. J. (2013). KIDs rule: Regulatory phosphorylation of RTKs. Trends Biochem. Sci. 38 (2), 75–84. doi: 10.1016/j.tibs.2012.12.001
Lu, S., Wang, J., Chitsaz, F., Derbyshire, M. K., Geer, R. C., Gonzales, N. R., et al. (2020). CDD/SPARCLE: The conserved domain database in 2020. Nucleic Acids Res. 48 (D1), D265–D268. doi: 10.1093/nar/gkz991
Lu, D., Wu, S., Gao, X., Zhang, Y., Shan, L., He, P. (2010). A receptor-like cytoplasmic kinase, BIK1, associates with a flagellin receptor complex to initiate plant innate immunity. Proc. Natl. Acad. Sci. U.S.A. 107 (1), 496–501. doi: 10.1073/pnas.0909705107
Ma, X., Claus, L. A. N., Leslie, M. E., Tao, K., Wu, Z., Liu, J., et al. (2020). Ligand-induced monoubiquitination of BIK1 regulates plant immunity. Nature 581 (7807), 199–203. doi: 10.1038/s41586-020-2210-3
Man, J., Gallagher, J. P., Bartlett, M. (2020). Structural evolution drives diversification of the large LRR-RLK gene family. New Phytol. 226 (5), 1492–1505. doi: 10.1111/nph.16455
McClendon, C. L., Kornev, A. P., Gilson, M. K., Taylor, S. S. (2014). Dynamic architecture of a protein kinase. Proc. Natl. Acad. Sci. U.S.A. 111 (43), E4623–E4631. doi: 10.1073/pnas.1418402111
Modi, V., Dunbrack, R. L., Jr. (2019). A structurally-validated multiple sequence alignment of 497 human protein kinase domains. Sci. Rep. 9 (1), 19790. doi: 10.1038/s41598-019-56499-4
Nguyen, L. T., Schmidt, H. A., von Haeseler, A., Minh, B. Q. (2015). IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32 (1), 268–274. doi: 10.1093/molbev/msu300
Nolen, B., Taylor, S., Ghosh, G. (2004). Regulation of protein kinases; controlling activity through activation segment conformation. Mol. Cell 15 (5), 661–675. doi: 10.1016/j.molcel.2004.08.024
Oh, E. S., Lee, Y., Chae, W. B., Rameneni, J. J., Park, Y. S., Lim, Y. P., et al. (2018). Biochemical analysis of the role of leucine-rich repeat receptor-like kinases and the carboxy-terminus of receptor kinases in regulating kinase activity in arabidopsis thaliana and brassica oleracea. Molecules 23 (1), 236. doi: 10.3390/molecules23010236
Park, C. J., Ronald, P. C. (2012). Cleavage and nuclear localization of the rice XA21 immune receptor. Nat. Commun. 3, 920. doi: 10.1038/ncomms1932
Paulus, J. K., van der Hoorn, R. A. L. (2018). Tricked or trapped-two decoy mechanisms in host-pathogen interactions. PloS Pathog. 14 (2), e1006761. doi: 10.1371/journal.ppat.1006761
Peng, Y., Bartley, L. E., Chen, X., Dardick, C., Chern, M., Ruan, R., et al. (2008). OsWRKY62 is a negative regulator of basal and Xa21-mediated defense against xanthomonas oryzae pv. oryzae in rice. Mol. Plant 1 (3), 446–458. doi: 10.1093/mp/ssn024
Proost, S., Van Bel, M., Sterck, L., Billiau, K., Van Parys, T., Van de Peer, Y., et al. (2009). PLAZA: A comparative genomics resource to study gene and genome evolution in plants. Plant Cell 21 (12), 3718–3731. doi: 10.1105/tpc.109.071506
Schulze, B., Mentzel, T., Jehle, A. K., Mueller, K., Beeler, S., Boller, T., et al. (2010). Rapid heteromerization and phosphorylation of ligand-activated plant transmembrane receptors and their associated kinase BAK1. J. Biol. Chem. 285 (13), 9444–9451. doi: 10.1074/jbc.M109.096842
Shiu, S. H., Bleecker, A. B. (2001). Receptor-like kinases from arabidopsis form a monophyletic gene family related to animal receptor kinases. Proc. Natl. Acad. Sci. U.S.A. 98 (19), 10763–10768. doi: 10.1073/pnas.181141598
Shiu, S. H., Bleecker, A. B. (2003). Expansion of the receptor-like kinase/Pelle gene family and receptor-like proteins in arabidopsis. Plant Physiol. 132 (2), 530–543. doi: 10.1104/pp.103.021964
Shiu, S. H., Karlowski, W. M., Pan, R., Tzeng, Y. H., Mayer, K. F., Li, W. H. (2004). Comparative analysis of the receptor-like kinase family in arabidopsis and rice. Plant Cell 16 (5), 1220–1234. doi: 10.1105/tpc.020834
Song, W.-Y., Wang, G.-L., Chen, L.-L., Kim, H.-S., Pi, L.-Y., Holsten, T., et al. (1995). A receptor kinase-like protein encoded by the rice disease resistance gene, Xa21. Science 270, 1804. doi: 10.1126/science.270.5243.1804
Steichen, J. M., Iyer, G. H., Li, S., Saldanha, S. A., Deal, M. S., Woods, V. L., Jr., et al. (2010). Global consequences of activation loop phosphorylation on protein kinase a. J. Biol. Chem. 285 (6), 3825–3832. doi: 10.1074/jbc.M109.061820
Sun, Y., Li, L., Macho, A. P., Han, Z., Hu, Z., Zipfel, C., et al. (2013). Structural basis for flg22-induced activation of the arabidopsis FLS2-BAK1 immune complex. Science 342 (6158), 624–628. doi: 10.1126/science.1243825
Taylor, S. S., Keshwani, M. M., Steichen, J. M., Kornev, A. P. (2012). Evolution of the eukaryotic protein kinases as dynamic molecular switches. Philos. Trans. R Soc. Lond B Biol. Sci. 367 (1602), 2517–2528. doi: 10.1098/rstb.2012.0054
Taylor, S. S., Kornev, A. P. (2011). Protein kinases: Evolution of dynamic regulatory proteins. Trends Biochem. Sci. 36 (2), 65–77. doi: 10.1016/j.tibs.2010.09.006
Taylor, S. S., Meharena, H. S., Kornev, A. P. (2019). Evolution of a dynamic molecular switch. IUBMB Life 71 (6), 672–684. doi: 10.1002/iub.2059
van der Hoorn, R. A., Kamoun, S. (2008). From guard to decoy: a new model for perception of plant pathogen effectors. Plant Cell 20 (8), 2009–2017. doi: 10.1105/tpc.108.060194
Wang, N., Tang, C., Fan, X., He, M., Gan, P., Zhang, S., et al. (2022). Inactivation of a wheat protein kinase gene confers broad-spectrum resistance to rust fungi. Cell 185 (16), 2961–2974. doi: 10.1016/j.cell.2022.06.027
Wright, P. E., Dyson, H. J. (2015). Intrinsically disordered proteins in cellular signalling and regulation. Nat. Rev. Mol. Cell Biol. 16 (1), 18–29. doi: 10.1038/nrm3920
Wubben, J. M., Atkinson, S. C., Borg, N. A. (2020). The role of protein disorder in nuclear transport and in its subversion by viruses. Cells 9 (12), 2654. doi: 10.3390/cells9122654
Yamaguchi, K., Yamada, K., Kawasaki, T. (2013). Receptor-like cytoplasmic kinases are pivotal components in pattern recognition receptor-mediated signaling in plant immunity. Plant Signal Behav. 8 (10), e25662. doi: 10.4161/psb.25662
Yoo, S. D., Cho, Y. H., Sheen, J. (2007). Arabidopsis mesophyll protoplasts: A versatile cell system for transient gene expression analysis. Nat. Protoc. 2 (7), 1565–1572. doi: 10.1038/nprot.2007.199
Zhang, H., Zhang, F., Yu, Y., Feng, L., Jia, J., Liu, B., et al. (2020). A comprehensive online database for exploring approximately 20,000 public arabidopsis RNA-seq libraries. Mol. Plant 13 (9), 1231–1233. doi: 10.1016/j.molp.2020.08.001
Keywords: nuclear localization signal, kinase insert domain, receptor-like kinase (RLK), evolution, intrisically disodered region, land plant adaptation, receptor-like cytoplasmic kinase (RLCK)
Citation: Yayen J, Chan C, Sun C-M, Chiang S-F and Chiou T-J (2023) Conservation of land plant-specific receptor-like cytoplasmic kinase subfamily XI possessing a unique kinase insert domain. Front. Plant Sci. 14:1117059. doi: 10.3389/fpls.2023.1117059
Received: 06 December 2022; Accepted: 30 January 2023;
Published: 22 February 2023.
Edited by:
Ning Li, Hong Kong University of Science and Technology, Hong Kong SAR, ChinaReviewed by:
Yasuhiro Kadota, Riken Center for Sustainable Resource Science (CSRS), JapanDongping Lu, Center for Agricultural Resources Research, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences (CAS), China
Copyright © 2023 Yayen, Chan, Sun, Chiang and Chiou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Tzyy-Jen Chiou, tjchiou@gate.sinica.edu.tw
†ORCID: Tzyy-Jen Chiou, orcid.org/0000-0001-5953-4144