- School of Agricultural Science and Engineering, Liaocheng University, Liaocheng, China
Very long chain fatty acids (VLCFAs) are fatty acids with chain lengths of 20 or more carbon atoms, which are the building blocks of various lipids that regulate developmental processes and plant stress responses. 3-ketoacyl-CoA synthase encoded by the KCS gene is the key rate-limiting enzyme in VLCFA biosynthesis, but the KCS gene family in soybean (Glycine max) has not been adequately studied thus far. In this study, 31 KCS genes (namely GmKCS1 - GmKCS31) were identified in the soybean genome, which are unevenly distributed on 14 chromosomes. These GmKCS genes could be phylogenetically classified into seven groups. A total of 27 paralogous GmKCS gene pairs were identified with their Ka/Ks ratios indicating that they had undergone purifying selection during soybean genome expansion. Cis-acting element analysis revealed that GmKCS promoters contained multiple hormone- and stress-responsive elements, indicating that GmKCS gene expression levels may be regulated by various developmental and environmental stimuli. Expression profiles derived from RNA-seq data and qRT-PCR experiments indicated that GmKCS genes were diversely expressed in different organs/tissues, and many GmKCS genes were found to be differentially expressed in the leaves under cold, heat, salt, and drought stresses, suggesting their critical role in soybean resistance to abiotic stress. These results provide fundamental information about the soybean KCS genes and will aid in their further functional elucidation and exploitation.
Introduction
Fatty acids with a hydrocarbon chain lengths of 20 or more carbon atoms are known as VLCFAs (Scott et al., 2022). The VLCFAs and their derivatives are the main components of numerous lipids that function in specific cell/tissue types, such as triacylglycerols that accumulate in the seeds, sphingolipids, and phospholipids that are essential for membrane homeostasis, cuticular waxes deposited on plant aerial surfaces, and suberin monomers that are indispensable for root development (Haslam and Kunst, 2013). Cuticular wax forms a hydrophobic layer covering most aerial plant surfaces. It plays a vital role in preventing excessive water loss, regulating transpiration, resistance against pathogen invasion, and tolerance against various environmental stresses (Kong et al., 2020; Arya et al., 2021; Batsale et al., 2021). The biosynthesis of VLCFAs is catalyzed by a fatty acid acyl-CoA elongase (FAE) multienzyme complex comprising four enzymes, including 3-ketoacyl-CoA synthase (KCS), β-ketoacyl-CoA reductase (KCR), β-hydroxy acyl-CoA dehydratase (HCD) and trans-2,3-enoyl-CoA reductase (ECR) (Bach and Faure, 2010; Kim et al., 2013). Using the plastid-generated C16 or C18 fatty acids as substrates, the endoplasmic reticulum-localized FAE sequentially adds two carbon units to a growing acyl chain through a repetitive cycle of condensation, keto-reduction, dehydration and enoyl-reduction (Kunst and Samuels, 2009). During the elongation process of VLCFAs, KCS catalyzes the condensation of C16 or C18 fatty acids with malonyl-CoA. Moreover, KCS exhibits strict substrate specificity and tissue specificity, and it is the key rate-limiting enzyme that determines the ultimate chain length and contents of VLCFAs (Shi et al., 2017; Ozseyhan et al., 2018). In contrast, the other three enzymes do not directly control VLCFA synthesis (Kunst and Samuels, 2009).
The first identified and characterized gene in the KCS family was FAE1/AtKCS18, specifically expressed in developing seeds, which catalyzes the synthesis of C20 and C22 fatty acids for energy storage in Arabidopsis thaliana (James et al., 1995; Rossak et al., 2001). Subsequently, numerous KCS genes have been cloned and functionally characterized in plants. The KCS proteins have two essential and conserved domains: the FAE1/Type III polyketide synthase-like protein domain (FAE1_CUT1_RppA) and the 3- Oxoacyl-[acyl-carrier-protein (ACP)] synthase III C-terminal domain (ACP_syn_III_C) (Sagar et al., 2015). KCS gene families have been identified in various higher plants, and the number of these genes varies greatly between species (Guo et al., 2016). For instance, 21 KCS genes were identified in Arabidopsis thaliana (Costaglioli et al., 2005; Joubès et al., 2008), 22 in rice (Oryza sativa; Yang et al., 2023), 25 in sorghum (Sorghum bicolor; Zhang et al., 2022a), 26 in maize (Zea mays; Campbell et al., 2019), 28 in apple (Malus × domestica Borkh.; Lian et al., 2020), 30 in peanut (Arachis hypogaea; Huai et al., 2020) and cabbage (Brassica oleracea; Xue et al., 2020), 33 in turnip (Brassica rapa; Xue et al., 2020) and barley (Hordeum vulgare; Tong et al., 2021), and as many as 58 in upland cotton (Gossypium hirsutum; Xiao et al., 2016) and rapeseed (Brassica napus; Xue et al., 2020).
The KCS gene family in Arabidopsis thaliana has been the most thoroughly studied and characterized in terms of physiological and molecular functions. It is divided into four classes (KCS1-like, FAE-like, FDH-like, CER6) or eight subclasses (α, β, γ, δ, ε, ζ, η, and θ) according to their phylogeny and duplication history (Costaglioli et al., 2005; Joubès et al., 2008). Previous studies have demonstrated that they catalyze the formation of diverse VLCFAs and play important roles in Arabidopsis developmental and metabolic processes, such as cuticular wax production (AtKCSs 1, 2, 3, 4, 5, 6, 9, 16 and 20) (Todd et al., 1999; Fiebig et al., 2000; Kim et al., 2013; Hegebarth et al., 2017; Kim et al., 2021; Huang et al., 2022; Huang et al., 2023), suberin metabolism in roots (AtKCSs 2, 9 and 20) (Franke et al., 2009; Lee et al., 2009; Kim et al., 2013), oil biosynthesis in seeds (AtKCS18) (James et al., 1995), and development of epidermis (AtKCS10 and AtKCS13) (Gray et al., 2000; Pruitt et al., 2000). Moreover, several AtKCS genes have been confirmed to play crucial roles in abiotic stress responses and tolerance. For example, AtKCS2 and AtKCS20 expression levels were drastically elevated under various abiotic stresses, indicating that they may be involved in stress adaptation and tolerance (Lee et al., 2009; Lee and Suh, 2015a). AtKCS1 was reported to be an important player involved in chilling tolerance (Chen et al., 2020c), and AtKCS4 plays a key role in the regulation of lipid metabolism under heat and dark conditions (Luzarowska et al., 2023).
In other species, particularly in crop plants, numerous KCS genes have been demonstrated to be vital for developmental and stress tolerance mechanisms. LeCER6 is an essential KCS gene that regulates fruit cuticular wax production in tomato (Smirnova et al., 2013). The maize GL4, a homolog of AtKCS6, is involved in cuticular wax accumulation in seedling leaves (Lee and Suh, 2015b). HMS1 expression increased pollen viability under low moisture stress in rice (Chen et al., 2020b). Overexpression of GhKCS13 enhanced the sensitivity to cold stress of transgenic cotton plants by modulating lipid and oxylipin biosynthesis (Wang et al., 2020a). Ectopic expression of VvKCS11 from grape (Vitis vinifera) or CcKCS6 from navel orange (Citrus clementina) enhanced the tolerance of transgenic Arabidopsis plants to various abiotic stresses (Guo et al., 2020; Yang et al., 2020). MtKCS12, predominantly expressed in the Medicago truncatula seed coat, controls the production of very long-chain lipid species in the seed coat and is essential for seed physical dormancy (Chai et al., 2021).
Soybean (Glycine max) is an economically important food and oil crop with high nutritional value, with its seed oil accounting for about 30% of all vegetable oil consumption worldwide (Zhan et al., 2020). It provides abundant nutrients for human nutrition, has various industrial uses, and is considered a suitable candidate crop for biodiesel production (Woyann et al., 2019). Moreover, soybean is one of the most commonly cultivated crops in arid and semi-arid areas where its growth and yield are usually hindered by various abiotic stresses, such as water deficiency, salt, and temperature extremes (Deshmukh et al., 2014; Gavili et al., 2018). To the best of our knowledge, there are limited studies regarding the soybean KCS gene family to date. As the KCS family genes are known to be crucial players in regulatory networks controlling biological and developmental processes as well as stress tolerance of plants, a thorough investigation of the KCS gene family in soybean is of great importance. In this study, 31 soybean KCS genes were identified using bioinformatics techniques. Their chromosomal distribution, gene structure, conserved motif distribution, gene synteny, and phylogenetic relationships were systematically analyzed. To further analyze the prospective functions of soybean’s KCS genes, their tissue-specific expression profiles were examined, as well as their response to cold, heat, salt, and drought stresses. These findings establish the groundwork for understanding the functions of the GmKCS gene family and offer crucial insights into their involvement in soybean’s stress tolerance mechanisms.
Materials and methods
Identification of the KCS genes in the genome of soybean
The Hidden Markov Models search tool (HMMER ver3.3.2; http://hmmer.org/) and Basic Local Alignment Search Tool for proteins (BLASTP; https://blast.ncbi.nlm.nih.gov/Blast.cgi) were used to retrieve and obtain the KCS sequences from the soybean genome (Wm82.a4.v1). The gff3, proteins, coding sequence, and genome files of soybean were downloaded from the phytozome database (https://phytozome-next.jgi.doe.gov/). The hidden Markov model (HMM) profiles of the two structural domains of KCS proteins, ACP_syn_III_C (Pfam: PF08541) and the FAE1_CUT1_RppA (Pfam: PF08392), were obtained from the Pfam database (http://pfam.xfam.org/). The 21 KCS protein sequences of Arabidopsis thaliana were downloaded from the Arabidopsis Information Resource (TAIR) database (https://www.arabidopsis.org/) and used for BLASTP comparisons against the soybean protein database. After integrating the results of HMMER and BLASTP searches, the non-redundant protein sequences were submitted to NCBI CD-search (https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) and the SMART server (https://smart.embl-heidelberg.de/) to investigate whether the two conserved domains are present. Proteins including both ACP_syn_III_C and the FAE1_CUT1_RppA domains were considered as KCS gene family members in soybean, and they were named according to their chromosomal location.
Prediction of the physicochemical properties, subcellular localization, and TAH of GmKCS proteins
The ProtParam tool in the ExPASy website (https://web.expasy.org/protparam/) was used to predict the physical and chemical properties of GmKCS proteins, such as molecular weight, isoelectric point, instability index, aliphatic index, and the grand average of hydropathy index (GRAVY). The subcellular location and the transmembrane alpha-helix (TAH) of GmKCS proteins were predicted using the WoLF PSORT web server (https://wolfpsort.hgc.jp/) and the TMHMM website (https://services.healthtech.dtu.dk/service.php?TMHMM-2.0), respectively.
Chromosomal localization and synteny analysis of GmKCS genes
The chromosome locations of GmKCS genes were retrieved from soybean genome annotation files. In the synteny analysis, all-against-all comparisons were performed in the soybean genome using BLASTP search with E-values less than 10−10 to obtain information on the gene pairs. Next, McScanX in TBtools (Chen et al., 2020a) was employed based on the BLASTP results to analyze synthetic blocks and gene duplication events. A circular map of GmKCS gene pairs was drawn using the Advanced Circos plugin in TBtools. Duplicated GmKCS gene pairs were displayed by connected red solid lines. The Ka and Ks values of the duplicated GmKCS gene pairs were calculated using the Ka/Ks Calculator in TBtools to analyze the divergence of duplicated genes. The duplication time was calculated according to a published method using the following formula: Time = Ks/(2 × 6.1 × 10−9) (Fan et al., 2013).
Phylogenetic analysis of KCS proteins
Alignment and phylogenetic analysis of KCS protein sequences from Arabidopsis (Joubès et al., 2008), turnip (Xue et al., 2020), soybean, rice (Yang et al., 2023) and sorghum (Zhang et al., 2022a) was performed using the MEGA 11.0 software (https://www.megasoftware.net/). The aligned sequences were subjected to neighbor-joining (NJ) tree construction with 1000 bootstrap replications, pairwise deletions, based on the Poisson model (Kumar et al., 2016).
Analysis of gene structure, cis-acting elements and conserved motif of GmKCS genes
The gff3 file of soybean was used to analyze the nucleotide sequence structure of GmKCS genes. The exon-intron distribution chart was generated using the TBtools software. The 2.0 kb upstream sequence from each GmKCS gene was selected as the promoter region and extracted from the soybean genome. The PlantCARE website (https://bioinformatics.psb.ugent.be/webtools/plantcare/html/) was used to predict the cis-acting elements. The functions and sequences of putative cis-acting elements of GmKCS genes were categorized into hormone- and stress-responsive elements. Conserved motifs of the GmKCS proteins were identified using the Multiple Expectation Maximization for Motif Elicitation (MEME) online website (http://meme-suite.org/) with the following parameters: a maximum number of 10 motifs and an optimum motif width of 6 - 200 amino acid residues. TBtools software was used to visualize the conserved motifs.
Expression pattern of GmKCS genes in various tissues/organs
RNA-seq data from nine tissues/organs of soybean were downloaded from the Phytozome database, including the roots, root hairs, nodules, stems, leaves, shoot apical meristem (SAM), pods, and seeds. The expression levels of each gene were represented by its fragments per kilobase of exon per million fragments mapped (FPKM) values, and a heatmap showing tissues-specific expression profiles was generated using the log2-transformed (FPKM + 1) values of GmKCS genes in the TBtools software.
Plant growth and stress treatments of soybean seedlings
The seeds of soybean cultivar Williams 82 were germinated on absorbent paper for 3 days and then sown on a planting tray for culture with 1/2 Hoagland nutrient solution under a 16-h-light/8-h-dark photoperiod at 25°C/18°C (day/night). After growing to the opening of the first leaf (V1 stage, Fehr and Caviness, 1977), plants with consistent growth were selected and subjected to four abiotic stress treatments. For cold and heat stress, the seedlings were exposed to 4°C and 37°C, respectively (Liu et al., 2015; Wang et al., 2016b). For salt stress, the seedlings were cultured in a 150 mM NaCl solution (Hu et al., 2022). For drought stress, the seedlings were hydroponically cultured in a 20% PEG6000 (w/v) solution (Wang et al., 2021). The control group was cultured in a 1/2 Hoagland nutrient solution. After 24 hours of treatment, the soybean leaves were sampled separately and frozen immediately in liquid nitrogen, then stored at –80°C until RNA extraction. Each sample was collected from five individual plants, and the biological assays were repeated independently three times.
RNA extraction, cDNA synthesis, and qRT-PCR analysis
Total RNA was extracted from leaves at different plant growth stages under the stress and control treatments using an RNA plant extraction kit (TSINGKE, cat.TSP401, China) according to the manufacturer’s instructions. Briefly, leaves were ground into fine powder in liquid nitrogen, and then 100 mg of powder was transferred to 0.45 µl Buffer RL. The RNA was purified using the RNase-Free Columns CS and finally eluted in 30 µl of RNase-free ddH2O. One microgram of RNA was reverse-transcribed to generate the first-strand cDNA using a Prime Script RT reagent kit (TSINGKE, cat. TSK301S, China) and stored at –20 °C until use. The quantitative real-time polymerase chain reaction (qRT-PCR) was carried out for each sample using a SYBR Green qRT-PCR kit (TSINGKE, cat.TSE201, China) with a LightCycler480 instrument (Roche). The qRT-PCR amplification program used was set as follows: stage 1, initial denaturation at 95°C for 30 s; stage 2, 40 cycles of 95°C for 5 s (denaturation), 60°C for 30 s (annealing); stage 3, melting curve. Relative quantification was performed using the 2−ΔΔCt method (Derveaux et al., 2010) using the GmTub gene (Glyma.05G157300, Wang et al., 2016a) as the internal control. The data were analyzed and illustrated using Graphpad Prism 8.0 (https://www.graphpad.com/). The values were the average from three technical measurements, and the data were presented as mean ± standard error (SE). Gene-specific primers used in this study are listed in Supplementary Table S1. Statistical significance of the data was analyzed using independent-samples t-test. Error bars indicate SE and p-value < 0.05 (*) or < 0.01 (**).
Results
Identification of KCS genes in soybean and characterization of their protein physicochemical properties
After integrating the search results of HMMER and BLASTP, 56 non-redundant proteins were obtained and subjected to conservative domain identification (Supplementary Table S2). A total of 31 proteins were found to contain both the ACP_syn_III_C and the FAE1_CUT1_RppA domains, and they were considered as KCS gene family members of soybean. The corresponding genes were designated as GmKCS1 to GmKCS31 according to their positions on soybean chromosomes (Table 1; Figure 1). The nucleotide sequence length of GmKCS genes varied from 1332 bp (GmKCS7) to 6891 bp (GmKCS6), and their coding sequence ranged from 1332 bp (GmKCS7) to 1611 bp (GmKCS3). Furthermore, the physicochemical properties of the GmKCS proteins were characterized. GmKCS7 and GmKCS3 were the shortest and the longest GmKCS proteins, containing 443 and 536 amino acid residues, respectively. The GmKCS protein molecular weight ranged from 49.65 kDa (GmKCS11) to 60.95 (GmKCS3) kDa, and the theoretical isoelectric point ranged from 8.49 (GmKCS17) to 9.40 (GmKCS12), indicating that all GmKCS proteins tend to be weakly basic. Analysis of the instability index indicated that nine proteins are potentially unstable (> 40), while the remaining 22 are probably stable proteins (ranging from 31.70 to 39.88). The aliphatic index, which ranged from 86.54 (GmKCS8) to 103.78 (GmKCS29), indicated the thermal stability of GmKCS proteins. The GRAVY values of GmKCS proteins ranged from -0.228 (GmKCS3) to 0.093 (GmKCS29), suggesting that they are amphoteric proteins (Table 1).
Figure 1 Chromosome location and the syntenic relationships among GmKCS genes. The positions of the GmKCS genes on the chromosomes are shown on the outside. The green colored boxes indicate the different chromosomes (Chr1-Chr20) of soybean. The red lines connecting genes from different chromosomes represent 27 segmental duplication events among the GmKCS genes.
Subcellular localization and TAH domains of GmKCS proteins
The subcellular localization prediction analysis revealed that GmKCS proteins were mainly localized in the plasma membrane, endoplasmic reticulum, chloroplast, and vacuole (Supplementary Figure S1), which suggested that GmKCS proteins might be mostly expressed and function in these organelles. The KCS proteins have been reported to have transmembrane localization, and their transmembrane-spanning domains are necessary for their biological function (Ghanevati and Jaworski, 2002). Prediction of the TAH domains of GmKCS proteins suggested that most of them (14, 45.16%) had 2 TAHs, 7 had 3 TAHs, 5 had 1 TAH, and GmKCS13/17/21/22/29 had no TAH domains (Supplementary Figure S2). Most TAH domains of the GmKCS proteins are located in the 100 amino acid residues of the N-terminus, and only a few are located in the middle of the protein sequence (Supplementary Figure S2).
Chromosomal distribution and duplication analysis of GmKCS genes
To map the chromosomal distribution of GmKCS genes, their physical location on the chromosomes was investigated. As shown in Figure 1, GmKCS genes were unevenly distributed on 14 of the 20 soybean chromosomes. Chr10 had the highest number of GmKCS genes (5), followed by Chr15 (4) and Chr20 (4), Chr6 (3) and Chr17 (3). Each of Chr4, Chr5, and Chr13 had 2 GmKCS genes, and only one GmKCS gene was located on each of Chr2, Chr3, Chr8, Chr11, Chr12, and Chr14. It is well-accepted that segmental and tandem duplications are two important mechanisms driving the expansion of gene families (Lynch and Conery, 2000). To further investigate the gene duplication events within the GmKCS family, a collinearity relationship analysis was performed. Twenty-seven pairs of segmentally duplicated genes were identified (Figure 1). The nonsynonymous (Ka) and synonymous (Ks) substitution ratios (Ka/Ks) were calculated to evaluate the selection pressure on GmKCS gene duplications. The Ka/Ks ratios for the 27 duplicated pairs ranged from 0.03 to 0.29 (Supplementary Table S3), implying that GmKCS genes evolved under strong purifying selection. Furthermore, it was estimated that the duplication events in the GmKCS gene family might have occurred 8.80 to 143.53 million years ago (Mya) (Supplementary Table S3).
Phylogenetic analysis of GmKCS genes
To elucidate the evolutionary relationships of the KCS genes, a total of 131 KCS protein sequences from five species were used to construct a phylogenetic tree, inlcuding 21 from Arabidopsis, 33 from turnip, 31 from soybean, 21 from rice, and 25 from sorghum (Supplementary Data Sheet 1). As shown in Figure 2 and Table 2, these KCSs were divided into nine groups (α-θ), which were named based on the eight Arabidopsis KCS subclasses (Joubès et al., 2008). Interestingly, group β only existed in in two cruciferous plants, Arabidopsis and turnip. Groups α, γ, δ, ε, ζ and θ existed in all five species, while group λ and η only existed in monocotyledonous and dicotyledonous plants, respectively. GmKCS proteins were disproportionally distributed in seven groups except for groups β and λ. Group ζ contained the highest number (10) of GmKCSs, followed by group θ (8), ϵ (4), α and γ (3 each), η (2), and δ (1) (Figure 2; Table 2).
Figure 2 Phylogenetic analysis of KCS proteins from Glycine max (Gm) and Arabidopsis thaliana (At). 21 AtKCS and 31 GmKCS were aligned with the Clustal W program, and a neighbor-joining phylogenetic tree with 1000 bootstrap replications was constructed using the MEGA 11.0 software.
Table 2 The total number of KCS genes in each group of Arabidopsis, turnip, soybean, rice and sorghum.
Gene structure and motif composition of GmKCSs
To analyze the structural composition of GmKCS genes, their exon-intron arrangements were obtained by comparing each gene’s coding sequence and DNA sequences. The results showed that most GmKCS genes were relatively simple in structure, e.g., the number of introns ranged from 0 to 3. Most GmKCS genes (21, 67.74%) contained no introns in their open reading frame sequences. Five genes (GmKCSs 1, 11, 12, 26, and 30) from group ζ contained one intron, four genes (GmKCSs 3, 8, 19, and 24) from group ϵ and one (GmKCS6) from group ζ contained two introns (Figures 3A, B). Motif analysis was conducted to identify the conserved features of GmKCS proteins, with ten conserved motifs identified, ranging from 15 to 113 amino acid residues in length (Figure 3C; Supplementary Figure S3). Motifs 1, 2, 3, 4, and 8 were present in all GmKCS proteins, among which motif 1 corresponds to the ACP_syn_III_C domain, while motifs 2, 3, and 4 correspond to a part of the FAE1_CUT1_RppA domain. Although motifs 5, 9, and 10 were present in different GmKCS proteins, they were also included in the FAE1_CUT1_RppA domain, indicating that sequence variation has occurred in the conserved domains of GmKCS proteins. Motifs 6 and 7 were present in the N-terminus of most group α, γ, δ, ζ, and ε GmKCS proteins (except GmKCSs 6, 7, and 26) but were absent in the members of the other two groups (Figure 3C; Supplementary Figure S3).
Figure 3 Schematic diagrams of GmKCS genes and motif composition of GmKCS proteins. (A) A rootless neighbor-joining tree was constructed based on the complete sequences of the 31 GmKCS proteins using the MEGA11.0 software. (B) Structural analyses of the exon-intron boundaries of the GmKCS genes with the yellow boxes and the black lines indicating the exons and introns, respectively. (C) Distribution map of the conserved motifs in GmKCS proteins. The ten putative motifs and the FAE1_CUT1_RppA and ACP_syn_III_C domains are represented by different colored boxes and red and blue lines, respectively. The length of the exon-intron junctions and amino acid sequences is inferred by the ruler at the bottom.
Distribution of cis-acting elements in GmKCS gene promoters
To explore the potential regulatory cues of gene’s expression, 1.5 - 2.0 kb sequences upstream of the start codon were often considered as its promoter sequence and used for cis-acting elements analysis (Lian et al., 2020; Zhang et al., 2022a; Sun et al., 2023; Yang et al., 2023). In this study, 2.0 kb sequences upstream of the start codon of GmKCS genes were extracted and submitted to the PlantCARE website. A total of 18 cis-acting elements that related to hormone- and stress-responsiveness were obtained (Figure 4). Among the hormone-responsive elements, the number of abscisic acid-responsive elements was the highest (79), followed by the ethylene- (68) and salicylic acid-responsive elements (36). Cis-acting elements involved in abscisic acid-, ethylene-, and salicylic acid-responsiveness were found in the promoter of 25, 23, and 22 GmKCS genes, respectively. The promoters of 11, 15, and 17 GmKCS genes contain auxin-, methyl jasmonate-, and gibberellin-responsive elements, respectively. All GmKCS promoters contained at least one type of hormone-responsive element, and GmKCS29 contained all the hormone-responsive elements mentioned above. Five stress-responsive elements were identified in GmKCS promoters. Among them, the number of anaerobic induction-responsive elements was the highest (53), followed by defense/stress- (51), wound- (36), drought- (27), and cold-responsive elements (16). These elements were found in 24, 20, 20, 17, and 15 GmKCS promoters, respectively. All GmKCS promoters contained at least two types of stress-responsive elements (Figure 4).
Figure 4 Cis-acting elements in the promoter regions of the 31 GmKCS genes. The color and number of the grid indicated numbers of different cis-acting elements in GmKCS genes.
Expression profiles of GmKCS genes in different tissues of soybean
To further investigate and elucidate the roles of GmKCS genes in soybean development, their expression profiles were analyzed in nine soybean tissues/organs based on published data collected from the Phytozome database. The gene expression values are represented by FPKM. Genes with FPKM values lower than 1.00 were considered to be not expressed, and the others were considered as low (1.00 ≤ FPKM < 5.00), moderate (5.00 ≤ FPKM < 15.00), and highly (FPKM ≥ 15.00) expressed genes (Girard et al., 2017). As shown in Figure 5 and Supplementary Table S4, GmKCS genes exhibited different expression profiles among the nine soybean tissues/organs, regardless of whether they belonged to the same group. Except for GmKCSs 22, 23, 24, and 29, the other 27 GmKCS genes are expressed in at least one of the nine tissues/organs. GmKCSs 2, 5, 7, 20, 25, and 27 were expressed at different levels throughout the soybean plant, indicating that they could be widely involved in multiple developmental processes in soybean. GmKCSs 3, 4, 8, 14, 25, and 27 were highly expressed in the roots, and GmKCSs 3, 14, and 28 were expressed at high levels in all aerial plant parts except for the seeds, suggesting that these genes play a critical role in soybean vegetative growth. Several genes are expressed predominantly in specific tissues/organs, such as GmKCSs 6, 15, and 26, expressed explicitly in the SAM, and GmKCS17 expressed in the leaves, implying that these genes play an indispensable role in tissue-specific developmental and metabolic processes.
Figure 5 Expression profiles of the GmKCS genes. The transcription levels of GmKCS genes in nine selected tissues or organs of soybean plants were analyzed based on data collected from Phytozome. A heatmap was generated using TBtools. The color scale from blue to red indicates the increased expression levels of the genes.
Expression of GmKCS genes in response to abiotic stress
To further elucidate the putative functions of GmKCS genes in response to abiotic stresses, we examined, using qRT-PCR, the expression patterns of 15 leaf-expressed GmKCS genes in soybean leaves subjected to cold, heat, salt, and drought, 24 h after the stress treatments. A greater than two-fold change in the transcriptional levels of a gene in any of the treated samples relative to that of the control and P < 0.05 was considered to be significantly differentially expressed. As shown in Figure 6, GmKCSs 4, 14, and 28 were differentially expressed under all four abiotic stresses, indicating that they may be essential for enhancing soybean plants’ tolerance under stress conditions. Among the three genes, GmKCS14 and GmKCS28 were significantly down-regulated under all four stress conditions. GmKCS4 was significantly up-regulated by salt and drought stress, while it was repressed under cold- and heat-induced stress. Under cold stress, GmKCSs 4, 7, 12, 14, 17, 20, 27, and 28 were down-regulated. The relative expression of GmKCS5 was induced by 4.77-fold compared to the control, while the other 6 genes were not differentially expressed. Under heat stress, the expression level of GmKCSs 4, 14, 20, and 28 significantly decreased. Specifically, the relative expression of GmKCS25 increased by 1.74-fold compared to the control, while the expression of the remaining genes did not significantly change. Under salt stress, GmKCS4 and GmKCS5 were up-regulated, with GmKCS4 being drastically up-regulated by 18.39-fold compared to the control. In contrast, GmKCSs 3, 14, 17, and 28 showed varying degrees of down-regulation, and the expression level of the other 9 GmKCS genes did not significantly change under salt stress. Except for GmKCS2 and GmKCS25, the expression level of the other 13 GmKCS genes was significantly altered under drought stress, among which 9 and 4 genes were up- and down-regulated, respectively. Notably, the expression level of GmKCSs 1, 3, 5, 12, and 27 under drought stress showed greater than 5-fold up-regulation compared with the control.
Figure 6 Relative expression levels of GmKCS genes under heat, cold, salt, and drought stress in soybean. qRT-PCR was used to study the expression levels of fifteen leaf-expressed GmKCS genes in triplicates. Error bars indicate SE and p-value < 0.05 (*) or < 0.01 (**).
Discussion
Cuticular wax forms the first protective barrier for the aerial parts of all terrestrial plants and plays a vital role in plant resistance against various biotic and abiotic stresses (Kong et al., 2020; Batsale et al., 2021). VLCFAs and their derivatives, including alkanes, ketones, alcohols, aldehydes, and wax esters, are the main components of cuticular wax. Thus, their content and composition are tightly associated with plant growth, development, and stress tolerance capability (Haslam and Kunst, 2013; Scott et al., 2022). In Arabidopsis, drought, salt and abscisic acid stress treatments lead to a significant increase in the amount of leaf cuticular wax (Kosma et al., 2009). Total leaf cuticular wax of tree tobacco (Nicotiana glauca) increased approximately 2-fold under drought stress, and leaves subjected to drying stress were more resistant to water loss than that of control (Cameron et al., 2006). Various stress conditions can elicit diverse impacts on the content, composition, and structure of plant cuticular wax (Shepherd and Wynne, 2006). The systematic identification and analysis of genes involved in plant cuticular wax synthesis and their expression patterns under stress are of great significance for understanding the molecular regulatory mechanisms of plant response to stress.
3-ketoacyl-CoA synthases, encoded by the KCS gene family, have been confirmed to be the key rate-limiting enzymes that determine the tissue and substrate specificities during VLCFA production, which are present in nearly all plants (Ozseyhan et al., 2018). The genome-wide identification and functional characterization of the KCS gene family have been described in many plant species, and large differences have been observed in the number of KCS family genes (ranging from 7 to 58) in higher plants (Guo et al., 2016; Campbell et al., 2019; Lian et al., 2020; Xue et al., 2020; Zhang et al., 2022a). However, our understanding of the structural and functional properties of the KCS gene family in soybean, one of the most important food and oil crops worldwide, is greatly limited. Identification and characterization of the soybean KCS family genes at the whole genome level could provide fundamental information for their functions and facilitate the genetic improvement of soybean.
In this study, a total of 31 KCS genes were identified in the soybean genome (Table 1; Figure 1), which is the highest number among all the diploid plants analyzed to date (Joubès et al., 2008; Guo et al., 2016; Campbell et al., 2019; Zhang et al., 2022a; Yang et al., 2023). A higher gene number occurrence has also been reported for other gene families in soybean, such as the B-box (Shan et al., 2022), the cytokinin oxidase (Du et al., 2023), and the actin-depolymerizing factor gene families (Sun et al., 2023). The KCS gene family expansion observed in soybean is possibly due to the two whole genome duplication events that occurred during soybean evolution (Schmutz et al., 2010). It appears that the KCS family genes have been significantly subjected to duplication events and expansion in soybean. Tandem and segmental duplications are considered to be the two main modes of gene family expansion during species evolution (Lynch and Conery, 2000). The former often results in a cluster of genes in the same or adjacent intergenic regions (Lynch and Conery, 2000), which has been observed among KCS genes in many species, such as Arabidopsis (Joubès et al., 2008), apple (Lian et al., 2020), sorghum (Zhang et al., 2022a) and barely (Tong et al., 2021). However, segmental duplications are the main driving force in KCS gene family expansion. In this study, a synteny analysis showed that 27 duplicated gene pairs covering 28 GmKCS genes were found to be generated by segmental duplication (Figure 1), which was also supported by their similar exon-intron structures (Figure 3). However, no tandem duplication event was identified as responsible for the expansion of the GmKCS family in soybean. Therefore, segmental duplications were primarily responsible for the KCS family’s expansion during soybean evolution. Moreover, these duplicated gene pairs were not only clustered into the same branches of the phylogenetic tree but also exhibited low Ka/Ks ratios (Figure 1; Figure 2; Supplementary Table S3), implying that GmKCS genes underwent a strong purifying selection during evolution and their functions might be evolutionarily conserved.
Generally, genes clustered in the same evolutionary clade are thought to perform similar biological functions (Liu et al., 2018). Based on their phylogenetic relationships, 131 KCS proteins from Arabidopsis, turnip, soybean, rice and sorghum were clustered into nine groups (Figure 2), consistent with previously published reports (Joubès et al., 2008; Xiao et al., 2016; Xue et al., 2020; Yang et al., 2023). Group λ and η KCS genes are unique to monocotyledonous and dicotyledonous plants, respectively, suggesting that these genes might be formed after the separation of monocots and dicots and functions between these genes may have differentiated (Figure 2; Table 2). Moreover, we found that KCSs in group β only existed in in two cruciferous plants, Arabidopsis and turnip. To explore whether group β KCS genes have been lost only in soybean, rice and sorghum, we searched for articles on the KCS gene family in other plant species. Interestingly, the loss of the β group KCS gene was also reported in cotton (Xiao et al., 2016), apple (Lian et al., 2020), peanut (Huai et al., 2020), barely (Tong et al., 2021), yellow horn (Liu et al., 2022), sorghum (Zhang et al., 2022a), passion fruit (Rizwan et al., 2022), and rice (Yang et al., 2023). Consistent with our results, KCS genes in group β have also been found in rapeseed and cabbage (Xue et al., 2020), implying that group β KCS genes may be unique to cruciferous plants. Genes in the other six groups might have undergone similar evolutionary diversification across plant families and species. The GmKCS gene homologs of known function in Arabidopsis provided clues or references for predicting and exploring the biological function of GmKCS genes (Figure 2). For example, the group α GmKCS genes were clustered with AtKCS4 and AtKCS9, which are widely involved in the biosynthesis of cuticular wax, suberin, sphingolipids, and phospholipids (Kim et al., 2013; Kim et al., 2021). GmKCS2 in the group δ is closely related to AtKCS1, which has been shown to regulate cuticular wax biosynthesis and tolerance to chilling stress in Arabidopsis (Todd et al., 1999; Chen et al., 2020c). The group ζ GmKCS genes are closely related to AtKCS2 and AtKCS20, which are involved in the biosynthesis of cuticular wax and root suberin and tolerance to osmotic stress (Lee et al., 2009; Lee and Suh, 2015a), GmKCS3 and GmKCS8, classified in group ε, are closely related to AtKCS10, which is required for normal development of the epidermis (Pruitt et al., 2000).
FAE1_CUT1_RppA and ACP_syn_III_C are the two major conserved domains in KCS proteins. The former had been reported to contain the motif that specifically binds to the substrate and thus may be related to the substrate specificity of each KCS enzyme, whereas the latter is important for initiating the fatty acid synthase chain reactions (Funa et al., 2002; Sagar et al., 2015). Our motif analysis showed that although the ACP_syn_III_C domain is conserved among GmKCS proteins, diverse sequence variants were found in the FAE1_CUT1_RppA domain (Figure 3C). Moreover, we also found the N-terminal was poorly conserved among GmKCS proteins. For example, there is a significant difference in sequence length before the FAE1_CUT1_RppA domain, and some of the GmKCS contain motifs 6 and 7. In contrast, other gene family members don’t contain any of the motifs (Figure 3C). The different distribution of motifs within KCS proteins was also reported in previous studies (Lian et al., 2020; Liu et al., 2022; Rizwan et al., 2022). Differences in GmKCS protein sequence characteristics imply that functional differentiation through evolution may have occurred, and further research on these sequence variations is needed for their functional elucidation and exploration.
Cis-acting elements play critical roles in signal transduction and regulation of gene transcription initiation (Narusaka et al., 2003). The mechanisms underlying the expression regulation of each KCS gene could be better understood by identifying the upstream cis-acting elements of the GmKCS genes. In agreement with previous reports concerning the KCS gene family (Lian et al., 2020; Rizwan et al., 2022; Zhang et al., 2022a), cis-acting elements involved in hormone- and stress-responsiveness were found to be widely distributed in GmKCS promoters, and all GmKCS promoters contain at least one type of hormone- and stress-responsive elements (Figure 4). Previous studies demonstrated that these elements are involved in gene expression regulation under various stresses (Jiang et al., 2022; Dong et al., 2023; Zhang et al., 2023). Our results suggest that GmKCS expression is closely associated with abiotic stress and hormone-signaling responses.
The diverse expression patterns of KCS genes in different tissues have been described in many plants. For example, all 21 AtKCS genes were found to be expressed but had diverse expression levels and tissue distributions (Joubès et al., 2008). Furthermore, certain rice KCS genes exhibit specific expression in different organs or different stages of growth and development (Yang et al., 2023). Exploring the expression profiles of GmKCS genes may help uncover their potential roles in plant physiological processes and development. We obtained the expression patterns of each GmKCS gene in nine tissues/organs from a public database. 27 out of the 31 GmKCS genes were found to be expressed in at least one tissue/organ, and six of which (i.e., GmKCSs 2, 5, 7, 20, 25 and 27) were differentially expressed throughout the different organs of the soybean plants (Figure 4; Supplementary Table S3). Several members that were constitutive expressed throughout the plants (GmKCSs 2, 5, 7, 20, 25, and 27) or predominantly expressed in specific tissue/organs (GmKCSs 6, 15, 17, and 26) were identified (Figure 4; Supplementary Table S3). The differential expression of GmKCS genes in different tissues/organs, and for certain genes, their tissue specificity, may indicate that they perform different functions in different plant organs and remains to be further investigated.
Unfavorable abiotic stress environmental conditions are often detrimental to growth and development. Drought, salt, and temperature stresses are the main environmental factors that limit the productivity of agricultural plants and threaten food security (Zhang et al., 2022b). Increasing evidence suggests that plant KCS genes are involved in abiotic stress responses and adaptation (Batsale et al., 2021). For instance, Arabidopsis kcs1 mutants exhibited reduced resistance to low humidity conditions at a young age (Todd et al., 1999), while overexpression of drought-induced KCS genes, such as CsKCS6 from orange, BnKCS1 from rapeseed, or AhKCS1 from groundnut, enhanced the drought tolerance of transgenic plants (Lokesh et al., 2019; Guo et al., 2020; Wang et al., 2020b). These genes can be targeted in crop breeding to improve drought tolerance. In our study, we examined the transcriptional profiles of the 15 leaf-expressed GmKCS gene family members under four abiotic stress conditions. Our qRT-PCR results showed that the expression levels of 9, 5, 6, and 13 GmKCS genes were altered more than two-fold under cold, heat, salt, and drought stress, respectively (Figure 6). Some of them exhibited a greater than 5 fold up-regulation under stress conditions, i.e., GmKCS5 under cold stress, GmKCS4 under salt stress, and GmKCSs 1, 3, 5, 12, and 27 under drought stress (Figure 6). The results indicated that the GmKCS genes may widely be involved in abiotic stress adaptation and resistance of soybean. Currently, there is limited evidence regarding the potential link between the differential expression of GmKCS genes and the accumulation of cuticular wax in leaves, as well as the response to stress in soybean. Further research is required to investigate the alterations in VLCFAs content and components in soybean plants following stress treatment. Additionally, it is important to identify the specific KCS genes that are instrumental in regulating soybean stress resistance. By targeting these key GmKCS genes, soybean breeding practices can enhance the plant’s tolerance to stress.
Conclusions
In this study, a total of 31 KCS genes were identified in the soybean genome, and their physicochemical properties, chromosomal location, collinearity, evolutionary relationships, gene structure, the presence of motifs and cis-acting elements, and gene expression patterns in different tissues/organs and under abiotic stress conditions were analyzed. The GmKCS genes were distributed on 14 chromosomes, and segmental duplication during evolution was primarily responsible for their expansion during soybean genome evolution. GmKCS genes could be divided into seven groups based on their phylogenetic relationships. GmKCS genes exhibited diverse expression patterns in the various tissues/organs of soybean plants. Many of them were found to respond to varied abiotic stresses, implying that they may be crucial for stress adaptation and tolerance in soybean. These findings will contribute to future research regarding the functions and regulatory mechanisms underlying soybean KCS genes in response to abiotic stress and their potential exploitation towards improving soybean stress tolerance.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.
Author contributions
YG: Investigation, Methodology, Writing – original draft. DW: Investigation, Software, Writing – original draft. HX: Conceptualization, Formal Analysis, Writing – original draft. ZZ: Investigation, Writing – original draft. YC: Investigation, Writing – original draft. DZ: Investigation, Writing – original draft. YJ: Investigation, Writing – original draft. MS: Software, Writing – original draft. PL: Software, Writing – original draft. QS: Software, Writing – original draft. JY: Investigation, Software, Writing – original draft. PC: Writing – original draft. YS: Investigation, Methodology, Software, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work was financially supported by National Natural Science Foundation of China (32101788), the Open Project of Liaocheng Universtiy Landscape Architecture Discipline (319462212), the Higher Educational Program of Shandong Province for Introduction and Cultivation of Young Innovative Talents (S202310447030; CXCY2023267), and the Doctoral Research Start-up Foundation of Liaocheng University (318052021).
Acknowledgments
We appreciate the linguistic assistance provided by ZYEdit during the preparation of this manuscript.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2023.1291731/full#supplementary-material
Supplementary Data Sheet 1 | KCS protein sequences from five species used for phylogenetic analysis.
References
Arya, G. C., Sarkar, S., Manasherova, E., Aharoni, A., Cohen, H. (2021). The plant cuticle: An ancient guardian barrier set against long-standing rivals. Front. Plant Sci. 12. doi: 10.3389/fpls.2021.663165
Bach, L., Faure, J. D. (2010). Role of very-long-chain fatty acids in plant development, when chain length does matter. C R Biol. 333, 361–370. doi: 10.1016/j.crvi.2010.01.014
Batsale, M., Bahammou, D., Fouillen, L., Mongrand, S., Joubès, J., Domergue, F. (2021). Biosynthesis and functions of very-long-chain fatty acids in the responses of plants to abiotic and biotic stresses. Cells 10, 1284. doi: 10.3390/cells10061284
Cameron, K. D., Teece, M. A., Smart, L. B. (2006). Increased accumulation of cuticular wax and expression of lipid transfer protein in response to periodic drying events in leaves of tree tobacco. Plant Physiol. 140, 176–183. doi: 10.1104/pp.105.069724
Campbell, A. A., Stenback, K. E., Flyckt, K., Hoang, T., Perera, M. A. D., Nikolau, B. J. (2019). A single-cell platform for reconstituting and characterizing fatty acid elongase component enzymes. PloS One 14, e0213620. doi: 10.1371/journal.pone.0213620
Chai, M., Queralta Castillo, I., Sonntag, A., Wang, S., Zhao, Z., Liu, W., et al. (2021). A seed coat-specific β-ketoacyl-coa synthase, KCS12, is critical for preserving seed physical dormancy. Plant Physiol. 186, 1606–1615. doi: 10.1093/plphys/kiab152
Chen, C., Chen, H., Zhang, Y., Thomas, H. R., Frank, M. H., He, Y., et al. (2020a). TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol. Plant 13, 1194–1202. doi: 10.1016/j.molp.2020.06.009
Chen, H., Zhang, Z., Ni, E., Lin, J., Peng, G., Huang, J., et al. (2020b). HMS1 interacts with HMS1I to regulate very-long-chain fatty acid biosynthesis and the humidity-sensitive genic male sterility in rice (Oryza sativa). New Phytol. 225, 2077–2093. doi: 10.1111/nph.16288
Chen, L., Hu, W., Mishra, N., Wei, J., Lu, H., Hou, Y., et al. (2020c). AKR2A interacts with KCS1 to improve VLCFAs contents and chilling tolerance of Arabidopsis thaliana. Plant J. 103, 1575–1589. doi: 10.1111/tpj.14848
Costaglioli, P., Joubès, J., Garcia, C., Stef, M., Arveiler, B., Lessire, R., et al. (2005). Profiling candidate genes involved in wax biosynthesis in Arabidopsis thaliana by microarray analysis. Biochim. Biophys. Acta 1734, 247–258. doi: 10.1016/j.bbalip.2005.04.002
Derveaux, S., Vandesompele, J., Hellemans, J. (2010). How to do successful gene expression analysis using real-time PCR. Methods 50, 227–230. doi: 10.1016/j.ymeth.2009.11.001
Deshmukh, R., Sonah, H., Patil, G., Chen, W., Prince, S., Mutava, R., et al. (2014). Integrating omic approaches for abiotic stress tolerance in soybean. Front. Plant Sci. 5. doi: 10.3389/fpls.2014.00244
Dong, X., Liu, Z., Wei, J., Zheng, G., Li, H., Wang, Y., et al. (2023). The BrAFP1 promoter drives gene-specific expression in leaves and stems of winter rapeseed (Brassica rapa L.) under cold induction. Plant Sci. 331, 111669. doi: 10.1016/j.plantsci.2023.111669
Du, Y., Zhang, Z., Gu, Y., Li, W., Wang, W., Yuan, X., et al. (2023). Genome-wide identification of the soybean cytokinin oxidase/dehydrogenase gene family and its diverse roles in response to multiple abiotic stress. Front. Plant Sci. 14. doi: 10.3389/fpls.2023.1163219
Fan, C., Wang, X., Hu, R., Wang, Y., Xiao, C., Jiang, Y., et al. (2013). The pattern of Phosphate transporter 1 genes evolutionary divergence in Glycine max L. BMC Plant Biol. 13, 48. doi: 10.1186/1471-2229-13-48
Fehr, W. R., Caviness, C. E. (1977). Stages of soybean. Iowa State University Cooperative Extension Service Special Report 80 (Ames, IA: Iowa State University).
Fiebig, A., Mayfield, J. A., Miley, N. L., Chau, S., Fischer, R. L., Preuss, D. (2000). Alterations in CER6, a gene identical to CUT1, differentially affect long-chain lipid content on the surface of pollen and stems. Plant Cell. 12, 2001–2008. doi: 10.1105/tpc.12.10.2001
Franke, R., Höfer, R., Briesen, I., Emsermann, M., Efremova, N., Yephremov, A., et al. (2009). The DAISY gene from Arabidopsis encodes a fatty acid elongase condensing enzyme involved in the biosynthesis of aliphatic suberin in roots and the chalaza-micropyle region of seeds. Plant J. 57, 80–95. doi: 10.1111/j.1365-313X.2008.03674.x
Funa, N., Ohnishi, Y., Ebizuka, Y., Horinouchi, S. (2002). Alteration of reaction and substrate specificity of a bacterial type III polyketide synthase by site-directed mutagenesis. Biochem. J. 367, 781–789. doi: 10.1042/BJ20020953
Gavili, E., Moosavi, A. A., Kamgar Haghighi, A. A. (2018). Does biochar mitigate the adverse effects of drought on the agronomic traits and yield components of soybean? Ind. Crops Products. 128, 445–454. doi: 10.1016/j.indcrop.2018.11.04
Ghanevati, M., Jaworski, J. G. (2002). Engineering and mechanistic studies of the Arabidopsis FAE1 beta-ketoacyl-coa synthase, FAE1 KCS. Eur. J. Biochem. 269, 3531–3539. doi: 10.1046/j.1432-1033.2002.03039.x
Girard, I. J., Tong, C., Becker, M. G., Mao, X., Huang, J., de Kievit, T., et al. (2017). RNA sequencing of Brassica napus reveals cellular redox control of sclerotinia infection. J. Exp. Bot. 68, 5079–5091. doi: 10.1093/jxb/erx338
Gray, J. E., Holroyd, G. H., van der Lee, F. M., Bahrami, A. R., Sijmons, P. C., Woodward, F. I., et al. (2000). The HIC signalling pathway links CO2 perception to stomatal development. Nature 408, 713–716. doi: 10.1038/35047071
Guo, H. S., Zhang, Y. M., Sun, X. Q., Li, M. M., Hang, Y. Y., Xue, J. Y. (2016). Evolution of the KCS gene family in plants: the history of gene duplication, sub/neofunctionalization and redundancy. Mol. Genet. Genomics 291, 739–752. doi: 10.1007/s00438-015-1142-3
Guo, W., Wu, Q., Yang, L., Hu, W., Liu, D., Liu, Y. (2020). Ectopic expression of CsKCS6 from navel orange promotes the production of very-long-chain fatty acids (VLCFAs) and increases the abiotic stress tolerance of Arabidopsis thaliana. Front. Plant Sci. 11. doi: 10.3389/fpls.2020.564656
Haslam, T. M., Kunst, L. (2013). Extending the story of very-long-chain fatty acid elongation. Plant Sci. 210, 93–107. doi: 10.1016/j.plantsci.2013.05.008
Hegebarth, D., Buschhaus, C., Joubès, J., Thoraval, D., Bird, D., Jetter, R. (2017). Arabidopsis ketoacyl-CoA synthase 16 (KCS16) forms C36/C38 acyl precursors for leaf trichome and pavement surface wax. Plant Cell Environ. 40, 1761–1776. doi: 10.1111/pce.12981
Hu, J., Zhuang, Y., Li, X., Li, X., Sun, C., Ding, Z., et al. (2022). Time-series transcriptome comparison reveals the gene regulation network under salt stress in soybean (Glycine max) roots. BMC Plant Biol. 22 (1), 157. doi: 10.1186/s12870-022-03541-9
Huai, D., Xue, X., Li, Y., Wang, P., Li, J., Yan, L., et al. (2020). Genome-wide identification of peanut KCS genes reveals that AhKCS1 and AhKCS28 are involved in regulating VLCFA contents in seeds. Front. Plant Sci. 11. doi: 10.3389/fpls.2020.00406
Huang, H., Ayaz, A., Zheng, M., Yang, X., Zaman, W., Zhao, H., et al. (2022). Arabidopsis KCS5 and KCS6 play redundant roles in wax synthesis. Int. J. Mol. Sci. 23, 4450. doi: 10.3390/ijms23084450
Huang, H., Yang, X., Zheng, M., Chen, Z., Yang, Z., Wu, P., et al. (2023). An ancestral role for 3-KETOACYL-COA SYNTHASE3 as a negative regulator of plant cuticular wax synthesis. Plant Cell. 35, 2251–2270. doi: 10.1093/plcell/koad051
James, D. W., Lim, E., Keller, J., Plooy, I., Ralston, E., Dooner, H. K. (1995). Directed tagging of the Arabidopsis FATTY ACID ELONGATION1 (FAE1) gene with the maize transposon activator. Plant Cell. 7, 309–319. doi: 10.1105/tpc.7.3.309
Jiang, H., Zhou, L. J., Gao, H. N., Wang, X. F., Li, Z. W., Li, Y. Y. (2022). The transcription factor MdMYB2 influences cold tolerance and anthocyanin accumulation by activating SUMO E3 ligase MdSIZ1 in apple. Plant Physiol. 189, 2044–2060. doi: 10.1093/plphys/kiac211
Joubès, J., Raffaele, S., Bourdenx, B., Garcia, C., Laroche-Traineau, J., Moreau, P., et al. (2008). The VLCFA elongase gene family in Arabidopsis thaliana: phylogenetic analysis, 3D modelling and expression profiling. Plant Mol. Biol. 67, 547–566. doi: 10.1007/s11103-008-9339-z
Kim, J., Jung, J. H., Lee, S. B., Go, Y. S., Kim, H. J., Cahoon, R., et al. (2013). Arabidopsis 3-ketoacyl-coenzyme A synthase9 is involved in the synthesis of tetracosanoic acids as precursors of cuticular waxes, suberins, sphingolipids, and phospholipids. Plant Physiol. 162, 567–580. doi: 10.1104/pp.112.210450
Kim, J., Lee, S. B., Suh, M. C. (2021). Arabidopsis 3-ketoacyl-CoA synthase 4 is essential for root and pollen tube growth. J. Plant Biol. 64, 155–165. doi: 10.1007/s12374-020-09288-w
Kong, L., Liu, Y., Zhi, P., Wang, X., Xu, B., Gong, Z., et al. (2020). Origins and evolution of cuticle biosynthetic machinery in land plants. Plant Physiol. 184, 1998–2010. doi: 10.1104/pp.20.00913
Kosma, D. K., Bourdenx, B., Bernard, A., Parsons, E. P., Lu, S., Joubes, J., et al. (2009). The impact of water deficiency on leaf cuticle lipids of Arabidopsis. Plant Physiol. 151, 1918–1929. doi: 10.1104/pp.20.00913
Kumar, S., Stecher, G., Tamura, K. (2016). MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 33, 1870–1874. doi: 10.1093/molbev/msw054
Kunst, L., Samuels, L. (2009). Plant cuticles shine: advances in wax biosynthesis and export. Curr. Opin. Plant Biol. 12, 721–727. doi: 10.1016/j.pbi.2009.09.009
Lee, S. B., Jung, S. J., Go, Y. S., Kim, H. U., Kim, J. K., Cho, H. J., et al. (2009). Two Arabidopsis 3-ketoacyl CoA synthase genes, KCS20 and KCS2/DAISY, are functionally redundant in cuticular wax and root suberin biosynthesis, but differentially controlled by osmotic stress. Plant J. 60, 462–475. doi: 10.1111/j.1365-313X.2009.03973.x
Lee, S. B., Suh, M. C. (2015a). Advances in the understanding of cuticular waxes in Arabidopsis thaliana and crop species. Plant Cell Rep. 34, 557–572. doi: 10.1007/s00299-015-1772-2
Lee, S. B., Suh, M. C. (2015b). Cuticular wax biosynthesis is up-regulated by the MYB94 transcription factor in Arabidopsis. Plant Cell Physiol. 56, 48–60. doi: 10.1093/pcp/pcu142
Lian, X. Y., Wang, X., Gao, H. N., Jiang, H., Mao, K., You, C. X., et al. (2020). Genome wide analysis and functional identification of MdKCS genes in apple. Plant Physiol. Biochem. 151, 299–312. doi: 10.1016/j.plaphy.2020.03.034
Liu, J., Feng, L., Li, J., He, Z. (2015). Genetic and epigenetic control of plant heat responses. Front. Plant Sci. 6. doi: 10.3389/fpls.2015.00267
Liu, C., Wright, B., Allen-Vercoe, E., Gu, H., Beiko, R. (2018). Phylogenetic clustering of genes reveals shared evolutionary trajectories and putative gene functions. Genome Biol. Evol. 10, 2255–2265. doi: 10.1093/gbe/evy178
Liu, X., Zhao, Z., Yang, Y., Xu, H., Bi, Q., Wang, L. (2022). Genome-wide identification and expression analysis of the KCS gene family in yellow horn reveal their putative function on abiotic stress responses and wax accumulation. Horticulturae 9, 25. doi: 10.3390/horticulturae9010025
Lokesh, U., Venkatesh, B., Kiranmai, K., Nareshkumar, A., Amarnathareddy, V., Rao, G.L., et al. (2019). Overexpression of ß-Ketoacyl Co-A synthase1 gene improves tolerance of drought susceptible groundnut (Arachis hypogaea L.) cultivar K-6 by increased leaf epicuticular wax accumulation. Front Plant Sci. 91869doi: 10.3389/fpls.2018.01869
Luzarowska, U., Ruß, A. K., Joubès, J., Batsale, M., Szymański, J., P Thirumalaikumar, V., et al. (2023). Hello darkness, my old friend: 3-ketoacyl-coenzyme A synthase4 is a branch point in the regulation of triacylglycerol synthesis in Arabidopsis thaliana. Plant Cell. 35, 1984–2005. doi: 10.1093/plcell/koad059
Lynch, M., Conery, J. S. (2000). The evolutionary fate and consequences of duplicate genes. Science 290, 1151–1155. doi: 10.1126/science.290.5494.1151
Narusaka, Y., Nakashima, K., Shinwari, Z. K., Sakuma, Y., Furihata, T., Abe, H., et al. (2003). Interaction between two cis-acting elements, ABRE and DRE, in ABA-dependent expression of Arabidopsis rd29A gene in response to dehydration and high-salinity stresses. Plant J. 34, 137–148. doi: 10.1046/j.1365-313x.2003.01708.x
Ozseyhan, M. E., Kang, J., Mu, X., Lu, C. (2018). Mutagenesis of the FAE1 genes significantly changes fatty acid composition in seeds of Camelina sativa. Plant Physiol. Biochem. 123, 1–7. doi: 10.1016/j.plaphy.2017.11.021
Pruitt, R. E., Vielle-Calzada, J. P., Ploense, S. E., Grossniklaus, U., Lolle, S. J. (2000). FIDDLEHEAD, a gene required to suppress epidermal cell interactions in Arabidopsis, encodes a putative lipid biosynthetic enzyme. Proc. Natl. Acad. Sci. 97, 1311–1316. doi: 10.1073/pnas.97.3.1311
Rizwan, H. M., Shaozhong, F., Li, X., Bilal Arshad, M., Yousef, A. F., Chenglong, Y., et al. (2022). Genome-wide identification and expression profiling of KCS gene family in passion fruit (Passiflora edulis) under Fusarium kyushuense and drought stress conditions. Front. Plant Science. 13. doi: 10.3389/fpls.2022.872263
Rossak, M., Smith, M., Kunst, L. (2001). Expression of the FAE1 gene and FAE1 promoter activity in developing seeds of Arabidopsis thaliana. Plant Mol. Biol. 46, 717–725. doi: 10.1023/a:1011603923889
Sagar, M., Pandey, N., Qamar, N., Singh, B., Shukla, A. (2015). Domain analysis of 3 keto acyl-CoA synthase for structural variations in Vitis vinifera and Oryza brachyantha using comparative modelling. Interdiscip Sci. 7, 7–20. doi: 10.1007/s12539-013-0017-8
Schmutz, J., Cannon, S. B., Schlueter, J., Ma, J. S., Mitros, T., Nelson, W., et al. (2010). Genome sequence of the palaeopolyploid soybean. Nature 463, 178–183. doi: 10.1038/nature08670
Scott, S., Cahoon, E. B., Busta, L. (2022). Variation on a theme: the structures and biosynthesis of specialized fatty acid natural products in plants. Plant J. 111, 954–965. doi: 10.1111/tpj.15878
Shan, B., Bao, G., Shi, T., Zhai, L., Bian, S., Li, X. (2022). Genome-wide identification of BBX gene family and their expression patterns under salt stress in soybean. BMC Genomics 23, 820. doi: 10.1186/s12864-022-09068-5
Shepherd, T., Wynne, D. (2006). The effects of stress on plant cuticular waxes. New Phytol. 171 (3), 469–499. doi: 10.1111/j.1469-8137.2006.01826.x
Shi, J., Lang, C., Wang, F., Wu, X., Liu, R., Zheng, T., et al. (2017). Depressed expression of FAE1 and FAD2 genes modifies fatty acid profiles and storage compounds accumulation in Brassica napus seeds. Plant Sci. 263, 177–182. doi: 10.1016/j.plantsci.2017.07.014
Smirnova, A., Leide, J., Riederer, M. (2013). Deficiency in a very-long-chain fatty acid β-ketoacyl-coenzyme A synthase of tomato impairs microgametogenesis and causes floral organ fusion. Plant Physiol. 161, 196–209. doi: 10.1104/pp.112.206656
Sun, Y., Wang, D., Shi, M., Gong, Y., Yin, S., Jiao, Y., et al. (2023). Genome-wide identification of actin-depolymerizing factor gene family and their expression patterns under various abiotic stresses in soybean (Glycine max). Front. Plant Sci. 14. doi: 10.3389/fpls.2023.1236175
Todd, J., Post-Beittenmiller, D., Jaworski, J. G. (1999). KCS1 encodes a fatty acid elongase 3-ketoacyl-CoA synthase affecting wax biosynthesis in Arabidopsis thaliana. Plant J. 17, 119–130. doi: 10.1046/j.1365-313x.1999.00352.x
Tong, T., Fang, Y. X., Zhang, Z., Zheng, J., Zhang, X., Li, J., et al. (2021). Genome-wide identification and expression pattern analysis of the KCS gene family in barley. Plant Growth Regul. 93, 89–103. doi: 10.1007/s10725-020-00668-3
Wang, F., Sun, X., Shi, X., Zhai, H., Tian, C., Kong, F., et al. (2016a). A global analysis of the polygalacturonase gene family in soybean (Glycine max). PloS One 11, e0163012. doi: 10.1371/journal.pone.0163012
Wang, N., Zhong, X., Cong, Y., Wang, T., Yang, S., Li, Y., et al. (2016b). Genome-wide analysis of phosphoenolpyruvate carboxylase gene family and their response to abiotic stresses in soybean. Sci. Rep. 6, 38448. doi: 10.1038/srep38448
Wang, Q., Du, X., Zhou, Y., Xie, L., Bie, S., Tu, L., et al. (2020a). The β-ketoacyl-CoA synthase KCS13 regulates the cold response in cotton by modulating lipid and oxylipin biosynthesis. J. Exp. Bot. 71, 5615–5630. doi: 10.1093/jxb/eraa254
Wang, S., Guo, J., Zhang, Y., Guo, Y., Ji, W. (2021). Genome-wide characterization and expression analysis of TOPP-type protein phosphatases in soybean (Glycine max L.) reveal the role of GmTOPP13 in drought tolerance. Genes Genomics. 43, 783–796.
Wang, Y., Jin, S., Xu, Y., Li, S., Zhang, S., Yuan, Z., et al. (2020b). Overexpression of BnKCS1-1, BnKCS1-2, and BnCER1-2 promotes cuticular wax production and increases drought tolerance in Brassica napus. Crop J. 8, 26–37. doi: 10.1016/j.cj.2019.04.006
Woyann, L. G., Meira, D., Zdziarski, A. D., et al. (2019). Multiple-trait selection of soybean for biodiesel production in Brazil. Ind. Crops Products. 140, 111721. doi: 10.1016/j.indcrop.2019.111721
Xiao, G. H., Wang, K., Huang, G., Zhu, Y. X. (2016). Genome-scale analysis of the cotton KCS gene family revealed a binary mode of action for gibberellin A regulated fiber growth. J. Integr. Plant Biol. 58, 577–589. doi: 10.1111/jipb.12429
Xue, Y., Jiang, J., Yang, X., Jiang, H., Du, Y., Liu, X., et al. (2020). Genome-wide mining and comparative analysis of fatty acid elongase gene family in Brassica napus and its progenitors. Gene 747, 144674. doi: 10.1016/j.gene.2020.144674
Yang, L., Fang, J., Wang, J., Hui, S., Zhou, L., Xu, B., et al. (2023). Genome-wide identification and expression analysis of 3-ketoacyl-CoA synthase gene family in rice (Oryza sativa L.) under cadmium stress. Front. Plant Sci. 14. doi: 10.3389/fpls.2023.1222288
Yang, Z., Yang, X., Dong, S., Ge, Y., Zhang, X., Zhao, X., et al. (2020). Overexpression of β-ketoacyl-coa synthase from Vitis vinifera L. improves salt tolerance in Arabidopsis thaliana. Front. Plant Sci. 11. doi: 10.3389/fpls.2020.564385
Zhan, Y., Li, H., Sui, M., Zhao, X., Jing, Y., Luo, J., et al. (2020). Genome wide association mapping for tocopherol concentration in soybean seeds across multiple environments. Ind. Crops Products. 154, 112674. doi: 10.1016/j.indcrop.2020.112674
Zhang, A., Xu, J., Xu, X., Wu, J., Li, P., Wang, B., et al. (2022a). Genome-wide identification and characterization of the KCS gene family in sorghum (Sorghum bicolor (L.) Moench). PeerJ 10, e14156. doi: 10.7717/peerj.14156
Zhang, Y., Zhao, Y., Hou, X., Ni, C., Han, L., Du, P., et al. (2023). Wheat ABA receptor TaPYL5 constitutes a signaling module with its downstream partners TaPP2C53/TaSnRK2.1/TaABI1 to modulate plant drought response. Int. J. Mol. Sci. 24, 7969. doi: 10.3390/ijms24097969
Keywords: very long chain fatty acids, 3-ketoacyl-CoA synthase, gene family, soybean, abiotic stress, expression profile
Citation: Gong Y, Wang D, Xie H, Zhao Z, Chen Y, Zhang D, Jiao Y, Shi M, Lv P, Sha Q, Yang J, Chu P and Sun Y (2023) Genome-wide identification and expression analysis of the KCS gene family in soybean (Glycine max) reveal their potential roles in response to abiotic stress. Front. Plant Sci. 14:1291731. doi: 10.3389/fpls.2023.1291731
Received: 10 September 2023; Accepted: 01 November 2023;
Published: 05 December 2023.
Edited by:
Saroj Kumar Sah, Brookhaven National Laboratory (DOE), United StatesReviewed by:
Chun Li, Henan Academy of Agricultural Sciences (HNAAS), ChinaMan Wu, Chinese Academy of Agricultural Sciences (CAAS), China
Copyright © 2023 Gong, Wang, Xie, Zhao, Chen, Zhang, Jiao, Shi, Lv, Sha, Yang, Chu and Sun. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Pengfei Chu, chupengfei@lcu.edu.cn; Yongwang Sun, sunyongwang@lcu.edu.cn
†These authors have contributed equally to this work