Skip to main content

ORIGINAL RESEARCH article

Front. Plant Sci., 23 August 2022
Sec. Plant Abiotic Stress
This article is part of the Research Topic Environmental Extremes Threatening Food Crops View all 37 articles

Genome-wide identification of myeloblastosis gene family and its response to cadmium stress in Ipomoea aquatica

Zheng Liu&#x;Zheng Liu1Yuxin Zhang&#x;Yuxin Zhang1Muhammad Ahsan AltafMuhammad Ahsan Altaf1Yuanyuan HaoYuanyuan Hao1Guangzhen ZhouGuangzhen Zhou2Xinyu LiXinyu Li2Jie ZhuJie Zhu1Wuqiang MaWuqiang Ma1Zhiwei Wang,Zhiwei Wang1,3Wenlong Bao
Wenlong Bao1*
  • 1Key Laboratory for Quality Regulation of Tropical Horticultural Crops of Hainan Province, College of Horticulture, Hainan University, Haikou, China
  • 2College of Tropical Crops, Hainan University, Haikou, China
  • 3Hainan Yazhou Bay Seed Laboratory, Sanya Nanfan Research Institute of Hainan University, Sanya, China

The myeloblastosis (MYB) proteins perform key functions in mediating cadmium (Cd) tolerance of plants. Ipomoea aquatica has strong adaptability to Cd Stress, while the roles of the I. aquatica MYB gene family with respect to Cd stress are still unclear. Here, we identified a total of 183 MYB genes in the I. aquatica genome (laMYB), which were classified into 66 1R-type IaMYB, 112 2R-type IaMYB, four 3R-type IaMYB, and one 4R-type IaMYB based on the number of the MYB repeat in each gene. The analysis of phylogenetic tree indicated that most of IaMYB genes are associated with the diverse biological processes including defense, development and metabolism. Analysis of sequence features showed that the IaMYB genes within identical subfamily have the similar patterns of the motif distributions and gene structures. Analysis of gene duplication events revealed that the dispersed duplication (DSD) and whole-genome duplication (WGD) modes play vital roles in the expansion of the IaMYB gene family. Expression profiling manifests that approximately 20% of IaMYB genes had significant role in the roots of I. aquatica under Cd stress. Promoter profiling implied that the differentially expressed genes might be induced by environmental factors or inherent hormones and thereby execute their function in Cd response. Remarkably, the 2R-type IaMYB157 with abundant light-responsive element G-box and ABA-responsive element ABRE in its promoter region exhibited very strong response to Cd stress. Taken together, our findings provide an important candidate IaMYB gene for further deciphering the molecular regulatory mechanism in plant with respect to Cd stress.

Introduction

The increasing discharge of heavy metals into the ecosystems has created global environmental problems with the intensification of industrial and agricultural activities. Cadmium (Cd) is an ubiquitous heavy metal contaminant, which poses serious threats to the biosphere due to its long biological half-life, non-biodegradability, water solubility, and high mobility (Sarwar et al., 2010; Cirmi et al., 2021; Kumar et al., 2021). Cd restrains the normal functioning of plants by perturbing the physiological function of plasma membrane, reducing the bioactivity of enzymes, and causing the burst of intracellular reactive oxygen species (ROS) (Janicka-Russak et al., 2008; Singh et al., 2018; Guo et al., 2022). To better adapt to the Cd-contaminated circumstances, plants struggling with Cd have evolved sophisticated regulatory networks to mitigate Cd-induced deleterious effects.

Transcription factors (TFs) act as crucial regulators for plant Cd accumulation and tolerance, which execute their functions by switching “on” or “off” downstream gene expression. Several TFs that involved in plant response to Cd stress have been reported, including basic helix–loop–helix protein (bHLH) (Xu et al., 2017), dehydration-responsive element-binding protein (DREB) (Akbudak et al., 2018), metal-responsive element-binding transcription factor (MTF) (Sun et al., 2015), ethylene-responsive factor (ERF) (Tian et al., 2022) basic leucine Zipper (bZIP) (Lu et al., 2022), WRKY (Sheng et al., 2019), and MYB (Sapara et al., 2019).

Myeloblastosis (MYB) proteins comprise a superfamily of TFs in a broad range of species and perform versatile functions in diverse biological processes. As reported, the first member of the MYB gene family, an oncogene v-MYB, was discovered from the avian myeloblastosis virus more than 40 years ago (Lipsick and Wang, 1999). Subsequently, the proto-oncogene c-MYB, a cellular homolog of v-MYB gene, was identified (Lipsick, 2010). In 1987, the first MYB gene in plants was found in Zea mays (Paz-Ares et al., 1987). Since then, an increasing number of MYB genes in diverse plant species have been widely identified and studied. Structurally, MYB proteins are composed of a highly conserved DNA-binding domain (DBD) that is located at the N-terminus and a variable transcriptional regulation domain (TRD) that is located at the C-terminus. The imperfect repeat (R) sequences of 50–55 amino acid residues are a core component of the DBD within MYB proteins. Each R structure has evolutionarily conserved tryptophan residues (W) and interval sequences which was consisted of 18–19 variable amino acids, forming a helix-turn-helix (HTH) motif to participate in DNA binding. Based on the similarity to the Rs in c-MYB, the Rs of MYB in the plant were designated as R1, R2, and R3, respectively (Saikumar et al., 1990; Dubos et al., 2010). According to the arrangement of the Rs, MYB TFs in the plant can be subdivided into four distinct types, i.e., 1R-type MYB (R1/2/3-MYB), 2R-type MYB (R2R3-MYB), 3R-type MYB (R1R2R3-MYB), and 4R-type MYB (R1R2R2R1/2-MYB). Remarkably, the 2R-type MYB genes are specific to plants and are the richest MYB in most plant species, which underwent a rapid expansion during the evolutionary process and thereby greatly facilitated the evolution of MYB genes in plant realms (Ambawat et al., 2013; Jiang and Rao, 2020). Several studies demonstrated that MYB TFs perform key functions in plant tolerance to Cd stress. For instance, Cd-induced AtMYB49 can significantly increase Cd accumulation in Arabidopsis thaliana by directly or indirectly regulating the expression of heavy metal-associated isoprenylated plant proteins (HIPPs) and ABA-inhibited metal transporter iron-regulated transporter 1 (IRT1), which is also involved in a feedback mechanism to modulate Cd uptake in plants by interacting with the basic region/Leu zipper TF abscisic acid-insensitive 5 (ABI5; Zhang et al., 2019). Genome-wide identification of AtMYB genes and its expression profiling showed that approximately 20% of AtMYB genes were engaged in Cd response. The transcript abundance of a Boehmeria nivea BnMYB2 was highly enhanced by Cd stress. Further study confirmed that the overexpression of 35S:BnMYB2 can impart Cd tolerance to A. thaliana (Zhu et al., 2020). In general, these studies have deepened our knowledge to understand the molecular regulatory function of MYBs in plant response to Cd stress.

Ipomoea aquatica is one of the only two edible plants in the family Convolvulaceae, which is widely distributed in tropical and subtropical areas (Hao et al., 2021). The characteristic of the high capacity of Cd accumulation together with strong adaptability to Cd stress brought I. aquatica into focus for studying plant response to Cd stress (Wang et al., 2007; Shen et al., 2017). However, little is known about the MYB gene family in I. aquatica and its expression patterns in response to Cd stress due to a lack of genomic information. In our previous work, we constructed a high-quality chromosome-level genome assembly of I. aquatica (Hao et al., 2021), which provides excellent sequence resources for comprehensive analysis of the MYB gene family in I. aquatica. In the present study, we performed genome-wide identification of IaMYB genes by conducting the bioinformatic search of I. aquatica genome and analyzed their sequence features, gene duplication modes, selective pressure, expression patterns under Cd stress, and cis-acting elements. Remarkably, 2R-type IaMYB157 with abundant light-responsive element G-box and ABA-responsive element ABRE in its promoter region exhibited very strong response to Cd stress. Our findings provide an important candidate IaMYB gene for further deciphering the molecular regulatory mechanism in plant with respect to Cd stress.

Materials and methods

Identification of MYB genes in Ipomoea aquatica (IaMYBs)

A high-quality chromosome-level genome database of I. aquatica (BioProject: PRJCA002216) that we constructed in our previous work was used for the identification of IaMYBs. The MYB genes in A. thaliana (AtMYBs) were retrieved from TAIR1 and used for the phylogeny analysis in this study. The hidden Markov model (HMM) profile of the MYB protein domain (PF00249) was downloaded from the Pfam database.2 The IaMYB candidates were preliminarily identified using HMM search (E-value < 10−5) according to the previously published methods (Krogh et al., 2001). Subsequently, the putative IaMYBs were further confirmed by using the NCBI CD-search,3 SMART,4 and Pfam5 database.

Analysis of physicochemical parameters of IaMYB proteins

The ExPASy6 and TBtools were performed to analyze the physicochemical parameters of each IaMYB proteins, including the number of amino acids, molecular weight (MW), theoretical isoelectric point (pI), instability index, aliphatic index and grand average of hydropathicity (GRAVY). In addition, the PIant-mPLoc online program7 was employed to predict the subcellular localization of each IaMYB protein.

Analysis of phylogenetic relationships of MYB genes in Ipomoea aquatica and Arabidopsis thaliana

Multiple sequence alignments were carried out using ClustalW with default parameters, and then a neighbor-joining (NJ) method within MEGA-X (Kumar et al., 2018) was conducted to construct a phylogenetic tree. The reliability of the phylogenetic tree was tested by 1,000 bootstrap replications. The Evolview8 program was performed to further manipulate and annotate the phylogenetic tree (He et al., 2016).

Analysis of conserved motifs, gene structures, and chromosome distribution of IaMYBs

The MEME online tool9 was performed to analyze conserved motifs shared among IaMYB genes. The parameters of MEME were set as follows: the site distribution: Zero or one occurrence per sequence; the number of motifs = 10; the width of motifs = 50–53 residues. The WebLogo program10 was used to envision the sequence logo of R2 and R3 domains distributed in 2R-type IaMYB proteins. The TBtools software was employed to map exon-intron structures of IaMYBs based on CDS of each IaMYB gene and its corresponding genomic DNA sequences (Chen et al., 2020). Furthermore, The TBtools software was performed to visualize the information of genome-wide chromosomal density and the distribution of IaMYBs across all chromosomes of I. aquatica based on the genome annotation files. The genes distributed on scaffolds were excluded.

Analysis of syntenic relationships, gene duplication events, and Ka/Ks of MYBs

Syntenic relationships of MYB genes in interspecies (I. aquatica vs. Ipomoea batatas, I. aquatica vs. A. thaliana, and I. aquatica vs. Oryza sativa) and intraspecies were analyzed using TBtools software with default parameters. Gene duplication events of IaMYB duplicated gene pairs were detected by performing the DupGen_finder pipeline, which includes whole-genome duplication (WGD), tandem duplication (TD), proximal duplication (PD), transposed duplication (TRD), and dispersed duplication (DSD; Qiao et al., 2019). In addition, the Nei-Gojobori method within TBtools was used to calculate the non-synonymous substitution rates (Ka) and the synonymous substitution rates (Ks) of IaMYBs.

Analysis of cis-acting regulatory elements

Upstream sequences (2,000 bp) from the start codon of CDS of IaMYB genes were considered as promoter regions, which were extracted from the genome database of I. aquatica. Afterward, the cis-acting elements of those sequences were detected using the PlantCARE program.11

Sampling of plant materials

Seeds of I. aquatica were soaked in 55°C water for 20 min, and washed with tap water 3 times. Subsequently, the seeds were put on the blotting paper and moistened with tap water for germination. Germinated seeds were planted in Hoagland nutrient solution and cultivated for 25 days under the growth conditions of 16 h light (31°C) and 8 h darkness (26°C) with the replacement of fresh nutrient solution by every 3 days. The 25-day seedlings were divided into two groups: Control group (CK), the seedlings were cultivated in Hoagland nutrient solution; Cd-treated group, the seedlings were cultivated in Hoagland nutrient solution supplemented with CdCl2·2H2O to the final concentration of 5 mg/l Cd2+. The roots of seedlings from the control group and Cd-treated group were collected after 4-day treatment, which was immediately frozen in liquid nitrogen and stored at −80°C until use. Each treatment was independently replicated three times.

Transcriptome analysis of IaMYB genes

Based on the RNA-seq data (BioProject: PRJNA812778), the transcriptome analyses were carried out to evaluate the expression patterns of IaMYB genes under Cd treatment. In this section, we quantified the transcript abundances of IaMYB genes by calculating the value of FPKM (fragments per kilobase per million mapped reads) of each gene, and defined the differentially expressed genes (DEGs) by the criteria as follows: |log2FC| > 1, FDR < 0.05, and p-value < 0.05. The TBtools software was used to generate the gene expression heatmap based on the FPKM value of each gene.

Quantitative real-time PCR (qRT-PCR) analysis of randomly selected IaMYBs

Total RNA was extracted from the roots of seedlings in the CK and Cd-treated group by using RNAprep pure Plant Kit (Tiangen, China) according to the manufacturer’s protocol. The concentration, quality, and integrity of total RNA were evaluated on a NanoDrop 2000 (ThermoFisher, United states) and checked by using gel electrophoresis. The high-quality total RNA was used as a template to synthesize cDNA using the Reverse Transcription Kit (Tiangen, China). The specific primers used for quantitative real-time PCR (qRT-PCR) were designed by using Primer5 software (Supplementary Table S9). The total reaction volume of qRT-PCR was 20 μl, containing 10 μl of 2 × SYBR Green Master Mix (TaKaRa, China), 0.5 μl cDNA template, 0.5 μl of forward and reverse primers, and 8.5 μl PCR-grade water. The final results were analyzed by using the 2−ΔΔCt method. The GAPDH gene encoding the glyceraldehyde-3-phosphate dehydrogenase was used as an internal housekeeping gene. GraphPad Prism 8 was used to produce the final figures. Each experiment was independently replicated three times.

Results

Identification and characterization of IaMYB genes

To identify the member of IaMYB TFs, we used PF00249 as seed sequence to conduct the bioinformatic search of I. aquatica genome database. After removing the redundant sequences based on E value and protein structure, a total of 183 IaMYB genes were confirmed, which were renamed as IaMYB1-IaMYB183 based on the order of their distribution on chromosomes and scaffolds (Supplementary Table S1). According to the arrangement of imperfect repeats (Rs), 183 IaMYB genes were classified into distinct four types, namely 1R-type IaMYB, 2R-type IaMYB, 3R-type IaMYB, and 4R-type IaMYB, respectively. As shown in Figure 1A, the number of MYB gene within each type of MYB in I. aquatica was consistent with that in A. thaliana. The 2R-type IaMYB had the largest number of genes with 61.2% (112/183) of the total IaMYB genes, while the 4R-type IaMYB had the lowest number with only one gene. In addition, the 1R-type IaMYB and 3R-type IaMYB had 66 and 4 genes, respectively. Besides, as shown in Figure 1B and Supplementary Table S2, the number of amino acids of IaMYB proteins ranged from 88 (IaMYB23) to 1,106 (IaMYB103), and their molecular weight (MW) varied from 10.11 kDa to 123.94 kDa. The value of the theoretical isoelectric point (pI) is a vital parameter for protein purification, which varied from 4.68 (IaMYB56) to 12.06 (IaMYB19). The instability index of 12 out of 183 IaMYB proteins were lower than 40, while the rest were larger than 40. There are 57 of 183 IaMYB TFs have a value of aliphatic index (Ai) greater than 71. IaMYB5 had the highest Ai with 88.43, while IaMYB24 had the lowest Ai with 45.38. The analysis of the grand average of hydropathicity (GRAVY) showed that all IaMYB proteins had negative hydrophobicity. The prediction of subcellular localization showed that all IaMYB proteins localized in the nucleus.

FIGURE 1
www.frontiersin.org

Figure 1. Identification of IaMYB genes. (A) The number of MYB gene in Ipomoea aquatica and Arabidopsis thaliana. The MYB genes were classified into four types accordding to the arrangement of MYB repeat in each gene, namely 1R-type MYB, 2R-type MYB, 3R-type MYB, and 4R-type MYB (Dubos et al., 2010). (B) The physicochemical properties of IaMYB proteins.

To illustrate the genomic distribution of the IaMYB genes, their location on each chromosome were marked based on their physical positions. As shown in Figure 2A, 158 of 183 IaMYB genes were mapped across 15 chromosomes (Chr1 to Chr15), while the rest were traced on the scaffolds (Supplementary Table S1). The rank of chromosomes with different number of IaMYB genes were as follows: Chr 1 (containing 24 IaMYBs) > Chr 14 (containing 22 IaMYBs) > Chr 6 (containing 17 IaMYBs) > Chr 10 (containing 12 IaMYBs) = Chr 3 (containing 12 IaMYBs) > Chr 8 (containing 11 IaMYBs) > Chr 12 (containing 9 IaMYBs) > Chr 7 (containing 8 IaMYBs) = Chr 9 (containing 8 IaMYBs) > Chr 13 (containing 7 IaMYBs) = Chr 2 (containing 7 IaMYBs) > Chr 11 (containing 6 IaMYBs) = Chr 4 (containing 6 IaMYBs) > Chr 15 (containing 5 IaMYBs) > Chr 5 (containing 4 IaMYBs).

FIGURE 2
www.frontiersin.org

Figure 2. Analysis of IaMYB genes at the chromosome level. (A) Chromosome distribution of IaMYB genes and their collinear duplications in I. aquatica genome. The outer circle with different colors show 15 chromosomes of I. aquatica; the inner circle shows the gene density of each chromosome. (B–D) Synteny analysis of IaMYB genes between I. batatas, A. thaliana, and O. sativa genomes, respectively. The gray line linked with each chromosome show all collinear gene pairs between I. aquatica and other plant genomes, while the red line show the collinear gene pairs with respect to IaMYB genes. Please refer to the Supplementary Table S3 for the detailed information of each gene pairs.

To uncover the distribution of IaMYB collinear duplications on each chromosome, synteny analyses were performed. The results showed that a total of 77 IaMYB collinear duplications were identified in the genome of I. aquatica. Of these, 61 IaMYB gene pairs were distributed on almost all chromosomes except for Chr 5, while the rest were spread on the scaffolds (Figure 2A). To provide more insights into the potential evolutionary process of IaMYB genes, the synteny relationships of IaMYB genes with three representative species were investigated at the whole-genome levels, including dicot I. batatas (belongs to the family Convolvulaceae), dicot A. thaliana (belongs to the family Brassicaceae), and monocot O. sativa (belongs to the family Poaceae) (Supplementary Table S3). The largest number of IaMYB homologous gene pairs were detected between I. aquatica and I. batatas (255; Figure 2B), followed by I. aquatica and A. thaliana (187; Figure 2C). While, only 39 IaMYB homologous gene pairs were found between I. aquatica and O. sativa (Figure 2D).

Phylogenetic tree analysis of MYB proteins in Ipomoea aquatica and Arabidopsis thaliana

To predict the possible function of the IaMYB proteins, we constructed a phylogenetic tree using amino acid sequences of MYB proteins in I. aquatica and A. thaliana. As shown in Figure 3A, the phylogenetic tree contained the 117 IaMYB proteins (including 112 2R-type IaMYBs, four 3R-type IaMYBs, and one 4R-type IaMYB) and the 131 AtMYB proteins (including 125 2R-type AtMYB, five 3R-type AtMYBs, and one 4R-type AtMYB). According to the topology of the phylogenetic tree together with the classification of the AtMYB proteins, 117 IaMYBs were subdivided into 23 subgroups (G1–G23). The total number of MYB protein in different subgroups varied to some extent (Figure 3B). The subgroup G1 had the largest number of IaMYB proteins with 21, while the subgroup G3, G4, G13 and G21 had the lowest number with only one IaMYB proteins (Figure 3C). Except for the IaMYB proteins in the subgroup G2, G3 and G15, all the IaMYB proteins in the different subgroups were well matched with the functional defined AtMYB subgroups that associated with four major function including defense, development, metabolism and differentiation (Supplementary Table S4). For example, the AtMYBs in the subgroup S14 were defined as development-related proteins, which were clustered together with the IaMYBs in subgroup G1. The AtMYBs in the subgroup S2 related to defense responses, which were grouped together with the IaMYBs in the subgroup G5. The AtMYB in the subgroup S7 gathered with the IaMYB in the subgroup G9, which was defined as metabolism that mainly involved in phenylpropanoid pathway and flavanol biosynthesis. The four 3R-type IaMYB proteins (IaMYB39, IaMYB40, IaMYB92, and IaMYB120) and the five 3R-type AtMYB proteins were gathered in the subgroup G22. Intriguingly, the phylogenetic distance between the four 3R-type MYB proteins and the development-associated AtMYB proteins in the subgroup S25 was close. The 4R-type MYB protein from I. aquatica and A. thaliana were clustered in the same clade within subgroup G23. Besides, the ten AtMYB proteins were not clustered with any IaMYB proteins.

FIGURE 3
www.frontiersin.org

Figure 3. Phylogenetic relationships of MYB proteins in I. aquatica and A. thaliana. (A) The phylogenetic tree contains the 117 IaMYB proteins (including 112 2R-type IaMYB, 4 3R-type IaMYB, and one 4R-type IaMYB) and the 131 AtMYB proteins (including 125 2R-type AtMYB, five 3R-type AtMYBs, and one 4R-type AtMYB). (B) The number distribution of MYB proteins in each subgroup. (C) The number distribution of IaMYB proteins in each subgroup. The function of the AtMYB proteins in each subgroup were consistent with previous report in A. thaliana (Dubos et al., 2010). The green circles represent the IaMYB proteins. The red stars represent the AtMYB proteins. The red letters show the 3R-type MYB in I. aquatica and A. thaliana. The blue letters show the 4R-type MYB in I. aquatica and A. thaliana. The green, yellow and gray quadrangle represents the bootstrap value ranged from 80 to 100, 40 to 80, and 0 to 40, respectively.

The sequence features of the R2 and R3 MYB repeat

To clarify the sequence features of the R2 and R3 MYB repeats, we performed multiple sequence alignment using amino acid sequences of 112 2R-type IaMYB proteins. As shown in Figure 4A, the primary structure of the R2 MYB repeat was expressed as [–W–(X19)–W–(X19)–W–], where the W represents the highly conserved tryptophan residues (located on position 6, 26 and 46) and the X19 represents 19 regular interval sequences. In addition, some other highly conserved amino acids with frequency greater than 90% were detected at the different position of the R2 MYB repeat, such as G (position 4, 22 and 39), E (position 10), D (position 11), L (position 14, 44 and 50), R (position 37, 43 and 45), K (position 40), S (position 41), C (position 42), N (position 48), and P (position 52). As shown in Figure 4B, the primary structure of the R3 MYB repeat was expressed as [–F–(X18)–W–(X18)–W–], which contained one relatively conserved phenylalanine (position 54) and two highly conserved W (position 78 and 97) intercalated within 18 regular interval sequences. Furthermore, the results showed that some other amino acids appeared in the R3 MYB repeat with frequency higher than 90%, including E (position 63), G (position 75 and 87), I (position 81), A (position 82), R (position 88), T (position 89), D (position 90), N (position 91 and 95), and K (position 94).

FIGURE 4
www.frontiersin.org

Figure 4. The sequence characteristics of the R2 and R3 MYB repeats. (A) The sequence logo of the R2 MYB repeat and (B) the R3 MYB repeat. The letters within each stack represent different amino acids and their height indicates the relative frequency of each amino acid at the respective position. The percentage on the histograms show the frequency of the highest conserved amino acid within each stack. The red stars indicate the conserved the highly conserved tryptophan (W) or the relatively conserved phenylalanine (F). The arrows indicate the highly conserved amino acids with frequency greater than 90%.

Conserved motifs, and exon-intron structures of IaMYB genes

To investigate the sequence characteristics of the IaMYB TFs, their conserved motifs were analyzed by using MEME program. As a result, a total of ten conserved motifs (Motif 1–10) were detected in 117 IaMYB proteins (including 112 2R-type IaMYBs, four 3R-type IaMYBs, and one 4R-type IaMYBs). As shown in Figure 5A,B, IaMYB proteins in the same subgroup had similar motifs. The Motif 1 and Motif 2 were detected as the most common motif appeared in the IaMYB proteins. The Motif 8 and Motif 9 were specific to 3R-type IaMYBs in the subgroup G22. The Motif 4 only presented in the 3R- and 4R-type IaMYB proteins. The Motif 6 was only detected in the subgroup G20. The analysis of exon-intron structures showed that the number distribution of intron in the 117 IaMYB genes varied from 0 to 11 (Figure 5C and Supplementary Table S1). More than 67% of IaMYB genes had 2 introns and 3 exons. The 3R-type IaMYB92 in the subgroup G22 and the IaMYB154 in the subgroup G23 had the largest number of introns with 11, while the six IaMYB genes in the subgroup G20 (IaMYB25, IaMYB58, IaMYB59, IaMYB67, IaMYB73, and IaMYB162) were not disrupted by an intron, which were clustered with the defense responsive AtMYB proteins in the subgroup S22.

FIGURE 5
www.frontiersin.org

Figure 5. The phylogenetic tree, conserved motif, and exon-intron structures of the IaMYB genes. (A) The phylogenetic tree of the 117 IaMYB proteins (including 112 2R-type IaMYBs, four 3R-type IaMYBs, and one 4R-type IaMYB). (B) The conserved motif of the 117 IaMYB proteins. The rectangles with different colors represent the Motif 1–10; the black solid line represents non-conserved regions; the bar scale shows the length of the amino acids. (C) The exon-intron structures of the 117 IaMYB genes. The green ellipses represent the exons; the orange ellipses represent an untranslated region (UTR); the black solid lines among the ellipse represent the introns; the bar scale shows the length of the gene.

Analysis of gene duplication events of IaMYB genes

To understand the primary driven-force in the evolutionary process of IaMYB genes, the five gene duplication events (i.e., WGD, TD, PD, TRD, and DSD) were analyzed by the Dup Gen finder pipeline (Supplementary Table S5). As shown in Figure 6A, the number distribution of the duplicated genes in the five duplication modes varied greatly. The DSD mode had the largest number of duplicated genes with 139, followed by the WGD mode with 77. In addition, 35 duplicated genes were found in the TRD mode, and 5 duplicated genes were detected in the TD mode. The PD mode had the lowest number of IaMYB duplicated gene pairs with only two. The analysis of chromosome distribution showed that the duplicated genes of each gene duplication mode were unevenly spread on the different chromosomes (Figure 6B).

FIGURE 6
www.frontiersin.org

Figure 6. Gene duplication events of IaMYB genes. (A) The number distribution of the duplicated genes in the five duplication events. (B) Chromosome distribution of the duplicated genes in each event. The blue line, green line, purple line, yellow line, and red line linked with each chromosome show the duplicated genes originated from DSD, PD, TD, TRD, and WGD, respectively. (C–E) The Ka, Ks, and Ka/Ks distribution of the duplicated genes in the five duplication events.

To disclose the direction of evolution, we calculated the value of Ka (non-synonymous substitution rate), Ks (synonymous substitution rate) and Ka/Ks of IaMYB duplicated genes originated from the five duplication events. The results showed that the different modes of duplicated genes presented divergent Ka, Ks, and Ka/Ks distributions (Supplementary Table S5). Overall, the Ka values ranged from 0.035 to 0.86, and the Ks values varied from 0.053 to 4.74. The PD event had the highest median Ka value (0.536), followed by DSD (0.458), TRD (0.456), WGD (0.278), and TD (0.168) (Figure 6C). The rank of the median Ks values were as follows: TRD (2.237) > DSD (2.216) > WGD (1.321) > TD (0.252) (Figure 6D). As shown in Figure 6E, the Ka/Ks ratio of all the IaMYB gene pairs were lower than 1, indicating purifying selection played a vital role in the evolutionary process of IaMYB gene family.

Expression patterns of IaMYB genes in the root under Cd treatment

To illustrate the expression patterns of IaMYB genes in response to Cd stress, we quantified the transcript abundances of IaMYB genes by calculating the value of FPKM (fragments per kilobase per million mapped reads) of each gene based on the RNA-seq data (Supplementary Table S6). As shown in Figure 7A, a total of 36 differentially expressed genes (DEGs) were identified. Of these, more than 70% (26/36) of DEGs belonged to the 2R-type IaMYB. Moreover, 19 DEGs (including the seventeen 2R-type IaMYBs and two 1R-type IaMYBs) were up-regulated in the Cd-treated root, while 17 DEGs (including the nine 2R-type IaMYBs and eight 1R-type IaMYBs) were enhanced in the root of control group. According to the phylogenetic tree analysis, we found that most of DEGs were associated with defense (including drought, salt, hormone-mediated, light, wounding, and pathogen response), metabolism (including phenylpropanoid pathway, lignin biosynthesis), and development (including axillary meristem regulation, Lateral organ formation, and hypocotyl elongation) (Supplementary Table S7). The transcript abundances of four DEGs (IaMYB47, IaMYB86, IaMYB142, and IaMYB147) in the roots of the Cd-treated group were 5-fold higher than that of the control group, while five DEGs (IaMYB7, IaMYB110, IaMYB121, IaMYB173, and IaMYB176) were down-regulated more than five times in the root under Cd treatment.

FIGURE 7
www.frontiersin.org

Figure 7. The expression patterns of the 36 DEGs. (A) Heatmap showing expression patterns of the 36 DEGs. (B) The expression of the 12 randomly selected DEGs validated by qRT-PCR. The CK represents the root samples in the control group, whereas the Cd represents the root samples in the Cd-treated group. Left Y-axis and black histograms indicate the relative expression levels quantified by qRT-PCR. The right Y-axis and red lines indicate FPKM values obtained by RNA-seq. The statistical analysis was performed using the student’s t-test. Data shows mean ± SD of three replicates (ns: no significance, *p < 0.05, **p < 0.01, and ***p < 0.01).

To verify the accuracy and reliability of RNA-seq data, we randomly selected 12 DEGs for the relative quantification by using qRT-PCR. As shown in Figure 7B, the variation trend of FPKM obtained by RNA-seq datasets were consistent with the relative quantity of qRT-PCR, suggesting the RNA-seq data were accurate and reliable. Remarkably, according to the RNA-seq results together with the qRT-PCR analysis, we found that 2R-type IaMYB157 was dramatically enhanced in the roots under Cd stress.

Analysis of cis-acting regulatory elements of DEGs

To explore the potential regulatory factors of 36 DEGs, the cis-acting regulatory elements of their promoter regions were analyzed by using Plant CARE online program. As shown in Figure 8 and Supplementary Table S8, a total of 43 types of cis-acting elements were found in 36 DEGs, which were classified into four major categories, including light-responsive elements, phytohormone-responsive elements, stress-responsive elements, and plant growth and development-related elements. Of these, the light-responsive elements are prevalent with 22 types (including Box-4, TCT-motif, and GT1-motif), followed by the phytohormone-responsive elements with 11 types (including the abscisic acid-responsive elements such as ABRE; the methyl jasmonic acid-responsive elements such as CGTCA-motif and TGACG-motif; the zeatin-responsive elements such as O2-site; the auxin-responsive elements such as AuxRR-core; the salicylic acid-responsive element such as TCA element; the gibberellin-responsive element such as P-box, TATC-box and GARE-motif; the other phytohormone-related element such as AT-rich and SARE), the stress-responsive elements with 7 types (including the anaerobic responsive elements such as ARE; the drought-responsive element such as MBS; the cold-responsive elements such as LTR; the wound-responsive element such as WUN-motif). While, only two types of the plant growth and development-related elements were detected, namely CAT-box and circadian. Moreover, IaMYB9 had the largest number of light-responsive elements and phytohormone responsive elements, and IaMYB72 had the most stress-responsive elements. IaMYB53 and IaMYB90 had the most plant growth and development-related elements. Remarkably, the 2R-type IaMYB157 varied greatly under Cd stress, which harbored the abundant light-responsive element G-box and ABA-responsive element ABRE.

FIGURE 8
www.frontiersin.org

Figure 8. Cis-acting elements in the promoter region of the 36 DEGs. The histograms show the total number of the corresponding cis-acting elements in the promoter region of each DEG, whereas the orange dots show the total number of the DEGs containing the corresponding cis-acting elements. The heatmap was generated based on the number of the corresponding cis-acting elements in the promoter region of each DEG.

Discussion

Myeloblastosis TFs are also known as trans-acting elements, which play a crucial role in regulating plant growth and development, biotic and abiotic stress tolerance, and biosynthesis of secondary metabolites. The release of a great deal of plant genome data provides important sequence resources for the genome-wide identification of the MYB genes family in plants. At present, the different numbers of MYB genes have been validated in several plant species with various genome sizes, including A. thaliana (MYB genes: 197; genome size: 125 Mb) (Stracke et al., 2001; Chen et al., 2006), O. sativa (MYB genes: 155; genome size: 466 Mb) (Katiyar et al., 2012) Solanum lycopersicum (MYB genes: 127; genome size: 828.349 Mb) (Li et al., 2016), Solanum tuberosum (MYB genes: 217; genome size: 840 Mb) (Li et al., 2021) Morella rubra (MYB genes: 174; genome size: 313 Mb) (Cao et al., 2021), and Pyrus bretschneideri (MYB genes: 129; genome size: 512 Mb) (Cao et al., 2016). In this study, a total of 183 MYB genes in 550.05 Mb genome of I. aquatica were identified. These results indicated that the number of MYB genes in different plant species were divergent, which were not proportional to the genome size of plant species.

Generally, MYB TFs were classified into four different types namely 1R-type MYB, 2R-type MYB, 3R-type MYB, and 4R-type MYB based on the arrangement of 1–4 incomplete repeats in their DBDs [18]. Among these, the 2R-type MYB genes are predominant in plants, which are probably derived from an ancestral 3R-type MYB gene through loss of the imperfect MYB repeat R1 (Jiang et al., 2004; Katiyar et al., 2012). In conformity with these findings, the 2R-type IaMYB contained the highest number of IaMYB genes, with 61.2% (112/183) of the total MYB genes in I. aquatica. The 4R-type MYB gene is also known as “Atypical MYB gene,” which was detected in diverse plant species. For instance, one 4R-type MYB genes in A. thaliana (Katiyar et al., 2012), two in Gossypium hirsutum (Salih et al., 2016), and four in Brassica rapa (Saha et al., 2016). However, there was no 4R-type MYB gene detected in some plants, including rice (Katiyar et al., 2012), sesame (Mmadi et al., 2017), and pepper (Arce-Rodriguez et al., 2021). In contrast to the 2R-type IaMYBs, the 4R-type IaMYBs constitute the smallest subfamily of MYB TF superfamily in I. aquatica, which contained only one gene. In addition, a MYB gene with five Rs was identified in Arabidopsis (Katiyar et al., 2012) and pepper (Arce-Rodriguez et al., 2021). While, we did not detect any 5R-type MYB gene in I. aquatica. The MYB gene containing only one MYB repeat or two separated MYB repeats are defined as MYB-related gene or 1R-type MYB (Jiang et al., 2004; Islam et al., 2021). 1R-type IaMYB constituted the second largest subfamily of MYB TF superfamily in I. aquatica. This finding is inconsistent with previous reports in sesame and chickpea where the 1R-type MYB was the largest subfamily of MYB genes family (Ramalingam et al., 2015; Mmadi et al., 2017). Additionally, some plants have more 3R-type MYB genes, while some plants have fewer (Qing et al., 2019). Overall, these findings indicate that the expansion level of different type of MYB genes varied in diverse plant species, which may be related to their distinct evolutionary strategies. The versatile functions of most MYB TFs can be comprehensively predicted by performing systematic evolutionary analysis (Wang et al., 2021b). Here, the evolutionary analysis is consistent with the previous report on A. thaliana. Most of IaMYB proteins were clustered together with well-defined AtMYB proteins, providing an excellent reference to predict the biological functions of IaMYB genes. Whereas, ten AtMYB proteins did not fit into any subgroups of IaMYB proteins, indicating the proteins with similar function might be lost in the genome of I. aquatica during its evolutionary process.

The analysis of physicochemical properties are helpful for deciphering the functional properties of proteins. The theoretical isoelectric point (pI) of a protein refers to the pH value at which the total charge carried by the protein is zero, which is a vital parameter for protein purification. Here, we found that pI values of IaMYB proteins were similar to those of MYB proteins in previous reports, suggesting the acidic (pI > 7.0) or basic (pI < 7.0) nature of IaMYBs (Halligan, 2009; Mohanta et al., 2019; Qing et al., 2019). The aliphatic index (Ai) reflects the thermostability of proteins. The protein with Ai value greater than 71 is considered thermostable (Hoda et al., 2021). Here, we found that 57 of 183 IaMYB TFs have a value of Ai greater than 71, indicating a thermostable nature of those IaMYBs. The instability index (Ii) reflects the stability of protein, and protein with Ii value less than 40 is considered stable. Consistent with previous reports, most of IaMYBs (171/183) were considered unstable (Li et al., 2022). The grand average of hydropathicity (GRAVY) reflects the hydrophobic or hydrophilic character of proteins. Proteins are referred to as hydrophobic if their GRAVY value is greater than zero, while hydrophilic if it is less than zero. In this study, we found that all IaMYB proteins had negative GRAVY values, suggesting those protein are soluble, an important nature that is necessary for TFs (Katiyar et al., 2012). As TFs, MYB proteins are mostly localized in the nucleus, where they play a crucial role in regulating downstream gene expression. Nevertheless, some MYB proteins were localized in the cytoplasm, mitochondria, or endoplasmic reticulum, and their roles in these organelles need to be further elucidated (Avila et al., 1993; Zhang et al., 2021; Sabir et al., 2022). Our results showed that all IaMYB proteins were localized in the nucleus. These findings implied that the MYB genes may performs their versatile functions in different organelles of plants. As mentioned above (Figure 4), IaMYB genes in the identical subgroup generally possessed similar motif compositions, which may share a similar function (Wang et al., 2021a). Remarkably, some IaMYB genes contained specific motif, suggesting these motifs may impart specific biological functions to those IaMYB genes. Understanding the exon-intron organizations within gene families allows us to obtain more clues about their evolutionary trajectory. Although the gene structures of each subgroup exist differences, most of IaMYB genes clustered together share similar exon-intron structural patterns, implying pivotal roles of these features in their evolutionary process and functional divergence. As reported, the Trp residues within MYB DBDs play a key role in DNA-binding specificity (Zhang et al., 2021). Here, we found that evolutionarily conserved Trp residues are regularly distributed between variable interval sequences within R2 and R3 MYB repeats of the 2R-type IaMYB, which is in line with previous studies. Noteworthy, the first Trp residue within R3 repeat was replaced by other amino acids, which may affect the sequence-specific binding of the MYB TFs.

The gene duplication events are the driving force for the expansion of gene families (Wang et al., 2012). Our results demonstrated that the DSD and WGD modes play key roles in the expansion of IaMYB gene families, which is similar to previous reports in different plants (Cannon et al., 2004; Liu et al., 2014; Sun et al., 2019). Furthermore, the analysis of Ka/Ks ratio showed that IaMYB duplicated gene pairs endured purifying selection, suggesting highly conserved evolution of IaMYB genes. The analysis of syntenic relationships between IaMYBs and three representative plant species including I. batatas (belongs to the family Convolvulaceae, dicot), A. thaliana (belongs to the family Brassicaceae, dicot), and O. sativa (belongs to the family Poaceae, monocot) confirmed that the same type of angiosperms from the identical family have closer evolutionary distance.

Heavy metal contamination has become a critical environmental problem. The accumulation of heavy metals in plants not only hampers plant yield and productivity, but threatens human health throughout the food chain (Alam et al., 2022). Cd is one of the most toxic heavy metal contaminants due to its long biological half-life and non-biodegradability (Cirmi et al., 2021; Altaf et al., 2022). Previous studies revealed that MYB proteins play a vital role in plant response to heavy metal stress. For instance, R2R3 AtMYB49 can directly or indirectly regulate the expression of HIPP and IRT1 proteins to enhance plant Cd accumulation, which also participate in ABA-mediated repression of Cd accumulation in plants by interacting with ABI5 (Zhang et al., 2019). The BnMYB2 gene from Boehmeria niveaw was greatly enhanced by Cd treatment, which has been further demonstrated to improve plant Cd tolerance (Zhu et al., 2020). The gene encoding the heavy metal transporter (natural resistance associated macrophage protein, NRAMP) harbors MYB binding sites in their promoter region, indicating MYB TF may modulate its expression and thereby affecting heavy metal uptake and transportation in plants (Tian et al., 2021). Like in Arabidopsis, approximately 20% (36/183) of IaMYB genes were dramatically changed when the I. aquatica seedlings were subjected to Cd treatment (Chen et al., 2006). Of these, more than 70% (26/36) of significantly altered genes belonged to the 2R-type IaMYBs, inferring their significant effect on plant response to Cd stress. Analysis of cis-acting elements within the promoter region of the gene helps us to better understand the plant stress response. Studies showed that the light-responsive element G-box had the role of stress-responsive element, and the ABA-responsive element ABRE act as an important regulator of ABA-mediated stress response in plants (Abdullah-Zawawi et al., 2021). Here, we found that the 2R-type IaMYB157 with abundant G-box and ABRE elements in its promoter region exhibited very strong response to Cd stress, indicating the possible function of IaMYB157 in plant response to Cd stress.

Conclusion

In conclusion, 183 MYB genes in I. aquatica were identified at whole-genome level, which were classified into four types (including 66 1R-type MYB, 112 2R-type MYB, four 3R-type MYB, and one 4R-type MYB) and categorized into 23 subfamilies (G1-G23). Bioinformatics analysis was conducted including chromosome distribution, syntenic relationship, sequence features, gene duplication events, and selection pressure. Furthermore, RNA-seq and qRT-PCR results demonstrated that approximately 20% of IaMYB genes had a significant role in I. aquatica roots under Cd stress. Remarkably, the 2R-type IaMYB157 with abundant light-responsive element G-box and ABA-responsive element ABRE in its promoter region exhibited very strong response to Cd stress. Our findings provide an important candidate IaMYB gene for further deciphering the molecular regulatory mechanism in plant with respect to Cd stress.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary material.

Author contributions

WB and ZL: conceptualization. WB, ZL, and YZ: methodology. ZL and YZ: software, validation, and writing—original draft preparation. ZL, YZ, MA, YH, GZ, XL, JZ, WM, and ZW: investigation. WB: funding acquisition, writing—review and editing, and supervision. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Hainan Provincial Natural Science Foundation of China (320MS008 and 2019RC059) and the Initial Funds for the High-level Talents of Hainan University (KYQD(ZR)1935).

Acknowledgments

Special thanks to the reviewers for their precious time and valuable suggestions.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2022.979988/full#supplementary-material

Footnotes

References

Abdullah-Zawawi, M. R., Ahmad-Nizammuddin, N. F., Govender, N., Harun, S., Mohd-Assaad, N., and Mohamed-Hussein, Z. A. (2021). Comparative genome-wide analysis of WRKY, MADS-box and MYB transcription factor families in Arabidopsis and rice. Sci. Rep. 11, 19678. doi: 10.1038/s41598-021-99206-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Akbudak, M. A., Filiz, E., and Kontbay, K. (2018). DREB2 (dehydration-responsive element-binding protein 2) type transcription factor in sorghum (Sorghum bicolor): genome-wide identification, characterization and expression profiles under cadmium and salt stresses. Biotech 8, 426. doi: 10.1007/s13205-018-1454-1

CrossRef Full Text | Google Scholar

Alam, I., Wu, X. T., and Ge, L. F. (2022). Comprehensive genomic survey, evolution, and expression analysis of GIF gene family during the development and metal ion stress responses in soybean. Plants-Basel 11, 570. doi: 10.3390/plants11040570

PubMed Abstract | CrossRef Full Text | Google Scholar

Altaf, M. A., Shahid, R., Ren, M. X., Naz, S., Altaf, M. M., Khan, L. U., et al. (2022). Melatonin mitigates cadmium toxicity by promoting root architecture and mineral homeostasis of tomato genotypes. J. Soil Sci.Plant Nutr. 22, 1112–1128. doi: 10.1007/s42729-021-00720-9

CrossRef Full Text | Google Scholar

Ambawat, S., Sharma, P., Yadav, N. R., and Yadav, R. C. (2013). MYB transcription factor genes as regulators for plant responses: an overview. Physiol. Mol. Biol. Pla. 19, 307–321. doi: 10.1007/s12298-013-0179-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Arce-Rodriguez, M. L., Martinez, O., and Ochoa-Alejo, N. (2021). Genome-wide identification and analysis of the MYB transcription factor gene family in chili pepper (capsicum spp.). Int. J. Mol. Sci. 22, 2229. doi: 10.3390/ijms22052229

CrossRef Full Text | Google Scholar

Avila, J., Nieto, C., Canas, L., Benito, M. J., and Paz-Ares, J. (1993). Petunia hybrida genes related to the maize regulatory C1 gene and to animal myb proto-oncogenes. Plant J. 3, 553–562. doi: 10.1046/j.1365-313X.1993.03040553.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Cannon, S. B., Mitra, A., Baumgarten, A., Young, N. D., and May, G. (2004). The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana. BMC Plant Biol. 4:10. doi: 10.1186/1471-2229-4-10

PubMed Abstract | CrossRef Full Text | Google Scholar

Cao, Y., Han, Y., Li, D., Lin, Y., and Cai, Y. (2016). MYB transcription factors in Chinese pear (Pyrus bretschneideri Rehd.): genome-wide identification, classification, and expression profiling during fruit development. Front. Plant Sci. 7:577. doi: 10.3389/fpls.2016.00577

CrossRef Full Text | Google Scholar

Cao, Y., Jia, H., Xing, M., Jin, R., Grierson, D., Gao, Z., et al. (2021). Genome-wide analysis of MYB gene family in Chinese bayberry (Morella rubra) and identification of members regulating flavonoid biosynthesis. Front. Plant Sci. 12:691384. doi: 10.3389/fpls.2021.691384

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, C., Chen, H., Zhang, Y., Thomas, H. R., Frank, M. H., He, Y., et al. (2020, 2020). TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol. Plant 13, 1194–1202. doi: 10.1016/j.molp.2020.06.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, Y. H., Yang, X. Y., He, K., Liu, M. H., Li, J. G., Gao, Z. F., et al. (2006). The MYB transcription factor superfamily of Arabidopsis: expression analysis and phylogenetic comparison with the rice MYB family. Plant Mol. Biol. 60, 107–124. doi: 10.1007/s11103-005-2910-y

CrossRef Full Text | Google Scholar

Cirmi, S., Maugeri, A., Micali, A., Marini, H. R., Puzzolo, D., Santoro, G., et al. (2021). Cadmium-induced kidney injury in mice is counteracted by a flavonoid-rich extract of bergamot juice, alone or in association with curcumin and resveratrol, via the enhancement of different defense mechanisms. Biomedicine 9, 1797. doi: 10.3390/biomedicines9121797

PubMed Abstract | CrossRef Full Text | Google Scholar

Dubos, C., Stracke, R., Grotewold, E., Weisshaar, B., Martin, C., and Lepiniec, L. (2010). MYB transcription factors in Arabidopsis. Trends Plant Sci. 15, 573–581. doi: 10.1016/j.tplants.2010.06.005

CrossRef Full Text | Google Scholar

Guo, Z., Lv, J., Zhang, H., Hu, C., Qin, Y., Dong, H., et al. (2022). Red and blue light function antagonistically to regulate cadmium tolerance by modulating the photosynthesis,antioxidant defense system and Cd uptake in cucumber (Cucumis sativus L.). J. Hazard. Mater. 429:128412. doi: 10.1016/j.jhazmat.2022.128412

CrossRef Full Text | Google Scholar

Halligan, B. D. (2009). ProMoST: a tool for calculating the pI and molecular mass of phosphorylated and modified proteins on two-dimensional gels. Methods Mol. Biol. 527, 283–298. doi: 10.1007/978-1-60327-834-8_21

PubMed Abstract | CrossRef Full Text | Google Scholar

Hao, Y. Y., Bao, W. L., Li, G. L., Gagoshidze, Z., Shu, H. Y., Yang, Z., et al. (2021). The chromosome-based genome provides insights into the evolution in water spinach. Sci Hortic-Amsterdam 289:110501. doi: 10.1016/j.scienta.2021.110501

CrossRef Full Text | Google Scholar

He, Z., Zhang, H., Gao, S., Lercher, M. J., Chen, W. H., and Hu, S. (2016). Evolview v2: an online visualization and management tool for customized and annotated phylogenetic trees. Nucleic Acids Res. 44, W236–W241. doi: 10.1093/nar/gkw370

PubMed Abstract | CrossRef Full Text | Google Scholar

Hoda, A., Tafaj, M., and Sallaku, E. (2021). In silico structural, functional and phylogenetic analyses of cellulase from Ruminococcus albus. J. Genet. Eng. Biotechnol. 19, 58. doi: 10.1186/s43141-021-00162-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Islam, K., Rawoof, A., Ahmad, I., Dubey, M., Momo, J., and Ramchiary, N. (2021). Capsicum chinense MYB transcription factor genes: identification, expression analysis, and their conservation and diversification With other Solanaceae genomes. Front. Plant Sci. 12:721265. doi: 10.3389/fpls.2021.721265

PubMed Abstract | CrossRef Full Text | Google Scholar

Janicka-Russak, M., Kabala, K., Burzynski, M., and Klobus, G. (2008). Response of plasma membrane H+-ATPase to heavy metal stress in Cucumis sativus roots. J. Exp. Bot. 59, 3721–3728. doi: 10.1093/jxb/ern219

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiang, C., Gu, J., Chopra, S., Gu, X., and Peterson, T. (2004). Ordered origin of the typical two- and three-repeat Myb genes. Gene 326, 13–22. doi: 10.1016/j.gene.2003.09.049

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiang, C. K., and Rao, G. Y. (2020). Insights into the diversification and evolution of R2R3-MYB transcription factors in plants. Plant Physiol. 183, 637–655. doi: 10.1104/pp.19.01082

PubMed Abstract | CrossRef Full Text | Google Scholar

Katiyar, A., Smita, S., Lenka, S. K., Rajwanshi, R., Chinnusamy, V., and Bansal, K. C. (2012). Genome-wide classification and expression analysis of MYB transcription factor families in rice and Arabidopsis. BMC Genom. 13, 544. doi: 10.1186/1471-2164-13-544

PubMed Abstract | CrossRef Full Text | Google Scholar

Krogh, A., Larsson, B., von Heijne, G., and Sonnhammer, E. L. (2001). Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J. Mol. Biol. 305, 567–580. doi: 10.1006/jmbi.2000.4315

PubMed Abstract | CrossRef Full Text | Google Scholar

Kumar, S., Stecher, G., Li, M., Knyaz, C., Tamura, K., and Mega, X. (2018). Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 35, 1547–1549. doi: 10.1093/molbev/msy096

PubMed Abstract | CrossRef Full Text | Google Scholar

Kumar, A., Subrahmanyam, G., Mondal, R., Cabral-Pinto, M. M. S., Shabnam, A. A., Jigyasu, D. K., et al. (2021). Bio-remediation approaches for alleviation of cadmium contamination in natural resources. Chemosphere 268:128855. doi: 10.1016/j.chemosphere.2020.128855

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Y., Liang, J., Zeng, X., Guo, H., Luo, Y., Kear, P., et al. (2021). Genome-wide analysis of MYB gene family in potato provides insights into tissue-specific regulation of anthocyanin biosynthesis. Hortic. Plant J. 7, 129–141. doi: 10.1016/j.hpj.2020.12.001

CrossRef Full Text | Google Scholar

Li, Z., Peng, R., Tian, Y., Han, H., Xu, J., and Yao, Q. (2016). Genome-wide identification and analysis of the MYB transcription factor superfamily in Solanum lycopersicum. Plant Cell Physiol. 57, 1657–1677. doi: 10.1093/pcp/pcw091

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, J., Zhou, H., Xiong, C., Peng, Z., Du, W., Li, H., et al. (2022). Genome-wide analysis R2R3-MYB transcription factors in Xanthoceras sorbifolium Bunge and functional analysis of XsMYB30 in drought and salt stresses tolerance. Ind. Crop. Prod. 178:114597. doi: 10.1016/j.indcrop.2022.114597

CrossRef Full Text | Google Scholar

Lipsick, J. S. (2010). The C-MYB story--is it definitive? Proc. Natl. Acad. Sci. U. S. A. 107, 17067–17068. doi: 10.1073/pnas.1012402107

PubMed Abstract | CrossRef Full Text | Google Scholar

Lipsick, J. S., and Wang, D. M. (1999). Transformation by v-Myb. Oncogene 18, 3047–3055. doi: 10.1038/sj.onc.1202745

CrossRef Full Text | Google Scholar

Liu, C., Wang, X., Xu, Y., Deng, X., and Xu, Q. (2014). Genome-wide analysis of the R2R3-MYB transcription factor gene family in sweet orange (Citrus sinensis). Mol. Biol. Rep. 41, 6769–6785. doi: 10.1007/s11033-014-3563-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, Z., Qiu, W., Jin, K., Yu, M., Han, X., He, X., et al. (2022). Identification and analysis of bZIP family genes in sedum plumbizincicola and their potential roles in response to cadmium stress. Front. Plant Sci. 13:859386. doi: 10.3389/fpls.2022.859386

PubMed Abstract | CrossRef Full Text | Google Scholar

Mmadi, M. A., Dossa, K., Wang, L. H., Zhou, R., Wang, Y. Y., Cisse, N., et al. (2017). Functional characterization of the versatile MYB gene family uncovered their important roles in plant development and responses to drought and waterlogging in sesame. Genes-Basel 8, 362. doi: 10.3390/genes8120362

CrossRef Full Text | Google Scholar

Mohanta, T. K., Khan, A., Hashem, A., Abd-Allah, E. F., and Al-Harrasi, A. (2019). The molecular mass and isoelectric point of plant proteomes. BMC Genom. 20, 631. doi: 10.1186/s12864-019-5983-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Paz-Ares, J., Ghosal, D., Wienand, U., Peterson, P. A., and Saedler, H. (1987). The regulatory c1 locus of Zea mays encodes a protein with homology to myb proto-oncogene products and with structural similarities to transcriptional activators. EMBO J. 6, 3553–3558. doi: 10.1002/j.1460-2075.1987.tb02684.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Qiao, X., Li, Q., Yin, H., Qi, K., Li, L., Wang, R., et al. (2019). Gene duplication and evolution in recurring polyploidization–diploidization cycles in plants. Genome Biol. 20, 38. doi: 10.1186/s13059-019-1650-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Qing, J., Dawei, W., Zhou, J., Yulan, X., Bingqi, S., and Fan, Z. (2019). Genome-wide characterization and expression analyses of the MYB superfamily genes during developmental stages in Chinese jujube. Peer J. 7:e6353. doi: 10.7717/peerj.6353

PubMed Abstract | CrossRef Full Text | Google Scholar

Ramalingam, A., Kudapa, H., Pazhamala, L. T., Garg, V., and Varshney, R. K. (2015). Gene expression and yeast two-hybrid studies of 1R-MYB transcription factor mediating drought stress response in chickpea (Cicer arietinum L.). Front. Plant Sci. 6:1117. doi: 10.3389/fpls.2015.01117

CrossRef Full Text | Google Scholar

Sabir, I. A., Manzoor, M. A., Shah, I. H., Liu, X., Zahid, M. S., Jiu, S., et al. (2022). MYB transcription factor family in sweet cherry (Prunus avium L.): genome-wide investigation, evolution, structure, characterization and expression patterns. BMC Plant Biol. 22:2. doi: 10.1186/s12870-021-03374-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Saha, G., Park, J. I., Ahmed, N. U., Kayum, M. A., Kang, K. K., and Nou, I. S. (2016). Characterization and expression profiling of MYB transcription factors against stresses and during male organ development in Chinese cabbage (Brassica rapa ssp. pekinensis). Plant Physiol. Biochem. 104, 200–215. doi: 10.1016/j.plaphy.2016.03.021

PubMed Abstract | CrossRef Full Text | Google Scholar

Saikumar, P., Murali, R., and Reddy, E. P. (1990). Role of tryptophan repeats and flanking amino acids in MYB-DNA interactions. Proc. Natl. Acad. Sci. U. S. A. 87, 8452–8456. doi: 10.1073/pnas.87.21.8452

PubMed Abstract | CrossRef Full Text | Google Scholar

Salih, H., Gong, W., He, S., Sun, G., Sun, J., and Du, X. (2016). Genome-wide characterization and expression analysis of MYB transcription factors in Gossypium hirsutum. BMC Genet. 17, 129. doi: 10.1186/s12863-016-0436-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Sapara, K. K., Khedia, J., Agarwal, P., Gangapur, D. R., and Agarwal, P. K. (2019). SbMYB15 transcription factor mitigates cadmium and nickel stress in transgenic tobacco by limiting uptake and modulating antioxidative defence system. Funct. Plant Biol. 46, 702–714. doi: 10.1071/FP18234

PubMed Abstract | CrossRef Full Text | Google Scholar

Sarwar, N., Saifullah Malhi, S. S., Zia, M. H., Naeem, A., Bibi, S., and Farid, G. (2010). Role of mineral nutrition in minimizing cadmium accumulation by plants. J. Sci. Food Agric. 90, 925–937. doi: 10.1002/jsfa.3916

PubMed Abstract | CrossRef Full Text | Google Scholar

Shen, C., Huang, Y. Y., He, C. T., Zhou, Q., Chen, J. X., Tan, X., et al. (2017). Comparative analysis of cadmium responsive microRNAs in roots of two Ipomoea aquatica Forsk. Cultivars with different cadmium accumulation capacities. Plant Physiol. Biochem. 111, 329–339. doi: 10.1016/j.plaphy.2016.12.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Sheng, Y. B., Yan, X. X., Huang, Y., Han, Y. Y., Zhang, C., Ren, Y. B., et al. (2019). The WRKY transcription factor, WRKY13, activates PDR8 expression to positively regulate cadmium tolerance in Arabidopsis. Plant Cell Environ. 42, 891–903. doi: 10.1111/pce.13457

PubMed Abstract | CrossRef Full Text | Google Scholar

Singh, P., Singh, I., and Shah, K. (2018). Reduced activity of nitrate reductase under heavy metal cadmium stress in rice: an in silico answer. Front. Plant Sci. 9:1948. doi: 10.3389/fpls.2018.01948

CrossRef Full Text | Google Scholar

Stracke, R., Werber, M., and Weisshaar, B. (2001). The R2R3-MYB gene family in Arabidopsis thaliana. Curr. Opin. Plant Biol. 4, 447–456. doi: 10.1016/S1369-5266(00)00199-0

CrossRef Full Text | Google Scholar

Sun, N., Liu, M., Zhang, W., Yang, W., Bei, X., Ma, H., et al. (2015). Bean metal-responsive element-binding transcription factor confers cadmium resistance in tobacco. Plant Physiol. 167, 1136–1148. doi: 10.1104/pp.114.253096

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, W. J., Ma, Z. T., Chen, H., and Liu, M. Y. (2019). MYB gene family in potato (Solanum tuberosum L.): genome-wide identification of hormone-responsive reveals their potential functions in growth and development. Int. J. Mol. Sci. 20, 4847. doi: 10.3390/ijms20194847

CrossRef Full Text | Google Scholar

Tian, W., He, G., Qin, L., Li, D., Meng, L., Huang, Y., et al. (2021). Genome-wide analysis of the NRAMP gene family in potato (Solanum tuberosum): identification, expression analysis and response to five heavy metals stress. Ecotox. Environ. Safe 208:111661. doi: 10.1016/j.ecoenv.2020.111661

CrossRef Full Text | Google Scholar

Tian, W., Huang, Y., Li, D., Meng, L., He, T., and He, G. (2022). Identification of StAP2/ERF genes of potato (Solanum tuberosum) and their multiple functions in detoxification and accumulation of cadmium in yeast: implication for genetic-based phytoremediation. Sci. Total Environ. 810:152322. doi: 10.1016/j.scitotenv.2021.152322

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, J., Fang, W., Yang, Z., Yuan, J., Zhu, Y., and Yu, H. (2007). Inter- and intraspecific variations of cadmium accumulation of 13 leafy vegetable species in a greenhouse experiment. J. Agric. Food Chem. 55, 9118–9123. doi: 10.1021/jf0716432

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, A., Liang, K., Yang, S., Cao, Y., Wang, L., Zhang, M., et al. (2021a). Genome-wide analysis of MYB transcription factors of Vaccinium corymbosum and their positive responses to drought stress. BMC Genom. 22, 565. doi: 10.1186/s12864-021-07850-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Y., Tang, H., Debarry, J. D., Tan, X., Li, J., Wang, X., et al. (2012). MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 40, e49. doi: 10.1093/nar/gkr1293

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Y., Zhang, Y., Fan, C., Wei, Y., Meng, J., Li, Z., et al. (2021b). Genome-wide analysis of MYB transcription factors and their responses to salt stress in Casuarina equisetifolia. BMC Plant Biol. 21, 328. doi: 10.1186/s12870-021-03083-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, Z., Liu, X., He, X., Xu, L., Huang, Y., Shao, H., et al. (2017). The soybean basic helix-loop-helix transcription factor ORG3-Like enhances cadmium tolerance via increased iron and reduced cadmium uptake and transport from roots to shoots. Front. Plant Sci. 8:1098. doi: 10.3389/fpls.2017.01098

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, T., Cui, Z., Li, Y., Kang, Y., Song, X., Wang, J., et al. (2021). Genome-wide identification and expression analysis of MYB transcription factor superfamily in Dendrobium catenatum. Front. Genet. 12:714696. doi: 10.3389/fgene.2021.714696

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, P., Wang, R., Ju, Q., Li, W., Tran, L. P., and Xu, J. (2019). The R2R3-MYB transcription factor MYB49 regulates cadmium accumulation. Plant Physiol. 180, 529–542. doi: 10.1104/pp.18.01380

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhu, S. J., Shi, W. J., Jie, Y. C., Zhou, Q. M., and Song, C. B. (2020). A MYB transcription factor, BnMYB2, cloned from ramie (Boehmeria nivea) is involved in cadmium tolerance and accumulation. PLoS One 15:e0233375. doi: 10.1371/journal.pone.0233375

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: cis-acting element, gene duplication mode, gene expression, heavy metal stress, transcription factor

Citation: Liu Z, Zhang Y, Altaf MA, Hao Y, Zhou G, Li X, Zhu J, Ma W, Wang Z and Bao W (2022) Genome-wide identification of myeloblastosis gene family and its response to cadmium stress in Ipomoea aquatica. Front. Plant Sci. 13:979988. doi: 10.3389/fpls.2022.979988

Received: 28 June 2022; Accepted: 29 July 2022;
Published: 23 August 2022.

Edited by:

Nasim Ahmad Yasin, University of the Punjab, Pakistan

Reviewed by:

Muhammad Aamir Manzoor, Anhui Agricultural University, China
Aqeel Ahmad, University of Florida, United States

Copyright © 2022 Liu, Zhang, Altaf, Hao, Zhou, Li, Zhu, Ma, Wang and Bao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Wenlong Bao, wlbao@hainanu.edu.cn

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.