- 1Guangxi Key Laboratory of Medicinal Resources Protection and Genetic Improvement, Guangxi Botanical Garden of Medicinal Plants, Nanning, China
- 2Guangxi Engineering Research Center of TCM Resource Intelligent Creation, Guangxi Botanical Garden of Medicinal Plants, Nanning, China
- 3College of Pharmacy, Guangxi Medical University, Nanning, China
Mesona chinensis Benth (MCB) (or Platostoma palustre or Platostoma chinense) is an important edible and medicinal plant in China. However, the mitochondrial genome (mitogenome, or mtDNA) of MCB has not been characterized or reported yet. In this study, we first sequenced and characterized the complete mitogenome of MCB. The MCB mitogenome was 494,599 bp in length and encoded 59 genes containing 37 protein-coding genes (PCGs), 19 tRNAs, and 3 rRNAs. Gene transfer analysis revealed that a total of 12 transfer segments with more than 93% identity (total length of 25,427 bp) were detected in the MCB mitogenome. Simple sequence repeats (SSR) analysis showed that 212 simple sequence repeats (SSR) were identified. Repeat sequence analysis revealed 305 repeat sequences (158 forward and 147 palindromic repeats) ranging from 30 bp to 48,383 bp and the 30–39 bp repeats were the majority type. Relative synonymous codon usage (RSCU) analysis uncovered that in total, 9,947 codons were encoding the protein-coding genes (PCGs). Serine (909, 9.1%) and leucine (879, 8.8%) were the two most abundant amino acids, while terminator (32, .3%) was the least abundant amino acid. Ka/Ks analysis indicated that almost all genes were subject to purification selection, except ccmB. Analysis of Lamiaceae mitogenomes constitution revealed that atpB and atpE were unique to the Rotheca serrata and Salvia miltiorrhiza mitogenomes. mttB gene loss was unique to the Boea hygrometrica mitogenome. The core fragments of the Lamiaceae mitogenomes harbored a higher GC content than the specific and variable fragments. In addition, phylogenetic analysis revealed that MCB was closely related to Salvia miltiorrhiza based on the mitogenomes. The current study provided valuable genomic resources for understanding and utilizing this important medicinal plant in the future.
Introduction
Mitochondria, semi-autonomous organelles in eukaryotic cells, have relatively independent genetic systems (Mitochondrial DNA, mtDNA) (Gray, 2012; Chen et al., 2017) and have always been a hot topic in the field of molecular biology. Mitogenome is widely used in system and evolutionary biology, population genetics, and conservation biology (Zhong et al., 2020). The mitogenomes of higher plants vary dramatically in size (Backert et al., 1997), even among species of the same genus, ranging from 66 Kbp of Viscum scurruloideum (Skippington et al., 2015) to 11.3 Mbp of Silene conica (Sloan et al., 2012). And the number of genes in plant mitogenomes ranges from 19 (Viscum album) to 221 (Capsicum annuum, Solanaceae) (Jo et al., 2011; Petersen et al., 2015). Despite plant mitogenomes differ in size and number of genes, they encode roughly the same gene products, including rRNAs, tRNAs, and protein subunits required for respiratory chain complexes (Li et al., 2011; Bi et al., 2020). Plant mitogenomes are usually large in comparison with animal or fungal mitogenomes and contain fewer genes than their plastid counterpart and are also complex due to the presence of a large number of non-coding regions and the introgression of foreign DNA from the nuclear or chloroplast genome (Alverson et al., 2010). Plant mitogenome is characterized by dramatic and rapid structural changes, which makes the conventional modes of sequencing and assembly less effective. Currently, there are two strategies to obtain a complete plant mitogenome. One is to separately isolate mitochondria first, then extract mitochondrial DNA and sequence the mitochondrial DNA (Wang et al., 2008). The other one is to directly extract the total genomic DNA without isolation and purification of mitochondria and then screen out the mitochondrial genome for assembly based on a sequencing library (Férandon et al., 2013; Song et al., 2019). As the assembly techniques improve and the cost of sequencing reduces, the number of plant mitogenomes should raise rapidly (Yuan et al., 2016).
Mesona chinensis Benth (MCB) is a traditional Chinese herbal medicine, which has been used for thousands of years in China, India, Indonesia, Malaysia, Philippines, Thailand, etc (Lin et al., 2018). MCB is an annual herbaceous plant and distributes in South China and Southeast Asian countries (Huang et al., 2019; Ren et al., 2019; Tang et al., 2020). In China, it is one of the most important medicinal and edible plants (Tang et al., 2022a). MCB belongs to the Lamiaceae family, which is the largest family-level clade in the Lamiales and consists of more than 7,000 species of 236 genera globally (Li et al., 2016; Tang et al., 2022b). However, compared with a large number of Lamiaceae species, the number of mitogenomes sequenced within this family is still minimal. To date, the mitogenomes of Lamiaceae plants that have been released and uploaded to NCBI (National Center for Biotechnology Information) only include Ajuga reptans, Dracocephalum moldavica, R. serrata, S. miltiorrhiza, S. splendens, etc. Therefore, it is necessary to obtain the MCB mitogenome to enrich the database of plant mitogenome. In addition, in recent years, although there have been more and more research reports on MCB, suggesting that it is getting more and more attention, the research foundation of MCB still needs to be further strengthened, especially the data on the molecular biology of this species need to be further explored.
Hence, in this study, we sequenced and characterized the complete mitogenome of MCB using the Next-Generation Sequencing and Nanopore technologies, and performed the phylogenetic analysis based on mtDNA. To the best of our knowledge, this is the first assembly of the MCB mitogenome, which provides valuable information for the molecular biology research of this species and also enriches the database of plant mitogenome.
Materials and methods
DNA sequencing and genome assembly
M. chinensis Benth (MCB) was planted in Guangxi Botanical Garden of Medicinal Plants (N 22°51′, E 108°19′), Nanning city, China. Total genomic DNA isolation was carried out using a modified cetyltrimethylammonium bromide (CTAB) (Allen et al., 2006) method and applied to 500 bp paired-end library construction using the NEBNext® Ultra™ DNA Library Prep Kit for Illumina sequencing. Sequencing was performed on the Illumina NovaSeq 6,000 platform. About 4.1 Gb of raw data from MCB were produced with 150 bp paired-end read lengths. For Oxford Nanopore library construction and sequencing, purified DNA was sheared to 20-kb fragments using a Covaris g-tube (Covaris) and purified with AMPure beads. The DNA concentration was measured with a Qubit fluorometer and end-repaired using the NEBNext End Repair Module. The final library was sequenced using the MinION device.
GetOrganelle v1.6.4 (Jin et al., 2020) was employed for de novo assembly using the mitogenome of the closely related species Salvia miltiorrhiza (GenBank Acc. No. KF177345.1) as the seed sequence. We extracted the potential mitochondrion reads from the pool of illumina reads using BLAST searches against the mitogenome of S. miltiorrhiza and the GetOrganelle results. Then the mitochondrial illumina reads were obtained to perform mitogenome de novo assembly using the SPAdes-3.14.0 (Antipov et al., 2016) package, which generated ∼50 contigs of mitogenome.
Clean Nanopore long-reads were aligned against the GetOrganelle and SPAdes assembled scaffolds using the BWA men program. All aligned Nanopore reads were extracted to perform self-correction and mitogenome de novo assembly using the Canu v2.0 (Koren et al., 2017) package, followed by error correction using the racon v1.4.3 (Fang et al., 2020) and pilon v1.21. The Nanopore assembly sequences were aligned against a nucleotide sequence database to remove the chloroplast and contaminating sequences and then were checked if the sequences have overlaps and connections between them (Li et al., 2021). Then the retained mitochondrial long contigs were connected into a ring. Finally, the complete MCB mitogenome was obtained after polishing with both Nanopore long reads and Illumina short reads.
Genome annotation
The online GeSeq tool with default parameters was used to predict the protein-coding genes, transfer RNA (tRNA) genes, and ribosome RNA (rRNA) genes (Tillich et al., 2017; Tang et al., 2021b). The position of each coding gene was determined by BLAST (Altschul et al., 1990) searches against the reference mitochondrial genes. Manual corrections of genes for start/stop codons and intron/exon boundaries were conducted in SnapGene Viewer by comparing them with the reference genome. The OrganellarGenomeDRAW (https://ogdraw.mpimp-golm.mpg.de/cgi-bin/ogdraw.pl) (Greiner et al., 2019) tool was used to draw the mitochondrial genome map of MCB. Functional annotations were conducted using sequence-similarity blast searches with a typical cut-off E-value of 10−5 against several publicly available protein databases: Swiss-Prot, NCBI non-redundant (Nr) protein database, Clusters of Orthologous Groups (COGs), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Gene Ontology (GO) terms (Fang et al., 2020).
Gene transfer between the mitogenome and chloroplast genome
The MCB mitogenome was compared with its chloroplast genome (Tang et al., 2021b) by BLASTn and the selected parameter E value was less than 1e−5.
Repeat sequence analysis
REPuter (Kurtz et al., 2001) (https://bibiserv.techpak.uni-bielefeld.de/computer/) software was used for repeat sequence analysis. The MISA-MIcroSAtellite identification tool (Beier et al., 2017) (https://pgrc.ipk-gatersleben.de/misa/) was employed to identify the SSR sequences.
Collinearity analysis
The genome alignment between the assembly sequences and other sequences was performed using nucmer (4.0.0beta2) software with the --maxpatch parameter to generate dot plot. The BLASTN (2.10.1 +) software (-word_size is 7, E-value is 1e−5, and the fragments with alignment length greater than 300 bp are screened) was used to sequentially compare the assembled species with the selected species and draw a collinearity map.
Relative synonymous codon usage analysis
RSCU refers to the actual usage frequency of a codon divided by the theoretical usage frequency of the codon. The unique coding sequences (CDS) were filtered and calculated using a Perl script written by ourselves.
KaKs analysis
The mafft v7.310 (Katoh and Standley, 2013) (https://mafft.cbrc.jp/alignment/software/) software was used for gene sequence alignment. The KaKs_Calculator v2.0 (https://sourceforge.net/projects/kakscalculator2/) software was employed to calculate the KaKs value of the gene with the MLWL calculation method.
Lamiaceae mitogenomes constitution analysis
To study the constitution variation of the Lamiaceae mitogenomes, we performed multiple whole genome alignment of the mitogenomes using Mugsy (Angiuoli and Salzberg, 2011). The Lamiaceae mitogenomes were parsed to three different types of fragments including core fragments, specific fragments, and variable fragments from the output of Mugsy using the customized shell scripts. In this study, the core fragments were shared by all 9 Lamiaceae mitogenomes and the variable fragments were shared by a few mitogenomes. And the specific fragments were unique to only one mitogenome. We obtained the size accumulation curves of the pan mitogenome and core mitogenome of 9 Lamiaceae species using PanGP v1.0.1 (Zhao et al., 2014) as well as multiple whole genome alignment analysis by random sampling of up to 500 replicates for each group. The size information of core mitogenome was the accumulation of the sizes of the core fragments of each group and the size information of pan mitogenome was the accumulation of the sizes of specific fragments, variable fragments, and core fragments, of which the sizes of homologous sequences among the core and variable fragments calculated once.
Phylogenetic analysis
To determine the phylogenetic position of MCB, we constructed the phylogenetic tree based on the whole mitogenomes of 41 species. The GenBank accession numbers of all the studied species were listed in Supplementary Table S1. The PhyML V3.0 software (https://www.atgc-montpellier.fr/phyml/) was employed for phylogenetic analysis by maximum likelihood (ML) method, Bayes correction, and 1,000 bootstrap replicates to calculate bootstrap values (Guindon et al., 2010).
In addition, a total of 24 protein-coding genes were selected for phylogenetic analysis. Alignment of conservative genes was performed with MUSCLE (codons), and then Gblocks_0.91b (Gblocks: https://www.plob.org/article/7933.html) was used for extraction of conservative regions. jModelTest was employed to evaluate the optimal nucleotide substitution model of each gene, and the evaluation result of Akaike information criterion (AIC) shall prevail. The gene tree was constructed with PhyML.
Results
MCB mitogenome sequencing and assembly
In this study, the MCB mitogenome was sequenced by Illumina and Nanopore sequencing platforms. Overall, 4,136,459,100 raw data (Q30 = 88.88%) and 3,487,825,086 bp clean data (Q30 = 91.65%) were obtained using the Illumina platform (Supplementary Table S2) (SRA, SRR21196490). Regarding the Nanopore sequencing, a total of 72,273,957 bases were generated and the total reads number was 9,458 (SRA, SRR21196489). The subreads with N50 and N90 were 8,697 bp and 4,741 bp, respectively. The largest length of subreads was 65,388 bp (Supplementary Table S3). Through de novo assembling, the MCB mitogenome sequence was obtained and deposited in GenBank with accession number OP537517.
The complete mitogenome of MCB was 494,599 bp in length (Figure 1). The MCB mitogenome organization and base composition were summarized in Table 1. The total GC content of the MCB mitogenome was 44.21%, while the gene’s GC content and intergenic’s GC content were 42.73% and 44.32%, respectively. The gene total length was 33,166 bp, while the intergenic region length was 461,433 bp.
FIGURE 1. The circular mitochondrial genome of Mesona chinensis Benth (MCB). Genes shown outside of the circle are transcribed clockwise, whereas genes on the inside are transcribed counterclockwise. Genes belonging to different functional groups are color-coded. GC content is represented on the inner circle by the dark gray plot.
The mitogenome of MCB encoded 59 genes containing 37 PCGs, 19 tRNAs, and 3 rRNAs (Table 2; Supplementary Table S4). Among these, three PCGs (cox3, atp9, ccmFn) and two tRNAs (trnK-UUU and trnM-CAU) contained two copies, while the remaining genes had one copy. Three ribosomal subunits (rrn18, rrn5, and rrn26) were encoded with lengths of 1,282 bp, 112 bp, and 2,875 bp, respectively (Supplementary Table S4). Of the PCGs, nad1, nad2, nad4, nad5, nad7, cox1, ccmFc, rps3, and matR contained introns (Supplementary Table S5). nad2 included four introns, while cox1, ccmFc, rps3, and matR had only one intron and the other four genes contained three introns. As for tRNAs, the length of all 19 tRNAs was 1,433 bp (Supplementary Table S4) and the length of each tRNA ranged from 65 bp to 90 bp (Supplementary Table S6).
Gene transfer between the mitogenome and chloroplast genome of MCB
In eukaryotic cells, genetic information transfer and biomass exchanges occurred between subcellular units or organelles (Schuldiner and Guo, 2015). Recently, studies showed that information exchanges and transfers between the mitochondria and chloroplasts existed in plant cells (Zhao et al., 2018). In this study, the phenomenon of gene exchange and the transfer was found between the chloroplast genome and mitogenome of MCB (Figure 2). In total, 11 segments (247,42 bp) transferred from the chloroplast genome to 12 mitochondrial segments (25,427 bp) were found with the 93%–97% sequence similarity, respectively (Supplementary Table S7). The longest and shortest transfer segments were 7,723 bp and 402 bp in length, respectively. Of these, there were two mitochondrial segments (141,923–141,239 bp and 492,819–493,503 bp) were transferred from the chloroplast segment (24,521–25,205 bp) with 97% sequence similarity.
SSR and repeat sequence analysis
A total of 212 SSRs were identified in the MCB mitogenome. Among them, more than half of the SSRs (122, 57.5%) belonged to monomers, while the dimers, trimers, tetramers, and pentamers were detected with lower frequency, accounting for 13.2%, 9.4%, 17.0%, and 2.8%, respectively (Figure 3A). Of the monomers, 112 A/T sequences (91.8%) occupied the main proportion, while G/C was only 10 (8.2%) (Figure 3C). Further, only six pentamers were observed, distributing in the IGS, cob, and, matR regions of the MCB mitogenome. The specific size and location of pentamers were shown in Supplementary Table S8.
FIGURE 3. SSR and repeat analysis in MCB mitogenome. (A), SSR number statistics of different SSR types; (B) Statistics of repeat sequence with different lengths; (C), SSR motif analysis; (D), Statistics of repeat sequence types.
A total of 305 repeat sequences were identified in the MCB mitogenome with a total length of 139,861 bp and the length of each repeat ranged from 30 bp to 48,383 bp (Figure 4). As shown in Figure 3B, the 30–39 bp repeats were the majority type, accounting for 69.2% (211) of the total repeats. In addition, in terms of repeat types, there were 158 forward (F) (total length: 119,084 bp) and 147 palindromic (P) repeats (total length: 20,777 bp) in MCB mitogenome (Figure 3D).
The large repeat sequences (>1 kb) were notable because they were associated with reversible genomic structural changes (Bi et al., 2016). In this study, there were 12 repeats longer than 1 kb with a total length of 127,969 bp. Of these, 4 long palindromic repeats with a length of 15,475 bp and 8 long forward repeats with a length of 112,494 bp were detected, respectively (Supplementary Table S9).
Collinearity analysis of the mitochondrial sequences
To evaluate the degree of structural rearrangement between MCB and the same family species, the mitochondrial genome of MCB was compared with those of Olea europaea (LR743801), dorcoceras hygrometricum (NC_016741), Mimulus guttatus (NC_018041), Ajuga reptans (NC_023103), Salvia miltiorrhiza (NC_023209), hesperelaea palmeri (NC_031323), castilleja paramensis (NC_031806), and rotheca serrata (NC_049064). As shown in Figure 5, when using MCB as a reference genome, the dot-plot analysis showed short stretches (less than 12 kb) of synteny across all species. In addition, there were many sequence rearrangements among these species, which indicated that the sequence arrangement order of the mitochondrial genome was not conservative. However, there were many homologous sequences among the species, especially the gene segments, which showed the conservative gene sequence of the mitochondrial genome (Figure 6).
RSCU analysis of the MCB mitogenome
The codon usage pattern of the MCB mitogenome was summarized (Figure 7). Overall, a total of 9,947 codons encoded the protein-coding genes in the MCB mitogenome. There were 64 different codons in the MCB mitogenome. Serine (909, 9.1%) and leucine (879, 8.8%) were the two most common amino acids, while terminator (32, .3%) was the least abundant amino acid in the MCB mitogenome. In addition, almost all the A/U-ending codons had RSCU values greater than one (RSCU > 1), while the C/G-ending codons possessed RSCU values less than one (RSCU < 1) (Supplementary Table S10).
KaKs analysis
In this study, a total of 32 genes were employed to compute the Ka/Ks ratio among the mitogenomes of Mesona chinensis Benth, Olea europaea, dorcoceras hygrometricum, Mimulus guttatus, Ajuga reptans, Salvia miltiorrhiza, hesperelaea palmeri, castilleja paramensis, and rotheca serrata. The results showed that almost all genes were subject to purification selection, except ccmB. Compared with other genes, ccmB might be subjected to greater positive selection pressure (Figure 8).
Variation in the gene content and constitution in the Lamiaceae mitogenomes
In this study, we carried out a comparative analysis of mitochondrial gene content and genome size of the 9 Lamiaceae mitogenomes. The genome size variability of more than two-fold was observed among Lamiaceae mitogenomes with a range from 352,069 bp in Ajuga reptans to 755,572 bp in Olea europaea and the GC content of the 9 Lamiaceae mitogenomes ranged from 43.27% in Boea hygrometrica to 45.54% in Rotheca serrata (Supplementary Table S11). atpB and atpE were unique to the Rotheca serrata and Salvia miltiorrhiza mitogenomes and the mttB gene loss was unique to the Boea hygrometrica mitogenome (Figure 9A). Moreover, the analysis of pan-mitogenome and mitochondrial constitution showed that the pan-mitogenome curve indicated an open pan-mitogenome (Figure 9B). The specific, core and variable fragments have a large variation in size with ranges from 128,410 bp (∼19.50% of Hesperelaea palmeri mitogenome) to 296,682 bp (∼59.98% of Mesona chinensis mitogenome), 78,279 bp (∼22.23% of Ajuga reptans mitogenome) to 112,771 bp (∼17.12% of Hesperelaea palmeri mitogenome), and 49,949 bp (∼14.19% of Ajuga reptans mitogenome) to 417,418 bp (∼63.39% of Hesperelaea palmeri mitogenome), respectively (Supplementary Table S11; Figure 9C). These specific, core and variable fragments contained stable GC contents of 44.0%, 46.5%, and 44.0% on average, respectively (Supplementary Table S11; Figure 9D). The core fragments of the Lamiaceae mitogenomes harbored a higher GC content than the variable and specific fragments.
FIGURE 9. Variation in the gene content and constitution in Lamiaceae mitogenomes. (A), The gene content of 9 Lamiaceae mitogenomes; (B), Increase and decrease in the genome size of the pan mitogenome (blue) and core mitogenome (green); (C), The size variation of different components in the 9 Lamiaceae mitogenomes; (D), The GC content variation of different components in the 9 Lamiaceae mitogenomes.
Phylogenetic analysis
We used the maximum likelihood (ML) method to construct the phylogenetic tree based on the mitogenomes of 41 plant species. Of these, Nelumbo nucifera was magnoliopsida, Eleusine indica and Tripsacum dactyloides belonged to monocots, and the other 38 higher plants were eudicots (SupplementaryTable S1). As shown in Figure 10, the ML tree was divided into three clades, and the phylogenetic tree strongly supported the separation of the magnoliopsida, monocots, and eudicots plants. MCB belonged to eudicots Clade and stayed closest to Salvia miltiorrhiza. To further explore the utility of the mt genes in phylogenetic reconstruction, a total of 24 mt genes were selected for gene tree analysis. Results showed that many of the 24 gene trees were congruent with the previous reconstruction based on the mitogenomes, meanwhile, several gene trees had topologies inconsistent with the mitochondrial tree (Supplementary Figure S1). For example, in the rps12 tree, Salvia miltiorrhiza was a sister to Castilleja paramensis, which was a sister of MCB in the mitochondria tree. And in the nad7 tree, Salvia miltiorrhiza was a sister to Rotheca serrata, which was clustered as a sister to MCB in the mitochondria tree.
Discussion
In the present study, we used the total genomic DNA to sequence and assembled the complete mitogenome of MCB based on Illumina and Nanopore sequencing data. This would lay a foundation for the whole genome analysis of MCB in the future. Generally, mitogenomes were described as single circular molecules (Kitazaki and Kubo 2010), while many other configurations of plant mitochondrial chromosomes had also been reported, such as highly branched and sigma-like morphologies, diverse linear and circular forms, and multichromosomal structures (Chen et al., 2017). Here, the mitogenome of MCB seemed to be a circular molecule (Figure 1). Circular arrangements appeared to be the common form among mitochondrial genomes assembled thus far in Lamiaceae (Qian, 2014; Zhu et al., 2014; Yu et al., 2022).
Plant mitochondrial genomes may vary enormously in size in different families and genera (Zhang et al., 2020), even within a single plant family (Chen et al., 2017). For example, in the Cucurbitaceae, mitogenomes varied over 7-fold in size, from 379 kb in Citrullus lanatus (Alverson et al., 2010) to 2,740 kb in Cucumis melo (Rodriguez-Moreno et al., 2011). In this study, as the members of the Lamiaceae family, the mitogenomes of Ajuga reptans (352,069 bp, NC_023103.1/KF709392.1), Rotheca serrata (482,114 bp, NC_049064.1/MT075727.1), Salvia miltiorrhiza (499,236 bp, NC_023209.1/KF177345.1), and Mesona chinensis Benth (494,599 bp) also varied enormously. The mitogenome of MCB was about 1.4 times that of Ajuga reptans, which supported the view that the mitogenome size was variable in angiosperm (Skippington et al., 2015). In addition, the mitogenome size of MCB was similar to that of Salvia miltiorrhiza and it might be due to their close genetic relationship.
Plant mitogenomes possessed a characteristic of their inclination to become a genetic “dumping ground” for sequences from both the chloroplast and nuclear genomes. Usually, sequence transfer from chloroplast to mitochondrial genomes typically was unidirectional (Tsunewaki, 2011; Chen et al., 2017). cpDNA-like sequences were detected in many mitogenomes from various species (Alverson et al., 2010; Rodriguez-Moreno et al., 2011). In the grape mitochondrial genome, there were 30 chloroplast fragments with a total length of 68,237 bp, accounting for 8.8% of the whole mitogenome and 42.4% of the whole chloroplast genome (Goremykin et al., 2009). Similarly, 17 chloroplast fragments with a total length of 22,593 bp and with a size range of 32–6,635 bp, account for 6.3% of the rice mitogenome (Notsu et al., 2002). In our investigation, 12 transfer segments were found with a total length of 22,075 bp (about 4.5% of the mitogenome) (Figure 2; Supplementary Table S7), supporting the conclusion that about .1%–10.3% of mtDNA originated from the chloroplast genome (Sloan and Wu, 2014).
Repetitive sequences including SSR, tandem repeats, short repeats, and large repeats were common in plant mitogenomes and about 38% of the mitogenome was obtained by the repeats of copy number and variable size (Mower et al., 2012). The presence of a large number of repeated sequences could promote the occurrence of mitochondrial sequence recombination, resulting in the generation of isomeric genomic sequences in mitochondria and sub-genomic sequences derived from some small circular molecular structures (Fauron et al., 1995). SSRs sequences were widely used in genetic analysis and species identification of individuals and populations due to their strong repeatability and relatively conserved characteristic (Choi and Park, 2015). SSRs were abundant in plants, however, the frequency of SSR in different plants varied greatly. In the mitogenome of G. raimondii, a total of 674 SSRs were identified and the single nucleotide SSR was the most (44.5%) (Bi et al., 2016). In this study, a total of 212 SSRs were detected in the MCB mitogenome, and 112 A/T sequences (52.8%) occupied the main proportion of the SSRs (Figure 3C). It was indicated that these mitochondrial SSRs could lay a foundation for the development of mitochondrial molecular markers, accurate identification and protection of genetic resources, and the evolution of flora in MCB.
Great variations in the genomic structure, gene content, and composition existed in the mitochondrial genomes of angiosperms (Mower et al., 2012). In this study, we could see that the size of the Lamiaceae mitogenomes was highly variable (Supplementary Table S11). The variation in mitogenome size could be explained by the differences in intergenic regions to a large extent (Mower et al., 2012) rather than the variable gene content (Figure 9A). Meanwhile, the differences in mitogenome size could also be attributed to the presence of foreign fragments and sizeable repetitive sequences (Park et al., 2015). Moreover, the relatively large mitogenomes (Hesperelaea palmeri and Olea europaea) always had larger variable fragments (H. palmeri 417,418 bp, 63.39% of the mitogenome and O. Europaea 409,030 bp, 54.14% of the mitogenome) (Supplemetary Table S11; Figure 9C). The differences in variable fragments could be explained by the genetic escape from the common ancestor of two or more species (Wang et al., 2018). In addition, the variable fragments showed relatively variable GC content and percent and low GC content among the Lamiaceae mitogenomes, indicating that these fragments might contain some foreign genetic materials with low GC content.
The characterization of the mitogenome of MCB stimulated a reassessment of the systematic relationships of MCB using the mt genomic/proteomic datasets. Given the demonstrated utility of mt proteomic datasets, high phylogenetic signal, and strong statistical support in trees (Jex et al., 2010; Park et al., 2011), there was a chance to test the phylogenetic relationships of MCB using the expanded mt datasets. In this study, the phylogenetic analysis showed that MCB stayed closest to Salvia miltiorrhiza (Figure 10). This result seemed to confirm the conclusion that the two genomes were similar in size.
Conclusion
The MCB mitogenome was 494,599 bp in length and encoded 59 genes including 37 PCGs, 19 tRNAs, and 3 rRNAs. A total of 12 transfer segments and 212 SSR and 305 repeats were identified in the MCB mitogenome. In total, 9,947 codons were encoding the protein-coding genes in the MCB mitogenome. Serine and leucine were the two most abundant amino acids, while terminator was the least abundant amino acid. Almost all genes were subject to purification selection, except ccmB. atpB and atpE were unique to the Rotheca serrata and Salvia miltiorrhiza mitogenomes. mttB gene loss was unique to the Boea hygrometrica mitogenome. The core fragments of the Lamiaceae mitogenomes harbored a higher GC content than the variable and specific fragments. Phylogenetic analysis showed that MCB was closely related to Salvia miltiorrhiza. The current study provided valuable genomic resources for understanding and utilizing this important medicinal plant in the future.
Data availability statement
The MCB mitogenome sequence has been deposited in GenBank with accession numbers SRR21196489, SRR21196490, and OP537517.
Author contributions
DT and FW conceived and designed the experiment. FW, CQ, YH, and SH helped to carry out the experimental and laboratory analysis. DT analyzed the data and prepared and revised the manuscript. JM provided funding support. All authors agreed on the final version.
Funding
This study was supported by the National Natural Science Foundation of China (82260749; 82260750), Fund Projects of the Central Government in Guidance of Local Science and Technology Development (GuiKeZY22096020), Guangxi Innovation-Driven Development Project (GuiKeAA18242040), and the Scientific Research Funding Project of Guangxi Botanical Garden of Medicinal Plants (GuiYaoJi202011).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2022.1056389/full#supplementary-material
References
Allen, G. C., Flores-Vergara, M. A., Krasynanski, S., Kumar, S., and Thompson, W. F. (2006). A modified protocol for rapid DNA isolation from plant tissues using cetyltrimethylammonium bromide. Nat. Protoc. 1, 2320–2325. doi:10.1038/nprot.2006.384
Altschul, S. F., Gish, W., Miller, W., Myers, E. W., and Lipman, D. J. (1990). Basic local alignment search tool. J. Mol. Biol. 215, 403–410. doi:10.1016/S0022-2836(05)80360-2
Alverson, A. J., Wei, X., Rice, D. W., Stern, D. B., Barry, K., and Palmer, J. D. (2010). Insights into the evolution of mitochondrial genome size from complete sequences of Citrullus lanatus and Cucurbita pepo (Cucurbitaceae). Mol. Biol. Evol. 27, 1436–1448. doi:10.1093/molbev/msq029
Angiuoli, S. V., and Salzberg, S. L. (2011). Mugsy: Fast multiple alignment of closely related whole genomes. Bioinformatics 27, 334–342. doi:10.1093/bioinformatics/btq665
Antipov, D., Korobeynikov, A., McLean, J. S., and Pevzner, P. A. (2016). hybridSPAdes: An algorithm for hybrid assembly of short and long reads. Bioinformatics 32 (7), 1009–1015. doi:10.1093/bioinformatics/btv688
Backert, S., Nielsen, B. L., and Börner, T. (1997). The mystery of the rings: Structure and replication of mitochondrial genomes from higher plants. Trends Plant Sci. 2, 477–483. doi:10.1016/s1360-1385(97)01148-5
Beier, S., Thiel, T., Münch, T., Scholz, U., and Mascher, M. (2017). MISA-Web: A web server for microsatellite prediction. Bioinformatics 33, 2583–2585. doi:10.1093/bioinformatics/btx198
Bi, C., Lu, N., Xu, Y., He, C., and Lu, Z. (2020). Characterization and analysis of the mitochondrial genome of common bean (Phaseolus vulgaris) by comparative genomic approaches. Int. J. Mol. Sci. 21, 3778. doi:10.3390/ijms21113778
Bi, C., Paterson, A. H., Wang, X., Xu, Y., Wu, D., Qu, Y., et al. (2016). Analysis of the complete mitochondrial genome sequence of the diploid cotton Gossypium raimondii by comparative genomics approaches. Biomed. Res. Int. 2016, 5040598. doi:10.1155/2016/5040598
Chen, Z. W., Zhao, N., Li, S. S., Grover, C. E., Nie, H. S., Wendel, J. F., et al. (2017). Plant mitochondrial genome evolution and cytoplasmic male sterility. Crit. Rev. Plant Sci. 36, 55–69. doi:10.1080/07352689.2017.1327762
Choi, K. S., and Park, S. (2015). The complete chloroplast genome sequence of Aster spathulifolius (Asteraceae); genomic features and relationship with Asteraceae. Gene 572, 214–221. doi:10.1016/j.gene.2015.07.020
Fang, J. P., Lin, A. T., Yuan, X., Chen, Y. Q., He, W. J., Huang, J. H., et al. (2020). The complete chloroplast genome of Isochrysis galbana and comparison with related haptophyte species. Algal Res. 50, 101989. doi:10.1016/j.algal.2020.101989
Fauron, C., Casper, M., Gao, Y., and Moore, B. (1995). The maize mitochondrial genome: Dynamic, yet functional. Trends Genet. 11, 228–235. doi:10.1016/s0168-9525(00)89056-3
Férandon, C., Xu, J. P., and Barroso, G. (2013). The 135k bp mitochondrial genome of Agaricus bisporus is the largest known eukaryotic reservoir of group I introns and plasmid-related sequences. Fungal Genet. Biol. 55, 85–91. doi:10.1016/j.fgb.2013.01.009
Goremykin, V. V., Salamini, F., Velasco, R., and Viola, R. (2009). Mitochondrial DNA of Vitis vinifera and the issue of rampant horizontal gene transfer. Mol. Biol. Evol. 26, 99–110. doi:10.1093/molbev/msn226
Gray, M. W. (2012). Mitochondrial evolution. Cold Spring Harb. Perspect. Biol. 4, a011403. doi:10.1101/cshperspect.a011403
Greiner, S., Lehwark, P., and Bock, R. (2019). OrganellarGenomeDRAW (OGDRAW) version 1.3.1: Expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Res. 47, W59–W64. doi:10.1093/nar/gkz238
Guindon, S., Dufayard, J. F., Lefort, V., Anisimova, M., Hordijk, W., and Gascuel, O. (2010). New algorithms and methods to estimate maximum-likelihood phylogenies: Assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321. doi:10.1093/sysbio/syq010
Huang, L. X., Huang, M., Shen, M. Y., Wen, P. W., Wu, T., Hong, Y. Z., et al. (2019). Sulfated modification enhanced the antioxidant activity of Mesona chinensis Benth polysaccharide and its protective effect on cellular oxidative stress. Int. J. Biol. Macromol. 136, 1000–1006. doi:10.1016/j.ijbiomac.2019.06.199
Jex, A. R., Hall, R. S., Littlewood, D. T., and Gasser, R. B. (2010). An integrated pipeline for next-generation sequencing and annotation of mitochondrial genomes. Nucleic Acids Res. 38, 522–533. doi:10.1093/nar/gkp883
Jin, J. J., Yu, W. B., Yang, J. B., Song, Y., dePamphilis, C. W., Yi, T. S., et al. (2020). GetOrganelle: A fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 21 (1), 241. doi:10.1186/s13059-020-02154-5
Jo, Y. D., Park, J., Kim, J., Song, W., Hur, C. G., Lee, Y. H., et al. (2011). Complete sequencing and comparative analyses of the pepper (Capsicum annuum L.) plastome revealed high frequency of tandem repeats and large insertion/deletions on pepper plastome. Plant Cell Rep. 30 (2), 217–229. doi:10.1007/s00299-010-0929-2
Katoh, K., and Standley, D. M. (2013). MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 30 (4), 772–780. doi:10.1093/molbev/mst010
Kitazaki, K., and Kubo, T. (2010). Cost of having the largest mitochondrial genome: Evolutionary mechanism of plant mitochondrial genome. J. Bot. 2010, 1–12. doi:10.1155/2010/620137
Koren, B. P., Walenz, B. P., Berlin, K., Miller, J. R., Bergman, N. H., and Phillippy, A. M. (2017). Canu: Scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27 (5), 722–736. doi:10.1101/gr.215087.116
Kurtz, S., Choudhuri, J. V., Ohlebusch, E., Schleiermacher, C., Stoye, J., and Giegerich, R. (2001). REPuter: The manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 29 (22), 4633–4642. doi:10.1093/nar/29.22.4633
Li, B., Cantino, P. D., Olmsead, R. G., Bramley, G. L. C., Xiang, C. H. L., Ma, Z. H., et al. (2016). A large-scale chloroplast phylogeny of the Lamiaceae sheds new light on its subfamilial classification. Sci. Rep. 6, 34343. doi:10.1038/srep34343
Li, S. N., Yang, Y. Y., Xu, L., Xing, Y. P., Zhao, R., Ao, W. L., et al. (2021). The complete mitochondrial genome of Aconitum kusnezoffii Rchb. (Ranales, Ranunculaceae). Mitochondrial DNA Part B 6 (3), 779–781. doi:10.1080/23802359.2021.1882894
Li, Y. Q., Zhao, H. K., Tan, H., Liu, X. D., Zhang, C. B., and Dong, Y. S. (2011). Analysis and comparison on characteristic of mitochondrial genome of eight plants. Biotechnol. Bull. 10, 156–162.
Lin, L. H., Shen, M. Y., Liu, S. C., Tang, W., Wang, Z. J., Xie, M. Y., et al. (2018). An acidic heteropolysaccharide from Mesona chinensis: Rheological properties, gelling behavior and texture characteristics. Int. J. Biol. Macromol. 107, 1591–1598. doi:10.1016/j.ijbiomac.2017.10.029
Mower, J. P., Sloan, D., and Alverson, A. (2012). “Plant mitochondrial genome diversity: The genomics revolution,” in Plant genome diversity volume 1. Editors J. F. Wendel, J. Greilhuber, J. Dolezel, and I. J. Leitch (Vienna: Springer), 123–144.
Notsu, Y., Masood, S., Nishikawa, T., KuboN., , Akiduki, G., NakazonoM., , et al. (2002). The complete sequence of the rice (Oryza sativa L) mitochondrial genome: Frequent DNA sequence acquisition and loss during the evolution of flowering plants. Mol.Genet. Genomics. 268, 434–445. doi:10.1007/s00438-002-0767-1
Park, J. K., Sultana, T., Lee, S. H., Kang, S., Kim, H. K., Min, G. S., et al. (2011). Monophyly of clade III nematodes is not supported by phylogenetic analysis of complete mitochondrial genome sequences. BMC Genomics 12, 392. doi:10.1186/1471-2164-12-392
Park, S., Grewe, F., Zhu, A., Ruhlman, T. A., Sabir, J., Mower, J. P., et al. (2015). Dynamic evolution of Geranium mitochondrial genomes through multiple horizontal and intracellular gene transfers. New Phytol. 208, 570–583. doi:10.1111/nph.13467
Petersen, G., Cuenca, A., Moller, I. M., and Seberg, O. (2015). Massive gene loss in mistletoe (Viscum, Viscaceae) mitochondria. Sci. Rep. 5, 17588. doi:10.1038/srep17588
Qian, J. (2014). Study on chloroplast and mitochondrial genomes of Salvia miltiorrhiza. Beijing, China: Chinese Academy of Medical Sciences & Peking Union Medical College.
Ren, Y. M., Jiang, L., Wang, W. J., Xiao, Y. H., Liu, S. C., Luo, Y., et al. (2019). Effects of Mesona chinensis Benth polysaccharide on physicochemical and rheological properties of sweet potato starch and its interactions. Food Hydrocoll. 99, 105371. doi:10.1016/j.foodhyd.2019.105371
Rodriguez-Moreno, L., Gonzalez, V. M., Benjak, A., Marti, M. C., Puigdomenech, P., Aranda, M. A., et al. (2011). Determination of the melon chloroplast and mitochondrial genome sequences reveals that the largest reported mitochondrial genome in plants contains a significant amount of DNA having a nuclear origin. BMC Genomics 12, 424. doi:10.1186/1471-2164-12-424
Schuldiner, M., and Guo, W. (2015). Editorial overview: Cell organelles: Organelle communication: New means and new views. Curr. Opin. Cell Biol. 35, v–vi. doi:10.1016/j.ceb.2015.07.008
Skippington, E., Barkman, T. J., Rice, D. W., and Palmer, J. D. (2015). Miniaturized mitogenome of the parasitic plant Viscum scurruloideum is extremely divergent and dynamic and has lost all nad genes. Proc. Natl. Acad. Sci. U. S. A. 112, E3515–E3524. doi:10.1073/pnas.1504491112
Sloan, D. B., Alverson, A. J., Chuckalovcak, J. P., Wu, M., McCauley, D. E., Palmer, J. D., et al. (2012). Rapid evolution of enormous, multichromosomal genomes in flowering plant mitochondria with exceptionally high mutation rates. PLoS Biol. 10, e1001241. doi:10.1371/journal.pbio.1001241
Sloan, D. B., and Wu, Z. (2014). History of plastid DNA insertions reveals weak deletion and at mutation biases in angiosperm mitochondrial genomes. Genome Biol. Evol. 6, 3210–3221. doi:10.1093/gbe/evu253
Song, X. X., Zhao, Y., Song, C. Y., Chen, M. J., Huang, J. C., Bao, D. P., et al. (2019). Mitogenome types of two Lentinula edodes sensu lato populations in China. Sci. Rep. 9, 9421. doi:10.1038/s41598-019-45922-5
Tang, D. F., Wei, F., Xie, J. H., Zhu, Y. X., Wei, K. H., and Miao, J. H. (2022a). Key technologies and problems in conservation of Mesona chinensis Benth in Guangxi. Mod. Chin. Med. 24 (3), 407–411.
Tang, D. F., Lin, Y., Wei, F., Quan, C. Q., Wei, K. H., Wei, Y. Y., et al. (2022b). Characteristics and comparative analysis of Mesona chinensis Benth chloroplast genome reveals DNA barcode regions for species identification. Funct. Integr. Genomics 22, 467–479. doi:10.1007/s10142-022-00846-8
Tang, D. F., Wei, F., Cai, Z. Q., Wei, Y. Y., Khan, A., Miao, J. H., et al. (2020). Analysis of codon usage bias and evolution in the chloroplast genome of Mesona chinensis Benth. Dev. Genes Evol. 231 (1-2), 1–9. doi:10.1007/s00427-020-00670-9
Tillich, M., Lehwark, P., Pellizzer, T., Ulbricht-Jones, E. S., Fischer, A., Bock, R., et al. (2017). GeSeq-versatile and accurate annotation of organelle genomes. Nucleic Acids Res. 45, W6–W11. doi:10.1093/nar/gkx391
Tsunewaki, K. (2011). Interorganellar DNA transfer in wheat: Dynamics and phylogenetic origin. Proc. Jpn. Acad. Ser. B. Phys. Biol. Sci. 87, 529–549. doi:10.2183/pjab.87.529
Wang, S. B., Song, Q. W., Li, S. S., Hu, Z. Q., Dong, G. Q., Song, C., et al. (2018). Assembly of a complete mitogenome of Chrysanthemum nankingense using oxford nanopore long reads and the diversity and evolution of Asteraceae mitogenomes. Genes 9 (11), 547. doi:10.3390/genes9110547
Wang, Y., Zeng, F. Y., Hon, C. C., Zhang, Y. Z., and Leung, F. C. C. (2008). The mitochondrial genome of the Basidiomycete fungus Pleurotus ostreatus (Oyster mushroom). FEMS Microbiol. Lett. 280, 34–41. doi:10.1111/j.1574-6968.2007.01048.x
Yu, X. L., Wang, Y., Zhang, Q., Duan, Z. G., Wang, Y. J., Zhang, Q. X., et al. (2022). Sequence analysis of the complete mitochondrial genome of a medicinal plant, Vitex rotundifolia Linnaeus f. (Lamiales: Lamiaceae). Genes (Basel) 13 (5), 839. doi:10.3390/genes13050839
Yuan, S., Xia, Y., Zheng, Y., and Zeng, X. (2016). Next-generation sequencing of mixed genomic DNA allows efcient assembly of rearranged mitochondrial genomes in Amolops chunganensis and Quasipaa boulengeri. Peer J. 4, e2786. doi:10.7717/peerj.2786
Zhang, D. C., Xing, Y. P., Xu, L., Zhao, R., Yang, Y. Y., Zhang, T. T., et al. (2020). Comparative analysis of the mitochondrial genome sequences of two medicinal plants: Arctium Lappa and A. Tomentosum. doi:10.21203/rs.3.rs-28443/v1
Zhao, Y., Jia, X., Yang, J., Ling, Y., Zhang, Z., Yu, J., et al. (2014). PanGP: A tool for quickly analyzing bacterial pan-genome profile. Bioinformatics 30, 1297–1299. doi:10.1093/bioinformatics/btu017
Zhao, Y. N., Luo, L. L., Xu, J. S., Xin, P. Y., Guo, H. Y., Wu, J., et al. (2018). Malate transported from chloroplast to mitochondrion triggers production of ROS and PCD in Arabidopsis thaliana. Cell Res. 28, 448–461. doi:10.1038/s41422-018-0024-8
Zhong, Y., Zhou, M., Ouyang, B., Zeng, C. J., Zhang, M., and Yang, J. D. (2020). Complete mtDNA genome of Otus sunia (Aves, Strigidae) and the relaxation of selective constrains on Strigiformes mtDNA following evolution. Genomics 112, 3815–3825. doi:10.1016/j.ygeno.2020.02.018
Keywords: Mesona chinensis Benth, mitogenome, characteristics, RSCU, KaKs, constitution analysis, phylogenetic analysis
Citation: Tang D, Huang S, Quan C, Huang Y, Miao J and Wei F (2023) Mitochondrial genome characteristics and phylogenetic analysis of the medicinal and edible plant Mesona chinensis Benth. Front. Genet. 13:1056389. doi: 10.3389/fgene.2022.1056389
Received: 28 September 2022; Accepted: 15 December 2022;
Published: 12 January 2023.
Edited by:
Changwei Bi, Nanjing Forestry University, ChinaReviewed by:
Yanshu Qu, Nanjing Forestry University, ChinaXu Zhenggang, Northwest A&F University, China
Copyright © 2023 Tang, Huang, Quan, Huang, Miao and Wei. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Fan Wei, d2ZtYW51c2NyaXB0QDE2My5jb20=; Jianhua Miao, bWpobWFudXNjcmlwdEAxNjMuY29t
†These authors have contributed equally to this work