- 1Chinese Medicine Research Institute of Beijing Tcmages Pharmaceutical Co., Ltd., Beijing, China
- 2Nanjing Institute of Environmental Sciences, Ministry of Ecology and Environment of the People’s Republic of China, Nanjing, China
- 3College of Life Sciences, South China Agricultural University, Guangzhou, China
Introduction: Spuriopimpinella brachycarpa (Kom.) Kitag., a member of the Apiaceae family, is a perennial aromatic herb native to Northeast Asia with applications in culinary and traditional medicine. Despite its significance, most studies on S. brachycarpa have primarily focused on its phytochemical properties, with limited insights into its molecular and genomic characteristics.
Methods: This study presents the sequencing and assembly of the mitochondrial genome (mitogenome) of S. brachycarpa using second- and third-generation high-throughput sequencing technologies. Comprehensive analyses were performed on its structural organization, RNA editing sites, relative synonymous codon usage (RSCU), and repeat sequences. Comparative analyses with closely related species were also conducted.
Results: The mitogenome exhibited a multi-branched structure, with a total length of 523,512 bp and a GC content of 43.37%. Annotation revealed 30 unique protein-coding genes, 21 tRNA genes, and three rRNA genes. Comparative analysis indicated that the S. brachycarpa mitogenome contains structural variations but shares collinear features with other Apiaceae species. We identified 618 potential RNA editing sites involving C-to-U conversions and discovered 59 homologous fragments between the mitogenome and plastome, comprising 8.13% of the mitogenome.
Discussion: These results enrich the genomic database of Apiaceae, providing valuable insights into the evolutionary relationships and genetic diversity within the family.
1 Introduction
The Apiaceae (syn. Umbelliferae) family, known for its abundant species and remarkable morphological diversity, consists of 466 recognized genera and roughly 3,800 species (Plunkett et al., 2018). Members of this family range from small herbs to trees and are predominantly found in temperate regions, with a significant presence in Central Asia (Pimenov and Leonov, 1993; Sheh et al., 2005). Additionally, Apiaceae holds significant economic value, serving various medicinal, culinary, and spice purposes (Clarkson et al., 2021), including well-known species such as carrots (Daucus carota), coriander (Coriandrum sativum), and cumin (Cuminum cyminum). The genus Spuriopimpinella was first established by Kitagawa (Kitagawa, 1941), having initially been part of Pimpinella, one of the largest genera in the subfamily Apioideae (Boissieu, 1906; Pimenov and Leonov, 1993). Based on molecular and morphological evidence, Spuriopimpinella has been described as an independent lineage distinct from Pimpinella and is now widely accepted (Downie et al., 2010; Wang et al., 2013). Spuriopimpinella is a small genus, with six accepted species, mainly distributed across East Asia, particularly in China, the Korean Peninsula, and Japan (Govaerts et al., 2021). Spuriopimpinella brachycarpa (Kom.) Kitag., a perennial aromatic herb, is native to Northeast Asia. Based on morphological and cytological similarities, Wang et al. reinstated S. brachycarpa from Pimpinella, supported by molecular evidence from ITS and plastid intron sequences. Molecular phylogenetic analyses also led to the reclassification of Pimpinella arguta into Spuriopimpinella, implying that molecular evidence plays an important role in taxonomic research of this genus (Wang et al., 2013). Additionally, S. brachycarpa holds economic significance, serving as both a culinary herb and a traditional medicine. For instance, the plant contains flavonoids, alkaloids, and phenolic compounds that can be used to treat colds, coughs, indigestion, and abdominal pain (Wu et al., 2023). The leaves and stems of S. brachycarpa are consumed as vegetables and seasonings (Zheng, 2016). Despite its economic and medicinal significance, most research on S. brachycarpa has been limited to analyzing its phytochemical properties with little exploration at the molecular level, particularly in the genome.
Mitochondria are thought to have originated from a primordial endosymbiotic event and now play a vital role in plant cells by acting as the primary sites for aerobic respiration (Margulis, 1970; Gray et al., 1999; Wallace, 1999; Dyall et al., 2004; Poole and Penny, 2007; Koonin, 2010). Mitochondria facilitate the oxidation of saccharides, fats, and amino acids to release energy necessary for cellular activities (Nicholls and Budd, 2000; Houten and Auwerx, 2004). In addition to energy production, mitochondria also contribute to cell differentiation, signal transduction, apoptosis, growth, and cell cycle regulation (Scorrano et al., 2002; Mannella, 2006). Plant mitogenomes are notably diverse, reflecting lineage-specific evolutionary processes, and differ significantly from their animal counterparts in terms of size and structure (Chen et al., 2017). Although the mitochondrial genome (mitogenome) is typically represented as a circular double-stranded structure, it can also include multiple independent chromosomes and linear or multi-branched structures (Gualberto and Newton, 2017; Kozik et al., 2019). Generally, plant mitochondria tend to incorporate foreign DNA, leading to a large number of repeats that often serve as sites of genomic recombination, thereby contributing to their structural complexity (Jiang et al., 2023; Li et al., 2024). Compared to plastomes, the structural complexity of mitogenomes poses significant challenges for sequencing and assembly (Arrieta-Montiel and Mackenzie, 2011; Štorchová and Krüger, 2024). Recent advancements in long-read sequencing technologies, such as PacBio and Oxford Nanopore, have made the accurate assembly of complex plant mitogenomes feasible, overcoming limitations posed by traditional short-read sequencing (Karin et al., 2023). These technological improvements have enhanced our understanding of the structural intricacies and functions of plant mitogenomes. Despite the growing body of research on plant mitogenomes, studies remain limited in certain plant families, including the Apiaceae family. To date, the NCBI database contains approximately 2,500 mitogenomes, but only 17 are from the Apiaceae family. This reveals a significant gap in genomic data, highlighting the need for further research in this area.
In this study, we sequenced the mitogenome of S. brachycarpa and revealed that it is composed of five circular chromosomes. We then conducted comprehensive analyses of its structure, RNA editing sites, relative synonymous codon usage (RSCU), and repeats and compared these features with those of closely related species. Examining these features, we aimed to reveal specific evolutionary patterns and structural variations that contribute to the unique properties of S. brachycarpa. Our study enhances the understanding of genetic diversity and evolutionary dynamics within the Spuriopimpinella genus and the broader Apiaceae family. Moreover, the findings offer valuable insights into the potential functional significance of S. brachycarpa’s mitogenome, particularly concerning its medicinal and economic applications.
2 Materials and methods
2.1 Sampling, DNA & RNA extraction, and sequencing
Samples of S. brachycarpa were collected from Liaoning Province, China (42.525°N, 124.148°E; Supplementary Figure S1). These samples were deposited at the Chinese Medicine Research Institute of Beijing Tcmages Pharmaceutical Co., Ltd. (Beijing, China) with voucher specimen SB01. Total DNA was extracted using a modified CTAB method, and RNA was extracted using a BioTeke RNA extraction kit (Raimundo et al., 2018). The high-quality extracted DNA and RNA samples were then sent to Wuhan Benagene Technology Co., Ltd. for Illumina and Oxford Nanopore Technologies (ONT) genome sequencing.
2.2 Genome assembly and annotation
The mitogenome contigs of S. brachycarpa were assembled with Flye using long-read sequences (Kolmogorov et al., 2019). Long- and short-read sequences were aligned to the contigs using BWA (Li and Durbin, 2009). Unicycler was used to assemble the aligned reads into a complete mitogenome (Wick et al., 2017), which was visualized and exported with Bandage (Wick et al., 2015). The mitogenome was annotated for protein-coding genes, tRNAs, and rRNAs using IPMGA, tRNAscan-SE, and BLASTn, respectively (Lowe and Eddy, 1997; Johnson et al., 2008). After manual correction in Apollo, the annotation files were submitted to NCBI with accession numbers PQ273107 to PQ273111 (Dunn et al., 2019). Additionally, the plastome of S. brachycarpa was assembled and annotated using GetOrganelle and CPGAVAS2 (Shi et al., 2019; Jin et al., 2020), with the corrected annotation uploaded to NCBI under accession number PQ213365.
2.3 Intraspecific mitogenome analysis
The relative codon usage of protein-coding sequences in the mitogenome was extracted and analyzed using CPStools (Huang et al., 2024a). Additionally, simple sequence repeats (SSRs) in the mitogenome were identified using CPStools with specific parameters: a minimum of 10 repeats for mononucleotides, 5 for dinucleotides, 4 for trinucleotides, and 3 for tetranucleotides, pentanucleotides, and hexanucleotides (Huang et al., 2024a). Tandem and long sequence repeats (LSRs) were detected using TRF and REPuter, with TRF configured for match, mismatch, and indel weights of 2, 7, and 7, detection parameters including a matching probability of 80% and an indel probability of 10%, a minimum alignment score of 50, and a maximum period size of 500, while REPuter was set with a Hamming distance of 3 and a minimum repeat size of 30 bp (Benson, 1999; Kurtz et al., 2001). After quality control and adapter sequence removal, transcriptome data were aligned to the assembled mitogenome, and RNA editing sites were detected with Bcftools, applying a filter to exclude variants with QUAL < 20 and depth < 10 (Danecek and McCarthy, 2017; Ou et al., 2024).
2.4 Comparative mitogenome analysis
Homologous fragments of mitochondrial plastid DNA segments (MTPTs) between the mitogenome and plastome were compared using BLASTn to identify regions of similarity and potential horizontal gene transfer events, with an e-value threshold of 1e-5 and a similarity of at least 70%. The results were then visualized using Circos (Krzywinski et al., 2009). Mitogenome sequences from four closely related species were downloaded from NCBI, and conserved homologous sequences over 500 bp across these species were identified using BLASTn. These conserved collinear blocks were then visualized by MCScanX (Wang et al., 2012). RNA editing sites in related species without transcriptome support were predicted using DeepRed-Mt (Edera et al., 2021). Common protein-coding genes were extracted with CPStools and aligned using MAFFT (Katoh and Standley, 2013). Phylogenetic trees were constructed with RAxML using the maximum likelihood method and 1000 bootstrap replicates (Stamatakis, 2014), with two Aquifoliales species (Ilex pubescens and Ilex metabaptista) as the outgroup.
3 Results
3.1 Genome assembly and annotation
To assemble the genome, 10.6 GB of ONT clean reads and 13.4 GB of Illumina clean reads were used. The mitogenome of S. brachycarpa was found to be multi-chromosomal. After excluding repeated regions from the ONT sequences, five main circular chromosomes were obtained, with a total length of 523,512 bp and a GC content of 43.37% (Figure 1; Supplementary Table S1). The lengths of the five chromosomes were 217,371 bp for chromosome 1, 124,759 bp for chromosome 2, 116,895 bp for chromosome 3, 46,425 bp for chromosome 4, and 18,062 bp for chromosome 5 (Table 1). The genome was annotated, revealing 30 unique protein-coding genes, including 24 core and six non-core genes, 21 tRNA genes (seven with multiple copies), and three rRNA genes (Table 2). The plastome of S. brachycarpa showed a typical tetrad structure, totaling 158,449 bp, with a GC content of 37.67% (Supplementary Figure S2). The large single-copy region, small single-copy region, and inverted repeats regions were 88,249 bp, 17,688 bp, and 26,256 bp, respectively, and a total of 133 genes were annotated, including 88 protein-coding genes, 37 tRNA genes, and eight rRNA genes (Supplementary Table S2).
Figure 1. The mitogenome map of S. brachycarpa. Chromosomes 1-5 are indicated by five different contigs. The arrows show transcriptional direction of the mtDNA. Genes with different functions were represented using different colors.
3.2 RNA editing
In the mitogenome of S. brachycarpa, RNA editing was identified across 30 unique protein-coding genes (Figure 2A). A total of 618 potential C-to-U RNA editing sites were detected, all of which were C-to-U conversions. The nad4 gene exhibited the highest number of edits, with 49 sites, followed closely by mttB, which had 46 sites (Supplementary Table S3). Notably, RNA editing was observed at six sites within start codons (cox1 and nad4L) and stop codons (atp9, matR, atp6, and ccmFC). The majority of the RNA editing sites were located at the first and second codon positions, with 95.63% resulting in amino acid changes. Second-position edits were particularly frequent, occurring at 62.23%. The amino acid changes showed a strong bias towards specific codon edits. For example, 123 amino acids were altered from proline (Pro) to leucine (Leu), accounting for 19.87% of the total RNA editing events (Figure 2B). In addition, RNA editing sites were also predicted in four related species, with 416 to 540 sites identified (Supplementary Table S3). Similar to S. brachycarpa, the two most frequent types of conversions in these species were from Pro to Leu and from serine (Ser) to Leu, representing 38.29% to 43.88% of all edits.
Figure 2. RNA editing events in S. brachycarpa. (A), Number of RNA editing sites in each gene, (B), Number of amino acid changes before and after RNA editing.
3.3 Codon preference analysis
In the complete mitogenome of S. brachycarpa, 7,679 codons from 30 unique protein-coding genes were extracted and analyzed for codon preference. Sixty-four codons encoding 21 amino acids were identified. Of these, 28 codons had RSCU values greater than 1, indicating a higher preference in the mitogenome of S. brachycarpa (Figure 3; Supplementary Table S4). The codon GCU (encoding alanine) showed a strong preference with an RSCU value of 1.59. Codon usage was also analyzed in four related species: Cuminum cyminum (12,192 codons), Daucus carota subsp. sativus (10,187 codons), Oenanthe linearis (8,870 codons), and Oenanthe thomsonii (8,730 codons) (Supplementary Table S4). The most preferred codon in O. linearis and O. thomsonii matched that of S. brachycarpa, while UAA (a stop codon) was the most preferred in C. cyminum and D. carota subsp. sativus.
Figure 3. Codon usage bias of mitochondrial protein-coding genes in S. brachycarpa. The X-axis represents 21 amino acids, while the Y-axis shows the codons for each amino acid and their corresponding RSCU values.
3.4 Repeat analysis
In the S. brachycarpa mitogenome, a total of 49, 34, 23, 12, and four SSRs were identified on chromosomes 1 to 5, respectively. These SSRs exhibit varying distributions across the chromosomes, with chromosome 1 containing the highest number and chromosome 5 the least. In addition to SSRs, tandem repeats were also found in similar patterns, with counts of 50, 31, 31, 11, and 0 on chromosomes 1 to 5, respectively (Figure 4A). LSRs were detected in all five chromosomes, showing diverse repeat types and counts. Specifically, chromosome 1 exhibited 1,094 LSR pairs, including 484 palindromic, 609 direct, and one inverted repeat. Chromosome 2 had 1,076 LSR pairs, comprising 509 palindromic and 567 direct repeats. Chromosome 3 showed 919 LSR pairs, with 493 palindromic and 426 direct repeats. Chromosome 4 contained 49 LSR pairs, made up of 16 palindromic, 32 direct, and one inverted repeat, while chromosome 5 had 17 LSR pairs, with three palindromic and 14 direct repeats (Figure 4B). No complementary repeats were detected in any of the chromosomes, highlighting the absence of this repeat type in the S. brachycarpa mitogenome.
Figure 4. The distribution of SSRs, LSRs, and tandem repeats across the five chromosomes. (A), The number of SSRs in each of the five chromosomes. (B), The number of LSRs and tandem repeats in each of the five chromosomes.
3.5 Plastome-derived mitogenomic sequence
A total of 59 homologous fragments were identified between the mitogenome and plastome of S. brachycarpa, covering a combined length of 42,553 bp and representing 8.13% of the total mitogenome length (Figure 5). The homologous fragments varied in length, with the longest fragment, MTPT15, measuring 10,520 bp, while the shortest fragments, MTPT36 and MTPT37, were each 34 bp. Among these homologous sequences, 20 complete genes were detected, including 11 protein-coding genes and nine tRNA genes. The identified protein-coding genes comprised atpB, petG, petL, psbC, psbD, rpl2, rpl23, rpoB, rps7, ycf2, and ycf15, while the tRNA genes included trnD-GUC, trnE-UUC, trnH-GUG, trnI-CAU, trnL-CAA, trnN-GUU, trnT-GGU, trnW-CCA, and trnY-GUA.
Figure 5. Schematic representation of MTPT (mitochondrial plastid DNA) in S. brachycarpa. The comparison illustrates the MTPT sequences from the mitogenome (light blue) and the plastome (light green). Arcs indicate the connections between corresponding sequence fragments between the genomes.
3.6 Phylogenetic analysis
Mitogenome data for species closely related to S. brachycarpa is limited. Within the Apiales order, 19 mitogenomes had been completely sequenced and deposited in NCBI, including 15 species from the Apiaceae family and four from the Araliaceae family. To better understand the phylogenetic relationships of S. brachycarpa, we expanded the analysis by incorporating mitogenome sequences from 33 species across the Apiales, Asterales, Dipsacales, and Aquifoliales orders, in which included all previously published sequences from the Apiales. After removing ambiguously aligned regions, a total of 30 shared protein-coding genes were aligned, totaling 25,820 bp in length, comprising 734 parsimony-informative sites (Supplementary Table S5). The phylogenetic topology revealed four monophyletic lineages at the order level with strong support values (BS > 98), of which Aguifoliales was the outgrop (Figure 6). Within Apiales, the topology strongly supported the reciprocally monophyletic clade of the two family arrangement, namely Apiaceae and Araliaceae (BS = 100). Moreover, S. brachycarpa was closely related to the clade that comprised O. linearis and O. thomsonii from Apiaceae, with moderate support (BS = 87).
3.7 Collinearity analysis
Using the mitogenome of S. brachycarpa as a reference, we analyzed the collinear relationships among C. cyminum, D. carota subsp. sativus, O. linearis, and O. thomsonii. The analysis revealed numerous homologous collinear blocks, although these blocks were relatively short (Figure 7). Additionally, we identified distinct gaps representing regions that are unique to S. brachycarpa, with no homologous sequences found in the other species. The results showed inconsistent collinear block arrangements among the six mitogenomes, indicating that S. brachycarpa has undergone significant genomic rearrangements compared to its close relatives. Furthermore, the mitogenomes of the five Apiaceae species displayed a high degree of structural variability, suggesting that their sequence arrangements are extremely non-conservative.
Figure 7. Collinearity analysis of S. brachycarpa and four closely related species. Red arcs highlight inverted regions, while gray arcs indicate homologous regions with high similarity.
4 Discussion
Plants are indispensable to human society, serving as vital sources of food, medicine, and economic value (Gray et al., 1999; Wallace, 1999; Poole and Penny, 2007). Understanding the genetic foundations that contribute to these valuable traits is crucial for maximizing their potential. Mitogenomes, in particular, play a significant role in revealing how plants adapt and evolve, given their involvement in essential cellular processes like energy production, respiration, and stress responses (Houten and Auwerx, 2004; Koonin, 2010), offering insights into the diversification and functional significance of various species (Gualberto et al., 2014; Gualberto and Newton, 2017). S. brachycarpa emerges as a species of considerable economic and medicinal value within the Apiaceae family. It is recognized as a valuable species within the Spuriopimpinella genus, which has been reclassified as an independent genus based on both molecular and morphological evidence (Downie et al., 2010; Wang et al., 2013). In this study, we sequenced, assembled, and reported, for the first time, the complete mitogenome of S. brachycarpa, providing novel insights into the unique properties of this valuable plant.
Plant mitogenomes are known to vary greatly in size, ranging from as small as 6.6 kb in some Plasmodium species to nearly 19 Mb in Cathaya argyrophylla (Hikosaka et al., 2010; Huang et al., 2024b). Such variation is often attributed to differences in non-coding regions and the presence of repetitive sequences (Gualberto et al., 2014; Gualberto and Newton, 2017), and mitogenome size is often correlated with specific evolutionary adaptations and properties (Huang et al., 2024b). In this study, the mitogenome of S. brachycarpa was assembled to a total length of 523,512 bp. This size is notably larger than those of closely related species such as C. cyminum (246,721 bp), D. carota subsp. sativus (250,368 bp), O. linearis (367,287 bp), and O. thomsonii (384,782 bp). This difference suggests unique evolutionary and functional characteristics in S. brachycarpa. To explore the reasons behind the larger mitogenome of S. brachycarpa, we compared coding sequence lengths, non-coding regions, and the abundance of LSRs among five closely related species. The total coding sequence lengths were 35,958 bp (15.57%), 37,321 bp (14.91%), 38,693 bp (10.57%), 43,410 bp (11.28%), and 43,492 bp (8.31%) in C. cyminum, D. carota subsp. sativus, O. linearis, O. thomsonii, and S. brachycarpa, respectively. We identified 209 LSRs totaling 12,039 bp in C. cyminum, 303 LSRs (15,223 bp) in D. carota subsp. sativus, 492 LSRs (33,095 bp) in O. linearis, 599 LSRs (57,329 bp) in O. thomsonii, and 3,155 LSRs (213,335 bp) in S. brachycarpa. Among the 3,155 LSRs in S. brachycarpa, 3,144 (99.65%) were located in non-coding regions. While the number of genes and the lengths of coding sequences were relatively consistent across species, S. brachycarpa exhibited a significant expansion in non-coding regions due to an abundance of LSRs. Repetitive sequences, such as inverted, palindromic, and direct repeats, are known to influence genome size, gene arrangement, and evolutionary dynamics in plant mitogenomes, as well as contribute to cytoplasmic male sterility and impact pollen development, which is vital for seedling cultivation and genetic improvement (Smith and Keeling, 2015; Dong et al., 2018; Martins et al., 2019; Yang et al., 2023; Wang et al., 2024). The extensive presence of LSRs in S. brachycarpa suggests that they have played a significant role in the expansion and structural complexity of its mitogenome. This observation indicates that LSR expansion may be a key driver of mitogenome size variation in S. brachycarpa.
RNA editing can affect gene expression and protein function, potentially influencing mitochondrial activity and plant adaptation (Gray et al., 1999; Mackenzie and McIntosh, 1999). The number of RNA editing sites in land plant mitogenomes varies widely, from none in Marchantia polymorpha to 2152 in Selaginella moellendorffii (Zhang et al., 2020; Lai et al., 2022). In S. brachycarpa, 618 C-to-U editing sites were detected across 30 protein-coding genes, demonstrating significant gene-specific variability. Most of these sites were located at the first and second codon positions, with 95.63% leading to amino acid changes. These edits, especially those in energy-related genes, may enhance mitochondrial efficiency and contribute to the plant’s distinct traits. The conserved C-to-U conversion pattern across Apiaceae suggests a shared RNA editing mechanism within the family, potentially reflecting an adaptive advantage. Frequent edits at the second codon position could play a key role in modulating protein structure, hydrophobicity, and stability (Jiang et al., 2022; Wu et al., 2022). Similarly, codon usage also plays an essential role in shaping mitogenome evolution and adaptation. In S. brachycarpa and related species, codon usage is influenced by factors such as gene expression level, gene length, tRNA abundance, and codon position, which further drive species-specific adaptation (Li et al., 2023). The shared preference for the codon GCU in S. brachycarpa, O. linearis, and O. thomsonii suggests conserved translational selection pressures within these closely related species of the Apiaceae family. Conversely, C. cyminum and D. carota subsp. sativus displayed the highest preference for the stop codon UAA, consistent with previous findings (Chen et al., 2024). In the mitogenome of S. brachycarpa, 28 codons were identified as high frequency, with 27 ending in A/T; while the ratios for C. cyminum, D. carota subsp. sativus, O. linearis, and O. thomsonii were 28/30, 28/30, 27/29, and 28/30, respectively. The predominance of codons ending in either A or T, with RSCU values of 1.00 or higher, highlights a strong AT bias at the third codon position, a common pattern in higher plants (Yang et al., 2021).
The evolutionary patterns of plant mitogenomes differ from those of animals, with lower mutation rates and frequent integration of foreign DNA, including plastome sequences (Zhang, 1995). This gene transfer is crucial for biological evolution, adaptation, and diversity (Xiong et al., 2008). In S. brachycarpa, we identified 59 homologous fragments between the mitochondrial and chloroplast genomes, totaling 42,553 bp and representing 8.13% of the mitogenome. These findings suggest horizontal gene transfer events, which are common in plant mitogenomes and contribute to genomic diversity and functional adaptation (Smith and Keeling, 2015; Kozik et al., 2019). Such transfers could influence the functional repertoire of the mitogenome and impact metabolic pathways relevant to the plant’s economic and medicinal value.
Further, our phylogenetic analysis, based on shared mitochondrial protein-coding genes, confirmed the placement of S. brachycarpa within the Apiaceae family, revealing its close relationships. This finding not only supports the taxonomic position of S. brachycarpa but also aligns with previous molecular studies on Apiaceae phylogeny (Downie et al., 2010; Wang et al., 2013). Previous phylogenetic analyses inferred from cpDNA sequences indicated that Pimpinella brachycarpa should be placed in the genus Spuriopimpinella (Wang et al., 2013), which is consistent with our result. Given the limited mitogenomic reports for Apiaceae, we also reconstructed a phylogenetic tree using rps16 and rpl16 intron sequences from the newly sequenced plastome, along with previously available data from Wang et al. (2013) (Supplementary Figure S3). The results resolved that two accessions of S. brachycarpa formed a clade and is sister to S. arguta, with a strong support value (BS = 100). Here, as the shared mitochondrial genes generated a higher-resolution topology compared with short DNA fragment, a more comprehensive phylogenetic tree can be further constructed based on extending sampling with mitogenome data.
Mitogenomes play a crucial role in energy metabolism, stress responses, and other essential cellular processes (Houten and Auwerx, 2004; Koonin, 2010). Alterations in mitogenome structure and function can have significant effects on plant physiology and adaptation, potentially influencing traits that are valuable to humans. Our findings provide a foundational understanding of the mitogenome of S. brachycarpa, highlighting its unique features and evolutionary trajectory. The complex genomic structure, characterized by extensive gene rearrangements, an abundance of LSRs leading to an enlarged genome size, and evidence of horizontal gene transfer through MTPTs, suggests a dynamic unique evolutionary history. The high number of RNA editing sites further underscores the potential for functional adaptations within the mitogenome. These findings suggest that the unique characteristics of the S. brachycarpa mitogenome may be linked to its economic and medicinal value. Further functional studies are necessary to elucidate the specific roles of these mitogenome features in S. brachycarpa and their contributions to its valuable traits. Understanding these connections could provide valuable insights into the genetic basis of the plant’s medicinal properties and inform breeding and conservation strategies. In conclusion, our comprehensive analysis of the mitogenome of S. brachycarpa provides valuable insights into its evolutionary processes, genetic diversity, and potential links to its unique economic and medicinal properties. This research not only enhances our understanding of mitogenome evolution in the Apiaceae family but also underscores the significance in the study of plant biology and its applications in agriculture and medicine.
5 Conclusion
This study presents a comprehensive analysis of the mitogenome of S. brachycarpa. Leveraging both second and third-generation high-throughput sequencing technologies, we successfully assembled and annotated the complex multi-chromosome mitogenome, which spans 523,512 bp with a GC content of 43.37%. The genome includes 30 unique protein-coding genes, 21 tRNA genes, and three rRNA genes. Comparative analysis revealed that S. brachycarpa shares some collinear features with other Apiaceae species. However, it also exhibits significant gene location rearrangements and structural variations. These findings suggest that S. brachycarpa possesses one of the most complex genome structures among the analyzed Apiaceae species, characterized by relatively short homologous regions. Additionally, we identified 618 potential RNA editing sites, all of which involved C-to-U editing. Fifty-nine homologous fragments between the mitogenome and plastome were discovered spanning 42,553 bp and constituting 8.13% of the mitogenome. Overall, this study underscores the uniqueness and complexity of the S. brachycarpa mitogenome, offering valuable insights into the evolutionary relationships and genetic diversity within the Apiaceae family. These findings not only enhance the existing genomic database for Apiaceae but also provide a solid theoretical foundation for future research in molecular systematics and conservation genetics.
Data availability statement
The complete sequences of the mitogenome and plastome of S. brachycarpa are available in the GenBank nucleotide database. The accession number for the plastome is PQ213365, while the accession numbers for the mitogenome chromosomes are filed under accession numbers PQ273107 to PQ273111. Additionally, the sequencing reads used in the assembly for this study are deposited in the NCBI repository under the following identifiers: BioProject PRJNA1149306, BioSample SAMN43240799, and Sequence Read Archive (SRA) data SRR30284393 to SRR30284395.
Author contributions
JH: Formal analysis, Investigation, Resources, Writing – original draft, Writing – review & editing. WX: Data curation, Formal Analysis, Investigation, Project administration, Resources, Software, Validation, Writing – original draft, Writing – review & editing. HY: Funding acquisition, Validation, Writing – original draft, Writing – review & editing. YH: Writing – original draft, Writing – review & editing. MZ: Conceptualization, Data curation, Formal analysis, Visualization, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. The study was supported by the Special Fund of the Chinese Central Government for Basic Scientific Research Operations in the Commonweal Research Institute (Grant No. GYZX240417), the National Key Research and Development Program of China (Grant No. SQ2020YFF0426320).
Conflict of interest
Authors JH and WX were employed by the company Beijing Tcmages Pharmaceutical Co., Ltd.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2024.1492723/full#supplementary-material
Supplementary Figure 1 | The morphological characteristics of S. brachycarpa.
Supplementary Figure 2 | The plastome map of S. brachycarpa.
Supplementary Figure 3 | Phylogenetic relationships inferred from rpl16 and rps16 intron sequences.
Supplementary Table 1 | Analysis of repeat regions supporting multi-copy genes in ONT Data.
Supplementary Table 2 | Genes predicted in the plastome of S. brachycarpa.
Supplementary Table 3 | RNA editing sites identified in S. brachycarpa and four related species.
Supplementary Table 4 | RSCU values in S. brachycarpa and four related species.
Supplementary Table 5 | Number of conserved genes among 33 species.
References
Arrieta-Montiel, M. P., Mackenzie, S. A. (2011). “Plant Mitochondrial Genomes and Recombination,” in Plant Mitochondria, vol. 1 . Ed. Kempken, F. (Springer, New York, NY, USA), 65–82.
Benson, G. (1999). Tandem repeats finder: A program to analyze DNA sequences. Nucleic Acids Res. 27, 573–580. doi: 10.1093/nar/27.2.573
Boissieu, H. (1906). Note sur quelques Ombellifères de la Chine, d’après les collections du Muséum d’Histoire naturelle de Paris. Bull. Soc Bot. Fr. 53, 428. doi: 10.1080/00378941.1906.10831189
Chen, L., Dong, X., Huang, H., Xu, H., Rono, P. C., Cai, X., et al. (2024). Assembly and comparative analysis of the initial complete mitochondrial genome of Primulina hunanensis (Gesneriaceae): a cave-dwelling endangered plant. BMC Genomics 25, 322. doi: 10.1186/s12864-024-10247-9
Chen, Z. W., Zhao, N., Li, S. S., Grover, C. E., Nie, H. S., Wendel, J. F., et al. (2017). Plant mitochondrial genome evolution and cytoplasmic male sterility. Crit. Rev. Plant Sci. 36, 55–69. doi: 10.1080/07352689.2017.1327762
Clarkson, J. J., Zuntini, A. R., Maurin, O., Downie, S. R., Plunkett, G. M., Nicolas, A. N., et al. (2021). A higher-level nuclear phylogenomic study of the carrot family (Apiaceae). Am. J. Bot. 108, 1252–1269. doi: 10.1002/ajb2.1701
Danecek, P., McCarthy, S. A. (2017). BCFtools/csq: haplotype-aware variant consequences. Bioinformatics 33, 2037–2039. doi: 10.1093/bioinformatics/btx100
Dong, S., Zhao, C., Chen, F., Liu, Y., Zhang, S., Wu, H., et al. (2018). The complete mitochondrial genome of the early flowering plant Nymphaea colorata is highly repetitive with low recombination. BMC Genomics 19, 614. doi: 10.1186/s12864-018-4957-6
Downie, S. R., Spalik, K., Katz-Downie, D. S., Reduron, J.-P. (2010). Major clades within Apiaceae subfamily Apioideae as inferred by phylogenetic analysis of nrDNA ITS sequences. Plant Divers. Evol. 128, 111–136. doi: 10.1127/1869-6155/2010/0128-0005
Dunn, N. A., Unni, D. R., Diesh, C., Munoz-Torres, M., Harris, N. L., Yao, E., et al. (2019). Apollo: democratizing genome annotation. PloS Comput. Biol. 15, e1006790. doi: 10.1371/journal.pcbi.1006790
Dyall, S. D., Brown, M. T., Johnson, P. J. (2004). Ancient invasions: from endosymbionts to organelles. Science 304, 253–257. doi: 10.1126/science.1094884
Edera, A. A., Small, I., Milone, D. H., Sanchez-Puerta, M. V. (2021). Deepred-Mt: Deep representation learning for predicting C-to-U RNA editing in plant mitochondria. Comput. Biol. Med. 136, 104682. doi: 10.1016/j.compbiomed.2021.104682
Govaerts, R., Nic Lughadha, E., Black, N., Turner, R., Paton, A. (2021). The World Checklist of Vascular Plants, a continuously updated resource for exploring global plant diversity. Sci. Data 8, 215. doi: 10.1038/s41597-021-00997-6
Gray, M. W., Burger, G., Lang, B. F. (1999). Mitochondrial evolution. Science 283, 1476–1481. doi: 10.1126/science.283.5407.1476
Gualberto, J. M., Mileshina, D., Wallet, C., Niazi, A. K., Weber-Lotfi, F., Dietrich, A. (2014). The plant mitochondrial genome: Dynamics and maintenance. Biochimie 100, 107–120. doi: 10.1016/j.biochi.2013.09.016
Gualberto, J. M., Newton, K. J. (2017). Plant mitochondrial genomes: dynamics and mechanisms of mutation. Annu. Rev. Plant Biol. 68, 225–252. doi: 10.1146/annurev-arplant-043015-112232
Hikosaka, K., Watanabe, Y., Tsuji, N., Kita, K., Kishine, H., Arisue, N, et al. (2010). Divergence of the mitochondrial genome structure in the apicomplexan parasites, Babesia and Theileria, Mol. Biol. Evol. 27, 1107–1116. doi: 10.1093/molbev/msp320
Houten, S. M., Auwerx, J. (2004). PGC-1α: turbocharging mitochondria. J. Clin. Invest. 119, 5–7. doi: 10.1016/j.cell.2004.09.016
Huang, K., Xu, W., Hu, H., Jiang, X., Sun, L., Zhao, W., et al. (2024b). The mitochondrial genome of Cathaya argyrophylla reaches 18.99 Mb: Analysis of super-large mitochondrial genomes in Pinaceae. arXiv. Available at: https://arxiv.org/abs/2410.07006v1.
Huang, L., Yu, H., Wang, Z., Xu, W. (2024a). CPStools: A package for analyzing chloroplast genome sequences. iMetaOmics, e25. doi: 10.1002/imo2.25
Jiang, H., Lu, Q., Qiu, S., Yu, H., Wang, Z., Yu, Z., et al. (2022). Fujian cytoplasmic male sterility and the fertility restorer gene OsRf19 provide a promising breeding system for hybrid rice. Proc. Natl. Acad. Sci. U.S.A. 119, e2208759119. doi: 10.1073/pnas.2208759119
Jiang, M., Ni, Y., Li, J., Liu, C. (2023). Characterisation of the complete mitochondrial genome of taraxacum mongolicum revealed five repeat-mediated recombinations. Plant Cell Rep. 42, 775–789. doi: 10.1007/s00299-023-02994-y
Jin, J.-J., Yu, W.-B., Yang, J.-B., Song, Y., dePamphilis, C. W., Yi, T.-S., et al. (2020). GetOrganelle: A fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 21, 241. doi: 10.1186/s13059-020-02154-5
Johnson, M., Zaretskaya, I., Raytselis, Y., Merezhuk, Y., McGinnis, S., Madden, T. L. (2008). NCBI BLAST: A better web interface. Nucleic Acids Res. 36, W5–W9. doi: 10.1093/nar/gkn201
Karin, B. R., Arellano, S., Wang, L., Walzer, K., Pomerantz, A., Vasquez, J. M., et al. (2023). Highly-multiplexed and efficient long-amplicon PacBio and Nanopore sequencing of hundreds of full mitochondrial genomes. BMC Genomics. 24, 229. doi: 10.1186/s12864-023-09277-6
Katoh, K., Standley, D. M. (2013). MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780. doi: 10.1093/molbev/mst010
Kitagawa, M. (1941). Miscellaneous Notes on Apiaceae (Umbelliferae) of Japan and Manchuria (IV). J. Jpn. Bot. 17, 558. doi: 10.51033/jjapbot.17_10_2588
Kolmogorov, M., Yuan, J., Lin, Y., Pevzner, P. A. (2019). Assembly of long, error-prone reads using repeat graphs. Nat. Biotechnol. 37, 540–546. doi: 10.1038/s41587-019-0072-8
Koonin, E. V. (2010). The origin and early evolution of eukaryotes in the light of phylogenomics. Genome Biol. 11, 209. doi: 10.1186/gb-2010-11-5-209
Kozik, A., Rowan, B. A., Lavelle, D., Berke, L., Schranz, M. E., Michelmore, R. W., et al. (2019). The alternative reality of plant mitochondrial DNA: one ring does not rule them all. PloS Genet. 15, e1008373. doi: 10.1371/journal.pgen.1008373
Krzywinski, M., Schein, J., Birol, I., Connors, J., Gascoyne, R., Horsman, D., et al. (2009). Circos: an information aesthetic for comparative genomics. Genome Res. 19, 1639–1645. doi: 10.1101/gr.092759.109
Kurtz, S., Choudhuri, J. V., Ohlebusch, E., Schleiermacher, C., Stoye, J., Giegerich, R. (2001). REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 29, 4633–4642. doi: 10.1093/nar/29.22.4633
Lai, C., Wang, J., Kan, S., Zhang, S., Li, P., Reeve, W. G., et al. (2022). Comparative analysis of mitochondrial genomes of Broussonetia spp. (Moraceae) reveals heterogeneity in structure, synteny, intercellular gene transfer, and RNA editing. Front. Plant Sci. 13, 1052151. doi: 10.3389/fpls.2022.1052151
Li, H., Durbin, R. (2009). Fast and accurate short read alignment with burrows-wheeler transform. Bioinformatics 25, 1754–1760. doi: 10.1093/bioinformatics/btp324
Li, Y., Hu, X., Xiao, M., Huang, J., Lou, Y., Hu, F., et al. (2023). An analysis of codon utilization patterns in the chloroplast genomes of three species of coffea. BMC Genomic Data 24, 42. doi: 10.1186/s12863-023-01108-w
Li, Y.-Y., Liu, Y.-Y., Zeng, X., Wu, P., Li, Q.-M., Guo, S.-X., et al. (2024). Complete mitochondrial genome of angelica dahurica and its implications on evolutionary analysis of complex mitochondrial genome architecture in apiaceae. Front. Plant Sci. 15. doi: 10.3389/fpls.2024.1367299
Lowe, T. M., Eddy, S. R. (1997). tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964. doi: 10.1093/nar/25.5.955
Mackenzie, S., McIntosh, L. (1999). Higher plant mitochondria. Plant Cell 11, 571–586. doi: 10.1105/tpc.11.4.571
Mannella, C. A. (2006). The relevance of mitochondrial membrane topology to mitochondrial function. Biochim. Biophys. Acta 1762, 140–147. doi: 10.1016/j.bbadis.2005.07.001
Martins, G., Balbino, E., Marques, A., Almeida, C. (2019). Complete mitochondrial genomes of the Spondias tuberosa Arr. Cam and Spondias mombin L. reveal highly repetitive DNA sequences. Gene 720, 144026. doi: 10.1016/j.gene.2019.144026
Nicholls, D. G., Budd, S. L. (2000). Mitochondria and neuronal survival. Physiol. Rev. 80, 315–360. doi: 10.1152/physrev.2000.80.1.315
Ou, T., Wu, Z., Tian, C., Yang, Y., Li, Z. (2024). Complete mitochondrial genome of Agropyron cristatum reveals gene transfer and RNA editing events. BMC Plant Biol. 24, 830. doi: 10.1186/s12870-024-05558-8
Pimenov, M. G., Leonov, M. V. (1993). The Genera of the Umbelliferae: A Nomenclator (Kew, UK: Royal Botanic Gardens).
Plunkett, G. M., Pimenov, M. G., Reduron, J.-P., Kljuykov, E. V., van Wyk, B.-E., Ostroumova, T. A., et al. (2018). “Apiaceae,” in Flowering Plants. Eudicots. Eds. Kadereit, J. W., Bittrich, V. (Springer International Publishing, Cham, Switzerland), 9–206.
Poole, A. M., Penny, D. (2007). Evaluating hypotheses for the origin of eukaryotes. Bioessays 29, 74–84. doi: 10.1002/bies.20516
Raimundo, J., Reis, C. M. G., Ribeiro, M. M. (2018). Rapid, simple and potentially universal method for DNA extraction from Opuntia spp. Fresh cladode tissues suitable for PCR amplification. Mol. Biol. Rep. 45, 1405–1412. doi: 10.1007/s11033-018-4303-8
Scorrano, L., Ashiya, M., Buttle, K., Weiler, S., Oakes, S. A., Mannella, C. A., et al. (2002). A Distinct Pathway Remodels Mitochondrial Cristae and Mobilizes Cytochrome c during Apoptosis. Dev. Cell 2, 55–67. doi: 10.1016/s1534-5807(01)00116-2
Sheh, M.-L., Pu, F.-D., Pan, Z.-H., Watson, M. F., Cannon, J. F. M., Holmes-Smith, et al. (2005). “Apiaceae,” in Editorial Committee, Flora of China, vol. 14. (Missouri Botanical Garden Press, St. Louis, MI, USA), 1–205.
Shi, L., Chen, H., Jiang, M., Wang, L., Wu, X., Huang, L., et al. (2019). CPGAVAS2, an integrated plastome sequence annotator and analyzer. Nucleic Acids Res. 47, W65–W73. doi: 10.1093/nar/gkz345
Smith, D. R., Keeling, P. J. (2015). Mitochondrial and plastid genome architecture: Reoccurring themes, but significant differences at the extremes. Proc. Natl. Acad. Sci. U.S.A. 112, 10177–10184. doi: 10.1073/pnas.1422049112
Stamatakis, A. (2014). RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313. doi: 10.1093/bioinformatics/btu033
Štorchová, H., Krüger, M. (2024). Methods for assembling complex mitochondrial genomes in land plants. J. Exp. Bot. 75, 5169–5174. doi: 10.1093/jxb/erae034
Wallace, D. C. (1999). Mitochondrial diseases in man and mouse. Science 283, 1482–1488. doi: 10.1126/science.283.5407.1482
Wang, Z. X., Downie, S. R., Tan, J. B., Liao, C. Y., Yu, Y., He, X. J. (2013). Molecular phylogenetics of Pimpinella and allied genera (Apiaceae), with emphasis on Chinese native species, inferred from nrDNA ITS and cpDNA intron sequence data. Nord. J. Bot. 32, 642–657. doi: 10.1111/j.1756-1051.2013.00343.x
Wang, J., Kan, S., Liao, X., Daniell, H., Jin, S., Wu, Z. (2024). Plant organellar genomes: much done, much more to do. Trends Plant Sci. 29, 754–769. doi: 10.1016/j.tplants.2023.12.014
Wang, Y., Tang, H., Debarry, J. D., Tan, X., Li, J., Wang, X., et al. (2012). MCScanX: A toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 40, e49. doi: 10.1093/nar/gkr1293
Wick, R. R., Judd, L. M., Gorrie, C. L., Holt, K. E. (2017). Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PloS Comput. Biol. 13, e1005595. doi: 10.1371/journal.pcbi.1005595
Wick, R. R., Schultz, M. B., Zobel, J., Holt, K. E. (2015). Bandage: interactive visualization of de novo genome assemblies. Bioinformatics 31, 3350–3352. doi: 10.1093/bioinformatics/btv383
Wu, J. J., Cao, Z., Hassan, S. S. U., Zhang, H., Ishaq, M., Yu, X., et al. (2023). Emerging biopharmaceuticals from pimpinella genus. Molecules 28, 1571. doi: 10.3390/molecules28041571
Wu, Z. Q., Liao, X. Z., Zhang, X. N., Tembrock, L. R., Broz, A. (2022). Genomic architectural variation of plant mitochondria - a review of multichromosomal structuring. J. Syst. Evol. 60, 160–168. doi: 10.1111/jse.12655
Xiong, A. S., Peng, R. H., Zhuang, J., Gao, F., Zhu, B., Fu, X. Y., et al. (2008). Gene duplication and transfer events in plant mitochondria genome. Biochem. Biophys. Res. Commun. 376, 427–430. doi: 10.1016/j.bbrc.2008.08.116
Yang, H., Li, W., Yu, X., Zhang, X., Zhang, Z., Liu, Y., et al. (2021). Insights into molecular structure, genome evolution and phylogenetic implication through mitochondrial genome sequence of Gleditsia sinensis. Sci. Rep. 11, 14850. doi: 10.1038/s41598-021-93920-9
Yang, H., Ni, Y., Zhang, X., Li, J., Chen, H., Liu, C. (2023). The mitochondrial genomes of panax notoginseng reveal recombination mediated by repeats associated with DNA replication. Int. J. Biol. Macromol. 252, 126359. doi: 10.1016/j.ijbiomac.2023.126359
Zhang, S. H. (1995). The structural differences between animal and plant mitochondrial genomes - two evolutionary scenarios. Zool. Res. 16, 132–145.
Zhang, J., Fu, X. X., Li, R. Q., Zhao, X., Liu, Y., Li, M. H., et al. (2020). The hornwort genome and early land plant evolution. Nat. Plants 6, 107–118. doi: 10.1038/s41477-019-0588-4
Keywords: Spuriopimpinella, Pimpinella, mitochondrial genome, phylogenetic analysis, RNA editing, chloroplast-mitochondrial homologous fragments
Citation: Han J, Xu W, Yu H, Han Y and Zhu M (2024) Structural and evolutionary analyses of the mitochondrial genome of Spuriopimpinella brachycarpa. Front. Plant Sci. 15:1492723. doi: 10.3389/fpls.2024.1492723
Received: 07 September 2024; Accepted: 06 November 2024;
Published: 26 November 2024.
Edited by:
Andan Zhu, Chinese Academy of Sciences (CAS), ChinaReviewed by:
Luke R. Tembrock, Colorado State University, United StatesZhiqiang Wu, Chinese Academy of Agricultural Sciences, China
Peng-Fei Ma, Chinese Academy of Sciences (CAS), China
Copyright © 2024 Han, Xu, Yu, Han and Zhu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Wenbo Xu, eHdiNzUzM0AxNjMuY29t; Ming Zhu, bXpodUBzY2F1LmVkdS5jbg==
†These authors have contributed equally to this work