- 1State Key Laboratory of Bioreactor Engineering, Shanghai Key Laboratory of New Drug Design, East China University of Science and Technology, Shanghai, China
- 2College of Pharmacy, Dali University, Dali, China
Aconitum is an important medicinal group of the Ranunculaceae family and has been used as conventional medicine in Bai, Yi, and other ethnic groups of China. There are about 350 Aconitum species globally and about 170 species in China. It is challenging to identify the species in morphology, and the lack of molecular biology information hinders the identification and rational utilization of the germplasm of this genus. Therefore, it is necessary to increase the molecular data of Aconitum species. This paper acquired the complete chloroplast (CP) genome sequence of ten medicinal plants of Aconitum species from Yunnan by Illumina paired-end (PE) sequencing technology and compared it with other species in the same family and genus. These CP genomes exhibited typical circular quadripartite structure, and their sizes ranged from 155,475 (A. stylosum) to 155,921 bp (A. vilmoinianum), including a large single-copy region (LSC), a small single-copy region (SSC), and two inverted repeat regions (IRs). Their gene content, order, and GC content (38.1%) were similar. Moreover, their number of genes ranged from 129 (A. vilmoinianum) to 132 (A. ramulosum), including 83 to 85 protein-coding genes (PCGs), 37 tRNA genes (tRNAs), eight rRNA genes (rRNAs), and two pseudogenes. In addition, we performed repeated sequence analysis, genomic structure, and comparative analysis using 42 Aconitum chloroplast genomes, including ten Aconitum chloroplast genomes and other sequenced Aconitum species. A total of 48–79 simple sequence repeats (SSRs) and 17 to 77 long repeat sequences were identified. IR regions showed higher variability than the SSC region and LSC region. Seven mutational hotspots were screened out, including trnK-UUU-trnQ-UGG, psbD, ndhJ-ndhK, clpP, psbH-petB, ycf1, and trnA-UGC-trnI-GAU, respectively. The phylogenetic trees of ten Aconitum species and other Aconitum species revealed that the complete CP genome was beneficial in determining the complex phylogenetic relationships among Aconitum species. This study provides a potential molecular marker and genomic resource for phylogeny and species identification of Aconitum species and an important reference and basis for Ranunculaceae species identification and phylogeny.
Introduction
Aconitum belonging to the Ranunculaceae comprises approximately 350 species mainly distributed in Asia, followed by Europe and North America. There are about 170 species of Aconitum in China. The Qinghai-Tibet Plateau and its adjacent areas are its most extensive distribution region, and the northern provinces are its second-largest distribution region (Li and Kadota, 2001).
Aconitum carmichaelii Debx. has been used in China for more than 2,000 years. It was first recorded in Shennong Bencao Jing (Wu and Sun, 1936). Seventy-six species of Aconitum plants were used in folk medicine through textual research, mainly for treating heart failure, rheumatism, joint pain, falling injury, stroke, paralysis, sore boils and poison, and wind and cold syndrome (Yang, 2012). However, there were differences in application, such as A. episcopale could detoxify, has anti-alcoholic properties, and detoxifies opium (Li et al., 1995); A. brachypodum was used for fractures, sprain, rheumatism, fall injury (Yang et al., 2016); and A. alboviolaceum was used for hypertension joint pain (Lu, 1995). It is important to note that many of the species of Aconitum plants are also reported to be toxic. The application process is highly susceptible to endangering human life safety due to misused species (He et al., 2010). The variety of Aconitum species and the diversity of the natural environment in the distribution area make its morphological variation extremely complex. Thus, it is difficult to accurately identify the Aconitum species based on their morphological characteristics. As for molecular identification, previous studies have demonstrated that barcoding sequences (ITS2, psbA-trnH) are also not ideal for accurate identification of Aconitum spp. (Zhu et al., 2020). Therefore, finding an accurate identification marker for this genus is necessary. Chloroplasts are independent organelles in the plant cells, have a complete set of the genome, which is relatively conserved in the genetic composition structure and contains more abundant mutation sites. These structural features, which allow chloroplast genomes to occupy a vital position in plant species’ discrimination and evolutionary study, have been widely used as super barcodes for species identification and phylogenetic studies (Sugiura, 1992; Yurina and Odintsova, 1998; Moore Michael et al., 2007).
In this study, we assembled and analyzed the complete chloroplast genomes sequence of ten medicinal plants of Aconitum species from Yunnan using Illumina PE sequencing and reported the CP genomes of ten medicinal plants of Aconitum species, including Aconitum stapfianum Hand.-Mazz., A. episcopale Leveille, A. ramulosum W.T.Wang, A. vilmoinianum Kom., A. nagarum Stapf., A. ouvrardianum Hand.-Mazz., A. delavayi Franch., A. duclouxii Levl., A. stylosum Stapf., and A. weixiense W.T.Wang. We performed a general characteristic analysis of chloroplast genomes for the ten Aconitum species sequenced in this study. In addition, we performed repeated sequence analysis, genome structure, comparative analysis, and phylogenetic analysis of the ten Aconitum species in this study and 32 chloroplast genomes of Aconitum submitted to the GeneBank database. We aim to expand our understanding of the genome divergence of Aconitum, provide insights into the phylogenetic relationships of Aconitum species, and identify potential DNA barcodes for identifying Aconitum species. These results will provide a theoretical basis for molecular phylogenetic and evolutionary studies at the species level of Aconitum.
Materials and Methods
Plant Materials
Plant materials from ten Aconitum spp. were collected from the Weixi County, Lijiang County, and Zhongdian County, Yunnan Province, in September 2020 (Table 1; Figure 1). Detailed information about the materials used and sequences obtained in the study is provided in Table 1. Fresh leaf material without lesions was collected and stored dry with discoloured silica gel. Professor Conglong Xia of Dali University identified the species according to the morphological characteristics recorded in the Flora of China. The voucher specimens were deposited in the Plant and Medicinal Herbology, College of Pharmacy, Dali University. The voucher specimens number was recorded as WT001∼WT0010.
DNA Extraction and Illumina Sequencing
Total genomic DNA was extracted from clean leaves from samples frozen at −80°C using the E.Z.N.A® Plant DNA kit (OMEGA, Beijing). Shanghai Origingene Co., Ltd. did the work. The DNA quality was assessed by electrophoresis in a 1% (w/v) agarose gel. The OD260/280 values ranged from 1.8 to 2.0, and >2 μg of DNA was equally pooled from individuals of the ten species to generate shotgun libraries. DNA samples were randomly sheared, incubated with fragmentation buffer, and broken into 300–500 bp fragments in an M220 focused ultrasonicator (Covaris, Woburn, MA, United States). The A & B connectors at both ends of the DNA fragment were connected, the segments were screened, and the self-connecting segments of the connector were removed. Fragment screening by electrophoresis in agarose gel keeps the fragment with A connector at one end, and B connector at the other end; the subsequent additions of NaOH to denatures produces single-stranded DNA fragments. This DNA was further paired-end sequenced using the Illumina Novaseq 2500 (Shanghai Origingene Co., Ltd., China) platform.
Genome Quality and Control and De Novo Assembly
Raw image data obtained by Illumina sequencing was converted into FASTQ format sequencing data files by Base Calling (Paul, 2015). The FASTQ data file used FastQC v0.11.4 (www.bioinformatics.babraham.ac.uk/bugzilla/) for quality control. The sequenced reads were spliced with several iterations using NOVOPlasty (https://github.com/ndierckx/NOVOPlasty) stitching software (Dierckxsens et al., 2017). The seed sequence required for the assembly was MN556604.1 (complete chloroplast genome of Delphinium grandiflorum), and the optimal assembly result was obtained.
Genome Annotation and Submission
PGA software (https://github.com/quxiajian/PGA) was used for gene annotation (Qu et al., 2019), and manual corrections were made for all sample annotation results. The annotated chloroplast genome GB file was submitted to the online tool (https://chlorobox.mpimp-golm.mpg.de/OGDraw.html) to generate a physical map (Lohse et al., 2007). Finally, these TBL files were generated to submit the sequences to NCBI. The complete and correct CP genome sequences of the ten Aconitum species were deposited in the GenBank database (https://www.ncbi.nlm.nih.gov/genbank/) of NCBI, and accession numbers are OM289057.1 (A. ouvrardianum), OM289058.1 (A. delavayi), OM289059.1 (A. ramulosum), OM328065.1 (A. vilmoinianum), OM328066.1 (A. episcopale), OM328067.1 (A. stapfianum), OM328068.1 (A. nagarum), OM328069.1 (A. weixiense), OM328070.1 (A. duclouxii), and OM328071.1 (A. stylosum), respectively.
Codon Usage Analysis
The CodonW software (University of Texas, Houston, TX, United States) was used to investigate the distribution of codons based on the relative synonymous codon usage (RSCU) ratio and the effective number of codon usage (ENC) to analyze the codon usage preference (Sharp et al., 1986).
Repeat Sequence Analysis
The FASTA files of whole chloroplast genome sequences of 42 Aconitum species were sorted out. Simple sequence repeats (SSRs) were detected using the MISA software (http://pgrc.ipk-gatersleben.de/misa/) (Beier et al., 2017). The minimum values for the number of repeats of mono -, di -, tri -, tetra -, penta -, and hexanucleotide repeats were set to 10, 5, 4, 3, 3, and 3 respectively. The sizes and locations of repeat sequences in the CP genomes of the 42 Aconitum spp. were identified using REPuter (http://bibiserv.techfak.uni-bielefeld.de/reputer/) (Kurtz et al., 2001) with the parameters set to a similarity percentage of scattered repeat copies ≥90%, a minimal repeat size of 30 bp, the hamming distance of 3 and maximum computed repeats of 5,000.
Genome Structure and Comparative Genome Analysis
The mVISTA online program (https://genome.lbl.gov/vista/mvista/submit.shtml) (Frazer et al., 2004) was used in Shuffle-LAGAN model to compare 42 Aconitum species CP genome using A. vilmorinianum CP genome as a reference. The IRscope (https://irscope.shinyapps.io/irapp/) online program (Amiryousefi et al., 2018) was used to obtain IR regions comparative analysis of the chloroplast genome.
Sliding Window Analysis
The DnaSP software was used for sliding window analysis (Rozas et al., 2017), the nucleotide diversity values (Pi) were calculated, and interspecific high variation sequences (hotspots) were screened according to the analysis results. The size of the windows length was set to 600 bp, and the step size was set to 200 bp.
Phylogenetic Analysis
To determine the phylogenetic positions of the ten Aconitum species within Ranunculaceae, we analyzed the CP genomes of 44 species, encompassing 34 additional taxa within this lineage. The CP genome sequences of 34 species were downloaded from Genebank database. The complete CP genome sequences and PCGs were used to reconstruct the Aconitum species phylogenetic tree. The chloroplast genome sequences were aligned using MAFFT online tool (Katoh and Standley, 2013). We used the CP genomes of Delphinium anthriscifolium Hance (MK253461.1) and Delphinium grandiflorum L. (NC_049872.1) as outgroups. The MEGA X software (Kumar et al., 2018) was used to construct phylogenetic trees employing 44 CP genomes sequences based on neighbour-joining (NJ) with 1,000 bootstrap replicates, and the model was Kimura 2-parameter. The IQ-tree software (Nguyen et al., 2015) was used to construct phylogenetic trees employing 44 CP genomes sequences based on maximum likelihood (ML) with 1,000 bootstrap replicates.
Results
Chloroplast Genomes of Ten Aconitum spp.
The genome sequences assembled using the reads obtained from the Illumina sequencing platform ranged from 155,475 bp for A. stylosum to 155,921 bp for A. vilmorinianum (Table 2). The genome exhibited a typical cyclic tetramer structure, including the SSC and LSC regions separated by two IR regions (Figure 2). The sequence length of LSC regions ranged from 86,098 bp for A. stylosum to 86,524 bp for A. vilmorinianum, the sequence length of SSC regions ranged from 16,903 bp for A. stylosum to 16,983 bp for A. nagarum, and the sequence length of IRs regions ranged from 52,218 bp for A. duclouxii to 52,476 bp for A. ouvrardianum (Table 2).
FIGURE 2. Gene map of the complete chloroplast genomes of Aconitum species. Genes inside the circle are transcribed clockwise, and those on the outside are transcribed counter-clockwise. Genes belonging to different functional groups have been colour-coded. The darker grey area in the inner circle corresponds to GC content, whereas the lighter grey corresponds to AT content.
The chloroplast genomes of ten Aconitum species can be divided into the coding and non-coding regions; the coding region sequence lengths ranged from 79,014 to 78,996 bp, and the non-coding region sequence lengths ranged from 76,569 to 77,475 bp. The ten Aconitum chloroplast genome sequences all encoded 129–132 genes (including IR region duplicates), and if 18 duplicates in the IR regions were excluded, a total of 111–114 genes were encoded, including 83 to 85 PCGs (seven duplicated genes), 37 tRNAs (seven duplicated genes), eight rRNAs (four duplicated genes). It is noteworthy that two pseudogenes (ycf1 and rps19) were found in A. ouvrardianum, A. delavayi, and A. ramulosum, whereas only the pseudogene ycf1 was found in the remaining species. Moreover, the GC contents of the ten Aconitum CP genomes were 38.1%, and the IR regions (43%) had higher GC contents than the single-copy regions (LSC: 36.2% and SSC: 32.6%). Furthermore, we also analyzed the length and frequency of four bases that included adenine (A), thymine (T), guanine (G), and cytosine (C) in the chloroplast genome of ten Aconitum species. The results showed that the lengths of adenine ranged from 47,585 to 47,840 bp, the lengths of cytosine ranged from 29,781 to 30,212 bp, and the lengths of guanine ranged from 29,058 to 29,524 bp, and the lengths of thymine ranged from 48,391 to 48,825 bp, respectively. A and T were used significantly more frequently than G and C.
Base Composition Analysis of Genome
The base composition of the entire coding region and different position codons in the chloroplast genomes of the ten Aconitum species were analyzed (Table 3). The coding region length ranged from 63,372 bp for A. duclouxii to 79,014 bp for A. delavayi. The total AT content in the coding region was 60.6–61.7%, and the total GC content was 38.3–39.4%. The length of the first, second, and third codons ranged from 21,124 bp to 26,338 bp, and GC contents ranged were 45.9–47.3% (first codon), 38.5–39.4% (second codon), and 30.7–31.5% (third codon), respectively. Moreover, the third codon site has AT preference.
Furthermore, we also analyzed the base composition in different regions of the ten Aconitum species chloroplast genomes (Supplementary Table S1). The AT content of SSC was the highest, followed by the LSC region and IRs region. The AT content of the protein-coding gene region was the highest, followed by the tRNAs region and rRNAs region. The results indicated that the AT content of the chloroplast genome was higher than the GC content and also demonstrated the codon preference for using bases A and T(U).
Genome Function and Classification
Chloroplast genomes of ten Aconitum species were annotated, and their genes were functionally classified (Table 4). The number of chloroplast genome coding genes in the ten Aconitum species ranged from 129 to 132, including 83–84 protein-coding genes (PCGs), 37 tRNA genes, and eight rRNA genes. The first group of genes involved in transcription and translation consists of 69–70 genes, including small subunit of ribosome genes (rps2, rps3, rps4, rps7 × 2, rps8, rps11, rps12 × 2, rps14, rps15, rps16, rps18, and rps19), ten large subunit of ribosome genes (rpl2 × 2, rpl14, rpl16, rpl20, rpl22, rpl23 × 2, rpl33, and rpl36), one transcription initiation factor genes (infA), four RNA polymerase genes (rpoA, rpoB, rpoC1, rpoC2), eight rRNA genes (rrn16 × 2, rrn23 × 2, rrn4.5 × 2, and rrn5 × 2), and 37 tRNA genes (trnA-UGC × 2, trnC-GCA, trnD-GUC, trnE-UUC, trnF-GAA, trnfM-CAU, trnG-GCC, trnG-UCC, trnH-GUG, trnI-CAU × 2, trnI-GAU × 2, trnK-UUU, trnL-CAA × 2, trnL-UAA, trnL-UAG, trnM-CAU, trnN-GUU × 2, trnP-UGG, trnQ-UUG, trnR-ACG × 2, trnR-UCU, trnS-GCU, trnS-GGA, trnS-UGA, trnT-GGU, trnT-UGU, trnV-GAC × 2, trnV-UAC, trnW-CCA, and trnY-GUA). The second group of 45 genes involved in the photosynthesis, including 20 Photosystem I and Photosystem II genes (psaA, psaB, psaC, psaI, psaJ, psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, and psbZ), six Cytochrome b/f complex genes (petA, petB, petD, petG, petL, petN), six ATP synthase genes (atpA, atpB, atpE, atpF, atpH, atpI), one ATP-dependent protease subunit gene (clpP), one RubiscoCO large subunit gene (rbcL), and 12 NADH oxidoreductase genes (ndhA, ndhB × 2, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK). The third group of four genes involved in the biosynthesis of amino acids, fatty acids, etc., included one Maturase gene (matK), one Envelop membrane protein gene (cemA), one Subunit Acetyl-CoA-Carboxylate gene (accD), and one c-type cytochrome synthesis gene (ccsA); another group of unknown function genes has 6–8. Among the ten Aconitum species, A. ouvrardianum, A. delavayi, and A. ramulosum contained ycf15 and rps16 genes, and the other seven species did not have ycf15 and rps16 genes.
In eukaryotic and semi-prokaryotic systems, gene expression occurs in the cytoplasm and organelles of the nucleus, respectively. Introns play a vital role in the regulation of gene expression. Previous studies have proved that introns can improve the expression level of extraneous genes in eukaryotic genomes (Cui, 2020). A total of 17 genes contained introns in the ten Aconitum CP genomes, clpP, ycf3, and rps12 genes contained two introns and three exons, whereas the other 14 genes all contained one intron and two exons (Supplementary Table S2). In addition, the introns and exons contained by the genes of ten Aconitum were essentially the same or had minor differences in length, as shown by the results in Supplementary Table S2.
Codon Usage Analysis
The codon usage and codon recognition patterns of the ten Aconitum CP genomes are shown in Figure 3 and Supplementary Table S3. The CP protein-coding genes of these ten species contained 61 codons encoding 20 amino acids. Leucine (10.4%–10.5%) was the most frequent, and cysteine (1.18%–1.19%) was the least, which was consistent with the results of amino acids coding of chloroplast genomes in most plants (Somaratne et al., 2020; Wang Z. et al., 2021). The statistical results of the frequency of codon usage in the chloroplasts of the ten Aconitum species showed that the coding regions (CDS) comprised 26,086 codons in A. ouvrardianum to 26,338 codons in A. vilmorinianum (Supplementary Table S5). Among amino acids encoded by the codon, Leu was the most frequent amino acid, encoding 2,701 times in A. ouvrardianum to 2,732 times in A. stylosum and A. nagarum; The least frequently coded amino acid was Cys, only encoding 302 times in A. ouvrardianum, A. ramulosum, and A. delavayi to 307 times in A. vilmorinianum. The results are consistent with those in Figure 3.
FIGURE 3. Frequency of amino acids encoded in the coding region by codons of chloroplast genes from ten Aconitum species.
Furthermore, we analyzed the relative synonymous codon usage (RSCU) and codon base composition of protein-coding gene sequences (CDS) of ten Aconitum species. There were 64 codons in CDS sequences of ten Aconitum species genes, among which 61 codons encoded 20 amino acids and the other three codons were stop codons (Figure 4B). An RSCU value <1.00 indicates that the codon usage frequency is lower than expected, whereas an RSCU value >1.00 indicates that the codon usage frequency is higher than expected (Sharp et al., 1986). In this study, the RSCU values of 31 codons were greater than 1, RSCU values of 31 codons were less than 1, and RSCU values of two codons were 1 (Figure 4B). Among the 64 codons, there were 16 codons, each ending with A, U, G, and C. Meanwhile, among the codons with RSCU value >1, 13 codons ended in A, 16 codons ended in U, one codon ended in G, and one codon ended in C (Figure 4A). These results indicated that the chloroplast genome codon of Aconitum species prefers to end in A/U.
FIGURE 4. Analysis of codon bias in the chloroplast genome of ten Aconitum species. (A) Analysis of terminal bases at codons in chloroplast genomes; (B) RSCU value analysis of codons in chloroplast genomes.
SSRs in the Chloroplast Genomes of the 42 Aconitum spp.
SSRs are widely present in CP genomes, consisting of 1-6-nucleotide repeat units, and are valuable molecular markers of high variation within the same species (Jiao et al., 2012). In order to explore the distribution and differences of SSRs among Aconitum species, we detected SSRs in the chloroplast genomes of 42 Aconitum species using MISA. Ten of them were the species sequenced in this study, and the sequences of the remaining 32 species were downloaded from NCBI. This study analyzed the number, type, and regional distribution of SSRs in 42 Aconitum chloroplast genomes (Figure 5). The total SSR loci in chloroplast genomes of 42 Aconitum species ranged from 48 in A. volubile to 79 in A. coreanum and A. reclinatum.
FIGURE 5. Analysis of simple sequence repeat (SSR) in the chloroplast genomes of 42 Aconitum species.
Furthermore, we analyzed SSRs loci in different regions of the chloroplast genome (LSC/SSC/IRs). The results showed that the distribution of SSRs significantly varied in different regions (Figure 6). The number of SSRs loci in the LSC region ranged from 38 to 65, accounting for 66.67%–85.92% of the total number of SSRs; SSC regions ranged from six to fifteen, accounting for 9.38%–20.00%; Two IR regions ranged from one to ten, accounting for 1.32%–17.54%. Most of the SSR sequences in the chloroplast genomes of 42 Aconitum plants were composed of mononucleotide and dinucleotide repeat units. The number of mononucleotide repeats ranged from 19 in A. volubile to 52 in A. finetianum. These were followed by dinucleotide (10–17), trinucleotide (6–12), tetranucleotide (6–9), pentanucleotide (1–5), and hexanucleotide (1–3).
FIGURE 6. Distribution types and numbers of SSRs in LSC, SSC, IR regions in the chloroplast genomes of 42 Aconitum spp.
We analyzed the repeat units of different types of repeat sequences in ten Aconitum species and found that A/T (96.67%–100% of the total SSRs of each species) and AT/TA (100%) dominated the repeat sequence types of mononucleotide and dinucleotide SSRs, respectively. The repeated trinucleotide SSRs were mainly composed of A/T base combinations (AAT/ATT, 71.43%–85.71%) (Supplementary Table S5). These results were consistent with previous reports that CP SSRs usually consist of short poly-A or poly-T repeats and rarely contain tandem G or C repeats in many plants (Kuang et al., 2011).
Long Repeat Analysis
In this study, the analysis results of long repeats in chloroplast genome sequences of 42 Aconitum species showed 1,467 long repeats in all species, and each species had 17–77 long repeats, respectively (Figure 7). Among them, A. piepunense has the most significant number of long repeats, including 18 forward, 19 palindromic, 35 reverse repeats, and five complement repeats; A. monanthum is the least, including six forward and 11 palindromic repeats. Among the 42 Aconitum species, A. duclouxii, A. barbatum var. hispidum, A. barbatum var. puberulum, A. chiisanense, A. finetianum, A. coreanum, A. piepunense, A. pseudolaeve, A. puchonroenicum, A. quelpaertense, A. reclinatum, A. scaposum var. vaginatum, and A. scaposum which contained 30–39 bp complement repeat sequence.
FIGURE 7. Type and number of long repeat sequences distributed in the chloroplast genomes of 42 Aconitum species.
Of the 42 species, the length range of the repeats was divided into five types (30–39, 40–49, 50–59, 60–69, ≥70 bp), with the highest number of repeats of 30–39 bp in length. All Aconitum species contained 30–39 bp repeat sequences, accounting for 41.18%–85.71% of the total repeat sequences. Among them, 30–39 bp long repeats in A. piepunense, and A. monanthum are the most and least abundant, respectively.
The 60–69 bp repeat sequences were the least. Only in A. nagarum, A. duclouxii, A. ouvrardianum, A. austrokoreense, A. brachypodum, A. chiisanense, A. ciliare, A. contortum, A. flavum, A. hemsleyanum, A. jaluense subsp. jaluense, A. japonicum subsp. napiforme, A. longecassidatum, A. pendulum containing repeats of 60–69 bp in length, accounting for 3.57% , 3.70%, 2.22%, 4.00%, 3.70%, 3.45%, 5.26%, 4.17%, 4.55%, 3.13%, 5.00%, 5.00%, 3.45%, and 3.57% of the total repeat sequence.
IR Boundary Variation Analysis
During the adaptive evolution of plant species, the expansion and contraction of the IR boundary of the chloroplast genome resulted in different levels of sequence replication at the four boundaries (LSC/SSC/IRa/IRb) (Hu and Zhang, 2021). Analysis of IR boundaries of chloroplast genome sequences of 42 Aconitum species showed that the four boundaries, the structure of the chloroplast genome, and the connection between IR regions were very conserved; LSC/IR and SSC/IR boundary distribution genes included rps19, rpl22, rpl2, ycf1, ndhF, trnH, and psbA (Figure 8). The rps19 gene in the chloroplast genomes of 42 Aconitum species showed various degrees of contraction and expansion at the LSC/IRb boundary. And then, the rps19 gene in chloroplast genomes of most Aconitum species exhibited a 3 bp protrusion in the IRb region. As the ycf1 gene straddles SSC/IRa boundary, a pseudogene ψycf1 was generated in the IRb region. No ycf1 pseudogene was found in A. reclinatum, A. scaposum var. vaginatum, A. sinomontanum, A. finetianum, A. barbatum, and A. angustius at the IRb region.
The length of the ndhF gene in A. duclouxii was 2,237 bp, extending across the SSC/IRa boundary to 30 bp in the IRa region, and the other Aconitum species all had ndhF gene of different lengths in the SSC region. These results indicated that some variation occurred in A. duclouxii. The rpl22, trnH, and psbA genes in the chloroplast genomes of the 42 Aconitum species were all located in the LSC region, the rpl2 gene was all located in the IRa region, and these genes were essentially unchanged in length across species, which can be used as a common feature of Aconitum chloroplast genomes for species identification.
Genomic Variation Analysis
In this study, the chloroplast genome of A. vilmorinianum was used as the reference sequence to compare the chloroplast genome in pairs with that of the other 41 species. The Shuffle—LAGAN model of mVISTA was used to map the chloroplast genomes of 42 Aconitum species (Figure 9) and observe the approximate gene composition and sequence. The results showed that the chloroplast genome sequences of 42 Aconitum plants were similar, revealing that the chloroplast genome of Aconitum was highly conserved. The IR region of the Aconitum chloroplast genome was more conserved than LSC and SSC regions, the coding region was more conserved than the non-coding region, and the variation degree of the intergenic spacer region was greater than that of the gene region. The intergenic spacer region trnK-UUU-trnQ-UUG, trnS-GCU-trnG-UCC, atpH-atpI, petN-psbM, rps18-rpl20, rpl16-rps3, trnL-CAA-trnL-CAA, ndhB-trnL-CAA had the largest variation; The protein-coding gene region was the most conserved, and only ycf1 and ycf2 had the largest variation; Four rRNA genes (rrn4.5, rrn5, rrn16, and rrn23) were the most conserved. These highly variable genes and intergenic spacers can be used as molecular markers for 42 Aconitum species phylogenetic and population genetics studies.
FIGURE 9. Global chloroplast genome alignments for 42 Aconitum species using the mVISTA program, with A. vilmorinianum as the reference. Y-axis shows the range of sequence identity (50–100%).
Nucleotide Polymorphism Analysis
The recognition of highly variable sites in the whole chloroplast genome can be used as molecular markers for species identification and phylogenetic studies (Li et al., 2019). This study screened the highly variable sites of 42 Aconitum species by sliding window analysis. Seven hotspots with high variation were screened from the chloroplast genome sequence, including four intergenic spacer regions and three protein-coding gene regions (Figure 10B). They were distributed in the LSC region (trnK-UUU-trnQ-UGG, psbD, ndhJ-ndhK, clpP, and psbH-petB) and SSC region (ycf1 and trnA-UGC-trnI-GAU). The LSC region had the most variation sites, followed by the SSC region. In contrast, the IRb and IRa regions had relatively few variation sites. The distribution of Pi value was significantly lower than in LSC and SSC regions, which also revealed that in the chloroplast genome, the IRs region variation was conservative compared with LSC and SSC regions.
FIGURE 10. Sliding window analyses of the whole plastomes of Aconitum species. X-axis: position of the midpoint of a window, Y-axis: nucleotide diversity (Pi) of each window. (A): Sliding window analysis of ten Aconitum species sequenced in this study. (B): Sliding window analyses the whole plastomes of 42 Aconitum species (comprising ten species for this study).
In addition, we performed chloroplast genome sliding window analyses of the ten Aconitum species included in this study (Figure 10A). We found that the variable sites were more divergent when variation analysis was performed on 42 Aconitum species. Analysis of the variant sites in all the 42 Aconitum species that have been sequenced allows screening of more suitable molecular identification markers for Aconitum species.
Phylogenetic Analysis
This study used Maximum Likelihood (ML) to construct phylogenetic trees based on two data sets (complete chloroplast genome and 84–85 protein-coding genes) from 44 species to determine the phylogenetic position of ten Aconitum species (Figures 11A,B). In addition, NJ tree with complete chloroplast genome sequences of 44 species were constructed. The topological structure of the NJ tree is essentially consistent with that of the ML tree (Supplementary Figure S1). The information on 44 species involved in phylogenetic tree construction is shown in Supplementary Table S6. The ML tree topology constructed based on the two data sets was essentially consistent, and a high support rate was detected on most nodes, but the support rate of the ML trees constructed based on different data sets was different (Figures 11A,B). The clustering of the two phylogenetic trees was the same, which strongly supports that Aconitum mainly consists of Subgen. Aconitum and Subgen. ParAconitum, with support values ≥100%. Two outgroup species (Delphinium grandiflorum and Delphinium anthriscifolium) were independent. Forty-two Aconitum species were clustered into a single large clade with support values ≥60%. Of these, A. stapfianum, A. weixiense, A. vilmorinianum, and A. episcopale were clustered together. A. stapfianum and A. episcopale exhibited a sister relationship with 96% and 100% support values, indicating the close relationship between the four Aconitum species. A. delavayi, A. ramulosum, and A. ouvrardianum clustered together, with 98% and 100% support values. A. delavayi and A. ramulosum exhibited a sister relationship. However, A. stylosum, A. stapfianum, A. weixiense, A. vilmorinianum, and A. episcopale were clustered together in ML phylogenetic tree based on PCGs. Our results will provide genomic resources for the phylogeny of Aconitum species and facilitate future phylogenetic studies and other studies of Ranunculaceae plants.
FIGURE 11. Phylogenetic relationship of 42 Aconitum species inferred from Maximum Likelihood (ML) based on two data sets. (A) ML tree constructed based on complete chloroplast genome. (B) ML tree constructed based on protein-coding genes (PCGs).
Discussion
The chloroplast genomes of Nicotiana tabacum (Shinozaki et al., 1986) and Marchantia polymorpha (Ohyama et al., 1986) were sequenced in 1986. As of November 2021, more than 6,500 chloroplast genomes have been recorded in the GeneBank database. The whole chloroplast genome is simple and easy to obtain (Badenes and Parfitt, 1995). Moreover, the chloroplast genome is more conservative and shorter than the nuclear and mitochondrial genome. It has often been used as a DNA super barcode for the identification, classification, and phylogenetic research of medicinal plants in recent years (Rønsted et al., 2005; Yuan, 2020). Hu et al. used the whole chloroplast genome as DNA super barcode to identify four medicinal plants of Gentian from Yunnan, and the results showed that DNA super barcode has excellent advantages in species identification of Gentian complex groups (Hu and Zhang, 2021). Cui et al. compared the whole genome sequences of 32 Cardamom species, revealing that the chloroplast whole genome can be used to accurately identify Cardamom species (Cui et al., 2019). Guo et al. determined the sister relationship between Schisandra and Illicium by constructing ML and BI phylogenetic trees (Guo et al., 2017). In this study, the chloroplast genomes of ten medicinal plants of Aconitum from Yunnan were sequenced in the next-generation sequencing. The genomic characteristics and composition of ten Aconitum species were analyzed. Moreover, we also performed a comparative analysis with other sequenced species in the GenBank database. Interspecific genetic studies and species identification of Aconitum species will be facilitated by chloroplast genome alignment, comparative analysis, and phylogenetic analysis.
Aconitum species are very diverse, and many species are highly similar in plant morphology and medicinal site roots, which are difficult to identify morphologically. Some authors have attempted to identify some Aconitum species with the DNA barcodes ITS2 and psbA-trnH with some limitations and difficulty in making accurate identifications (Zhu et al., 2020). Park et al. (2017) found highly divergent regions, including trnK-trnQ, ycf1-ndhF, and ycf4-cemA, by comparing the chloroplast genomes of three Aconitum medicinal plants. Developed InDel markers based on the hypervariable region using indel sequences in trnK-trnQ and ycf1-ndhF and confirmed that A. pseudolaeve, A. longecassidatum, and A. barbatum could be clearly distinguished using novel InDel markers AcoTT (Aconitum trnK-trnQ) and AcoYN (Aconitum ycf1-ndhF). Therefore, molecular marker studies of chloroplast genomes of Aconitum species are relatively lacking. The ten Aconitum species used in this study have a long medicinal use history in Yunnan and surrounding areas. They have many applications in folk, such as Yi, Bai, Tibetan, and Naxi, and root tubers are often used to treat rheumatic arthritis pain, fall injury mainly, and so on (Jiang et al., 2016). However, several medicinal herbs are toxic, and mixed-use may endanger human life safety. Therefore, the addition of molecular studies to ten Aconitum species will benefit the identification and characterization of the species. Based on the ten Aconitum species sequenced in this study, genome comparison with other sequenced Aconitum species and sliding window analysis were performed to obtain seven highly divergent regions (trnK-UUU-trnQ-UGG, ndhJ-ndhK, psbH-petB, trnA-UGC-trnI-GAU, psbD, clpP, and ycf1). These highly divergent regions provide helpful information for molecular marker development in plant identification and investigating the phylogenetic relationships of Aconitum.
The chloroplast genome variation of Aconitum species was generally conserved. In the Aconitum chloroplast genome structure, the reverse repeat sequence (IRs) was separated by LSC and SSC (Mower and Vickrey, 2018). In different plants, the IRs region was highly conserved, the length ranging from 20,000 bp to 25,000 bp (Stewart A et al., 2019). In addition, intron sequences in chloroplast genomes were highly conserved. In this study, the IRs region was highly conservative compared to the LSC/SSC region, and the intron region was relatively conserved compared to the gene spacer region. The results of this study were consistent with the results of other scholars’ studies on the chloroplast genome of Aconitum species (Kong et al., 2017). It is also revealed that more highly variable segments could be mined from the chloroplast genome structure except the IRs and intron gene regions for inter-species identification.
Codon usage preference (CUB) is an important evolutionary feature in genomes and has been widely documented in many organisms, from prokaryotes to eukaryotes (Sharp et al., 1998). Some scholars analyzed the codons of chloroplast genomes of 19 Panicum species by bias and cluster analysis and revealed the overall evolutionary differences between chloroplast genomes of Panicum (Li et al., 2021). Wang et al. explored the codon usage preference of chloroplasts in the Paris polyphylla var. yunnanensis, providing a reference for heterologous expression and gene function in the P. polyphylla var. yunnanensis (Wang et al., 2021). Relative synonymous codon usage (RSCU) is an essential metric for determining codon usage preferences, and RSCU values are limited to 1; If RSCU >1, indicating a high frequency of codon usage; If RSCU <1, this codon is used less frequently; Or RSCU = 1, the codon usage is unbiased (Sharp et al., 1986). In the present study, the total GC content of the ten Aconitum chloroplast genomes was 38.61%. Of the 31 high-frequency codons with RSCU >1, 13 ended with A, 16 ended with U, one ended with G, and one ended with G. Our results showed that the third base of codon tended to end with A/U and the composition of bases at different positions of codons, which was consistent with the codon bias of other medicinal plants such as Delphinium grandiflorum (Duan et al., 2021). Some scholars have suggested that the codon usage pattern in plant genomes is closer to that of humans and higher eukaryotes than in other unicellular organisms, and this similarity is due to the overall preference of the third base of the codon for G+C content (Murray et al., 1988).
Pseudogenes play an essential role in gene expression regulation and genome evolution (Liu et al., 2010). Pseudogenes ycf15, rps16, infA, rps19, and ycf1 were commonly found in the Aconitum genome (Kong et al., 2017; Meng et al., 2018). However, only two pseudogenes of rps19 and ycf1 were found in this study, and the ycf1 was located in the IRb region. Some scholars have proved that their location caused pseudogenes rps19 and ycf1 at the boundary of each region of the chloroplast genome, and a repeat occurs at the boundary of IRs and SSC/LSC, leading to the incomplete gene in the IRs region, which was called boundary effect (Li et al., 2018). Pseudogene rps19 was found only in A. ouvrardianum, A. delavayi, and A. ramulosum among the ten Aconitum species in this study, indicating specific differences in pseudogene phenomenon among species.
SSRs mainly distributed in the non-coding region, and the degree of sequence variation was higher than in the coding region (Powell et al., 1995). In addition, SSRs could be used to study conservation genetics of endangered plant species, molecular identification, and genetic relationships among related species (Clark et al., 2000; Huang et al., 2015). This study obtained 48–79 SSRs loci from 42 Aconitum species. Most of these SSRs were located in the LSC region, followed by the SSC and IR regions. The most abundant were mononucleotide repeats, which contributed to A/T richness. These results are consistent with most reported angiosperms (Nunzio et al., 2018; Gu et al., 2019; Tang et al., 2021). Analysis of the SSRs identified in the chloroplast genome of Aconitum showed some differences in the number of SSRs among the 42 species. Long repeats play an essential role in the complete chloroplast genome’s variation, expansion, and rearrangement (Asaf et al., 2017). We also identified 17–77 long repeats in the complete chloroplast genomes of 42 Aconitum species. The results suggest that long repeats were also divergent among different lineages. The SSRs and long repeats of the 42 Aconitum chloroplast genome exhibited abundant variation. They thus might help detect polymorphisms at the intraspecific level and develop molecular markers for Aconitum species for future evolutionary and genetic diversity studies.
We constructed the phylogenetic trees of 42 Aconitum species based on two data sets. There were no apparent conflicts between the phylogenetic trees constructed by different datasets, but the most support values of the branches based on the complete CP genomes dataset were higher than those based on the PCGs dataset. In addition, the sister relationships of A. contortum and A. stylosum changed in the ML phylogenetic tree constructed based on PCGs, and the A. stylosum was clustered with A. stapfianum, A. weixiense, A. vilmorinianum, and A. episcopale into one branch. The results suggest that determining the phylogenetic relationships of Aconitum species based on chloroplast whole genomes is more efficient than using PCGs alone. The ML phylogenetic trees based on the whole chloroplast genome and protein-coding gene sequences proved that Aconitum mainly consists of Subgen. Aconitum and Subgen. ParAconitum, the results were similar to phylogenetic studies on Aconitum by other scholars. (Meng et al., 2018; Kim et al., 2019). The genus Aconitum in Flora of China was classified into the Subgen. Aconitum, Subgen. Gymnaconitum, Subgen. ParAconitum, and the phylogenetic tree support subgenera classification in Aconitum spp. (Li and Kadota, 2001). In the phylogenetic tree constructed based on the whole chloroplast genome, the related species of Aconitum are clustered into different branches with Ser. Ambigua, Ser. Brachypoda, Ser. Bullatifolia, Ser. Stylosa, Ser. Tangutica, Ser. Volubilia. The results revealed that the related species of Ser. Ambigua (A. piepunense, A. delavayi, A. ramulosum, A. ouvrardianum) and Ser. Volubilia (A. hemsleyanum, A. weixiense, A. vilmorinianum, A. episcopale, A. stapfianum) are clustered into sister branches and have a close genetic relationship. Ser. Brachypoda (A. brachypodum, A. pendulum, A. flavum), Ser. Bullatifolia (A. nagarum, A. duclouxii), and Ser. Stylosa (A. stylosum, A. contortum) were clustered into a single clade with 97% support values, where Ser. Bullatifolia and Ser. Stylosa exhibited close phylogenetic relationships. It is worth noting that A. habaense, classified as the Ser. Volubilia, was clustered in the branch of the Ser. Ambigua. A. tanguticum was singly clustered as a branch of the Ser. Tangutica. Ten Aconitum species were sequenced in current were divided into four groups: Ser. Volubilia, Ser. Stylosa, Ser. Ambigua, and Ser. Bullatifolia. For the first time, a phylogenetic tree was constructed for A. stylosum, A. nagarum, A. duclouxii, A. stapfianum, A. weixiense, A. ouvrardianum, and A. delavayi with other Aconitum species. A. delavayi and A. ramulosum were sister specie; A. episcopale and A. stapfianum were sister species; A. delavayi and A. ramulosum as sister species. The complete chloroplast genome resolved the more complex phylogeny among Aconitum species, and it also shows a high support rate, indicating that the way to employ the chloroplast genome to explore the phylogenetic relationships among Aconitum species is effective.
Conclusion
This study reported that the CP genomes from ten Aconitum species ranged from 155,475 (A. stylosum) to 155,921 bp (A. vilmoinianum), and the structure and composition of the CP genomes are highly similar. The CP genomes genes from 129 (A. vilmoinianum) to 132 (A. ramulosum), including 83–85 protein-coding genes, 37 tRNA genes, eight rRNA genes, and two pseudogenes. The RSCU values of 31 codons were greater than one, RSCU values of 31 codons were less than one, RSCU values of two codons were one, and the high-frequency codons preferred to end with the A/T base. 48–79 SSRs and 17–77 long repeats were identified in the chloroplast genome of 42 Aconitum species. The variation of IRs was more conservative than that of LSC and SSC. Four highly variable intergenic spacers (trnK-UUU-trnQ-UGG, ndhJ-ndhK, psbH-petB, and trnA-UGC-trnI-GAU) and three gene regions (psbD, clpP, and ycf1) can be used as appropriate DNA barcodes for species identification and genetic diversity studies of Aconitum species. Among the ten Aconitum species, A. nagarum had the closest relationship with A. duclouxii, A. stapfianum had the closest relationship with A. episcopale, A. delavayi had the closest relationship with A. ramulosum, respectively. This study enriched the complete chloroplast genome resources of Aconitum species and could provide scientific evidence for developing molecular markers, species identification, and phylogeny of Aconitum species.
Data Availability Statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.
Author Contributions
CX and JL conceived and designed the study. CX acquired the founding. CX collected samples and determined the species. CX, MW and YG conducted the experiment(s). CX and YG performed the genome assembly and analysis of the data. CX wrote the manuscript. JL supervised the manuscript. All authors have read and approved the final manuscript.
Funding
This research was funded by the Major Projects of Science and Technology Plan of Dali state (No. D2019NA03) and LJ Expert Workstation of Yunnan Province (No. 202005AF150013).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
We sincerely thank Jun Qian of Biozeron Biotech Co. Ltd., Shanghai, China, for the assistance provided with this study. We would like to thank ManuscriptEdit (https://www.manuscriptedit.com) for English language editing (ORDER11439).
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2022.878182/full#supplementary-material
References
Amiryousefi, A., Hyvönen, J., and Poczai, P. (2018). IRscope: An Online Program to Visualize the Junction Sites of Chloroplast Genomes. Bioinformatics 34, 3030–3031. doi:10.1093/bioinformatics/bty220/4961430
Asaf, S., Khan, A. L., Khan, M. A., Waqas, M., Kang, S.-M., Yun, B.-W., et al. (2017). Chloroplast Genomes of Arabidopsis Halleri Ssp. Gemmifera and Arabidopsis Lyrata Ssp. Petraea: Structures and Comparative Analysis. Sci. Rep. 7, 7556. doi:10.1038/s41598-017-07891-5
Badenes, M. L., and Parfitt, D. E. (1995). Phylogenetic Relationships of Cultivated Prunus Species from an Analysis of Chloroplast DNA Variation. Theor. Appl. Genet. 90, 1035–1041. doi:10.1007/BF00222918
Beier, S., Thiel, T., Münch, T., Scholz, U., and Mascher, M. (2017). MISA-web: a Web Server for Microsatellite Prediction. Bioinformatics 33, 2583–2585. doi:10.1093/bioinformatics/btx198
Clark, C. M., Wentworth, T. R., and O'Malley, D. M. (2000). Genetic Discontinuity Revealed by Chloroplast Microsatellites in Eastern North AmericanAbies(Pinaceae). Am. J. Bot. 87, 774–782. doi:10.2307/2656885
Cui, X. Y. (2020). Structural Analysis of Complete Chloroplast Genome of Medicinal and Edible Oriiginal Plants of Amomum, Chinese Wolfberry, Hawthorn and Ginger. Beijing: Chinese Academy of Medical Sciences, Peking Union Medical College.
Cui, Y., Chen, X., Nie, L., Sun, W., Hu, H., Lin, Y., et al. (2019). Comparison and Phylogenetic Analysis of Chloroplast Genomes of Three Medicinal and Edible Amomum Species. Ijms 20, 4040. doi:10.3390/ijms20164040
D’Agostino, N., Tamburino, R., Cantarella, C., De Carluccio, V., Sannino, L., Cozzolino, S., et al. (2018). The Complete Plastome Sequences of Eleven Capsicum Genotypes: Insights into DNA Variation and Molecular Evolution. Genes. 9, 503. doi:10.3390/genes9100503
Dierckxsens, N., Mardulyn, P., and Smits, G. (2017). NOVOPlasty:de novoassembly of organelle genomes from whole genome data. Nucleic Acids Res. 45, gkw955. doi:10.1093/nar/gkw955
Duan, H., Zhang, Q., Wang, C., Li, F., Tian, F., Lu, Y., et al. (2021). Analysis of Codon Usage Patterns of the Chloroplast Genome in Delphinium Grandiflorum L. Reveals a Preference for AT-Ending Codons as a Result of Major Selection Constraints. PeerJ 9, e10787. doi:10.7717/peerj.10787
Frazer, K. A., Pachter, L., Poliakov, A., Rubin, E. M., and Dubchak, I. (2004). VISTA: Computational Tools for Comparative Genomics. Nucleic Acids Res. 32, W273–W279. doi:10.1093/nar/gkh458
Gu, C., Ma, L., Wu, Z., Chen, K., and Wang, Y. (2019). Comparative Analyses of Chloroplast Genomes from 22 Lythraceae Species: Inferences for Phylogenetic Relationships and Genome Evolution within Myrtales. Bmc Plant Biol. 19, 1–19. doi:10.1186/s12870-019-1870-3
Guo, H., Liu, J., Luo, L., Wei, X., Zhang, J., Qi, Y., et al. (2017). Complete Chloroplast Genome Sequences of Schisandra Chinensis: Genome Structure, Comparative Analysis, and Phylogenetic Relationship of Basal Angiosperms. Sci. China Life Sci. 60, 1286–1290. doi:10.1007/s11427-017-9098-5
He, J., Wong, K.-L., Shaw, P.-C., Wang, H., and Li, D.-Z. (2010). Identification of the Medicinal Plants inAconitumL. By DNA Barcoding Technique. Planta Med. 76, 1622–1628. doi:10.1055/s-0029-1240967
Hu, H. S., and Zhang, D. Q. (2021). DNA Super-barcoding of Several Medicinal Species in Gentiana from Yunnan Province. Zhongguo Zhong Yao Za Zhi 46, 5260–5269. doi:10.19540/j.cnki.cjcmm.20210720.103
Huang, J., Yang, X., Zhang, C., Yin, X., Liu, S., and Li, X. (2015). Development of Chloroplast Microsatellite Markers and Analysis of Chloroplast Diversity in Chinese Jujube (Ziziphus Jujuba Mill.) and Wild Jujube (Ziziphus Acidojujuba Mill.). Plos One 10, e0134519. doi:10.1371/journal.pone.0134519
Jiang, B., Zhao, G., and Zhang, D. Q. (2016). An Illustrated Book of Bainationality Medicinal Plants. Beijing: Chinese Medicine Press, 156.
Jiao, Y., Jia, H.-m., Li, X.-w., Chai, M.-l., Jia, H.-j., Chen, Z., et al. (2012). Development of Simple Sequence Repeat (SSR) Markers from a Genome Survey of Chinese Bayberry (Myrica Rubra). Bmc Genomics 13, 201. doi:10.1186/1471-2164-13-201
Katoh, K., and Standley, D. M. (2013). MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Mol. Biol. Evol. 30, 772–780. doi:10.1093/molbev/mst010
Kim, Y., Yi, J.-S., Min, J., Xi, H., Kim, D. Y., Son, J., et al. (2019). The Complete Chloroplast Genome of Aconitum Coreanum (H. Lév.) Rapaics (Ranunculaceae). Mitochondrial DNA Part B 4, 3404–3406. doi:10.1080/23802359.2019.1674213
Kong, H., Liu, W., Yao, G., and Gong, W. (2017). A Comparison of Chloroplast Genome Sequences in Aconitum (Ranunculaceae): a Traditional Herbal Medicinal Genus. PeerJ 5, e4018. doi:10.7717/peerj.4018
Kuang, D.-Y., Wu, H., Wang, Y.-L., Gao, L.-M., Zhang, S.-Z., and Lu, L. (2011). Complete Chloroplast Genome Sequence of Magnolia Kwangsiensis (Magnoliaceae): Implication for DNA Barcoding and Population Genetics. Genome 54, 663–673. doi:10.1139/g11-026
Kumar, S., Stecher, G., Li, M., Knyaz, C., and Tamura, K. (2018). MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol. Biol. Evol. 35, 1547–1549. doi:10.1093/molbev/msy096/4990887
Kurtz, S., Choudhuri, J. V., Ohlebusch, E., Schleiermacher, C., Stoye, J., and Giegerich, R. (2001). REPuter: the Manifold Applications of Repeat Analysis on a Genomic Scale. Nucleic Acids Res. 29, 4633–4642. doi:10.1093/nar/29.22.4633
Li, G., Zhang, L., and Xue, P. (2021). Codon Usage Pattern and Genetic Diversity in Chloroplast Genomes of Panicum Species. Gene 802, 145866. (prepublish). doi:10.1016/j.gene.2021.145866
Li, L. Q., and Kadota, Y. (2001). “Aconitum Linnaeus, Sp,” in Flora of China (Beijing: Science Press), 149–222.
Li, Q. L., Yan, N., Song, Q., and Guo, Z. J. (2018). Complete Chloroplast Genome Sequence and Characteristics Analysis of Morus Multicaulis. Bull. Bot. 53, 94–103. doi:10.11983/CBB16247
Li, X. H., Wang, M. M., and Wang, Z. K. (1995). Yi Nationality Medicine. Kunming: Yunnan national press, 566.
Li, X., Tan, W., Sun, J., Du, J., Zheng, C., Tian, X., et al. (2019). Comparison of Four Complete Chloroplast Genomes of Medicinal and Ornamental Meconopsis Species: Genome Organization and Species Discrimination. Sci. Rep. 9, 1–12. doi:10.1038/s41598-019-47008-8
Liu, G. Q., Bai, Y. B. L. G., and Xing, Y. Q. (2010). Advances in Pseudogene Research. Prog. Biochem. Biophys. 37, 1165–1174. doi:10.1016/j.mechrescom.2009.09.001
Lohse, M., Drechsel, O., and Bock, R. (2007). OrganellarGenomeDRAW (OGDRAW): a Tool for the Easy Generation of High-Quality Custom Graphical Maps of Plastid and Mitochondrial Genomes. Curr. Genet. 52, 267–274. doi:10.1007/s00294-007-0161-y
Meng, J., Li, X., Li, H., Yang, J., Wang, H., and He, J. (2018). Comparative Analysis of the Complete Chloroplast Genomes of Four Aconitum Medicinal Species. Molecules 23, 1015. doi:10.3390/molecules23051015
Moore, M. J., Bell, C. D., Soltis, P. S., and Soltis, D. E. (2007). Using Plastid Genome-Scale Data to Resolve Enigmatic Relationships Among Basal Angiosperms. Proc. Natl. Acad. Sci. U.S.A. 104, 19363–19368. doi:10.1073/PNAS.0708072104
Morley, S. A., Peralta-Castro, A., Brieba, L. G., Miller, J., Ong, K. L., Ridge, P. G., et al. (2019). Arabidopsis thaliana Organelles Mimic the T7 Phage DNA Replisome with Specific Interactions between Twinkle Protein and DNA Polymerases Pol1A and Pol1B. Bmc Plant Biol. 19, 1–16. doi:10.1186/s12870-019-1854-3
Mower, J. P., and Vickrey, T. L. (2018). Structural Diversity Among Plastid Genomes of Land Plants. Adv. Bot. Res. 85, 263–292. doi:10.1016/bs.abr.2017.11.013
Murray, E. E., Lotzer, J., and Eberle, M. (1988). Codon Usage in Plant Genes. Nucleic Acids Res. 17, 477–498. doi:10.1093/nar/17.2.477
Nguyen, L.-T., Schmidt, H. A., von Haeseler, A., and Minh, B. Q. (2015). IQ-TREE: a Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies. Mol. Biol. Evol. 32, 268–274. doi:10.1093/molbev/msu300
Ohyama, K., Fukuzawa, H., Kohchi, T., Shirai, H., Sano, T., Sano, S., et al. (1986). Chloroplast Gene Organization Deduced from Complete Sequence of Liverwort Marchantia Polymorpha Chloroplast DNA. Nature 322, 572–574. doi:10.1038/322572a0
Park, I., Yang, S., Choi, G., Kim, W., and Moon, B. (20172012). The Complete Chloroplast Genome Sequences of Aconitum Pseudolaeve and Aconitum Longecassidatum, and Development of Molecular Markers for Distinguishing Species in the Aconitum Subgenus Lycoctonum. Molecules 22, 2012. doi:10.3390/molecules22112012
Paul, C. (2015). Base Calling, Read Mapping, and Coverage Analysis. In Clinical Genomics. Editors K. Shashikant, and P. John (New York, NY: Academic Press), 91–107.
Powell, W., Morgante, M., Mcdevitt, R., Vendramin, G. G., and Rafalski, J. A. (1995). Polymorphic Simple Sequence Repeat Regions in Chloroplast Genomes: Applications to the Population Genetics of Pines. Proc. Natl. Acad. Sci. U.S.A. 92, 7759–7763. doi:10.1073/pnas.92.17.7759
Qu, X.-J., Moore, M. J., Li, D.-Z., and Yi, T.-S. (2019). PGA: a Software Package for Rapid, Accurate, and Flexible Batch Annotation of Plastomes. Plant Methods 15, 1–12. doi:10.1186/s13007-019-0435-7
Rønsted, N., Law, S., Thornton, H., Fay, M. F., and Chase, M. W. (2005). Molecular Phylogenetic Evidence for the Monophyly of Fritillaria and Lilium (Liliaceae; Liliales) and the Infrageneric Classification of Fritillaria. Mol. Phylogenetics Evol. 35, 509–527. doi:10.1016/j.ympev.2004.12.023
Rozas, J., Ferrer-Mata, A., Sánchez-DelBarrio, J. C., Guirao-Rico, S., Librado, P., Ramos-Onsins, S. E., et al. (2017). DnaSP 6: DNA Sequence Polymorphism Analysis of Large Data Sets. Mol. Biol. Evol. 34, 3299–3302. doi:10.1093/molbev/msx248
Sharp, P. M., Cowe, E., Higgins, D. G., Shields, D. C., Wolfe, K. H., and Wright, F. (1998). Codon Usage Patterns in Escherichia coli, Bacillus Subtiiis, Saccharomyces Certvisiae, Schizjosaccharomyces Pombt, Drosophila Mclanogaster and Homo sapiens; a Review of the Considerable Within-Species Diversity. Nucleic Acids Res. 16, 8207–8211.
Sharp, P. M., Tuohy, T. M. F., and Mosurski, K. R. (1986). Codon Usage in Yeast: Cluster Analysis Clearly Differentiates Highly and Lowly Expressed Genes. Nucl. Acids Res. 14, 5125–5143. doi:10.1093/nar/14.13.5125
Shinozaki, K., Ohme, M., Tanaka, M., Wakasugi, T., Hayshida, N., Matsubayasha, T., et al. (1986). The Complete Nucleotide Sequence of the Tobacco Chloroplast Genome. Plant Mol. Biol. Rep. 4, 111–148. doi:10.1007/BF02669253
Somaratne, Y., Guan, D.-L., Wang, W.-Q., Zhao, L., and Xu, S.-Q. (2020). The Complete Chloroplast Genomes of Two Lespedeza Species: Insights into Codon Usage Bias, RNA Editing Sites, and Phylogenetic Relationships in Desmodieae (Fabaceae: Papilionoideae). Plants 9, 51. doi:10.3390/plants9010051
Tang, D., Wei, F., Cai, Z., Wei, Y., Khan, A., Miao, J., et al. (2021). Analysis of Codon Usage Bias and Evolution in the Chloroplast Genome of Mesona Chinensis Benth. Dev. Genes. Evol. 231, 1–9. doi:10.1007/s00427-020-00670-9
Wang, Y. F., Jiang, Y., Yang, C. J., Wang, J., Xu, Z. C., and Duan, B. Z. (2021). Analysis of Chloroplast Genome Characteristic and Codon Usage Bias in Paris Polyphylla Var. Yunnanensis. Mol. Plant Breed. 19, 7448–7458. doi:10.13271/j.mpb.019.007448
Wang, Z., Wang, G., Cai, Q., Jiang, Y., Wang, C., Xia, H., et al. (2021). Genomewide Comparative Analysis of Codon Usage Bias in Three Sequenced Jatropha Curcas. J. Genet. 100, 20. doi:10.1007/s12041-021-01271-9
Yang, C. H. (2012). A Review of Medicinal Plant Resources of Aconitum. Shanghai, China: Science & Technology Vision, 417–423. doi:10.19694/j.cnki.issn2095-2457.2012.29.328
Yang, L. G., Xia, W. J., Tang, M. Y., and Cui, R. (2016). Overview of Chemical Composition and Pharmacological Effects of Aconitum Brachypodum Diels. J. Yunnan Univ. Traditional Chin. Med. 39, 95–97+102. doi:10.19288/j.cnki.issn.1000-2723.2016.06.023
Yuan, J. (2020). Sequence Analysis and Phylogenetic Relationships of Chloroplast Genome of Coptis. Chongqing: Southwest University.
Yurina, N., and Odintsova, M. S. (1998). Comparative Structural Organization of Plant Chloroplast and Mitochondrial Genomes. Russ. J. Genet.+. 34, 5–22.
Keywords: Aconitum, medicinal plants, chloroplast genome, comparative analysis, phylogenomics
Citation: Xia C, Wang M, Guan Y and Li J (2022) Comparative Analysis of the Chloroplast Genome for Aconitum Species: Genome Structure and Phylogenetic Relationships. Front. Genet. 13:878182. doi: 10.3389/fgene.2022.878182
Received: 18 February 2022; Accepted: 27 April 2022;
Published: 31 May 2022.
Edited by:
Sunil Kumar Sahu, Beijing Genomics Institute (BGI), ChinaReviewed by:
Gang Yao, South China Agricultural University, ChinaJian-Li Zhao, Yunnan University, China
Copyright © 2022 Xia, Wang, Guan and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Conglong Xia, bG9uZzc0ODRAMTI2LmNvbQ==; Jian Li, amlhbmxpQGVjdXN0LmVkdS5jbg==