- 1CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
- 2Yunnan Key Laboratory for Integrative Conservation of Plant Species with Extremely Small Populations, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
- 3School of Life Sciences, Yunnan University, Kunming, China
- 4School of Traditional Chinese Medicine, Guangdong Pharmaceutical University, Guangzhou, China
- 5Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
Ultra-barcoding is a technique using whole plastomes and nuclear ribosomal DNA (nrDNA) sequences for plant species identification. Paris yunnanensis is a medicinal plant of great economic importance for the pharmaceutical industry. However, the alpha taxonomy of P. yunnanensis is still uncertain, hindering effective conservation and management of the germplasm. To resolve long-standing taxonomic disputes regarding this species, we newly generated the complete plastomes and nrDNA sequences from 22 P. yunnanensis accessions. Ultra-barcoding analyses suggest that P. yunnanensis as currently circumscribed is made up of two distinct genetic lineages, corresponding to the two phenotypes (“typical” and “high stem” form) identified early in our study. With distinct morphologies and distribution, the “high stem” form should be recognized as a previously unrecognized species; here it is described as a new species, P. liiana sp. nov. Moreover, the ultra-barcoding data do not support treatment of P. yunnanensis as a conspecific variety under Paris polyphylla. Our study represents a guiding practical application of ultra-barcoding for discovery of cryptic species in taxonomically challenging plant taxa. The findings highlight the great potential of ultra-barcoding as an effective tool for resolving perplexing problems in plant taxonomy.
Introduction
DNA barcoding involves the standardized use of one or a few DNA regions for identification and discrimination of species (Hebert et al., 2003; Hollingsworth, 2011; Hollingsworth et al., 2016), as well as the discovery of cryptic or novel species (Hebert et al., 2003; Bell et al., 2012). Although the mitochondrial gene cytochrome oxidase 1 (COI) performs well as a standard animal DNA barcode (Ward et al., 2005; Hajibabaei et al., 2006; Pons et al., 2006), reliable species discrimination based on standard DNA barcodes (i.e., rbcL, matK, trnH-psbA, and ITS) remains problematic in plants (Hollingsworth et al., 2009, 2016; Hollingsworth, 2011; Li et al., 2011; Coissac et al., 2016). With the advent of next-generation DNA sequencing (NGS) technologies, the concept of DNA barcoding for plant species has been extended from one or several sequence loci to large amounts of genomic data (Li et al., 2015; Coissac et al., 2016; Hollingsworth et al., 2016). Complete plastid genomes (plastomes) and entire nuclear ribosomal DNA (nrDNA) sequences harbor many more sequence variations, making them far more sensitive and effective than standard DNA barcodes, especially among very closely related taxa (Nock et al., 2011; Kane et al., 2012; Ruhsam et al., 2015; Ji et al., 2019a; Zhu et al., 2019; Li et al., 2020). The extension of standard DNA barcodes to whole plastomes and nrDNA sequences has been referred to as “ultra-barcoding” (Kane et al., 2012). However, practical application of this technique for discovery of cryptic or novel species in taxonomically difficult plant taxa is still absent from literature.
Paris yunnanensis Franch. (Melanthiaceae), a perennial rhizomatous herb distributed in southwestern China and northern Myanmar (Li, 1998), has great economic value. Dried rhizome of this plant, bearing the pharmaceutical name “Rhizoma Paridis,” is a traditional medicine in China with hemostatic, anti-inflammatory, analgesic, antipyretic, and other therapeutic properties (China Pharmacopoeia Commission, 2015). Phytochemical investigations revealed steroidal saponins as the main components responsible for the bioactivities of this plant (Yang Z. Y. et al., 2019). There are about 70 commercial drugs and health products that use Rhizoma Paridis as raw materials, including “Yunnan Baiyao,” a famous Chinese medicine, and “Gongxuening Capsule,” a gynecological hemostatic based on extractions of Rhizoma Paridis. The value of these pharmaceutical products is estimated to be more than 10 billion CNY (∼1.5 billion USD) per year (Huang et al., 2012).
Although Paris is morphologically distinct from other angiosperm genera, the rhizome, leaf, flower, stamen, ovary, fruit, and seed morphologies, which have been widely used for classification, are highly divergent among species (Hara, 1969; Takhtajan, 1983; Li, 1998; Ji et al., 2006, 2019b). Since the description of P. yunnanensis by Franchet (1888), its taxonomic rank has been in dispute. Handel-Mazzetti (1936) proposed that the morphologies of P. yunnanensis are largely homologous to those of Paris polyphylla, and thus reduced it to a conspecific variety (P. polyphylla var. yunnanensis) of the latter species; this treatment was followed by Hara (1969) and Li (1998). However, Takhtajan (1983) argued that P. yunnanensis is morphologically different from P. polyphylla and should be treated as a separate species. We detected a number of morphological differences among P. yunnanensis accessions during the early stage of our study, based on which we identified two phenotypes (“typical” form and “high stem” form, Figure 1). The morphological variation within P. yunnanensis suggests that the taxonomic delimitation of this economically important plant needs to be re-assessed. Given the great economic importance of P. yunnanensis, satisfactory resolution of these taxonomic issues will be conductive to exploration and protection of its germplasm.
Figure 1. Comparison of morphological features between “typical” Paris yunnanensis (P. yunnanensis s.s.) and “high stem” form (P. liiana sp. nov.). (A) aerial shoot. (B) leaf shape and size. (C) flower. (D) sepals. (E) young fruit. (F) mature fruit.
Genome skimming, involving a relatively low coverage shotgun sequencing of genomic DNA, is an efficient and cost-effective approach to recover highly repetitive genome components such as nrDNA or organelle genomes (Straub et al., 2012). The genome skimming approach using NGS can recover plastomes, nrDNA clusters and sometimes even the complete nuclear genome at relatively low sequencing depth, and these sequence data can be both backwards-compatible with the standard plant barcodes, and forward-compatible with whole genome sequencing (Straub et al., 2012; Coissac et al., 2016; Hollingsworth et al., 2016). Because of the significant advantages, this approach has great promise for extending the concept of DNA barcoding from one or a few DNA regions to genomes (Hollingsworth et al., 2016). Recently, genome skimming has been employed to genomic data for species discrimination in several taxonomically challenging plant groups, for instance, Theobroma (Kane et al., 2012), Araucaria (Ruhsam et al., 2015), Diospyros (Turner et al., 2016), and Panax (Ji et al., 2019a).
In this study, we sampled 22 P. yunnanensis individuals, representing the two phenotypes identified, and generated complete plastomes and nrDNA sequences for these individuals using a genome skimming approach. Based on ultra-barcoding analyses, we aimed to elucidate (1) whether P. yunnanensis is related closely enough to P. polyphylla to warrant taxonomic treatment as conspecific varieties, and (2) whether the two phenotypes within P. yunnanensis represent distinct taxa.
Materials and Methods
Plant Materials and Low-Coverage Shotgun Sequencing of Genomes
A total of 22 individuals of P. yunnanensis as currently circumscribed (16 accessions of “typical” form and 6 accessions of “high stem” form) were collected from the wild according to records of herbarium specimens, approximately covering the geographic range of the species (Table 1). The vouchers were identified by Dr. Yunheng Ji and deposited at the herbarium of the Kunming Institute of Botany, Chinese Academy of Sciences (KUN).
Genomic DNA was extracted from ∼20 mg silica-dried leaf tissues, using the cetyltrimethylammonium bromide (CTAB) method (Doyle and Doyle, 1987). Approximately 5 μg purified genomic DNA was sheared to fragments of 300–500 bp by sonication. Paired-end libraries with an average insert size of 350 bp were prepared using a TruSeq DNA Sample Prep Kit (Illumina, Inc., United States), according to the manufacturer’s instructions. The libraries were paired-end sequenced on the Illumina HiSeq 2000 platform. Raw reads were filtered to remove adaptors and low-quality reads using the NGS QC Toolkit (Patel and Jain, 2012), setting the cutoff value for percentage read length to 80 and Phred quality score to 30.
Recovery and Annotation of Plastomes
High-quality reads were assembled to generate complete plastomes with GetOrganelle pipeline developed by Jin et al. (2018). The plastome sequence of P. yunnanensis (GenBank accession: MN125587) was used as a reference for plastome assembly. All of the plastid-like reads were assembled into contigs by SPAdes v3.10.1 (Bankevich et al., 2012) with the k-mer defined as 75, 85, 95, and 105. A customized python script (Jin et al., 2018), which uses BLAST and a built-in library to search the plastid-like contig, was employed to connect verified contigs into plastomes in Bowtie 2 (Langmead and Salzberg, 2012), with its default parameters.
The assembled plastomes were annotated using the Dual Organellar Genome Annotator database (Wyman et al., 2004). Start and stop codons and intron/exon boundaries for protein-coding genes were checked manually. Annotated tRNA genes were further verified using tRNAscan-SE 1.21 (Schattner et al., 2005) with default parameters. Gene content and arrangement of P. yunnanensis plastomes were visualized and compared using MUMmer 3.0 (Kurtz et al., 2004). Boundaries of the large single copy (LSC), inverted repeat (IR), and small single copy (SSC) regions in each plastome were compared using Geneious v10.2.3 (Kearse et al., 2012).
Recovery of rDNA Sequences
Before the assembly of nrDNA clusters, all plastid-like reads were excluded from the Illumina data. The complete nrDNA sequence (including 26S, 18S, and 5.8S ribosomal RNA genes and ITS regions) of P. yunnanensis (MN174873) was used as a reference. Contigs mapping to reference nrDNA sequences were assembled using the processes described above. Nuclear ribosomal RNA genes and their boundaries with ITS regions were annotated and defined by comparison with the reference sequence using Geneious v10.2.3 (Kearse et al., 2012).
Data Analysis
The efficiency of the complete plastomes and nrDNA sequences for species identification were investigated using tree-based methods. Based on the tree topologies, species-level monophyly of P. yunnanensis as currently circumscribed and its relationships with congeneric species were examined. In addition to the 22 P. yunnanensis plastomes and nrDNA sequences newly sequenced in this study (Table 1), 31 plastomes and nrDNA sequences determined from our previous studies (Huang et al., 2016; Ji et al., 2019a; Yang L. F. et al., 2019) and representing species in the genus Paris were included in the phylogenetic analyses. Plastome and nrDNA sequences were respectively aligned using the program MAFFT (Katoh and Standley, 2013) with manual adjustment where necessary. Alignment of sequences are deposited in the online database Treebase1.
Phylogenetic analysis of each dataset was performed using maximum likelihood (ML) and Bayesian inference (BI). The complete plastome (MN125577) and nrDNA (MN174897) sequences of Trillium tschonoskii were used as the outgroup to root the plastome and nuclear trees, respectively. Conflict between plastid and nuclear datasets was statistically tested using the incongruence length difference (ILD) test (Farris et al., 1994) implemented in PAUP∗ 4.0b10 (Swofford, 2002) for 1,000 replicates.
The best-fit substitution model for plastomes (GTR + G) and nrDNA (GTR + G + I) was determined using MODELTEST 3.7 (Posada and Crandall, 1998) with the Akaike information criterion (Posada and Buckley, 2004). ML analyses were performed in the software RAxML-HPC BlackBox v8.1.24 (Stamatakis, 2006). The best-scoring ML tree for each dataset was generated with 1,000 replicates to provide bootstrap percentage (BP) support values. BI analyses were performed using MrBayes v3.2 (Ronquist and Huelsenbeck, 2003). Two independent Markov Chain Monte Carlo (MCMC) simulations were run with 1,000,000 generations, sampling every 100 generations. An initial 25% of the sampled trees were discarded as burn-in. Posterior probability (PP) values were computed from the remaining trees. Stationarity was considered to be reached when the average standard deviation of the split frequencies was <0.01.
Results
Illumina Sequencing
Illumina sequencing generated between 9,448,962 and 25,342,424 paired-end clean reads per sample. Of those, 158,172–1,067,241 and 8,299–22,466 reads were mapped to the reference plastome and ribosomal DNA sequences, respectively (Supplementary Table S1). De novo assembly based on these data covered the entire plastome and nrDNA for all samples, with average coverage ranging from 44,917 to 1,011 and 211,637 to 572,407 times, respectively. The sequences newly generated in this study were deposited in NCBI GenBank, and their accession numbers are shown in Table 1.
Phylogenies Based on nrDNA Sequences
Assembly of nrDNA sequences entirely covered the 18S rDNA, ITS1, 5.8S rDNA, ITS2, and 26S rDNA clusters. Sequence lengths for the “typical” P. yunnanensis (16 accessions) and “high stem” phenotype (6 accessions) were 5,856 and 5,857 bp, respectively. Phylogenetic trees based on maximum likelihood (ML) and Bayesian inference (BI) analyses had a very similar topology overall, but exhibited minor differences within interior nodes. Both ML and BI analyses failed to recover all P. yunnanensis accessions as a monophyletic lineage, instead grouping them into two phylogenetically disparate clades (Figure 2). The first clade consisted of all “high stem” accessions while the second clade included all “typical” P. yunnanensis accessions. The monophyly of both clades received full branch support (BP = 100, PP = 1), and the clades were separated from each other by Paris yanchii, Paris lancifolia (≡ P. polyphylla var. stenophylla), and the clade comprising Paris tengchongensis, Paris forrestii, Paris rugosa, Paris mairei, P. polyphylla, Paris luquanensis, and Paris marmorata. Moreover, the nrDNA phylogenies indicated that P. polyphylla is sister to P. mairei (BP = 100, BI = 1) and closely related to P. luquanensis and P. marmorata (BP = 100, BI = 1). However, not only P. yunnanensis accessions but also Paris chinensis (≡ P. polyphylla var. chinensis) and P. lancifolia, once treated as conspecific varieties of P. polyphylla (Hara, 1969; Li, 1998), were phylogenetically disparate from P. polyphylla in both ML and BI trees (Figure 2).
Figure 2. Phylogenetic tree reconstructed via maximum-likelihood (ML) and Bayesian inference (BI) analyses of nuclear ribosomal DNA (nrDNA) sequences. Numbers above branches indicate likelihood bootstrap percentages (BP) and Bayesian posterior probabilities (PP).
Plastome Phylogenies
In this study, 22 P. yunnanensis plastomes were recovered, using the genome skimming approach. The plastome size of “typical” P. yunnanensis and “high stem” accessions varied from 157,641 to 158,254 bp and 157,951 to 158,526 bp, which possessed the typical quadripartite structure of flowering plants, consisting of a LSC, a SSC, and a pair of IRs (Figure 3). All plastomes contained the same 114 unique genes, including 80 protein-coding genes, 30 tRNA genes, and four rRNA genes (Supplementary Table S2). Several internal stop codons in coding regions of the cemA gene identified it as a pseudogene in all newly generated plastomes.
Figure 3. The plastome map of “typical” Paris yunnanensis (P. yunnanensis s.s.) accessions (A) and “high stem” form (P. liiana sp. nov) accessions (B).
The incongruence length difference (ILD) test revealed strong discordance between plastome and nrDNA datasets (p < 0.001). Although both ML and BI analyses similarly grouped “high stem” and “typical” P. yunnanensis plastomes into two phylogenetically independent clades, the relationships of these two groups with congeneric species differed greatly from those revealed by nrDNA phylogenies. Since P. luquanensis was nested into “typical” P. yunnanensis accessions, both ML and BI analyses failed to resolve the latter as monophyletic (Figure 4). In addition, plastome phylogenies did not recover the sister relationships between “typical” P. yunnanensis accessions and P. yanchii, as well as between “high stem” accessions and P. lancifolia. Instead, P. yanchii and P. lancifolia formed a well-supported clade (BP = 99, PP = 1) sister to the clade consisting of P. mairei, P. marmorata, and P. polyphylla (BP = 84, PP = 1). Similar to nrDNA phylogenies, P. chinensis, P. lancifolia, P. polyphylla, and P. yunnanensis accessions were resolved as phylogenetically disparate in both tree topologies (Figure 4).
Figure 4. Phylogenetic tree reconstructed via maximum-likelihood (ML) and Bayesian inference (BI) analyses of complete plastomes. Numbers above branches indicate likelihood bootstrap percentages (BP) and Bayesian posterior probabilities (PP).
Discussion
Paris yunnanensis is a medicinal plant with great economic importance to the pharmaceutical industry. In this study, we aimed to resolve long-standing taxonomic disputes regarding this species using ultra-barcoding technique. The complete plastomes and nuclear ribosomal DNA regions from 22 Paris yunnanensis accessions were newly generated to investigate the species-level monophyly of the plant and its relationships with the congeneric species. Our data not only allowed recognition of a cryptic species in P. yunnanensis, but also led us to resolve long-standing controversies regarding the taxonomic status of this species. This study represents a guiding practical application of ultra-barcoding technique for discovery of cryptic or novel species. The findings highlight the great potential of ultra-barcoding as an effective tool for resolving perplexing problems in taxonomically difficult plant taxa, and have implications for the conservation and management of P. yunnanensis germplasm.
Putative Hybridization
Similar to previous studies (Ji et al., 2006, 2019b), we found that nrDNA and plastome phylogenies were largely incongruent in Paris. With respect to the target species of this study, the cytonuclear incongruence primarily involved the non-monophyly of “typical” P. yunnanensis accessions in plastome trees. Notably, cytonuclear discordance is a commonly investigated phenomenon in plant phylogenetics (Rieseberg and Soltis, 1991), which can be attributed to incomplete sorting of cytoplasmic polymorphisms or “chloroplast capture” resulting from hybridization (Wendel and Doyle, 1998). Ji et al. (2006) proposed that natural hybridization between some sympatric Paris species is feasible if the pollination mechanisms are compatible. In addition, observation of morphological intermediates between P. yunnanensis and P. luquanensis suggests that natural hybridization may occur between these two species (Ji et al., 2006).
Plastome tree topologies indicated that the non-monophyly of “typical” P. yunnanensis accessions results from clustering of P. luquanensis with P. yunnanensis accessions collected from northern Yunnan and southwestern Sichuan. Within these regions, P. yunnanensis is sympatric with P. luquanensis. Therefore, the non-monophyly of “typical” P. yunnanensis plastomes may have been caused by chloroplast capture, with the plastome from P. yunnanensis being introgressed into the nuclear background of P. luquanensis by hybridization (Rieseberg and Soltis, 1991; Rieseberg and Wendel, 1993). This assumption can be further tested through analyzing multiple loci of nuclear genes and sampling populations of both species.
Evidence for a Cryptic Species Within Paris yunnanensis
The plasticity of morphological characteristics and lack of taxonomically robust characters among Paris species have made the taxonomy of this genus historically difficult to reconstruct, especially for Chinese and Himalayan species (Franchet, 1888; Hara, 1969; Takhtajan, 1983; Li, 1998). Despite the great commercial value of P. yunnanensis to the pharmaceutical industry, the alpha taxonomy of this plant is still uncertain. In this study, we used complete plastomes and nrDNA sequences as ultra-barcodes to assess the species-level monophyly of P. yunnanensis as currently circumscribed. Our data failed to group all accessions into a single and monophyletic clade, but resolved them as two phylogenetically disparate and well-supported clades corresponding to the two phenotypes identified in P. yunnanensis. This suggests that both “typical” and “high stem” forms represent two evolutionarily distinct lineages.
The “high stem” form shows significant morphological differences from “typical” P. yunnanensis, which include plant height, leaf-blade shape, length, and width, sepal shape, petal color and width, and color of fruit at maturity (Figure 1 and Table 2). Interestingly, their aerial shoots also exhibit distinct growth patterns (Figure 5). Specifically, opening of flowers in “high stem” form is usually 5–15 days earlier than the leaf unfolding, when the pedicels extend out 10–30 cm above stem apex. On the contrast, flowering and leaf expansion synchronize in “typical” P. yunnanensis, whose pedicels do not obviously elongate until the full expansion of leaves. In addition, the two phenotypes possess distinct distribution ranges. The “high stem” populations occur in southern Yunnan, western Guangxi, and southwestern Guizhou, whereas “typical” P. yunnanensis (P. yunnanensis s.s.) is mainly distributed in central, northern, northwestern, and western Yunnan, southwestern Sichuan, and southeastern Tibet (Figure 6). There is little overlap between their respective distribution ranges. This evidence justifies the “high stem” form being recognized as a distinct taxon. Moreover, the phylogenetic relationships of the “high stem” form with related, well-defined, congeneric species suggest that recognition of it as a distinct species is appropriate.
Figure 5. Comparison of the development of aerial shoot between “typical” Paris yunnanensis (P. yunnanensis s.s.) and “high stem” form (P. liiana sp. nov.).
Figure 6. The distribution of Paris yunnanensis s.s. (“typical” form, blue cycle) and P. liiana sp. nov. (“high stem” form, red cycle).
The nrDNA and plastome tree topologies both indicated that P. yunnanensis s.s. (≡ P. polyphylla var. yunnanensis), P. chinensis (≡ P. polyphylla var. chinensis), and P. lancifolia (≡ P. polyphylla var. stenophylla) are genetically distinct from P. polyphylla. The ultra-barcoding data provide no support for treating these four taxa as conspecific varieties (Handel-Mazzetti, 1936; Hara, 1969; Li, 1998), but justify that they should be recognized as distinct species. It is notable that our sampling of P. chinensis and P. lancifolia (one individual per species) might be limiting for molecular study. Further study of their species-level monophyly by sampling multiple accessions per species is warranted.
With more variable characters than standard DNA barcodes, genomic data have been recommended as next-generation DNA barcodes for plants (Kane and Cronk, 2008; Nock et al., 2011; Kane et al., 2012; Ruhsam et al., 2015; Hollingsworth et al., 2016) and utilization of these extended barcodes in plant species identification is referred to as ultra-barcoding (Kane et al., 2012) or “plant barcoding 2.0” (Hollingsworth et al., 2016). However, the efficiency of ultra-barcodes for the discovery of cryptic and novel species has seldom been evaluated (Kane and Cronk, 2008; Kane et al., 2012; Hollingsworth et al., 2016). The practical application of ultra-barcodes in this study not only allowed recognition of a cryptic species in P. yunnanensis, but also led us to infer possible hybridization between P. luquanensis and “typical” P. yunnanensis (P. yunnanensis s.s.). Therefore, the ultra-barcoding approach has great promise for discovery of novel taxa, and offers significant advantages in interpreting possible hybridization events and identifying hybrids.
Implications for the Management of Paris yunnanensis Germplasm Resources
Germplasm resources are the genetic material basis for plant breeding and crop improvement (Nass et al., 2012). Proper circumscription of a species and identification of germplasm diversity is a critical prerequisite to conservation and management efforts. We propose a narrow species delimitation for P. yunnanensis, based on successful distinction of P. yunnanensis s.s. from the cryptic species using complete plastome and nrDNA sequences as ultra-barcodes. In addition, both datasets possess high levels of infraspecific sequence variation in P. yunnanensis s.s. Thus, ultra-barcoding could be an effective tool for identifying P. yunnanensis s.s. and for investigating its germplasm diversity.
As we discussed above, cytonuclear discordance observed in P. yunnanensis s.s. implies that natural hybridization may occur between this plant and its sympatric congeneric species, P. luquanensis. From the perspective of germplasm conservation and management, great attention should be paid to the protection of “genetically genuine” individuals and populations. Ultra-barcoding may help exclude possible hybrids for construction of a core germplasm resource. Based on this, we could search for elite germplasm that is highly productive and contains high levels of steroidal saponins for breeding needs. Given that distant hybridization can either result in parental advantages or create heterosis (Cicin, 1954; Whitney et al., 2010), it will not only extend the germplasm resources of P. yunnanensis s.s., but can also improve the performance of the outcrossing offspring. Therefore, hybrids are indispensable complements to the core germplasm resources. Ultra-barcoding will serve as a useful tool for genotyping hybrid germplasm and interpreting parentage. Elucidating these issues will improve future cross-breeding research.
Taxonomic Treatment
Paris liiana Y. H. Ji sp. nov. (Figures 1, 5, 7).
Type: China. Yunnan: Yuanyang County, Xiaoxinjie, 24° 43′ 53.76″ N, 104° 21′ 06.01″ E, 1599 m, 7 August 2016, Y. H. Ji 2016457 (holotype, KUN!); Qiubei County, 24° 03′ 55.45″ N, 104° 10′ 57.57″ E, 1530 m, 12 July 2016, Y. L. Huang 006 (paratype, KUN!).
Paris liiana can be distinguished from P. yunnanensis by its elliptic or oblong-obovate leaf blade, 20–30 cm × 8–15 cm, oblong or obovate-oblong sepals, petals filiform to linear, distally slightly widened to 2–3 mm, and capsule dark red or brown at the top.
Perennial herbs with cylindrical, oblique or horizontal rhizomes, yellowish brown outside, and white inside, 3.0–7.0 cm in diameter, 5.0–20.0 cm long, bearing a bud at the top, roots up to 30.0 cm long. Stem erect, cylindrical, purplish red or green, 50.0–150.0 cm tall. Leaves 5–12 in an apical whorl, green; petiole light green, 8.0–3.0 cm long; leaf blades elliptic or oblong-obovate, apex acute, 20–30 cm × 8–15 cm oblong, 6.0–19.0 × 3.5–9.5 cm; two pairs of lateral veins, basally developed. Flower solitary and terminal, basic merosity 5–10. Peduncle green or light purple, 25.0–50.0 cm; sepals 5–10, oblong or obovate-oblong, green, ca. 5–12 × 2.5–5 cm 3.5–8.6 × 1.6–2.2 cm; petals 5–10, filiform-linear, green at low portion, greenish yellow at upper portion, distally slightly widened to 2–3 mm, shorter or slightly longer than sepals. Stamens 2 × petal number, filament greenish yellow, 3.0–6.0 mm, anthers golden yellow, dehiscing by a lateral slit, 1.5–4.0 cm long. Ovary pale green at base, purplish red apically, with 5–10 slight ridges, carpels 5–10, unilocular with parietal placenta; style 4.0–5.0 mm, with an enlarged base, purplish red, stigmas 5–10-lobed, dark brown. Capsule dehiscent, subglobose, green, dark red or brown at the top. Seeds numerous, with a red and juicy sarcotesta.
Additional specimens examined: China. Guangxi: Longlin, 01 Jun. 1957, Liang CF and Wu DL 32471 (IBSC); Nanning, 07 Jul. 1973, Huang XC 5833 (GXMG). Guizhou: Anlong, 27° 44′ 34.7″ N, 98° 36′ 17.1″ E, 1800 m, 09 Jun. 1960, Guizhou Expedition 3167 (KUN); loc. eodem, 09 Jun. 1960, Zhang ZS and Zhang YT 4155 (PE); Xingyi, 20 Jul. 1960, Zhang ZS and Zhang YT 6408 (PE). Yunnan: Eshan, 1350 m, 11 Jul. 1989, Yuxi Expedition 89-516 (KUN); loc. eodem, 1300 m, 29 Apr. 1988, Eshan Expedition 88-101 (KUN); Guangnan, 05 Jan. 2016, Guangnan Expedition 5326270519 (IMDY); loc. eodem, 20 Jan. 2016, Guangnan Expedition 5326270534 (IMDY); Jingdong, 21 Oct. 1956, Qiu BY 52963 (KUN); loc. eodem, 28 Apr. 1959, Xu SG 5049 (KUN); loc. eodem, 06 Dec. 1939, Li MG 2263 (KUN); Jinghong, Sept. 1936, Wang CW 78693 (PE); loc. eodem, May 1984, Tao GD 44099 (HITBC); Menghai, May 1936, Wang CW 74278 (PE); loc. eodem, 14 Jun. 2012, Menghai Census 5328220509 (IMDY); loc. eodem, 24 Apr. 2012, Menghai Census 5328220065 (IMDY); loc. eodem, 03 Oct. 1959, Cai XT 59-10459 (KUN); loc. eodem, 1650 m, 17 Jun. 1960, Yunnan Tropic Expedition 60-11693 (KUN); Mengla, 10 Nov. 1959, Pei SJ 59-11386 (KUN); Lancang, 20 Aug. 2015, Yang YP yi-173-1 (KUN); Longling, 17 May 2015, Longling Census 530523150517045LY (IMDY); Lüchun, 16 Oct. 1973, Tao DD 856 (KUN); Luoping, 27 May 1989, Hongshui River Expedition 1722 (KUN); loc. eodem, 2480 m, 28 May 1989, Hongshui River Expedition 1922 (KUN); Fengqing, 18 Jun. 1938, Yu TT 16348 (PE); Pingbian, 05 Oct. 1939, Wang CW 82319 (PE); Simao, 29 May 2012, Simao Census 5308020534 (IMDY); loc. eodem, 14 Jun. 2012, Simao Census 5308020705 (IMDY); loc. eodem, 17 May 2012, Simao Census 5308020404 (IMDY); Xichou, 2200 m, 24 Sept. 1947, Feng GM 11993 (KUN); Xinping, 03 Jun. 2012, Xinping Census 5304270419 (IMDY); Yanshan, 1250–1320 m, 09 Oct. 1939, Wang QW 84252 (KUN); Yuanjiang, 08 Jun. 2012, Yuanjiang Census 5304280573 (IMDY).
Etymology: The species is named in honor of Prof. Heng Li, who carried out the most recent and comprehensive taxonomic revision on the genus Paris.
Distribution: Guangxi, southwestern Guizhou, and southern Yunnan, China (Figure 6).
Habitat: Evergreen broad-leaved forests dominated by Castanopsis, Lithocarpus, Quercus, and Schima species at 1200–2200 elevation.
Phenology: Flowering April–May, fruiting June–December.
Conservation status: The species is commonly harvested as medicinal herb by local people. We estimate that its population size has been reduced by at least 50% over the past 10 years. According to the IUCN (2012) red list categories and criteria, P. liinana should be assessment as vulnerable status (VU A1d).
Data Availability Statement
The datasets Generated for this study can be found in NCBI GenBank database, and the accession number of each sequence are showed in Table 1. Alignment of sequences are deposited in the online database Treebase (http://purl.org/phylo/treebase/phylows/study/TB2:S25503).
Author Contributions
YJ and J-BY designed the research. CL, JY, LJ, ZY, and J-BY collected and analyzed the data. YJ wrote the manuscript. J-BY revised the manuscript.
Funding
This study was supported by the National Natural Science Foundation of China (31872673), the NSFC-Joint Foundation of Yunnan Province (U1802287), a grant from the Large-scale Scientific Facilities of the Chinese Academy of Sciences (no. 2017-LSF-GBOWS-02), and the Major Science and Technology Projects of Yunnan Science and Technology Plan (2019ZF011-2).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
We are grateful to Zhangming Wang, Guohua Zhou, Chengjin Yang, Tingzhou Zhao, Yuling Huang, Ren Zhao and Yulong Li for their help in collecting samples used in the study, and to Zhengshan He, Jing Yang, and Zhirong Zhang for their assistance in data analysis.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2020.00411/full#supplementary-material
Footnotes
References
Bankevich, A., Nurk, S., Antipov, D., Gurevich, A. A., Dvorkin, M., Kulikov, A. S., et al. (2012). SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comp. Biol. 19, 455–477. doi: 10.1089/cmb.2012.0021
Bell, D., Long, D. G., Forrest, A. D., Hollingsworth, M. L., Blom, H. H., and Hollingsworth, P. M. (2012). DNA barcoding of European Herbertus (Marchantiopsida, Herbertaceae) and the discovery and description of a new species. Mol. Ecol. Resour. 12, 36–47. doi: 10.1111/j.1755-0998.2011.03053.x
China Pharmacopoeia Commission (2015). Pharmacopoeia of the People’s Republic of China. Beijing: China Medica Science Press.
Cicin, N. V. (1954). Distant hybridization in plants. Moskva: Gosudarstvennoe Izdateljstvo Seljskohozjaistvennoi Literatury.
Coissac, E., Hollingsworth, P. M., Lavergne, S., and Taberlet, P. (2016). From barcodes to genomes: extending the concept of DNA barcoding. Mol. Ecol. 25, 1423–1428. doi: 10.1111/mec.13549
Doyle, J. J., and Doyle, J. L. (1987). A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem. Bull. 19, 11–15.
Farris, J. S., Källersjö, M., Kluge, A. C., and Bult, C. (1994). Testing significance of incongruence. Cladistics 10, 315–319. doi: 10.1111/j.1096-0031.1994.tb00181.x
Franchet, A. (1888). Monographie du Genere Paris. Paris: Memoire de la Societe Philomathique Centaire.
Hajibabaei, M., Janzen, D. H., Burns, J. M., Hallwachs, W., and Hebert, P. D. N. (2006). DNA barcodes distinguish species of tropical Lepidoptera. Proc. Natl. Acad. Sci. U.S.A. 103, 968–971. doi: 10.1073/pnas.0510466103
Handel-Mazzetti, H. R. E. (1936). Symbolae Sinicae: Botanische Ergebnisse der Expedition der Akademie der Wissenschaften in Wein nach Südwest-China. Wien: Springer.
Hara, H. (1969). Variation in Parispolyphylla Smith, with reference to other Asiatic species. J. Fac. Sci. 10, 141–180.
Hebert, P. D. N., Cywinska, A., Ball, S. L., and De-Waard, J. R. (2003). Biological identifications through DNA barcodes. Proc. R. Soc. B 270, 313–322. doi: 10.1098/rspb.2002.2218
Hollingsworth, P. M. (2011). Refining the DNA barcode for land plants. Proc. Natl. Acad. Sci. U.S.A. 108, 19451–19452. doi: 10.1073/pnas.1116812108
Hollingsworth, P. M., Forrest, L. L., Spouge, J. L., Hajibabaei, M., Ratnasingham, S., van der Bank, M., et al. (2009). A DNA barcode for land plants. Proc. Natl. Acad. Sci. U.S.A. 106, 12794–12797. doi: 10.1073/pnas.0905845106
Hollingsworth, P. M., Li, D. Z., Michelle, V. D. B., and Twyford, A. D. (2016). Telling plant species apart with DNA: from barcodes to genomes. Philos. Trans. R. Soc. B Biol. Sci. 371:20150338. doi: 10.1098/rstb.2015.0338
Huang, L. Q., Xiao, P. G., and Wang, Y. Y. (2012). Investigation on Resources of Rare and Endangered Medicinal Plants in China. Shanghai: Shanghai Science & Technology Press.
Huang, Y. L., Li, X. J., Yang, Z. Y., Yang, C. J., Yang, J. B., and Ji, Y. H. (2016). Analysis of complete chloroplast genome sequences improves phylogenetic resolution of Paris (Melanthiaceae). Front. Plant Sci. 7:1797. doi: 10.3389/fpls.2016.01797
Ji, Y. H., Fritsch, P. W., Li, H., Xiao, T., and Zhou, Z. (2006). Phylogeny and classification of Paris (Melanthiaceae) inferred from DNA sequence data. Ann. Bot. 98, 245–256. doi: 10.1093/aob/mcl095
Ji, Y. H., Liu, C. K., Yang, Z. Y., Yang, L. F., He, Z. H., Wang, H. C., et al. (2019a). Testing and using complete plastomes and ribosomal DNA sequences as the next generation DNA barcodes in Panax (Araliaceae). Mol. Ecol. Resour. 19, 1333–1345. doi: 10.1111/1755-0998.13050
Ji, Y. H., Yang, L. F., Chase, M. W., Liu, C. K., Yang, Z. Y., Yang, J., et al. (2019b). Plastome phylogenomics, biogeography, and clade diversification of Paris (Melanthiaceae). BMC Plant Biol. 19:543. doi: 10.1186/s12870-019-2147-6
Jin, J. J., Yu, W. B., Yang, J. B., Song, Y., Yi, T. S., and Li, D. Z. (2018). GetOrganelle: a simple and fast pipeline for de novo assembly of a complete circular chloroplast genome using genome skimming data. bioRxiv [Preprint]. doi: 10.1101/256479
Kane, N. C., and Cronk, Q. (2008). Botany without borders, barcoding in focus. Mol. Ecol. 17, 5175–5176. doi: 10.1111/j.1365-294x.2008.03972.x
Kane, N. S., Sveinsson, S., Dempewolf, H., Yang, J. Y., Zhang, D., Engels, J. M. M., et al. (2012). Ultra-barcoding in cacao (Theobroma spp.; Malvaceae) using whole chloroplast genomes and nuclear ribosomal DNA. Am. J. Bot. 99, 320–329. doi: 10.3732/ajb.1100570
Katoh, K., and Standley, D. M. (2013). MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780. doi: 10.1093/molbev/mst010
Kearse, M., Moir, R., Wilson, A., Stones-Havas, S., Cheung, M., Sturrock, S., et al. (2012). Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28, 1647–1649. doi: 10.1093/bioinformatics/bts199
Kurtz, S., Phillippy, A., Delcher, A. L., Smoot, M., Shumway, M., Antonescu, C., et al. (2004). Versatile and open software for comparing large genomes. Genome Biol. 5:R12. doi: 10.1186/gb-2004-5-2-r12
Langmead, B., and Salzberg, S. L. (2012). Fast gapped-read alignment with Bowtie2. Nat. Methods 9, 357–359. doi: 10.1038/nmeth.1923
Li, D. Z., Gao, L. M., Li, H. T., Wang, H., Ge, X. J., Liu, J. Q., et al. (2011). Comparative analysis of a large dataset indicates that internal transcribed spacer (ITS) should be incorporated into the core barcode for seed plants. Proc. Natl. Acad. Sci. U.S.A. 108, 19641–19646. doi: 10.1073/pnas.1104551108
Li, L., Jiang, Y., Niu, Z., Xue, Q., Liu, W., and Ding, X. (2020). The large single-copy (LSC) region functions as a highly effective and efficient molecular marker for accurate authentication of medicinal Dendrobium species. Acta Pharm. Sin. B. [Preprint]. doi: 10.1016/j.apsb.2020.01.012
Li, X., Yang, Y., Henry, R. J., Rossetto, M., Wang, Y., and Chen, S. (2015). Plant DNA barcoding: from gene to genome. Biol. Rev. 90, 157–166. doi: 10.1111/brv.12104
Nass, L. L., Sigrist, M. S., Ribeiro, C. S. C., and Reifschneider, F. J. B. (2012). Genetic resources: the basis for sustainable and competitive plant breeding. Crop Breed. Appl. Biotechnol. Sci. 2, 75–86. doi: 10.1590/S1984-70332012000500009
Nock, C. J., Waters, D. L. E., Edwards, M. A., Bowen, S. G., Rice, N., Cordeiro, G. M., et al. (2011). Chloroplast genome sequences from total DNA for plant identification. Plant Biotechnol. J. 9, 328–333. doi: 10.1111/j.1467-7652.2010.00558.x
Patel, R. K., and Jain, M. (2012). NGS QC Toolkit: a toolkit for quality control of next generation sequencing data. PLoS One 7:e30619. doi: 10.1371/journal.pone.0030619
Pons, J., Barraclough, T. G., Gomez-Zurita, J., Cardoso, A., Duran, D. P., Hazell, S., et al. (2006). Sequence-based species delimitation for the DNA taxonomy of undescribed insects. Syst. Biol. 55, 595–609. doi: 10.1080/10635150600852011
Posada, D., and Buckley, T. R. (2004). Model selection and model averaging in phylogenetics: advantages of Akaike information criterion and Bayesian approaches over likelihood ratio tests. Syst. Biol. 53, 793–808. doi: 10.2307/4135365
Posada, D., and Crandall, K. A. (1998). MODELTEST: testing the model of DNA substitution. Bioinformatics 14, 817–818. doi: 10.1093/bioinformatics/14.9.817
Rieseberg, L. H., and Soltis, D. E. (1991). Phylogenetic consequences of cytoplasmic gene flow in plants. Am. J. Bot. 5, 65–84. doi: 10.1007/BF00021248
Rieseberg, L. H., and Wendel, J. F. (1993). “Introgression and its consequences in plants,” in Hybrid Zones and the Evolutionary Process, ed. R. G. Harrison, (New York, NY: Oxford University Press), 70–114.
Ronquist, F., and Huelsenbeck, J. P. (2003). MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19, 1572–1574. doi: 10.1093/bioinformatics/btg180
Ruhsam, M., Rai, H. S., Mathews, S., Ross, T. G., Graham, S. W., Raubeson, L. A., et al. (2015). Does complete plastid genome sequencing improve species discrimination and phylogenetic resolution in Araucaria? Mol. Ecol. Resour. 15, 1067–1078. doi: 10.1111/1755-0998.12375
Schattner, P., Brooks, A. N., and Lowe, T. M. (2005). The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res. 33, 686–689. doi: 10.1093/nar/gki366
Stamatakis, A. (2006). RAxML-VI-HPC: maximum likelihood-based phylogenetic analysis with thousands of taxa and mixed models. Bioinformatics 22, 2688–2690. doi: 10.1093/bioinformatics/btl446
Straub, S. C. K., Parks, M., Weitemier, K., Fishbein, M., Cronn, R. C., and Liston, A. (2012). Navigating the tip of the genomic iceberg: next-generation sequencing for plant systematics. Am. J. Bot. 99, 349–364. doi: 10.3732/ajb.1100335
Swofford, D. L. (2002). PAUP: Phylogenetic Analysis Using Parsimony (and Other Methods), 4.0 Beta. Sunderland, MA: Sinauer Associates.
Takhtajan, A. (1983). A revision of Daiswa (Trilliaceae). Brittonia 35, 255–270. doi: 10.2307/2806025
Turner, B., Paun, O., Munzinger, J., Chase, M. W., and Samuel, R. (2016). Sequencing of whole plastid genomes and nuclear ribosomal DNA of Diospyros species (Ebenaceae) endemic to New Caledonia: many species, little divergence. Ann. Bot. 117, 1175–1185. doi: 10.1093/aob/mcw060
Ward, R. D., Zemlak, T. S., Innes, B. H., Last, P. R., and Hebert, P. D. N. (2005). DNA barcoding Australia’s fish species. Philos. Trans. R. Soc. B Biol. Sci. 360, 1847–1857. doi: 10.1098/rstb.2005.1716
Wendel, J. F., and Doyle, J. J. (1998). “Phylogenetic incongruence: window into genome history and speciation,” in Molecular Systematics of Plants, eds P. S. Soltis, D. E. Soltis, and J. J. Doyle, (New York, NY: Chapman and Hall), 265–296. doi: 10.1007/978-1-4615-5419-6_10
Whitney, K. D., Ahern, J. R., Campbell, L. G., Albert, L. P., and King, M. S. (2010). Patterns of hybridization in plants. Perspect. Plant Ecol. Evol. Syst. 12, 175–182. doi: 10.1016/j.ppees.2010.02.002
Wyman, S. K., Jansen, R. K., and Boore, J. L. (2004). Automatic annotation of organellar genomes with DOGMA. Bioinformatics 20, 3252–3255. doi: 10.1093/bioinformatics/bth352
Yang, L. F., Yang, Z. Y., Liu, C. K., He, Z. S., Zhang, Z. R., Yang, J., et al. (2019). Chloroplast phylogenomic analysis provides insights into the evolution of the largest eukaryotic genome holder, Paris japonica (Melanthiaceae). BMC Plant Biol. 19:293. doi: 10.1186/s12870-019-1879-7
Yang, Z. Y., Yang, L. F., Liu, C. K., Qin, X. J., Liu, H. Y., Chen, J. H., et al. (2019). Transcriptome analyses of Paris polyphylla var. chinensis, Ypsilandra thibetica, and Polygonatum kingianum characterize their steroidal saponin biosynthesis pathway. Fitoterapia 135, 52–63. doi: 10.1016/j.fitote.2019.04.008
Keywords: new species, plastomes, ribosomal DNA, species identification, DNA barcodes, Paris liiana, Melanthiaceae
Citation: Ji Y, Liu C, Yang J, Jin L, Yang Z and Yang J-B (2020) Ultra-Barcoding Discovers a Cryptic Species in Paris yunnanensis (Melanthiaceae), a Medicinally Important Plant. Front. Plant Sci. 11:411. doi: 10.3389/fpls.2020.00411
Received: 22 January 2020; Accepted: 23 March 2020;
Published: 22 April 2020.
Edited by:
Nina Rønsted, National Tropical Botanical Garden, United StatesReviewed by:
Panagiotis Madesis, Institute of Applied Biosciences (INAB), GreeceXiaoyu Ding, Nanjing Normal University, China
Copyright © 2020 Ji, Liu, Yang, Jin, Yang and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yunheng Ji, aml5aEBtYWlsLmtpYi5hYy5jbg==; Jun-Bo Yang, amJ5YW5nQG1haWwua2liLmFjLmNu