Phyllanthus emblica or Indian gooseberry, commonly known as amla, is an important medicinal horticultural plant used in traditional and modern medicines. It bears stone fruits with immense antioxidant properties due to being one of the richest natural sources of vitamin C and numerous flavonoids. This study presents the first genome sequencing of this species performed using 10x Genomics and Oxford Nanopore Technology. The draft genome assembly was 519 Mbp in size and consisted of 4,384 contigs, N50 of 597 Kbp, 98.4% BUSCO score, and 37,858 coding sequences. This study also reports the genome-wide phylogeny of this species with 26 other plant species that resolved the phylogenetic position of P. emblica. The presence of three ascorbate biosynthesis pathways including L-galactose, galacturonate, and myo-inositol pathways was confirmed in this genome. A comprehensive comparative evolutionary genomic analysis including gene family expansion/contraction and identification of multiple signatures of adaptive evolution provided evolutionary insights into ascorbate and flavonoid biosynthesis pathways and stone fruit formation through lignin biosynthesis. The availability of this genome will be beneficial for its horticultural, medicinal, dietary, and cosmetic applications and will also help in comparative genomics analysis studies.
The genus Neocinnamomum is considered to be one of the most enigmatic groups in Lauraceae, mainly distributed in tropical and subtropical regions of Southeast Asia. The genus contains valuable oilseed and medicinal tree species. However, there are few studies on the genus Neocinnamomum at present, and its interspecific relationship is still unclear. In order to explore the genetic structure and evolutionary characteristics of the Neocinnamomum chloroplast genome and to resolve the species relationships within the genus, comparative genomic and phylogenetic analyses were performed on the whole chloroplast genome sequences of 51 samples representing seven Neocinnamomum taxa. The whole Neocinnamomum chloroplast genome size ranged from 150,753-150,956 bp, with a GC content of 38.8%-38.9%. A total of 128 genes were annotated within the Neocinnamomum chloroplast genome, including 84 protein coding genes, 8 rRNA genes, and 36 tRNA genes. Between 71-82 SSRs were detected, among which A/T base repeats were the most common. The chloroplast genome contained a total of 31 preferred codons. Three highly variable regions, trnN-GUU-ndhF, petA-psbJ, and ccsA-ndhD, were identified with Pi values > 0.004. Based on the whole chloroplast genome phylogenetic tree, the phylogenetic relationships among the seven Neocinnamomum taxa were determined. N. delavayi and N. fargesii were the most closely related species, and N. lecomtei was identified as the most basal taxon. In this study, the characteristics and sequence variation of the chloroplast genomes of seven Neocinnamomum taxa were revealed, and the genetic relationship among the species was clarified. The results of this study will provide a reference for subsequent molecular marker development and phylogenetic research of Neocinnamomum.
Chia is an annual crop whose seeds have the highest content of α-linolenic acid (ALA) of any plant known to date. We generated a high-quality assembly of the chia genome using circular consensus sequencing (CCS) of PacBio. The assembled six chromosomes are composed of 21 contigs and have a total length of 361.7 Mb. Genome annotation revealed a 53.5% repeat content and 35,850 protein-coding genes. Chia shared a common ancestor with Salvia splendens ~6.1 million years ago. Utilizing the reference genome and two transcriptome datasets, we identified candidate fatty acid desaturases responsible for ALA biosynthesis during chia seed development. Because the seed of S. splendens contains significantly lower proportion of ALA but similar total contents of unsaturated fatty acids, we suggest that strong expression of two ShFAD3 genes are critical for the high ALA content of chia seeds. This genome assembly will serve as a valuable resource for breeding, comparative genomics, and functional genomics studies of chia.
Dipsacus asperoides is a traditional medicinal herb widely used in inflammation and fracture in Asia. Triterpenoid saponins from D. asperoides are the main composition with pharmacological activity. However, the biosynthesis pathway of triterpenoid saponins has not been completely resolved in D. asperoides. Here, the types and contents of triterpenoid saponins were discovered with different distributions in five tissues (root, leaf, flower, stem, and fibrous root tissue) from D. asperoides by UPLC-Q-TOF-MS analysis. The discrepancy between five tissues in D. asperoides at the transcriptional level was studied by combining single-molecule real-time sequencing and next- generation sequencing. Meanwhile, key genes involved in the biosynthesis of saponin were further verified by proteomics. In MEP and MVA pathways, 48 differentially expressed genes were identified through co-expression analysis of transcriptome and saponin contents, including two isopentenyl pyrophosphate isomerase and two 2,3-oxidosqualene β-amyrin cyclase, etc. In the analysis of WGCNA, 6 cytochrome P450s and 24 UDP- glycosyltransferases related to the biosynthesis of triterpenoid saponins were discovered with high transcriptome expression. This study will provide profound insights to demonstrate essential genes in the biosynthesis pathway of saponins in D. asperoides and support for the biosynthetic of natural active ingredients in the future.
tRNA-derived small RNAs (tsRNAs) represent a novel category of small non-coding RNAs and serve as a new regulator of gene expression at both transcriptional and post-transcriptional levels. Growing evidence indicates that tsRNAs can be induced by diverse stimuli and regulate stress-responsive target genes, allowing plants to adapt to unfavorable environments. Here, we discuss the latest developments about the biogenesis and classification of tsRNAs and highlight the expression regulation and potential function of tsRNAs in plant biotic and abiotic stress responses. Of note, we also collect useful bioinformatics tools and resources for tsRNAs study in plants. Finally, we propose current limitations and future directions for plant tsRNAs research. These recent discoveries have refined our understanding of whether and how tsRNAs enhance plant stress tolerance.