Skip to main content

ORIGINAL RESEARCH article

Front. Plant Sci.
Sec. Plant Systematics and Evolution
Volume 15 - 2024 | doi: 10.3389/fpls.2024.1456102

First De Novo Genome Assembly and Characterization of Gaultheria prostrata

Provisionally accepted
Yan-Jun Lin Yan-Jun Lin 1Xiao-Ya Ding Xiao-Ya Ding 1*Yi-Wei Huang Yi-Wei Huang 2,3*Lu Lu Lu Lu 1*
  • 1 School of Pharmaceutical Sciences, Yunnan Key Laboratory of Pharmacology for Natural Products, and Yunnan College of Modern Biomedical Industry, Kunming Medical University, Kunming, Yunnan, China
  • 2 Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
  • 3 University of Chinese Academy of Sciences, Beijing, China

The final, formatted version of the article will be published soon.

    Gaultheria Kalm ex L. (Ericaceae), a type of evergreen shrub, known as a natural source of methyl salicylate, possesses rich germplasm resources, strong habitat adaptability, significant ornamental value, and noteworthy pharmacological activities. However, due to the paucity of whole genomic information, genetically deep research in these areas remains limited. Consequently, we intend to obtain genomic-data through high-throughput sequencing, gene annotation, flow cytometry, transcription factors prediction and genetic marker analysis for a representative species of this genus, with Gaultheria prostrata selected for our study. In this study, we preliminarily obtained the genome of G. prostrata through next-generation sequencing methods. Utilizing 47.94 Gb of high-quality sequence data (108.95× coverage), assembled into 114,436 scaffolds, with an N50 length of 33,667 bp. The genome size assembled by SOAPdenovo, approximately 417 Mb, corresponded closely to predictions by flow cytometry (440 Mb) and k-mer analysis (447 Mb). The genome integrity was evaluated using BUSCO with 91%. The heterozygosity ratio was 0.159%, the GC content was 38.85%, and the repetitive regions encompassed over 34.6% of the genome. A total of 26,497 protein-coding genes have been predicted and annotated across Nr, Swissprot, GO, KEGG, and Pfam databases. Among these, 14,377 and 2,387 genes received functional annotation in Nr and Swissprot, respectively; 21,895, 24,424, and 22,330 genes were similarly annotated in GO, KEGG, and Pfam. Moreover, A total of 279,785 SSRs were identified and 345,270 primers for these SSRs were designed. Within the various nucleotide types of SSRs, AG/CT and AAG/CTT constituted the predominant dinucleotide and trinucleotides in G. prostrata. In addition, 1,395 transcription factors (TFs) from 75 TF families, 462 transcription regulators (TRs) from 33 TR families and 840 protein kinase (PKs) from 118 PK families were identified in this genome. We also performed phylogenetic analyses of species such as G. prostrata, including estimation of divergence times and expansion contraction analyses, followed by positive selection analyses of orthologous gene pairs of Gaultheria prostrata and its close relative Vaccinium corymbosum. These results provide a reference for in-depth study of genus Gaultheria, contributing to future functional and comparative genomics analyses and providing supporting data for the development of molecular markers.

    Keywords: whole genome sequencing, gene annotation, Evergreen shrubs, Flow Cytometry, microsatellite, SSR primers, Transcription Factors, phylogenetic relationships

    Received: 11 Jul 2024; Accepted: 04 Oct 2024.

    Copyright: © 2024 Lin, Ding, Huang and Lu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence:
    Xiao-Ya Ding, School of Pharmaceutical Sciences, Yunnan Key Laboratory of Pharmacology for Natural Products, and Yunnan College of Modern Biomedical Industry, Kunming Medical University, Kunming, Yunnan, China
    Yi-Wei Huang, Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
    Lu Lu, School of Pharmaceutical Sciences, Yunnan Key Laboratory of Pharmacology for Natural Products, and Yunnan College of Modern Biomedical Industry, Kunming Medical University, Kunming, Yunnan, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.