- 1Smart Farm Research Center, Korea Institute of Science and Technology (KIST), Gangneung, Gangwon, Republic of Korea
- 2Division of Bio-Medical Science & Technology, KIST School, University of Science and Technology (UST), Daejeon, Republic of Korea
- 3Department of Oriental Medicine Biotechnology, College of Life Sciences, Kyung Hee University, Yongin, Republic of Korea
- 4Department of Agricultural Biotechnology, Seoul National University, Seoul, Republic of Korea
- 5Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea
Artemisia argyi (A. argyi) is a medicinal plant belonging to the Asteraceae family and Artemisia genus. Flavonoids abundant in A. argyi are associated with anti-inflammatory, anticancer, and antioxidative effects. Eupatilin and jaceosidin are representative polymethoxy flavonoids with medicinal properties significant enough to warrant the development of drugs using their components. However, the biosynthetic pathways and related genes of these compounds have not been fully explored in A. argyi. This study comprehensively analyzed the transcriptome data and flavonoids contents from four different tissues of A. argyi (young leaves, old leaves, trichomes collected from stems, and stems without trichomes) for the first time. We obtained 41,398 unigenes through the de-novo assembly of transcriptome data and mined promising candidate genes involved in the biosynthesis of eupatilin and jaceosidin using differentially expressed genes, hierarchical clustering, phylogenetic tree, and weighted gene co-expression analysis. Our analysis led to the identification of a total of 7,265 DEGs, among which 153 genes were annotated as flavonoid-related genes. In particular, we were able to identify eight putative flavone-6-hydroxylase (F6H) genes, which were responsible for providing a methyl group acceptor into flavone basic skeleton. Furthermore, five O-methyltransferases (OMTs) gene were identified, which were required for the site-specific O-methylation during the biosynthesis of eupatilin and jaceosidin. Although further validation would be necessary, our findings pave the way for the modification and mass-production of pharmacologically important polymethoxy flavonoids through genetic engineering and synthetic biological approaches.
1 Introduction
Artemisia argyi (A. argyi) is a perennial medicinal plant that is widely distributed in Asia, Europe, and North America (Lee et al., 2009; Mei et al., 2016; Song et al., 2019). It belongs to the Artemisia genus of the Asteraceae family, which comprises over 500 species and subspecies (Bora and Sharma, 2011), many of which contain various and abundant bioactive components (Yoshikawa et al., 1996; Han et al., 2017; Lv et al., 2018; Cui et al., 2022). These plants are used in various fields, such as the herbal, food, cosmetic, and pharmaceutical industries, and therefore have high economic value (Ivanescu et al., 2021; Kshirsagar and Rao, 2021). A major contributor to the discovery of Artemisinin, which is isolated from Artemisia annua, and recognized for its antimalarial effect, was awarded the Nobel Prize in Physiology or Medicine in 2015 (Su and Miller, 2015). A. argyi has various pharmacological effects, including anti-inflammatory (Yun et al., 2016), anticancer (Seo et al., 2003), antioxidative (Lan et al., 2010), and antimicrobial activities (Wang et al., 2006). These effects are attributed to its secondary metabolites, including flavonoids (Han et al., 2017), organic acids (Han et al., 2017), and terpenoids (Yoshikawa et al., 1996). Flavonoids, especially the polymethoxy flavones eupatilin and jaceosidin, are abundant in the aerial parts, especially leaves of A. argyi (Lv et al., 2018; Cui et al., 2022). These compounds have been shown to induce apoptosis in human promyelocytic leukemia cells (Seo and Surh, 2001) and inhibit cervical cancer cells (Lee et al., 2005). Furthermore, Stillen® (DA-9601, Dong-A ST Co., Seoul, Korea), a drug developed using eupatilin and jaceosidin has been approved and utilized as a phytomedicine for gastritis treatment (Seol et al., 2004).
Flavonoids are synthesized via the phenylpropanoid pathway, in which phenylalanine is converted into 4-coumaroyl-CoA (Falcone Ferreyra et al., 2012). Flavonoid biosynthesis begins with the condensation of three malonyl-CoA molecules with one 4-coumaroyl-CoA molecule by chalcone synthase (CHS). Naringenin chalcones can be further modified into various flavonoid subclasses, such as chalcones, flavones, flavonols, flavandiols, anthocyanins, and proanthocyanidins. The biosynthesis of flavones involves several steps. Chalcone isomerase (CHI) catalyzes stereospecific isomerization of naringenin chalcone to synthesize naringenin, resulting in the synthesis of eriodictyol by flavonoid 3′-hydroxylase (F3′H). The two flavanones, naringenin and eriodictyol, act as major substrates of flavone synthase (FNS) and are converted into the basic flavone skeletons, apigenin and luteolin, respectively. Although the downstream pathways of the polymethoxy flavones eupatilin and jaceosidin have not yet been clearly elucidated, they are known to be produced via a series of enzymatic reactions of hydroxylation and O-methylation. These reactions start from a flavone basic skeleton, such as apigenin, or luteolin. In plants, members of the cytochrome P450 monooxygenase (CYP450) and O-methyltransferases (OMT) families, catalyze these reactions, respectively (Tohge et al., 2018; Yonekura-Sakakibara et al., 2019).
Transcriptome sequencing coupled with integrated metabolite profiling is a promising strategy for the identification of biosynthetic genes of interest. Attempts have been made in A. argyi to identify the genes involved in terpenoid (Liu et al., 2018; Zhang et al., 2022; Chen et al., 2023) and flavonoid (Miao et al., 2022) biosynthesis. Variations in levels of gene expression and contents of target compounds among the sample groups allow comparative studies, thereby an appropriate sample selection plays a key role in the success of this approach. The aerial parts of A. argyi are covered with trichomes. Trichomes are epidermal outgrowths found on the surfaces of various plant organs, including leaves, stems, and reproductive structures (Schilmiller et al., 2008; Frerigmann et al., 2012; Xiao et al., 2016; Tissier et al., 2017). Trichomes can be divided into glandular and non-glandular trichomes based on their structure and function. Glandulartrichomes contain glandular cells that secrete and store various secondary metabolites, such as terpenoids, alkaloids, and flavonoids, which play important roles in plant defense against herbivores, pathogens, and abiotic stress. Non-glandular trichomes, on the other hand, are composed of one or more layers of elongated cells and provide physical protection against various environmental factors, such as UV radiation, desiccation, and insect damage. The preliminary tests conducted in our group indicated that eupatilin and jaceosidin were abundant in trichomes collected from the surface of stems, compared to trichome-removed stem tissues. This result was further supported by recent studies which showed significant variations in their contents depending on the types of trichomes on leaves of A. argyi (Cui et al., 2022; Miao et al., 2022). In this regard, we chose four different tissues based on the presence or absence of trichomes and differences in their densities, and explored the differences in levels of gene expression and contents of target compounds among the sample.
Overall, the biosynthesis of secondary metabolites in plants is a complex and dynamic process that is not yet fully understood. In this study, we aimed to shed new light on the biosynthesis pathways of eupatilin, jaceosidin, and their precursors in A. argyi, and to identify the key genes and enzymes involved in this process. Using a combination of liquid chromatography and RNA-seq analyses of different tissue types, we aimed to provide a comprehensive understanding of the biosynthetic pathways of these important secondary metabolites, which may have important implications for the development of novel therapeutics and industrial applications.
2 Materials and methods
2.1 Plant materials
A. argyi plants (accession number AA1801), were obtained from the National Institute of Horticultural and Herbal Science, Rural Development Administration, Republic of Korea, and propagated asexually via rhizomes. The propagules were grown in a growth chamber under 22°C/18°C (day/night), with a 16-h photoperiod, 200 μmol/m2/s light intensity, and 50% relative humidity for 8 weeks. For the flavonoid quantification and transcriptome analyses, we collected the 3rd to 5th and 15th to 17th leaves counted from the shoot apical meristem, which were designated as ‘young leaves’ and ‘old leaves’, respectively. After removing the 10th to 17th leaves along with their petioles, the remaining stem was immediately frozen using liquid nitrogen, and the trichomes attached to the epidermal layer were collected using a razor blade and designated as the ‘trichome’ sample. The remaining stem without trichomes was designated as the ‘stem’ sample. Histochemical analysis of the stem transverse section was conducted by light microscopy to ensure the successful removal of the trichomes attached to the stem surface (Supplementary Figure S1). Each sample was prepared from three biological replicates.
2.2 Microscopic analysis of trichomes
The overall morphological characteristics of young and old leaves were analyzed using a dissecting microscope system, consisting of a Leica M60 microscope and a Leica IC90E digital camera with LAS version 4.13 software (Leica Microsystems, Wetzlar, Germany). The adaxial and abaxial surfaces of young and old leaves, as well as the stem surface, were imaged using a scanning electron microscope (SEM) to analyze the distribution and density of trichomes. The samples were fixed with 2.5% glutaraldehyde, followed by 2% osmium tetroxide post-fixation, dehydration, and drying using a hexamethyldisilazane (HMDS) solvent. The samples were then examined by a field emission-scanning electron microscopy with FEI Teneo VolumeScope (FEI, Hillsborough, USA) at the Advanced Analysis Center, Korea Institute of Science and Technology (Seoul, Korea). The density of glandular trichomes was measured in triplicate from the same-sized images of the samples.
2.3 Liquid chromatography analysis of flavonoids
For the quantification of each flavonoid, samples from each tissue were freeze-dried with three biological replicates. 50 mg of a freeze-dried powder sample was mixed with 900 μL of methanol and 100 μL of quercetin (as an internal standard at 250 μg/mL in methanol) and then ultrasonic extraction was performed for 60 minutes at 30°C. Filtration was conducted using a 0.45 μm syringe filter, followed by HPLC analysis.
The HPLC analysis was performed using an Agilent (Palo Alto, CA) Series 1200 liquid chromatography system and a Phenomenex Gemini C18 column packed with 5 μm particles (4.6 x 250 mm) at 35°C column oven. Acetonitrile and water (containing formic acid 0.1%) were used as a mobile phase. The binary mobile phase consisted of Solvent A (0.1% formic acid in water) and Solvent B (0.1% formic acid in acetonitrile) and was applied with a gradient program under the following conditions: 0 minutes, 10% B; 60 minutes, 20% B; 65 minutes, 23% B (isocratic for 30 minutes at 23% B); 95 minutes, 23% B; 120 minutes, 30% B; 150 minutes, 65% B. The injection volume was 10 μL, and the flow rate was set at 1.0 mL/min. UV detection was performed at 330 nm for fingerprint analysis. Freshly prepared calibration standards for each flavonoid were analyzed, and the contents of each compound were determined by standard curve fitting.
2.4 Total RNA isolation, library construction, and RNA-seq
Total RNA was isolated from four different tissue samples, young leaves, old leaves, trichomes, and stem using TRIzol (Life Technologies, Carlsbad, USA) and RNeasy Plant RNA kit (Qiagen, Hilden, Germany), followed by DNase treatment and RNA integrity verification with Ambion DNA-free kit (Thermo Fisher Scientific, Vilnius, Lithuania) and Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, USA), respectively, according to the manufacturer’s instruction. For RNA-seq, 12 paired-end cDNA libraries, representing three biological replicates per four different tissue samples, were constructed using TrueSeq mRNA kit (Illumina, San Diego, USA), and high-throughput sequencing was performed using Illumina HiSeq X Ten (Illumina).
2.5 De novo assembly, read mapping, annotation, and differential expression analysis
The sequence reads were pre-processed with Cutadapt for trimming adapter and DynamicTrim and LengthSort from the SolexaQA package for trimming low-quality sequences (Cox et al., 2010; Martin, 2011). The cleaned reads were subjected to de novo assembly using the Trinity version 2.8.6, which recovers full-length transcripts and splicing isoforms based on the de Bruijn graph by single k-mer (Grabherr et al., 2011), and aligned to the assembled transcripts using Bowtie 2 version 2.1.0 (Langmead and Salzberg, 2012). In the Trinity program, multiple isoform (transcript) information corresponding to a single locus is obtained. Among these isoforms, we selected the representative sequence with a longer assembly length and higher expression level. For annotation, all unique transcripts were analyzed using InterProScan and BLASTX against the non-redundant NCBI database under the taxonomy restriction to Viridiplantae (NCBI-NR). DESeq package in R was used to normalize the number of reads and identify differentially expressed genes (DEGs) with thresholds of | log2(fold change) | ≥ 1 and an adjusted p-value (false discovery rate) < 0.01 (Anders and Huber, 2010). All candidate genes involved in phenylpropanoid and flavonoid biosynthesis were further identified by tBLASTn search against the NCBI-NR database. GO term enrichment analysis was conducted through the Blast2GO program (FDR p-value: 0.05), and KEGG enrichment analysis was performed using an in-house script.
2.6 Weighted gene co-expression network analysis
WGCNA was conducted with normalized read counts of the 7,265 DEGs using an R package (Zhang and Horvath, 2005; Langfelder and Horvath, 2008) with the following parameter. To ensure that the scale-free topology index is greater than 0.8, three biological replicates of four different samples were used separately. Topological overlap matrix (TOM)- signed, power β=14, minimal module size=30, reassign threshold=0, and branch merge cut height=0. Eigengene (the first principal component of a given module) value was calculated and used for association analysis between a module and measured metabolites from four different tissues.
Hierarchical clustering heatmap was generated by converting the normalized expression values into Z-scores using the formula Z=(Log2(normalized count+1)-mean)/standard deviation. For metabolites, Z-scores were calculated using the same formula, but the amount of metabolite was used instead of normalized count.
2.7 Phylogenetic tree analysis
The phylogeny was inferred by Maximum Likelihood method, Le_Gascuel_2008 mode, and JTT substitution model available in software MEGA X (Kumar et al., 2018).
2.8 Statistical analyses
Analysis of variance (ANOVA), Tukey’s honest significant difference (Tukey HSD) procedures, and unpaired t-test were conducted using R v4.2.1 (Team, 2023).
3 Results
3.1 Differences in trichomes density among different tissues of A argyi
Visual and light microscopic analysis of the aerial parts of A. argyi revealed that the surfaces of the leaves and stems were covered with whitish trichomes. In Figure 1, it can be seen that the young leaves and abaxial surface display a brighter green color than the old leaves and the adaxial sides of both young and old leaves. The brightest color was observed on the abaxial surface of young leaves, indicating a higher density of trichomes. Trichome density was higher in developing young leaves and abaxial surfaces than in physiologically older leaves and adaxial surfaces.
Figure 1 Morphological characteristics of Artemisia argyi. (A) Whole plant. (B) Stems. (C) The part where trichomes have been removed from the stem appears black, and the yellow triangle indicates the stripped trichome. (D-F). Adaxial leaf surfaces in young and old leaves. (G-I) Abaxial surfaces in young and old leaves. Scale bars; (A-D), G = 5 cm, (E, F, H, I) = 1 mm.
High-resolution SEM was used to further analyze the morphology and density of the trichomes. The surfaces of the leaves and stems had both non-glandular and glandular trichomes (Figures 2A, B). Non-glandular trichomes were denser and curlier on the abaxial side than on the adaxial side of the leaves (Figure 2A). Elliptical ball-shaped glandular trichomes were present on both the abaxial and adaxial surfaces of the leaves. The density of glandular trichomes was found to be 2.6-fold higher in young leaves (49 ± 5.4/mm2) compared to old leaves (19 ± 2.9/mm2) on the adaxial surface (Figure 2C). However, the density of glandular trichomes on the abaxial surfaces of leaves and stems could not be quantified owing to the hidden glandular trichomes underneath multiple dense layers of non-glandular trichomes (Figure 2A).
Figure 2 Micromorphology of trichomes of Artemisia argyi. (A) Distribution of trichomes on adaxial and abaxial surfaces of young and old leaves using a scanning electron microscope. Red triangle indicates glandular trichome and yellow triangle indicates non-glandular trichome. Scale bars: 500μm. (B) Distribution of trichomes in the stem. Scale bars: 500μm. (C) Glandular trichome (GT) density difference on adaxial surface between young and old leaves. Statistical differences were examined by two-tail method of unpaired Student’s t-test (n=3). Symbols denote statistical significance (** , <0.01).
3.2 Flavonoids quantification in different tissues of A argyi
The content of eupatilin, jaceosidin, and their putative intermediates was determined in four different tissues selected based on the differences in trichome distribution and density to identify their biosynthetic genes showing a correlation between the gene expression levels and the metabolite contents (Figure 3). The flavonoid content in the known biosynthetic pathway from naringenin chalcone to luteolin varied among tissues, with stem samples showing the lowest content (Figure 3B). In a newly proposed pathway from luteolin to eupatilin and jaceosidin via chrysoeriol and batatifolin, the putative intermediates (chrysoeriol and batatifolin) and final products (eupatilin and jaceosidin) showed significantly higher concentrations in young leaves and trichomes than in old leaves and trichome removed stems, respectively (Figure 3C). All of the flavonoids involved in the pathway from luteolin to jaceosidin and eupatilin showed the lowest content in the stem where trichomes were completely removed (Figure 3C).
Figure 3 Proposed biosynthetic pathway for eupatilin and jaceosidin. (A) Basic structure of flavone. (B) Previously established biosynthetic pathway towards flavone basic skeletons, apigenin and luteolin. CHI; Chalcone-flavanone isomerase, FNS; Flavone synthase, F3´H; Flavanone-3´-hydroxylase. (C) Proposed remaining steps for eupatilin and jaceosidin biosynthesis starting from luteolin. F3’OMT; Flavonoid-3´-O-methyltransferase, F6H; Flavone-6-hydroxylase, F6OMT; Flavonoid-6-O-methyltransferase, F6/4’OMT; Flavonoid- 6/4´-O-methyltransferase. Hydroxylation and O-methylation by corresponding enzymes are shown in blue. YL represents young leaves, OL represents old leaves, TR represents trichomes, and ST represents trichome-removed stem. Numbers in the heatmap boxes represent the measured amount of the metabolite in mg/g dry weight of sample. The heatmap represents the relative abundance of the metabolite in each tissue as a percentage, relative to its maximum amount detected in any tissue. Different letters within rows indicate significant difference based on one-way ANOVA and Tukey’s HSD test (n=3, p < 0.05).
3.3 RNA-seq, de novo assembly, and annotation
To identify the genes responsible for the varying flavonoid content in different parts of A. argyi, we obtained the transcriptomes of four different tissue samples using next-generation sequencing. Prior to de novo assembly, we pre-processed a total of 448,665,104 reads generated from 12 paired-end libraries, which had a combined length of 67.7 Gb. After DynamicTrim and LengthSort filtering, the average length and ratio of trimmed to raw reads were 129.91 bp and 82.31%, respectively, resulting in a total of 429,154,582 clean reads with a total length of 55.7 Gb. These reads were assembled into 110,940 transcripts, of which 41,398 were selected as representative transcripts (unigenes) with an average length of 1,258 bp and an average mapping rate of 95.24%.
The highest number of assembled transcripts was found in the range of 500–1,000 bp for both the total (47,453) and representative (21,417) transcripts, and this number decreased as the length of the assembled transcripts increased (Supplementary Table S1). Approximately 71.63% of the unigenes (29,654) were successfully annotated in two public databases: NCBI-NR (Viridiplantae DB) and InterProScan. A summary of the RNA-seq data processing performed in this study is presented in Table 1.
3.4 Analysis of differentially expressed genes among different tissues of A argyi
Considering the differences in flavonoid content and anatomical characteristics among the four tissue samples, we examined two comparison groups: young leaves vs. old leaves, and trichomes vs. stems, to select differentially expressed genes (DEGs). A total of 3,493 (2,064 upregulated and 1,429 downregulated) and 5,157 DEGs (2,335 upregulated and 2,822 downregulated) were identified in young leaves vs old leaves, and trichomes vs stems, respectively (Supplementary Figure S2). The two sets of DEGs were functionally categorized using KEGG pathways. For both sets, the seven major classifications were “carbohydrate metabolism,” “signal transduction,” “biosynthesis of other secondary metabolites,” “environmental adaptation,” “lipid metabolism,” “amino acid metabolism,” and “metabolism of terpenoids and polyketides” (Supplementary Figure S3A). As our primary goal was related to secondary metabolites, we further focused on the “biosynthesis of other secondary metabolites” category. We confirmed that most genes related to phenylpropanoid, flavonoid, flavone, and flavonol biosynthesis were significantly and differentially expressed in both comparisons (Supplementary Figure S3B).
To examine the functional relationships between the DEGs and their associated biological processes, we mapped the DEGs to GO terms (Supplementary Figure S4). In the GO analysis of the “biological process” category, the most enriched term in the up-regulated DEGs of the young leaves vs old leaves group was “cell wall organization or biogenesis.” Conversely, the top enriched GO terms in the down-regulated DEGs of the young leaves vs old leaves group were strongly related to the response to various endogenous and external stimuli, such as “response to chemical” and “response to external biotic stimulus” (Supplementary Figure S4A). For the GO analysis of the “biological process” category of trichome vs stem DEGs, the most enriched term in the up-regulation was “secondary metabolic process” (Supplementary Figure S4B). Its relevant terms derived from the general phenylpropanoid pathway were particularly enriched, including “flavonoid biosynthetic process,” “phenylpropanoid biosynthetic process,” and “anthocyanin-containing compound biosynthetic process.”
3.5 Identification of candidate genes involved in eupatilin and jaceosidin biosynthesis
To identify putative key genes encoding enzymes for eupatilin and jaceosidin production in A. argyi, we first identified flavonoid-related genes among the DEGs. Based on functional annotation, we identified 153 flavonoid pathway-related genes among the 7,265 DEGs. These 153 genes were divided into four gene families:121 genes for Cytochrome P450 (CYP450), 22 genes for O-methyltransferases (OMT), five genes for chalcone isomerase (CHI), and five genes for chalcone synthase (CHS). The 153 flavonoid-related genes and nine flavonoids analyzed in this study were subjected to hierarchical clustering analysis based on Z-scores across the four different tissues, resulting in their classification into nine clusters (Figure 4). All five flavonoids (eupatilin, jaceosidin, and their putative intermediates luteolin, chrysoeriol, and batatifolin) were grouped into ‘cluster 9’ with 17 flavonoid-related genes, including 10 CYP450, 5 OMT, and 2 CHS genes (Figure 4, Table 2).
Figure 4 Clustering analysis of 153 flavonoid-related genes and 9 metabolites. Hierarchical clustering heat map of 153 flavonoid-related gene expression and content of 9 metabolites from four different tissues based on the z-score of either gene: Log2(normalized count+1) or metabolite: Log2(amount of metabolite+1). Color classification for gene family and metabolite is as follows, CYP450; Cytochrome P450 (light green), OMT; O-methyltransferase (violet), Metabolite includes naringenin, chalcone, naringenin, apigenin, eriodictyol, luteolin, chrysoeriol, batatifolin, eupatilin and jaceosidin (amber), CHI; Chalcone-flavanone isomerase (yellow), CHS; Chalcone synthase (cyan). YL represents young leaves, OL represents old leaves, TR represents trichomes, and ST represents trichome-removed stem.
To infer the potential functions of the 10 genes annotated as CYP450 in ‘cluster 9’, and to pinpoint flavone-6-hydorxylase (F6H) genes, we conducted a phylogenetic analysis. For comparison, we included previously characterized flavonoid biosynthetic CYP450 (F3’H, F6H, and FNSII), 2-oxoglutarate-dependent dioxygenase (FNSI), and irrelevant CYP450 family member, CYP80 as outgroup (Figure 5A). As a result, eight genes (DN1889_c0_g1_i4, DN16156_c0_g1_i5, DN29835_c0_g1_i2, DN18125_c0_g1_i2, DN7805_c0_g2_i2, DN8530_c0_g1_i1, DN17144_c0_g1_i1, and DN29695_c0_g1_i5) were classified as F6H genes required for the transition from chrysoeriol to batatifolin. Two other genes, DN8695_c0_g1_i12 and DN11069_c0_g1_i1, grouped with FNS I and FNS II, respectively, to convert naringenin to apigenin and/or eriodictyol to luteolin.
Figure 5 Phylogenetic analysis of flavonoid-related genes identified in ‘cluster 9’. The support values were obtained after 1,000 tests and indicated near the nodes. Protein sequences of previously known (A) Cytochrome P450 and 2-oxoglutarate-dependent dioxygenase and (B) O-methyltransferase genes were obtained from the NCBI database. The protein ID and the derived species information are enclosed in parentheses.
In our proposed pathway, OMTs are the key enzymes for the biosynthesis of eupatilin and jaceosidin. F3’OMT is required for the conversion of luteolin to chrysoeriol, and F4’OMT, F6OMT, or F6/4’OMT are required for the further O-methylation of batatifolin towards eupatilin and jaceosidin production (Figure 3C). To reveal which OMTs found in ‘cluster 9’ might function as F3’OMT, F4’OMT, F6OMT, or F6/4’OMT, phylogenetic analysis was performed with known OMTs (F3’OMT, F4’OMT, F6OMT, F6/4’OMT, and F7OMT). As a result, four OMT genes (DN1088_c0_g1_i6, DN1088_c0_g2_i1, DN9271_c0_g1_i2, and DN18001_c0_g2_i3) were classified as F3’OMT, and one gene (DN6529_c0_g2_i1) was classified as F4’OMT and/or F6OMT (Figure 5B).
To further investigate the association between genes and metabolites in ‘cluster 9’, we used the correlation values among 7,265 DEGs and metabolites obtained from weighted gene co-expression network analysis (WGCNA). Of the 17 genes in ‘cluster 9’, 16 (except DN18001_c0_g2_i3) were significantly associated with at least one of the five metabolites involved in the proposed pathway from luteolin to eupatilin and jaceosidin production (Figure 6). On average, these 17 genes were significantly associated with 2.6 metabolites. Two CYP450 genes (DN11069_c0_g1_i1 and DN17144_c0_g1_i1) were significantly associated with all five metabolites, whereas four genes (CHS: DN6752_c0_g1_i1, CYP450: DN29695_c0_g1_i5, CYP450: DN29835_c0_g1_i2, and CHS: DN10373_c0_g1_i1) were significantly associated with all four metabolites.
Figure 6 Hierarchical clustering heat map and bubble plots of putative key genes for jaceosidin and eupatilin biosynthesis. (A) Hierarchical clustering heat map of 17 genes and 5 metabolites derived from ‘cluster 9’ of Figure 4. Phylo* indicates putative functions of genes inferred through phylogenetic tree analysis (Figure 5A, 5B). Z-score calculated based on Log2(normalized count +1) or Log2(amount of metabolite+1). (B) Bubble plots showing an association between gene and 9 metabolites derived from Weighted gene co-expression network analysis (WGCNA). YL represents young leaves, OL represents old leaves, TR represents trichomes, and ST represents trichome-removed stem. The bubble plot shows the strength of the correlation (r) between two variables, represented by the size of each bubble. Larger bubbles indicate stronger correlations. The color of each bubble indicates the associated p-value.
4 Discussion
4.1 Characteristics of trichomes in A argyi and their effect on metabolite content differences
In plants, trichomes can be divided into glandular and non-glandular types based on the presence or absence of glandular heads (Werker, 2000). Glandular trichomes are responsible for the synthesis, storage, and secretion of phytochemicals, and their distribution and density are linked to phytochemical production in various plants (Glas et al., 2012; Tissier, 2012; Cui et al., 2022). In contrast, non-glandular trichomes lack secretory mechanisms and mainly function in physical defense (Yamasaki and Murakami, 2014; Xiao et al., 2016).
Our measurements revealed that the eupatilin and jaceosidin contents were considerably higher in trichomes than in trichome-removed stems, indicating that these phytochemicals mainly accumulated in the trichomes of A. argyi. Additionally, eupatilin and jaceosidin were more abundant in young than in old leaves, likely owing to the higher density of glandular trichomes in developing young leaves (Figures 2, 3). This interpretation is supported by a previous study showing that glandular trichomes are major accumulation sites for flavonoids in A. argyi (Cui et al., 2022).
4.2 Proposed eupatilin and jaceosidin biosynthesis pathway and candidate genes
Despite the medicinal importance of eupatilin and jaceosidin, their biosynthetic pathways have not yet been clearly elucidated in A. argyi. Determining the biosynthetic pathways of these natural products is challenging. However, insights into the necessary biochemical and enzymatic reactions can be gained from the chemical structures of the compounds of interest in their biosynthetic grids (Berim et al., 2012). For the production of eupatilin and jaceosidin, at least two or one step(s) of hydroxylation, respectively, and two additional steps of O-methylation are required, starting from the basic flavone skeletons, apigenin, or luteolin. Recently, a biosynthetic pathway starting from apigenin was proposed, using scutellarein and hispidulin as intermediates (Miao et al., 2022). However, the biosynthesis of hispidulin from scutellarein might be biochemically unfavorable because 6-O-methylation hardly occurs in the non-methoxylated flavone scutellarein (Berim et al., 2012). We propose an alternative biosynthetic route that utilizes chrysoeriol and batatifolin as intermediates (Figure 3C) and excludes biochemically unfavorable reactions. Supporting this proposed pathway, luteolin has been found to be an active substrate for F3´OMT producing chrysoeriol in rice during the biosynthesis of another polymethoxy flavone, tricin (Lam et al., 2021). For the production of eupatilin and jaceosidin, 6-O-methylation catalyzed by F6OMT or bifunctional F6/4´OMT is also indispensable. Thus, chrysoeriol must be further converted into 6-hydroxy chrysoeriol (batatifolin) via F6H, which provides a methyl group acceptor for -OH at the C6 position (Figure 3C). While proposing this alternative biosynthetic pathway, further loss-of-function study in-planta would be necessary to fully elucidate and confirm the biosynthetic pathway for these polymethoxy flavones.
According to the proposed pathways, chrysoeriol, batatifolin, jaceosidin, and eupatilin were observed to have relatively high levels of content in the young leaves, where the density of trichomes was higher than in the old leaves (Figure 3C). Moreover, the content of these substances was significantly higher in the trichomes when compared to the stem without trichomes (Figure 3C). These observations suggest that the biosynthesis or accumulation of these substances is specifically localized in the trichomes, and genes that are highly expressed in the trichomes or young leaves in contrast to the stem without trichomes or old leaves, might play a crucial role in this process. The co-expression of genes involved in the same metabolic pathway is often observed during biosynthetic processes (Saito et al., 2008; Nett et al., 2018), and such associations can be identified using transcriptome and metabolite data (Lau and Sattely, 2015). In our study, we used this approach for association analysis, which resulted in the identification of candidate genes (Figure 4). Further, phylogenetic tree analysis of genes with known functions (Figure 5) allowed us to identify the key genes necessary for eupatilin and jaceosidin biosynthesis in A. argyi. Identifying biosynthetic genes leads to a better understanding of plant biochemistry and enables the mass production of valuable phytochemicals through crop improvement and synthetic biology (Nett et al., 2018). Researchers have successfully co-expressed identified genes in alternative plant hosts (Lau and Sattely, 2015; Nett et al., 2020) and strains (Lau and Sattely, 2015; Nett et al., 2020) for the mass production of natural products. In particular, the biosynthesis of artemisinic acid, a precursor of artemisinin identified in Artemisia annua, was successfully performed in this strain (Paddon et al., 2013).
5 Conclusion
Artemisia argyi is a medicinal perennial plant with historical usage that exhibits pharmacological effects such as anti-inflammatory, anticancer, and antimicrobial activities, which are attributed to various secondary metabolites. In this study, we analyzed the flavonoid content of four different tissues of A. argyi and proposed the biosynthetic pathways of eupatilin and jaceosidin based on metabolite analysis and chemical structures. Through the examination of the chemical structures of the analyzed metabolites, we were able to identify the eupatilin and jaceosidin biosynthetic pathways. We performed de-novo assembly of transcriptome data from these tissues, resulting in 41,398 unigenes. Through a variety of analyses including DEG analysis, KEGG and GO term enrichment analysis, clustering, phylogenetic tree construction, and WGCNA, we identified candidate genes involved in the biosynthesis of eupatilin and jaceosidin. Our work serves as a basis for the development of synthetic biological systems for the mass production of these compounds in the future.
Data availability statement
The data presented in the study are deposited in the NCBI repository, accession number PRJNA945494.
Author contributions
SL, HW, and SB collected and analyzed the data, and wrote the manuscript. YP, SK, HK, JC, and H-YK collected data. JL and JJ designed the study, analyzed the data and wrote the manuscript. All authors contributed to the article and approved the submitted version.
Funding
This research was supported by intramural grants (2Z06831) from the Korea Institute of Science and Technology (KIST).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be constructed as potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2023.1186023/full#supplementary-material
References
Anders, S., Huber, W. (2010). Differential expression analysis for sequence count data. Genome Biol. 11, R106. doi: 10.1186/gb-2010-11-10-r106
Berim, A., Hyatt, D. C., Gang, D. R. (2012). A set of regioselective O-methyltransferases gives rise to the complex pattern of methoxylated flavones in sweet basil. Plant Physiol. 160, 1052–1069. doi: 10.1104/pp.112.204164
Bora, K. S., Sharma, A. (2011). The genus artemisia: a comprehensive review. Pharm. Biol. 49, 101–109. doi: 10.3109/13880209.2010.497815
Chen, H., Guo, M., Dong, S., Wu, X., Zhang, G., He, L., et al. (2023). A chromosome-scale genome assembly of artemisia argyi reveals unbiased subgenome evolution and key contributions of gene duplication to volatile terpenoid diversity. Plant Commun. 100516, 1–16. doi: 10.1016/j.xplc.2023.100516
Cox, M. P., Peterson, D. A., Biggs, P. J. (2010). SolexaQA: At-a-glance quality assessment of illumina second-generation sequencing data. BMC Bioinf. 11, 485. doi: 10.1186/1471-2105-11-485
Cui, Z., Li, M., Han, X., Liu, H., Li, C., Peng, H., et al. (2022). Morphogenesis, ultrastructure, and chemical profiling of trichomes in artemisia argyi h. lev. & vaniot (Asteraceae). Planta 255, 102. doi: 10.1007/s00425-022-03889-0
Falcone Ferreyra, M. L., Rius, S. P., Casati, P. (2012). Flavonoids: biosynthesis, biological functions, and biotechnological applications. Front. Plant Sci. 3, 222. doi: 10.3389/fpls.2012.00222
Frerigmann, H., Bottcher, C., Baatout, D., Gigolashvili, T. (2012). Glucosinolates are produced in trichomes of arabidopsis thaliana. Front. Plant Sci. 3, 242. doi: 10.3389/fpls.2012.00242
Glas, J. J., Schimmel, B. C., Alba, J. M., Escobar-Bravo, R., Schuurink, R. C., Kant, M. R. (2012). Plant glandular trichomes as targets for breeding or engineering of resistance to herbivores. Int. J. Mol. Sci. 13, 17077–17103. doi: 10.3390/ijms131217077
Grabherr, M. G., Haas, B. J., Yassour, M., Levin, J. Z., Thompson, D. A., Amit, I., et al. (2011). Full-length transcriptome assembly from RNA-seq data without a reference genome. Nat. Biotechnol. 29, 644–652. doi: 10.1038/nbt.1883
Han, B., Xin, Z., Ma, S., Liu, W., Zhang, B., Ran, L., et al. (2017). Comprehensive characterization and identification of antioxidants in folium artemisiae argyi using high-resolution tandem mass spectrometry. J. Chromatogr B Analyt Technol. BioMed. Life Sci. 1063, 84–92. doi: 10.1016/j.jchromb.2017.08.021
Ivanescu, B., Burlec, A. F., Crivoi, F., Rosu, C., Corciova, A. (2021). Secondary metabolites from artemisia genus as biopesticides and innovative nano-based application strategies. Molecules 26, 1–42. doi: 10.3390/molecules26103061
Kshirsagar, S. G., Rao, R. V. (2021). Antiviral and immunomodulation effects of artemisia. Medicina (Kaunas) 57, 1–12. doi: 10.3390/medicina57030217
Kumar, S., Stecher, G., Li, M., Knyaz, C., Tamura, K. (2018). MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 35, 1547–1549. doi: 10.1093/molbev/msy096
Lam, P. Y., Lui, A. C. W., Wang, L., Liu, H., Umezawa, T., Tobimatsu, Y., et al. (2021). Tricin biosynthesis and bioengineering. Front. Plant Sci. 12, 733198. doi: 10.3389/fpls.2021.733198
Lan, M. B., Zhang, Y. H., Zheng, Y., Yuan, H. H., Zhao, H. L., Gao, F. (2010). Antioxidant and immunomodulatory activities of polysaccharides from moxa (Artemisia argyi) leaf. Food Sci. Biotechnol. 19, 1463–1469. doi: 10.1007/s10068-010-0209-5
Langfelder, P., Horvath, S. (2008). WGCNA: an r package for weighted correlation network analysis. BMC Bioinf. 9, 559. doi: 10.1186/1471-2105-9-559
Langmead, B., Salzberg, S. L. (2012). Fast gapped-read alignment with bowtie 2. Nat. Methods 9, 357–359. doi: 10.1038/nmeth.1923
Lau, W., Sattely, E. S. (2015). Six enzymes from mayapple that complete the biosynthetic pathway to the etoposide aglycone. Science 349, 1224–1228. doi: 10.1126/science.aac7202
Lee, J. H., Lee, J. W., Sung, J. S., Bang, K. H., Moon, S. G. (2009). Molecular authentication of 21 Korean artemisia species (Compositae) by polymerase chain reaction-restriction fragment length polymorphism based on trnL-f region of chloroplast DNA. Biol. Pharm. Bull. 32, 1912–1916. doi: 10.1248/bpb.32.1912
Lee, H. G., Yu, K. A., Oh, W. K., Baeg, T. W., Oh, H. C., Ahn, J. S., et al. (2005). Inhibitory effect of jaceosidin isolated from artemisiaargyi on the function of E6 and E7 oncoproteins of HPV 16. J. Ethnopharmacol 98, 339–343. doi: 10.1016/j.jep.2005.01.054
Liu, M., Zhu, J., Wu, S., Wang, C., Guo, X., Wu, J., et al. (2018). De novo assembly and analysis of the artemisia argyi transcriptome and identification of genes involved in terpenoid biosynthesis. Sci. Rep. 8, 1–10. doi: 10.1038/s41598-018-24201-9
Lv, J. L., Li, Z. Z., Zhang, L. B. (2018). Two new flavonoids from artemisia argyi with their anticoagulation activities. Nat. Prod Res. 32, 632–639. doi: 10.1080/14786419.2017.1332603
Martin, M. (2011). Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal 17, 1–3. doi: 10.14806/ej.17.1.200
Mei, Q., Chen, X., Xiang, L., Liu, Y., Su, Y., Gao, Y., et al. (2016). DNA Barcode for identifying folium artemisiae argyi from counterfeits. Biol. Pharm. Bull. 39, 1531–1537. doi: 10.1248/bpb.b16-00336
Miao, Y., Luo, D., Zhao, T., Du, H., Liu, Z., Xu, Z., et al. (2022). Genome sequencing reveals chromosome fusion and extensive expansion of genes related to secondary metabolism in artemisia argyi. Plant Biotechnol. J. 20, 1902–1915. doi: 10.1111/pbi.13870
Nett, R. S., Guan, X., Smith, K., Faust, A. M., Sattely, E. S., Fischer, C. R. (2018). D(2)O labeling to measure active biosynthesis of natural products in medicinal plants. AIChE J. 64, 4319–4330. doi: 10.1002/aic.16413
Nett, R. S., Lau, W., Sattely, E. S. (2020). Discovery and engineering of colchicine alkaloid biosynthesis. Nature 584, 148–153. doi: 10.1038/s41586-020-2546-8
Paddon, C. J., Westfall, P. J., Pitera, D. J., Benjamin, K., Fisher, K., Mcphee, D., et al. (2013). High-level semi-synthetic production of the potent antimalarial artemisinin. Nature 496, 528–532. doi: 10.1038/nature12051
Saito, K., Hirai, M. Y., Yonekura-Sakakibara, K. (2008). Decoding genes with coexpression networks and metabolomics - ‘majority report by precogs’. Trends Plant Sci. 13, 36–43. doi: 10.1016/j.tplants.2007.10.006
Schilmiller, A. L., Last, R. L., Pichersky, E. (2008). Harnessing plant trichome biochemistry for the production of useful compounds. Plant J. 54, 702–711. doi: 10.1111/j.1365-313X.2008.03432.x
Seo, J. M., Kang, H. M., Son, K. H., Kim, J. H., Lee, C. W., Kim, H. M., et al. (2003). Antitumor activity of flavones isolated from artemisia argyi. Planta Med. 69, 218–222. doi: 10.1055/s-2003-38486
Seo, H. J., Surh, Y. J. (2001). Eupatilin, a pharmacologically active flavone derived from artemisia plants, induces apoptosis in human promyelocytic leukemia cells. Mutat. Res. 496, 191–198. doi: 10.1016/S1383-5718(01)00234-0
Seol, S. Y., Kim, M. H., Ryu, J. S., Choi, M. G., Shin, D. W., Ahn, B. O. (2004). DA-9601 for erosive gastritis: results of a double-blind placebo-controlled phase III clinical trial. World J. Gastroenterol. 10, 2379–2382. doi: 10.3748/wjg.v10.i16.2379
Song, X. W., Wen, X., He, J. W., Zhao, H., Li, S. M., Wang, M. Y. (2019). Phytochemical components and biological activities of artemisia argyi. J. Funct. Foods 52, 648–662. doi: 10.1016/j.jff.2018.11.029
Su, X. Z., Miller, L. H. (2015). The discovery of artemisinin and the Nobel prize in physiology or medicine. Sci. China Life Sci. 58, 1175–1179. doi: 10.1007/s11427-015-4948-7
Team, R. C. (2023). R: a language and environment for statistical computing (Vienna, Austria: R Foundation for Statistical Computing).
Tissier, A. (2012). Glandular trichomes: what comes after expressed sequence tags? Plant J. 70, 51–68. doi: 10.1111/j.1365-313X.2012.04913.x
Tissier, A., Morgan, J. A., Dudareva, N. (2017). Plant volatiles: going ‘In’ but not ‘Out’ of trichome cavities. Trends Plant Sci. 22, 930–938. doi: 10.1016/j.tplants.2017.09.001
Tohge, T., Borghi, M., Fernie, A. R. (2018). The natural variance of the arabidopsis floral secondary metabolites. Sci. Data 5, 180051. doi: 10.1038/sdata.2018.51
Wang, W., Zhang, X.-K., Wu, N., Fu, Y.-J., Zu, Y.-G. (2006). Antimicrobial activities of essential oil from artemisiae argyi leaves. J. Forestry Res. 17, 332–334. doi: 10.1007/s11676-006-0077-2
Werker, E. (2000). Trichome diversity and development. Adv. Botanical Res. Incorporating Adv. Plant Pathology Vol 31 2000 31, 1–35. doi: 10.1016/S0065-2296(00)31005-9
Xiao, K., Mao, X., Lin, Y. (2016). Trichome, a functional diversity phenotype in plant. Mol. Biol. s1, 1–6. doi: 10.4172/2168-9547.1000183
Yamasaki, S., Murakami, Y. (2014). Continuous UV-b irradiation induces endoreduplication and trichome formation in cotyledons, and reduces epidermal cell division and expansion in the first leaves of pumpkin seedlings (Cucurbita maxima Duch.^|^times;C. moschata duch.). Environ. Control Biol. 52, 203–209. doi: 10.2525/ecb.52.203
Yonekura-Sakakibara, K., Higashi, Y., Nakabayashi, R. (2019). The origin and evolution of plant flavonoid metabolism. Front. Plant Sci. 10, 943. doi: 10.3389/fpls.2019.00943
Yoshikawa, M., Shimada, H., Matsuda, H., Yamahara, J., Murakami, N. (1996). Bioactive constituents of Chinese natural medicines. i. new sesquiterpene ketones with vasorelaxant effect from Chinese moxa, the processed leaves of artemisia argyi levl. et vant.: moxartenone and moxartenolide. Chem. Pharm. Bull. (Tokyo) 44, 1656–1662. doi: 10.1248/cpb.44.1656
Yun, C., Jung, Y., Chun, W., Yang, B., Ryu, J., Lim, C., et al. (2016). Anti-inflammatory effects of artemisia leaf extract in mice with contact dermatitis In vitro and In vivo. Mediators Inflamm. 2016, 8027537. doi: 10.1155/2016/8027537
Zhang, B., Horvath, S. (2005). A general framework for weighted gene co-expression network analysis. Stat. Appl. Genet. Mol. Biol. 4, Article17. doi: 10.2202/1544-6115.1128
Keywords: Artemisia argyi, flavonoid, eupatilin, jaceosidin, transcriptome, biosynthetic pathway
Citation: Lee S, Won HJ, Ban S, Park YJ, Kim SM, Kim HS, Choi J, Kim H-Y, Lee JH and Jung JH (2023) Integrative analysis of metabolite and transcriptome reveals biosynthetic pathway and candidate genes for eupatilin and jaceosidin biosynthesis in Artemisia argyi. Front. Plant Sci. 14:1186023. doi: 10.3389/fpls.2023.1186023
Received: 14 March 2023; Accepted: 10 April 2023;
Published: 25 April 2023.
Edited by:
Jong-Seong Jeon, Kyung Hee University, Republic of KoreaReviewed by:
Sun-Hyung Lim, Hankyong National University, Republic of KoreaSang-Ho Kang, National Institute of Agricultural Science, Republic of Korea
Copyright © 2023 Lee, Won, Ban, Park, Kim, Kim, Choi, Kim, Lee and Jung. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jae Hoon Lee, jaehoon@snu.ac.kr; Je Hyeong Jung, jhjung@kist.re.kr
†These authors have contributed equally to this work and share first authorship