Skip to main content

ORIGINAL RESEARCH article

Front. Plant Sci., 29 September 2021
Sec. Plant Physiology
This article is part of the Research Topic Light-Mediated Regulation of Plant Physiology View all 13 articles

Genome-Wide Analysis of Light-Regulated Alternative Splicing in Artemisia annua L.

\r\nTingyu Ma,&#x;Tingyu Ma1,2†Han Gao,&#x;Han Gao1,3†Dong Zhang,,&#x;Dong Zhang1,4,5†Wei SunWei Sun1Qinggang YinQinggang Yin1Lan WuLan Wu1Tianyuan ZhangTianyuan Zhang6Zhichao XuZhichao Xu2Jianhe WeiJianhe Wei7Yanyan SuYanyan Su8Yuhua ShiYuhua Shi1Dandan DingDandan Ding1Ling YuanLing Yuan9Gangqiang DongGangqiang Dong8Liang Leng*Liang Leng1*Li Xiang,*Li Xiang1,9*Shilin Chen*Shilin Chen1*
  • 1Key Laboratory of Beijing for Identification and Safety Evaluation of Chinese Medicine, Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, China
  • 2Key Lab of Chinese Medicine Resources Conservation, State Administration of Traditional Chinese Medicine of the People’s Republic of China, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
  • 3School of Life Sciences, Central China Normal University, Wuhan, China
  • 4College of Agriculture, South China Agricultural University, Guangzhou, China
  • 5Guangdong Laboratory for Lingnan Modern Agriculture, Guangzhou, China
  • 6State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, China
  • 7Hainan Provincial Key Laboratory of Resources Conservation and Development of Southern Medicine, Hainan Branch of the Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Haikou, China
  • 8Amway (China) Botanical R&D Center, Wuxi, China
  • 9Department of Plant and Soil Sciences, Kentucky Tobacco Research and Development Center, University of Kentucky, Lexington, KY, United States

Artemisinin is currently the most effective ingredient in the treatment of malaria, which is thus of great significance to study the genetic regulation of Artemisia annua. Alternative splicing (AS) is a regulatory process that increases the complexity of transcriptome and proteome. The most common mechanism of alternative splicing (AS) in plant is intron retention (IR). However, little is known about whether the IR isoforms produced by light play roles in regulating biosynthetic pathways. In this work we would explore how the level of AS in A. annua responds to light regulation. We obtained a new dataset of AS by analyzing full-length transcripts using both Illumina- and single molecule real-time (SMRT)-based RNA-seq as well as analyzing AS on various tissues. A total of 5,854 IR isoforms were identified, with IR accounting for the highest proportion (48.48%), affirming that IR is the most common mechanism of AS. We found that the number of up-regulated IR isoforms (1534/1378, blue and red light, respectively) was more than twice that of down-regulated (636/682) after treatment of blue or red light. In the artemisinin biosynthetic pathway, 10 genes produced 16 differentially expressed IR isoforms. This work demonstrated that the differential expression of IR isoforms induced by light has the potential to regulate sesquiterpenoid biosynthesis. This study also provides high accuracy full-length transcripts, which can be a valuable genetic resource for further research of A. annua, including areas of development, breeding, and biosynthesis of active compounds.

Introduction

The main natural source of Artemisinin, a sesquiterpene lactone with peroxide bridge, is the medicinal herb Artemisia annua (Miller and Su, 2011; Tu, 2011). Artemisinin is famous for its role in artemisinin combination therapies (ACT), the most effective malaria treatment currently available (Czechowski et al., 2016). Development of new uses of artemisinin and its derivatives, e.g., dihydroartemisinin in the treatment of systemic lupus erythematosus, is still ongoing. These efforts show some potential in anti-tumor, anti-parasitic, anti-fibrosis, and anti-arrhythmic effects (Bin and Hong, 2010). As a result, the demand for artemisinin and its derivatives will increase significantly (Peplow, 2013). Although biosynthesis and chemical synthesis of Artemisinin have seen significant progress in recent years5,6, the final steps in the conversion of dihydroartemisinic acid (DHAA) to artemisinin are still non-enzymatic and require extracellular treatments (Graham et al., 2010; Czechowski et al., 2016). For example, in vitro biosynthesis of artemisinin is still not possible because yeast cannot provide the special oil oxidization environment required for the synthesis of artemisinin (Paddon et al., 2013; Paddon and Keasling, 2014). Thus, the market supply of artemisinin is currently relied on extraction from A. annua (Paddon et al., 2013; Ma et al., 2018; Triemer et al., 2018). Great efforts, particularly in molecular analysis, are needed to understand the biology of A. annua in order to improve artemisinin content of the plant and to reduce production cost.

Transcriptional regulation of genes, such as terpenoid synthase, is central to artemisinin biosynthesis (Nagegowda et al., 2008; Nagegowda, 2010; Tan et al., 2015). Thus, research efforts have focused on the A. annua transcriptome. Alternative splicing (AS) plays a fundamental role in eukaryote gene regulation (Nilsen and Graveley, 2010; Reddy et al., 2013; Staiger and Brown, 2013). During the process of precursor mRNA maturation, AS produces multiple mature mRNA from the same gene, which then results in the production of different proteins. The alternative splicing of precursor RNA with multiple exons increases the diversity of proteins encoded by organisms and the complexity of regulating gene expression (Brett et al., 2002; Matlin et al., 2005; Keren et al., 2010; Nilsen and Graveley, 2010). In human, nearly all multi-exon genes are alternatively spliced (Pan et al., 2008; Wang et al., 2008), while in plants the corresponding proportion is over 60% (Reddy et al., 2013). AS can occur through different mechanisms, e.g., exon skipping, alternative 3′/5′ splice sites, or intron retention (IR) (Reddy et al., 2013). In contrast to human, in which exon skipping accounts for the largest proportion of alternative splicing products, plants tend to use IR as the main mechanism of AS (Kim and Yin, 2005; Keren et al., 2010; Reddy et al., 2013). For example, the IR rates in Arabidopsis and rice are as high as 64.1 and 55%, respectively (Zhang et al., 2016). AS also participates in the process of antibiotic stress response (Staiger and Brown, 2013; Filichkin et al., 2015). Plants produce secondary metabolites to defend against biological or abiotic stress, e.g., light, plant hormones, temperature, saline and drought stresses (Li et al., 2007; Yu Z. X. et al., 2012; Shen et al., 2016; Vashisth et al., 2018; Lopes et al., 2019). Specifically, the expression of isoforms produced by IR splicing can be regulated depending on source of stress (Filichkin et al., 2010; Abdel-Ghany et al., 2016). For example, light is an important environmental factor that regulates plant physiological adaptation and influences the synthesis of plant secondary metabolites in plant. Light regulation has been demonstrated to promote the accumulation of artemisinin, anthocyanins, ginsenoside and charantin (Cuong et al., 2017; Kochan et al., 2019; Lopes et al., 2019; Sun et al., 2020). Light exposure immediately induced IR, with many alternatively spliced transcripts expressed from genes with functions related to light signaling, suggesting a potential impact on pre-mRNA splicing and photomorphogenic gene regulation in response to light (Petrillo et al., 2014a, b; Godoy Herz et al., 2019).

In recent years high-throughput sequencing technology has proven to be a powerful strategy to study medicinal plant genome, full-length transcriptomes and AS (Wang et al., 2009; Martins et al., 2011; Ozsolak and Milos, 2011; Song et al., 2018; Liu et al., 2021). The main advantage of the most popular high throughput sequencing technology used for transcriptomics, i.e., Next Generation Sequencing, is the ability to study all expressed genes simultaneously. However, current NGS technology suffers from insufficient read length (100–300 bp currently), and the inability to accurately construct full-length splice variants (Pan et al., 2008; Au et al., 2013). The third generation sequencing (TGS) technology, e.g., SMRT sequencing from the PacBio platform (Pacific Biosciences of California, Inc.,1), produces a long sequence with much longer read length >8 kb (Au et al., 2010; Mason and Elemento, 2012; Rhoads and Au, 2015). A major drawback of TGS technology, compared with NGS technology, is the high error rate (Au et al., 2010; Koren et al., 2012; Jiao et al., 2013). Two alternatives may be helpful to overcome this problem. First, the length of raw pacbio read is several times longer than the length of most genes; thus, the circular consensus sequence (CCS) mode of pacbio technology could enable a single transcript (cDNA) to be sequenced by multiple passes, and sequencing error of the raw TGS read could be corrected since they were randomly generated (Sharon et al., 2013; Rhoads and Au, 2015). Second, hybrid sequencing, i.e., integrating of NGS sequencing and TGS sequencing technology, could also be applied to the correct the error of PacBio reads (Au et al., 2010, 2013). TGS sequencing technology is widely used in plant AS studies, e.g., Salvia miltiorrhiza (Xu et al., 2015), Zea mays (Thatcher et al., 2014), Fragaria ananassa (Li et al., 2018), Scutellaria baicalensis (Gao T. et al., 2019) and Andrographis paniculata (Gao H. et al., 2019).

Considering the importance of Artemisia annua biology and recent progress in TGS sequencing, we sequenced multiple tissues of Artemisia annua to investigate its alternative splicing landscape. Together with current genome assembly and annotation (Shen et al., 2018), in this work we aimed to supply a useful resource to the Artemisia annua community with a new genome wide isoform set. Previous studies have shown that different light conditions can affect artemisinin synthesis and accumulation. However, the mechanism by which light regulates artemisinin synthesis is unclear. Many transcription factors have previously been found to regulate artemisinin synthesis at the transcriptional regulatory level, but it is not clear whether AS can affect artemisinin synthesis. Herein, the classification and gene-expression of AS isoforms under different light conditions could provide novel insights in the molecular mechanism of light’s influence on artemisinin synthesis.

Materials and Methods

Plant Materials Preparation

The variety of A. annua used in this study is a wild type variety originally collected in Hainan (19°56′44″N, 110°13′23″W, 2015) and cultured in Guangxi “Seed cultivation and planting base of ecological of A. annua” (25°13′2″N, 109°24′39″W). The seeds were then transplanted to Beijing (40°20′17.8″N, 116°33′40″W) in March 2018 from Guangxi and preserved in the Artemisinin Research Center, China Academy of Chinese Medical Sciences. The seeds were cultured in white light at 25°C for 4 weeks. The light intensity of white light was 50 ± 5 μmol/m2s, and the photoperiod was 16/8 h (day/night). Then, the seedlings were transferred to four continuous light treatments for 2 days. After different light treatments, we collected the aboveground parts of 10 seedlings randomly as a sample. The light intensity was 50 ± 5 μmol/m2s in all the treatments, including light-emitting diode (LED) red light (wavelength 670 nm), LED blue light (wavelength 470 nm), and LED far-red light (wavelength 735 nm) (Zhang et al., 2017). For the purposes of full-length transcriptome and Illumina short reads analysis, samples of A. annua were obtained from four different organs of plant: young leaf, stem, open flower, and root. Each plant organs collected three biological samples for Illumina short reads analysis.

mRNA-seq Library Preparation and Sequencing

The total RNA from each sample for RNA-seq were extracted from seedlings using RNAprep Pure Plant Kit (Tiangen, Beijing, China) following the manufacturer’s instructions (Zhang et al., 2017). Total RNA in this paper was used for Illumina Hi-seq 2500 platform. We prepared total RNA to construct the sequencing libraries using NEBNext® UltraTM RNA Library Prep Kit for Illumina® (NEB, MA, United States) with 28S/18S RNA ratio ≥ 1.8, and a minimum integrity number (RIN) value of 8.5 and 250–300 bp insertion element. The integrity of RNA was assessed using the Agilent 2100 Bioanalyzer (Agilent Technologies, CA, United States). Detection of RNA samples, including 1% agarose electrophoresis, was performed to detect degradation and genomic DNA contamination; Nanodrop detects the purity of the RNA (OD260/280 ratio); Qubit quantifies RNA precisely.

In the reaction of synthesizing cDNA, the SMART primers of 3′ terminal oligo (dG) were added beforehand, and reverse transcribed from the RNA3′ terminal with oligo dT as primer. Touching the “hat structure” (methylated G) of eukaryote mRNA at the 5′ end of mRNA adds several (dC) to the end of cDNA. The oligo (dG) of the SMART primer is paired with several C protruding at the end of the synthesized cDNA to form an extension template of the cDNA. The reverse transcriptase automatically converts the template, and the SMART primer continues to extend the cDNA single strand until the end of the primer. In this way, the cDNA contains the starting primer (oligo dT) and SMART primers, which can be amplified by universal primer sequence. The full-length cDNA from pooled poly (A) RNA samples was normalized and subjected to SMRT sequencing using the PacBio RS platform (Novogene, Beijing, China). The full-length cDNA from polymerized (A) RNA samples was normalized and SMRT sequenced by PacBio RS platform (Beijing, China).

Preprocessing of Single Molecule Real-Time Long Reads

Three steps were conducted to obtain the final SMRT long reads for subsequent analysis. Firstly, circular consensus sequence (CCS) reads were generated from raw subreads using the Pacific Biosciences SMRT-Analysis pipeline version 2.2.0 (Berlin et al., 2015; Ardui et al., 2018). Secondly, full-length reads were identified from the CCS reads by running hmmer_wrapper.py. To define full-length reads, hmmer_wrapper.py requires both 5′ and 3′ primer sequences, as well as a poly (A) tail. The primer sequences and poly (A) tail sequence were then trimmed off prior to further analysis.2; finally, PE150 short reads, generated using an Illumina HiSeq 2500, were used to correct full-length reads (PBcR) (Koren et al., 2012). Since this correction step was resource-consuming, we randomly down-sampled illumina short RNA-seq reads to about 12 Gb per tissue for correction. The quality of final full-length reads was assessed by mapping reads to the reference genome using STAR version 2.5.2a (Dobin et al., 2013).

Isoform Detection and Prediction and Alternative Splicing Type Identification

The SpliceMap-isoform detection and prediction (IDP) pipeline (Au et al., 2013) was used to detect and predict isoforms in four tissues, separately. We initially used SpliceMap to detect the junctions of Illumina short reads obtained from stem, leaf, flower and root samples, and then used IDP to detect and predict isoforms by integrating both Illumina short reads and SMRT long reads. These Illumina RNA-seq reads were all used for SpliceMap analysis, and both full-length reads and corrected full-length reads were merged for IDP. The IDP results indicate the sites where isoforms are located on the genome. We followed our previous work to identify alternative splicing events (Gao H. et al., 2019). Briefly, we used the original annotated genes as reference isoform (hereinafter ref isoform). Original annotation files were downloaded from https://www.ncbi.nlm.nih.gov/genome/?term=Artemisia+annua (Shen et al., 2018). If a gene already has more than one transcript or isoform, we defined the longest isoform as the ref isoform and other isoforms as AS isoforms. We then compared the location information of ref isoforms and AS isoforms to identify the type of AS that might occur in a gene.

Isoform Quantification and Analysis of Differentially Expressed Isoforms

All ref and AS isoform sequences were combined into a new fasta-format file and quantified using Kallisto version 0.43 (Bray et al., 2016; Jin et al., 2017). A previously published light-treated RNA-seq data were re-calculated with this new A. annua isoform set (Zhang et al., 2018). Briefly, plant seedlings were treated with different lights, i.e., FR/far red, R/red, B/blue, WL/white and D/Dark. Each treatment comprised three biological replicates (including 10 seedlings per replicate). In total, 45 RNA-seq libraries were analyzed based on the new A. annua isoform set. We also conducted differentially expressed (DE) analysis on FR vs. WL, R vs. WL, B vs. WL as well as WL vs. D, FR vs. D, R vs. D and B vs. D. DE analysis was conducted using R package DEseq2 (Love et al., 2014; Maza, 2016). Genes with adjusted p-value being less than 0.05 were kept as differentially expressed ones.

Analysis of the Intron Retention Isoform Expression Levels of Genes Related to the Artemisinin Biosynthesis Pathway

According to the identified isoforms and their expression, genes involved in artemisinin biosynthesis were manually curated and alternatively spliced isoforms of these genes were selected (Graham et al., 2010; Czechowski et al., 2016, 2018; Ma et al., 2018; Shen et al., 2018). Pheatmap package was used to generate heat maps of isoforms of these genes and the expression patterns of the ref and IR isoforms were analyzed (Kolde, 2015). Enrichment analysis of gene ontology (GO) terms was conducted on differentially expressed IR isoforms using the clusterProfiler package (Yu G. et al., 2012). Functional categories of GO terms with adjusted p-value smaller than 0.05 were considered as significantly enriched.

Results

Summary of Single Molecule Real-Time Sequencing

To identify AS events in A. annua, we applied both Illumina and SMRT RNA-sequencing and analyzed these data in four tissues separately, i.e., leaf, flower, stem and root. We generated ∼15 Gb NGS data and more than 10 Gb TGS data for each tissue (Table 1). For TGS RNA-sequencing data, 7–10 million raw subreads were obtained in each tissue, with median lengths of raw subreads are 1,898 bp, 1,451 bp, 1,093 bp and 1,794 bp in leaf, flower, stem and root, respectively (Table 1). We further generated full length reads from raw subreads (Table 1 and Supplementary Table 1). The number of full-length reads ranges from 341,083 in flower to 412,999 in root, while the median lengths changed to 1,973, 1,956, 2,185, and 1,962 bp in leaf, flower, stem and root, respectively (Table 1 and Supplementary Table 1). We mapped the full length reads to the reference genome and accessed the quality (Supplementary Table 2). Overall, the quality of the reads is high; mapped fractions range from 97.4 to 99.4%, while match rates are around 98 percent. By examining unmapped or deficiently mapped reads manually, we found that a large fraction of these sequences have poly A/G/C/T structure and are therefore difficult to map. We thus corrected these full reads with Illumina short reads. After correction, the number of corrected full-length read ranges from 338,27 in flower to 411,076 in root (Table 1 and Supplementary Table 1). While mapped fraction and mismatch rates only improved by a small proportion, deletion/insertion rate, which may be the result of inaccurate poly-A/G/C/T, decreased (Supplementary Table 2).

TABLE 1
www.frontiersin.org

Table 1. Summary of the sequencing libraries statistics of next generation sequencing (NGS) and third generation transcriptome sequencing (TGS) data in leaf, flowers, stem and root in A. annua.

The New Artemisia annua Isoform Annotation Set

The A. annua genome was assembled and annotated previously (Shen et al., 2018). This version of annotation has 63,226 genes, of which 3,062 genes comprise more than one isoform (Table 2). However, only 159 AS genes involve intron retention (IR), which contrasts with previous observation showing that ∼30% of all AS events are IR (Keren et al., 2010). Thus, we made use of our newly generated full-length reads to update the current A. annua genome annotation and to generate a new A. annua isoform annotation set. We identified 13,328 AS events, 11,832 AS isoforms and 7,210 AS genes. Together with the previously published annotation set, here we generated a new A. annua annotation set, which consists of 63,226 genes and 75,058 transcripts in total.

TABLE 2
www.frontiersin.org

Table 2. Number of AS and IR gene.

We mined out all possible AS events separately from the four tissues, then combined newly identified events and AS events annotated in the current gene annotation to form a final isoform annotation set (Table 2 and Supplementary Table 3). Our sequencing in the four tissues jointly identified 5,155 genes which comprise at least one AS isoform. Together with the current gene annotation, the new A. annua isoform set comprises 7,210 AS genes (Figure 1A and Table 2). Among the 7,210 AS genes, 11,832 AS isoforms were identified (Supplementary Table 3). More than half of all AS genes have only two isoforms, i.e., a reference isoform and a AS isoform. The largest number of isoforms a gene comprises of is 12 (Figure 1B). We further classified all AS events into five types: AA, alternative 3′ splice site; AD, alternative 5′ splice site; ES, exon skipping; IR, intron retention; others. We found a total of 13,328 AS events from 11,832 AS isoforms, while 5,854 AS isoforms involve IR events (Supplementary Tables 3, 4 and Figure 1C). We defined IR genes as genes which bring at least one IR event. Some AS isoforms could consist of multiple AS events or types (Figure 1D); for example, 317 AS isoforms consist of both IR and AA events. Some AS isoforms were independently found in more than one tissue (Figure 1E). For example, 270 AS isoforms were found in all four tissues. The final isoform set consists of 7,210 AS genes, of which 3,895 are IR genes (Table 2).

FIGURE 1
www.frontiersin.org

Figure 1. Identification and classification of gene isoforms in Artemisia annua L. (A) The number and proportion of AS and non-AS gene. (B) The number distribution of alternatively spliced isoforms per gene. (C) Types of alternatively spliced isoforms: AA, alternative 3′ splice site; AD, alternative 5′ splice site; ES, exon skipping; IR, intron retention. (D) The number of isoforms produced by one or more splicing mechanisms. Venn diagram shows the combination of the four types of isoforms. (E) The number of new isoforms found in leaf, flower, stem and root. Venn diagram shows the common and specific isoforms in different tissues.

Differential Expression of Intron Retention Isoforms Under Light Treatments

In the following part we focused on IR genes and isoform expression under light treatments and of artemisinin synthesis pathway related genes. Five light treatments were conducted, each with three biological replicates (Supplementary Table 4). TPM being 1 was chosen as the cutoff to determine whether a gene is expressed or not. The median TPM of expressed IR isoforms ranges from 2.91 under white light treatment to 3.37 under far-red light treatment (Figure 2A), while that of expressed non-AS genes (which has one isoform per gene) ranges from 6.55 under blue to 6.96 under far red (Figure 2B). The fraction of expressed IR isoforms ranges from 30.2% (1,771) under dark to 34.4% (2,013) under red, while that of non-AS gene ranges from 34.7% (19,425) under dark to 39.3% (22,008) in red (Figures 2C,D and Supplementary Table 5).

FIGURE 2
www.frontiersin.org

Figure 2. Expression of intron retention (IR) isoforms and non-alternatively spliced (non-AS) genes after light treatment. Isoforms with a TPM > 1 were defined as expressed ones. (A,B) Box-plots using log (TPM + 1) as the ordinate show TPM distribution of expressed IR isoforms and non-AS genes under five treatments, respectively. (C,D) the number of expressed IR isoforms and non-AS genes, respectively.

We further performed differential expression analysis on seven combinations of light treatment: far red vs. white (FR-WL), red vs. white (R-WL), blue vs. white (B-WL), white vs. dark (WL-D), far red vs. dark (FR-D), red vs. dark (R-D), and blue with dark (B-D). In these comparisons, we observed more up-regulated isoforms than down-regulated ones, and this pattern is largely consistent between IR isoforms and non-AS genes apart from WL-D (Figures 3A,B). Moreover, the proportion of IR isoforms which were differentially expressed under different light treatments is higher than that of non-AS genes (Figure 3C and Supplementary Table 6). For instance, 37% of IR isoforms is differentially expressed between the B-D comparison, while the corresponding fraction of non-AS genes is 28% (p-value < 2.2e–16, Chi-squared test, Figure 3C).

FIGURE 3
www.frontiersin.org

Figure 3. Analysis of differential expression IR isoform expression pattern after light treatment. (A,B) Number of up- and down-regulated IR isoforms and non-AS genes after treatment, respectively. (C) Proportion of differentially expressed IR isoforms over all IR isoforms vs. the corresponding profortion of non-AS genes. B, blue light; R, red light; FR, far-red light; WL, white light; D, dark.

Gene Ontology Enrichment Analysis of Differentially Expressed Alternative Splicing and Intron Retention Isoforms

Since AS isoforms are more likely to be differentially expressed under light treatment, we used GO enrichment analysis to investigate gene function of differentially expressed AS (Supplementary Table 7) genes, especially IR genes (Table 3 and Supplementary Table 8). For all AS genes, the functional enrichment analysis revealed that transcription regulator activity usually enriches in differentially expressed AS genes since adjusted p-values of this GO term (GO: 0140110) are significant in six comparisons, i.e., B-D, B-WL, FR-WL, R-WL and WL-D. Besides, lipid binding (GO: 0008289) is also enriched in four comparisons, i.e., B-D, B-WL, R-D and R-WL. For IR genes, among seven comparisons we found a total of 23 enriched GO terms in six comparisons, belonging to 11 GO annotations (Table 3). No enrichment was found in the FR-D comparison due to the limited number of differentially expressed IR isoforms (Supplementary Table 6). Five GO annotations were enriched in more than one comparison, i.e., nucleotide binding (GO: 0000166) in R-D and R-WL, RNA-binding (GO: 0003723) in R-D and B-WL, lipid metabolic process (GO: 0006629) in B-D, WL-D and B-WL, cellular protein modification (GO: 0006464) in B-D, R-D, B-WL and R-WL, membrane (GO: 0016020) in B-D, R-D, WL-D, B-WL, FR-W and R-WL. These functional enrichments suggest the evolution of IR genes with specific functions in response to light conditions. For instance, we found that three overlapping genes (CTI12_AA107750, CTI12_AA262950 and CTI12_AA419320) regulating sugar transporters have responses to blue, red and far-red light (Supplementary Table 9). The ref isoform of the CTI12_AA107750 gene was down-regulated in blue, red and far-red treatment in contrast to dark treatment (green label). However, the IR isoform of the corresponding gene was up-regulated in blue, red and far red treatment (red label). The ref isoform and IR isoforms of the CTI12_AA262950 gene have a similar regulatory mechanism in blue and red-light treatments in contrast to dark. Similarly, the ref isoform and IR isoforms of the CTI12_AA419320 gene was down-regulated and up-regulated in far-red light treatment compared with dark.

TABLE 3
www.frontiersin.org

Table 3. Gene ontology (GO) functional enrichment of differentially expressed IR isoforms.

Analysis of Intron Retention Genes Related to Artemisinin Synthesis Pathway

We compared the AS genes we identified with genes predicted to be involved in the biosynthesis pathway for the general sesquiterpenes precursor FPP and artemisinin synthesis. Among 24 genes related to artemisinin synthesis pathway and its upstream MVA and MEP pathways (Supplementary Figure 3), fifteen genes with AS behaviors were identified. Nine AS genes in the MEP pathway were identified, belonging to the 1-Deoxy-D-xylulose 5-phosphate synthase (DXS), 1-deoxy-D-xylulose- 5-phosphate reductoisomerase (DXR), 4-(Cytidine 5-diphospho)-2-C-methyl-D-erythritol kinase (CMK), 2-C-methyl-D-erythritol-4-(cytidyl-5-diphosphate) transferase (MCT), 2-C-methyl-D-erythritol-2,4-cyclodiphosphate synthase (MCS), and R-linalool synthase (LS) gene families (Supplementary Table 10). Five genes in the MVA pathway were identified, belonging to the acetoacetyl-CoA thiolase (AACT), mevalonate kinase (MVK), and farnesyl diphosphate synthase (FDS) gene families (Supplementary Tables 10, 11). One gene in the artemisinin synthesis pathway, belonging to the cytochrome P450 reductase (CPR) gene family, was identified. These genes produced a total of 39 isoforms, including 16 IR isoforms. The CPR gene was most highly expressed is an AS isoform, and the remaining genes have highest expression of the ref isoform (Figure 4).

FIGURE 4
www.frontiersin.org

Figure 4. Expression heatmap of the isoforms involved in artemisinin synthesis. The suffix “ref” indicates the reference isoform. For every gene, the reference isoform present in genome annotation is signed with the suffix “ref” or “rna” plus a series of number, while other isoforms newly identified are signed with pure number in its suffix. For example, CTI12_AA584370.ref61643 is the reference isoform of AACT while CTI12_AA584370.47 is a newly identified AS isoform. AACT, Acetyl-CoA C-acetyltransferase; HMGS, 3-Hydroxy-3-methylglutaryl-CoA synthase; HMGR, 3-Hydroxy-3-methylglutaryl-CoA reductase; MK, MVA kinase; PMK, Phospho-MVA kinase; MDC, Diphospho-MVA decarboxylase; DXS, 1-Deoxy-D-xylulose-5-phosphate synthase; DXR, 1-Deoxy-D-xylulose 5-phosphate reductoisomerase; MCT, 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; CMK, 4-(Cytidine-5-diphospho)-2-C-methyl-D-erythritol kinase; MDS, 2-C-methyl-D-erythritol-2,4-cyclodiphosphate synthase; HDS, 4-Hydroxy-3-methylbut-2-enyl-diphosphate synthase; HDR, 4-Hydroxy-3-methylbut-2-enyl diphosphate reductase; GGPPS (E, E, E)-geranylgeranyl diphosphate synthase.

Ten genes were identified that produce IR type isoforms in the artemisinin pathway (Figure 4). The expression of ref isoforms produced by CTI12_AA095890 (DXS), CTI12_AA182630 (DXS), CTI12_AA123070 (DXS), CTI12_AA422860 (DXR), CTI12_AA094970 (LS), CTI12_AA348050 (MCT), CTI12_AA584370 (AACT), CTI12_AA079320 (MVK), CTI12_AA174930 (FDS), and CTI12_AA484860 (CPR) genes, were up-regulated (red label), compared with dark treatment (Supplementary Table 10). Several IR isoforms were alternatively expressed under different light treatments. For example, under white light treatment, expression of the IR isoforms produced by CTI12_AA095890 (DXS) and CTI12_AA094970.6 (LS) genes were up-regulated, while the expression of the IR isoforms produced by CTI12_AA422860 (DXS), CTI12_AA182630 (DXR) and CTI12_AA484860 (CPR) genes were down-regulated. In addition, more IR isoforms are up-regulated in blue light, such as CTI12_AA348050 (MCT), CTI12_AA463490 (MVK), CTI12_AA422860 (DXS), CTI12_AA484860 (CPR) and CTI12_AA123070 (LS), compared with white light (Supplementary Table 10).

Interestingly, the ref isoform of the CPR gene was up-regulated and the IR isoform down-regulated in all light treatments when compared with dark (Supplementary Tables 10, 11). However, the expression of IR isoform produced by CPR gene was up-regulated when contrasted with white light (Supplementary Table 10). CPR has been hypothesized to serve as a redox partner for CYP71AV1, which catalyzes the conversion of amorphadiene to more oxygenated products in A. annua (Paddon et al., 2013). Moreover, if a gene has multiple isoforms, i.e., more than one IR isoform, these IR isoforms may contribute in the same or opposite ways under different light conditions as in the CPR or LS cases (Supplementary Table 10). In the artemisinin pathway, we observed genes that produced IR isoforms, e.g., nine genes in the MEP pathway, five genes in the MEP pathway and one gene in the artemisinin synthesis pathway. More IR isoforms are produced in the upstream biosynthesis pathway of artemisinin. Compared with dark, we found the IR isoforms of all 10 genes are up-regulated under light treatment. More IR isoforms were up-regulated under blue and red-light treatments. However, only blue light treatment has more IR expression up-regulated when we compared with white light. One main consequence of IR is the occurrence of premature stop codon (PTC) in retained intron (Supplementary Figures 1, 2). We checked whether PTC exists in IR genes related to the artemisinin synthesis pathway. Seven genes were found to bear PTC in their IR isoforms. For instance, three IR isoforms of the LS (CTI12_AA094970) gene, i.e., CTI12_AA094970.6, CTI12_AA094970.7, and CTI12_AA094970.8, retain the second intron, leading to the PTC of these AS isoforms. Thus, all these three isoforms may produce a truncated protein which lacks the 3rd to 6th exons. PTC-present isoforms may also produce pre-mRNAs which will further subject to non-sense-mediated mRNA decay (NMD) and the expression of this gene thus could be appropriately regulated by these PTC-present isoforms induced decay (Isken and Maquat, 2008). These results imply that IR are involved in expression regulatory mechanism to control the metabolism of sesquiterpenes.

Discussion

The accumulation of artemisinin is unique in A. annua, and the yield of artemisinin largely determines the medicinal value of A. annua (Graham et al., 2010; Czechowski et al., 2016, 2018). Since light has a positive effect on the accumulation of secondary metabolites (Li et al., 2017; Yoo et al., 2019; Zhang et al., 2019), the study of the molecular mechanism of the effect of light on artemisinin accumulation is of great significance for basic science and practical applications. In this study, we used both NGS and SMRT-based RNA-seq to identify transcripts of A. annua under different light conditions including blue, red, far-red, white light and dark treatments (Zhang et al., 2017), and analyzed the overall transcriptional level of each gene. By combining Illumina and SMRT platforms for transcriptome sequencing of leaf, flower, stem, root in A. annua, we generated a new isoform set. Our study identified 11,832 AS events in total, of which 5,854 were IR isoforms (Figure 1 and Table 2). File of a total of 11,832 AS isoforms sequences is stored in Supplementary_AS_sequence.fasta. We found that IR isoforms accounted for 48.48% of all AS isoforms, representing the most frequent AS event is IR in A. annua. This result is consistent with previous findings in other plants (Filichkin et al., 2010; Chang et al., 2014; Gao H. et al., 2019). We found that IR isoform expression tends to be differentially expressed under light treatment. Comparing with the current annotation that records only 159 IR genes (Table 2), our new AS dataset is more reliable based on our knowledge of plant AS patterns and could sufficiently improve the A. annua genome annotation.

Former studies have shown that environmental stress can affect AS global pattern (Palusa et al., 2007; Filichkin et al., 2010, 2015; Duque, 2011; Reddy et al., 2013; Staiger and Brown, 2013; Ding et al., 2014). Light not only provides energy for plants, but also is a key environmental factor (Filichkin et al., 2010; Chang et al., 2014; Dietz, 2015; Gao H. et al., 2019). There are reported that the deletion of SAD1 which is a key gene involved in ABA signal pathway results in a genome-wide increase of AS (Cui et al., 2014). Our work also revealed that light possibly affect the production of AS particularly IR. Comparing with the non-AS isoforms, there is a greater proportion of AS gene expression regulated by light, which suggests that light can influence not only the production of AS but also its gene expression (Figure 3C). This finding could broaden our insight into how light regulates the behavior of plants. Ten genes were identified that produce IR type isoforms in the artemisinin pathway, but only one is in the artemisinin downstream pathway. This may be due to the upstream is the common pathway of many metabolites and can be affected by more environmental regulatory factors, so as to quick respond to the changes; while the downstream specific artemisinin synthesis pathway may not require too many regulatory changes. Moreover, we also found that the IR genes in the artemisinin pathway was basically high in the expression of reference isoform, while the expression of AS isoform was generally lower. The expression of AS reference and AS isoforms in the artemisinin pathway were higher in most light than in the dark, especially in red and blue light, which was consistent with the most obvious effect of blue and red light on the accumulation of artemisinin. This indicates that the regulation effect of AS on different transcripts is the same on the synthesis pathway of light, which is more conducive to the additive effect on the synthesis regulation of artemisinin and the rapid regulation of light on artemisinin.

Plants have distinct sets of photoreceptors for several parts of the light spectrum, ranging from near-UVB (280–315 nm) to far-red (FR) (∼750 nm) wavelengths. Plants mainly use red and blue light for photosynthesis. At the same time, red and blue light, as a signal, play an important role in many biological processes such as seed germination, de-etiolation, phototropism and flowering (Wit et al., 2016). In Figure 3A, red and blue light can induce more IR isoforms up-regulated than other light treatments, indicating that the transduction of red and blue light signals to plants may depend on the AS. Also, in Figure 3C, red and blue light treatments can induce more differentially expressed IR isoforms. Red and blue light can induce more physiological responses in plants and cause more changes in the expression of AS. Besides, our analysis show that lipid related functions such as lipid binding or lipid metabolic process are enriched in differentially expressed AS or IR genes (Supplementary Tables 7, 8). Lipids, which are major and essential constituents of all plant cells, not only provide structural integrity and energy for various metabolic processes but can also function as signal transduction mediators (Bigay and Antonny, 2012). There is a lot of lipid synthesis and metabolism in chloroplasts. Chloroplast is the site of photosynthesis in plants, and its growth and development are also regulated by light (Chen et al., 2018). Therefore, light changes can cause many lipids synthesis and metabolism gene expression changes. Also, PA, one of the central molecules in lipid biosynthesis, not only facilitates transport of lipids across membranes (Awai et al., 2006) but also plays an important role in biotic and abiotic stress responses. Light is also a stress to plants, especially ultraviolet light. It is possible that light also mediates stress responses in plants through lipid signaling, and therefore, light may affect the expression of many genes related to lipid metabolism. Light is one of the important factors that regulate plant growth and development. Light exposure immediately induced alternative splicing, with many alternatively spliced transcripts expressed from genes with functions related to light signaling, suggesting a potential impact on pre-mRNA splicing and photomorphogenic gene regulation in response to light (Yoshimura et al., 2011; Petrillo et al., 2014a, b; Godoy Herz et al., 2019). Light promotes RNA polymerase II (Pol II) elongation in the affected genes, whereas in darkness, elongation is lower (Godoy Herz et al., 2019). Therefore, Photoreceptors strictly regulate gene expression to control light morphological response. These results jointly suggest that AS may play an important role in the transduction of light signals.

Overall, the number of up-regulated IR isoforms was more than the total number of down-regulated IR isoforms after blue, red, and far-red light treatments. GO enrichment indicated that some genes with special functions can produce multiple isoforms. In the artemisinin biosynthetic pathway, a total of 10 genes underwent AS to produce 26 isoforms, including 16 IR isoforms, two AA isoforms, and eight other isoforms (Supplementary Tables 10, 11). Our research suggests that the expression of IR isoforms is regulated by blue, red, and far-red light, which may also be involved in the biosynthesis of artemisinin and other plant secondary metabolite synthesis (Chen et al., 2013; Liu et al., 2018). For example, genes such as AACT and DXS in the MVA and MEP pathways exhibited IR isoforms (Figure 4). A recent study reported that most of the IR isoforms in Arabidopsis are predicted to be targets of NMD to regulate mRNA stability as they are subjected to NMD to eliminate incomplete transcripts (Filichkin et al., 2010; Syed et al., 2012; Ge and Porse, 2014; Zhu et al., 2020). These eliminated IR isoforms may contain functional domains important for artemisinin biosynthesis by influencing specific steps of the pathway.

The analysis of AS in A. annua has generated valuable insights regarding plant transcriptomes, and these findings provide a foundation for further analyses. In future work, functional verification of IR isoforms will be conducted in the artemisinin pathway and regulatory genes. In conclusion, this study provides not only new insights into the regulation of AS in artemisinin biosynthesis, but also valuable genetic resources for further exploration of functional genomics in A. annua.

Data Availability Statement

The raw sequencing datasets of A. annua reported in this paper have been submitted to the Sequence Read Archive from the NCBI under the accession number PRJNA752933.

Author Contributions

SC, LX, and LL conceived the study. TM, HG, DZ, DD, LW, YYS, and GD performed the sample preparation of NGS and SMRT sequencing. HG, TM, DZ, LL, and TZ analyzed the data. WS, QY, LY, JW, ZX, and YHS contributed to discussion and language polishing. TM, HG, DZ, and LL wrote the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by the National Natural Science Foundation of China and Karst Science Research Center of Guizhou Province (U1812403-1), the China Postdoctoral Science Foundation (2021M690464), the National Natural Science Foundation of China (81903758, 31900258, and 81641002), the National Major Science and Technology Projects (2017ZX09101002-003-001, 2019ZX09201005-006-001, and 2019ZX09731-002), and the CACMS Innovation Fund (CI2021A05103).

Conflict of Interest

YYS and GD were employed by the Amway (China) Botanical R&D Center.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We would like to thank Yujun Zhang from Institute of the China Academy of Chinese Medical Sciences for the helpful discussion.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2021.733505/full#supplementary-material

Supplementary Figure 1 | Sequence alignment of PTC-present isforms and their corresponding reference isoforms. Location of PTC is highlighted in red rectangle.

Supplementary Figure 2 | Gene structure of PTC-present isforms and their corresponding reference isoforms. Location of PTC is indicated by black arrow.

Supplementary Figure 3 | Expression heatmap of genes related to MVA, MEP and artemisinin synthesis pathways.

Supplementary Table 1 | Pre-processed of full-length reads and corrected full-length reads data in leaf, flowers, stem and root of A. annua.

Supplementary Table 2 | Summary of Illumina RNA-seq and single-molecule real-time (SMRT) full-length read and corrected full-length reads quality mapping to the reference genome as determined by STAR.

Supplementary Table 3 | The alternative splicing isoforms and splicing types identified by Isoform Detection and Prediction (IDP).

Supplementary Table 4 | The expression level of a total of 11,832 alternative splicing isoforms involved in the A. annua.

Supplementary Table 5 | Genome-wide quantitative analysis of alternative splicing isoforms produced by different light treatments. The table shows that blue, red, far red, white light and dark were used to treat A. annua seedlings for 48 h. There were three biological duplicates for each treatment. Kallisto was used to quantify 75,058 transcripts, and the mean value of TPM was taken as the expression of transcripts. Set TPM = 1 to the cut-off value, and the isoform (alternative splicing) of TPM > 1 is considered to be expressed.

Supplementary Table 6 | The total number of up-regulated and down-regulated IR isoforms and non-AS genes after treatment with blue, red, far-red, white light and dark. The table shows the total number of up- and down-regulated IR isoforms and non-AS genes after different light treatments, which were used to Supplementary Figure 3.

Supplementary Table 7 | Functional enrichment of differentially expressed AS genes. The table shows gene enrichment results corresponding to all differentially expressed AS isoforms. Gene Ratio: The ratio of the number of differentially expressed genes associated with this term to the total number of differentially expressed genes; which had GO annotations; Bg Ratio: The ratio of the number of background genes associated with this term to the total number of all background genes which had GO annotations; p.adjust: adjust p-values for multiple comparisons.

Supplementary Table 8 | Alternative splicing isoforms of up-regulated and down-regulated genes in Gene ontology (GO) functional enrichment results. The table shows gene enrichment results corresponding to all differentially expressed IR isoforms were either up-regulated or down-regulated after blue, red, far-red, white light and dark treatment.

Supplementary Table 9 | The overlapping genes in Gene ontology (GO) enrichment analysis result.

Supplementary Table 10 | The alternative splicing type and expression level of the genes involved in the A. annua (dark as a control).

Supplementary Table 11 | The alternative splicing type and expression level of the genes involved in the A. annua (white as a control).

Supplementary_AS_sequence.fasta | Fasta-format sequence file of a total of 11,832 alternative splicing isoforms involved in the A. annua.

Footnotes

  1. ^ http://www.pacificbiosciences.com/
  2. ^ https://github.com/Magdoll/PacBio-generic/blob/master/hmmer_wrapper.py

References

Abdel-Ghany, S. E., Hamilton, M., Jacobi, J. L., Ngam, P., Devitt, N., Schilkey, F., et al. (2016). A survey of the sorghum transcriptome using single-molecule long reads. Nat. Commun. 7:11706.

Google Scholar

Ardui, S., Ameur, A., Vermeesch, J. R., and Hestand, M. S. (2018). Single molecule real-time (SMRT) sequencing comes of age: applications and utilities for medical diagnostics. Nucleic Acids Res. 46, 2159–2168. doi: 10.1093/nar/gky066

PubMed Abstract | CrossRef Full Text | Google Scholar

Au, K. F., Jiang, H., Lin, L., Xing, Y., and Wong, W. H. (2010). Detection of splice junctions from paired-end RNA-seq data by SpliceMap. Nucleic Acids Res. 38, 4570–4578. doi: 10.1093/nar/gkq211

PubMed Abstract | CrossRef Full Text | Google Scholar

Au, K. F., Sebastiano, V., Afshar, P. T., Durruthy, J. D., Lee, L., Williams, B. A., et al. (2013). Characterization of the human ESC transcriptome by hybrid sequencing. Proc. Natl. Acad. Sci. U. S. A. 110, E4821–E4830.

Google Scholar

Awai, K., Xu, C., Tamot, B., and Benning, C. (2006). A phosphatidic acid–binding protein of the chloroplast inner envelope membrane involved in lipid trafficking. PNAS 103, 10817–10822. doi: 10.1073/pnas.0602754103

PubMed Abstract | CrossRef Full Text | Google Scholar

Berlin, K., Koren, S., Chin, C. S., Drake, J. P., Landolin, J. M., and Phillippy, A. M. (2015). Assembling large genomes with single-molecule sequencing and locality-sensitive hashing. Nat. Biotechnol. 33, 623–630. doi: 10.1038/nbt.3238

PubMed Abstract | CrossRef Full Text | Google Scholar

Bigay, J., and Antonny, B. (2012). Curvature, lipid packing, and electrostatics of membrane organelles: defining cellular territories in determining specificity. Dev. Cell 23, 886–895. doi: 10.1016/j.devcel.2012.10.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Bin, L., and Hong, Z. (2010). Advances on the study of the pharmacological effects of artemisinin and its derivatives. Chin. J. Clin. Pharmacol. Ther. 15, 572–576.

Google Scholar

Bray, N. L., Pimentel, H., Melsted, P., and Pachter, L. (2016). Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525–527. doi: 10.1038/nbt.3519

PubMed Abstract | CrossRef Full Text | Google Scholar

Brett, D., Pospisil, H., Valcarcel, J., Reich, J., and Bork, P. (2002). Alternative splicing and genome complexity. Nat. Genet. 30, 29–30.

Google Scholar

Chang, C. Y., Lin, W. D., and Tu, S. L. (2014). Genome-Wide Analysis of Heat-Sensitive Alternative Splicing in Physcomitrella patens. Plant Physiol. 165, 826–840. doi: 10.1104/pp.113.230540

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, J. F., Dong, X., Li, Q., Zhou, X., Gao, S. H., Chen, R. B., et al. (2013). Biosynthesis of the active compounds of isatis indigotica based on transcriptome sequencing and metabolites profiling. BMC Genomics 14:857. doi: 10.1186/1471-2164-14-857

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, Y., Zhou, B., Li, J., Tang, H., Tang, J., and Yang, Z. (2018). Formation and Change of Chloroplast-Located Plant Metabolites in Response to Light Conditions. Int. J. Mol. Sci. 19:654. doi: 10.3390/ijms19030654

PubMed Abstract | CrossRef Full Text | Google Scholar

Cui, P., Zhang, S., Ding, F., Ali, S., and Xiong, L. (2014). Dynamic regulation of genome-wide pre-mRNA splicing and stress tolerance by the Sm-like protein LSm5 in Arabidopsis. Genome Biol. 15:R1.

Google Scholar

Cuong, D. M., Jeon, J., Morgan, A. M. A., Kim, C., Kim, J. K., Lee, S. Y., et al. (2017). Accumulation of Charantin and Expression of Triterpenoid Biosynthesis Genes in Bitter Melon (Momordica charantia). J. Agric. Food Chem. 65, 7240–7249. doi: 10.1021/acs.jafc.7b01948

PubMed Abstract | CrossRef Full Text | Google Scholar

Czechowski, T., Larson, T. R., Catania, T. M., Harvey, D., Brown, G. D., and Graham, I. A. (2016). Artemisia annua mutant impaired in artemisinin synthesis demonstrates importance of nonenzymatic conversion in terpenoid metabolism. Proc. Natl. Acad. Sci. U. S. A. 113, 15150–15155. doi: 10.1073/pnas.1611567113

PubMed Abstract | CrossRef Full Text | Google Scholar

Czechowski, T., Larson, T. R., Catania, T. M., Harvey, D., Wei, C. X., Essome, M., et al. (2018). Detailed phytochemical analysis of highand low artemisinin-producing chemotypes of Artemisia annua. Front. Plant Sci. 9:641. doi: 10.3389/fpls.2018.00641

PubMed Abstract | CrossRef Full Text | Google Scholar

Dietz, K. J. (2015). Efficient high light acclimation involves rapid processes at multiple mechanistic levels. J. Exp. Bot. 66, 2401–2414.

Google Scholar

Ding, F., Cui, P., Wang, Z., Zhang, S., Ali, S., and Xiong, L. (2014). Genome-wide analysis of alternative splicing of pre-mRNA under salt stress in Arabidopsis. BMC Genomics 15:431. doi: 10.1186/1471-2164-15-431

PubMed Abstract | CrossRef Full Text | Google Scholar

Dobin, A., Davis, C. A., Schlesinger, F., Drenkow, J., Zaleski, C., Jha, S., et al. (2013). STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21. doi: 10.1093/bioinformatics/bts635

PubMed Abstract | CrossRef Full Text | Google Scholar

Duque, P. (2011). A role for SR proteins in plant stress responses. Plant Signal. Behav. 6, 49–54. doi: 10.4161/psb.6.1.14063

PubMed Abstract | CrossRef Full Text | Google Scholar

Filichkin, S. A., Cumbie, J. S., Dharmawardhana, P., Jaiswal, P., Chang, J. H., Palusa, S. G., et al. (2015). Environmental stresses modulate abundance and timing of alternatively spliced circadian transcripts in Arabidopsis. Mol. Plant 8, 207–227. doi: 10.1016/j.molp.2014.10.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Filichkin, S. A., Priest, H. D., Givan, S. A., Shen, R., Bryant, D. W., Fox, S. E., et al. (2010). Genome-wide mapping of alternative splicing in Arabidopsis thaliana. Genome Res. 20, 45–58. doi: 10.1101/gr.093302.109

PubMed Abstract | CrossRef Full Text | Google Scholar

Gao, H., Li, F., Xu, Z., Huang, C., Xiong, C., Jiang, C., et al. (2019). Genome-wide analysis of methyl jasmonate-regulated isoform expression in the medicinal plant Andrographis paniculata. Ind. Crops Prod. 135, 39–48. doi: 10.1016/j.indcrop.2019.04.023

CrossRef Full Text | Google Scholar

Gao, T., Xu, Z., Song, X., Huang, K., Li, Y., Wei, J., et al. (2019). Hybrid Sequencing of Full-Length cDNA Transcripts of the Medicinal Plant Scutellaria baicalensis. Int. J. Mol. Sci. 20:4426. doi: 10.3390/ijms20184426

PubMed Abstract | CrossRef Full Text | Google Scholar

Ge, Y., and Porse, B. T. (2014). The functional consequences of intron retention: alternative splicing coupled to NMD as a regulator of gene expression. Bioessays 36, 236–243. doi: 10.1002/bies.201300156

PubMed Abstract | CrossRef Full Text | Google Scholar

Godoy Herz, M. A., Kubaczka, M. G., Brzyzek, G., Servi, L., Krzyszton, M., Simpson, C., et al. (2019). Light Regulates Plant Alternative Splicing through the Control of Transcriptional Elongation. Mol. Cell 73, 1066–1074.e3.

Google Scholar

Graham, I. A., Besser, K., Blumer, S., Branigan, C. A., Czechowski, T., Elias, L., et al. (2010). The genetic map of Artemisia annua L. identifies loci affecting yield of the antimalarial drug artemisinin. Science 327, 328–331. doi: 10.1126/science.1182612

PubMed Abstract | CrossRef Full Text | Google Scholar

Isken, O., and Maquat, L. E. (2008). The multiple lives of NMD factors: balancing roles in gene and genome regulation. Nat. Rev. Genet. 9, 699–712. doi: 10.1038/nrg2402

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiao, X., Chang, J., Kilic, T., Tong, L., and Kiledjian, M. (2013). A mammalian pre-mrna 5’ end capping quality control mechanism and an unexpected link of capping to pre-mrna processing. Mol. Cell 50, 104–115. doi: 10.1016/j.molcel.2013.02.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Jin, H., Wan, Y. W., and Liu, Z. (2017). Comprehensive evaluation of RNA-seq quantification methods for linearity. BMC Bioinformatics 18:117. doi: 10.1186/s12859-017-1526-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Keren, H., Lev-Maor, G., and Ast, G. (2010). Alternative splicing and evolution: diversification, exon definition and function. Nat. Rev. Genet. 11, 345–355. doi: 10.1038/nrg2776

PubMed Abstract | CrossRef Full Text | Google Scholar

Kim, H., and Yin, J. (2005). Effects of RNA splicing and post-transcriptional regulation on HIV-1 growth: a quantitative and integrated perspective. Syst. Biol. 152, 138–152. doi: 10.1049/ip-syb:20050004

PubMed Abstract | CrossRef Full Text | Google Scholar

Kochan, E., Balcerczak, E., Szymczyk, P., Sienkiewicz, M., Zielinska-Blizniewska, H., and Szymanska, G. (2019). Abscisic Acid Regulates the 3-Hydroxy-3-methylglutaryl CoA Reductase Gene Promoter and Ginsenoside Production in Panax quinquefolium Hairy Root Cultures. Int. J. Mol. Sci. 20:1310.

Google Scholar

Kolde, R. (2015). Pheatmap: Pretty Heatmaps. R package version 0.6 1: 2012.

Google Scholar

Koren, S., Schatz, M. C., Walenz, B. P., Martin, J., Howard, J. T., Ganapathy, G., et al. (2012). Hybrid error correction and de novo assembly of single-molecule sequencing reads. Nat. Biotechnol. 30, 693–700. doi: 10.1038/nbt.2280

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, J., Ren, L., Gao, Z., Jiang, M., Liu, Y., Zhou, L., et al. (2017). Combined transcriptomic and proteomic analysis constructs a new model for light-induced anthocyanin biosynthesis in eggplant (Solanum melongena L.). Plant Cell Environ. 40, 3069–3087. doi: 10.1111/pce.13074

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Y., Wei, W., Feng, J., Luo, H., Pi, M., Liu, Z., et al. (2018). Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets. DNA Res. 25, 61–70. doi: 10.1093/dnares/dsx038

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Y. Q., Yang, S. H., Yang, H. J., and Hua, J. (2007). The TIR-NB-LRR gene SNC1 is regulated at the transcript level by multiple factors. Mol. Plant Microbe Interact. 20, 1449–1456. doi: 10.1094/mpmi-20-11-1449

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, G. F., Liu, J. J., He, Z. R., Wang, F. M., Yang, H., Yan, Y. F., et al. (2018). Implementation of CsLIS/NES in linalool biosynthesis involves transcript splicing regulation in Camellia sinensis. Plant Cell Environ. 41, 176–186. doi: 10.1111/pce.13080

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, Y. F., Wang, B., Shu, S. H., Li, Z., Song, C., and Liu, D. (2021). Analysis of the Coptis chinensis genome reveals the diversification of protoberberine-type alkaloids. Nat. Commun. 12:3276.

Google Scholar

Lopes, E. M., Guimaraes-Dias, F., Gama, T., Macedo, A. L., Valverde, A. L., de Moraes, M. C., et al. (2019). Artemisia annua L. and photoresponse: from artemisinin accumulation, volatile profile and anatomical modifications to gene expression. Plant Cell Rep. 39, 101–117. doi: 10.1007/s00299-019-02476-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Love, M. I., Huber, W., and Anders, S. (2014). Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15:550.

Google Scholar

Ma, T. Y., Xiang, L., Zhang, D., Shi, Y. H., Ding, D. D., Shen, X. F., et al. (2018). Statues and research strategy of molecular breeding in Artemisia annua. Zhongguo Zhong Yao Za Zhi 43, 3041–3050.

Google Scholar

Martins, S. B., Rino, J., Carvalho, T., Carvalho, C., Yoshida, M., Klose, J. M., et al. (2011). Spliceosome assembly is coupled to RNA polymerase II dynamics at the 3’ end of human genes. Nat. Struct. Mol. Biol. 18, 1115–1123. doi: 10.1038/nsmb.2124

PubMed Abstract | CrossRef Full Text | Google Scholar

Mason, C. E., and Elemento, O. (2012). Faster sequencers, larger datasets, new challenges. Genome Biol. 13:314. doi: 10.1186/gb-2012-13-3-314

PubMed Abstract | CrossRef Full Text | Google Scholar

Matlin, A. J., Clark, F., and Smith, C. W. (2005). Understanding alternative splicing: towards a cellular code. Nat. Rev. Mol. Cell Biol. 6, 386–398. doi: 10.1038/nrm1645

PubMed Abstract | CrossRef Full Text | Google Scholar

Maza, E. (2016). In Papyro Comparison of TMM (edgeR), RLE (DESeq2), and MRN Normalization Methods for a Simple Two-Conditions-Without-Replicates RNA-Seq Experimental Design. Front. Genet. 7:164. doi: 10.3389/fgene.2016.00164

PubMed Abstract | CrossRef Full Text | Google Scholar

Miller, L. H., and Su, X. (2011). Artemisinin: discovery from the Chinese herbal garden. Cell 146, 855–858. doi: 10.1016/j.cell.2011.08.024

PubMed Abstract | CrossRef Full Text | Google Scholar

Nagegowda, D. A. (2010). Plant volatile terpenoid metabolism: biosynthetic genes, transcriptional regulation and subcellular compartmentation. FEBS Lett. 584, 2965–2973. doi: 10.1016/j.febslet.2010.05.045

PubMed Abstract | CrossRef Full Text | Google Scholar

Nagegowda, D. A., Gutensohn, M., Wilkerson, C. G., and Dudareva, N. (2008). Two nearly identical terpene synthases catalyze the formation of nerolidol and linalool in snapdragon flowers. Plant J. 55, 224–239. doi: 10.1111/j.1365-313x.2008.03496.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Nilsen, T. W., and Graveley, B. R. (2010). Expansion of the eukaryotic proteome by alternative splicing. Nature 463, 457–463. doi: 10.1038/nature08909

PubMed Abstract | CrossRef Full Text | Google Scholar

Ozsolak, F., and Milos, P. M. (2011). RNA sequencing: advances, challenges and opportunities. Nat. Rev. Genet. 12, 87–98.

Google Scholar

Paddon, C. J., and Keasling, J. D. (2014). Semi-synthetic artemisinin: a model for the use of synthetic biology in pharmaceutical development. Nat. Rev. Microbiol. 12, 355–367. doi: 10.1038/nrmicro3240

PubMed Abstract | CrossRef Full Text | Google Scholar

Paddon, C. J., Westfall, P. J., Pitera, D. J., Benjamin, K., Fisher, K., McPhee, D., et al. (2013). High-level semi-synthetic production of the potent antimalarial artemisinin. Nature 496, 528–532.

Google Scholar

Palusa, S. G., Ali, G. S., and Reddy, A. S. (2007). Alternative splicing of pre-mRNAs of Arabidopsis serine/arginine-rich proteins: regulation by hormones and stresses. Plant J. 49, 1091–1107. doi: 10.1111/j.1365-313x.2006.03020.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Pan, Q., Shai, O., Lee, L. J., Frey, B. J., and Blencowe, B. J. (2008). Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nat. Genet. 40, 1413–1415. doi: 10.1038/ng.259

PubMed Abstract | CrossRef Full Text | Google Scholar

Peplow, M. (2013). Malaria drug made in yeast causes market ferment. Nature 494, 160–161.

Google Scholar

Petrillo, E., Godoy Herz, M. A., Barta, A., Kalyna, M., and Kornblihtt, A. R. (2014a). Let there be light: regulation of gene expression in plants. RNA Biol. 11, 1215–1220. doi: 10.4161/15476286.2014.972852

PubMed Abstract | CrossRef Full Text | Google Scholar

Petrillo, E., Godoy Herz, M. A., Fuchs, A., Reifer, D., Fuller, J., Yanovsky, M. J., et al. (2014b). A chloroplast retrograde signal regulates nuclear alternative splicing. Science 344, 427–430. doi: 10.1126/science.1250322

PubMed Abstract | CrossRef Full Text | Google Scholar

Reddy, A. S., Marquez, Y., Kalyna, M., and Barta, A. (2013). Complexity of the alternative splicing landscape in plants. Plant Cell 25, 3657–3683.

Google Scholar

Rhoads, A., and Au, K. F. (2015). PacBio Sequencing and Its Applications. Genomics Proteomics Bioinformatics 13, 278–289. doi: 10.1016/j.gpb.2015.08.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Sharon, I., Morowitz, M. J., Thomas, B. C., Costello, E. K., Relman, D. A., and Banfield, J. F. (2013). Time series community genomics analysis reveals rapid shifts in bacterial species, strains, and phage during infant gut colonization. Genome Res. 23, 111–120.

Google Scholar

Shen, Q., Lu, X., Yan, T., Fu, X., Lv, Z., Zhang, F., et al. (2016). The jasmonate-responsive AaMYC2 transcription factor positively regulates artemisinin biosynthesis in Artemisia annua. New Phytol. 210, 1269–1281. doi: 10.1111/nph.13874

PubMed Abstract | CrossRef Full Text | Google Scholar

Shen, Q., Zhang, L., Liao, Z., Wang, S., Yan, T., Shi, P., et al. (2018). The Genome of Artemisia annua Provides Insight into the Evolution of Asteraceae Family and Artemisinin Biosynthesis. Mol. Plant 11, 776–788. doi: 10.1016/j.molp.2018.03.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Song, C., Liu, Y. F., Song, A. P., Dong, G. Q., Zhao, H. B., and Sun, W. (2018). The chrysanthemum nankingense genome provides insights into the evolution and diversification of chrysanthemum flowers and medicinal traits. Mol. Plant 11, 1482–1491. doi: 10.1016/j.molp.2018.10.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Staiger, D., and Brown, J. W. S. (2013). Alternative Splicing at the Intersection of Biological Timing, Development, and Stress Responses. Plant Cell 25, 3640–3656. doi: 10.1105/tpc.113.113803

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, C., Deng, L., Du, M., Zhao, J., Chen, Q., Huang, T., et al. (2020). A Transcriptional Network Promotes Anthocyanin Biosynthesis in Tomato Flesh. Mol. Plant 13, 42–58. doi: 10.1016/j.molp.2019.10.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Syed, N. H., Kalyna, M., Marquez, Y., Barta, A., and Brown, J. W. (2012). Alternative splicing in plants–coming of age. Trends Plant Sci. 17, 616–623. doi: 10.1016/j.tplants.2012.06.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Tan, H. X., Ling, X., Gao, S. H., Li, Q., Chen, J. F., Xiao, Y., et al. (2015). Trichome and artemisinin regulator 1 is required for trichome development and artemisinin biosynthesis in artemisia annua. Mol. Plant 8, 1396–1411. doi: 10.1016/j.molp.2015.04.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Thatcher, S. R., Zhou, W., Leonard, A., Wang, B. B., Beatty, M., Zastrow-Hayes, G., et al. (2014). Genome-wide analysis of alternative splicing in Zea mays: landscape and genetic regulation. Plant Cell 26, 3472–3487. doi: 10.1105/tpc.114.130773

PubMed Abstract | CrossRef Full Text | Google Scholar

Triemer, S., Gilmore, K., Vu, G. T., Seeberger, P. H., and Seidel-Morgenstern, A. (2018). Literally Green Chemical Synthesis of Artemisinin from Plant Extracts. Angew. Chem. Int. Ed Engl. 57, 5525–5528. doi: 10.1002/anie.201801424

PubMed Abstract | CrossRef Full Text | Google Scholar

Tu, Y. (2011). The discovery of artemisinin (qinghaosu) and gifts from Chinese medicine. Nat. Med. 17, 1217–1220. doi: 10.1038/nm.2471

PubMed Abstract | CrossRef Full Text | Google Scholar

Vashisth, D., Kumar, R., Rastogi, S., Patel, V. K., Kalra, A., Gupta, M. M., et al. (2018). Transcriptome changes induced by abiotic stresses in Artemisia annua. Sci. Rep. 8:3423.

Google Scholar

Wang, E. T., Sandberg, R., Luo, S., Khrebtukova, I., Zhang, L., Mayr, C., et al. (2008). Alternative isoform regulation in human tissue transcriptomes. Nature 456, 470–476. doi: 10.1038/nature07509

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Y., Cheong, C. G., Hall, T. M., and Wang, Z. (2009). Engineering splicing factors with designed specificities. Nat. Methods 6, 825–830. doi: 10.1038/nmeth.1379

PubMed Abstract | CrossRef Full Text | Google Scholar

Wit, M., Costa Galvão, V., and Fankhauser, C. (2016). Light-mediated hormonal regulation of plant growth and development. Annu. Rev. Plant Biol. 67, 513–537. doi: 10.1146/annurev-arplant-043015-112252

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, Z., Peters, R. J., Weirather, J., Luo, H., Liao, B., Zhang, X., et al. (2015). Full-length transcriptome sequences and splice variants obtained by a combination of sequencing platforms applied to different root tissues of Salvia miltiorrhiza and tanshinone biosynthesis. Plant J. 82, 951–961. doi: 10.1111/tpj.12865

PubMed Abstract | CrossRef Full Text | Google Scholar

Yoo, H. J., Kim, J. H., Park, K. S., Son, J. E., and Lee, J. M. (2019). Light-Controlled Fruit Pigmentation and Flavor Volatiles in Tomato and Bell Pepper. Antioxidants 9:14. doi: 10.3390/antiox9010014

PubMed Abstract | CrossRef Full Text | Google Scholar

Yoshimura, K., Mori, T., Yokoyama, K., Koike, Y., Tanabe, N., Sato, N., et al. (2011). Identification of alternative splicing events regulated by an Arabidopsis serine/arginine-like protein, atSR45a, in response to high-light stress using a tiling array. Plant Cell Physiol. 52, 1786–1805. doi: 10.1093/pcp/pcr115

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, G., Wang, L. G., Han, Y., and He, Q. Y. (2012). clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 16, 284–287. doi: 10.1089/omi.2011.0118

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, Z. X., Li, J. X., Yang, C. Q., Hu, W. L., Wang, L. J., and Chen, X. Y. (2012). The jasmonate-responsive AP2/ERF transcription factors AaERF1 and AaERF2 positively regulate artemisinin biosynthesis in Artemisia annua L. Mol. Plant 5, 353–365. doi: 10.1093/mp/ssr087

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, D., Jiang, C., Huang, C., Wen, D., Lu, J., Chen, S., et al. (2019). The light-induced transcription factor FtMYB116 promotes accumulation of rutin in Fagopyrum tataricum. Plant Cell Environ. 42, 1340–1351. doi: 10.1111/pce.13470

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, D., Sun, W., Shi, Y., Wu, L., Zhang, T., and Xiang, L. (2018). Red and Blue Light Promote the Accumulation of Artemisinin in Artemisia Annua L. Molecules 23:1329. doi: 10.3390/molecules23061329

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, T., Song, C., Song, L., Shang, Z., Yang, S., Zhang, D., et al. (2017). RNA Sequencing and Coexpression Analysis Reveal Key Genes Involved in alpha-Linolenic Acid Biosynthesis in Perilla frutescens Seed. Int. J. Mol. Sci. 18:2433. doi: 10.3390/ijms18112433

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, X., Chen, M. H., Wu, X., Kodani, A., Fan, J., Doan, R., et al. (2016). Cell-Type-Specific Alternative Splicing Governs Cell Fate in the Developing Cerebral Cortex. Cell 166, 1147–1162.e15.

Google Scholar

Zhu, D., Mao, F., Tian, Y., Lin, X., Gu, L., Gu, H., et al. (2020). The Features and Regulation of Co-transcriptional Splicing in Arabidopsis. Mol. Plant 13, 278–294. doi: 10.1016/j.molp.2019.11.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: Artemisia annua, single molecule real-time (SMRT) sequencing, alternative splicing, intron retention, light-regulated, artemisinin

Citation: Ma T, Gao H, Zhang D, Sun W, Yin Q, Wu L, Zhang T, Xu Z, Wei J, Su Y, Shi Y, Ding D, Yuan L, Dong G, Leng L, Xiang L and Chen S (2021) Genome-Wide Analysis of Light-Regulated Alternative Splicing in Artemisia annua L. Front. Plant Sci. 12:733505. doi: 10.3389/fpls.2021.733505

Received: 06 July 2021; Accepted: 03 September 2021;
Published: 29 September 2021.

Edited by:

Carlos Esteban Hernando, IIBBA-CONICET Leloir Institute Foundation, Argentina

Reviewed by:

Rohini Garg, Shiv Nadar University, India
Maximiliano Estravis-Barcala, Umeå University, Sweden

Copyright © 2021 Ma, Gao, Zhang, Sun, Yin, Wu, Zhang, Xu, Wei, Su, Shi, Ding, Yuan, Dong, Leng, Xiang and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Liang Leng, bGxlbmdAaWNtbS5hYy5jbg==; Li Xiang, bHhpYW5nQGljbW0uYWMuY24=; Shilin Chen, c2xjaGVuQGljbW0uYWMuY24=

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.