Abstract
There is increasing evidence suggesting the contribution of non-coding RNAs (ncRNAs) to the phenotypic and physiological complexity of organisms. A novel ncRNA species has been identified near the transcription boundaries of protein-coding genes in eukaryotes, bacteria, and archaea. This review provides a detailed description of these transcription boundary-associated RNAs (TBARs), including their classification. Based on their genomic distribution, TBARs are divided into two major groups: promoter-associated RNAs (PARs) and terminus-associated RNAs (TARs). Depending on the sequence length, each group is further classified into long RNA species (>200 nt) and small RNA species (<200 nt). According to these rules of TBAR classification, divergent ncRNAs with confusing nomenclatures, such as promoter upstream transcripts (PROMPTs), upstream antisense RNAs (uaRNAs), stable unannotated transcripts (SUTs), cryptic unstable transcripts (CUTs), upstream non-coding transcripts (UNTs), transcription start site-associated RNAs (TSSaRNAs), transcription initiation RNAs (tiRNAs), and transcription termination site-associated RNAs (TTSaRNAs), were assigned to specific classes. Although the biogenesis pathways of PARs and TARs have not yet been clearly elucidated, previous studies indicate that some of the PARs have originated either through divergent transcription or via RNA polymerase pausing. Intriguing findings regarding the functional implications of the TBARs such as the long-range “gene looping” model, which explains their role in the transcriptional regulation of protein-coding genes, are also discussed. Altogether, this review provides a comprehensive overview of the current research status of TBARs, which will promote further investigations in this research area.
Introduction
In the initial phase of genome sequencing, annotated protein-coding genes were thought to be a major contributor to the development of phenotypic and physiological complexity of organisms. However, this viewpoint was challenged soon afterwards. The application of high-throughput sequencing (HTS) in transcriptomics studies uncovered the intriguing phenomenon in which transcription signals were detected across the entire genomes (Encode Project Consortium et al., 2007). Notably, most of these signals originated from non-coding loci but not from protein-coding genes. Subsequent studies have revealed a positive correlation between the diversity of non-coding RNAs (ncRNAs) and the complexity of organisms (Amaral and Mattick, 2008; Amaral et al., 2008). Considering the emerging biological importance of ncRNAs, continuous research efforts have led to the identification and characterization of ncRNAs in multiple organisms. Although a large proportion of ncRNAs resides within intergenic regions, some reside within protein-coding genes, such as antisense ncRNAs (He et al., 2008) and intronic ncRNAs (Brown et al., 2008; Meng et al., 2011; Meng and Shao, 2012). Additionally, some of the ncRNAs have been identified near the promoter regions or the transcription termini of protein-coding genes (Gingeras, 2007; Kapranov et al., 2007).
In this review, we provide a brief summary of the discovery of promoter-associated RNAs (PARs) in budding yeast (Saccharomyces cerevisiae) (Davis and Ares, 2006; Neil et al., 2009; Xu et al., 2009), fruit fly (Drosophila melanogaster), chicken (Gallus gallus) (Taft et al., 2009a), mouse (Mus musculus) (Kapranov et al., 2007; Seila et al., 2008; Affymetrix Encode Transcriptome Project and Cold Spring Harbor Laboratory Encode Transcriptome Project, 2009; Flynn et al., 2011), human (Homo sapiens) (Han et al., 2007; Kapranov et al., 2007; Preker et al., 2008; Affymetrix Encode Transcriptome Project and Cold Spring Harbor Laboratory Encode Transcriptome Project, 2009; Pickrell et al., 2010), Arabidopsis thaliana (Chekanova et al., 2007; Wang et al., 2011; Ma et al., 2017), bacteria (Yus et al., 2012), and archaea (Zaramela et al., 2014). Additionally, the discovery of terminus-associated RNAs (TARs) in human (Kapranov et al., 2007, 2010; Taft et al., 2009a; Yue et al., 2010; Valen et al., 2011; Schwalb et al., 2016), mouse (Kapranov et al., 2007), fission yeast (Schizosaccharomyces pombe), and Arabidopsis (Ma et al., 2017) has also been introduced. Since promoters and terminators define the transcription boundaries of protein-coding genes, PARs and TARs are referred to as transcription boundary-associated RNAs (TBARs). Additionally, we summarize the biogenesis pathways of some of the TBARs and a few mechanistic models of TBAR biogenesis including divergent transcription and RNA polymerase (Pol) pausing. We also summarize recent progress in the functional studies conducted on TBARs, followed by in-depth discussions. Interestingly, PARs have been functionally implicated in transcription termination, whereas TARs are potentially involved in transcription initiation. The interactive relationship between transcription initiation and termination has been depicted using long-range regulatory models, such as “gene looping” (Lykke-Andersen et al., 2011; Andersen et al., 2013). Finally, we propose a preliminary nomenclature system for TBARs according to specific classification criteria and emphasize the importance of a uniform annotation system for TBAR research.
PARs in Animals: Discovery, Biogenesis, and Functions
In eukaryotes, the transcription of protein-coding genes includes two primary steps: initiation and elongation. Each step is under strict surveillance. However, each time transcription is initiated from the transcription start site (TSS) of a protein-coding gene, the final output is not always a full-length coding sequence. This is because various kinds of genetic and epigenetic factors, such as binding sites of RNA Pol II or transcription factors (TFs) and chromatin modifications, interrupt transcription initiation and/or elongation (Li et al., 2007; Razin et al., 2011). Premature termination of transcription is a widespread phenomenon that results in aberrant transcripts. As a result of frequent transcription pausing, diverse ncRNAs have been discovered near the promoter regions. Although such ncRNAs have been referred to by different names, here they are referred to as PARs.
In a pioneering study, Core et al. (2008) employed global run-on sequencing (GRO-Seq) to investigate the density of transcriptionally engaged RNA polymerase enzymes on a genome-wide scale. This study identified a novel species of PARs in human lung fibroblasts. However, because this study used an older generation sequencing platform with short read-length (maximum 33 nt), data on the length of PARs was not provided. These PARs are enriched within -250 bp to +50 bp of the TSSs of certain protein-coding genes. Because GRO-Seq is capable of evaluating promoter-proximal pausing on all genes, it has been suggested that these PARs are derived from RNA Pol II-dependent divergent transcription and promoter-proximal pausing (Core et al., 2008). Subsequent studies have uncovered PARs in different species. Because of recent advances in sequencing platforms, more information is available on the sequence features of PARs, especially their length distribution. In this review, we propose that, according to their sequence length, PARs should to be divided into two classes: promoter-associated long RNAs (PALRs; >200 nt) and promoter-associated small RNAs (PASRs; <200 nt).
Promoter-associated small RNAs have been reported in human and mouse (Kapranov et al., 2007; Affymetrix Encode Transcriptome Project and Cold Spring Harbor Laboratory Encode Transcriptome Project, 2009). TSS-associated RNAs (TSSaRNAs) reported in mouse (Seila et al., 2008) and transcription initiation RNAs (tiRNAs) in human, mouse, chicken and Drosophila (Taft et al., 2009a) have also been categorized as PASRs. Based on these studies, it can be concluded that weak expression is a common feature of PASRs. A number of PARs are exosome-specific substrates with short half-lives (Wyers et al., 2005; Davis and Ares, 2006; Chekanova et al., 2007; Preker et al., 2008). The divergent distribution of many PARs near the TSSs is correlated with bidirectional transcription activity of RNA Pol II. Functional studies have connected some of the PARs with transcriptional activation or repression of their target genes (Li et al., 2006; Janowski et al., 2007; Morris et al., 2008; Turunen et al., 2009; Huang et al., 2010; Kassab et al., 2015; Li et al., 2016; Uesaka et al., 2017), indicating that transcription can be modulated not only during elongation but also at initiation (Saunders et al., 2006; Muse et al., 2007; Zeitlinger et al., 2007; Core et al., 2008). In the following sections, we describe the previously reported PARs in animals and their biogenesis pathways and biological functions.
PALRs in Animals
Different Classes of PALRs
As summarized in Table 1, different classes of PALRs have been reported in several organisms, including promoter-associated transcripts in human (Han et al., 2007) and yeast (Davis and Ares, 2006), promoter upstream transcripts (PROMPTs) in human (Preker et al., 2008, 2011), upstream antisense RNAs (uaRNAs) in mouse (Flynn et al., 2011), stable unannotated transcripts (SUTs) and cryptic unstable transcripts (CUTs) in yeast (Wyers et al., 2005; Neil et al., 2009; Xu et al., 2009), and other unnamed PALRs in human and mouse (Kapranov et al., 2007; Affymetrix Encode Transcriptome Project and Cold Spring Harbor Laboratory Encode Transcriptome Project, 2009). Till date, only a few studies have been conducted on the biogenesis pathways and modes of action of PALRs. Thus, it is unclear whether these PALRs, with distinct nomenclatures, belong to the same or different ncRNA species. It is well known that divergent transcription of active promoters is the major mechanism involved in the biogenesis of PALRs (Sigova et al., 2013). Among the above-mentioned PALRs, PROMPTs are relatively well studied, especially for their sequence characteristics, biogenesis and biological roles. Therefore, we describe the PROMPTs in detail in the sections below.
Table 1
List of the currently identified TBARs (transcription boundary-associated RNAs).
TBARs, transcription boundary-associated RNAs; PARs, promoter-associated RNAs; TARs, terminus-associated RNAs; PALRs, promoter-associated long RNAs; PASRs, promoter-associated small RNAs; TALRs, terminus-associated long RNAs; TASRs, terminus-associated small RNAs; paRNAs, promoter-associated RNAs; PROMPTs, promoter upstream transcripts; uaRNAs, upstream antisense RNAs; SUTs, stable unannotated transcripts; CUTs, cryptic unstable transcripts; UNTs, upstream non-coding transcripts; TSSaRNAs, transcription start site-associated RNAs; tiRNAs, transcription initiation RNAs; TTSaRNAs, transcription termination site-associated RNAs.
Discovery of PROMPTs
Promoter-associated RNAs have short half-lives, as they undergo rapid exosome-mediated degradation, which interferes with their detection. To overcome this limitation, Preker et al. (2008, 2011) used human HeLa cells without exosome activity for the identification of short-lived PARs using oligodT primers. Results of tiling microarray analysis revealed 5′ capped and 3′ adenylated transcripts upstream of the TSSs of transcriptionally active genes (Preker et al., 2008, 2011). These transcripts were several hundred nucleotides in length and were named as PROMPTs.
Unlike PARs discovered by Core et al. (2008), PROMPTs originate from the region within -2,500 bp to -500 bp upstream of the TSS. Moreover, the transcription of PROMPTs can proceed in both directions (Preker et al., 2008). It is widely accepted that divergent transcription is an inherent feature of most of the active promoters (Beck and Warren, 1988; Neil et al., 2009; Xu et al., 2009). Moreover, the production of PROMPTs is highly dependent on activity of promoters of neighboring genes (Preker et al., 2008). Thus, the bidirectional transcription of PROMPTs reflects the universality of divergent transcription of highly active genes, which links bidirectional transcription to gene activity (Core et al., 2008; Nechaev and Adelman, 2011).
Functions of PROMPTs
Functional studies suggest that bidirectional transcription plays an important role in improving the accessibility of chromatin regions for the binding of TFs (Gilchrist et al., 2008; Seila et al., 2008). Notably, some of the divergently transcribed PARs are involved in Argonaute (AGO)-dependent (Han et al., 2007) or -independent (Wang et al., 2008) gene silencing pathways, indicating a potential regulatory role of PROMPTs in gene transcription. The SR proteins facilitate the function of promoter-proximal nascent RNA in transcription pause release (Ji et al., 2013). Additionally, some of the PROMPTs are enriched within the promoter regions with a high CpG content, and these PROMPTs affect the DNA methylation density of the promoters (Preker et al., 2008). Sequence motif analysis has shown that 3′ poly(A) signals are more abundant in regions upstream of the promoter than in those downstream of the promoter. These functional poly(A) signals are involved in the rapid degradation of PROMPTs transcribed in the antisense direction upstream of the associated genes. Thus, the rapid decay of upstream PROMPTs enables efficient elongation of downstream transcripts, which enforces the promoter orientation of protein-coding genes (Ntini et al., 2013). Moreover, if the synthesis of PROMPTs is stalled very early within the TSS-proximal regions, small TSSaRNAs are produced. Small TSSaRNAs are included in the category of PASRs and are discussed below.
PASRs in Animals
Discovery of PASRs
Promoter-associated small RNAs (sRNAs) identified in human and mouse have been previously named as PASRs (Calabrese et al., 2007; Kapranov et al., 2007). These sRNAs are 20–200 nt in length and originate from the region within -400 bp to +400 bp of the TSS. TSSaRNAs are a type of PASRs that have been identified in a broad spectrum of organisms, including eukaryotes (Seila et al., 2008; Park et al., 2014), bacteria (Yus et al., 2012), and archaea (Zaramela et al., 2014). The length of TSSaRNAs differs between species. For example, in the archaea Halobacterium salinarum, the length of TSSaRNAs ranges from 16 to 146 nt, with a median size of 27 nt (Zaramela et al., 2014). In bacteria, the length of TSSaRNAs varies within a narrow range; in Escherichia coli and Mycoplasma pneumonia, the length of TSSaRNAs varies from 33 to 40 nt and 35 to 55 nt, respectively (Yus et al., 2012). In murine embryonic stem cells, TSSaRNAs vary in size from 20 to 90 nt. These TSSaRNAs are highly enriched in the region spanning from -250 bp to +50 bp of the TSS (Seila et al., 2008). Thus, sequence features of mouse TSSaRNAs are similar to those described above. Indeed, TSSaRNAs were occasionally confused with the PASRs (Affymetrix Encode Transcriptome Project and Cold Spring Harbor Laboratory Encode Transcriptome Project, 2009). Another class of PASRs comprises tiRNAs; these are highly conserved in higher metazoans, such as human, chicken, and fruit fly. Similar to the above two PASR classes, tiRNAs are enriched within the region spanning from -60 bp to +120 bp relative to the TSS. However, additional features specific to the tiRNAs have been reported, including their size distribution and GC content (Taft et al., 2009a). The size of tiRNAs varies from 13 to 28 nt, and a significant proportion of these are 18nt in length. Additionally, tiRNAs are GC-rich and show a strong strand bias toward the neighboring TSSs for their biogenesis.
Biogenesis of PASRs
The biogenesis of the three types of PASRs described above is Dicer-independent (Calabrese et al., 2007; Seila et al., 2008; Taft et al., 2009a), indicating that the non-canonical sRNA processing pathway(s) might be responsible for their production. Two models have been proposed for the biogenesis of PASRs (Lenhard et al., 2012). The first model is called “backtracking and excision” (Shaevitz et al., 2003; Seila et al., 2009; Taft et al., 2009a,b, 2010; Nechaev et al., 2010). In this model, the elongating RNA Pol II is stalled, inducing it to backtrack toward the upstream TSS after encountering the downstream nucleosome, resulting in a nascent transcript with a short 3′ exposed region. Subsequently, the exposed region is cleaved by the transcription elongation factor SII, resulting in PASR biogenesis. However, in vitro experiments have shown that the pausing and backtracking of Pol II is used to generate 6–14 nt fragments (Lenhard et al., 2012); however, this length is much shorter than that of PASRs. Therefore, further investigation is needed to identify a plausible model. The second model proposed for PASR biogenesis is the “Pol II pausing” model, which is also described as “unsuccessful Pol II elongation followed by RNA degradation” (Buckley et al., 2014; Jonkers et al., 2014). In this model, the RNA elongation complex is stalled at the initiation stage. Pol II pausing results in a nascent transcript without the 5′ cap, thus rendering it susceptible to rapid decay starting from its 5′ end. Only a short RNA sequence physically covered by Pol II is protected from degradation (Valen et al., 2011). The size of this short RNA sequence varies from17 to 22 nt, which fits well within the median size of PASRs. Notably, the above two models are probably mutually non-exclusive, since different PASR classes may use distinct pathways for their biogenesis.
Functions of PASRs
Recent studies have revealed valuable functional implications of PASRs. Depending on the association with certain long non-coding RNAs (lncRNAs) and protein factors, some of the PASRs target specific promoter regions of genes for epigenetic modifications that affect gene transcription (Hamazaki et al., 2017). However, altered transcriptional activity of target genes, in turn, causes fluctuations in PASR biogenesis. Thus, a feedback regulatory loop is established between PASRs and their target genes (Yan and Ma, 2012).
TSSaRNAs have been shown to play an important role in the tissue-specific epigenetic regulation of transcription initiation of target genes (Seila et al., 2008; Taft et al., 2009a, 2011; Henriques et al., 2013). In bacteria, TSSaRNAs are involved in the assembly of functional complexes that enable full-length transcription of their target genes, thus avoiding immature transcription initiation (Yus et al., 2012). The tiRNAs are produced from the promoter regions of highly expressed genes. These sRNAs are associated with chromatin marks representative of active transcription (Taft et al., 2009a, 2011). Specifically, approximately 96% of tiRNAs discovered in human embryonic stem cells overlap with histone H3K4 methylation marks (Taft et al., 2009a). Notably, some of these chromatin marks are generated via the tiRNA-dependent pathway and play important biological roles. For example, in both human and mouse, some of the tiRNAs induce local epigenetic modifications that modulate the localization of TFs that bind to CCCTC motifs (Taft et al., 2011).
TARs in Animals: Discovery, Biogenesis, and Functions
In addition to the PARs, transcription signals have also been detected at the 3′ ends of protein-coding genes (Wei et al., 2011) (Table 1). In contrast to PARs, reports on TARs, especially terminus-associated long RNAs (TALRs), are limited. In one case of TALRs, Yue et al. (2010) discovered several sense ncRNAs of ∼3,100 nt at the 3′ end of a progesterone receptor gene. In another study using human K562 cells, Schwalb et al. (2016) detected transient RNAs downstream of the poly(A) sites using transient transcriptome sequencing (TT-Seq), although the length of these transient RNAs was not described in detail.
The terminus-associated small RNAs (TASRs) ranging from 22 to 200 nt in length are highly conserved in human and mouse (Kapranov et al., 2007, 2010). Traces of sRNAs at the 3′ ends of many animal genes have been reported (Taft et al., 2009a). In fission yeast, the sense TASR snR49 has been detected in the 3′ region of the ribosomal protein-coding gene RPL26 (Leng et al., 2014). Another type of TASRs has been referred to as transcription termination site-associated RNAs (TTSaRNAs) (Valen et al., 2011). The enrichment of AGO-associated sRNAs within the 3′ untranslated regions (UTRs) of protein-coding genes has been observed in human (Valen et al., 2011); these TTSaRNAs typically vary in size from 22 to 24 nt.
Like PASRs, the genomic distribution of TASRs does not indicate a strand-specific bias. Some of the TASRs with non-genomically encoded 5′ poly(U) tails have been reported in human cells (Kapranov et al., 2010). Because these TASRs are antisense to poly(A)-tailed transcripts and are close to the transcription termination sites, these are suggested to be synthesized via an as-yet-unidentified RNA-dependent RNA polymerase (RDR)-dependent pathway. Another mechanism of TASR biogenesis has been reported for the 3′ sense terminus-associated ncRNA snR49 (Leng et al., 2014). Results of scanning deletion analysis within the promoter region reveal that the transcription of snR49 is dependent on the promoter activity of the upstream gene RPL26. Similarly, the transcription of another 3′ sense terminus-associated ncRNA, snR93, relies on the promoter activity of the upstream RPL29 gene. These data suggest that this kind of transcriptional regulation is a conserved mechanism underlying TASR biogenesis (Leng et al., 2014).
To investigate the regulatory roles of TARs overlapping the 3′ termini of protein-coding genes in the human genome, Younger and Corey (2011) designed small duplex RNAs called “microRNA (miRNA) mimics,” which were perfectly complementary to the TARs. The targeting of TARs by “miRNA mimics” affects upstream gene transcription in a dose-dependent manner, highlighting a previously unappreciated role of gene termini and their associated TARs in transcriptional regulation (Younger and Corey, 2011). On the other hand, artificial constructs expressing 3′ UTRs containing miRNA-binding sites act as miRNA sponges (Thomson and Dinger, 2016), indicating that some of the TALRs might function as endogenous miRNA sponges. Another widely accepted model called “gene looping” elucidates TAR-mediated interaction between the 5′ promoters and 3′ terminators. This model provides an unconventional view of transcriptional regulation on a long-range scale (O’Sullivan et al., 2004; Tan-Wong et al., 2008; Tiwari et al., 2008; Yue et al., 2010). The long-range “gene looping” brings the two ends of a gene in close proximity. The sRNAs complementary to the 3′ sense TALRs recruit AGO2 and other protein factors to the 5′–3′ interaction region of the gene, thus modulating its transcription initiation (Yue et al., 2010). This “gene looping” also provides a long-range scaffold, thus enabling communication between endogenous TASRs and promoters of upstream genes (Younger and Corey, 2011).
TBARs: The Emerging RNA Species in Plants
Discovery of Plant TBARs
Transcription boundary-associated RNAs have also been discovered in plants, although most of the reports are related to PARs. In Arabidopsis, a striking association of UNTs with promoters of many protein-coding genes has been observed (Chekanova et al., 2007). In yeast, characteristics of UNTs are similar to those of CUTs (Wyers et al., 2005; Neil et al., 2009; Xu et al., 2009). For example, both UNTs and CUTs are weakly expressed, 3′ poly(A)-tailed, and degraded via the exosome-mediated pathway. Notably, the 5′ ends of UNTs are coincident with those of full-length mRNAs transcribed from the identical promoters.
A survey of PASRs in metazoa and Arabidopsis has shown that, unlike metazoans, PASR-like peak is not detectable within the promoter regions of Arabidopsis genes (Taft et al., 2009b). Data of this survey also suggest that 18-nt tiRNAs are absent in plants. There are two possible explanations for these observations. First, it is possible that the tiRNA biogenesis pathway either has been lost or never existed in Arabidopsis. Second, it is possible that once tiRNAs are produced, they are subjected to rapid degradation (Taft et al., 2009b). The presence of PARs and TARs in Arabidopsis has been revisited using many more HTS data sets (Wang et al., 2011). Unlike the previous study (Taft et al., 2009b), Wang et al. (2011) detected PASR peaks in regions surrounding the TSSs of non-transposable element (TE) genes. However, in these data sets, a total of 17,000 non-TE genes were treated as a whole group for PASR signal detection, and an in-depth investigation was not performed. Therefore, a detailed list of genes producing PASRs is not available. Recently, we identified hundreds of protein-coding genes in Arabidopsis with detectable PASR signals (Ma et al., 2017). Similar to the result of Taft et al. (2009a), the 18-nt tiRNA-like PASRs were also rarely detected in our study. Some of the PASRs accumulated in a tissue-specific manner. Additionally, we observed TASR peaks surrounding the transcription termini of many protein-coding genes (Ma et al., 2017). Both PASRs and TASRs vary in length from 23 to 24 nt and preferentially start with either an adenine (A) or a uracil (U) residue at their 5′ ends.
Biogenesis of Plant TBARs
In Arabidopsis, PASRs are located either in upstream or downstream regions of the TSSs (Wang et al., 2011; Ma et al., 2017). In animals, divergent transcription (Beck and Warren, 1988; Core et al., 2008; He et al., 2008; Seila et al., 2008, 2009; Flynn et al., 2011) and bidirectional promoters (Morris et al., 2008; Xu et al., 2009; Wei et al., 2011; Uesaka et al., 2014) have been proposed as the mechanisms of PASR biogenesis. Whether these models explain the biogenesis of PASRs in plants needs further investigation. There is no clear model for the biogenesis of TASRs in animals. It has been suggested that some of the TASRs, especially those that areantisense, are generated through a specific RDR-dependent pathway. Our study in Arabidopsis shows that the accumulation of some TBARs, including PASRs and TASRs, is highly dependent on the activities of RNA Pol IV, RDR2/6, and DCL2/3/4 (Ma et al., 2017).
According to the distribution patterns of TSSs in vertebrates, gene promoters are classified into two categories: sharp and broad. A sharp promoter always has a single predominant TSS, whereas a broad promoter usually has an array of almost equivalent TSSs (Valen et al., 2011; Danino et al., 2015). Interestingly, sharp promoters generate fairly narrow PASR peaks, whereas broad promoters generate PASRs with a much wider distribution range. Notably, in Arabidopsis, both sharp and broad PASR peaks are observed on different gene promoters (Ma et al., 2017). Thus, investigating the relationship between the promoter type and PASR peak shape in plants will be useful.
Functions of Plant TBARs
Few studies have been conducted on the regulatory roles of TBARs in plants. However, studies in animals provide valuable hints on the functions of TBARs in plants. Using the human PR gene as a model, Core et al. (2008) have shown that small duplex RNAs complementary to the ncRNAs originating from the promoter or terminus of the PR gene can efficiently regulate gene transcription. Theoretically, manually designed sRNAs recruit the AGO2 protein to the complementary target ncRNAs (i.e., PALRs or TALRs), which alters the transcription status of the PR gene via the “gene looping” mechanism (Janowski et al., 2005, 2006, 2007; Schwartz et al., 2008; Younger and Corey, 2009; Yue et al., 2010). This regulatory mechanism is conserved in animals (Janowski et al., 2006; Kim et al., 2006; Han et al., 2007; Schwartz et al., 2008; Napoli et al., 2009; Chu et al., 2010). Two research areas need further investigation: first, whether the plant endogenous PASRs and TASRs function in a similar manner as the manually designed sRNAs reported in animals; and second, whether PALRs and TALRs are recognized as targets of PASRs and TASRs, respectively, in plants. The involvement of TBARs in site-specific DNA methylation has been observed in both plants and animals (Mette et al., 2000; Hawkins et al., 2009; Swiezewski et al., 2009; Zheng et al., 2013; Ma et al., 2017), indicating that it is an important pathway for TBAR-mediated regulation of gene transcription.
Concluding Remarks and Perspectives
Need for a Uniform Annotation System for TBAR Research
Owing to the heterogeneity of the TBAR population, many aspects of TBARs including sequence characteristics, biogenesis pathways and biological functions are poorly understood. One of the most basic and pressing issues derives from their classification. Some of the TBARs such as PASRs, TSSa-RNAs, and tiRNAs exhibit similar genomic locations and sizes in animals. It is unclear whether these sRNAs belong to distinct classes or to the same class (Jacquier, 2009). It is also unclear whether their biogenesis pathways and action modes overlap. Although some of the TBARs are likely to represent “transcription noise,” increasing cases are being reported that emphasize the non-negligible roles of certain TBARs in gene expression regulation. Hence, a uniform annotation system is required to facilitate in-depth research on TBARs. Some criteria have been proposed previously to determine the classification of PARs, including size ranges and positions relative to the TSSs (Kaikkonen et al., 2011; Lenhard et al., 2012). Here, we propose a preliminary system for TBAR annotation (Figure 1). In this system, we first classified TBARs, based on their genomic position, as PARs and TARs. Then, based on their sequence length, both PARs and TARs were divided into long (>200 nt) and short (<200 nt) species. Using this annotation system, most of the recently reported TBARs could be assigned to one of the categories. For further classification, we suggest that additional important features of TBARs, such as biogenesis pathways, modes of action, and biological functions, should be taken into consideration.
FIGURE 1
Future Challenges in TBAR Research
To advance our understanding of TBARs, further research is needed in a few areas described below. First, it has been suggested that sRNAs mapped to the transcription boundaries of protein-coding genes are degraded remnants of mRNAs (Schwalb et al., 2016). However, considering the non-random enrichment of TBARs with specific sequence length, such as the animal tiRNAs (18 nt) (Taft et al., 2009a) and plant PASRs and TASRs (23–24 nt) (Ma et al., 2017), at least some of the TBARs are unlikely to be generated through random mRNA decay. Nonetheless, a large proportion of TBARs are subjected to rapid degradation after their maturation. According to previous studies (Davis and Ares, 2006; Chekanova et al., 2007; Preker et al., 2008), mutants of RNA decay pathways (e.g., exosome-depleted cell lines) represent promising options for efficient TBAR cloning.
Second, recent evidence shows that a genetic mutation within a protein-coding locus may affect not only the gene of interest but also the associated ncRNAs. Notably, some of these affected ncRNAs, in turn, alter the chromatin state through long-range interactions (Tufarelli et al., 2003; Wei et al., 2011). Thus, when investigating the biological consequences of a mutation within a specific genomic locus, it is necessary to consider the effects on the associated ncRNAs.
Third, as mentioned above, the high frequency of poly(A) signals on PROMPTs is related to their rapid degradation, which has been proposed as one of the mechanisms to ensure unidirectional transcription elongation (Ntini et al., 2013). Since most of the mammalian promoters are bidirectional (Core et al., 2008; Seila et al., 2008; Sigova et al., 2013), functional studies on the involvement of PARs in transcription determination are necessary. Till date, three key factors have been proposed as important players in transcription determination (Wei et al., 2011), including the nucleotide composition, chromatin modifications of promoters, and the “gene looping” mechanism. The nucleotide composition within the promoter region affects the directionality of the promoter (Engstrom et al., 2006). In yeast, the TATA element imposes a constraint on the direction of transcription initiation (Park et al., 2014). Additionally, chromatin modifications induced by the promoter-associated transcription of the ncRNAs may serve as codes for the orientation of transcription elongation. Moreover, the “gene looping” model suggests that long-range interactions link the promoter to its favored 3′ end, thus determining the orientation of transcription elongation (Miele and Dekker, 2008; Laine et al., 2009; Tan-Wong et al., 2009). However, detailed mechanisms of unidirectional transcription elongation require further investigation.
Fourth, in addition to the TBARs associated with protein-coding genes, whether TBARs exist on lncRNAs needs to be revisited. Conserved secondary structures are detected at the ends of certain lncRNAs (Ponting et al., 2009; Yu et al., 2017); however, whether these local structures serve as precursors of PASRs or TASRs remains elusive.
Overall, investigations into all of the above-mentioned challenging but intriguing research areas are needed. Results of these investigations are expected to increase our knowledge of TBARs.
Statements
Author contributions
DY, XM, HW, and YM: wrote the manuscript. ZZ, DY, and XM: prepared the figure and table. All authors read and approved the final manuscript.
Funding
This work was funded by the National Natural Science Foundation of China (31571349), (31601062), and (31771436). Zhejiang Provincial Natural Science Foundation of China (LY15C060006) and (LQ16C060003). Hangzhou Scientific and Technological Program (20170432B04).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
References
1
Affymetrix Encode Transcriptome Project and Cold Spring Harbor Laboratory Encode Transcriptome Project (2009). Post-transcriptional processing generates a diversity of 5′-modified long and short RNAs.Nature4571028–1032. 10.1038/nature07759
2
AmaralP. P.DingerM. E.MercerT. R.MattickJ. S. (2008). The eukaryotic genome as an RNA machine.Science3191787–1789. 10.1126/science.1155472
3
AmaralP. P.MattickJ. S. (2008). Noncoding RNA in development.Mamm. Genome19454–492. 10.1007/s00335-008-9136-7
4
AndersenP. K.JensenT. H.Lykke-AndersenS. (2013). Making ends meet: coordination between RNA 3′-end processing and transcription initiation.Wiley Interdiscip. Rev. RNA4233–246. 10.1002/wrna.1156
5
BeckC. F.WarrenR. A. (1988). Divergent promoters, a common form of gene organization.Microbiol. Rev.52318–326.
6
BrownJ. W.MarshallD. F.EcheverriaM. (2008). Intronic noncoding RNAs and splicing.Trends Plant Sci.13335–342. 10.1016/j.tplants.2008.04.010
7
BuckleyM. S.KwakH.ZipfelW. R.LisJ. T. (2014). Kinetics of promoter Pol II on Hsp70 reveal stable pausing and key insights into its regulation.Genes Dev.2814–19. 10.1101/gad.231886.113
8
CalabreseJ. M.SeilaA. C.YeoG. W.SharpP. A. (2007). RNA sequence analysis defines Dicer’s role in mouse embryonic stem cells.Proc. Natl. Acad. Sci. U.S.A.10418097–18102. 10.1073/pnas.0709193104
9
ChekanovaJ. A.GregoryB. D.ReverdattoS. V.ChenH.KumarR.HookerT.et al (2007). Genome-wide high-resolution mapping of exosome substrates reveals hidden features in the Arabidopsis transcriptome.Cell1311340–1353. 10.1016/j.cell.2007.10.056
10
ChuY.YueX.YoungerS. T.JanowskiB. A.CoreyD. R. (2010). Involvement of argonaute proteins in gene silencing and activation by RNAs complementary to a non-coding transcript at the progesterone receptor promoter.Nucleic Acids Res.387736–7748. 10.1093/nar/gkq648
11
CoreL. J.WaterfallJ. J.LisJ. T. (2008). Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters.Science3221845–1848. 10.1126/science.1162228
12
DaninoY. M.EvenD.IdesesD.Juven-GershonT. (2015). The core promoter: at the heart of gene expression.Biochim. Biophys. Acta18491116–1131. 10.1016/j.bbagrm.2015.04.003
13
DavisC. A.AresM.Jr. (2006). Accumulation of unstable promoter-associated transcripts upon loss of the nuclear exosome subunit Rrp6p in Saccharomyces cerevisiae.Proc. Natl. Acad. Sci. U.S.A.1033262–3267. 10.1073/pnas.0507783103
14
Encode Project ConsortiumBirneyE.StamatoyannopoulosJ. A.DuttaA.GuigoR.GingerasT. R.et al (2007). Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.Nature447799–816. 10.1038/nature05874
15
EngstromP. G.SuzukiH.NinomiyaN.AkalinA.SessaL.LavorgnaG.et al (2006). Complex Loci in human and mouse genomes.PLoS Genet.2:e47. 10.1371/journal.pgen.0020047
16
FlynnR. A.AlmadaA. E.ZamudioJ. R.SharpP. A. (2011). Antisense RNA polymerase II divergent transcripts are P-TEFb dependent and substrates for the RNA exosome.Proc. Natl. Acad. Sci. U.S.A.10810460–10465. 10.1073/pnas.1106630108
17
GilchristD. A.NechaevS.LeeC.GhoshS. K.CollinsJ. B.LiL.et al (2008). NELF-mediated stalling of Pol II can enhance gene expression by blocking promoter-proximal nucleosome assembly.Genes Dev.221921–1933. 10.1101/gad.1643208
18
GingerasT. R. (2007). Origin of phenotypes: genes and transcripts.Genome Res.17682–690. 10.1101/gr.6525007
19
HamazakiN.NakashimaK.ImamuraT. (2017). Manipulation of promoter-associated noncoding RNAs in mouse early embryos for controlling sequence-specific epigenetic status.Methods Mol. Biol.1543271–282. 10.1007/978-1-4939-6716-2_16
20
HanJ.KimD.MorrisK. V. (2007). Promoter-associated RNA is required for RNA-directed transcriptional gene silencing in human cells.Proc. Natl. Acad. Sci. U.S.A.10412422–12427. 10.1073/pnas.0701635104
21
HawkinsP. G.SantosoS.AdamsC.AnestV.MorrisK. V. (2009). Promoter targeted small RNAs induce long-term transcriptional gene silencing in human cells.Nucleic Acids Res.372984–2995. 10.1093/nar/gkp127
22
HeY.VogelsteinB.VelculescuV. E.PapadopoulosN.KinzlerK. W. (2008). The antisense transcriptomes of human cells.Science3221855–1857. 10.1126/science.1163853
23
HenriquesT.GilchristD. A.NechaevS.BernM.MuseG. W.BurkholderA.et al (2013). Stable pausing by RNA polymerase II provides an opportunity to target and integrate regulatory signals.Mol. Cell.52517–528. 10.1016/j.molcel.2013.10.001
24
HuangV.QinY.WangJ.WangX.PlaceR. F.LinG.et al (2010). RNAa is conserved in mammalian cells.PLoS One5:e8848. 10.1371/journal.pone.0008848
25
JacquierA. (2009). The complex eukaryotic transcriptome: unexpected pervasive transcription and novel small RNAs.Nat. Rev. Genet.10833–844. 10.1038/nrg2683
26
JanowskiB. A.HuffmanK. E.SchwartzJ. C.RamR.HardyD.ShamesD. S.et al (2005). Inhibiting gene expression at transcription start sites in chromosomal DNA with antigene RNAs.Nat. Chem. Biol.1216–222. 10.1038/nchembio725
27
JanowskiB. A.HuffmanK. E.SchwartzJ. C.RamR.NordsellR.ShamesD. S.et al (2006). Involvement of AGO1 and AGO2 in mammalian transcriptional silencing.Nat. Struct. Mol. Biol.13787–792. 10.1038/nsmb1140
28
JanowskiB. A.YoungerS. T.HardyD. B.RamR.HuffmanK. E.CoreyD. R. (2007). Activating gene expression in mammalian cells with promoter-targeted duplex RNAs.Nat. Chem. Biol.3166–173. 10.1038/nchembio860
29
JiX.ZhouY.PanditS.HuangJ.LiH.LinC. Y.et al (2013). SR proteins collaborate with 7SK and promoter-associated nascent RNA to release paused polymerase.Cell153855–868. 10.1016/j.cell.2013.04.028
30
JonkersI.KwakH.LisJ. T. (2014). Genome-wide dynamics of Pol II elongation and its interplay with promoter proximal pausing, chromatin, and exons.eLife3:e02407. 10.7554/eLife.02407
31
KaikkonenM. U.LamM. T.GlassC. K. (2011). Non-coding RNAs as regulators of gene expression and epigenetics.Cardiovasc. Res.90430–440. 10.1093/cvr/cvr097
32
KapranovP.ChengJ.DikeS.NixD. A.DuttaguptaR.WillinghamA. T.et al (2007). RNA maps reveal new RNA classes and a possible function for pervasive transcription.Science3161484–1488. 10.1126/science.1138341
33
KapranovP.OzsolakF.KimS. W.FoissacS.LipsonD.HartC.et al (2010). New class of gene-termini-associated human RNAs suggests a novel RNA copying mechanism.Nature466642–646. 10.1038/nature09190
34
KassabM. A.MudassirM.SinghA.MuthuramanN.BhagatM.PalanichamyJ. K.et al (2015). Gene silencing and activation of human papillomavirus 18 is modulated by sense promoter associated RNA in Bidirectionally transcribed long control region.PLoS One10:e0128416. 10.1371/journal.pone.0128416
35
KimD. H.VilleneuveL. M.MorrisK. V.RossiJ. J. (2006). Argonaute-1 directs siRNA-mediated transcriptional gene silencing in human cells.Nat. Struct. Mol. Biol.13793–797. 10.1038/nsmb1142
36
LaineJ. P.SinghB. N.KrishnamurthyS.HampseyM. (2009). A physiological role for gene loops in yeast.Genes Dev.232604–2609. 10.1101/gad.1823609
37
LengX. M.DiaoL. T.LiB.BiY. Z.ChenC. J.ZhouH.et al (2014). The ribosomal protein rpl26 promoter is required for its 3′ sense terminus ncRNA transcription in Schizosaccharomyces pombe, implicating a new transcriptional mechanism for ncRNAs.Biochem. Biophys. Res. Commun.44486–91. 10.1016/j.bbrc.2014.01.018
38
LenhardB.SandelinA.CarninciP. (2012). Metazoan promoters: emerging characteristics and insights into transcriptional regulation.Nat. Rev. Genet.13233–245. 10.1038/nrg3163
39
LiB.CareyM.WorkmanJ. L. (2007). The role of chromatin during transcription.Cell128707–719. 10.1016/j.cell.2007.01.015
40
LiL. C.OkinoS. T.ZhaoH.PookotD.PlaceR. F.UrakamiS.et al (2006). Small dsRNAs induce transcriptional activation in human cells.Proc. Natl. Acad. Sci. U.S.A.10317337–17342. 10.1073/pnas.0607015103
41
LiW.NotaniD.RosenfeldM. G. (2016). Enhancers as non-coding RNA transcription units: recent insights and future perspectives.Nat. Rev. Genet.17207–223. 10.1038/nrg.2016.4
42
Lykke-AndersenS.MapendanoC. K.JensenT. H. (2011). An ending is a new beginning: transcription termination supports re-initiation.Cell Cycle10863–865. 10.4161/cc.10.6.14931
43
MaX.HanN.ShaoC.MengY. (2017). Transcriptome-wide discovery of PASRs (promoter-associated small RNAs) and TASRs (terminus-associated small RNAs) in Arabidopsis thaliana.PLoS One12:e0169212. 10.1371/journal.pone.0169212
44
MengY.ShaoC. (2012). Large-scale identification of mirtrons in Arabidopsis and rice.PLoS One7:e31163. 10.1371/journal.pone.0031163
45
MengY.ShaoC.WangH.ChenM. (2011). The regulatory activities of plant microRNAs: a more dynamic perspective.Plant Physiol.1571583–1595. 10.1104/pp.111.187088
46
MetteM. F.AufsatzW.van der WindenJ.MatzkeM. A.MatzkeA. J. (2000). Transcriptional silencing and promoter methylation triggered by double-stranded RNA.EMBO J.195194–5201. 10.1093/emboj/19.19.5194
47
MieleA.DekkerJ. (2008). Long-range chromosomal interactions and gene regulation.Mol. Biosyst.41046–1057. 10.1039/b803580f
48
MorrisK. V.SantosoS.TurnerA. M.PastoriC.HawkinsP. G. (2008). Bidirectional transcription directs both transcriptional gene activation and suppression in human cells.PLoS Genet.4:e1000258. 10.1371/journal.pgen.1000258
49
MuseG. W.GilchristD. A.NechaevS.ShahR.ParkerJ. S.GrissomS. F.et al (2007). RNA polymerase is poised for activation across the genome.Nat. Genet.391507–1511. 10.1038/ng.2007.21
50
NapoliS.PastoriC.MagistriM.CarboneG. M.CatapanoC. V. (2009). Promoter-specific transcriptional interference and c-myc gene silencing by siRNAs in human cells.EMBO J.281708–1719. 10.1038/emboj.2009.139
51
NechaevS.AdelmanK. (2011). Pol II waiting in the starting gates: regulating the transition from transcription initiation into productive elongation.Biochim. Biophys. Acta180934–45. 10.1016/j.bbagrm.2010.11.001
52
NechaevS.FargoD. C.dos SantosG.LiuL.GaoY.AdelmanK. (2010). Global analysis of short RNAs reveals widespread promoter-proximal stalling and arrest of Pol II in Drosophila.Science327335–338. 10.1126/science.1181421
53
NeilH.MalabatC.d’Aubenton-CarafaY.XuZ.SteinmetzL. M.JacquierA. (2009). Widespread bidirectional promoters are the major source of cryptic transcripts in yeast.Nature4571038–1042. 10.1038/nature07747
54
NtiniE.JarvelinA. I.BornholdtJ.ChenY.BoydM.JorgensenM.et al (2013). Polyadenylation site-induced decay of upstream transcripts enforces promoter directionality.Nat. Struct. Mol. Biol.20923–928. 10.1038/nsmb.2640
55
O’SullivanJ. M.Tan-WongS. M.MorillonA.LeeB.ColesJ.MellorJ.et al (2004). Gene loops juxtapose promoters and terminators in yeast.Nat. Genet.361014–1018. 10.1038/ng1411
56
ParkD.MorrisA. R.BattenhouseA.IyerV. R. (2014). Simultaneous mapping of transcript ends at single-nucleotide resolution and identification of widespread promoter-associated non-coding RNA governed by TATA elements.Nucleic Acids Res.423736–3749. 10.1093/nar/gkt1366
57
PickrellJ. K.MarioniJ. C.PaiA. A.DegnerJ. F.EngelhardtB. E.NkadoriE.et al (2010). Understanding mechanisms underlying human gene expression variation with RNA sequencing.Nature464768–772. 10.1038/nature08872
58
PontingC. P.OliverP. L.ReikW. (2009). Evolution and functions of long noncoding RNAs.Cell136629–641. 10.1016/j.cell.2009.02.006
59
PrekerP.AlmvigK.ChristensenM. S.ValenE.MapendanoC. K.SandelinA.et al (2011). PROMoter uPstream Transcripts share characteristics with mRNAs and are produced upstream of all three major types of mammalian promoters.Nucleic Acids Res.397179–7193. 10.1093/nar/gkr370
60
PrekerP.NielsenJ.KammlerS.Lykke-AndersenS.ChristensenM. S.MapendanoC. K.et al (2008). RNA exosome depletion reveals transcription upstream of active human promoters.Science3221851–1854. 10.1126/science.1164096
61
RazinS. V.GavrilovA. A.PichuginA.LipinskiM.IarovaiaO. V.VassetzkyY. S. (2011). Transcription factories in the context of the nuclear and genome organization.Nucleic Acids Res.399085–9092. 10.1093/nar/gkr683
62
SaundersA.CoreL. J.LisJ. T. (2006). Breaking barriers to transcription elongation.Nat. Rev. Mol. Cell Biol.7557–567. 10.1038/nrm1981
63
SchwalbB.MichelM.ZacherB.FruhaufK.DemelC.TreschA.et al (2016). TT-seq maps the human transient transcriptome.Science3521225–1228. 10.1126/science.aad9841
64
SchwartzJ. C.YoungerS. T.NguyenN. B.HardyD. B.MoniaB. P.CoreyD. R.et al (2008). Antisense transcripts are targets for activating small RNAs.Nat. Struct. Mol. Biol.15842–848. 10.1038/nsmb.1444
65
SeilaA. C.CalabreseJ. M.LevineS. S.YeoG. W.RahlP. B.FlynnR. A.et al (2008). Divergent transcription from active promoters.Science3221849–1851. 10.1126/science.1162253
66
SeilaA. C.CoreL. J.LisJ. T.SharpP. A. (2009). Divergent transcription: a new feature of active promoters.Cell Cycle82557–2564. 10.4161/cc.8.16.9305
67
ShaevitzJ. W.AbbondanzieriE. A.LandickR.BlockS. M. (2003). Backtracking by single RNA polymerase molecules observed at near-base-pair resolution.Nature426684–687. 10.1038/nature02191
68
SigovaA. A.MullenA. C.MolinieB.GuptaS.OrlandoD. A.GuentherM. G.et al (2013). Divergent transcription of long noncoding RNA/mRNA gene pairs in embryonic stem cells.Proc. Natl. Acad. Sci. U.S.A.1102876–2881. 10.1073/pnas.1221904110
69
SwiezewskiS.LiuF.MagusinA.DeanC. (2009). Cold-induced silencing by long antisense transcripts of an Arabidopsis Polycomb target.Nature462799–802. 10.1038/nature08618
70
TaftR. J.GlazovE. A.CloonanN.SimonsC.StephenS.FaulknerG. J.et al (2009a). Tiny RNAs associated with transcription start sites in animals.Nat. Genet.41572–578. 10.1038/ng.312
71
TaftR. J.KaplanC. D.SimonsC.MattickJ. S. (2009b). Evolution, biogenesis and function of promoter-associated RNAs.Cell Cycle82332–2338. 10.4161/cc.8.15.9154
72
TaftR. J.HawkinsP. G.MattickJ. S.MorrisK. V. (2011). The relationship between transcription initiation RNAs and CCCTC-binding factor (CTCF) localization.Epigenetics Chromatin4:13. 10.1186/1756-8935-4-13
73
TaftR. J.SimonsC.NahkuriS.OeyH.KorbieD. J.MercerT. R.et al (2010). Nuclear-localized tiny RNAs are associated with transcription initiation and splice sites in metazoans.Nat. Struct. Mol. Biol.171030–1034. 10.1038/nsmb.1841
74
Tan-WongS. M.FrenchJ. D.ProudfootN. J.BrownM. A. (2008). Dynamic interactions between the promoter and terminator regions of the mammalian BRCA1 gene.Proc. Natl. Acad. Sci. U.S.A.1055160–5165. 10.1073/pnas.0801048105
75
Tan-WongS. M.WijayatilakeH. D.ProudfootN. J. (2009). Gene loops function to maintain transcriptional memory through interaction with the nuclear pore complex.Genes Dev.232610–2624. 10.1101/gad.1823209
76
ThomsonD. W.DingerM. E. (2016). Endogenous microRNA sponges: evidence and controversy.Nat. Rev. Genet.17272–283. 10.1038/nrg.2016.20
77
TiwariV. K.McGarveyK. M.LicchesiJ. D.OhmJ. E.HermanJ. G.SchubelerD.et al (2008). PcG proteins, DNA methylation, and gene repression by chromatin looping.PLoS Biol.6:e306. 10.1371/journal.pbio.0060306
78
TufarelliC.StanleyJ. A.GarrickD.SharpeJ. A.AyyubH.WoodW. G.et al (2003). Transcription of antisense RNA leading to gene silencing and methylation as a novel cause of human genetic disease.Nat. Genet.34157–165. 10.1038/ng1157
79
TurunenM. P.LehtolaT.HeinonenS. E.AssefaG. S.KorpisaloP.GirnaryR.et al (2009). Efficient regulation of VEGF expression by promoter-targeted lentiviral shRNAs based on epigenetic mechanism: a novel example of epigenetherapy.Circ. Res.105604–609. 10.1161/CIRCRESAHA.109.200774
80
UesakaM.AgataK.OishiT.NakashimaK.ImamuraT. (2017). Evolutionary acquisition of promoter-associated non-coding RNA (pancRNA) repertoires diversifies species-dependent gene activation mechanisms in mammals.BMC Genomics18:285. 10.1186/s12864-017-3662-1
81
UesakaM.NishimuraO.GoY.NakashimaK.AgataK.ImamuraT. (2014). Bidirectional promoters are the major source of gene activation-associated non-coding RNAs in mammals.BMC Genomics15:35. 10.1186/1471-2164-15-35
82
ValenE.PrekerP.AndersenP. R.ZhaoX.ChenY.EnderC.et al (2011). Biogenic mechanisms and utilization of small RNAs derived from human protein-coding genes.Nat. Struct. Mol. Biol.181075–1082. 10.1038/nsmb.2091
83
WangX.AraiS.SongX.ReichartD.DuK.PascualG.et al (2008). Induced ncRNAs allosterically modify RNA-binding proteins in cis to inhibit transcription.Nature454126–130. 10.1038/nature06992
84
WangX.LaurieJ. D.LiuT.WentzJ.LiuX. S. (2011). Computational dissection of Arabidopsis smRNAome leads to discovery of novel microRNAs and short interfering RNAs associated with transcription start sites.Genomics97235–243. 10.1016/j.ygeno.2011.01.006
85
WeiW.PelechanoV.JarvelinA. I.SteinmetzL. M. (2011). Functional consequences of bidirectional promoters.Trends Genet.27267–276. 10.1016/j.tig.2011.04.002
86
WyersF.RougemailleM.BadisG.RousselleJ. C.DufourM. E.BoulayJ.et al (2005). Cryptic Pol II transcripts are degraded by a nuclear quality control pathway involving a new poly(A) polymerase.Cell121725–737. 10.1016/j.cell.2005.04.030
87
XuZ.WeiW.GagneurJ.PerocchiF.Clauder-MunsterS.CamblongJ.et al (2009). Bidirectional promoters generate pervasive transcription in yeast.Nature4571033–1037. 10.1038/nature07728
88
YanB. X.MaJ. X. (2012). Promoter-associated RNAs and promoter-targeted RNAs.Cell. Mol. Life Sci.692833–2842. 10.1007/s00018-012-0953-1
89
YoungerS. T.CoreyD. R. (2009). The puzzle of RNAs that target gene promoters.Chembiochem101135–1139. 10.1002/cbic.200900015
90
YoungerS. T.CoreyD. R. (2011). Transcriptional regulation by miRNA mimics that target sequences downstream of gene termini.Mol. Biosyst.72383–2388. 10.1039/c1mb05090g
91
YuD.MaX.ZuoZ.ShaoW.WangH.MengY. (2017). Bioinformatics resources for deciphering the biogenesis and action pathways of plant small RNAs.Rice10:38. 10.1186/s12284-017-0177-y
92
YueX.SchwartzJ. C.ChuY.YoungerS. T.GagnonK. T.ElbashirS.et al (2010). Transcriptional regulation by small RNAs at sequences downstream from 3′ gene termini.Nat. Chem. Biol.6621–629. 10.1038/nchembio.400
93
YusE.GuellM.VivancosA. P.ChenW. H.Lluch-SenarM.DelgadoJ.et al (2012). Transcription start site associated RNAs in bacteria.Mol. Syst. Biol.8:585. 10.1038/msb.2012.16
94
ZaramelaL. S.VencioR. Z.ten-CatenF.BaligaN. S.KoideT. (2014). Transcription start site associated RNAs (TSSaRNAs) are ubiquitous in all domains of life.PLoS One9:e107680. 10.1371/journal.pone.0107680
95
ZeitlingerJ.StarkA.KellisM.HongJ. W.NechaevS.AdelmanK.et al (2007). RNA polymerase stalling at developmental control genes in the Drosophila melanogaster embryo.Nat. Genet.391512–1516. 10.1038/ng.2007.26
96
ZhengQ.RowleyM. J.BohmdorferG.SandhuD.GregoryB. D.WierzbickiA. T. (2013). RNA polymerase V targets transcriptional silencing components to promoters of protein-coding genes.Plant J.73179–189. 10.1111/tpj.12034
Summary
Keywords
transcription boundary-associated RNAs (TBARs), promoter-associated RNAs (PARs), terminus-associated RNAs (TARs), divergent transcription, transcription start site (TSS), transcription termination site (TTS), polymerase (Pol) pausing, gene looping
Citation
Yu D, Ma X, Zuo Z, Wang H and Meng Y (2018) Classification of Transcription Boundary-Associated RNAs (TBARs) in Animals and Plants. Front. Genet. 9:168. doi: 10.3389/fgene.2018.00168
Received
29 January 2018
Accepted
26 April 2018
Published
14 May 2018
Volume
9 - 2018
Edited by
Florent Hubé, UMR7216 Epigénétique et Destin Cellulaire, France
Reviewed by
Francesca Aguilo, Umeå University, Sweden; Monica Ballarino, Sapienza Università di Roma, Italy
Updates
Copyright
© 2018 Yu, Ma, Zuo, Wang and Meng.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yijun Meng, mengyijun@zju.edu.cn
† These authors have contributed equally to this work.
This article was submitted to RNA, a section of the journal Frontiers in Genetics
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.