REVIEW article

Front. Genet., 14 May 2018

Sec. RNA

Volume 9 - 2018 | https://doi.org/10.3389/fgene.2018.00168

Classification of Transcription Boundary-Associated RNAs (TBARs) in Animals and Plants

  • College of Life and Environmental Sciences, Hangzhou Normal University, Hangzhou, China

Abstract

There is increasing evidence suggesting the contribution of non-coding RNAs (ncRNAs) to the phenotypic and physiological complexity of organisms. A novel ncRNA species has been identified near the transcription boundaries of protein-coding genes in eukaryotes, bacteria, and archaea. This review provides a detailed description of these transcription boundary-associated RNAs (TBARs), including their classification. Based on their genomic distribution, TBARs are divided into two major groups: promoter-associated RNAs (PARs) and terminus-associated RNAs (TARs). Depending on the sequence length, each group is further classified into long RNA species (>200 nt) and small RNA species (<200 nt). According to these rules of TBAR classification, divergent ncRNAs with confusing nomenclatures, such as promoter upstream transcripts (PROMPTs), upstream antisense RNAs (uaRNAs), stable unannotated transcripts (SUTs), cryptic unstable transcripts (CUTs), upstream non-coding transcripts (UNTs), transcription start site-associated RNAs (TSSaRNAs), transcription initiation RNAs (tiRNAs), and transcription termination site-associated RNAs (TTSaRNAs), were assigned to specific classes. Although the biogenesis pathways of PARs and TARs have not yet been clearly elucidated, previous studies indicate that some of the PARs have originated either through divergent transcription or via RNA polymerase pausing. Intriguing findings regarding the functional implications of the TBARs such as the long-range “gene looping” model, which explains their role in the transcriptional regulation of protein-coding genes, are also discussed. Altogether, this review provides a comprehensive overview of the current research status of TBARs, which will promote further investigations in this research area.

Introduction

In the initial phase of genome sequencing, annotated protein-coding genes were thought to be a major contributor to the development of phenotypic and physiological complexity of organisms. However, this viewpoint was challenged soon afterwards. The application of high-throughput sequencing (HTS) in transcriptomics studies uncovered the intriguing phenomenon in which transcription signals were detected across the entire genomes (Encode Project Consortium et al., 2007). Notably, most of these signals originated from non-coding loci but not from protein-coding genes. Subsequent studies have revealed a positive correlation between the diversity of non-coding RNAs (ncRNAs) and the complexity of organisms (Amaral and Mattick, 2008; Amaral et al., 2008). Considering the emerging biological importance of ncRNAs, continuous research efforts have led to the identification and characterization of ncRNAs in multiple organisms. Although a large proportion of ncRNAs resides within intergenic regions, some reside within protein-coding genes, such as antisense ncRNAs (He et al., 2008) and intronic ncRNAs (Brown et al., 2008; Meng et al., 2011; Meng and Shao, 2012). Additionally, some of the ncRNAs have been identified near the promoter regions or the transcription termini of protein-coding genes (Gingeras, 2007; Kapranov et al., 2007).

In this review, we provide a brief summary of the discovery of promoter-associated RNAs (PARs) in budding yeast (Saccharomyces cerevisiae) (Davis and Ares, 2006; Neil et al., 2009; Xu et al., 2009), fruit fly (Drosophila melanogaster), chicken (Gallus gallus) (Taft et al., 2009a), mouse (Mus musculus) (Kapranov et al., 2007; Seila et al., 2008; Affymetrix Encode Transcriptome Project and Cold Spring Harbor Laboratory Encode Transcriptome Project, 2009; Flynn et al., 2011), human (Homo sapiens) (Han et al., 2007; Kapranov et al., 2007; Preker et al., 2008; Affymetrix Encode Transcriptome Project and Cold Spring Harbor Laboratory Encode Transcriptome Project, 2009; Pickrell et al., 2010), Arabidopsis thaliana (Chekanova et al., 2007; Wang et al., 2011; Ma et al., 2017), bacteria (Yus et al., 2012), and archaea (Zaramela et al., 2014). Additionally, the discovery of terminus-associated RNAs (TARs) in human (Kapranov et al., 2007, 2010; Taft et al., 2009a; Yue et al., 2010; Valen et al., 2011; Schwalb et al., 2016), mouse (Kapranov et al., 2007), fission yeast (Schizosaccharomyces pombe), and Arabidopsis (Ma et al., 2017) has also been introduced. Since promoters and terminators define the transcription boundaries of protein-coding genes, PARs and TARs are referred to as transcription boundary-associated RNAs (TBARs). Additionally, we summarize the biogenesis pathways of some of the TBARs and a few mechanistic models of TBAR biogenesis including divergent transcription and RNA polymerase (Pol) pausing. We also summarize recent progress in the functional studies conducted on TBARs, followed by in-depth discussions. Interestingly, PARs have been functionally implicated in transcription termination, whereas TARs are potentially involved in transcription initiation. The interactive relationship between transcription initiation and termination has been depicted using long-range regulatory models, such as “gene looping” (Lykke-Andersen et al., 2011; Andersen et al., 2013). Finally, we propose a preliminary nomenclature system for TBARs according to specific classification criteria and emphasize the importance of a uniform annotation system for TBAR research.

PARs in Animals: Discovery, Biogenesis, and Functions

In eukaryotes, the transcription of protein-coding genes includes two primary steps: initiation and elongation. Each step is under strict surveillance. However, each time transcription is initiated from the transcription start site (TSS) of a protein-coding gene, the final output is not always a full-length coding sequence. This is because various kinds of genetic and epigenetic factors, such as binding sites of RNA Pol II or transcription factors (TFs) and chromatin modifications, interrupt transcription initiation and/or elongation (Li et al., 2007; Razin et al., 2011). Premature termination of transcription is a widespread phenomenon that results in aberrant transcripts. As a result of frequent transcription pausing, diverse ncRNAs have been discovered near the promoter regions. Although such ncRNAs have been referred to by different names, here they are referred to as PARs.

In a pioneering study, Core et al. (2008) employed global run-on sequencing (GRO-Seq) to investigate the density of transcriptionally engaged RNA polymerase enzymes on a genome-wide scale. This study identified a novel species of PARs in human lung fibroblasts. However, because this study used an older generation sequencing platform with short read-length (maximum 33 nt), data on the length of PARs was not provided. These PARs are enriched within -250 bp to +50 bp of the TSSs of certain protein-coding genes. Because GRO-Seq is capable of evaluating promoter-proximal pausing on all genes, it has been suggested that these PARs are derived from RNA Pol II-dependent divergent transcription and promoter-proximal pausing (Core et al., 2008). Subsequent studies have uncovered PARs in different species. Because of recent advances in sequencing platforms, more information is available on the sequence features of PARs, especially their length distribution. In this review, we propose that, according to their sequence length, PARs should to be divided into two classes: promoter-associated long RNAs (PALRs; >200 nt) and promoter-associated small RNAs (PASRs; <200 nt).

Promoter-associated small RNAs have been reported in human and mouse (Kapranov et al., 2007; Affymetrix Encode Transcriptome Project and Cold Spring Harbor Laboratory Encode Transcriptome Project, 2009). TSS-associated RNAs (TSSaRNAs) reported in mouse (Seila et al., 2008) and transcription initiation RNAs (tiRNAs) in human, mouse, chicken and Drosophila (Taft et al., 2009a) have also been categorized as PASRs. Based on these studies, it can be concluded that weak expression is a common feature of PASRs. A number of PARs are exosome-specific substrates with short half-lives (Wyers et al., 2005; Davis and Ares, 2006; Chekanova et al., 2007; Preker et al., 2008). The divergent distribution of many PARs near the TSSs is correlated with bidirectional transcription activity of RNA Pol II. Functional studies have connected some of the PARs with transcriptional activation or repression of their target genes (Li et al., 2006; Janowski et al., 2007; Morris et al., 2008; Turunen et al., 2009; Huang et al., 2010; Kassab et al., 2015; Li et al., 2016; Uesaka et al., 2017), indicating that transcription can be modulated not only during elongation but also at initiation (Saunders et al., 2006; Muse et al., 2007; Zeitlinger et al., 2007; Core et al., 2008). In the following sections, we describe the previously reported PARs in animals and their biogenesis pathways and biological functions.

PALRs in Animals

Different Classes of PALRs

As summarized in Table 1, different classes of PALRs have been reported in several organisms, including promoter-associated transcripts in human (Han et al., 2007) and yeast (Davis and Ares, 2006), promoter upstream transcripts (PROMPTs) in human (Preker et al., 2008, 2011), upstream antisense RNAs (uaRNAs) in mouse (Flynn et al., 2011), stable unannotated transcripts (SUTs) and cryptic unstable transcripts (CUTs) in yeast (Wyers et al., 2005; Neil et al., 2009; Xu et al., 2009), and other unnamed PALRs in human and mouse (Kapranov et al., 2007; Affymetrix Encode Transcriptome Project and Cold Spring Harbor Laboratory Encode Transcriptome Project, 2009). Till date, only a few studies have been conducted on the biogenesis pathways and modes of action of PALRs. Thus, it is unclear whether these PALRs, with distinct nomenclatures, belong to the same or different ncRNA species. It is well known that divergent transcription of active promoters is the major mechanism involved in the biogenesis of PALRs (Sigova et al., 2013). Among the above-mentioned PALRs, PROMPTs are relatively well studied, especially for their sequence characteristics, biogenesis and biological roles. Therefore, we describe the PROMPTs in detail in the sections below.

Table 1

TBAR speciesDiscoveryLengthReference
PARsPALRsPaRNAsH. sapiens and S. cerevisiae200–500 ntDavis and Ares, 2006; Han et al., 2007
PROMPTsH. sapiensHundreds of ntPreker et al., 2008; Preker et al., 2011
UaRNAsM. musculus40–1,100 ntFlynn et al., 2011
SUTs and CUTsS. cerevisiae200–500 ntWyers et al., 2005; Neil et al., 2009; Xu et al., 2009
UNTsA. thalianaChekanova et al., 2007
PALRsH. sapiens and M. musculusKapranov et al., 2007; Affymetrix Encode Transcriptome Project and Cold Spring Harbor Laboratory Encode Transcriptome Project, 2009
PASRsPASRsH. sapiens, M. musculus and A. thaliana19–200 ntKapranov et al., 2007; Affymetrix Encode Transcriptome Project and Cold Spring Harbor Laboratory Encode Transcriptome Project, 2009; Wang et al., 2011; Ma et al., 2017
TSSaRNAsEukaryotes, bacteria, and archaea16–146 ntSeila et al., 2008; Yus et al., 2012; Park et al., 2014; Zaramela et al., 2014
TiRNAsH. sapiens, M. musculus, G. gallus, and D. melanogaster13–28 ntTaft et al., 2009a
TARsTALRsTALRsH. sapiens∼3,100 ntYue et al., 2010
TASRsTASRsH. sapiens, M. musculus, S. pombe, and A. thaliana19–200 ntKapranov et al., 2007, 2010; Taft et al., 2009a; Leng et al., 2014; Schwalb et al., 2016; Ma et al., 2017
TTSaRNAsH. sapiens22–24 ntValen et al., 2011

List of the currently identified TBARs (transcription boundary-associated RNAs).

TBARs, transcription boundary-associated RNAs; PARs, promoter-associated RNAs; TARs, terminus-associated RNAs; PALRs, promoter-associated long RNAs; PASRs, promoter-associated small RNAs; TALRs, terminus-associated long RNAs; TASRs, terminus-associated small RNAs; paRNAs, promoter-associated RNAs; PROMPTs, promoter upstream transcripts; uaRNAs, upstream antisense RNAs; SUTs, stable unannotated transcripts; CUTs, cryptic unstable transcripts; UNTs, upstream non-coding transcripts; TSSaRNAs, transcription start site-associated RNAs; tiRNAs, transcription initiation RNAs; TTSaRNAs, transcription termination site-associated RNAs.

Discovery of PROMPTs

Promoter-associated RNAs have short half-lives, as they undergo rapid exosome-mediated degradation, which interferes with their detection. To overcome this limitation, Preker et al. (2008, 2011) used human HeLa cells without exosome activity for the identification of short-lived PARs using oligodT primers. Results of tiling microarray analysis revealed 5′ capped and 3′ adenylated transcripts upstream of the TSSs of transcriptionally active genes (Preker et al., 2008, 2011). These transcripts were several hundred nucleotides in length and were named as PROMPTs.

Unlike PARs discovered by Core et al. (2008), PROMPTs originate from the region within -2,500 bp to -500 bp upstream of the TSS. Moreover, the transcription of PROMPTs can proceed in both directions (Preker et al., 2008). It is widely accepted that divergent transcription is an inherent feature of most of the active promoters (Beck and Warren, 1988; Neil et al., 2009; Xu et al., 2009). Moreover, the production of PROMPTs is highly dependent on activity of promoters of neighboring genes (Preker et al., 2008). Thus, the bidirectional transcription of PROMPTs reflects the universality of divergent transcription of highly active genes, which links bidirectional transcription to gene activity (Core et al., 2008; Nechaev and Adelman, 2011).

Functions of PROMPTs

Functional studies suggest that bidirectional transcription plays an important role in improving the accessibility of chromatin regions for the binding of TFs (Gilchrist et al., 2008; Seila et al., 2008). Notably, some of the divergently transcribed PARs are involved in Argonaute (AGO)-dependent (Han et al., 2007) or -independent (Wang et al., 2008) gene silencing pathways, indicating a potential regulatory role of PROMPTs in gene transcription. The SR proteins facilitate the function of promoter-proximal nascent RNA in transcription pause release (Ji et al., 2013). Additionally, some of the PROMPTs are enriched within the promoter regions with a high CpG content, and these PROMPTs affect the DNA methylation density of the promoters (Preker et al., 2008). Sequence motif analysis has shown that 3′ poly(A) signals are more abundant in regions upstream of the promoter than in those downstream of the promoter. These functional poly(A) signals are involved in the rapid degradation of PROMPTs transcribed in the antisense direction upstream of the associated genes. Thus, the rapid decay of upstream PROMPTs enables efficient elongation of downstream transcripts, which enforces the promoter orientation of protein-coding genes (Ntini et al., 2013). Moreover, if the synthesis of PROMPTs is stalled very early within the TSS-proximal regions, small TSSaRNAs are produced. Small TSSaRNAs are included in the category of PASRs and are discussed below.

PASRs in Animals

Discovery of PASRs

Promoter-associated small RNAs (sRNAs) identified in human and mouse have been previously named as PASRs (Calabrese et al., 2007; Kapranov et al., 2007). These sRNAs are 20–200 nt in length and originate from the region within -400 bp to +400 bp of the TSS. TSSaRNAs are a type of PASRs that have been identified in a broad spectrum of organisms, including eukaryotes (Seila et al., 2008; Park et al., 2014), bacteria (Yus et al., 2012), and archaea (Zaramela et al., 2014). The length of TSSaRNAs differs between species. For example, in the archaea Halobacterium salinarum, the length of TSSaRNAs ranges from 16 to 146 nt, with a median size of 27 nt (Zaramela et al., 2014). In bacteria, the length of TSSaRNAs varies within a narrow range; in Escherichia coli and Mycoplasma pneumonia, the length of TSSaRNAs varies from 33 to 40 nt and 35 to 55 nt, respectively (Yus et al., 2012). In murine embryonic stem cells, TSSaRNAs vary in size from 20 to 90 nt. These TSSaRNAs are highly enriched in the region spanning from -250 bp to +50 bp of the TSS (Seila et al., 2008). Thus, sequence features of mouse TSSaRNAs are similar to those described above. Indeed, TSSaRNAs were occasionally confused with the PASRs (Affymetrix Encode Transcriptome Project and Cold Spring Harbor Laboratory Encode Transcriptome Project, 2009). Another class of PASRs comprises tiRNAs; these are highly conserved in higher metazoans, such as human, chicken, and fruit fly. Similar to the above two PASR classes, tiRNAs are enriched within the region spanning from -60 bp to +120 bp relative to the TSS. However, additional features specific to the tiRNAs have been reported, including their size distribution and GC content (Taft et al., 2009a). The size of tiRNAs varies from 13 to 28 nt, and a significant proportion of these are 18nt in length. Additionally, tiRNAs are GC-rich and show a strong strand bias toward the neighboring TSSs for their biogenesis.

Biogenesis of PASRs

The biogenesis of the three types of PASRs described above is Dicer-independent (Calabrese et al., 2007; Seila et al., 2008; Taft et al., 2009a), indicating that the non-canonical sRNA processing pathway(s) might be responsible for their production. Two models have been proposed for the biogenesis of PASRs (Lenhard et al., 2012). The first model is called “backtracking and excision” (Shaevitz et al., 2003; Seila et al., 2009; Taft et al., 2009a,b, 2010; Nechaev et al., 2010). In this model, the elongating RNA Pol II is stalled, inducing it to backtrack toward the upstream TSS after encountering the downstream nucleosome, resulting in a nascent transcript with a short 3′ exposed region. Subsequently, the exposed region is cleaved by the transcription elongation factor SII, resulting in PASR biogenesis. However, in vitro experiments have shown that the pausing and backtracking of Pol II is used to generate 6–14 nt fragments (Lenhard et al., 2012); however, this length is much shorter than that of PASRs. Therefore, further investigation is needed to identify a plausible model. The second model proposed for PASR biogenesis is the “Pol II pausing” model, which is also described as “unsuccessful Pol II elongation followed by RNA degradation” (Buckley et al., 2014; Jonkers et al., 2014). In this model, the RNA elongation complex is stalled at the initiation stage. Pol II pausing results in a nascent transcript without the 5′ cap, thus rendering it susceptible to rapid decay starting from its 5′ end. Only a short RNA sequence physically covered by Pol II is protected from degradation (Valen et al., 2011). The size of this short RNA sequence varies from17 to 22 nt, which fits well within the median size of PASRs. Notably, the above two models are probably mutually non-exclusive, since different PASR classes may use distinct pathways for their biogenesis.

Functions of PASRs

Recent studies have revealed valuable functional implications of PASRs. Depending on the association with certain long non-coding RNAs (lncRNAs) and protein factors, some of the PASRs target specific promoter regions of genes for epigenetic modifications that affect gene transcription (Hamazaki et al., 2017). However, altered transcriptional activity of target genes, in turn, causes fluctuations in PASR biogenesis. Thus, a feedback regulatory loop is established between PASRs and their target genes (Yan and Ma, 2012).

TSSaRNAs have been shown to play an important role in the tissue-specific epigenetic regulation of transcription initiation of target genes (Seila et al., 2008; Taft et al., 2009a, 2011; Henriques et al., 2013). In bacteria, TSSaRNAs are involved in the assembly of functional complexes that enable full-length transcription of their target genes, thus avoiding immature transcription initiation (Yus et al., 2012). The tiRNAs are produced from the promoter regions of highly expressed genes. These sRNAs are associated with chromatin marks representative of active transcription (Taft et al., 2009a, 2011). Specifically, approximately 96% of tiRNAs discovered in human embryonic stem cells overlap with histone H3K4 methylation marks (Taft et al., 2009a). Notably, some of these chromatin marks are generated via the tiRNA-dependent pathway and play important biological roles. For example, in both human and mouse, some of the tiRNAs induce local epigenetic modifications that modulate the localization of TFs that bind to CCCTC motifs (Taft et al., 2011).

TARs in Animals: Discovery, Biogenesis, and Functions

In addition to the PARs, transcription signals have also been detected at the 3′ ends of protein-coding genes (Wei et al., 2011) (Table 1). In contrast to PARs, reports on TARs, especially terminus-associated long RNAs (TALRs), are limited. In one case of TALRs, Yue et al. (2010) discovered several sense ncRNAs of ∼3,100 nt at the 3′ end of a progesterone receptor gene. In another study using human K562 cells, Schwalb et al. (2016) detected transient RNAs downstream of the poly(A) sites using transient transcriptome sequencing (TT-Seq), although the length of these transient RNAs was not described in detail.

The terminus-associated small RNAs (TASRs) ranging from 22 to 200 nt in length are highly conserved in human and mouse (Kapranov et al., 2007, 2010). Traces of sRNAs at the 3′ ends of many animal genes have been reported (Taft et al., 2009a). In fission yeast, the sense TASR snR49 has been detected in the 3′ region of the ribosomal protein-coding gene RPL26 (Leng et al., 2014). Another type of TASRs has been referred to as transcription termination site-associated RNAs (TTSaRNAs) (Valen et al., 2011). The enrichment of AGO-associated sRNAs within the 3′ untranslated regions (UTRs) of protein-coding genes has been observed in human (Valen et al., 2011); these TTSaRNAs typically vary in size from 22 to 24 nt.

Like PASRs, the genomic distribution of TASRs does not indicate a strand-specific bias. Some of the TASRs with non-genomically encoded 5′ poly(U) tails have been reported in human cells (Kapranov et al., 2010). Because these TASRs are antisense to poly(A)-tailed transcripts and are close to the transcription termination sites, these are suggested to be synthesized via an as-yet-unidentified RNA-dependent RNA polymerase (RDR)-dependent pathway. Another mechanism of TASR biogenesis has been reported for the 3′ sense terminus-associated ncRNA snR49 (Leng et al., 2014). Results of scanning deletion analysis within the promoter region reveal that the transcription of snR49 is dependent on the promoter activity of the upstream gene RPL26. Similarly, the transcription of another 3′ sense terminus-associated ncRNA, snR93, relies on the promoter activity of the upstream RPL29 gene. These data suggest that this kind of transcriptional regulation is a conserved mechanism underlying TASR biogenesis (Leng et al., 2014).

To investigate the regulatory roles of TARs overlapping the 3′ termini of protein-coding genes in the human genome, Younger and Corey (2011) designed small duplex RNAs called “microRNA (miRNA) mimics,” which were perfectly complementary to the TARs. The targeting of TARs by “miRNA mimics” affects upstream gene transcription in a dose-dependent manner, highlighting a previously unappreciated role of gene termini and their associated TARs in transcriptional regulation (Younger and Corey, 2011). On the other hand, artificial constructs expressing 3′ UTRs containing miRNA-binding sites act as miRNA sponges (Thomson and Dinger, 2016), indicating that some of the TALRs might function as endogenous miRNA sponges. Another widely accepted model called “gene looping” elucidates TAR-mediated interaction between the 5′ promoters and 3′ terminators. This model provides an unconventional view of transcriptional regulation on a long-range scale (O’Sullivan et al., 2004; Tan-Wong et al., 2008; Tiwari et al., 2008; Yue et al., 2010). The long-range “gene looping” brings the two ends of a gene in close proximity. The sRNAs complementary to the 3′ sense TALRs recruit AGO2 and other protein factors to the 5′–3′ interaction region of the gene, thus modulating its transcription initiation (Yue et al., 2010). This “gene looping” also provides a long-range scaffold, thus enabling communication between endogenous TASRs and promoters of upstream genes (Younger and Corey, 2011).

TBARs: The Emerging RNA Species in Plants

Discovery of Plant TBARs

Transcription boundary-associated RNAs have also been discovered in plants, although most of the reports are related to PARs. In Arabidopsis, a striking association of UNTs with promoters of many protein-coding genes has been observed (Chekanova et al., 2007). In yeast, characteristics of UNTs are similar to those of CUTs (Wyers et al., 2005; Neil et al., 2009; Xu et al., 2009). For example, both UNTs and CUTs are weakly expressed, 3′ poly(A)-tailed, and degraded via the exosome-mediated pathway. Notably, the 5′ ends of UNTs are coincident with those of full-length mRNAs transcribed from the identical promoters.

A survey of PASRs in metazoa and Arabidopsis has shown that, unlike metazoans, PASR-like peak is not detectable within the promoter regions of Arabidopsis genes (Taft et al., 2009b). Data of this survey also suggest that 18-nt tiRNAs are absent in plants. There are two possible explanations for these observations. First, it is possible that the tiRNA biogenesis pathway either has been lost or never existed in Arabidopsis. Second, it is possible that once tiRNAs are produced, they are subjected to rapid degradation (Taft et al., 2009b). The presence of PARs and TARs in Arabidopsis has been revisited using many more HTS data sets (Wang et al., 2011). Unlike the previous study (Taft et al., 2009b), Wang et al. (2011) detected PASR peaks in regions surrounding the TSSs of non-transposable element (TE) genes. However, in these data sets, a total of 17,000 non-TE genes were treated as a whole group for PASR signal detection, and an in-depth investigation was not performed. Therefore, a detailed list of genes producing PASRs is not available. Recently, we identified hundreds of protein-coding genes in Arabidopsis with detectable PASR signals (Ma et al., 2017). Similar to the result of Taft et al. (2009a), the 18-nt tiRNA-like PASRs were also rarely detected in our study. Some of the PASRs accumulated in a tissue-specific manner. Additionally, we observed TASR peaks surrounding the transcription termini of many protein-coding genes (Ma et al., 2017). Both PASRs and TASRs vary in length from 23 to 24 nt and preferentially start with either an adenine (A) or a uracil (U) residue at their 5′ ends.

Biogenesis of Plant TBARs

In Arabidopsis, PASRs are located either in upstream or downstream regions of the TSSs (Wang et al., 2011; Ma et al., 2017). In animals, divergent transcription (Beck and Warren, 1988; Core et al., 2008; He et al., 2008; Seila et al., 2008, 2009; Flynn et al., 2011) and bidirectional promoters (Morris et al., 2008; Xu et al., 2009; Wei et al., 2011; Uesaka et al., 2014) have been proposed as the mechanisms of PASR biogenesis. Whether these models explain the biogenesis of PASRs in plants needs further investigation. There is no clear model for the biogenesis of TASRs in animals. It has been suggested that some of the TASRs, especially those that areantisense, are generated through a specific RDR-dependent pathway. Our study in Arabidopsis shows that the accumulation of some TBARs, including PASRs and TASRs, is highly dependent on the activities of RNA Pol IV, RDR2/6, and DCL2/3/4 (Ma et al., 2017).

According to the distribution patterns of TSSs in vertebrates, gene promoters are classified into two categories: sharp and broad. A sharp promoter always has a single predominant TSS, whereas a broad promoter usually has an array of almost equivalent TSSs (Valen et al., 2011; Danino et al., 2015). Interestingly, sharp promoters generate fairly narrow PASR peaks, whereas broad promoters generate PASRs with a much wider distribution range. Notably, in Arabidopsis, both sharp and broad PASR peaks are observed on different gene promoters (Ma et al., 2017). Thus, investigating the relationship between the promoter type and PASR peak shape in plants will be useful.

Functions of Plant TBARs

Few studies have been conducted on the regulatory roles of TBARs in plants. However, studies in animals provide valuable hints on the functions of TBARs in plants. Using the human PR gene as a model, Core et al. (2008) have shown that small duplex RNAs complementary to the ncRNAs originating from the promoter or terminus of the PR gene can efficiently regulate gene transcription. Theoretically, manually designed sRNAs recruit the AGO2 protein to the complementary target ncRNAs (i.e., PALRs or TALRs), which alters the transcription status of the PR gene via the “gene looping” mechanism (Janowski et al., 2005, 2006, 2007; Schwartz et al., 2008; Younger and Corey, 2009; Yue et al., 2010). This regulatory mechanism is conserved in animals (Janowski et al., 2006; Kim et al., 2006; Han et al., 2007; Schwartz et al., 2008; Napoli et al., 2009; Chu et al., 2010). Two research areas need further investigation: first, whether the plant endogenous PASRs and TASRs function in a similar manner as the manually designed sRNAs reported in animals; and second, whether PALRs and TALRs are recognized as targets of PASRs and TASRs, respectively, in plants. The involvement of TBARs in site-specific DNA methylation has been observed in both plants and animals (Mette et al., 2000; Hawkins et al., 2009; Swiezewski et al., 2009; Zheng et al., 2013; Ma et al., 2017), indicating that it is an important pathway for TBAR-mediated regulation of gene transcription.

Concluding Remarks and Perspectives

Need for a Uniform Annotation System for TBAR Research

Owing to the heterogeneity of the TBAR population, many aspects of TBARs including sequence characteristics, biogenesis pathways and biological functions are poorly understood. One of the most basic and pressing issues derives from their classification. Some of the TBARs such as PASRs, TSSa-RNAs, and tiRNAs exhibit similar genomic locations and sizes in animals. It is unclear whether these sRNAs belong to distinct classes or to the same class (Jacquier, 2009). It is also unclear whether their biogenesis pathways and action modes overlap. Although some of the TBARs are likely to represent “transcription noise,” increasing cases are being reported that emphasize the non-negligible roles of certain TBARs in gene expression regulation. Hence, a uniform annotation system is required to facilitate in-depth research on TBARs. Some criteria have been proposed previously to determine the classification of PARs, including size ranges and positions relative to the TSSs (Kaikkonen et al., 2011; Lenhard et al., 2012). Here, we propose a preliminary system for TBAR annotation (Figure 1). In this system, we first classified TBARs, based on their genomic position, as PARs and TARs. Then, based on their sequence length, both PARs and TARs were divided into long (>200 nt) and short (<200 nt) species. Using this annotation system, most of the recently reported TBARs could be assigned to one of the categories. For further classification, we suggest that additional important features of TBARs, such as biogenesis pathways, modes of action, and biological functions, should be taken into consideration.

FIGURE 1

Future Challenges in TBAR Research

To advance our understanding of TBARs, further research is needed in a few areas described below. First, it has been suggested that sRNAs mapped to the transcription boundaries of protein-coding genes are degraded remnants of mRNAs (Schwalb et al., 2016). However, considering the non-random enrichment of TBARs with specific sequence length, such as the animal tiRNAs (18 nt) (Taft et al., 2009a) and plant PASRs and TASRs (23–24 nt) (Ma et al., 2017), at least some of the TBARs are unlikely to be generated through random mRNA decay. Nonetheless, a large proportion of TBARs are subjected to rapid degradation after their maturation. According to previous studies (Davis and Ares, 2006; Chekanova et al., 2007; Preker et al., 2008), mutants of RNA decay pathways (e.g., exosome-depleted cell lines) represent promising options for efficient TBAR cloning.

Second, recent evidence shows that a genetic mutation within a protein-coding locus may affect not only the gene of interest but also the associated ncRNAs. Notably, some of these affected ncRNAs, in turn, alter the chromatin state through long-range interactions (Tufarelli et al., 2003; Wei et al., 2011). Thus, when investigating the biological consequences of a mutation within a specific genomic locus, it is necessary to consider the effects on the associated ncRNAs.

Third, as mentioned above, the high frequency of poly(A) signals on PROMPTs is related to their rapid degradation, which has been proposed as one of the mechanisms to ensure unidirectional transcription elongation (Ntini et al., 2013). Since most of the mammalian promoters are bidirectional (Core et al., 2008; Seila et al., 2008; Sigova et al., 2013), functional studies on the involvement of PARs in transcription determination are necessary. Till date, three key factors have been proposed as important players in transcription determination (Wei et al., 2011), including the nucleotide composition, chromatin modifications of promoters, and the “gene looping” mechanism. The nucleotide composition within the promoter region affects the directionality of the promoter (Engstrom et al., 2006). In yeast, the TATA element imposes a constraint on the direction of transcription initiation (Park et al., 2014). Additionally, chromatin modifications induced by the promoter-associated transcription of the ncRNAs may serve as codes for the orientation of transcription elongation. Moreover, the “gene looping” model suggests that long-range interactions link the promoter to its favored 3′ end, thus determining the orientation of transcription elongation (Miele and Dekker, 2008; Laine et al., 2009; Tan-Wong et al., 2009). However, detailed mechanisms of unidirectional transcription elongation require further investigation.

Fourth, in addition to the TBARs associated with protein-coding genes, whether TBARs exist on lncRNAs needs to be revisited. Conserved secondary structures are detected at the ends of certain lncRNAs (Ponting et al., 2009; Yu et al., 2017); however, whether these local structures serve as precursors of PASRs or TASRs remains elusive.

Overall, investigations into all of the above-mentioned challenging but intriguing research areas are needed. Results of these investigations are expected to increase our knowledge of TBARs.

Statements

Author contributions

DY, XM, HW, and YM: wrote the manuscript. ZZ, DY, and XM: prepared the figure and table. All authors read and approved the final manuscript.

Funding

This work was funded by the National Natural Science Foundation of China (31571349), (31601062), and (31771436). Zhejiang Provincial Natural Science Foundation of China (LY15C060006) and (LQ16C060003). Hangzhou Scientific and Technological Program (20170432B04).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  • 1

    Affymetrix Encode Transcriptome Project and Cold Spring Harbor Laboratory Encode Transcriptome Project (2009). Post-transcriptional processing generates a diversity of 5′-modified long and short RNAs.Nature45710281032. 10.1038/nature07759

  • 2

    AmaralP. P.DingerM. E.MercerT. R.MattickJ. S. (2008). The eukaryotic genome as an RNA machine.Science31917871789. 10.1126/science.1155472

  • 3

    AmaralP. P.MattickJ. S. (2008). Noncoding RNA in development.Mamm. Genome19454492. 10.1007/s00335-008-9136-7

  • 4

    AndersenP. K.JensenT. H.Lykke-AndersenS. (2013). Making ends meet: coordination between RNA 3′-end processing and transcription initiation.Wiley Interdiscip. Rev. RNA4233246. 10.1002/wrna.1156

  • 5

    BeckC. F.WarrenR. A. (1988). Divergent promoters, a common form of gene organization.Microbiol. Rev.52318326.

  • 6

    BrownJ. W.MarshallD. F.EcheverriaM. (2008). Intronic noncoding RNAs and splicing.Trends Plant Sci.13335342. 10.1016/j.tplants.2008.04.010

  • 7

    BuckleyM. S.KwakH.ZipfelW. R.LisJ. T. (2014). Kinetics of promoter Pol II on Hsp70 reveal stable pausing and key insights into its regulation.Genes Dev.281419. 10.1101/gad.231886.113

  • 8

    CalabreseJ. M.SeilaA. C.YeoG. W.SharpP. A. (2007). RNA sequence analysis defines Dicer’s role in mouse embryonic stem cells.Proc. Natl. Acad. Sci. U.S.A.1041809718102. 10.1073/pnas.0709193104

  • 9

    ChekanovaJ. A.GregoryB. D.ReverdattoS. V.ChenH.KumarR.HookerT.et al (2007). Genome-wide high-resolution mapping of exosome substrates reveals hidden features in the Arabidopsis transcriptome.Cell13113401353. 10.1016/j.cell.2007.10.056

  • 10

    ChuY.YueX.YoungerS. T.JanowskiB. A.CoreyD. R. (2010). Involvement of argonaute proteins in gene silencing and activation by RNAs complementary to a non-coding transcript at the progesterone receptor promoter.Nucleic Acids Res.3877367748. 10.1093/nar/gkq648

  • 11

    CoreL. J.WaterfallJ. J.LisJ. T. (2008). Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters.Science32218451848. 10.1126/science.1162228

  • 12

    DaninoY. M.EvenD.IdesesD.Juven-GershonT. (2015). The core promoter: at the heart of gene expression.Biochim. Biophys. Acta184911161131. 10.1016/j.bbagrm.2015.04.003

  • 13

    DavisC. A.AresM.Jr. (2006). Accumulation of unstable promoter-associated transcripts upon loss of the nuclear exosome subunit Rrp6p in Saccharomyces cerevisiae.Proc. Natl. Acad. Sci. U.S.A.10332623267. 10.1073/pnas.0507783103

  • 14

    Encode Project ConsortiumBirneyE.StamatoyannopoulosJ. A.DuttaA.GuigoR.GingerasT. R.et al (2007). Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.Nature447799816. 10.1038/nature05874

  • 15

    EngstromP. G.SuzukiH.NinomiyaN.AkalinA.SessaL.LavorgnaG.et al (2006). Complex Loci in human and mouse genomes.PLoS Genet.2:e47. 10.1371/journal.pgen.0020047

  • 16

    FlynnR. A.AlmadaA. E.ZamudioJ. R.SharpP. A. (2011). Antisense RNA polymerase II divergent transcripts are P-TEFb dependent and substrates for the RNA exosome.Proc. Natl. Acad. Sci. U.S.A.1081046010465. 10.1073/pnas.1106630108

  • 17

    GilchristD. A.NechaevS.LeeC.GhoshS. K.CollinsJ. B.LiL.et al (2008). NELF-mediated stalling of Pol II can enhance gene expression by blocking promoter-proximal nucleosome assembly.Genes Dev.2219211933. 10.1101/gad.1643208

  • 18

    GingerasT. R. (2007). Origin of phenotypes: genes and transcripts.Genome Res.17682690. 10.1101/gr.6525007

  • 19

    HamazakiN.NakashimaK.ImamuraT. (2017). Manipulation of promoter-associated noncoding RNAs in mouse early embryos for controlling sequence-specific epigenetic status.Methods Mol. Biol.1543271282. 10.1007/978-1-4939-6716-2_16

  • 20

    HanJ.KimD.MorrisK. V. (2007). Promoter-associated RNA is required for RNA-directed transcriptional gene silencing in human cells.Proc. Natl. Acad. Sci. U.S.A.1041242212427. 10.1073/pnas.0701635104

  • 21

    HawkinsP. G.SantosoS.AdamsC.AnestV.MorrisK. V. (2009). Promoter targeted small RNAs induce long-term transcriptional gene silencing in human cells.Nucleic Acids Res.3729842995. 10.1093/nar/gkp127

  • 22

    HeY.VogelsteinB.VelculescuV. E.PapadopoulosN.KinzlerK. W. (2008). The antisense transcriptomes of human cells.Science32218551857. 10.1126/science.1163853

  • 23

    HenriquesT.GilchristD. A.NechaevS.BernM.MuseG. W.BurkholderA.et al (2013). Stable pausing by RNA polymerase II provides an opportunity to target and integrate regulatory signals.Mol. Cell.52517528. 10.1016/j.molcel.2013.10.001

  • 24

    HuangV.QinY.WangJ.WangX.PlaceR. F.LinG.et al (2010). RNAa is conserved in mammalian cells.PLoS One5:e8848. 10.1371/journal.pone.0008848

  • 25

    JacquierA. (2009). The complex eukaryotic transcriptome: unexpected pervasive transcription and novel small RNAs.Nat. Rev. Genet.10833844. 10.1038/nrg2683

  • 26

    JanowskiB. A.HuffmanK. E.SchwartzJ. C.RamR.HardyD.ShamesD. S.et al (2005). Inhibiting gene expression at transcription start sites in chromosomal DNA with antigene RNAs.Nat. Chem. Biol.1216222. 10.1038/nchembio725

  • 27

    JanowskiB. A.HuffmanK. E.SchwartzJ. C.RamR.NordsellR.ShamesD. S.et al (2006). Involvement of AGO1 and AGO2 in mammalian transcriptional silencing.Nat. Struct. Mol. Biol.13787792. 10.1038/nsmb1140

  • 28

    JanowskiB. A.YoungerS. T.HardyD. B.RamR.HuffmanK. E.CoreyD. R. (2007). Activating gene expression in mammalian cells with promoter-targeted duplex RNAs.Nat. Chem. Biol.3166173. 10.1038/nchembio860

  • 29

    JiX.ZhouY.PanditS.HuangJ.LiH.LinC. Y.et al (2013). SR proteins collaborate with 7SK and promoter-associated nascent RNA to release paused polymerase.Cell153855868. 10.1016/j.cell.2013.04.028

  • 30

    JonkersI.KwakH.LisJ. T. (2014). Genome-wide dynamics of Pol II elongation and its interplay with promoter proximal pausing, chromatin, and exons.eLife3:e02407. 10.7554/eLife.02407

  • 31

    KaikkonenM. U.LamM. T.GlassC. K. (2011). Non-coding RNAs as regulators of gene expression and epigenetics.Cardiovasc. Res.90430440. 10.1093/cvr/cvr097

  • 32

    KapranovP.ChengJ.DikeS.NixD. A.DuttaguptaR.WillinghamA. T.et al (2007). RNA maps reveal new RNA classes and a possible function for pervasive transcription.Science31614841488. 10.1126/science.1138341

  • 33

    KapranovP.OzsolakF.KimS. W.FoissacS.LipsonD.HartC.et al (2010). New class of gene-termini-associated human RNAs suggests a novel RNA copying mechanism.Nature466642646. 10.1038/nature09190

  • 34

    KassabM. A.MudassirM.SinghA.MuthuramanN.BhagatM.PalanichamyJ. K.et al (2015). Gene silencing and activation of human papillomavirus 18 is modulated by sense promoter associated RNA in Bidirectionally transcribed long control region.PLoS One10:e0128416. 10.1371/journal.pone.0128416

  • 35

    KimD. H.VilleneuveL. M.MorrisK. V.RossiJ. J. (2006). Argonaute-1 directs siRNA-mediated transcriptional gene silencing in human cells.Nat. Struct. Mol. Biol.13793797. 10.1038/nsmb1142

  • 36

    LaineJ. P.SinghB. N.KrishnamurthyS.HampseyM. (2009). A physiological role for gene loops in yeast.Genes Dev.2326042609. 10.1101/gad.1823609

  • 37

    LengX. M.DiaoL. T.LiB.BiY. Z.ChenC. J.ZhouH.et al (2014). The ribosomal protein rpl26 promoter is required for its 3′ sense terminus ncRNA transcription in Schizosaccharomyces pombe, implicating a new transcriptional mechanism for ncRNAs.Biochem. Biophys. Res. Commun.4448691. 10.1016/j.bbrc.2014.01.018

  • 38

    LenhardB.SandelinA.CarninciP. (2012). Metazoan promoters: emerging characteristics and insights into transcriptional regulation.Nat. Rev. Genet.13233245. 10.1038/nrg3163

  • 39

    LiB.CareyM.WorkmanJ. L. (2007). The role of chromatin during transcription.Cell128707719. 10.1016/j.cell.2007.01.015

  • 40

    LiL. C.OkinoS. T.ZhaoH.PookotD.PlaceR. F.UrakamiS.et al (2006). Small dsRNAs induce transcriptional activation in human cells.Proc. Natl. Acad. Sci. U.S.A.1031733717342. 10.1073/pnas.0607015103

  • 41

    LiW.NotaniD.RosenfeldM. G. (2016). Enhancers as non-coding RNA transcription units: recent insights and future perspectives.Nat. Rev. Genet.17207223. 10.1038/nrg.2016.4

  • 42

    Lykke-AndersenS.MapendanoC. K.JensenT. H. (2011). An ending is a new beginning: transcription termination supports re-initiation.Cell Cycle10863865. 10.4161/cc.10.6.14931

  • 43

    MaX.HanN.ShaoC.MengY. (2017). Transcriptome-wide discovery of PASRs (promoter-associated small RNAs) and TASRs (terminus-associated small RNAs) in Arabidopsis thaliana.PLoS One12:e0169212. 10.1371/journal.pone.0169212

  • 44

    MengY.ShaoC. (2012). Large-scale identification of mirtrons in Arabidopsis and rice.PLoS One7:e31163. 10.1371/journal.pone.0031163

  • 45

    MengY.ShaoC.WangH.ChenM. (2011). The regulatory activities of plant microRNAs: a more dynamic perspective.Plant Physiol.15715831595. 10.1104/pp.111.187088

  • 46

    MetteM. F.AufsatzW.van der WindenJ.MatzkeM. A.MatzkeA. J. (2000). Transcriptional silencing and promoter methylation triggered by double-stranded RNA.EMBO J.1951945201. 10.1093/emboj/19.19.5194

  • 47

    MieleA.DekkerJ. (2008). Long-range chromosomal interactions and gene regulation.Mol. Biosyst.410461057. 10.1039/b803580f

  • 48

    MorrisK. V.SantosoS.TurnerA. M.PastoriC.HawkinsP. G. (2008). Bidirectional transcription directs both transcriptional gene activation and suppression in human cells.PLoS Genet.4:e1000258. 10.1371/journal.pgen.1000258

  • 49

    MuseG. W.GilchristD. A.NechaevS.ShahR.ParkerJ. S.GrissomS. F.et al (2007). RNA polymerase is poised for activation across the genome.Nat. Genet.3915071511. 10.1038/ng.2007.21

  • 50

    NapoliS.PastoriC.MagistriM.CarboneG. M.CatapanoC. V. (2009). Promoter-specific transcriptional interference and c-myc gene silencing by siRNAs in human cells.EMBO J.2817081719. 10.1038/emboj.2009.139

  • 51

    NechaevS.AdelmanK. (2011). Pol II waiting in the starting gates: regulating the transition from transcription initiation into productive elongation.Biochim. Biophys. Acta18093445. 10.1016/j.bbagrm.2010.11.001

  • 52

    NechaevS.FargoD. C.dos SantosG.LiuL.GaoY.AdelmanK. (2010). Global analysis of short RNAs reveals widespread promoter-proximal stalling and arrest of Pol II in Drosophila.Science327335338. 10.1126/science.1181421

  • 53

    NeilH.MalabatC.d’Aubenton-CarafaY.XuZ.SteinmetzL. M.JacquierA. (2009). Widespread bidirectional promoters are the major source of cryptic transcripts in yeast.Nature45710381042. 10.1038/nature07747

  • 54

    NtiniE.JarvelinA. I.BornholdtJ.ChenY.BoydM.JorgensenM.et al (2013). Polyadenylation site-induced decay of upstream transcripts enforces promoter directionality.Nat. Struct. Mol. Biol.20923928. 10.1038/nsmb.2640

  • 55

    O’SullivanJ. M.Tan-WongS. M.MorillonA.LeeB.ColesJ.MellorJ.et al (2004). Gene loops juxtapose promoters and terminators in yeast.Nat. Genet.3610141018. 10.1038/ng1411

  • 56

    ParkD.MorrisA. R.BattenhouseA.IyerV. R. (2014). Simultaneous mapping of transcript ends at single-nucleotide resolution and identification of widespread promoter-associated non-coding RNA governed by TATA elements.Nucleic Acids Res.4237363749. 10.1093/nar/gkt1366

  • 57

    PickrellJ. K.MarioniJ. C.PaiA. A.DegnerJ. F.EngelhardtB. E.NkadoriE.et al (2010). Understanding mechanisms underlying human gene expression variation with RNA sequencing.Nature464768772. 10.1038/nature08872

  • 58

    PontingC. P.OliverP. L.ReikW. (2009). Evolution and functions of long noncoding RNAs.Cell136629641. 10.1016/j.cell.2009.02.006

  • 59

    PrekerP.AlmvigK.ChristensenM. S.ValenE.MapendanoC. K.SandelinA.et al (2011). PROMoter uPstream Transcripts share characteristics with mRNAs and are produced upstream of all three major types of mammalian promoters.Nucleic Acids Res.3971797193. 10.1093/nar/gkr370

  • 60

    PrekerP.NielsenJ.KammlerS.Lykke-AndersenS.ChristensenM. S.MapendanoC. K.et al (2008). RNA exosome depletion reveals transcription upstream of active human promoters.Science32218511854. 10.1126/science.1164096

  • 61

    RazinS. V.GavrilovA. A.PichuginA.LipinskiM.IarovaiaO. V.VassetzkyY. S. (2011). Transcription factories in the context of the nuclear and genome organization.Nucleic Acids Res.3990859092. 10.1093/nar/gkr683

  • 62

    SaundersA.CoreL. J.LisJ. T. (2006). Breaking barriers to transcription elongation.Nat. Rev. Mol. Cell Biol.7557567. 10.1038/nrm1981

  • 63

    SchwalbB.MichelM.ZacherB.FruhaufK.DemelC.TreschA.et al (2016). TT-seq maps the human transient transcriptome.Science35212251228. 10.1126/science.aad9841

  • 64

    SchwartzJ. C.YoungerS. T.NguyenN. B.HardyD. B.MoniaB. P.CoreyD. R.et al (2008). Antisense transcripts are targets for activating small RNAs.Nat. Struct. Mol. Biol.15842848. 10.1038/nsmb.1444

  • 65

    SeilaA. C.CalabreseJ. M.LevineS. S.YeoG. W.RahlP. B.FlynnR. A.et al (2008). Divergent transcription from active promoters.Science32218491851. 10.1126/science.1162253

  • 66

    SeilaA. C.CoreL. J.LisJ. T.SharpP. A. (2009). Divergent transcription: a new feature of active promoters.Cell Cycle825572564. 10.4161/cc.8.16.9305

  • 67

    ShaevitzJ. W.AbbondanzieriE. A.LandickR.BlockS. M. (2003). Backtracking by single RNA polymerase molecules observed at near-base-pair resolution.Nature426684687. 10.1038/nature02191

  • 68

    SigovaA. A.MullenA. C.MolinieB.GuptaS.OrlandoD. A.GuentherM. G.et al (2013). Divergent transcription of long noncoding RNA/mRNA gene pairs in embryonic stem cells.Proc. Natl. Acad. Sci. U.S.A.11028762881. 10.1073/pnas.1221904110

  • 69

    SwiezewskiS.LiuF.MagusinA.DeanC. (2009). Cold-induced silencing by long antisense transcripts of an Arabidopsis Polycomb target.Nature462799802. 10.1038/nature08618

  • 70

    TaftR. J.GlazovE. A.CloonanN.SimonsC.StephenS.FaulknerG. J.et al (2009a). Tiny RNAs associated with transcription start sites in animals.Nat. Genet.41572578. 10.1038/ng.312

  • 71

    TaftR. J.KaplanC. D.SimonsC.MattickJ. S. (2009b). Evolution, biogenesis and function of promoter-associated RNAs.Cell Cycle823322338. 10.4161/cc.8.15.9154

  • 72

    TaftR. J.HawkinsP. G.MattickJ. S.MorrisK. V. (2011). The relationship between transcription initiation RNAs and CCCTC-binding factor (CTCF) localization.Epigenetics Chromatin4:13. 10.1186/1756-8935-4-13

  • 73

    TaftR. J.SimonsC.NahkuriS.OeyH.KorbieD. J.MercerT. R.et al (2010). Nuclear-localized tiny RNAs are associated with transcription initiation and splice sites in metazoans.Nat. Struct. Mol. Biol.1710301034. 10.1038/nsmb.1841

  • 74

    Tan-WongS. M.FrenchJ. D.ProudfootN. J.BrownM. A. (2008). Dynamic interactions between the promoter and terminator regions of the mammalian BRCA1 gene.Proc. Natl. Acad. Sci. U.S.A.10551605165. 10.1073/pnas.0801048105

  • 75

    Tan-WongS. M.WijayatilakeH. D.ProudfootN. J. (2009). Gene loops function to maintain transcriptional memory through interaction with the nuclear pore complex.Genes Dev.2326102624. 10.1101/gad.1823209

  • 76

    ThomsonD. W.DingerM. E. (2016). Endogenous microRNA sponges: evidence and controversy.Nat. Rev. Genet.17272283. 10.1038/nrg.2016.20

  • 77

    TiwariV. K.McGarveyK. M.LicchesiJ. D.OhmJ. E.HermanJ. G.SchubelerD.et al (2008). PcG proteins, DNA methylation, and gene repression by chromatin looping.PLoS Biol.6:e306. 10.1371/journal.pbio.0060306

  • 78

    TufarelliC.StanleyJ. A.GarrickD.SharpeJ. A.AyyubH.WoodW. G.et al (2003). Transcription of antisense RNA leading to gene silencing and methylation as a novel cause of human genetic disease.Nat. Genet.34157165. 10.1038/ng1157

  • 79

    TurunenM. P.LehtolaT.HeinonenS. E.AssefaG. S.KorpisaloP.GirnaryR.et al (2009). Efficient regulation of VEGF expression by promoter-targeted lentiviral shRNAs based on epigenetic mechanism: a novel example of epigenetherapy.Circ. Res.105604609. 10.1161/CIRCRESAHA.109.200774

  • 80

    UesakaM.AgataK.OishiT.NakashimaK.ImamuraT. (2017). Evolutionary acquisition of promoter-associated non-coding RNA (pancRNA) repertoires diversifies species-dependent gene activation mechanisms in mammals.BMC Genomics18:285. 10.1186/s12864-017-3662-1

  • 81

    UesakaM.NishimuraO.GoY.NakashimaK.AgataK.ImamuraT. (2014). Bidirectional promoters are the major source of gene activation-associated non-coding RNAs in mammals.BMC Genomics15:35. 10.1186/1471-2164-15-35

  • 82

    ValenE.PrekerP.AndersenP. R.ZhaoX.ChenY.EnderC.et al (2011). Biogenic mechanisms and utilization of small RNAs derived from human protein-coding genes.Nat. Struct. Mol. Biol.1810751082. 10.1038/nsmb.2091

  • 83

    WangX.AraiS.SongX.ReichartD.DuK.PascualG.et al (2008). Induced ncRNAs allosterically modify RNA-binding proteins in cis to inhibit transcription.Nature454126130. 10.1038/nature06992

  • 84

    WangX.LaurieJ. D.LiuT.WentzJ.LiuX. S. (2011). Computational dissection of Arabidopsis smRNAome leads to discovery of novel microRNAs and short interfering RNAs associated with transcription start sites.Genomics97235243. 10.1016/j.ygeno.2011.01.006

  • 85

    WeiW.PelechanoV.JarvelinA. I.SteinmetzL. M. (2011). Functional consequences of bidirectional promoters.Trends Genet.27267276. 10.1016/j.tig.2011.04.002

  • 86

    WyersF.RougemailleM.BadisG.RousselleJ. C.DufourM. E.BoulayJ.et al (2005). Cryptic Pol II transcripts are degraded by a nuclear quality control pathway involving a new poly(A) polymerase.Cell121725737. 10.1016/j.cell.2005.04.030

  • 87

    XuZ.WeiW.GagneurJ.PerocchiF.Clauder-MunsterS.CamblongJ.et al (2009). Bidirectional promoters generate pervasive transcription in yeast.Nature45710331037. 10.1038/nature07728

  • 88

    YanB. X.MaJ. X. (2012). Promoter-associated RNAs and promoter-targeted RNAs.Cell. Mol. Life Sci.6928332842. 10.1007/s00018-012-0953-1

  • 89

    YoungerS. T.CoreyD. R. (2009). The puzzle of RNAs that target gene promoters.Chembiochem1011351139. 10.1002/cbic.200900015

  • 90

    YoungerS. T.CoreyD. R. (2011). Transcriptional regulation by miRNA mimics that target sequences downstream of gene termini.Mol. Biosyst.723832388. 10.1039/c1mb05090g

  • 91

    YuD.MaX.ZuoZ.ShaoW.WangH.MengY. (2017). Bioinformatics resources for deciphering the biogenesis and action pathways of plant small RNAs.Rice10:38. 10.1186/s12284-017-0177-y

  • 92

    YueX.SchwartzJ. C.ChuY.YoungerS. T.GagnonK. T.ElbashirS.et al (2010). Transcriptional regulation by small RNAs at sequences downstream from 3′ gene termini.Nat. Chem. Biol.6621629. 10.1038/nchembio.400

  • 93

    YusE.GuellM.VivancosA. P.ChenW. H.Lluch-SenarM.DelgadoJ.et al (2012). Transcription start site associated RNAs in bacteria.Mol. Syst. Biol.8:585. 10.1038/msb.2012.16

  • 94

    ZaramelaL. S.VencioR. Z.ten-CatenF.BaligaN. S.KoideT. (2014). Transcription start site associated RNAs (TSSaRNAs) are ubiquitous in all domains of life.PLoS One9:e107680. 10.1371/journal.pone.0107680

  • 95

    ZeitlingerJ.StarkA.KellisM.HongJ. W.NechaevS.AdelmanK.et al (2007). RNA polymerase stalling at developmental control genes in the Drosophila melanogaster embryo.Nat. Genet.3915121516. 10.1038/ng.2007.26

  • 96

    ZhengQ.RowleyM. J.BohmdorferG.SandhuD.GregoryB. D.WierzbickiA. T. (2013). RNA polymerase V targets transcriptional silencing components to promoters of protein-coding genes.Plant J.73179189. 10.1111/tpj.12034

Summary

Keywords

transcription boundary-associated RNAs (TBARs), promoter-associated RNAs (PARs), terminus-associated RNAs (TARs), divergent transcription, transcription start site (TSS), transcription termination site (TTS), polymerase (Pol) pausing, gene looping

Citation

Yu D, Ma X, Zuo Z, Wang H and Meng Y (2018) Classification of Transcription Boundary-Associated RNAs (TBARs) in Animals and Plants. Front. Genet. 9:168. doi: 10.3389/fgene.2018.00168

Received

29 January 2018

Accepted

26 April 2018

Published

14 May 2018

Volume

9 - 2018

Edited by

Florent Hubé, UMR7216 Epigénétique et Destin Cellulaire, France

Reviewed by

Francesca Aguilo, Umeå University, Sweden; Monica Ballarino, Sapienza Università di Roma, Italy

Updates

Copyright

*Correspondence: Yijun Meng,

These authors have contributed equally to this work.

This article was submitted to RNA, a section of the journal Frontiers in Genetics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics