MINI REVIEW article

Front. Genet., 30 November 2020

Sec. Genome Architecture and Epigenetic Memory

Volume 11 - 2020 | https://doi.org/10.3389/fgene.2020.589697

Evolution of Genome-Organizing Long Non-coding RNAs in Metazoans

  • 1. Unidad de Genómica Avanzada (Langebio), Centro de Investigación y de Estudios Avanzados del IPN, Irapuato, México

  • 2. Unidad Irapuato, Centro de Investigación y de Estudios Avanzados del IPN, Irapuato, México

Abstract

Long non-coding RNAs (lncRNAs) have important regulatory functions across eukarya. It is now clear that many of these functions are related to gene expression regulation through their capacity to recruit epigenetic modifiers and establish chromatin interactions. Several lncRNAs have been recently shown to participate in modulating chromatin within the spatial organization of the genome in the three-dimensional space of the nucleus. The identification of lncRNA candidates is challenging, as it is their functional characterization. Conservation signatures of lncRNAs are different from those of protein-coding genes, making identifying lncRNAs under selection a difficult task, and the homology between lncRNAs may not be readily apparent. Here, we review the evidence for these higher-order genome organization functions of lncRNAs in animals and the evolutionary signatures they display.

Introduction

The three-dimensional (3D) organization of DNA in the cell nucleus has become a significant subject of study, particularly its influence on gene regulation. Recent advances in chromatin conformation capture (3C) techniques, computational, and modeling approaches have made its study feasible on a genome-wide scale, giving insight into the structure and the dynamics of chromatin folding in space and time. Nuclear 3D organization has multiple levels and varies between cell types and biological conditions. For instance, chromosomes are subdivided into topologically associating domains (TADs) within which chromatin loops bring together regulatory elements and target loci separated in the linear genome (Dixon et al., 2012). These chromatin interactions are crucial for precise gene expression regulation (reviewed in Furlong and Levine, 2018; Schoenfelder and Fraser, 2019; Ibrahim and Mundlos, 2020). Importantly, changes in transcriptional programs result in variation in chromatin interactions within TADs, while TAD boundaries delimiting these domains are preserved (Dixon et al., 2015). TADs segregate in the nuclear space into transcriptionally active (A) and inactive (B) compartments. A/B compartments correlate well with histone modifications characteristic of euchromatin and heterochromatin, respectively, and are described as cell type-specific, being able to undergo switches during cell differentiation and lineage commitment (Lieberman-Aiden et al., 2009; Rao et al., 2014; Dixon et al., 2015; Fortin and Hansen, 2015).

In addition to DNA and histones, RNA is a major component of the cell nucleus (Rinn and Chang, 2012). High-throughput sequencing methods have revealed the pervasive transcription of thousands of non-coding RNA (ncRNA) molecules in the genome. Among the latter, long non-coding RNAs (lncRNAs) have emerged as important gene regulators in eukaryotes. lncRNAs are broadly defined as transcripts longer than 200 nucleotides, with little to no protein-coding potential (Mercer et al., 2009; Wang and Chang, 2011; Derrien et al., 2012). lncRNAs are more lowly expressed (Hezroni et al., 2015), display more tissue-restricted expression patterns (Necsulea et al., 2014), have fewer exons, and are shorter than protein-coding genes (Hezroni et al., 2015). In animals, several lncRNAs are essential to phenomena such as gene silencing, activation, and chromatin remodeling, with significant roles in development, immunity, and cancer (Guttman et al., 2011; Schmitt and Chang, 2016; Delás et al., 2017). lncRNA functions may predate the origin of metazoans, as several unicellular holozans possess lncRNAs that are distinct in terms of their histone marks as well as expression throughout their life cycle (Gaiti et al., 2017).

Signatures of Conservation in LNCRNAs

There has been a long debate on whether most lncRNAs are functional or not (van Bakel et al., 2010; Clark et al., 2011; Lindsay et al., 2013). This discussion was, in part, sparked by the fact that the sequence of lncRNAs is generally poorly conserved across species, suggesting that they are not under purifying selection (Babak et al., 2005; Ponjavic et al., 2007; Marques and Ponting, 2009). There are several examples of orthologous RNAs that preserve their function, but whose sequence is so divergent, they can no longer be identified as orthologs by sequence similarity alone (Ponjavic et al., 2007; Ulitsky et al., 2011; Ulitsky, 2016). Thus, the detection of conservation beyond sequence is paramount to annotate candidate lncRNAs for further functional characterization.

The conservation signals in lncRNAs can differ from those typically found in protein-coding genes (Diederichs, 2014; Ulitsky, 2016). For instance, conventional conservation analyses applied to coding sequences, such as calculating the rate between synonymous and non-synonymous mutations, are not suitable for these elements. Nevertheless, lncRNAs display some sequence conservation, generally in short sequence islands, potentially due to selection constraints on sequences necessary for interacting with other transcripts, proteins, or DNA (Kapusta and Feschotte, 2014; Quinn et al., 2016; Ulitsky, 2016). lncRNAs may also display constraints on the post-transcriptional processing of the transcript, leading to the conservation of splice sites across different species (Nitsche et al., 2015; Ulitsky, 2016). lncRNAs can also possess structural conservation – a constraint that may not be readily detectable at the sequence level (Smith et al., 2013; Tavares et al., 2019). Finally, lncRNAs can have positional conservation, and be expressed from syntenic loci despite having lost most or all sequence conservation. These modes of conservation are not mutually exclusive and may be present in a single lncRNA.

Beyond their apparent lack of conservation, many functionally characterized lncRNAs modulate the organization of higher-order chromatin structures in the nucleus (Saxena and Carninci, 2011; Marchese and Huarte, 2014). lncRNAs are involved in the formation of DNA loops and domains (Wang and Chang, 2011; Zhang et al., 2014), interchromosomal structures (Hacisuleyman et al., 2016), heterochromatic regions (Deng et al., 2009; Engreitz et al., 2013), subnuclear bodies (Mao et al., 2011), and the dynamic assembly of protein complexes (Tsai et al., 2010; Lin et al., 2014; Marín-Béjar et al., 2017). Several novel experimental methods allow the identification of lncRNAs binding to chromatin in vivo across the genome (Li et al., 2017; Sridhar et al., 2017; Bell et al., 2018; Bonetti et al., 2020; Gavrilov et al., 2020). Recruiting and binding to effector molecules is a prevalent mode of action of lncRNAs in both cis and trans activities.

Here, we summarize lncRNAs that affect, establish, or maintain three-dimensional chromatin organization in metazoans and the conservation signals that indicate they are under selection.

LNCRNAs That Affect Tad Conformation and Their Conservation

Sequence Conservation

Sequence conservation in lncRNAs can range from very high to almost non-existent. Despite being generally presented as poorly conserved, a subset of lncRNAs can present significant sequence conservation across species (Necsulea et al., 2014; Hezroni et al., 2015). However, sequence conservation does not guarantee functional equivalence; a highly conserved lncRNA can be fundamental in one species while dispensable in others. For example, the lncRNA Metastasis Associated in Lung Adenocarcinoma Transcript 1 (MALAT1) is highly conserved from human to zebrafish (Figure 1A; Hutchinson et al., 2007; Lin et al., 2007). While the human MALAT1 functions in nuclear speckles, regulating alternative splicing (Hutchinson et al., 2007; Tripathi et al., 2010), cell-cycle associated genes (Yang et al., 2011), and cancer progression (Gutschner et al., 2013), the murine ortholog is neither essential for these functions nor mouse development (Eißmann et al., 2012; Nakagawa et al., 2012; Zhang et al., 2012).

However, it is more common for lncRNAs to have short conserved motifs or domains that are important for their association with DNA or proteins that regulate chromatin conformation. For example, lncRNAs that affect 3D genome topology and arise from highly conserved syntenic loci, such as the Hox clusters, display contrasting patterns of sequence conservation compared to their protein counterparts in the same cluster. Hox genes, organized in mammals in four clusters (HoxAHoxD), encode transcription factors crucial for patterning along the anterior-posterior axis. Numerous ncRNAs are transcribed from the human HOX loci, and their expression relates to differential histone marks and transcriptional accessibility (Rinn et al., 2007).

The HOX antisense intergenic RNA (HOTAIR) lncRNA is transcribed from the boundary between domains with differential chromatin marks at the HOXC locus but acts in trans repressing transcription of coding and non-coding genes on the HOXD locus (Rinn et al., 2007). A chromatin loop established between HOTAIR locus and the HOXC distal enhancer (HDE) located downstream of HOTAIR promotes transcription of the lncRNA. This loop is disrupted by the recruitment of hepatocyte nuclear factor 4-α (HNF4α), a master regulator of epithelial differentiation, to the HDE (Battistelli et al., 2019). HOTAIR exists across mammals, albeit poorly conserved in sequence; it is only highly conserved in primates (He et al., 2011). Noteworthy, a highly conserved domain in exon 6, possibly the backbone of HOTAIR, appeared first in kangaroos suggesting the ab initio generation of HOTAIR in marsupials (He et al., 2011). Despite its low sequence conservation across mammals, key secondary structural elements of HOTAIR contain protein-binding motifs and have significant conservation or covariation (He et al., 2011; Somarowthu et al., 2015). However, studies evaluating the functional conservation of murine HOTAIR (mHotair) present contradictory results. On the one hand, the deletion of the HoxC cluster, including mHotair, did not affect HoxD silencing in vivo (Schorderet and Duboule, 2011). In contrast, mice homozygous for mHotair KO presented homeotic spine transformation and malformation of metacarpal bones, and derived fibroblasts showed altered expression and levels of epigenetic marks at hundreds of genes, including HoxD genes (Li et al., 2013). Interestingly, human and mouse HOTAIR differ in number, arrangement, and degree of sequence conservation among their exons. The absence of exons with protein-binding motifs in mHotair may partially explain differences in their function.

Another lncRNA expressed from HOX clusters is HOXA transcript at the distal tip (HOTTIP), transcribed from the 5' end of the HOXA locus in mammals and conserved in avians (Wang et al., 2011). Chromosomal looping brings HOTTIP into spatial proximity to its target genes in cis, allowing HOTTIP to activate transcription by binding the WD repeat domain 5/mixed lineage leukemia (WDR5/MLL) complex, driving H3K4me3 (Wang et al., 2011). HOTTIP and its association with CCCTC-binding factor (CTCF), which delineates active and inactive TADs within the HOXA cluster, also influence the expression of HoxA genes (Narendra et al., 2015; Wang et al., 2018).

Long non-coding RNAs also enable the establishment of inter-chromosomal structures. The Functional intergenic repeating RNA element (Firre) is a lncRNA involved in pluripotency, hematopoiesis, and adipogenesis (Hacisuleyman et al., 2014; Lewandowski et al., 2019). Firre accumulates across a ~5 Mb domain around its transcription site on the X chromosome (Hacisuleyman et al., 2014), located between two TADs, and highly enriched in CTCF binding sites, required for Firre transcription (Barutcu et al., 2018). This domain colocalizes with five regions on different chromosomes that contain genes with roles in adipogenesis. The formation of this structure depends on the interaction of Firre with Heterogeneous Nuclear Ribonucleoprotein U (HNRNPU), through a 156-bp repeating RNA domain (RRD; Hacisuleyman et al., 2014). This RRD is unique to Firre, and functions as a lineage-specific nuclear retention signal in mice and humans. The RRD and other local repeats (LRs) are conserved to different extents across Firre orthologs in mammals. Firre is also required for the super-loop formation of the inactive X chromosome (Xi), H3K27me3 deposition, and the localization of the Xi to the perinuclear region (Yang et al., 2015; Barutcu et al., 2018).

The 3D architecture of TADs enables a group of multi-exonic lncRNAs, termed immune gene-priming lncRNAs (IPLs), to direct the active priming of the promoters of immune genes, necessary for a rapid and robust pro-inflammatory response as part of trained immunity (Fanucchi et al., 2019). Upon induction of transcription of immune genes by the tumor necrosis factor (TNF), chromatin contacts increase TNF-induced genes and the lncRNAs loci. IPLs are somewhat conserved between mouse and human; the majority possess an Alu element in their first intron and share putative transcription-factor binding motifs at their promoters.

The region comprising an IPL, Upstream master lncRNA of the inflammatory chemokine locus (UMLILO), engages in chromosomal contacts with CXCL chemokine genes belonging to the same TAD, but UMLILO does not have enhancer-RNA-like characteristics. In contrast to other IPLs, UMLILO is not conserved in mice and only partially conserved in pigs, suggesting that IPLs are not essential across species, but have a complementary role in ensuring robust gene expression. UMLILO has short conserved sequence motifs and interacts with WDR5 through its conserved exon 3, directing WDR5/MLL1 to chemokine gene promoters, mediating H3K4me3. Transcription of chemokines in UMLILO knockdown cells was restored by insertion of another WDR5-binding lncRNA, HOTTIP, under the control of the UMLILO promoter (Fanucchi et al., 2019). The ability of HOTTIP to rescue the loss of UMLILO is an example of convergent functional evolution, as they share minimal sequence similarity.

Another group of chromatin-modifying lncRNAs arises from the syntenic estrogen receptor 1 (ESR1) locus. ESR1 is strongly upregulated in cancerous cells undergoing estrogen deprivation. A cluster of ncRNAs, ESR1 locus enhancing and activating non-coding RNAs (Eleanors), are transcribed from introns in a large chromatin cluster within a TAD that contains the ESR1 locus (Tomita et al., 2015). These Eleanors form a chromatin-associated RNA cloud that delineates the TAD and cis-activate transcription. This TAD interacts with another active TAD that contains the apoptotic transcription factor forkhead Box O3 (FOXO3; Abdalla et al., 2019). Knockdown of a promoter-associated Eleanor, pa-Eleanor(S), induced repression of the rest of the Eleanors and the genes within the TAD, including ESR1 (Abdalla et al., 2019). The abundant and highly conserved Eleanor2 increases chromatin accessibility in the ESR1 upstream region by destabilizing nucleosomes, activating ESR1, and is required for the formation of the RNA cloud (Fujita et al., 2020).

Positional Conservation

Long non-coding RNAs may be expressed from syntenic loci, suggesting a common origin, but may have lost the majority of sequence conservation (Figure 1B). The functions of these lncRNAs are thought to rely primarily on their transcription (Diederichs, 2014; Ulitsky, 2016). Thus, the evolutionary signature would be expected to reside outside the transcribed region (Ulitsky, 2016). Indeed, many lncRNAs have a very conserved promoter but little to no conservation in their transcribed region (Guttman et al., 2009). A substantial difficulty in this classification is defining when sequence conservation is entirely lost. As outlined above, several lncRNAs only retain small patches of conservation considered negligible by some authors and meaningful by others.

Figure 1

Examples of this conundrum are dosage compensation lncRNAs in Drosophila melanogaster (Figure 1B). Detailed syntenic analysis of Drosophilid genomes revealed 47 new orthologs, where only 19 had been identified by sequence similarity (Quinn et al., 2016). Importantly, it was shown that the roX RNA itself, only its transcription, is necessary for dosage compensation (Quinn et al., 2016). Furthermore, a distant roX RNA ortholog rescues the loss of roX between two distant species (D. melanogaster and Drosophila busckii) despite almost no sequence conservation outside an eight nucleotide-long conserved patch of microhomology (Quinn et al., 2016).

A more traditional example of positional conservation is the lncRNA antisense to Igf2r RNA non-coding (Airn), required for paternal-specific silencing of imprinted genes in the insulin-like growth factor 2 (Igf2r) cluster (Sleutels et al., 2002). The function of Airn is conserved between human and mouse despite them sharing little conserved sequence (Yotova et al., 2008). The Igf2r silencing function of Airn was shown to be dependent on transcriptional overlap and not on the transcribed RNAs themselves (Latos et al., 2012). However, recent evidence shows that this is only the case for nearby imprinted genes, as the murine Airn lncRNA itself is necessary for the recruitment of chromatin-modifying complexes to distant non-overlapping genes in the cluster (Andergassen et al., 2019).

Structural Conservation

Structural conservation is potentially the most telling signal of conservation in lncRNAs, yet the most difficult to identify. The basic premise is that structural domains may be preserved despite changes in the sequence, as long as complementary base pairs are maintained.

The non-coding isoform of the steroid receptor RNA activator (SRA), ncSRA, has a four-domain secondary structure with varying levels of sequence conservation (Figure 1C). ncSRA functions as a coactivator of several human hormone receptors by modifying chromatin structure (Novikova et al., 2012). ncSRA associates with CTCF and the DEAD-BOX helicase 5 (DDX5), and this association is necessary for the insulator activity of CTCF in vivo (Yao et al., 2010). The functional RNA structure is conserved in all mammals, while its sequence is not. Furthermore, several of the varying positions in other species show changes predicted to help stabilize its structural elements (Novikova et al., 2012).

Dosage compensation lncRNAs (see next section) show patches of structural conservation of biological importance. The Repeat A (RepA) region of X-inactive specific transcript (Xist), essential to the establishment of X chromosome inactivation, interacts with proteins such as the polycomb repressive complex 2 (PRC2; Zhao et al., 2008), ATRX chromatin remodeler (Sarma et al., 2014), and SHARP repressor protein (McHugh et al., 2015). RepA was experimentally shown to have a complex structure that is preserved despite rapid changes across mammalian evolution, strongly suggesting that this structure is indispensable for Xist function (Liu et al., 2017). lncRNAs involved in dosage compensation in drosophilids, roX1 and roX2, have conserved boxes that correspond precisely with stems that are necessary for binding to the male-specific lethal (MSL) proteins. Domains outside these interaction zones are not conserved and lack structure (Ilik et al., 2013; Quinn et al., 2016).

HOTAIR has also been shown to have a complex secondary structure, with some evidence of conservation in mammals acquired from computational methods (Somarowthu et al., 2015). However, there is some debate as to whether there is enough evidence to suggest that HOTAIR’s structure is conserved in mammals (Rivas et al., 2017). Similarly, secondary-structure predictions on Firre indicated that the RRD is a highly structured domain (Nakagawa and Hirano, 2014), consistent with LRs representing potential binding platforms for the specific targeting of proteins to specific genomic regions by lncRNAs.

Functional Convergence: The Case of Dosage Compensation lncRNAs

The lncRNAs involved in the process of dosage compensation are extraordinary examples of de novo emergence of novel lncRNAs of unrelated evolutionary origins (Figure 1D). A prominent example is the Xist lncRNA, required for dosage compensation in the sex-chromosomes of eutherians (Penny et al., 1996). Random X-chromosome inactivation in females is necessary to balance the transcriptional output to that of males. Xist localizes at the X inactivation center (XIC) and is expressed exclusively from the inactivated X (Xi; Brown et al., 1991). During the onset of X inactivation, Xist accumulates at the XIC (Clemson et al., 1996), and then targets gene-rich regions that are spatially close to its transcription site (Engreitz et al., 2013; Simon et al., 2013), incorporating them into the Xist silencing domain and spreading further to cover the complete future Xi (Engreitz et al., 2013). Xist-mediated inactivation involves the transcriptional silencing of most genes on the Xi, and its compaction and recruitment to the nuclear lamina (Zhao et al., 2008; Hasegawa et al., 2010; Chu et al., 2015; McHugh et al., 2015; Minajigi et al., 2015).

While exonic sequences of Xist are well-conserved among eutherians, there are differences in the exon-intron structure, length, and sequence between species (Nesterova et al., 2001; Elisaphenko et al., 2008). This indicates that either Xist genes present a high adaptation level or that their sequence and structure are not essential (Elisaphenko et al., 2008). Xist is not present in non-eutherian vertebrates, including marsupials, despite common epigenetic features on the Xi, such as loss of active histone marks and exclusion of RNA polymerase II (Chaumeil et al., 2011). Homology of Xist with promoters and exonic sequences of the protein-coding gene ligand of numb-protein x 3 (Lnx3) found in marsupials, chicken, and fish suggests that Xist emerged through pseudogenization of Lnx3, possibly by the insertion of tandem repeats from transposable elements (Duret et al., 2006; Elisaphenko et al., 2008).

Interestingly, in marsupials, X-chromosome inactivation is imprinted, tissue-specific, and somewhat incomplete compared to eutherians, and thought to be achieved by female-specific expression of the lncRNA RNA on the silent X (Rsx), which is transcribed from and coats the paternal chromosome (Grant et al., 2012). The independent evolution of Xist and Rsx adds to the notion of dosage systems rapidly evolving from ancient silencing mechanisms common to all eukaryotes through the use of lncRNAs (Gendrel and Heard, 2014; Graves, 2016). The discoveries on the regulation of Xist by non-coding elements located at its own and the neighboring TAD and the impact of this 3D conformation on the regulatory landscape adds another layer of complexity to the mechanisms for dosage compensation (van Bemmel et al., 2019; Galupa et al., 2020).

lncRNAs are also the effectors of dosage compensation in drosophilids, but they differ in both origin and mechanism to those in mammals. Here, the roX1 and roX2 lncRNAs mediate the upregulation of genes on the single male X chromosome to equalize expression of the two X chromosomes in females. roX1 and roX2 associate to the MSL proteins, forming the MSL complex that localizes to numerous specific sites along the male X (Franke and Baker, 1999), mediating histone acetylation and increasing transcription. The MSL complex does not alter the global architecture of the X chromosome, but it does spread via spatial proximity from high-affinity sites – enriched at TAD boundaries – to other regions (Ramírez et al., 2015). Contrary to Xist, whose activity is limited to the chromosome from which it is expressed (Wutz and Jaenisch, 2000), roX transgenes target the X chromosome in trans and rescue roX1 and roX2 mutant males (Meller and Rattner, 2002).

The independent origin of Xist in mammals, Rsx in marsupials, and roX1 and roX2 in flies suggests that lncRNAs may be one of the fastest mechanisms to evolve novel epigenetic controls. As these lncRNAs participate in dosage compensation but have emerged independently in several lineages, they are extraordinarily difficult to identify as functionally convergent. Additional examples of functionally equivalent lncRNAs with no evolutionary relationship may likely have gone undetected.

Discussion

Distinctly, lncRNAs have emerged as an additional layer of complexity involved in shaping the three-dimensional organization of the genome by interacting and modifying the structure of chromatin. Several lncRNAs affect chromatin conformation and display a combination of conservation signals that may be difficult to identify solely by looking at traditional genomic conservation metrics (summarized in Table 1). These signatures could prove useful to identify and prioritize lncRNA candidates for experimental functional characterization. Sequence conservation can be identified using traditional computational sequence comparison methods. Recent examples have shown that conserved sequence stretches can be much shorter in lncRNAs than in protein-coding sequences, highlighting the need to look for tiny stretches of sequence conservation (microhomology; Quinn et al., 2016). Positional conservation of lncRNAs can be identified using multiple genome alignments complemented with transcriptomic data that support the existence of non-coding transcripts in multiple taxa. The detection of splice site conservation uses a similar approach but focuses on identifying splice sites via modeling or direct RNA-seq evidence, followed by comparison across taxa (Nitsche et al., 2015). In the case of structural conservation, covariation signatures in multiple sequence alignments may indicate the conservation of a structure (Nawrocki et al., 2009; Gruber et al., 2010; Will et al., 2012). One of the most significant limitations is the difficult problem of distinguishing covariation from sequence conservation. Thus, these methods can better identify conserved structures in highly varying sequences in diverse and multiple taxa (Rivas et al., 2017, 2020).

Table 1

lncRNAFunctionMode of actionInteracting proteinsAssociation with chromatin topologyConservationReferences
XistX chromosome inactivation in mammalsIn cisPRC1, PRC2, HNRNPU, RBM15, SHARP, WTAP, HNRNPK, LBR, and many othersThe organization of the XIC into two topologically associating domains (TADs) ensures the proper interaction of Xist and its antisense lncRNA Tsix with regulatory elementsPresent only in eutherian mammals. Presence of common core exonic sequences, despite species-specific unique sequences, and variation in length and gene structureNesterova et al., 2001; Plath et al., 2003; Elisaphenko et al., 2008; Zhao et al., 2008; Hasegawa et al., 2010; Engreitz et al., 2013; Chu et al., 2015; McHugh et al., 2015; Minajigi et al., 2015; Moindrot et al., 2015; Chen et al., 2016; Pintacuda et al., 2017; van Bemmel et al., 2019; Galupa et al., 2020
During X inactivation, Xist spreads along the chromosome exploiting the three-dimensional (3D) organization, resulting in compaction and recruitment to the nuclear lamina
HOTTIPGene control of HOXA locus for distal identityIn cisWDR5/MLL and CTCFA chromatin loop gets HOTTIP into spatial proximity to HOXA genes. Associates with CTCF to define functional TADs at HOXA clusterPortions conserved in mammals and aviansWang et al., 2011, 2018
AirnIts transcription prevents overexpression of Igfr2 locus in a paternal-specific matterIn cisEHMT2Forms an RNA cloud, creating a repressive domainTandem direct repeats at the CpG island at 5' end are conserved in human and mouse at an organizational level but not by sequenceLyle et al., 2000; Seidl et al., 2006; Nagano et al., 2008; Latos et al., 2009, 2012; Koerner et al., 2012; Santoro et al., 2013
Kcnq1ot1Silencing at imprinted Kcnq1 locus in a paternal-specific mannerIn cisEHMT2, PRC2, PRC1, and DNMT1Formation of repressive chromatin loop on the imprinting control region of the locusWell-conserved motifs between human and mousePandey et al., 2004, 2008; Mancini-Dinardo et al., 2006; Mohammad et al., 2008, 2010; Zhang et al., 2014
pRNAMediates silencing by CpG methylation of rRNA genes at nucleolus via DNA:RNA triplex formationIn cisNoRC and DNMT3bEstablishment of nucleolar heterochromatinConserved across eutharians, various levels of sequence conservation, and highly conserved secondary-structure motifsMayer et al., 2006, 2008; Santoro et al., 2010; Schmitz et al., 2010;Guetg et al., 2012; Jacob et al., 2013; Savić et al., 2014; Wehner et al., 2014
LUNAR1IGF1 signaling, promotes cell proliferation in cancersIn cisNone reportedA chromatin loop that brings into contact the promoter of LUNAR1 and the enhancer of IFG1R is necessary for the expression of both genes, which reside in the same TADNot reportedTrimarchi et al., 2014; Peng and Feng, 2016
Khps1Activates transcription of its sense proto-oncogene SPHK1, via DNA:RNA triplex formation at SPHK1 enhancerIn cisEP300Khps1 transcription leads to a transcriptionally active open chromatin state by recruitment of EP300/CBP, transcripton of KHPS1 enhancer and eviction of CTCFConservation between humans and rodentsImamura et al., 2004; Postepska-Igielska et al., 2015; Blank-Giwojna et al., 2019
UMLILOTrained immune response on chemokine genesIn cisWDR5/MLLIn chemokine TAD, chromosomal looping brings the super-enhancer region harboring UMLILO into contact with chemokine genes, allowing UMLILO RNA to guide WDR/MLL to the promoters to facilitate H3K4me3 epigenetic primingPartial conservation between human, chimpanzee, and pig, absent in mouseFanucchi et al., 2019
EleanorsActivation of ESR1 locus, apoptosis resistanceIn cisNone reportedEleanors RNA cloud delineate ESR1 TAD and activate transcriptionVarying levels of conservation for each EleanorTomita et al., 2015; Abdalla et al., 2019; Fujita et al., 2020
HOTAIRRepresses expression in HOXD locus and other genes, including imprintedIn transPRC2, RCOR1, and ARHOTAIR transcripts demarcate silent and active domains in HOXD locus.Poorly conserved by sequence, secondary structure motifs conserved between mouse and humanRinn et al., 2007; Gupta et al., 2010; Tsai et al., 2010; Li et al., 2013; Somarowthu et al., 2015; Zhang et al., 2015; Portoso et al., 2017
ncSRAActivation of steroid receptors (isoforms of SRA code for protein)In transSRC-1, PRC2, TrxG, NANOG, CTCF, SHARP, DDX5, and othersDiverse, chromatin looping and modification as scaffold for proteins in both active and inactive domainsSignificant sequence conservation and high structural conservationLanz et al., 1999; Shi et al., 2001; Zhao et al., 2007; Yao et al., 2010; Novikova et al., 2012; Wongtrakoongate et al., 2015
roX1 and roX2Dosage compensation in DrosophilaIn transMSL ProteinsThe MSL complex (rox + MSL proteins) has high affinity sites on TAD bordersThere are roX orthologs across drosophilidsFranke and Baker, 1999; Park et al., 2008; Ilik et al., 2013; Maenner et al., 2013; Ramírez et al., 2015; Quinn et al., 2016
IPWRepression of maternally expressed genes. Possibly implicated in Prader-Willi syndromeIn transEHMT2Allele-specific formation of heterochromatin at DLK1–DIO3 region.Poorly conserved by sequence between human and mouseWevrick et al., 1994; Wevrick and Francke, 1997; Stelzer et al., 2014
FirreRole in adipogenesis, nuclear architecture, inflammatory response (in vitro), and hematopoiesis (in vivo)In trans, but occupies domain in cisHNRNPUFirre acts as a scaffold for the formation of an inter-chromosomal structure. Locates at border of TAD in a CTCF binding region. Required for super loop formation of inactive XConservation across mammals, high convergence of repeating domain in primates. Local Repeats are conserved between species of the same orderHacisuleyman et al., 2014, 2016; Yang et al., 2015; Lu et al., 2017; Barutcu et al., 2018; Lewandowski et al., 2019
TERRAImplicated in telomeric and subtelomeric heterochromatin formation, stability and maintenanceIn cis (to telomeres) and in transShelterin components (TERF1 and TERF2), ORC1, CBX5 NoRC, ATRX, POT1, and othersTERRA transcription depends on chromosome loopingTelomere transcription is conserved across vertebrates and Saccharomyces cerevisiaeAzzalin et al., 2007; Luke et al., 2008; Schoeftner and Blasco, 2008; Deng et al., 2009; Postepska-Igielska et al., 2013; Beishline et al., 2017

Characterized long non-coding RNAs (lncRNAs) that are involved in nuclear genome topology.

In the context of studying novel lncRNAs, its unique conservation signatures, albeit more difficult to detect, are excellent ways to identify potentially functional lncRNA candidates and give a first insight on their possible mechanisms of action. They can also help guide the search for homologous mechanisms in other species. Complementing in silico studies with experimental approaches in the context of spatiotemporal gene expression programs is crucial to further assess the impact of these ncRNAs on modulating genome architecture, including their specific contribution to the complexity and evolution of animal gene regulation.

Statements

Author contributions

All authors participated in writing and reviewing the manuscript and approved the final version for publication.

Funding

AR-C was funded by the Consejo Nacional de Ciencia y Tecnología (CONACYT) M.Sc. fellowship. KO and SF-V were funded by the Newton Advanced Fellowship (No. NAF\R1\180303) awarded to SF-V.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  • 1

    AbdallaM. O. A.YamamotoT.MaeharaK.NogamiJ.OhkawaY.MiuraH.et al. (2019). The Eleanor ncRNAs activate the topological domain of the ESR1 locus to balance against apoptosis. Nat. Commun.10:3778. doi: 10.1038/s41467-019-11378-4

  • 2

    AndergassenD.MuckenhuberM.BammerP. C.KulinskiT. M.TheusslH. -C.ShimizuT.et al. (2019). The Airn lncRNA does not require any DNA elements within its locus to silence distant imprinted genes. PLoS Genet.15:e1008268. doi: 10.1371/journal.pgen.1008268

  • 3

    AzzalinC. M.ReichenbachP.KhoriauliL.GiulottoE.LingnerJ. (2007). Telomeric repeat containing RNA and RNA surveillance factors at mammalian chromosome ends. Science318, 798801. doi: 10.1126/science.1147182

  • 4

    BabakT.BlencoweB. J.HughesT. R. (2005). A systematic search for new mammalian noncoding RNAs indicates little conserved intergenic transcription. BMC Genom.6:104. doi: 10.1186/1471-2164-6-104

  • 5

    BarutcuA. R.MaassP. G.LewandowskiJ. P.WeinerC. L.RinnJ. L. (2018). A TAD boundary is preserved upon deletion of the CTCF-rich Firre locus. Nat. Commun.9:1444. doi: 10.1038/s41467-018-03614-0

  • 6

    BattistelliC.SabareseG.SantangeloL.MontaldoC.GonzalezF. J.TripodiM.et al. (2019). The lncRNA HOTAIR transcription is controlled by HNF4α-induced chromatin topology modulation. Cell Death Differ.26, 890901. doi: 10.1038/s41418-018-0170-z

  • 7

    BeishlineK.VladimirovaO.TuttonS.WangZ.DengZ.LiebermanP. M. (2017). CTCF driven TERRA transcription facilitates completion of telomere DNA replication. Nat. Commun.8:2114. doi: 10.1038/s41467-017-02212-w

  • 8

    BellJ. C.JukamD.TeranN. A.RiscaV. I.SmithO. K.JohnsonW. L.et al. (2018). Chromatin-associated RNA sequencing (ChAR-seq) maps genome-wide RNA-to-DNA contacts. eLife7:e27024. doi: 10.7554/eLife.27024

  • 9

    Blank-GiwojnaA.Postepska-IgielskaA.GrummtI. (2019). lncRNA KHPS1 activates a poised enhancer by triplex-dependent recruitment of epigenomic regulators. Cell Rep.26:2904.e42915.e4. doi: 10.1016/j.celrep.2019.02.059

  • 10

    BonettiA.AgostiniF.SuzukiA. M.HashimotoK.PascarellaG.GimenezJ.et al. (2020). RADICL-seq identifies general and cell type–specific principles of genome-wide RNA-chromatin interactions. Nat. Commun.11:1018. doi: 10.1038/s41467-020-14337-6

  • 11

    BrownC. J.BallabioA.RupertJ. L.LafreniereR. G.GrompeM.TonlorenziR.et al. (1991). A gene from the region of the human X inactivation centre is expressed exclusively from the inactive X chromosome. Nature349, 3844. doi: 10.1038/349038a0

  • 12

    ChaumeilJ.WatersP. D.KoinaE.GilbertC.RobinsonT. J.GravesJ. A. M. (2011). Evolution from XIST-independent to XIST-controlled X-chromosome inactivation: epigenetic modifications in distantly related mammals. PLoS One6:e19040. doi: 10.1371/journal.pone.0019040

  • 13

    ChenC. -K.BlancoM.JacksonC.AznauryanE.OllikainenN.SurkaC.et al. (2016). Xist recruits the X chromosome to the nuclear lamina to enable chromosome-wide silencing. Science354, 468472. doi: 10.1126/science.aae0047

  • 14

    ChuC.ZhangQ. C.da RochaS. T.FlynnR. A.BharadwajM.CalabreseJ. M.et al. (2015). Systematic discovery of Xist RNA binding proteins. Cell161, 404416. doi: 10.1016/j.cell.2015.03.025

  • 15

    ClarkM. B.AmaralP. P.SchlesingerF. J.DingerM. E.TaftR. J.RinnJ. L.et al. (2011). The reality of pervasive transcription. PLoS Biol.9:e1000625. doi: 10.1371/journal.pbio.1000625

  • 16

    ClemsonC. M.McNeilJ. A.WillardH. F.LawrenceJ. B. (1996). XIST RNA paints the inactive X chromosome at interphase: evidence for a novel RNA involved in nuclear/chromosome structure. J. Cell Biol.132, 259275. doi: 10.1083/jcb.132.3.259

  • 17

    DelásM. J.Joaquina DelásM.HannonG. J. (2017). lncRNAs in development and disease: from functions to mechanisms. Open Biol.7:170121. doi: 10.1098/rsob.170121

  • 18

    DengZ.NorseenJ.WiedmerA.RiethmanH.LiebermanP. M. (2009). TERRA RNA binding to TRF2 facilitates heterochromatin formation and ORC recruitment at telomeres. Mol. Cell35, 403413. doi: 10.1016/j.molcel.2009.06.025

  • 19

    DerrienT.JohnsonR.BussottiG.TanzerA.DjebaliS.TilgnerH.et al. (2012). The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression. Genome Res.22, 17751789. doi: 10.1101/gr.132159.111

  • 20

    DiederichsS. (2014). The four dimensions of noncoding RNA conservation. Trends Genet.30, 121123. doi: 10.1016/j.tig.2014.01.004

  • 21

    DixonJ. R.JungI.SelvarajS.ShenY.Antosiewicz-BourgetJ. E.LeeA. Y.et al. (2015). Chromatin architecture reorganization during stem cell differentiation. Nature518, 331336. doi: 10.1038/nature14222

  • 22

    DixonJ. R.SelvarajS.YueF.KimA.LiY.ShenY.et al. (2012). Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature485, 376380. doi: 10.1038/nature11082

  • 23

    DuretL.ChureauC.SamainS.WeissenbachJ.AvnerP. (2006). The Xist RNA gene evolved in eutherians by pseudogenization of a protein-coding gene. Science312, 16531655. doi: 10.1126/science.1126316

  • 24

    EißmannM.GutschnerT.HämmerleM.GüntherS.Caudron-HergerM.GroßM.et al. (2012). Loss of the abundant nuclear non-coding RNA MALAT1 is compatible with life and development. RNA Biol.9, 10761087. doi: 10.4161/rna.21089

  • 25

    ElisaphenkoE. A.KolesnikovN. N.ShevchenkoA. I.RogozinI. B.NesterovaT. B.BrockdorffN.et al. (2008). A dual origin of the Xist gene from a protein-coding gene and a set of transposable elements. PLoS One3:6. doi: 10.1371/journal.pone.0002521

  • 26

    EngreitzJ. M.Pandya-JonesA.McDonelP.ShishkinA.SirokmanK.SurkaC.et al. (2013). The Xist lncRNA exploits three-dimensional genome architecture to spread across the X chromosome. Science341:1237973. doi: 10.1126/science.1237973

  • 27

    FanucchiS.FokE. T.DallaE.ShibayamaY.BörnerK.ChangE. Y.et al. (2019). Immune genes are primed for robust transcription by proximal long noncoding RNAs located in nuclear compartments. Nat. Genet.51, 138150. doi: 10.1038/s41588-018-0298-2

  • 28

    FortinJ. -P.HansenK. D. (2015). Reconstructing A/B compartments as revealed by Hi-C using long-range correlations in epigenetic data. Genome Biol.16:180. doi: 10.1186/s13059-015-0741-y

  • 29

    FrankeA.BakerB. S. (1999). The rox1 and rox2 RNAs are essential components of the compensasome, which mediates dosage compensation in Drosophila. Mol. Cell4, 117122. doi: 10.1016/S1097-2765(00)80193-8

  • 30

    FujitaR.YamamotoT.ArimuraY.FujiwaraS.TachiwanaH.IchikawaY.et al. (2020). Nucleosome destabilization by nuclear non-coding RNAs. Commun. Biol.3:60. doi: 10.1038/s42003-020-0784-9

  • 31

    FurlongE. E. M.LevineM. (2018). Developmental enhancers and chromosome topology. Science361, 13411345. doi: 10.1126/science.aau0320

  • 32

    GaitiF.CalcinoA. D.TanurdžićM.DegnanB. M. (2017). Origin and evolution of the metazoan non-coding regulatory genome. Dev. Biol.427, 193202. doi: 10.1016/j.ydbio.2016.11.013

  • 33

    GalupaR.NoraE. P.Worsley-HuntR.PicardC.GardC.van BemmelJ. G.et al. (2020). A conserved noncoding locus regulates random monoallelic Xist expression across a topological boundary. Mol. Cell77, 352.e8367.e8. doi: 10.1016/j.molcel.2019.10.030

  • 34

    GavrilovA. A.ZharikovaA. A.GalitsynaA. A.LuzhinA. V.RubanovaN. M.GolovA. K.et al. (2020). Studying RNA–DNA interactome by Red-C identifies noncoding RNAs associated with various chromatin types and reveals transcription dynamics. Nucleic Acids Res.48, 66996714. doi: 10.1093/nar/gkaa457

  • 35

    GendrelA. -V.HeardE. (2014). Noncoding RNAs and epigenetic mechanisms during X-chromosome inactivation. Annu. Rev. Cell Dev. Biol.30, 561580. doi: 10.1146/annurev-cellbio-101512-122415

  • 36

    GrantJ.MahadevaiahS. K.KhilP.SangrithiM. N.RoyoH.DuckworthJ.et al. (2012). Rsx is a metatherian RNA with Xist-like properties in X-chromosome inactivation. Nature487, 254258. doi: 10.1038/nature11171

  • 37

    GravesJ. A. M. (2016). Evolution of vertebrate sex chromosomes and dosage compensation. Nat. Rev. Genet.17, 3346. doi: 10.1038/nrg.2015.2

  • 38

    GruberA. R.FindeißS.WashietlS.HofackerI. L.StadlerP. F. (2010). RNAz 2.0: improved noncoding RNA detection. Pac. Symp. Biocomput.2010, 6979. doi: 10.1142/9789814295291_0009

  • 39

    GuetgC.ScheifeleF.RosenthalF.HottigerM. O.SantoroR. (2012). Inheritance of silent rDNA chromatin is mediated by PARP1 via noncoding RNA. Mol. Cell45, 790800. doi: 10.1016/j.molcel.2012.01.024

  • 40

    GuptaR. A.ShahN.WangK. C.KimJ.HorlingsH. M.WongD. J.et al. (2010). Long non-coding RNA HOTAIR reprograms chromatin state to promote cancer metastasis. Nature464, 10711076. doi: 10.1038/nature08975

  • 41

    GutschnerT.HämmerleM.DiederichsS. (2013). MALAT1--a paradigm for long noncoding RNA function in cancer. J. Mol. Med.91, 791801. doi: 10.1007/s00109-013-1028-y

  • 42

    GuttmanM.AmitI.GarberM.FrenchC.LinM. F.FeldserD.et al. (2009). Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals. Nature458, 223227. doi: 10.1038/nature07672

  • 43

    GuttmanM.DonagheyJ.CareyB. W.GarberM.GrenierJ. K.MunsonG.et al. (2011). lincRNAs act in the circuitry controlling pluripotency and differentiation. Nature477, 295300. doi: 10.1038/nature10398

  • 44

    HacisuleymanE.GoffL. A.TrapnellC.WilliamsA.Henao-MejiaJ.SunL.et al. (2014). Topological organization of multichromosomal regions by the long intergenic noncoding RNA Firre. Nat. Struct. Mol. Biol.21, 198206. doi: 10.1038/nsmb.2764

  • 45

    HacisuleymanE.ShuklaC. J.WeinerC. L.RinnJ. L. (2016). Function and evolution of local repeats in the Firre locus. Nat. Commun.7:11021. doi: 10.1038/ncomms11021

  • 46

    HasegawaY.BrockdorffN.KawanoS.TsutuiK.TsutuiK.NakagawaS. (2010). The matrix protein hnRNP U is required for chromosomal localization of Xist RNA. Dev. Cell19, 469476. doi: 10.1016/j.devcel.2010.08.006

  • 47

    HeS.LiuS.ZhuH. (2011). The sequence, structure and evolutionary features of HOTAIR in mammals. BMC Evol. Biol.11:102. doi: 10.1186/1471-2148-11-102

  • 48

    HezroniH.KoppsteinD.SchwartzM. G.AvrutinA.BartelD. P.UlitskyI. (2015). Principles of long noncoding RNA evolution derived from direct comparison of transcriptomes in 17 species. Cell Rep.11, 11101122. doi: 10.1016/j.celrep.2015.04.023

  • 49

    HutchinsonJ. N.EnsmingerA. W.ClemsonC. M.LynchC. R.LawrenceJ. B.ChessA. (2007). A screen for nuclear transcripts identifies two linked noncoding RNAs associated with SC35 splicing domains. BMC Genom.8:39. doi: 10.1186/1471-2164-8-39

  • 50

    IbrahimD. M.MundlosS. (2020). The role of 3D chromatin domains in gene regulation: a multi-facetted view on genome organization. Curr. Opin. Genet. Dev.61, 18. doi: 10.1016/j.gde.2020.02.015

  • 51

    IlikI. A.QuinnJ. J.GeorgievP.Tavares-CadeteF.MaticzkaD.ToscanoS.et al. (2013). Tandem stem-loops in roX RNAs act together to mediate X chromosome dosage compensation in Drosophila. Mol. Cell51, 156173. doi: 10.1016/j.molcel.2013.07.001

  • 52

    ImamuraT.YamamotoS.OhganeJ.HattoriN.TanakaS.ShiotaK. (2004). Non-coding RNA directed DNA demethylation of Sphk1 CpG island. Biochem. Biophys. Res. Commun.322, 593600. doi: 10.1016/j.bbrc.2004.07.159

  • 53

    JacobM. D.AudasT. E.UniackeJ.Trinkle-MulcahyL.LeeS. (2013). Environmental cues induce a long noncoding RNA-dependent remodeling of the nucleolus. Mol. Biol. Cell24, 29432953. doi: 10.1091/mbc.e13-04-0223

  • 54

    KapustaA.FeschotteC. (2014). Volatile evolution of long noncoding RNA repertoires: mechanisms and biological implications. Trends Genet.30, 439452. doi: 10.1016/J.TIG.2014.08.004

  • 55

    KoernerM. V.PaulerF. M.HudsonQ. J.SantoroF.SawickaA.GuenzlP. M.et al. (2012). A downstream CpG island controls transcript initiation and elongation and the methylation state of the imprinted Airn macro ncRNA promoter. PLoS Genet.8:e1002540. doi: 10.1371/journal.pgen.1002540

  • 56

    LanzR. B.McKennaN. J.OnateS. A.AlbrechtU.WongJ.TsaiS. Y.et al. (1999). A steroid receptor coactivator, SRA, functions as an RNA and is present in an SRC-1 complex. Cell97, 1727. doi: 10.1016/s0092-8674(00)80711-4

  • 57

    LatosP. A.PaulerF. M.KoernerM. V.ŞenerginH. B.HudsonQ. J.StocsitsR. R.et al. (2012). Airn transcriptional overlap, but not its lncRNA products, induces imprinted Igf2r silencing. Science338, 14691472. doi: 10.1126/science.1228110

  • 58

    LatosP. A.StrickerS. H.SteenpassL.PaulerF. M.HuangR.SenerginB. H.et al. (2009). An in vitro ES cell imprinting model shows that imprinted expression of the Igf2r gene arises from an allele-specific expression bias. Development136, 437448. doi: 10.1242/dev.032060

  • 59

    LewandowskiJ. P.LeeJ. C.HwangT.SunwooH.GoldsteinJ. M.GroffA. F.et al. (2019). The Firre locus produces a trans-acting RNA molecule that functions in hematopoiesis. Nat. Commun.10:5137. doi: 10.1038/s41467-019-12970-4

  • 60

    LiL.LiuB.WapinskiO. L.TsaiM. -C.QuK.ZhangJ.et al. (2013). Targeted disruption of Hotair leads to homeotic transformation and gene derepression. Cell Rep.5, 312. doi: 10.1016/j.celrep.2013.09.003

  • 61

    LiX.ZhouB.ChenL.GouL. T.LiH.FuX. D. (2017). GRID-seq reveals the global RNA–chromatin interactome. Nat. Biotechnol.35, 940950. doi: 10.1038/nbt.3968

  • 62

    Lieberman-AidenE.van BerkumN. L.WilliamsL.ImakaevM.RagoczyT.TellingA.et al. (2009). Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science326, 289293. doi: 10.1126/SCIENCE.1181369

  • 63

    LinN.ChangK. -Y.LiZ.GatesK.RanaZ. A.DangJ.et al. (2014). An evolutionarily conserved long noncoding RNA TUNA controls pluripotency and neural lineage commitment. Mol. Cell53, 10051019. doi: 10.1016/j.molcel.2014.01.021

  • 64

    LinR.MaedaS.LiuC.KarinM.EdgingtonT. S. (2007). A large noncoding RNA is a marker for murine hepatocellular carcinomas and a spectrum of human carcinomas. Oncogene26, 851858. doi: 10.1038/sj.onc.1209846

  • 65

    LindsayM. A.Griffiths-JonesS.ClarkM. B.ChoudharyA.SmithM. A.TaftR. J.et al. (2013). The dark matter rises: the expanding world of regulatory RNAs. Essays Biochem.54, 116. doi: 10.1042/bse0540001

  • 66

    LiuF.SomarowthuS.PyleA. M. (2017). Visualizing the secondary and tertiary architectural domains of lncRNA RepA. Nat. Chem. Biol.13, 282289. doi: 10.1038/nchembio.2272

  • 67

    LuY.LiuX.XieM.LiuM.YeM.LiM.et al. (2017). The NF-κB-responsive long noncoding RNA FIRRE regulates posttranscriptional regulation of inflammatory gene expression through interacting with hnRNPU. J. Immunol.199, 35713582. doi: 10.4049/jimmunol.1700091

  • 68

    LukeB.PanzaA.RedonS.IglesiasN.LiZ.LingnerJ. (2008). The Rat1p 5' to 3' exonuclease degrades telomeric repeat-containing RNA and promotes telomere elongation in Saccharomyces cerevisiae. Mol. Cell32, 465477. doi: 10.1016/j.molcel.2008.10.019

  • 69

    LyleR.WatanabeD.te VruchteD.LerchnerW.SmrzkaO. W.WutzA.et al. (2000). The imprinted antisense RNA at the Igf2r locus overlaps but does not imprint Mas1. Nat. Genet.25, 1921. doi: 10.1038/75546

  • 70

    MaennerS.MüllerM.FröhlichJ.LangerD.BeckerP. B. (2013). ATP-dependent roX RNA remodeling by the helicase maleless enables specific association of MSL proteins. Mol. Cell51, 174184. doi: 10.1016/j.molcel.2013.06.011

  • 71

    Mancini-DinardoD.SteeleS. J. S.LevorseJ. M.IngramR. S.TilghmanS. M. (2006). Elongation of the Kcnq1ot1 transcript is required for genomic imprinting of neighboring genes. Genes Dev.20, 12681282. doi: 10.1101/gad.1416906

  • 72

    MaoY. S.SunwooH.ZhangB.SpectorD. L. (2011). Direct visualization of the co-transcriptional assembly of a nuclear body by noncoding RNAs. Nat. Cell Biol.13, 95101. doi: 10.1038/ncb2140

  • 73

    MarcheseF. P.HuarteM. (2014). Long non-coding RNAs and chromatin modifiers: their place in the epigenetic code. Epigenetics9, 2126. doi: 10.4161/epi.27472

  • 74

    Marín-BéjarO.MasA. M.GonzálezJ.MartinezD.AthieA.MoralesX.et al. (2017). The human lncRNA LINC-PINT inhibits tumor cell invasion through a highly conserved sequence element. Genome Biol.18:202. doi: 10.1186/s13059-017-1331-y

  • 75

    MarquesA. C.PontingC. P. (2009). Catalogues of mammalian long noncoding RNAs: modest conservation and incompleteness. Genome Biol.10:R124. doi: 10.1186/gb-2009-10-11-r124

  • 76

    MayerC.NeubertM.GrummtI. (2008). The structure of NoRC-associated RNA is crucial for targeting the chromatin remodelling complex NoRC to the nucleolus. EMBO Rep.9, 774780. doi: 10.1038/embor.2008.109

  • 77

    MayerC.SchmitzK. -M.LiJ.GrummtI.SantoroR. (2006). Intergenic transcripts regulate the epigenetic state of rRNA genes. Mol. Cell22, 351361. doi: 10.1016/j.molcel.2006.03.028

  • 78

    McHughC. A.ChenC. -K.ChowA.SurkaC. F.TranC.McDonelP.et al. (2015). The Xist lncRNA interacts directly with SHARP to silence transcription through HDAC3. Nature521, 232236. doi: 10.1038/nature14443

  • 79

    MellerV. H.RattnerB. P. (2002). The roX genes encode redundant male-specific lethal transcripts required for targeting of the MSL complex. EMBO J.21, 10841091. doi: 10.1093/emboj/21.5.1084

  • 80

    MercerT. R.DingerM. E.MattickJ. S. (2009). Long non-coding RNAs: insights into functions. Nat. Rev. Genet.10, 155159. doi: 10.1038/nrg2521

  • 81

    MinajigiA.FrobergJ. E.WeiC.SunwooH.KesnerB.ColognoriD.et al. (2015). Chromosomes. A comprehensive Xist interactome reveals cohesin repulsion and an RNA-directed chromosome conformation. Science349:aab2276. doi: 10.1126/science.aab2276

  • 82

    MohammadF.MondalT.GusevaN.PandeyG. K.KanduriC. (2010). Kcnq1ot1 noncoding RNA mediates transcriptional gene silencing by interacting with Dnmt1. Development137, 24932499. doi: 10.1242/dev.048181

  • 83

    MohammadF.PandeyR. R.NaganoT.ChakalovaL.MondalT.FraserP.et al. (2008). Kcnq1ot1/Lit1 noncoding RNA mediates transcriptional silencing by targeting to the perinucleolar region. Mol. Cell. Biol.28, 37133728. doi: 10.1128/mcb.02263-07

  • 84

    MoindrotB.CeraseA.CokerH.MasuiO.GrijzenhoutA.PintacudaG.et al. (2015). A pooled shRNA screen identifies Rbm15, Spen, and Wtap as factors required for Xist RNA-mediated silencing. Cell Rep.12, 562572. doi: 10.1016/j.celrep.2015.06.053

  • 85

    NaganoT.MitchellJ. A.SanzL. A.PaulerF. M.Ferguson-SmithA. C.FeilR.et al. (2008). The air noncoding RNA epigenetically silences transcription by targeting G9a to chromatin. Science322, 17171720. doi: 10.1126/science.1163802

  • 86

    NakagawaS.HiranoT. (2014). Gathering around Firre. Nat. Struct. Mol. Biol.21, 207208. doi: 10.1038/nsmb.2782

  • 87

    NakagawaS.IpJ. Y.ShioiG.TripathiV.ZongX.HiroseT.et al. (2012). Malat1 is not an essential component of nuclear speckles in mice. RNA18, 14871499. doi: 10.1261/rna.033217.112

  • 88

    NarendraV.RochaP. P.AnD.RaviramR.SkokJ. A.MazzoniE. O.et al. (2015). CTCF establishes discrete functional chromatin domains at the Hox clusters during differentiation. Science347, 10171021. doi: 10.1126/science.1262088

  • 89

    NawrockiE. P.KolbeD. L.EddyS. R. (2009). Infernal 1.0: inference of RNA alignments. Bioinformatics25, 13351337. doi: 10.1093/bioinformatics/btp326

  • 90

    NecsuleaA.SoumillonM.WarneforsM.LiechtiA.DaishT.ZellerU.et al. (2014). The evolution of lncRNA repertoires and expression patterns in tetrapods. Nature505, 635640. doi: 10.1038/nature12943

  • 91

    NesterovaT. B.SlobodyanyukS. Y.ElisaphenkoE. A.ShevchenkoA. I.JohnstonC.PavlovaM. E.et al. (2001). Characterization of the genomic Xist locus in rodents reveals conservation of overall gene structure and tandem repeats but rapid evolution of unique sequence. Genome Res.11, 833849. doi: 10.1101/gr.174901

  • 92

    NitscheA.RoseD.FasoldM.ReicheK.StadlerP. F. (2015). Comparison of splice sites reveals that long noncoding RNAs are evolutionarily well conserved. RNA21, 801812. doi: 10.1261/rna.046342.114

  • 93

    NovikovaI. V.HennellyS. P.SanbonmatsuK. Y. (2012). Structural architecture of the human long non-coding RNA, steroid receptor RNA activator. Nucleic Acids Res.40, 50345051. doi: 10.1093/nar/gks071

  • 94

    PandeyR. R.CeribelliM.SinghP. B.EricssonJ.MantovaniR.KanduriC. (2004). NF-Y regulates the antisense promoter, bidirectional silencing, and differential epigenetic marks of the Kcnq1 imprinting control region. J. Biol. Chem.279, 5268552693. doi: 10.1074/jbc.M408084200

  • 95

    PandeyR. R.MondalT.MohammadF.EnrothS.RedrupL.KomorowskiJ.et al. (2008). Kcnq1ot1 antisense noncoding RNA mediates lineage-specific transcriptional silencing through chromatin-level regulation. Mol. Cell32, 232246. doi: 10.1016/j.molcel.2008.08.022

  • 96

    ParkS. -W.KurodaM. I.ParkY. (2008). Regulation of histone H4 Lys16 acetylation by predicted alternative secondary structures in roX noncoding RNAs. Mol. Cell. Biol.28, 49524962. doi: 10.1128/mcb.00219-08

  • 97

    PengW.FengJ. (2016). Long noncoding RNA LUNAR1 associates with cell proliferation and predicts a poor prognosis in diffuse large B-cell lymphoma. Biomed. Pharmacother.77, 6571. doi: 10.1016/j.biopha.2015.12.001

  • 98

    PennyG. D.KayG. F.SheardownS. A.RastanS.BrockdorffN. (1996). Requirement for Xist in X chromosome inactivation. Nature379, 131137. doi: 10.1038/379131a0

  • 99

    PintacudaG.WeiG.RoustanC.KirmizitasB. A.SolcanN.CeraseA.et al. (2017). hnRNPK recruits PCGF3/5-PRC1 to the Xist RNA B-repeat to establish polycomb-mediated chromosomal silencing. Mol. Cell68, 955.e10969.e10. doi: 10.1016/j.molcel.2017.11.013

  • 100

    PlathK.FangJ.Mlynarczyk-EvansS. K.CaoR.WorringerK. A.WangH.et al. (2003). Role of histone H3 lysine 27 methylation in X inactivation. Science300, 131135. doi: 10.1126/science.1084274

  • 101

    PonjavicJ.PontingC. P.LunterG. (2007). Functionality or transcriptional noise? Evidence for selection within long noncoding RNAs. Genome Res.17, 556565. doi: 10.1101/gr.6036807

  • 102

    PortosoM.RagazziniR.BrenčičŽ.MoianiA.MichaudA.VassilevI.et al. (2017). PRC2 is dispensable for HOTAIR-mediated transcriptional repression. EMBO J.36, 981994. doi: 10.15252/embj.201695335

  • 103

    Postepska-IgielskaA.GiwojnaA.Gasri-PlotnitskyL.SchmittN.DoldA.GinsbergD.et al. (2015). LncRNA Khps1 regulates expression of the proto-oncogene SPHK1 via triplex-mediated changes in chromatin structure. Mol. Cell60, 626636. doi: 10.1016/j.molcel.2015.10.001

  • 104

    Postepska-IgielskaA.KrunicD.SchmittN.Greulich-BodeK. M.BoukampP.GrummtI. (2013). The chromatin remodelling complex NoRC safeguards genome stability by heterochromatin formation at telomeres and centromeres. EMBO Rep.14, 704710. doi: 10.1038/embor.2013.87

  • 105

    QuinnJ. J.ZhangQ. C.GeorgievP.IlikI. A.AkhtarA.ChangH. Y. (2016). Rapid evolutionary turnover underlies conserved lncRNA-genome interactions. Genes Dev.30, 191207. doi: 10.1101/gad.272187.115

  • 106

    RamírezF.LinggT.ToscanoS.LamK. C.GeorgievP.ChungH. -R.et al. (2015). High-affinity sites form an interaction network to facilitate spreading of the MSL complex across the X chromosome in Drosophila. Mol. Cell60, 146162. doi: 10.1016/j.molcel.2015.08.024

  • 107

    RaoS. S. P.HuntleyM. H.DurandN. C.StamenovaE. K.BochkovI. D.RobinsonJ. T.et al. (2014). A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell159, 16651680. doi: 10.1016/j.cell.2014.11.021

  • 108

    RinnJ. L.ChangH. Y. (2012). Genome regulation by long noncoding RNAs. Annu. Rev. Biochem.81, 145166. doi: 10.1146/annurev-biochem-051410-092902

  • 109

    RinnJ. L.KerteszM.WangJ. K.SquazzoS. L.XuX.BrugmannS. A.et al. (2007). Functional demarcation of active and silent chromatin domains in human HOX loci by noncoding RNAs. Cell129, 13111323. doi: 10.1016/j.cell.2007.05.022

  • 110

    RivasE.ClementsJ.EddyS. R. (2017). A statistical test for conserved RNA structure shows lack of evidence for structure in lncRNAs. Nat. Methods14, 4548. doi: 10.1038/nmeth.4066

  • 111

    RivasE.ClementsJ.EddyS. R. (2020). Estimating the power of sequence covariation for detecting conserved RNA structure. Bioinformatics36, 30723076. doi: 10.1093/bioinformatics/btaa080

  • 112

    SantoroF.MayerD.KlementR. M.WarczokK. E.StukalovA.BarlowD. P.et al. (2013). Imprinted Igf2r silencing depends on continuous Airn lncRNA expression and is not restricted to a developmental window. Development140, 11841195. doi: 10.1242/dev.088849

  • 113

    SantoroR.SchmitzK. -M.SandovalJ.GrummtI. (2010). Intergenic transcripts originating from a subclass of ribosomal DNA repeats silence ribosomal RNA genes in trans. EMBO Rep.11, 5258. doi: 10.1038/embor.2009.254

  • 114

    SarmaK.Cifuentes-RojasC.ErgunA.Del RosarioA.JeonY.WhiteF.et al. (2014). ATRX directs binding of PRC2 to Xist RNA and Polycomb targets. Cell159, 869883. doi: 10.1016/j.cell.2014.10.019

  • 115

    SavićN.BärD.LeoneS.FrommelS. C.WeberF. A.VollenweiderE.et al. (2014). lncRNA maturation to initiate heterochromatin formation in the nucleolus is required for exit from pluripotency in ESCs. Cell Stem Cell15, 720734. doi: 10.1016/j.stem.2014.10.005

  • 116

    SaxenaA.CarninciP. (2011). Long non-coding RNA modifies chromatin: epigenetic silencing by long non-coding RNAs. Bioessays33, 830839. doi: 10.1002/bies.201100084

  • 117

    SchmittA. M.ChangH. Y. (2016). Long noncoding RNAs in cancer pathways. Cancer Cell29, 452463. doi: 10.1016/j.ccell.2016.03.010

  • 118

    SchmitzK. -M.MayerC.PostepskaA.GrummtI. (2010). Interaction of noncoding RNA with the rDNA promoter mediates recruitment of DNMT3b and silencing of rRNA genes. Genes Dev.24, 22642269. doi: 10.1101/gad.590910

  • 119

    SchoeftnerS.BlascoM. A. (2008). Developmentally regulated transcription of mammalian telomeres by DNA-dependent RNA polymerase II. Nat. Cell Biol.10, 228236. doi: 10.1038/ncb1685

  • 120

    SchoenfelderS.FraserP. (2019). Long-range enhancer-promoter contacts in gene expression control. Nat. Rev. Genet.20, 437455. doi: 10.1038/s41576-019-0128-0

  • 121

    SchorderetP.DubouleD. (2011). Structural and functional differences in the long non-coding RNA hotair in mouse and human. PLoS Genet.7:e1002071. doi: 10.1371/journal.pgen.1002071

  • 122

    SeidlC. I. M.StrickerS. H.BarlowD. P. (2006). The imprinted air ncRNA is an atypical RNAPII transcript that evades splicing and escapes nuclear export. EMBO J.25, 35653575. doi: 10.1038/sj.emboj.7601245

  • 123

    ShiY.DownesM.XieW.KaoH. Y.OrdentlichP.TsaiC. C.et al. (2001). Sharp, an inducible cofactor that integrates nuclear receptor repression and activation. Genes Dev.15, 11401151. doi: 10.1101/gad.871201

  • 124

    SimonM. D.PinterS. F.FangR.SarmaK.Rutenberg-SchoenbergM.BowmanS. K.et al. (2013). High-resolution Xist binding maps reveal two-step spreading during X-chromosome inactivation. Nature504, 465469. doi: 10.1038/nature12719

  • 125

    SleutelsF.ZwartR.BarlowD. P. (2002). The non-coding air RNA is required for silencing autosomal imprinted genes. Nature415, 810813. doi: 10.1038/415810a

  • 126

    SmithM. A.GesellT.StadlerP. F.MattickJ. S. (2013). Widespread purifying selection on RNA structure in mammals. Nucleic Acids Res.41, 82208236. doi: 10.1093/nar/gkt596

  • 127

    SomarowthuS.LegiewiczM.ChillónI.MarciaM.LiuF.PyleA. M. (2015). HOTAIR forms an intricate and modular secondary structure. Mol. Cell58, 353361. doi: 10.1016/j.molcel.2015.03.006

  • 128

    SridharB.Rivas-AstrozaM.NguyenT. C.ChenW.YanZ.CaoX.et al. (2017). Systematic mapping of RNA-chromatin interactions in vivo. Curr. Biol.27, 602609. doi: 10.1016/j.cub.2017.01.011

  • 129

    StelzerY.SagiI.YanukaO.EigesR.BenvenistyN. (2014). The noncoding RNA IPW regulates the imprinted DLK1-DIO3 locus in an induced pluripotent stem cell model of Prader-Willi syndrome. Nat. Genet.46, 551557. doi: 10.1038/ng.2968

  • 130

    TavaresR. C. A.PyleA. M.SomarowthuS. (2019). Phylogenetic analysis with improved parameters reveals conservation in lncRNA structures. J. Mol. Biol.431, 15921603. doi: 10.1016/j.jmb.2019.03.012

  • 131

    TomitaS.AbdallaM. O. A.FujiwaraS.MatsumoriH.MaeharaK.OhkawaY.et al. (2015). A cluster of noncoding RNAs activates the ESR1 locus during breast cancer adaptation. Nat. Commun.6:6966. doi: 10.1038/ncomms7966

  • 132

    TrimarchiT.BilalE.NtziachristosP.FabbriG.Dalla-FaveraR.TsirigosA.et al. (2014). Genome-wide mapping and characterization of notch-regulated long noncoding RNAs in acute leukemia. Cell158, 593606. doi: 10.1016/j.cell.2014.05.049

  • 133

    TripathiV.EllisJ. D.ShenZ.SongD. Y.PanQ.WattA. T.et al. (2010). The nuclear-retained noncoding RNA MALAT1 regulates alternative splicing by modulating SR splicing factor phosphorylation. Mol. Cell39, 925938. doi: 10.1016/j.molcel.2010.08.011

  • 134

    TsaiM. -C.ManorO.WanY.MosammaparastN.WangJ. K.LanF.et al. (2010). Long noncoding RNA as modular scaffold of histone modification complexes. Science329, 689693. doi: 10.1126/science.1192002

  • 135

    UlitskyI. (2016). Evolution to the rescue: using comparative genomics to understand long non-coding RNAs. Nat. Rev. Genet.17, 601614. doi: 10.1038/nrg.2016.85

  • 136

    UlitskyI.ShkumatavaA.JanC. H.SiveH.BartelD. P. (2011). Conserved function of lincRNAs in vertebrate embryonic development despite rapid sequence evolution. Cell147, 15371550. doi: 10.1016/j.cell.2011.11.055

  • 137

    van BakelH.NislowC.BlencoweB. J.HughesT. R. (2010). Most “dark matter” transcripts are associated with known genes. PLoS Biol.8:e1000371. doi: 10.1371/journal.pbio.1000371

  • 138

    van BemmelJ. G.GalupaR.GardC.ServantN.PicardC.DaviesJ.et al. (2019). The bipartite TAD organization of the X-inactivation center ensures opposing developmental regulation of Tsix and Xist. Nat. Genet.51, 10241034. doi: 10.1038/s41588-019-0412-0

  • 139

    WangK. C.ChangH. Y. (2011). Molecular mechanisms of long noncoding RNAs. Mol. Cell43, 904914. doi: 10.1016/j.molcel.2011.08.018

  • 140

    WangF.TangZ.ShaoH.GuoJ.TanT.DongY.et al. (2018). Long noncoding RNA HOTTIP cooperates with CCCTC-binding factor to coordinate HOXA gene expression. Biochem. Biophys. Res. Commun.500, 852859. doi: 10.1016/j.bbrc.2018.04.173

  • 141

    WangK. C.YangY. W.LiuB.SanyalA.Corces-ZimmermanR.ChenY.et al. (2011). A long noncoding RNA maintains active chromatin to coordinate homeotic gene expression. Nature472, 120124. doi: 10.1038/nature09819

  • 142

    WehnerS.DörrichA. K.CibaP.WildeA.MarzM. (2014). pRNA: NoRC-associated RNA of rRNA operons. RNA Biol.11, 39. doi: 10.4161/rna.27448

  • 143

    WevrickR.FranckeU. (1997). An imprinted mouse transcript homologous to the human imprinted in Prader-Willi syndrome (IPW) gene. Hum. Mol. Genet.6, 325332. doi: 10.1093/hmg/6.2.325

  • 144

    WevrickR.KernsJ. A.FranckeU. (1994). Identification of a novel paternally expressed gene in the Prader-Willi syndrome region. Hum. Mol. Genet.3, 18771882. doi: 10.1093/hmg/3.10.1877

  • 145

    WillS.JoshiT.HofackerI. L.StadlerP. F.BackofenR. (2012). LocARNA-P: accurate boundary prediction and improved detection of structural RNAs. RNA18, 900914. doi: 10.1261/rna.029041.111

  • 146

    WongtrakoongateP.RiddickG.FucharoenS.FelsenfeldG. (2015). Association of the long non-coding RNA steroid receptor RNA activator (SRA) with TrxG and PRC2 complexes. PLoS Genet.11:e1005615. doi: 10.1371/journal.pgen.1005615

  • 147

    WutzA.JaenischR. (2000). A shift from reversible to irreversible X inactivation is triggered during ES cell differentiation. Mol. Cell5, 695705. doi: 10.1016/S1097-2765(00)80248-8

  • 148

    YangF.DengX.MaW.BerletchJ. B.RabaiaN.WeiG.et al. (2015). The lncRNA Firre anchors the inactive X chromosome to the nucleolus by binding CTCF and maintains H3K27me3 methylation. Genome Biol.16:52. doi: 10.1186/s13059-015-0618-0

  • 149

    YangL.LinC.LiuW.ZhangJ.OhgiK. A.GrinsteinJ. D.et al. (2011). ncRNA- and Pc2 methylation-dependent gene relocation between nuclear structures mediates gene activation programs. Cell147, 773788. doi: 10.1016/j.cell.2011.08.054

  • 150

    YaoH.BrickK.EvrardY.XiaoT.Camerini-OteroR. D.FelsenfeldG. (2010). Mediation of CTCF transcriptional insulation by DEAD-box RNA-binding protein p68 and steroid receptor RNA activator SRA. Genes Dev.24, 25432555. doi: 10.1101/gad.1967810

  • 151

    YotovaI. Y.VlatkovicI. M.PaulerF. M.WarczokK. E.AmbrosP. F.OshimuraM.et al. (2008). Identification of the human homolog of the imprinted mouse air non-coding RNA. Genomics464, 473788. doi: 10.1016/j.ygeno.2008.08.004

  • 152

    ZhangB.ArunG.MaoY. S.LazarZ.HungG.BhattacharjeeG.et al. (2012). The lncRNA Malat1 is dispensable for mouse development but its transcription plays a cis-regulatory role in the adult. Cell Rep.2, 111123. doi: 10.1016/j.celrep.2012.06.003

  • 153

    ZhangH.ZeitzM. J.WangH.NiuB.GeS.LiW.et al. (2014). Long noncoding RNA-mediated intrachromosomal interactions promote imprinting at the Kcnq1 locus. J. Cell Biol.204, 6175. doi: 10.1083/jcb.201304152

  • 154

    ZhangA.ZhaoJ. C.KimJ.FongK. -W.YangY. A.ChakravartiD.et al. (2015). LncRNA HOTAIR enhances the androgen-receptor-mediated transcriptional program and drives castration-resistant prostate cancer. Cell Rep.13, 209221. doi: 10.1016/j.celrep.2015.08.069

  • 155

    ZhaoX.PattonJ. R.GhoshS. K.Fischel-GhodsianN.ShenL.SpanjaardR. A. (2007). Pus3p- and Pus1p-dependent pseudouridylation of steroid receptor RNA activator controls a functional switch that regulates nuclear receptor signaling. Mol. Endocrinol.21, 686699. doi: 10.1210/me.2006-0414

  • 156

    ZhaoJ.SunB. K.ErwinJ. A.SongJ. -J.LeeJ. T. (2008). Polycomb proteins targeted by a short repeat RNA to the mouse X chromosome. Science322, 750756. doi: 10.1126/science.1163045

Summary

Keywords

evolution, conservation, long-non-coding RNAs, chromatin conformation, three-dimensional chromatin conformation, genome topology, gene expression regulation

Citation

Ramírez-Colmenero A, Oktaba K and Fernandez-Valverde SL (2020) Evolution of Genome-Organizing Long Non-coding RNAs in Metazoans. Front. Genet. 11:589697. doi: 10.3389/fgene.2020.589697

Received

31 July 2020

Accepted

09 November 2020

Published

30 November 2020

Volume

11 - 2020

Edited by

Hehuang Xie, Virginia Tech, United States

Reviewed by

Daniel Vaiman, Institut National de la Santé et de la Recherche Médicale (INSERM), France; Sergey Razin, Institute of Gene Biology (RAS), Russia

Updates

Copyright

*Correspondence: Selene L. Fernandez-Valverde,

This article was submitted to Epigenomics and Epigenetics, a section of the journal Frontiers in Genetics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics