- 1Medical Research Center, Chongqing General Hospital, Chongqing, China
- 2Chongqing Academy of Medical Sciences, Chongqing, China
- 3Key Lab in Healthy Science and Technology of Shenzhen, Tsinghua Shenzhen International Graduate School, Shenzhen, China
- 4Department of Chinese Medical Gastrointestinal of China-Japan Friendship Hospital, Beijing, China
An effective loss-of-function study is necessary to investigate the biological function of long non-coding RNA (lncRNA). Various approaches are available, including RNA silencing, antisense oligos, and CRISPR-based genome editing. CRISPR-based genome editing is the most widely used for inactivating lncRNA function at the genomic level. Knocking out the lncRNA function can be achieved by removing the promoter and the first exon (PE1), introducing pre-termination poly(A) signals, or deleting the entire locus, unlike frameshift strategies used for messenger RNA (mRNA). However, the intricate genomic interplay between lncRNA and neighbor genes makes it challenging to interpret lncRNA function accurately. This article discusses the advantages and disadvantages of each lncRNA knockout method and envisions the potential future directions to facilitate lncRNA functional study.
Introduction
Long non-coding RNA (lncRNA) is a type of RNA transcript that is over 200 nucleotides long and lacks protein-coding potential. Like messenger RNA (mRNA), lncRNA is transcribed mainly by RNA polymerase II and undergoes post-transcriptional modifications (Nojima and Proudfoot, 2022). The use of high-throughput RNA transcript sequencing has allowed for identifying numerous lncRNAs. Despite being regarded as “junk” transcripts for a long time, the functions of lncRNAs are still not fully understood. However, many functional lncRNAs have been characterized in recent decades, demonstrating novel gene regulatory patterns. They have ushered in a new paradigm in the field (Rinn and Chang, 2012). Loss-of-function studies are a critical method for investigating the function of lncRNAs in vitro and in vivo. While deleting the promoter and first exon of lncRNA has been extensively utilized and demonstrated efficacy in deactivating lncRNA function, the intricate nature of lncRNA gene loci and their interactions with nearby or overlapping genes pose challenges in achieving precise lncRNA knockouts without affecting the expression of nearby or overlapping genes (Li Z. et al., 2021). In this article, we first introduce the biogenesis and central functional patterns of lncRNAs. We then summarize the prevailing techniques and strategies for knocking out or inactivating lncRNA function. Finally, we discuss the future directions for lncRNA functional inactivation and provide insights into practical approaches aimed at expediting lncRNA functional studies, from individual studies to high-throughput screening.
The biogenesis and function of lncRNA
Most lncRNAs are transcribed by RNA polymerase II, which undergoes post-transcriptional modifications such as 5′capping, 3′poly(A) tailing, and intron splicing (Rinn and Chang, 2012). LncRNAs can perform biological functions via various mechanisms, either by being exported to the cytoplasm or retained within the nucleus (Rinn and Chang, 2012). LncRNAs can be categorized into two classes based on their role: sequence-dependent and sequence-independent. The sequence-dependent role of lncRNAs, known as ceRNA (competing endogenous RNA), has been well-studied. LncRNAs can bind with microRNAs via Watson-Crick base-pairing, acting as a “miRNA sponge” to neutralize targeting miRNA carrying a specific “seed region.” (Conte et al., 2021; Zhang et al., 2022). As a result, the mRNA targeted by the endogenous miRNA is “released” from miRNA repression, leading to the recovery of translational levels (Tay et al., 2014). The contribution of ceRNA to gene regulation has been controversial due to the promiscuous miRNA-lncRNA-mRNA interplay, which makes quantitative analysis difficult (Tay et al., 2014). Another important sequence-dependent function of lncRNAs is serving as a “guide.” In this mechanism, a partial sequence on the lncRNA can bind to genomic DNA via sequence-specific pairing.
In contrast, another part of the lncRNA can form a specific tertiary structure and associate with a protein via electrostatic force. A well-known example of this category is the small guide RNA (sgRNA) and Cas proteins (Jinek et al., 2012; Cong et al., 2013), with derivatives such as prime editing guide RNA (pegRNA) and gene-modified prime editors (Anzalone et al., 2019; Anzalone et al., 2020). Another sequence-dependent function of lncRNAs is encoding small or micro peptides. For instance, Anderson et al. discovered that the LINC00948 lncRNA encodes a peptide called MLN, which interacts with SERCA to regulate the phenotype in skeletal muscle (Anderson et al., 2015).
Most characterized lncRNAs function sequence independently by forming tertiary structures and interacting with proteins as a scaffold (Yoon et al., 2013; Andric and Rougemaille, 2021). Dozens of protein partners of lncRNAs have been reported (Gerstberger et al., 2014), among which the PRC2 (Polycomb Repressive Complex 2) complex has been well-studied. The PRC2 core complex consists of EED, SUZ12, EZH1, EZH2, etc. (Blackledge and Klose, 2021). LncRNAs such as XIST, HOTAIR, ANRIL, etc., interact with PRC2 components in the nucleus, regulating H3K27 methylation and gene silencing. These lncRNAs play an essential role in dosage compensation, imprinting, and histone accessibility (Yoon et al., 2013). LncRNAs can also associate with proteins in the cytosol. For example, the vascular smooth muscle cell and endothelial cell-enriched lncRNA, SENCR, interacts with cytoskeletal-associated protein (CKAP4) and modulates the surface localization of cadherin 5 (CDH5) (Lyu et al., 2019). This interaction plays a critical role in maintaining homeostasis and integrity of the vascular endothelial monolayer (Lyu et al., 2019). Another noteworthy lncRNA functioning pattern is in-cis interacting lncRNA. The lncRNA Upperhand (Uph) and the protein-coding gene Hand2 localize on chromosome 8 in the mouse genome in a divergent direction (Anderson et al., 2016). There is merely a 150 bp gap between the transcription start site (TSS) between Uph and Hand2. Uph transcription processing establishes a permissive chromatin status and therefore benefits Hand2 transcription initiation. The blockade of Uph transcription, but not the knockdown of Uph transcripts, compromises Hand2 expression, suggesting that Uph plays an important role in Hand2 transcription regulation (Anderson et al., 2016).
The genomic complexity of lncRNA
Most lncRNAs are multi-exons and undergo maturation processes like mRNAs: transcribed by the RNA polymerase II, primary lncRNA transcripts are 5′capped, 3′polyadenylated, and introns are removed (St Laurent et al., 2015). Different from mRNAs exported to the cytoplasm for protein synthesis, a portion of lncRNAs retained in the nucleus and play essential roles in gene regulation (St Laurent et al., 2015). For example, the inactive X chromosome-specific transcript, XIST, was discovered in the cell nucleus that dominates the X-inactivation by silencing one of the two female X chromosomes and preventing the dosage effect (Chao et al., 2002; Lee, 2012). Here arises a puzzle about how lncRNAs retained in the cell nucleus, unlike their protein-coding counterparts. Yin et al. reported that U1 snRNP regulates the chromatin retention of non-coding RNAs and revealed that mutation of the 3′splicing site causes nuclear localization and performs cis-regulation on mRNA transcription (Yin et al., 2020). However, the regulatory function of nuclear lncRNAs is far from clear, partially attributed to the lack of understanding of RNA-binding proteins (RBPs) since RBPs mediate most lncRNA functions. It is estimated that there are ∼1,500 RBPs encoded by the human genome (Gerstberger et al., 2014), whereas the actual number could be significantly more. Lately, the genome-scale investigation of RNA-binding proteins unveiled the regulatory pattern of lncRNA-protein, suggesting a novel paradigm for lncRNA study (Xiao et al., 2019; Van Nostrand et al., 2020).
Bearing multiple introns makes lncRNA occupy a large piece of genomic DNA space for accommodating full-length lncRNA. The longer the lncRNA primary transcript is, the more likely it overlaps and intervenes with another mRNA, lncRNA, or snoRNA coding sequence. Thus, lncRNAs were categorized into long intergenic non-coding RNA, head-to-head overlap, head-to-head non-overlap, tail-to-tail overlap, tail-to-tail non-overlap, embedded, intronic, and complicated form lncRNA according to the relative genomic localization of lncRNA to the nearby protein-coding gene (Figure 1). Due to the complexity of the genomic locus of lncRNAs, it is a big challenge to inactive a target lncRNA without perturbing adjacent or overlapped transcript counterparts, hampering the functional study of lncRNAs.
The strategies for lncRNA functional knockout
Multiple strategies have been established to inactivate the lncRNA function via inheritable and uninheritable approaches, including but not limited to siRNA duplex silencing, antisense oligonucleotides (ASO) inhibition, and CRISPR-based genome editing (Miano et al., 2019). CRISPR-based genome editing is the primary technique resolution to create stable and transferrable lncRNA knockout cell lines or animals. Here, we summarized the overall knockout or knockdown strategies and discussed the advantages and disadvantages of each method.
1 Removing the promoter and the first exon of lncRNA
Like mRNA, most lncRNAs were transcribed by RNA polymerase II and regulated by the accessibility of the chromatin state and the transcriptional factors that directly bind to the DNA duplex to recruit the initiation complex (St Laurent et al., 2015). Thus, a widely used strategy to inactivate lncRNA expression is to remove the promoter region from the transcription start site (TSS) to 1,000–2000 base pair upstream (Figure 2A) (Zhang et al., 2012; Ho et al., 2015; Zhen et al., 2017). Provided that most lncRNAs are multi-exon, the first exon of the target lncRNA is usually excised to ensure the inhibitory effect (Aparicio-Prat et al., 2015; Li et al., 2021). In brief, two sgRNAs are designed using sequencing flanking (±100 bp) target areas (-1k upstream of TSS, the 5′splicing site of the first intron). The candidate sequences are analyzed and optimized by the on-targeting and off-targeting computational algorithms. The sgRNAs are synthesized as mature crRNA (Jinek et al., 2012) or inserted into the sgRNA expression cassette plasmids (Ran et al., 2013). The delivery of Cas9 protein can be achieved via various approaches, viral-based vectors, in vitro transcribed (IVT) RNA, and purified protein (Luther et al., 2018).
FIGURE 2. Schematic of lncRNA functional knockout strategies. (A) the schematic of promoter-exon1 deletion (PE) lncRNA knockout strategy; (B) the schematic of poly(A) signal insertion for lncRNA knockout; (C) the schematic of BESST lncRNA knockout method.
The genome editing method has been comprehensively described (Rosenlund et al., 2021), and we are not going through all the details here. The advantages of this strategy include 1) simple: only two sgRNAs and Cas9 to be delivered, the genome editing efficiency is good; 2) excellent inhibition: since the driver of the target transcript is wholly removed, the target lncRNA is very unlikely to be expressed; 3) friendly to genotyping: the editing generates an approximately a thousand base pair gap in the genomic DNA, which is convenient to be detected via genotyping PCR and agarose gel electrophoresis. However, the flaw of this method is also significant, the apparent genome disturbance. Many lncRNA loci are close, overlap, or included in the genomic loci of another transcript, typically protein-coding genes (Sigova et al., 2013). Thus, removing the promoter and the first exon of the lncRNA is likely to delete the protein-coding sequence of transcripts. It causes direct compromise of the protein-coding genes and transcriptome interference and phenotype change. The collateral intervention could mislead the researcher and mask the natural biological function of the target lncRNA.
2 The insertion of pre-termination poly(A) signal
Another widely-used method to inactivate the lncRNA function is to introduce a poly(A) signal downstream of the TSS, which causes the detachment of RNA polymerase II from the DNA strand and disassembly (Figure 2B) (Liu et al., 2017). Only one sgRNA is required for this approach to generate a double-strand break (DSB) by Cas9 protein. Notably, a repair template (single-strand DNA is preferred) is needed to serve as an HDR (homology-directed repair) template to introduce the poly(A) signal containing codons via homologous recombination (Powell et al., 2021). Notably, this strategy minimizes the perturbance of the genome structure: only a few hundred nucleotides are inserted (Liu et al., 2017; Lavalou et al., 2019; Powell et al., 2021). However, a few significant drawbacks are preventing it from the extensive application: 1) Low efficiency: insertion of poly(A) signal requires 3-component (sgRNA, Cas9, HDR template) CRISPR-based genome editing, compared to the 2-component (sgRNA, Cas9) strategy, the overall genome editing efficiency is limited by the recombination efficiency of the HDR template. 2) Alternative TSS: Like mRNAs, lncRNAs undergo alternative splicing and generate multiple splicing variants. If a lncRNA transcript has an alternative TSS, the knockout could fail if the poly(A) signal insertion site is upstream of the second alternative TSS. A truncated lncRNA transcript will be transcribed if the poly(A) signal insertion is too far from the TSS (McDonel and Guttman, 2019). Despite the low efficiency, poly(A) signal insertion is still an applicable alternative for terminating RNA transcript in vitro and in vivo.
3 The branchpoint to 3′splicing site targeting (BESST) knockout strategy
It is well-established that lncRNAs reside in intricate gene contexts within the mammalian genome. Numerous lncRNAs are in close proximity to neighboring protein-coding genes, which can be categorized into various natural antisense transcripts (NATs), including head-to-head, tail-to-tail, and entirely overlapped (Reis and Poirier, 2021). Therefore, it is highly probable that the conventional CRISPR-based gene knockout strategy, which involves removing the promoter and first exon of an RNA transcript, will affect the neighboring protein-coding gene on the same or opposite strand. Goyal et al. reported that nearly 60 percent of 15,929 lncRNAs were not in secure situations for CRISPR-based gene editing (Goyal et al., 2017), indicating that deletion of lncRNA almost certainly results in the inactivation of protein-coding genes. The BESST (branchpoint to 3′splicing site targeting) gene knockout strategy is a recently described method for inactivating lncRNA and protein-coding genes by removing the branchpoint to the 3′splicing site of the last intron within a compact 18–44 nt region (Figure 2C) (Zhang et al., 2023). The BESST removes the DNA segment from the branchpoint to the 3′splicing site in the last exon of the target lncRNA, resulting in the intron retention and nuclear retention of the target RNA. Further research revealed that the BESST gene knockout induces lncRNA nuclear degradation by decapping via NCBP1-dependent mechanism and poly(A) deadenylation via PABPN1-mediated and CNOT7/CNOT8-dependent mechanism. Notably, the BESST knockout was validated to also function in multi-exon protein-coding genes, providing an alternative method for studying gene function. Another important aspect of the BESST knockout method is that it could facilitate researchers to determine the underlying mechanism of lncRNA. LncRNA has multiple modes of functioning, including trans- and cis-action. The BESST knockout technique could distinguish between sequence-dependence and transcription-dependence in the mechanism of lncRNAs across their genomic loci (McDonel and Guttman, 2019; Zhang et al., 2023). However, since there is no intron to deal with, we were unable to disregard the BESST application’s limitations in single-exon RNA transcripts. In brief, the emergence of the BESST knockout strategy provides a convenient tool to inactivate multi-exon RNA transcripts with minimal genomic DNA perturbation, which could be applied to precisely identify gene function via high throughput screening. Notably, the efficiency and feasibility of BESST will be further enhanced by using optimized Cas proteins which allow less stringent PAM (protospacer-adjacent motif) sequence (Walton et al., 2020).
4 The uninheritable approach
Besides the CRISPR-based lncRNA knockout strategy, other methods cannot be inherited in vivo, including RNA silencing and antisense oligo (Stojic et al., 2018). RNA silencing is the knockdown gene approach using synthesized siRNA duplex or viral-based shRNA expression vector (Elbashir et al., 2001; Rao et al., 2009). RNA silencing is effective for knocking down protein-encoding genes, whereas it only works for a portion of lncRNA. The molecular principle of RNA silencing is the siRNA/shRNA duplex being loaded into the RISC complex and functions as a guide to associate with target mRNA with Watson-Crick pairing and initiating RNA degradation (Rao et al., 2009). However, RNA silencing does not work for nuclear lncRNA since there is no RISC complex distribution in the cell nucleus (Zeng and Cullen, 2002; Lennox and Behlke, 2016). Antisense oligonucleotide (ASO) is a short single-strand DNA that targets lncRNA in a sequence-specific manner and triggers RNase H activity for degradation (Bennett, 2019). However, ASO can only be delivered by transfection and not be loaded into a plasmid, establishing a stable expression line as shRNA. So, the animal application is limited even if the ASO showed superior efficiency in nuclear lncRNA knockdown. Cas13 is a newly emerged genetic tool that directly edits RNA transcript efficiently (Pickar-Oliver and Gersbach, 2019), whereas the uninheritable attribute of the method dramatically limits its application in animal study. The pros and cons of the aforementioned methodologies are summarized in Table 1.
Perspectives
LncRNAs are essential regulatory molecules in cells, but the majority of lncRNAs’ biological functions are unknown. This is largely due to the fact that current functional studies are restricted to analyzing individual lncRNAs, which is laborious and inefficient. In recent years, the CRISPR-based genome-scale knockout screening and transcriptional activation method for protein-coding genes were established (Fox et al., 2018). Subsequently, the CRISPR-based genome-scale deletion screening of human lncRNAs was also conducted (Zhu et al., 2016). Due to technical limitations, these early initiatives can only investigate intergenic lncRNAs and not natural antisense lncRNAs, since large-piece DNA deletion result in unintended protein-coding or non-coding RNA expression intervention and lead to unpredictable effects. The emergence of precise and genome-wide lncRNA functional knockout methodology would no doubt expedite the understanding of lncRNA more efficiently. It will assist researchers to comprehend the importance of non-coding RNAs in various biological and pathological processes and could also be utilized for developing potential therapeutical interventions.
Different from proteins, the biological function of lncRNAs is mainly exerted via interacting with proteins (Kopp and Mendell, 2018). Rinn et al. introduced diverse patterns of lncRNAs functioning with their protein partners, including decoy, scaffold, guide, and enhancer (Rinn and Chang, 2012). Nevertheless, the association between lncRNA and protein is promiscuous since the RNA-protein interaction is mediated by the non-covalent bond and depended on the RNA tertiary structure, which is flexible (Gruber et al., 2008; Ferre et al., 2016). The tertiary structure of RNA is more dynamic than that of proteins and is influenced by the transcriptome, interacting proteins, ion strength, and so on (Jaeger et al., 1989; Li and Aviran, 2018). It is difficult and pointless to determine the “true” RNA tertiary structure of a target lncRNA in the complex biophysical environment in living cells because different cell types exhibit distinct transcriptomes and proteomes (Li and Aviran, 2018; Sanchez de Groot et al., 2019; Wang et al., 2021). However, we can elucidate the biological function of RNA via the aspect of the interacting proteins, which can be accomplished by a variety of technical resolutions (McHugh et al., 2014; Ramanathan et al., 2019). For example, the PAR-CLIP (photoactivatable ribonucleoside enhanced crosslinked immunoprecipitation) is able to pinpoint the exact nucleotides on RNA transcripts that are directly interacting with binding proteins by in vivo labeling nascent RNAs with photoreactive nucleosides 4-thiouridine (4-SU) or 6-thioguanosine (6SG) (Hafner et al., 2010). The information on direct protein-binding nucleotides revealed by PAR-CLIP could also be utilized for calculating the potential RNA tertiary structure in the future.
The lncRNA expression is sophisticatedly coordinated in a spatiotemporal manner. There are two fundamental components of spatiotemporal regulation: where and when. LncRNAs are expressed in different cells, especially during development. For instance, Kim and colleagues reported that lncRNA Moshe is detected in the cardiac mesoderm stage (E8.5 to E9.5) along with the expression of Gata6, and continues to be expressed and increases in the atrioventricular septum (E12.5) (Kim et al., 2021). Knock-down of Moshe causes transcriptional suppression of Nkx2.5 and increases the expression of second heart field (SHF) lineage genes, which may play a critical role in ASD. Another level of spatial expression of lncRNAs is subcellular localization. LncRNAs exhibit distinguished biological functional and underlying mechanisms in different cell compartments. Localized lncRNAs in the cell nucleus are primarily involved in epigenetic and transcriptional regulation, whereas cytoplasmic lncRNAs are typically involved in post-transcriptional regulation, protein-protein interactions, etc. (Yao et al., 2019; Gao et al., 2020). Accordingly, a new technique, APEX-seq, was developed which can dissect the function of lncRNA in different subcellar localizations, enabling a comprehensive understanding of RNA spatiotemporal regulation (Fazal et al., 2019).
Last, but not least, the expanding range of RNA-binding proteins (RBPs) also broadens our view of lncRNA functioning. RBPs are proteins containing one or more RNA binding domains (RBDs) that can directly interact with RNA strands and alter their structure and function (Hentze et al., 2018). Several hundred RBPs have been identified over the years, but studies have found RBPs without conventional RBDs, suggesting the number of RBPs is underestimated (Hentze and Argos, 1991; Hentze, 1994). Notably, the unconventional RBPs introduce the subject of phase condensation or phase separation, another crucial phenomenon in cells. A growing body of research has shown that lncRNAs and unconventional RBPs, such as paraspeckles in the nucleus and processing bodies in the cytoplasm, are crucial for organizing nonmembrane organelles and controlling gene expression (Fox et al., 2018; Elguindy and Mendell, 2021; Hirose et al., 2023). The role of lncRNA in the formation of non-membrane organelles and how it supports specific biological processes remain largely unknown. Thus, a comprehensive multi-omics strategy is required to reveal the spatiotemporal function of lncRNA in various biological and pathological processes.
Author contributions
QL, SZ, and ZT drafted the manuscript, QL and ZZ supervised and revised the manuscript. All authors contributed to the article and approved the submitted version.
Funding
This work was supported by the National Natural Science Funding of China (#82070486).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Anderson, D. M., Anderson, K. M., Chang, C. L., Makarewich, C. A., Nelson, B. R., McAnally, J. R., et al. (2015). A micropeptide encoded by a putative long non-coding RNA regulates muscle performance. Cell 160 (4), 595–606. doi:10.1016/j.cell.2015.01.009
Anderson, K. M., Anderson, D. M., McAnally, J. R., Shelton, J. M., Bassel-Duby, R., and Olson, E. N. (2016). Transcription of the non-coding RNA upperhand controls Hand2 expression and heart development. Nature 539 (7629), 433–436. doi:10.1038/nature20128
Andric, V., and Rougemaille, M. (2021). Long non-coding RNAs in the control of gametogenesis: lessons from fission yeast. Non-coding RNA 7 (2), 34. doi:10.3390/ncrna7020034
Anzalone, A. V., Koblan, L. W., and Liu, D. R. (2020). Genome editing with CRISPR-Cas nucleases, base editors, transposases and prime editors. Nat. Biotechnol. 38 (7), 824–844. doi:10.1038/s41587-020-0561-9
Anzalone, A. V., Randolph, P. B., Davis, J. R., Sousa, A. A., Koblan, L. W., Levy, J. M., et al. (2019). Search-and-replace genome editing without double-strand breaks or donor DNA. Nature 576 (7785), 149–157. doi:10.1038/s41586-019-1711-4
Aparicio-Prat, E., Arnan, C., Sala, I., Bosch, N., Guigo, R., and Johnson, R. (2015). Decko: single-oligo, dual-CRISPR deletion of genomic elements including long non-coding RNAs. BMC Genomics 16, 846. doi:10.1186/s12864-015-2086-z
Bennett, C. F. (2019). Therapeutic antisense oligonucleotides are coming of age. Annu. Rev. Med. 70, 307–321. doi:10.1146/annurev-med-041217-010829
Blackledge, N. P., and Klose, R. J. (2021). The molecular principles of gene regulation by Polycomb repressive complexes. Nat. Rev. Mol. Cell Biol. 22 (12), 815–833. doi:10.1038/s41580-021-00398-y
Chao, W., Huynh, K. D., Spencer, R. J., Davidow, L. S., and Lee, J. T. (2002). CTCF, a candidate trans-acting factor for X-inactivation choice. Science 295 (5553), 345–347. doi:10.1126/science.1065982
Cong, L., Ran, F. A., Cox, D., Lin, S., Barretto, R., Habib, N., et al. (2013). Multiplex genome engineering using CRISPR/Cas systems. Science 339 (6121), 819–823. doi:10.1126/science.1231143
Conte, F., Fiscon, G., Sibilio, P., Licursi, V., and Paci, P. (2021). An overview of the computational models dealing with the regulatory ceRNA mechanism and ceRNA deregulation in cancer. Methods Mol. Biol. 2324, 149–164. doi:10.1007/978-1-0716-1503-4_10
Elbashir, S. M., Harborth, J., Lendeckel, W., Yalcin, A., Weber, K., and Tuschl, T. (2001). Duplexes of 21-nucleotide RNAs mediate RNA interference in cultured mammalian cells. Nature 411 (6836), 494–498. doi:10.1038/35078107
Elguindy, M. M., and Mendell, J. T. (2021). NORAD-induced Pumilio phase separation is required for genome stability. Nature 595 (7866), 303–308. doi:10.1038/s41586-021-03633-w
Fazal, F. M., Han, S., Parker, K. R., Kaewsapsak, P., Xu, J., Boettiger, A. N., et al. (2019). Atlas of subcellular RNA localization revealed by APEX-seq. Cell 178 (2), 473–490. doi:10.1016/j.cell.2019.05.027
Ferre, F., Colantoni, A., and Helmer-Citterich, M. (2016). Revealing protein-lncRNA interaction. Brief. Bioinform 17 (1), 106–116. doi:10.1093/bib/bbv031
Fox, A. H., Nakagawa, S., Hirose, T., and Bond, C. S. (2018). Paraspeckles: where long non-coding RNA meets phase separation. Trends Biochem. Sci. 43 (2), 124–135. doi:10.1016/j.tibs.2017.12.001
Gao, N., Li, Y., Li, J., Gao, Z., Yang, Z., Li, Y., et al. (2020). Long non-coding RNAs: the regulatory mechanisms, research strategies, and future directions in cancers. Front. Oncol. 10, 598817. doi:10.3389/fonc.2020.598817
Gerstberger, S., Hafner, M., and Tuschl, T. (2014). A census of human RNA-binding proteins. Nat. Rev. Genet. 15 (12), 829–845. doi:10.1038/nrg3813
Goyal, A., Myacheva, K., Gross, M., Klingenberg, M., Duran Arque, B., and Diederichs, S. (2017). Challenges of CRISPR/Cas9 applications for long non-coding RNA genes. Nucleic Acids Res. 45 (3), e12. doi:10.1093/nar/gkw883
Gruber, A. R., Lorenz, R., Bernhart, S. H., Neubock, R., and Hofacker, I. L. (2008). The Vienna RNA websuite. Nucleic Acids Res. 36, W70–W74. Web Server issue). doi:10.1093/nar/gkn188
Hafner, M., Landthaler, M., Burger, L., Khorshid, M., Hausser, J., Berninger, P., et al. (2010). Transcriptome-wide identification of RNA-binding protein and microRNA target sites by PAR-CLIP. Cell 141 (1), 129–141. doi:10.1016/j.cell.2010.03.009
Hentze, M. W., and Argos, P. (1991). Homology between IRE-BP, a regulatory RNA-binding protein, aconitase, and isopropylmalate isomerase. Nucleic Acids Res. 19 (8), 1739–1740. doi:10.1093/nar/19.8.1739
Hentze, M. W., Castello, A., Schwarzl, T., and Preiss, T. (2018). A brave new world of RNA-binding proteins. Nat. Rev. Mol. Cell Biol. 19 (5), 327–341. doi:10.1038/nrm.2017.130
Hentze, M. W. (1994). Enzymes as RNA-binding proteins: a role for (di)nucleotide-binding domains? Trends Biochem. Sci. 19 (3), 101–103. doi:10.1016/0968-0004(94)90198-8
Hirose, T., Ninomiya, K., Nakagawa, S., and Yamazaki, T. (2023). A guide to membraneless organelles and their various roles in gene regulation. Nat. Rev. Mol. Cell. Biol. 24 (4), 288–304. doi:10.1038/s41580-022-00558-8
Ho, T. T., Zhou, N., Huang, J., Koirala, P., Xu, M., Fung, R., et al. (2015). Targeting non-coding RNAs with the CRISPR/Cas9 system in human cell lines. Nucleic Acids Res. 43 (3), e17. doi:10.1093/nar/gku1198
Jaeger, J. A., Turner, D. H., and Zuker, M. (1989). Improved predictions of secondary structures for RNA. Proc. Natl. Acad. Sci. U. S. A. 86 (20), 7706–7710. doi:10.1073/pnas.86.20.7706
Jinek, M., Chylinski, K., Fonfara, I., Hauer, M., Doudna, J. A., and Charpentier, E. (2012). A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science 337 (6096), 816–821. doi:10.1126/science.1225829
Kim, N. J., Lee, K. H., Son, Y., Nam, A. R., Moon, E. H., Pyun, J. H., et al. (2021). Spatiotemporal expression of long non-coding RNA Moshe modulates heart cell lineage commitment. RNA Biol. 18 (2), 640–654. doi:10.1080/15476286.2021.1976549
Kopp, F., and Mendell, J. T. (2018). Functional classification and experimental dissection of long non-coding RNAs. Cell 172 (3), 393–407. doi:10.1016/j.cell.2018.01.011
Lavalou, P., Eckert, H., Damy, L., Constanty, F., Majello, S., Bitetti, A., et al. (2019). Strategies for genetic inactivation of long non-coding RNAs in zebrafish. RNA 25 (8), 897–904. doi:10.1261/rna.069484.118
Lee, J. T. (2012). Epigenetic regulation by long non-coding RNAs. Science 338 (6113), 1435–1439. doi:10.1126/science.1231776
Lennox, K. A., and Behlke, M. A. (2016). Cellular localization of long non-coding RNAs affects silencing by RNAi more than by antisense oligonucleotides. Nucleic Acids Res. 44 (2), 863–877. doi:10.1093/nar/gkv1206
Li, H., and Aviran, S. (2018). Statistical modeling of RNA structure profiling experiments enables parsimonious reconstruction of structure landscapes. Nat. Commun. 9 (1), 606. doi:10.1038/s41467-018-02923-8
Li, P. Y., Li, S. Q., Gao, S. G., and Dong, D. Y. (2021a). CRISPR/Cas9-mediated gene editing on Sox2ot promoter leads to its truncated expression and does not influence neural tube closure and embryonic development in mice. Biochem. Biophys. Res. Commun. 573, 107–111. doi:10.1016/j.bbrc.2021.08.029
Li, Z., Liu, L., Jiang, S., Li, Q., Feng, C., Du, Q., et al. (2021b). LncExpDB: an expression database of human long non-coding RNAs. Nucleic Acids Res. 49 (D1), D962–D968. doi:10.1093/nar/gkaa850
Liu, Y., Han, X., Yuan, J., Geng, T., Chen, S., Hu, X., et al. (2017). Biallelic insertion of a transcriptional terminator via the CRISPR/Cas9 system efficiently silences expression of protein-coding and non-coding RNA genes. J. Biol. Chem. 292 (14), 5624–5633. doi:10.1074/jbc.M116.769034
Luther, D. C., Lee, Y. W., Nagaraj, H., Scaletti, F., and Rotello, V. M. (2018). Delivery approaches for CRISPR/Cas9 therapeutics in vivo: advances and challenges. Expert Opin. Drug Deliv. 15 (9), 905–913. doi:10.1080/17425247.2018.1517746
Lyu, Q., Xu, S., Lyu, Y., Choi, M., Christie, C. K., Slivano, O. J., et al. (2019). SENCR stabilizes vascular endothelial cell adherens junctions through interaction with CKAP4. Proc. Natl. Acad. Sci. U. S. A. 116 (2), 546–555. doi:10.1073/pnas.1810729116
McDonel, P., and Guttman, M. (2019). Approaches for understanding the mechanisms of long non-coding RNA regulation of gene expression. Cold Spring Harb. Perspect. Biol. 11 (12), a032151. doi:10.1101/cshperspect.a032151
McHugh, C. A., Russell, P., and Guttman, M. (2014). Methods for comprehensive experimental identification of RNA-protein interactions. Genome Biol. 15 (1), 203. doi:10.1186/gb4152
Miano, J. M., Long, X., and Lyu, Q. (2019). CRISPR links to long non-coding RNA function in mice: a practical approach. Vasc. Pharmacol. 114, 1–12. doi:10.1016/j.vph.2019.02.004
Nojima, T., and Proudfoot, N. J. (2022). Mechanisms of lncRNA biogenesis as revealed by nascent transcriptomics. Nat. Rev. Mol. Cell Biol. 23 (6), 389–406. doi:10.1038/s41580-021-00447-6
Pickar-Oliver, A., and Gersbach, C. A. (2019). The next generation of CRISPR-Cas technologies and applications. Nat. Rev. Mol. Cell Biol. 20 (8), 490–507. doi:10.1038/s41580-019-0131-5
Powell, G., Pavlovic Djuranovic, S., and Djuranovic, S. (2021). Gene dosage effects of poly(A) track-engineered hypomorphs. Mol. Ther. Nucleic Acids 26, 865–878. doi:10.1016/j.omtn.2021.10.005
Ramanathan, M., Porter, D. F., and Khavari, P. A. (2019). Methods to study RNA-protein interactions. Nat. Methods 16 (3), 225–234. doi:10.1038/s41592-019-0330-1
Ran, F. A., Hsu, P. D., Wright, J., Agarwala, V., Scott, D. A., and Zhang, F. (2013). Genome engineering using the CRISPR-Cas9 system. Nat. Protoc. 8 (11), 2281–2308. doi:10.1038/nprot.2013.143
Rao, D. D., Vorhies, J. S., Senzer, N., and Nemunaitis, J. (2009). siRNA vs. shRNA: similarities and differences. Adv. Drug Deliv. Rev. 61 (9), 746–759. doi:10.1016/j.addr.2009.04.004
Reis, R. S., and Poirier, Y. (2021). Making sense of the natural antisense transcript puzzle. Trends Plant Sci. 26 (11), 1104–1115. doi:10.1016/j.tplants.2021.07.004
Rinn, J. L., and Chang, H. Y. (2012). Genome regulation by long non-coding RNAs. Annu. Rev. Biochem. 81, 145–166. doi:10.1146/annurev-biochem-051410-092902
Rosenlund, I. A., Calin, G. A., Dragomir, M. P., and Knutsen, E. (2021). CRISPR/Cas9 to silence long non-coding RNAs. Methods Mol. Biol. 2348, 175–187. doi:10.1007/978-1-0716-1581-2_12
Sanchez de Groot, N., Armaos, A., Grana-Montes, R., Alriquet, M., Calloni, G., Vabulas, R. M., et al. (2019). RNA structure drives interaction with proteins. Nat. Commun. 10 (1), 3246. doi:10.1038/s41467-019-10923-5
Sigova, A. A., Mullen, A. C., Molinie, B., Gupta, S., Orlando, D. A., Guenther, M. G., et al. (2013). Divergent transcription of long non-coding RNA/mRNA gene pairs in embryonic stem cells. Proc. Natl. Acad. Sci. U. S. A. 110 (8), 2876–2881. doi:10.1073/pnas.1221904110
St Laurent, G., Wahlestedt, C., and Kapranov, P. (2015). The Landscape of long non-coding RNA classification. Trends Genet. 31 (5), 239–251. doi:10.1016/j.tig.2015.03.007
Stojic, L., Lun, A. T. L., Mangei, J., Mascalchi, P., Quarantotti, V., Barr, A. R., et al. (2018). Specificity of RNAi, LNA and CRISPRi as loss-of-function methods in transcriptional analysis. Nucleic Acids Res. 46 (12), 5950–5966. doi:10.1093/nar/gky437
Tay, Y., Rinn, J., and Pandolfi, P. P. (2014). The multilayered complexity of ceRNA crosstalk and competition. Nature 505 (7483), 344–352. doi:10.1038/nature12986
Van Nostrand, E. L., Freese, P., Pratt, G. A., Wang, X., Wei, X., Xiao, R., et al. (2020). A large-scale binding and functional map of human RNA-binding proteins. Nature 583 (7818), 711–719. doi:10.1038/s41586-020-2077-3
Walton, R. T., Christie, K. A., Whittaker, M. N., and Kleinstiver, B. P. (2020). Unconstrained genome targeting with near-PAMless engineered CRISPR-Cas9 variants. Science 368 (6488), 290–296. doi:10.1126/science.aba8853
Wang, X. W., Liu, C. X., Chen, L. L., and Zhang, Q. C. (2021). RNA structure probing uncovers RNA structure-dependent biological functions. Nat. Chem. Biol. 17 (7), 755–766. doi:10.1038/s41589-021-00805-7
Xiao, R., Chen, J. Y., Liang, Z., Luo, D., Chen, G., Lu, Z. J., et al. (2019). Pervasive chromatin-RNA binding protein interactions enable RNA-based regulation of transcription. Cell 178 (1), 107–121. doi:10.1016/j.cell.2019.06.001
Yao, R. W., Wang, Y., and Chen, L. L. (2019). Cellular functions of long non-coding RNAs. Nat. Cell Biol. 21 (5), 542–551. doi:10.1038/s41556-019-0311-8
Yin, Y., Lu, J. Y., Zhang, X., Shao, W., Xu, Y., Li, P., et al. (2020). U1 snRNP regulates chromatin retention of non-coding RNAs. Nature 580 (7801), 147–150. doi:10.1038/s41586-020-2105-3
Yoon, J. H., Abdelmohsen, K., and Gorospe, M. (2013). Posttranscriptional gene regulation by long non-coding RNA. J. Mol. Biol. 425 (19), 3723–3730. doi:10.1016/j.jmb.2012.11.024
Zeng, Y., and Cullen, B. R. (2002). RNA interference in human cells is restricted to the cytoplasm. RNA 8 (7), 855–860. doi:10.1017/s1355838202020071
Zhang, B., Arun, G., Mao, Y. S., Lazar, Z., Hung, G., Bhattacharjee, G., et al. (2012). The lncRNA Malat1 is dispensable for mouse development but its transcription plays a cis-regulatory role in the adult. Cell Rep. 2 (1), 111–123. doi:10.1016/j.celrep.2012.06.003
Zhang, J., Liu, L., Xu, T., Zhang, W., Li, J., Rao, N., et al. (2022). Time to infer miRNA sponge modules. Wiley Interdiscip. Rev. RNA 13 (2), e1686. doi:10.1002/wrna.1686
Zhang, S., Chen, Y., Dong, K., Zhao, Y., Wang, Y., Wang, S., et al. (2023). Besst: a novel LncRNA knockout strategy with less genome perturbance. Nucleic Acids Res. 51, e49. doi:10.1093/nar/gkad197
Zhen, S., Hua, L., Liu, Y. H., Sun, X. M., Jiang, M. M., Chen, W., et al. (2017). Inhibition of long non-coding RNA UCA1 by CRISPR/Cas9 attenuated malignant phenotypes of bladder cancer. Oncotarget 8 (6), 9634–9646. doi:10.18632/oncotarget.14176
Keywords: long non-coding RNA, CRISPR-Cas9, functional knockout, genome editing, methodology
Citation: Lyu QR, Zhang S, Zhang Z and Tang Z (2023) Functional knockout of long non-coding RNAs with genome editing. Front. Genet. 14:1242129. doi: 10.3389/fgene.2023.1242129
Received: 18 June 2023; Accepted: 16 August 2023;
Published: 29 August 2023.
Edited by:
Bodhisattwa Banerjee, University of Vermont, United StatesReviewed by:
Malay Chaklader, University of Texas Southwestern Medical Center, United StatesCopyright © 2023 Lyu, Zhang, Zhang and Tang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Qing Rex Lyu, bHZxaW5nQGNpZ2l0LmFjLmNu