Skip to main content

REVIEW article

Front. Genet., 08 December 2021
Sec. Computational Genomics
This article is part of the Research Topic Machine Learning Techniques on Gene Function Prediction Volume II View all 25 articles

Bioinformatics Analysis of Long Non-coding RNA and Related Diseases: An Overview

Yuxin Gong,,,Yuxin Gong1,2,3,4Wen ZhuWen Zhu1Meili Sun
Meili Sun5*Lei Shi
Lei Shi6*
  • 1School of Mathematics and Statistics, Hainan Normal University, Haikou, China
  • 2Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China
  • 3Key Laboratory of Computational Science and Application of Hainan Province, Haikou, China
  • 4Key Laboratory of Data Science and Smart Education, Hainan Normal University, Ministry of Education, Haikou, China
  • 5Beidahuang Industry Group General Hospital, Harbin, China
  • 6Department of Spine Surgery, Changzheng Hospital, Naval Medical University, Shanghai, China

Long non-coding RNAs (lncRNAs) are usually located in the nucleus and cytoplasm of cells. The transcripts of lncRNAs are >200 nucleotides in length and do not encode proteins. Compared with small RNAs, lncRNAs have longer sequences, more complex spatial structures, and more diverse and complex mechanisms involved in the regulation of gene expression. LncRNAs are widely involved in the biological processes of cells, and in the occurrence and development of many human diseases. Many studies have shown that lncRNAs can induce the occurrence of diseases, and some lncRNAs undergo specific changes in tumor cells. Research into the roles of lncRNAs has covered the diagnosis of, for example, cardiovascular, cerebrovascular, and central nervous system diseases. The bioinformatics of lncRNAs has gradually become a research hotspot and has led to the discovery of a large number of lncRNAs and associated biological functions, and lncRNA databases and recognition models have been developed. In this review, the research progress of lncRNAs is discussed, and lncRNA-related databases and the mechanisms and modes of action of lncRNAs are described. In addition, disease-related lncRNA methods and the relationships between lncRNAs and human lung adenocarcinoma, rectal cancer, colon cancer, heart disease, and diabetes are discussed. Finally, the significance and existing problems of lncRNA research are considered.

Introduction

A long transcription product was discovered and identified by Okazaki in 2002 (Okazaki et al., 2003) when sequencing a mouse cDNA library. The transcript was called long non-coding RNA (lncRNA). LncRNAs are >200 nucleotides long and similar in structure to messenger RNAs, but they lack an open reading frame. Mainly distributed in the nucleus and cytoplasm of cells, lncRNAs are the transcription products of RNA polymerase II. LncRNAs have been classified based on the lengths of the coded transcripts as lncRNA, long-intergenic non-coding RNA, very long-intergenic non-coding RNA, macroRNA, and promoter-associated long RNA. They have also been classified according to the position of the lncRNA in the genome relative to the target protein-coding gene as 1) antisense lncRNA, which is partially or completely complementary to the transcription product on the opposite strand; 2) enhancer lncRNA, which is produced from the enhancer region of a protein coding gene; 3) bidirectional lncRNA, which shares the same promoter with protein-coding genes, but the transcription direction is opposite; 4) intronic lncRNA, which is produced by introns of genes; and 5) large intergenic non-coding RNA, which is independently transcribed from sequences located between protein-coding genes (Figure 1).

FIGURE 1
www.frontiersin.org

FIGURE 1. Cellular localization and classification of long non-coding RNAs (lncRNA). (A) Cellular distribution of lncRNAs. (B) Positions of lncRNAs in the genome relative to the target protein-coding gene.

Because non-coding RNAs (ncRNAs) do not encode proteins, they were thought to have no biological function and were regarded as “transcriptional noise” for a long time after they were discovered. Although, research on short ncRNAs, including microRNAs, short interfering RNAs, small nucleolar RNAs, and Piwi-interacting RNAs, has flourished, lncRNAs have been neglected because of their long sequences and limitations of the research methods (Liu, 2021).

Initially, lncRNAs were considered to be the noise of genome transcription and a by-product of polymerase II transcription. However, after lncRNAs were found to have conserved secondary structures, spliced forms, and subcellular localizations, it was realized that they may be functional. Indeed, it is now recognized that lncRNAs have functions that are essential for many biological processes, including epigenetic regulation, cell signal transduction, immune response, and cell proliferation and differentiation (Heo et al., 2013; Zhao et al., 2021; Yang et al., 2020; Hu et al., 2021; Hu et al., 20216222), and the abnormal expression of lncRNAs can result in a variety of complex diseases. Moreover, some lncRNAs can act as precursors of some functional short ncRNAs to indirectly participate in the regulation of target genes. LncRNAs regulate gene expression to exert these functions, including cis-regulation and trans-regulation.

According to data from the Encyclopedia of DNA Elements (ENCODE) project (2015), approximately 15,941 lncRNA loci have been identified in the human genome (Jalali et al., 2016). The discovery and research of a large number of lncRNAs have greatly promoted the development of RNA biology research. Analysis of these lncRNAs showed that they had important regulatory functions at the epigenetic, transcription, and post-transcriptional levels (Mercer et al., 2009; Hu et al., 2020; Shen et al., 2021). Until now, research on the relationship between lncRNA and disease has been focused mainly on tumors; however, a small number of differentially expressed lncRNAs associated with obesity, diabetes, hypertension, and other diseases have been found, but the mechanism of action is still unclear. LncRNA research suffers from problems such as few available resources, relatively independent research results, and lack of systematicness (Losko et al., 2016). In-depth analysis of disease-related lncRNAs will help in finding disease biomarkers and provide insights into the diagnosis, treatment, prognosis, and prevention of diseases (Tang et al., 2018; Sun et al., 2021). In this review, bioinformatics approaches for the study of lncRNAs and diseases are summarized, including mainly databases related to lncRNAs, the mechanisms and modes of lncRNA action, methods for identifying disease-related lncRNAs, and the relationships between lncRNAs and human metabolic syndrome, cancer, leukemia, heart disease, and neurodegenerative diseases.

Databases Related to LncRNAs

A large number of biological datasets related to lncRNA have been generated and many lncRNA-oriented databases have been built to store, manage, and integrate comprehensive lncRNA functional information, lncRNA structure and genome mutations, lncRNA expression analysis data, and lncRNA–disease associations. Here, the commonly used lncRNA databases are briefly described.

1) ChIPBase (Zhou et al., 2016) provides comprehensive identification and annotation of lncRNA expression profiles and transcriptional regulation data. The lncRNA expression profiles obtained by RNA sequencing and transcription factor binding sites identified by ChIP-Seq (chromatin immunoprecipitation followed by sequencing) are included in this database (http://rna.sysu.edu.cn/chipbase3/index.php).

2) lncRNAdb (Quek et al., 2015) contains comprehensive annotations of lncRNAs with biological functions, including gene expression, functional evidence, disease-related lncRNAs, pathogen-induced or derived lncRNAs, and sequence information (http://www.lncrnadb.org/).

3) NRED (Dinger et al., 2009) provides the expression information of thousands of lncRNAs of humans and mice from microarray and in situ hybridization data, as well as auxiliary information such as secondary structure evidence, antisense relationships, evolutionary conservation, and genome-related text links (http://jsm-research.imb.uq.edu.au/nred/).

4) NONCODE (Fang et al., 2018) provides comprehensive lncRNA annotations, including expression information and functions predicted by the ncFANs software. NONCODE is widely used for ncRNA research (http://www.noncode.org).

5) LncRNADisease (Bao et al., 2019) contains annotations of disease-related lncRNAs reported in the literature. For the lncRNAs, basic information such as name, chromosome location, species, transcript number, and sequence is provided. For the related diseases, name, literature, and other information are given (http://www.rnanut.net/lncrnadisease/).

6) lncRNASNP2 (Miao et al., 2017) provides resources of single nucleotide polymorphisms (SNPs) in lncRNAs of humans and mice. The database has the following browse and search functions: the influence of SNPs in lncRNAs on their own genes, the binding of microRNAs (miRNAs) in lncRNAs and the influence of SNPs on binding, the mutation and expression of lncRNAs in The Cancer Genome Atlas, variations in lncRNAs in the COSMIC database (large databases of cancer-related somatic mutation sites), and the effect of mutations on lncRNAs (http://bioinfo.life.hust.edu.cn/lncRNASNP#!/).

7) StarBase v2.0 (Li et al., 2013) can search for lncRNA based on miRNA-lncRNA interaction, as well as competitive endogenous RNA regulatory molecules and binding proteins related to the specified lncRNAs (http://starbase.sysu.edu.cn/index.php).

Mechanisms and Modes of Action of LncRNAs Headings

Compared with small RNAs, lncRNAs are longer, have more complex spatial structure, and more diverse and complex mechanisms for expression regulation. LncRNAs are involved in the regulation of development, differentiation, and metabolism. LncRNAs can regulate gene expression at the epigenetic (Mercer and Mattick, 2013), transcriptional (Bonasio and Shiekhattar, 2014), and post-transcriptional (Yoon et al., 2013) levels. They also participate in important regulatory processes such as X chromosome silencing, genome imprinting, chromatin modification, transcription activation and inhibition, and nuclear transport. LncRNAs are also closely related with the occurrence, development, and prevention of human diseases. The mechanisms of lncRNA action discovered so far include X chromosome inactivation and genome imprinting, chromatin modification, cell cycle regulation and apoptosis, mRNA decay, and protein translation regulation (Zou et al., 2019).

The modes of lncRNA action are signal, decoy, guide, and scaffold (Wang and Chang, 2011) as shown in Figure 2. The molecular functions of lncRNAs can be explained based on the mode of action. In the signal mode, lncRNAs participate in gene imprinting processes. For example, the lncRNAs Kcnq1ot1 and Xist both function in the signal mode. The mode of action of Kcnq1ot1 and Xist are similar. Kcnq1ot1 binds to the chromosome and inhibits the expression of Kcnq1 by recruiting H3K9- and H3K27-specific histone methyltransferases and polycomb repressive complex 2 (PRC2) complexes. This function is hereditary (Pandey et al., 2008). In cells with two X chromosomes, one of the X chromosomes is suppressed. Xist regulates the process by which the X chromosome is selectively suppressed and maintains this phenotype to the next generation (Plath et al., 2002).

FIGURE 2
www.frontiersin.org

FIGURE 2. Modes of action long non-coding RNAs. (A) Signal, (B) Decoy, (C) Guide, and (D) Scaffold.

In the decoy mode, lncRNAs bind to proteins that have transcriptional regulatory functions (e.g., transcription factors and chromosome folding proteins), thereby regulating the transcriptional activation and inhibition of related genes by controlling the activity of molecules and signal pathways. LncRNA Gas5 functions in the decoy mode. It binds to the DNA-binding domain of the glucocorticoid receptor through the RNA motif to inhibit the physiological function of the receptor (Kino et al., 2010). LncRNA can also be used as a molecular decoy for miRNAs and splicing factors, inhibiting their functions. MiRNAs can promote the formation of protein complexes, hence playing an important role in gene regulation. LncRNA PTENP1 can inhibit human tumors. In the decoy mode of action, PTENP1 binds to a group of miRNAs that act on the PTEN 3′ untranslated region, thereby regulating the expression of PTEN (Poliseno et al., 2010).

In the guide mode, lncRNAs bind to proteins to guide the protein complex to a specific DNA sequence, thereby regulating the transcription of downstream molecules. The guiding can be either cis or trans. One cis-regulatory mechanism of lncRNA involves the inactive center of the X chromosome, which controls the silence of the maternal X chromosome. A trans-regulatory mechanism included reducing the expression of tumor-associated lncRNA HOTAIR and decreasing cell invasiveness.

In the scaffold mode, lncRNAs simultaneously bind multiple related transcription factors to provide a platform for interaction. For example, lncRNA HOTAIR splices and bridges between the PRC2 and LSD1 complexes. The 5′ end of HOTAIR combines with the PRC2 complex (acting on H3 and H27 to methylate them) to promote gene expression (Rinn et al., 2007), whereas the 3′ end of HOTAIR combines with LSD1 (acting on H3K4 to demethylate it) to antagonize gene expression activation (Tsai et al., 2010), thereby inducing the interaction between the PRC2 and LSD1 complexes.

Methods to Identify Disease-Related LncRNAs

LncRNAs by far outnumber protein-coding genes, and, unlike protein-coding genes, lncRNAs are usually not conserved in sequence fragments and secondary structures, which make lncRNA functional prediction difficult. Two main approaches have been used to identify disease-related lncRNAs, methods based on biological experiments and methods based on computational predictions. The former generally produces more reliable results, but the cost is high and the efficiency is low; the latter uses multi-source biological data, such as disease-related genomes, transcriptomes, and proteomes, to predict disease-related lncRNAs. Bioinformatics methods have been applied to predict disease-related lncRNAs based on the analysis of multi-source biological data (Cao et al., 2021).

Identification of Disease-Related lncRNA by Biological Experiments

Large-scale biological experimental research is often limited by ethical factors that govern the collection of experimental samples. However, the influence of interference factors on the experimental results can often be largely controlled, and therefore the results are likely to be objective and highly reliable. Two examples of lncRNA research using biological experiments are briefly described.

Nakagawa et al. (Nakagawa et al., 2014) found that approximately 50% of lncRNA Neat1 knockout mice with abnormal ovulation did not become pregnant, and this outcome seemed to be random. Subsequently, corpus luteum dysfunction and the accompanying low progesterone were found to contribute most to the decline in fertility. Unlike the weak expression of Neat1 observed in most adult tissues, in the infertile Neat1 knockout mice, Neat1 was highly expressed in the corpus luteum and the formation of the corpus luteum was severely impaired. These results indicated that Neat1 may be closely related to the formation of the corpus luteum and some forms of infertility in humans.

Zhang et al. (Zhang et al., 2018) screened and identified a new type of lncRNA, HOXC-AS3, using publicly available gastric cancer expression profile data and integrated bioinformatics analysis. They found that the expression of HOXC-AS3 was highly up-regulated in gastric cancer tissues and was related to clinicopathological factors such as histological grade, depth of tumor invasion, lymph node metastasis, and poor prognosis. They performed chromatin immunoprecipitation assays to explore the mechanisms involved in the high HOXC-AS3 expression, and found that HOXC-AS3 was partially activated by H3K4me3 and H3K27ac in cells and tissues. Overexpression and knockout of HOXC-AS3 were used to detect cell apoptosis and proliferation. They found that overexpression of HOXC-AS3 promoted the proliferation of cancer cells, and knockout of HOXC-AS3 induced apoptosis of cancer cells. To further explore the mechanism of action of HOXC-AS3, the transcription factor YBX1 was selected, and three independent RNA pull-down mass spectrometry analyses were performed. The results showed that HOXC-AS3 interacted with YBX1. This result combined with the results of the immunoprecipitation assays, confirmed that YBX1 was involved in HOXC-AS3-mediated gene transcription regulation in gastric cancer.

Clearly, traditional biological experiments are not only time-consuming but also expensive. Computational models are less time-consuming and less expensive, therefore they have attracted more and more attention as a solution that can predict lncRNA functions on a large scale. Models can be used to predict the possible functions of lncRNAs according to related priorities, and the predictions can be verified experimentally. This process effectively promotes the functional recognition of lncRNAs.

Identification of Disease-Related LncRNAs by Computational Prediction

Because biological experiments are costly and time-consuming, the use of bioinformatics calculations to predict disease-related lncRNAs has become the mainstream. In recent years, many lncRNA-disease association prediction (LDAP) models have been proposed, including models based on biological networks, models that do not rely on known lncRNA–disease associations, and models based on machine learning algorithms. The three types of LDAP models are briefly described.

LDAP Models Based on Biological Networks

These models integrate biological networks such as the disease similarity network, lncRNA similarity network, and lncRNA–disease association network to construct an LDAP model. In 2014, Sun et al. (Sun et al., 2014) proposed an LDAP model based on the functionally similar network of lncRNAs and random walks with restart. In 2015, Chen et al. (Weng et al., 2015) developed the KATZLDA prediction model by fusing lncRNA–disease association, lncRNA expression profile, lncRNA functional similarity, and disease semantic similarity data. In 2017, Yu et al. (Yu et al., 2017) developed the BRWLDA prediction model based on double random walks, and Gu et al. (Gu et al., 2017) developed a GRWLDA prediction model based on global network random walk. In 2018, Ping et al. (Ping et al., 2019) proposed a prediction model based on the known lncRNA–disease association network to infer potential lncRNA–disease associations. In 2019, Fan et al. (Fan et al., 2019) proposed an LDAP model based on multiple heterogeneous information networks and random walks with restart. Xie et al. (Xie et al., 2019) proposed an SFK-LDA prediction model based on similarity nuclear fusion. In 2020, Zhou et al. (Zhou et al., 2021) built a heterogeneous network by integrating various associations between diseases and miRNAs, lncRNAs, proteins, and drugs, and trained a LDAP model with the rotating forest classifier, and Zhang et al. (Zhang et al., 2020) proposed an LDAP model based on network feature similarity and gradient boosting. In 2021, Liu et al. (Liu et al., 2021) proposed an LDAP model based on the weighted graph regularized collaborative matrix factorization.

LDAP Models That do not Rely on Known LncRNA–Disease Associations

In these models, the expression and regulatory relationship between disease-related genes or miRNAs and lncRNAs are used to predict potential lncRNA–disease associations. In 2014, Liu et al. (Zhao et al., 2015) developed the first LDAP model that did not rely on known lncRNA–disease associations by integrating lncRNA expression profiles, gene expression profiles, and disease-related gene data. In 2015, Chen et al. (Chen, 2015) developed an LDAP model HGLDA based on hypergeometric distribution by integrating miRNA–disease associations and ncRNA–miRNA interactions. In 2016, Cheng et al. (Cheng et al., 2016) proposed the IntNetLncSim computing framework, which inferred the functional similarity of lncRNAs and predicted new lncRNA–disease associations by modeling the information flow in an integrated network that contained lncRNA transcription and post-transcription information. In 2017, Wang et al. (Wang et al., 2017) mapped lncRNAs to their functional genomic context based on the theory of competing endogenous RNAs to predict new lncRNA–disease associations. Fu et al. (Fu et al., 2018) proposed a matrix decomposition-based LDAP model MFLDA, which decomposed the data matrix of heterogeneous data sources into low-rank matrices through matrix decomposition to explore and use their internal and shared structure. In 2018, Ding et al. (Ding et al., 2018) proposed an LDAP model based on a lncRNA–disease–gene network that integrated gene–disease and lncRNA–disease associations. In 2020, Xiao et al. (Xiao et al., 2020) proposed an LDAP model that used both direct and indirect features of lncRNA–disease relationship pairs, and Tang et al. proposed a hierarchical extended LDAP model based on a Boolean matrix (Tang et al., 2020).

LDAP Models Based on Machine Learning Algorithms

These models integrate biological data and use various machine learning algorithms to predict disease-related lncRNAs. In 2013, Chen et al. (Chen and Yan, 2013) developed a semi-supervised learning framework LRLSLDA based on Laplace regularization least squares by integrating lncRNA expression profiles and known lncRNA–disease associations. In 2015, Liu et al. (Liu et al., 2015) developed an LDAP model based on the naive Bayes classifier to identify lncRNAs related to cancer by integrating genome, regulatory factors, and transcriptome data. In 2017, Lan et al. (Lan et al., 2017) proposed an LDAP model based on support vector machines. In 2018, Yu et al. (Yu et al., 2018) proposed the NBCLDA model based on the naive Bayes classifier. In 2019, Guo et al. (Guo et al., 2019) proposed two LDAP models, one based on rotating forest and neural network and another based on a random forest classifier. Sheng et al. (Sheng et al., 2021) proposed a series of LDAP models based on convolutional neural networks, including CNNLDA, as well as an attention multi-level representation coding model based on convolution and variance autoencoders. In 2020, Zeng et al. (Zeng et al., 2020) proposed SDLDA, an LDAP model based on singular value decomposition and deep learning. Fan et al. (Fan et al., 2020) proposed IDSSIM, a calculation model of lncRNA functional similarity based on improved disease semantic similarity. Tan et al. (Tan et al., 2020) proposed a multi-view consensus graph learning model to predict lncRNA–disease association. Wei et al. (Wei et al., 2021) proposed a convolutional neural network model fused with multiple biological characteristics to predict lncRNA–disease association.

LncRNAs and Related Diseases

LncRNAs induce the occurrence of disease by regulating disease-related protein coding genes, thus leading to improper expression of lncRNAs or altering the chromatin that contain disease-related gene polymorphisms and non-coding regions. Therefore, the expression of lncRNAs is important in the diagnosis, occurrence, development, and treatment of diseases. In recent years, the associations of lncRNAs with cancer, leukemia, cardiovascular and cerebrovascular diseases, diabetes, and other diseases have been a focus of study. Future clinical applications of disease-related lncRNAs are very likely.

The roles of lncRNAs in cancer: Li et al. (Li et al., 2016) found that up-regulation of lncRNA MALAT1 was related to tumor size and lymph node metastasis, and to the shorter overall survival of patients with lung adenocarcinoma. In vivo and in vitro experiments showed that MALAT1 promoted epithelial–mesenchymal transition and metastasis of lung adenocarcinoma cells. Numerous lncRNAs have been found to encode small proteins or micropeptides, some of which play roles in diseases. Meng et al. (Meng et al., 2020) found that lncRNA LOC90024 encodes a splicing regulatory small protein that induces the formation of Sp4 transcription factor splice variants, thereby promoting the occurrence and development of advanced rectal cancer tumors. Zhu et al. (Zhu et al., 2020) found that lncRNA LINC00266-1 encodes an RNA-binding regulatory peptide that, when highly expressed in patients with colon cancer, leads to a poor prognosis. The oncogenic peptide encoded by LINC00266-1 exerts its carcinogenic function by enhancing the recognition of N6-methyladenosine of RNA.

The roles of lncRNAs in leukemia: Garzon et al. (Garzon et al., 2014) developed a prognostic scoring system to determine if lncRNAs were associated with cytogenetically normal acute myeloid leukemia (CN-AML) clinical features and recurrent mutations in patients older than 60 years. First, 48 lncRNAs most relevant to prognosis were identified. Then, patients with CN-AML were divided into two groups, those with good prognostic scores and those with poor prognostic scores, based on the 48 lncRNAs. The prognostic scores were verified in an independent matched group of patients with CN-AML who received the same treatment. The comparative analysis showed that the lncRNA expression profile was closely related to the recurrent mutation and expression of AML, implying that some of 48 lncRNAs may have a functional role in the development of leukemia. These lncRNAs are good candidates as biomarkers for the prognosis of AML.

The roles of lncRNAs in heart disease: Han et al. (Han et al., 2014) developed a new lncRNA–chromatin mechanism to treat heart failure. A lncRNA transcript Mhrt779 from myosin heavy chain 7 loci was found to be specifically expressed in cardiomyocytes and to gradually increase with the development of embryos, especially after birth. Furthermore, the Brg1–Hdac–Parp chromatin inhibitory complex was activated by pathological stress and lncRNA Mhrt transcription was inhibited in the heart, thereby protecting the heart from hypertrophy and failure. These results show that there is a conserved lncRNA mechanism in human cardiomyopathy, and also establish a new paradigm for lncRNA–chromatin interactions.

The roles of lncRNAs in neurodegenerative diseases: Alzheimer’s disease is a progressively developing neurodegenerative disease with insidious onset that is believed to be caused by a large amount of amyloid β-protein (Aβ) expression, which leads to pathological changes in patients. Aβ is hydrolyzed from β-amyloid precursor protein, and excessive Aβ deposition can cause degenerative diseases related to neurons. BACE1 is a key enzyme in the production of Aβ. Faghihi et al. (Faghihi and Wahlestedt, 2009) found that lncRNA BACE1-AS (antisense transcript of BACE1) increased the stability of BACE1 mRNA through a mechanism that involved the formation of RNA duplexes, and this in turn facilitated the accumulation of Aβ in patients with Alzheimer’s disease.

The roles of lncRNAs in diabetes: LncRNAs in human pancreatic β-cells exhibit dynamic regulation during differentiation or when glucose concentrations change. Akerman et al. (Akerman et al., 2017) studied the functions of β-cell-specific lncRNAs and transcription factors using transcript knockdown and co-expression network analysis strategies. They found that lncRNAs and transcription factors acted synergistically to regulate the specific transcription network of β-cells. LncRNA PLUTO affected local three-dimensional chromatin structure and transcription of PDX1, which encodes a key β-cell transcription factor. PLUTO and PDX1 were both down-regulated in islets from donors with type 2 diabetes or impaired glucose tolerance. These results indicate the role of lncRNAs in β-cell gene regulation and diabetes.

Challenges and Research Prospects

Compared with protein-coding sequences and small RNA molecules, lncRNA-related research is insufficient and there are many problems still to be solved. The major ones are listed here.

1) No standardized naming of lncRNAs. Until now, lncRNAs have been named according to their functions, structural characteristics, or modes of action. Therefore, it is difficult to understand their roles and functions from the name.

2) Unannotated and unbalanced lncRNA data. Compared with other ncRNA databases, the annotation information in lncRNA databases is insufficient, and disease-related lncRNAs that are included in multi-source data have problems such as serious imbalances of information.

3) Lack of lncRNA-specific technologies. Because of the diverse types and functions of lncRNAs, more effective methods are needed for systematically studying the biological functions of lncRNAs and for identifying disease-related lncRNAs. The process of combining multi-data to predict disease-related lncRNAs has problems, such as high-dimensional feature space, high noise, and redundant feature interference, that seriously affect the accuracy of the predictions.

4) Research fields need to be expanded. Current research on lncRNAs has focused mainly on tumors, nerves, and development. However, the genetic characteristics of cancer-related lncRNAs and the mechanism of action of complex diseases are still unclear. More areas of disease research related to lncRNAs need to be developed.

Despite these problems, lncRNA research has continuously advanced the understanding of lncRNAs. lncRNAs not only exert their biological functions in a variety of mechanisms in different organisms, but their dysfunction can lead to the occurrence and development of many diseases. Undoubtedly, new technologies and new methods will be developed for use in lncRNA bioinformatics research. Such developments will help to further analyze the functions and regulation mechanisms of lncRNAs, as well as the pathological mechanisms associate with the development of diseases.

Discussion

LncRNAs are closely related to cell cycle and differentiation, aging and human diseases. Therefore, the research on their functions and mechanisms is also constantly deepening. This review summarized the following key points:1) according to the four modes of action of lncRNA, the corresponding molecular functions were described respectively. 2) The identification methods of lncRNA related to diseases were summarized into two parts: lncRNA identification research based on biological experiments and computational prediction. 3) The relationship between lncRNA and various human diseases was expounded.

Although the research technology of lncRNA is constantly developing, there are still a number of limitations:1) The low abundance of lncRNAs and lack of annotation information lead to inaccurate positioning. 2) The data set of lncRNAs in the database is not perfect. 3) The types of diseases associated with lncRNA are limited. Therefore, in the future, researchers need to continue to dig out the functional information of unknown lncRNAs and develop new lncRNA recognition models, which will help to enhance the scientific understanding of more human diseases.

Author Contributions

YG and WZ designed the study. WZ and LS collected and analyzed the literatures. YG and MS analyzed the literatures and made the figures. All authors participated in writing the draft and approved the final manuscript.

Funding

The work was supported by the National Key R&D Program of China (No.2020YFB2104400), and the Special Science Foundation of Quzhou (2021D004).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We thank Margaret Biswas, PhD, from Liwen Bianji (Edanz) (www.liwenbianji.cn/), for editing the English text of a draft of this manuscript.

References

Akerman, I., Tu, Z., Beucher, A., Rolando, D. M. Y., Sauty-Colace, C., Benazra, M., et al. (2017). Human Pancreatic β Cell lncRNAs Control Cell-specific Regulatory Networks. Cel Metab. 25, 400–411. doi:10.1016/j.cmet.2016.11.016

PubMed Abstract | CrossRef Full Text | Google Scholar

Bao, Z., Yang, Z., Huang, Z., Zhou, Y., Cui, Q., and Dong, D. (2019). LncRNADisease 2.0: an Updated Database of Long Non-coding RNA-Associated Diseases. Nucleic Acids Res. 47, D1034–d1037. doi:10.1093/nar/gky905

PubMed Abstract | CrossRef Full Text | Google Scholar

Bonasio, R., and Shiekhattar, R. (2014). Regulation of Transcription by Long Noncoding RNAs. Annu. Rev. Genet. 48, 433–455. doi:10.1146/annurev-genet-120213-092323

PubMed Abstract | CrossRef Full Text | Google Scholar

Cao, C., Wang, J., Kwok, D., Cui, F., Zhang, Z., Zhao, D., et al. (2021). webTWAS: a Resource for Disease Candidate Susceptibility Genes Identified by Transcriptome-wide Association Study. Nucleic Acids Res., gkab957. doi:10.1093/nar/gkab957

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, X. (2015). Predicting lncRNA-Disease Associations and Constructing lncRNA Functional Similarity Network Based on the Information of miRNA. Sci. Rep. 5, 13186. doi:10.1038/srep13186

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, X., and Yan, G.-Y. (2013). Novel Human lncRNA-Disease Association Inference Based on lncRNA Expression Profiles. Bioinformatics 29, 2617–2624. doi:10.1093/bioinformatics/btt426

PubMed Abstract | CrossRef Full Text | Google Scholar

Cheng, L., Shi, H., Wang, Z., Hu, Y., Yang, H., Zhou, C., et al. (2016). IntNetLncSim: an Integrative Network Analysis Method to Infer Human lncRNA Functional Similarity. Oncotarget 7, 47864–47874. doi:10.18632/oncotarget.10012

PubMed Abstract | CrossRef Full Text | Google Scholar

Ding, L., Wang, M., Sun, D., and Li, A. (2018). TPGLDA: Novel Prediction of Associations between lncRNAs and Diseases via lncRNA-Disease-Gene Tripartite Graph. Sci. Rep. 8, 1065. doi:10.1038/s41598-018-19357-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Dinger, M. E., Pang, K. C., Mercer, T. R., Crowe, M. L., Grimmond, S. M., and Mattick, J. S. (2009). NRED: a Database of Long Noncoding RNA Expression. Nucleic Acids Res. 37, D122–D126. doi:10.1093/nar/gkn617

PubMed Abstract | CrossRef Full Text | Google Scholar

Faghihi, M. A., and Wahlestedt, C. (2009). Regulatory Roles of Natural Antisense Transcripts. Nat. Rev. Mol. Cel Biol 10, 637–643. doi:10.1038/nrm2738

CrossRef Full Text | Google Scholar

Fan, W., Shang, J., Li, F., Sun, Y., Yuan, S., and Liu, J.-X. (2020). IDSSIM: an lncRNA Functional Similarity Calculation Model Based on an Improved Disease Semantic Similarity Method. BMC Bioinformatics 21, 339. doi:10.1186/s12859-020-03699-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Fan, X.-N., Zhang, S.-W., Zhang, S.-Y., Zhu, K., and Lu, S. (2019). Prediction of lncRNA-Disease Associations by Integrating Diverse Heterogeneous Information Sources with RWR Algorithm and Positive Pointwise Mutual Information. BMC Bioinformatics 20, 87. doi:10.1186/s12859-019-2675-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Fang, S., Zhang, L., Guo, J., Niu, Y., Wu, Y., Li, H., et al. (2018). NONCODEV5: a Comprehensive Annotation Database for Long Non-coding RNAs. Nucleic Acids Res. 46, D308–d314. doi:10.1093/nar/gkx1107

PubMed Abstract | CrossRef Full Text | Google Scholar

Fu, G., Wang, J., Domeniconi, C., and Yu, G. (2018). Matrix Factorization-Based Data Fusion for the Prediction of lncRNA-Disease Associations. Bioinformatics 34, 1529–1537. doi:10.1093/bioinformatics/btx794

PubMed Abstract | CrossRef Full Text | Google Scholar

Garzon, R., Volinia, S., Papaioannou, D., Nicolet, D., Kohlschmidt, J., Yan, P. S., et al. (2014). Expression and Prognostic Impact of lncRNAs in Acute Myeloid Leukemia. Proc. Natl. Acad. Sci. USA 111, 18679–18684. doi:10.1073/pnas.1422050112

PubMed Abstract | CrossRef Full Text | Google Scholar

Gu, C., Liao, B., Li, X., Cai, L., Li, Z., Li, K., et al. (2017). Global Network Random Walk for Predicting Potential Human lncRNA-Disease Associations. Sci. Rep. 7, 12442. doi:10.1038/s41598-017-12763-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo, Z.-H., You, Z.-H., Wang, Y.-B., Yi, H.-C., and Chen, Z.-H. (2019). A Learning-Based Method for LncRNA-Disease Association Identification Combing Similarity Information and Rotation Forest. iScience 19, 786–795. doi:10.1016/j.isci.2019.08.030

PubMed Abstract | CrossRef Full Text | Google Scholar

Han, P., Li, W., Lin, C.-H., Yang, J., Shang, C., Nurnberg, S. T., et al. (2014). A Long Noncoding RNA Protects the Heart from Pathological Hypertrophy. Nature 514, 102–106. doi:10.1038/nature13596

PubMed Abstract | CrossRef Full Text | Google Scholar

Heo, J. B., Lee, Y.-S., and Sung, S. (2013). Epigenetic Regulation by Long Noncoding RNAs in Plants. Chromosome Res. 21, 685–693. doi:10.1007/s10577-013-9392-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Hu, Y., Qiu, S., and Cheng, L. (2021). Integration of Multiple-Omics Data to Analyze the Population-specific Differences for Coronary Artery Disease. Comput. Math. Methods Med. 2021, 1–11. doi:10.1155/2021/7036592

CrossRef Full Text | Google Scholar

Hu, Y., Sun, J.-y., Zhang, Y., Zhang, H., Gao, S., Wang, T., et al. (202162220). rs1990622 Variant Associates with Alzheimer's Disease and Regulates TMEM106B Expression in Human Brain Tissues. BMC Med. 19, 11. doi:10.1186/s12916-020-01883-5

CrossRef Full Text | Google Scholar

Hu, Y., Zhang, H., Liu, B., Gao, S., Wang, T., Han, Z., et al. (2020). rs34331204 Regulates TSPAN13 Expression and Contributes to Alzheimer's Disease with Sex Differences. Brain 143, e95. doi:10.1093/brain/awaa302

PubMed Abstract | CrossRef Full Text | Google Scholar

Jalali, S., Gandhi, S., and Scaria, V. (2016). Navigating the Dynamic Landscape of Long Noncoding RNA and Protein-Coding Gene Annotations in GENCODE. Hum. Genomics 10, 35. doi:10.1186/s40246-016-0090-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Kino, T., Hurt, D. E., Ichijo, T., Nader, N., and Chrousos, G. P. (2010). Noncoding RNA Gas5 Is a Growth Arrest- and Starvation-Associated Repressor of the Glucocorticoid Receptor. Sci. Signal. 3, ra8. doi:10.1126/scisignal.2000568

PubMed Abstract | CrossRef Full Text | Google Scholar

Lan, W., Li, M., Zhao, K., Liu, J., Wu, F.-X., Pan, Y., et al. (2017). LDAP: a Web Server for lncRNA-Disease Association Prediction. Bioinformatics 33, btw639–460. doi:10.1093/bioinformatics/btw639

CrossRef Full Text | Google Scholar

Li, J., Wang, J., Chen, Y., Li, S., Jin, M., Wang, H., et al. (2016). LncRNA MALAT1 Exerts Oncogenic Functions in Lung Adenocarcinoma by Targeting miR-204. Am. J. Cancer Res. 6, 1099–1107.

Google Scholar

Li, J.-H., Liu, S., Zhou, H., Qu, L.-H., and Yang, J.-H. (2013). starBase v2.0: Decoding miRNA-ceRNA, miRNA-ncRNA and Protein-RNA Interaction Networks from Large-Scale CLIP-Seq Data. Nucl. Acids Res. 42, D92–D97. doi:10.1093/nar/gkt1248

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, J.-X., Cui, Z., Gao, Y.-L., and Kong, X.-Z. (2021). WGRCMF: A Weighted Graph Regularized Collaborative Matrix Factorization Method for Predicting Novel LncRNA-Disease Associations. IEEE J. Biomed. Health Inform. 25, 257–265. doi:10.1109/jbhi.2020.2985703

CrossRef Full Text | Google Scholar

Liu, Y., Zhang, R., Qiu, F., Li, K., Zhou, Y., Shang, D., et al. (2015). Construction of a lncRNA-PCG Bipartite Network and Identification of Cancer-Related lncRNAs: a Case Study in Prostate Cancer. Mol. Biosyst. 11, 384–393. doi:10.1039/c4mb00439f

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, Z.-P. (2021). Predicting lncRNA-Protein Interactions by Machine Learning Methods: A Review. Cbio 15, 831–840. doi:10.2174/1574893615666200224095925

CrossRef Full Text | Google Scholar

Losko, M., Kotlinowski, J., and Jura, J. (2016). Long Noncoding RNAs in Metabolic Syndrome Related Disorders. Mediators Inflamm. 2016, 1–12. doi:10.1155/2016/5365209

PubMed Abstract | CrossRef Full Text | Google Scholar

Meng, N., Chen, M., Chen, D., Chen, X. H., Wang, J. Z., Zhu, S., et al. (2020). Small Protein Hidden in lncRNA LOC90024 Promotes "Cancerous" RNA Splicing and Tumorigenesis. Adv. Sci. 7, 1903233. doi:10.1002/advs.201903233

PubMed Abstract | CrossRef Full Text | Google Scholar

Mercer, T. R., Dinger, M. E., and Mattick, J. S. (2009). Long Non-coding RNAs: Insights into Functions. Nat. Rev. Genet. 10, 155–159. doi:10.1038/nrg2521

PubMed Abstract | CrossRef Full Text | Google Scholar

Mercer, T. R., and Mattick, J. S. (2013). Structure and Function of Long Noncoding RNAs in Epigenetic Regulation. Nat. Struct. Mol. Biol. 20, 300–307. doi:10.1038/nsmb.2480

PubMed Abstract | CrossRef Full Text | Google Scholar

Miao, Y.-R., Liu, W., Zhang, Q., and Guo, A.-Y. (2017). lncRNASNP2: an Updated Database of Functional SNPs and Mutations in Human and Mouse lncRNAs. Nucleic Acids Res. 46, D276–D280. doi:10.1093/nar/gkx1004

PubMed Abstract | CrossRef Full Text | Google Scholar

Nakagawa, S., Shimada, M., Yanaka, K., Mito, M., Arai, T., Takahashi, E., et al. (2014). The lncRNA Neat1 Is Required for Corpus Luteum Formation and the Establishment of Pregnancy in a Subpopulation of Mice. Development 141, 4618–4627. doi:10.1242/dev.110544

PubMed Abstract | CrossRef Full Text | Google Scholar

Okazaki, N., Kikuno, R., Ohara, R., Inamoto, S., Aizawa, H., Yuasa, S., et al. (2003). Prediction of the Coding Sequences of Mouse Homologues of KIAA Gene: II. The Complete Nucleotide Sequences of 400 Mouse KIAA-Homologous cDNAs Identified by Screening of Terminal Sequences of cDNA Clones Randomly Sampled from Size-Fractionated Libraries. DNA Res. 10, 35–48. doi:10.1093/dnares/10.1.35

PubMed Abstract | CrossRef Full Text | Google Scholar

Pandey, R. R., Mondal, T., Mohammad, F., Enroth, S., Redrup, L., Komorowski, J., et al. (2008). Kcnq1ot1 Antisense Noncoding RNA Mediates Lineage-specific Transcriptional Silencing through Chromatin-Level Regulation. Mol. Cel 32, 232–246. doi:10.1016/j.molcel.2008.08.022

PubMed Abstract | CrossRef Full Text | Google Scholar

Ping, P., Wang, L., Kuang, L., Ye, S., Iqbal, M. F. B., and Pei, T. (2019). A Novel Method for LncRNA-Disease Association Prediction Based on an lncRNA-Disease Association Network. Ieee/acm Trans. Comput. Biol. Bioinf. 16, 688–693. doi:10.1109/tcbb.2018.2827373

PubMed Abstract | CrossRef Full Text | Google Scholar

Plath, K., Mlynarczyk-Evans, S., Nusinow, D. A., and Panning, B. (2002). Xist RNA and the Mechanism of X Chromosome Inactivation. Annu. Rev. Genet. 36, 233–278. doi:10.1146/annurev.genet.36.042902.092433

PubMed Abstract | CrossRef Full Text | Google Scholar

Poliseno, L., Salmena, L., Zhang, J., Carver, B., Haveman, W. J., and Pandolfi, P. P. (2010). A Coding-independent Function of Gene and Pseudogene mRNAs Regulates Tumour Biology. Nature 465, 1033–1038. doi:10.1038/nature09144

PubMed Abstract | CrossRef Full Text | Google Scholar

Quek, X. C., Thomson, D. W., Maag, J. L. V., Bartonicek, N., Signal, B., Clark, M. B., et al. (2015). lncRNAdb v2.0: Expanding the Reference Database for Functional Long Noncoding RNAs. Nucleic Acids Res. 43, D168–D173. doi:10.1093/nar/gku988

PubMed Abstract | CrossRef Full Text | Google Scholar

Rinn, J. L., Kertesz, M., Wang, J. K., Squazzo, S. L., Xu, X., Brugmann, S. A., et al. (2007). Functional Demarcation of Active and Silent Chromatin Domains in Human HOX Loci by Noncoding RNAs. Cell 129, 1311–1323. doi:10.1016/j.cell.2007.05.022

PubMed Abstract | CrossRef Full Text | Google Scholar

Shen, L., Shen, G., Lu, X., Ding, G., and Hu, X. (2021). Co-expression Network Analysis Revealing the Potential Regulatory Roles of LncRNAs in Atrial Fibrillation. Cbio 15, 980–987. doi:10.2174/1574893614666191210142141

CrossRef Full Text | Google Scholar

Sheng, N., Cui, H., Zhang, T., and Xuan, P. (2021). Attentional Multi-Level Representation Encoding Based on Convolutional and Variance Autoencoders for lncRNA-Disease Association Prediction. Brief Bioinform 22, 22. doi:10.1093/bib/bbaa067

CrossRef Full Text | Google Scholar

Sun, J., Shi, H., Wang, Z., Zhang, C., Liu, L., Wang, L., et al. (2014). Inferring Novel lncRNA-Disease Associations Based on a Random Walk Model of a lncRNA Functional Similarity Network. Mol. Biosyst. 10, 2074–2081. doi:10.1039/c3mb70608g

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, Y., Chen, L., Zhang, Y., Zhang, J., and Tiwari, S. R. (2021). Genome-wide Identification of Differently Expressed lncRNAs, mRNAs, and circRNAs in Patients with Osteoarthritis. Cbio 15, 1222–1230. doi:10.2174/1574893615999200706002907

CrossRef Full Text | Google Scholar

Tan, H., Sun, Q., Li, G., Xiao, Q., Ding, P., Luo, J., et al. (2020). Multiview Consensus Graph Learning for lncRNA-Disease Association Prediction. Front. Genet. 11, 89. doi:10.3389/fgene.2020.00089

PubMed Abstract | CrossRef Full Text | Google Scholar

Tang, L., Liang, Y., Jin, X., Liu, L., and Zhou, W. (2020). Hierarchical Extension Based on the Boolean Matrix for LncRNA-Disease Association Prediction. Cmm 20, 452–460. doi:10.2174/1566524019666191119104212

PubMed Abstract | CrossRef Full Text | Google Scholar

Tang, W., Wan, S., Yang, Z., Teschendorff, A. E., and Zou, Q. (2018). Tumor Origin Detection with Tissue-specific miRNA and DNA Methylation Markers. Bioinformatics 34, 398–406. doi:10.1093/bioinformatics/btx622

PubMed Abstract | CrossRef Full Text | Google Scholar

Tsai, M.-C., Manor, O., Wan, Y., Mosammaparast, N., Wang, J. K., Lan, F., et al. (2010). Long Noncoding RNA as Modular Scaffold of Histone Modification Complexes. Science 329, 689–693. doi:10.1126/science.1192002

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, K. C., and Chang, H. Y. (2011). Molecular Mechanisms of Long Noncoding RNAs. Mol. Cel 43, 904–914. doi:10.1016/j.molcel.2011.08.018

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, P., Guo, Q., Gao, Y., Zhi, H., Zhang, Y., Liu, Y., et al. (2017). Improved Method for Prioritization of Disease Associated lncRNAs Based on ceRNA Theory and Functional Genomics Data. Oncotarget 8, 4642–4655. doi:10.18632/oncotarget.13964

PubMed Abstract | CrossRef Full Text | Google Scholar

Wei, H., Liao, Q., and Liu, B. (2021). iLncRNAdis-FB: Identify lncRNA-Disease Associations by Fusing Biological Feature Blocks through Deep Neural Network. Ieee/acm Trans. Comput. Biol. Bioinf. 18, 1946–1957. doi:10.1109/tcbb.2020.2964221

CrossRef Full Text | Google Scholar

Weng, W., Lin, C., Zhu, S., Ma, Y., Zou, Q., and Chen, Z. (2015). Community Discovery Algorithm with the Label Propagation Based on the Initial Division. J. Xiamen Univ. Nat. Sci. 54, 107–113.

Google Scholar

Xiao, Y., Xiao, Z., Feng, X., Chen, Z., Kuang, L., and Wang, L. (2020). A Novel Computational Model for Predicting Potential LncRNA-Disease Associations Based on Both Direct and Indirect Features of LncRNA-Disease Pairs. BMC Bioinformatics 21, 555. doi:10.1186/s12859-020-03906-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Xie, G., Meng, T., Luo, Y., and Liu, Z. (2019). SKF-LDA: Similarity Kernel Fusion for Predicting lncRNA-Disease Association. Mol. Ther. - Nucleic Acids 18, 45–55. doi:10.1016/j.omtn.2019.07.022

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, Q., Wu, J., Zhao, J., Xu, T., Han, P., and Song, X. (2020). The Expression Profiles of lncRNAs and Their Regulatory Network during Smek1/2 Knockout Mouse Neural Stem Cells Differentiation. Cbio 15, 77–88. doi:10.2174/1574893614666190308160507

CrossRef Full Text | Google Scholar

Yoon, J.-H., Abdelmohsen, K., and Gorospe, M. (2013). Posttranscriptional Gene Regulation by Long Noncoding RNA. J. Mol. Biol. 425, 3723–3730. doi:10.1016/j.jmb.2012.11.024

CrossRef Full Text | Google Scholar

Yu, G., Fu, G., Lu, C., Ren, Y., and Wang, J. (2017). BRWLDA: Bi-random Walks for Predicting lncRNA-Disease Associations. Oncotarget 8, 60429–60446. doi:10.18632/oncotarget.19588

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, J., Ping, P., Wang, L., Kuang, L., Li, X., and Wu, Z. (2018). A Novel Probability Model for LncRNA-Disease Association Prediction Based on the Naïve Bayesian Classifier. Genes 9, 345. doi:10.3390/genes9070345

PubMed Abstract | CrossRef Full Text | Google Scholar

Zeng, M., Lu, C., Zhang, F., Li, Y., Wu, F.-X., Li, Y., et al. (2020). SDLDA: lncRNA-Disease Association Prediction Based on Singular Value Decomposition and Deep Learning. Methods 179, 73–80. doi:10.1016/j.ymeth.2020.05.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, E., He, X., Zhang, C., Su, J., Lu, X., Si, X., et al. (2018). A Novel Long Noncoding RNA HOXC-AS3 Mediates Tumorigenesis of Gastric Cancer by Binding to YBX1. Genome Biol. 19, 154. doi:10.1186/s13059-018-1523-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Y., Ye, F., Xiong, D., and Gao, X. (2020). LDNFSGB: Prediction of Long Non-coding Rna and Disease Association Using Network Feature Similarity and Gradient Boosting. BMC Bioinformatics 21, 377. doi:10.1186/s12859-020-03721-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhao, X., Zou, Q., Liu, B., and Liu, X. (2015). Exploratory Predicting Protein Folding Model with Random Forest and Hybrid Features. Cp 11, 289–299. doi:10.2174/157016461104150121115154

CrossRef Full Text | Google Scholar

Zhao, Z., Zhang, C., Li, M., Yu, X., Liu, H., Chen, Q., et al. (2021). Integrative Analysis of miRNA-Mediated Competing Endogenous RNA Network Reveals the lncRNAs-mRNAs Interaction in Glioblastoma Stem Cell Differentiation. Cbio 15, 1187–1196. doi:10.2174/1574893615999200511074226

CrossRef Full Text | Google Scholar

Zhou, J.-R., You, Z.-H., Cheng, L., and Ji, B.-Y. (2021). Prediction of lncRNA-Disease Associations via an Embedding Learning HOPE in Heterogeneous Information Networks. Mol. Ther. - Nucleic Acids 23, 277–285. doi:10.1016/j.omtn.2020.10.040

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, K.-R., Liu, S., Sun, W.-J., Zheng, L.-L., Zhou, H., Yang, J.-H., et al. (2016). ChIPBase v2.0: Decoding Transcriptional Regulatory Networks of Non-coding RNAs and Protein-Coding Genes from ChIP-Seq Data. Nucleic Acids Res. 45, D43–D50. doi:10.1093/nar/gkw965

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhu, S., Wang, J.-Z., Chen, D., He, Y.-T., Meng, N., Chen, M., et al. (2020). An Oncopeptide Regulates m6A Recognition by the m6A Reader IGF2BP1 and Tumorigenesis. Nat. Commun. 11, 1685. doi:10.1038/s41467-020-15403-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Zou, Q., Xing, P., Wei, L., and Liu, B. (2019). Gene2vec: Gene Subsequence Embedding for Prediction of Mammalian N6-Methyladenosine Sites from mRNA. RNA 25, 205–218. doi:10.1261/rna.069112.118

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: lncRNA, database, mechanism of action, recognition methods, disease

Citation: Gong Y, Zhu W, Sun M and Shi L (2021) Bioinformatics Analysis of Long Non-coding RNA and Related Diseases: An Overview. Front. Genet. 12:813873. doi: 10.3389/fgene.2021.813873

Received: 12 November 2021; Accepted: 26 November 2021;
Published: 08 December 2021.

Edited by:

Dariusz Mrozek, Silesian University of Technology, Poland

Reviewed by:

Wang-Ren Qiu, Jingdezhen Ceramic Institute, China
Yongchun Zuo, Inner Mongolia University, China

Copyright © 2021 Gong, Zhu, Sun and Shi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Meili Sun, MTcwMDI3MTYzOUBxcS5jb20=; Lei Shi, c2xzcGluZUAxNjMuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.