- 1Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China
- 2Department of Mathematics, Changzhi Medical College, Changzhi, China
- 3Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment, Taiyuan, China
- 4School of Health and Service Management, Shanxi University of Chinese Medicine, Taiyuan, China
Long non-coding RNAs (lncRNAs) play significant roles in the disease process. Understanding the pathological mechanisms of lncRNAs during the course of various diseases will help clinicians prevent and treat diseases. With the emergence of high-throughput techniques, many biological experiments have been developed to study lncRNA-disease associations. Because experimental methods are costly, slow, and laborious, a growing number of computational models have emerged. Here, we present a new approach using network consistency projection and bi-random walk (NCP-BiRW) to infer hidden lncRNA-disease associations. First, integrated similarity networks for lncRNAs and diseases were constructed by merging similarity information. Subsequently, network consistency projection was applied to calculate space projection scores for lncRNAs and diseases, which were then introduced into a bi-random walk method for association prediction. To test model performance, we employed 5- and 10-fold cross-validation, with the area under the receiver operating characteristic curve as the evaluation indicator. The computational results showed that our method outperformed the other five advanced algorithms. In addition, the novel method was applied to another dataset in the Mammalian ncRNA-Disease Repository (MNDR) database and showed excellent performance. Finally, case studies were carried out on atherosclerosis and leukemia to confirm the effectiveness of our method in practice. In conclusion, we could infer lncRNA-disease associations using the NCP-BiRW model, which may benefit biomedical studies in the future.
Introduction
Long non-coding RNAs (lncRNAs) were primitively considered noise in transcriptional regulation and thought to have no biological functions (Guttman et al., 2013; Li et al., 2019). In recent decades, however, lncRNAs have attracted growing attention from researchers worldwide owing to the discovery of their critical biological functions. Increasing numbers of lncRNAs have been identified in eukaryotes (Guttman et al., 2009) and abnormal lncRNA expression has been shown to cause many human diseases, including nervous system diseases (Qureshi and Mehler, 2013; Chen et al., 2021), cardiovascular diseases (Bhatti et al., 2021; Xie et al., 2021), various cancers (Amelio et al., 2021; Taniue and Akimitsu, 2021), autoimmune diseases (Lodde et al., 2020; Zeni and Mraz, 2021), and blood diseases (Wei et al., 2013; Kirtonia et al., 2021). Therefore, searching for possible lncRNA-disease associations may facilitate the elucidation of the molecular pathogenesis of human diseases and could be relevant in disease diagnosis, prognosis, prevention, and treatment in the clinical setting. At present, researchers mainly study potential lncRNA-disease associations through biological experiment verification and computational model prediction. However, biological experiments are often costly, time-consuming, and inconclusive (Chen et al., 2017). Thus, few lncRNA-disease associations have been verified experimentally, and the use of more advanced algorithms is essential.
LncRNA-disease association predictive models can be roughly classified into two types, the first of which is machine learning-based. Chen and Yan (2013) proposed the calculative model LRLSLDA, which integrates known lncRNA-disease interactions and lncRNA expression profiles and applies the Laplacian regularized least square method to predict disease-related lncRNAs. Subsequently, Chen et al. (2015) developed LRLSLDA-LNCSIM. Under the hypothesis that lncRNAs with similar functions tend to be related to similar diseases, two new functional similarity computational models, LNCSIM1 and LNCSIM2, were developed. Then, the two models were combined with the LRLSLDA model for the prediction of lncRNA-disease associations. Yang et al. (2014) constructed a binary network for genes and diseases, and applied a network propagation algorithm to find hidden lncRNA-disease interactions. On the basis of the naïve Bayesian classifier, Zhao et al. (2015) developed a novel method to identify cancer-related lncRNAs by integrating genome, transcriptome, and regulome data and identified 707 lncRNAs. Furthermore, Lu et al. (2018) proposed SIMCLDA, which first computed disease functional similarity and lncRNA Gaussian interaction profile kernel similarity and then used principal component analysis to extract the principal eigenvector of disease and lncRNA similarity. Finally, the inductive matrix completion technique was used for association prediction. In recent years, there have been many deep learning techniques developed in the field of bioinformatics. Zeng et al. (2020) developed the SDLDA model to predict lncRNA-disease interactions. SDLDA extracted the features of lncRNAs and diseases, including the linear features acquired by the singular value decomposition technique and the non-linear features obtained by the deep learning method. Zeng et al. (2021) proposed a deep matrix factorization model called DMFLDA. Based on the lncRNA-disease associations matrix, the non-linear hidden layers of DMFLDA were employed to learn the latent representation of lncRNAs and diseases, which could capture more complex and nonlinear lncRNA-disease associations. However, negative samples are required for these machine learning methods and are difficult to obtain.
The second type of predictive model is network-based. Sun et al. (2014) constructed the RWRlncD model, in which random walk with restart was used to compute lncRNA functional similarity, and the lncRNA functional similarity network was then combined with the lncRNA-disease and disease similarity networks to form a global network. Finally, the candidate lncRNAs of specific diseases of interest were sorted. Chen (2015) developed KATZLDA, which integrated lncRNA functional similarity, lncRNA expression profiles, disease semantic similarity, Gaussian interaction profile kernel similarity, and the known lncRNA-disease pairs, and then used the KATZ method to predict the potential lncRNA-disease interactions. Wen et al. (2018) developed Lap-BiRWRHLDA. First, Laplacian normalization was applied to compute lncRNA similarity matrix and disease similarity matrix. Then a heterogeneous network was constructed based on lncRNA similarity network, disease similarity network, and known lncRNA-disease associations. Finally, bi-random walk algorithm was applied on this heterogeneous network to predict lncRNA-disease associations. Hu et al. (2019) proposed the BiWalkLDA model, which applied bi-random walk method to predict hidden lncRNA-disease associations. It integrated gene ontology and interaction profiles to calculate disease similarity, and used interaction profiles data to calculate lncRNA similarity in which the cold-start problem was solved by using the local topological structure of a new lncRNA. Xie et al. (2019) proposed NCPHLDA, which calculated the comprehensive similarity for lncRNAs and diseases and then applied a network consistency projection method to infer the interactions between lncRNAs and diseases. The most significant advantage of the network consistency projection algorithm is that it has no parameters. The network consistency projection algorithm and the bi-random walk algorithm have the common characteristic that they both have the calculation process on the similarity networks of lncRNAs and diseases. Wang and Yan (2019) constructed the IDLDA model, which used an improved diffusion method to infer lncRNA-disease interactions based on a combined dataset. Recently, some hybrid computational models have emerged and showed good performance. Xie et al. (2021) designed the RWSF-BLP model to forecast lncRNA-disease interactions. The model first applied a random walk algorithm to fuse various similarity networks and then adopted bidirectional label propagation to make predictions. Yin et al. (2020) created the NCPLP model based on network consistency projection and label propagation to predict microbe-disease interactions. These biological network-based methods provide a fresh perspective and framework with which we can construct new computational models.
Here, we intend to construct a hybrid method consisting of two different methods. According to previous studies, we considered the following three factors in modeling: First, the two methods could be combined properly and reasonably. Second, it is better to have no more parameters, which is directly related to computational efficiency. Third, the combination of two methods should contribute more biological information to the final result. Accordingly, in this paper, we come up with a hybrid method consisting of network consistency projection and bi-random walk (NCP-BiRW) to infer lncRNA-disease interactions. We investigated comprehensive similarity networks for lncRNAs and diseases based on known lncRNA-disease relationships, disease semantic similarity, lncRNA functional similarity, and Gaussian interaction profile (GIP) kernel similarity for lncRNAs and diseases to apply more similarity information. Second, we constructed a heterogeneous network consisting of lncRNA similarity network, disease similarity network, and lncRNA-disease association network. The network consistency projection method was used to compute lncRNA network projection scores and disease network projection scores. Third, we added the results of the network consistency projection algorithm to the bi-random walk algorithm, and finally got the predicted scores of potential lncRNA-disease interactions. Five- and ten-fold cross-validation (CV) were adopted to verify the effectiveness of NCP-BiRW. Our Results demonstrated that our method outperformed the other five classical algorithms and we showed that the model was robust when applied to another dataset. Finally, case studies on atherosclerosis and leukemia were used to further verify the validity of our model.
Materials and Methods
Long non-coding RNA-Disease Associations Dataset
We downloaded known lncRNA-disease associations from the 2017-version LncRNADisease database (Chen et al., 2013) (http://www.cuilab.cn/lncrnadisease). After conducting data quality control and data cleaning, 701 known experimentally validated interactions between 157 diseases and 82 lncRNAs were acquired, as previously reported (Fan et al., 2020).
Gaussian Interaction Profile Kernel Similarity for Long non-coding RNAs and Diseases
Researchers have hypothesized that the more similar two lncRNAs are, the more likely they are to have similar interaction modes with similar diseases (van Laarhoven et al., 2011). Thus, GIP kernel similarity was used to measure the similarities of lncRNAs and diseases. Given lncRNA
where
Similarly, the GIP kernel similarity of disease is calculated as follows:
where
Disease Semantic Similarity
Directed acyclic graphs (DAGs) have been widely used to compute the semantic similarity between diseases when predicting potential lncRNA-disease interactions (Chen et al., 2017). Here, the disease semantic similarity was calculated as previously reported (Fan et al., 2020). First, the Medical Subject Headings (MeSH) descriptors of the diseases we needed were downloaded from the National Library of Medicine (http://www.nlm.nih.gov/) (Wang et al., 2010). We then constructed a DAG for each disease
where
where K is the diseases set in MeSH,
By accumulating the semantic contributions of all the diseases in
In general, the similarity between the two diseases is higher if the nodes sharing in their DAGs are higher. Therefore, we compute the semantic similarity of diseases
Long non-coding RNA Functional Similarity
We computed the functional similarities of lncRNAs according to the LNCSIM model (Chen et al., 2015). Let
In view of the hypothesis that functionally similar lncRNAs are usually related with similar diseases, the functional similarity between lncRNAs
where
Network Consistency Projection and Bi-Random Walk
We constructed a novel model NCP-BiRW involving network consistent projection (Xie et al., 2019) and bi-random walk (Hu et al., 2019) to forecast hidden lncRNA-disease interactions. We divided the model implementation process into three steps. Figure 1 shows the flowchart of the algorithm.
Step 1. construction of integrated similarity networks for lncRNAs and diseasesThe integrated technique was adopted to obtain more similarity information. On the basis of the lncRNA GIP kernel similarity matrix (KL) and the lncRNA functional similarity matrix (FL), the integrated similarity between lncRNAs
Similarly, based on the disease semantic similarity matrix (SV) and the disease GIP kernel similarity matrix (KD), the integrated similarity between diseases
Step 2. network consistency projection for lncRNA and disease spacesWe constructed a heterogeneous network consisting of the above integrated similarity networks and lncRNA-disease association network. The network consistency projection method was utilized to obtain more network topological information (Yin et al., 2020). Network consistency projection can be divided into lncRNA network consistency projection and disease network consistency projection (Li et al., 2019; Xie et al., 2019).The lncRNA network consistency projection fractions can be formulated as follows:
where
where
Step 3. bi-random walk in the integrated similarity networks of lncRNAs and diseasesFirst, the integrated similarity networks, LS and DS were normalized such that all the similarity values were between 0 and 1 (Hu et al., 2019). The formula of the normalized similarity of lncRNAs is as follows:
Similarly, the normalization of the disease similarity is as follows:
The association matrix
Then, we carried out the random walk method for both the lncRNA similarity network and the disease similarity network, called bi-random walk, a global process (Zhang et al., 2018). r1 and r2 are designated as the maximum number of iterations in the lncRNA and disease similarity networks, respectively. If r1 > r2, the lncRNA similarity is considered more important in the predicted process (Hu et al., 2019). On the basis of the results of the network consistency projection, the iteration processes are as follows:
where
Results
Performance Evaluation
We used k-fold CV to evaluate the model performance. In k-fold CV, known lncRNA-disease pairs are divided into k subparts, with k-1 parts as the training set and the remaining part as the testing set. Here, we chose k = 5 (5-fold CV) and k = 10 (10-fold CV). All unknown associations were regarded as candidate samples. The predicted score of each lncRNA-disease pair was obtained using NCP-BiRW. The predicted scores of the test and candidate samples were sorted together. The receiver operating characteristic (ROC) curve was drawn according to the false positive rate (FPR) and the true positive rate (TPR) under different thresholds. The area under the ROC curve (AUC) was employed as a metric to assess the overall performance of our method. For AUC
Effects of Parameters
In this research, there were three parameters:
Comparison With Other Methods
In order to prove the excellent model performance, we compared NCP-BiRW with five other popular algorithms: KATZLDA (Chen, 2015), Lap-BiRWRHLDA (Wen et al., 2018), BiWalkLDA (Hu et al., 2019), NCPHLDA (Xie et al., 2019), and IDLDA (Wang and Yan, 2019). We chose the parameter values for each model in the original reference. First, we conducted 5-fold CV, as shown in Figure 3, and the AUC of NCP-BiRW was 0.8982, which was better than the AUC values of the other five methods (KATZLDA: 0.8622, Lap-BiRWRHLDA: 0.8642, BiWalkLDA: 0.8702, NCPHLDA: 0.8338, and IDLDA: 0.8424). Then, we conducted 10-fold CV, and the AUC of NCP-BiRW was 0.9050 (Figure 3), which had the best performance (KATZLDA: 0.8646, Lap-BiRWRHLDA: 0.8666, BiWalkLDA: 0.8706, NCPHLDA: 0.8862, and IDLDA: 0.8413). In addition, we considered the following two models: 1) NCP, i.e., NCP-BiRW without bi-random walk; and 2) BiRW, i.e., NCP-BiRW without network consistent projection. Then, we compared the two models with NCP-BiRW, as shown in Figure 4. The results showed that our hybrid method was better than every single method. In summary, NCP-BiRW achieved the best performance for predicting lncRNA-disease interactions using the dataset from the LncRNADisease database.
Robustness of Evaluation Using Another Dataset
We then applied NCP-BiRW to another dataset to determine whether our method could still achieve outstanding performance. We chose the Mammalian ncRNA-Disease Repository (MNDR) database (Cui et al., 2018), from which the known lncRNA-disease interactions were downloaded. After data cleaning, 1,680 known interactions between 190 diseases and 89 lncRNAs were selected (Fan et al., 2020). We performed the same experiment as above, and Figure 5 shows the final computational results. In 5-fold CV, the AUC of NCP-BiRW was 0.9556, which was better than those of KATZLDA (0.9450), Lap-BiRWRHLDA (0.9374), BiWalkLDA (0.9412), NCPHLDA (0.9355), and IDLDA (0.9452). In 10-fold CV, NCP-BiRW also performed the best. The AUCs of KATZLDA, Lap-BiRWRHLDA, BiWalkLDA, NCPHLDA, IDLDA and NCP-BiRW were 0.9466, 0.9380, 0.9420, 0.9539, 0.9466 and 0.9591, respectively. The excellent performance of NCP-BiRW using the MNDR database demonstrated the robustness of our model.
Case Studies
Next, we chose atherosclerosis (AS) and leukemia as model diseases, and conducted case studies using these two diseases to further confirm the predictive effects of NCP-BiRW. The top 10 candidate lncRNAs predicted by our method for the two diseases are listed in Tables 1, 2. Eventually, lncRNAs in the tables were verified using the MNDR database (Ning et al., 2021) and the Lnc2Cancer database (Gao et al., 2021) (http://bio-bigdata.hrbmu.edu.cn/lnc2cancer).
AS is a chronic inflammatory disease characterized by lipid-rich plaques in the artery wall (Vigario et al., 2020). AS is the primary cause of most cardiovascular diseases, including acute myocardial infarction and stroke (Li et al., 2020). Many lncRNAs have been shown to function in AS, the central underlying pathology of cardiovascular diseases (Josefs and Boon, 2020). In this study, we next predicted the top 10 lncRNAs associated with AS (Table 1). Seven of these top 10 lncRNAs were verified using the MNDR database. For example, MALAT1 (ranked first) inhibits AS through miR-155 and SOCS1. Specifically, MALAT1 inhibits the release of inflammatory cytokines and blocks apoptosis by sponging miR-155 and enhancing SOCS1 expression to suppress the Janus kinase/signal transducer and activator of the transcription pathway (Li et al., 2018). Additionally, MEG3 (ranked second), an endothelial-enriched lncRNA, acts as a competing endogenous RNA against miR-223, which may explain the anti-AS functions of melatonin (Zhang et al., 2018). HOTAIR (ranked third), is related to the progression of various cancers; however, its functions in AS are still unclear. Notably, HOTAIR has been shown to control AS progression by sponging miR-330-5p in THP-1 cells (Liu et al., 2019).
Leukemia, a type of blood or bone marrow cancer, involves excessive production of white cells (Luo et al., 2015). There are four main types of leukemia: acute lymphocytic leukemia, acute myeloid leukemia (AML), chronic lymphocytic leukemia, and chronic myeloid leukemia (CML) (Siegel et al., 2021). In 2020, over 31,000 people died of leukemia worldwide (Siegel et al., 2021). Recent studies have demonstrated the relationships among lncRNAs and the pathophysiology of leukemia (Gao et al., 2020). The top 10 predicted leukemia-related lncRNAs are listed in Table 2. All 10 were validated using the Lnc2Cancer database and the MNDR database. MALAT1 (ranked third) promotes the survival of CML cells, stimulates the cell cycle and imatinib resistance by sponging miR-328, highlighting the vital roles of MALAT1 as a microRNA sponge in CML and supporting the application of lncRNA-targeted therapies in the treatment of CML (Wen et al., 2018). Additionally, TUG1 (ranked eighth) promotes the progression of AML through the miR-370-3p/mitogen-activated protein kinase 1 (MAPK1)/extracellular signal-regulated kinase (ERK) signaling pathway. The MAPK1/ERK signaling pathway inhibits the epithelial-mesenchymal transition and thus blocks the migration and invasion of AML cells (Li et al., 2019). Studies have shown that MIAT (ranked ninth) is highly expressed in various solid tumors in humans and promotes AML progression by negatively regulating miR-495, which may be a promising therapeutic target in patients with AML (Wang et al., 2019).
Discussion
According to a substantial body of evidence, lncRNAs are critical for disease research. Identification of hidden lncRNA-disease pairs may provide insights into the pathological mechanisms of diseases, disease prevention, diagnosis, and treatment. Experimental techniques have been used to identify unknown lncRNA-disease interactions; however, these approaches are slow and costly. Therefore, computing methods have been developed as alternative approaches. Here, we constructed a new algorithm, NCP-BiRW, based on network consistency projection and bi-random walk. First, we integrated two similarity networks, i.e., one for diseases combining disease GIP kernel similarity and disease semantic similarity, and the other for lncRNAs combining lncRNA functional similarity and lncRNA GIP kernel similarity. Then, we used NCP-BiRW to forecast lncRNA-disease interactions on the LncRNADisease database. To validate its superiority, NCP-BiRW was compared with five classical models: KATZLDA, Lap-BiRWRHLDA, BiWalkLDA, NCPHLDA, and IDLDA based on 5- and 10-fold CV frameworks. The AUCs of NCP-BiRW were 0.8982 and 0.9050 for the two frameworks, respectively. To further test the stability of NCP-BiRW, we applied six methods on the MNDR database. After the same experimental process, the performance of NCP-BiRW was found to be optimal. Furthermore, case studies on AS and leukemia were used to validate the predictive performance of our algorithm in practice, and the prediction accuracy of the top 10 lncRNAs in AS and leukemia were 70% and 100%, respectively.
The reasons for the outstanding performance of our model are as follows. First, a considerable amount of biological information about lncRNAs and diseases was applied. Indeed, we used disease semantic similarity, GIP kernel similarity, and lncRNA functional similarity to construct similarity networks. Second, we did not use negative samples. Third, for making full use of network topological information, network consistency projection was applied. Moreover, no parameters were necessary for this step, so the computational efficiency was improved. Finally, the model added the results of network consistency projection into the bi-random walk, so more network topological information was added to the initial association matrix in the computing process of the bi-random walk method. By conducting random walks on two similarity networks, the similarity of lncRNAs and diseases are used reasonably and fully. Based on the above, the performance of the algorithm has been improved. In the future, our model can be used for other association predictions, such as miRNA-disease, gene-disease, drug-disease associations.
Despite these advantages, there are still some limitations of the NCP-BiRW framework. First, the proportion of known lncRNA-disease interactions in the LncRNADisease database is only 5.4%, and the original association matrix is thus very sparse; this could influence various calculations, including GIP kernel similarity, network consistency projection, and bi-random walk. Second, in this study, we only considered two factors: lncRNAs and diseases, and more biological information on different factors (such as genes, protein, and other types of RNAs) may provide more evidence for the prediction of lncRNA-disease interactions. Therefore, more valuable biological information is necessary for the future. Finally, NCP-BiRW is a network-based method. With the emergence of new methods in different fields, developing more algorithms for integration of various fields is essential. In our future studies, we will plan to apply multiple types of data with more biological information to association prediction models in order to yield more accurate predictive effects.
Data Availability Statement
Publicly available datasets were analyzed in this study. This data can be found here: https://github.com/CDMB-lab/IDSSIM.
Author Contributions
YL conceived the study, designed the study protocol, and wrote the code and the manuscript. HY, CZ, KW, JY, and HC participated in the data analysis. YZ came up with the original concept for the study, oversaw the data analysis, and revised the paper. All authors contributed to the article and approved the final manuscript.
Funding
This research was supported by the National Natural Science Foundation of China (Grant No. 82173631) and Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment (Grant No. 201805D111006).
Conflict of Interest
The authors declare that the research was conducted without any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
We would like to thank Editage (www.editage.cn) for English language editing.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2022.862272/full#supplementary-material
References
Amelio, I., Bernassola, F., and Candi, E. (2021). Emerging Roles of Long Non-coding RNAs in Breast Cancer Biology and Management. Semin. Cancer Biol. 72, 36–45. doi:10.1016/j.semcancer.2020.06.019
Bao, W., Jiang, Z., and Huang, D.-S. (2017). Novel Human Microbe-Disease Association Prediction Using Network Consistency Projection. BMC Bioinformatics 18 (Suppl. 16), 543. doi:10.1186/s12859-017-1968-2
Bhatti, G. K., Khullar, N., Sidhu, I. S., Navik, U. S., Reddy, A. P., Reddy, P. H., et al. (2021). Emerging Role of Non‐coding RNA in Health and Disease. Metab. Brain Dis. 36 (6), 1119–1134. doi:10.1007/s11011-021-00739-y
Chen, G., Wang, Z., Wang, D., Qiu, C., Liu, M., Chen, X., et al. (2013). LncRNADisease: a Database for Long-Non-Coding RNA-Associated Diseases. Nucleic Acids Res. 41 (D1), D983–D986. doi:10.1093/nar/gks1099
Chen, M., Lai, X., Wang, X., Ying, J., Zhang, L., Zhou, B., et al. (2021). Long Non-coding RNAs and Circular RNAs: Insights into Microglia and Astrocyte Mediated Neurological Diseases. Front. Mol. Neurosci. 14, 745066. doi:10.3389/fnmol.2021.745066
Chen, X., Clarence Yan, C., Luo, C., Ji, W., Zhang, Y., and Dai, Q. (2015). Constructing lncRNA Functional Similarity Network Based on lncRNA-Disease Associations and Disease Semantic Similarity. Sci. Rep. 5 (1), 11338. doi:10.1038/srep11338
Chen, X. (2015). KATZLDA: KATZ Measure for the lncRNA-Disease Association Prediction. Sci. Rep. 5 (1), 16840. doi:10.1038/srep16840
Chen, X., Yan, C. C., Zhang, X., and You, Z.-H. (2017). Long Non-coding RNAs and Complex Diseases: from Experimental Results to Computational Models. Brief Bioinform 18 (4), bbw060–576. doi:10.1093/bib/bbw060
Chen, X., and Yan, G.-Y. (2013). Novel Human lncRNA-Disease Association Inference Based on lncRNA Expression Profiles. Bioinformatics 29 (20), 2617–2624. doi:10.1093/bioinformatics/btt426
Cui, T., Zhang, L., Huang, Y., Yi, Y., Tan, P., Zhao, Y., et al. (2018). MNDR v2.0: an Updated Resource of ncRNA-Disease Associations in Mammals. Nucleic Acids Res. 46 (D1), D371–D374. doi:10.1093/nar/gkx1025
Fan, W., Shang, J., Li, F., Sun, Y., Yuan, S., and Liu, J.-X. (2020). IDSSIM: an lncRNA Functional Similarity Calculation Model Based on an Improved Disease Semantic Similarity Method. BMC Bioinformatics 21 (1), 339. doi:10.1186/s12859-020-03699-9
Gao, J., Wang, F., Wu, P., Chen, Y., and Jia, Y. (2020). Aberrant LncRNA Expression in Leukemia. J. Cancer 11 (14), 4284–4296. doi:10.7150/jca.42093
Gao, Y., Shang, S., Guo, S., Li, X., Zhou, H., Liu, H., et al. (2021). Lnc2Cancer 3.0: an Updated Resource for Experimentally Supported lncRNA/circRNA Cancer Associations and Web Tools Based on RNA-Seq and scRNA-Seq Data. Nucleic Acids Res. 49 (D1), D1251–D1258. doi:10.1093/nar/gkaa1006
Guttman, M., Amit, I., Garber, M., French, C., Lin, M. F., Feldser, D., et al. (2009). Chromatin Signature Reveals over a Thousand Highly Conserved Large Non-coding RNAs in Mammals. Nature 458 (7235), 223–227. doi:10.1038/nature07672
Guttman, M., Russell, P., Ingolia, N. T., Weissman, J. S., and Lander, E. S. (2013). Ribosome Profiling Provides Evidence that Large Noncoding RNAs Do Not Encode Proteins. Cell 154 (1), 240–251. doi:10.1016/j.cell.2013.06.009
Hu, J., Gao, Y., Li, J., Zheng, Y., Wang, J., and Shang, X. (2019). A Novel Algorithm Based on Bi-random Walks to Identify Disease-Related lncRNAs. BMC Bioinformatics 20 (Suppl. 18), 569. doi:10.1186/s12859-019-3128-3
Josefs, T., and Boon, R. A. (2020). The Long Non-coding Road to Atherosclerosis. Curr. Atheroscler. Rep. 22 (10), 55. doi:10.1007/s11883-020-00872-6
Kirtonia, A., Ashrafizadeh, M., Zarrabi, A., Hushmandi, K., Zabolian, A., Bejandi, A. K., et al. (2021). Long Noncoding RNAs: A Novel Insight in the Leukemogenesis and Drug Resistance in Acute Myeloid Leukemia. J. Cell Physiol. 237 (1), 450–465. doi:10.1002/jcp.30590
Li, C., Dou, Y., Chen, Y., Qi, Y., Li, L., Han, S., et al. (2020). Site‐Specific MicroRNA‐33 Antagonism by pH‐Responsive Nanotherapies for Treatment of Atherosclerosis via Regulating Cholesterol Efflux and Adaptive Immunity. Adv. Funct. Mater. 30 (42), 2002131. doi:10.1002/adfm.202002131
Li, G., Luo, J., Liang, C., Xiao, Q., Ding, P., and Zhang, Y. (2019). Prediction of LncRNA-Disease Associations Based on Network Consistency Projection. IEEE Access 7, 58849–58856. doi:10.1109/access.2019.2914533
Li, G., Zheng, P., Wang, H., Ai, Y., and Mao, X. (2019). Long Non-coding RNA TUG1 Modulates Proliferation, Migration, and Invasion of Acute Myeloid Leukemia Cells via Regulating miR-370-3p/MAPK1/ERK. Ott 12, 10375–10388. doi:10.2147/OTT.S217795
Li, S., Sun, Y., Zhong, L., Xiao, Z., Yang, M., Chen, M., et al. (2018). The Suppression of Ox-LDL-Induced Inflammatory Cytokine Release and Apoptosis of HCAECs by Long Non-coding RNA-MALAT1 via Regulating microRNA-155/SOCS1 Pathway. Nutr. Metab. Cardiovasc. Dis. 28 (11), 1175–1187. doi:10.1016/j.numecd.2018.06.017
Li, Y., Li, J., and Bian, N. (2019). DNILMF-LDA: Prediction of lncRNA-Disease Associations by Dual-Network Integrated Logistic Matrix Factorization and Bayesian Optimization. Genes 10 (8), 608. doi:10.3390/genes10080608
Liu, J., Huang, G. Q., and Ke, Z. P. (2019). Silence of Long Intergenic Noncoding RNA HOTAIR Ameliorates Oxidative Stress and Inflammation Response in ox‐LDL‐treated Human Macrophages by Upregulating miR‐330‐5p. J. Cel Physiol 234 (4), 5134–5142. doi:10.1002/jcp.27317
Lodde, V., Murgia, G., Simula, E. R., Steri, M., Floris, M., and Idda, M. L. (2020). Long Noncoding RNAs and Circular RNAs in Autoimmune Diseases. Biomolecules 10 (7), 1044. doi:10.3390/biom10071044
Lu, C., Yang, M., Luo, F., Wu, F.-X., Li, M., Pan, Y., et al. (2018). Prediction of lncRNA-Disease Associations Based on Inductive Matrix Completion. Bioinformatics 34 (19), 3357–3364. doi:10.1093/bioinformatics/bty327
Luo, N., Zhao, L.-C., Shi, Q.-Q., Feng, Z.-Q., Chen, D.-L., and Li, J. (2015). Induction of Apoptosis in Human Leukemic Cell Lines by Diallyl Disulfide via Modulation of EGFR/ERK/PKM2 Signaling Pathways. Asian Pac. J. Cancer Prev. 16 (8), 3509–3515. doi:10.7314/apjcp.2015.16.8.3509
Ning, L., Cui, T., Zheng, B., Wang, N., Luo, J., Yang, B., et al. (2021). MNDR v3.0: Mammal ncRNA-Disease Repository with Increased Coverage and Annotation. Nucleic Acids Res. 49 (D1), D160–D164. doi:10.1093/nar/gkaa707
Qureshi, I. A., and Mehler, M. F. (2013). Long Non-coding RNAs: Novel Targets for Nervous System Disease Diagnosis and Therapy. Neurotherapeutics 10 (4), 632–646. doi:10.1007/s13311-013-0199-0
Siegel, R. L., Miller, K. D., Fuchs, H. E., and Jemal, A. (2021). Cancer Statistics, 2021. CA A. Cancer J. Clin. 71 (1), 7–33. doi:10.3322/caac.21654
Sun, J., Shi, H., Wang, Z., Zhang, C., Liu, L., Wang, L., et al. (2014). Inferring Novel lncRNA-Disease Associations Based on a Random Walk Model of a lncRNA Functional Similarity Network. Mol. Biosyst. 10 (8), 2074–2081. doi:10.1039/c3mb70608g
Taniue, K., and Akimitsu, N. (2021). The Functions and Unique Features of LncRNAs in Cancer Development and Tumorigenesis. Ijms 22 (2), 632. doi:10.3390/ijms22020632
van Laarhoven, T., Nabuurs, S. B., and Marchiori, E. (2011). Gaussian Interaction Profile Kernels for Predicting Drug-Target Interaction. Bioinformatics 27 (21), 3036–3043. doi:10.1093/bioinformatics/btr500
Vigario, F. L., Kuiper, J., and Slütter, B. (2020). Tolerogenic Vaccines for the Treatment of Cardiovascular Diseases. EBioMedicine 57, 102827. doi:10.1016/j.ebiom.2020.102827
Wang, D., Wang, J., Lu, M., Song, F., and Cui, Q. (2010). Inferring the Human microRNA Functional Similarity and Functional Network Based on microRNA-Associated Diseases. Bioinformatics 26 (13), 1644–1650. doi:10.1093/bioinformatics/btq241
Wang, G., Li, X., Song, L., Pan, H., Jiang, J., and Sun, L. (2019). Long Noncoding RNA MIAT Promotes the Progression of Acute Myeloid Leukemia by Negatively Regulating miR-495. Leuk. Res. 87, 106265. doi:10.1016/j.leukres.2019.106265
Wang, Q., and Yan, G. (2019). IDLDA: An Improved Diffusion Model for Predicting LncRNA-Disease Associations. Front. Genet. 10, 1259. doi:10.3389/fgene.2019.01259
Wei, P., Han, B., and Chen, Y. (2013). Role of Long Non-coding RNAs in normal and Malignant Hematopoiesis. Sci. China Life Sci. 56 (10), 867–875. doi:10.1007/s11427-013-4550-9
Wen, F., Cao, Y.-X., Luo, Z.-Y., Liao, P., and Lu, Z.-W. (2018). LncRNA MALAT1 Promotes Cell Proliferation and Imatinib Resistance by Sponging miR-328 in Chronic Myelogenous Leukemia. Biochem. Biophysical Res. Commun. 507 (1-4), 1–8. doi:10.1016/j.bbrc.2018.09.034
Wen, Y., Han, G., and Anh, V. V. (2018). Laplacian Normalization and Bi-random Walks on Heterogeneous Networks for Predicting lncRNA-Disease Associations. BMC Syst. Biol. 12 (Suppl. 9), 122. doi:10.1186/s12918-018-0660-0
Xie, G., Huang, B., Sun, Y., Wu, C., and Han, Y. (2021). RWSF-BLP: a Novel lncRNA-Disease Association Prediction Model Using Random Walk-Based Multi-Similarity Fusion and Bidirectional Label Propagation. Mol. Genet. Genomics 296 (3), 473–483. doi:10.1007/s00438-021-01764-3
Xie, G., Huang, Z., Liu, Z., Lin, Z., and Ma, L. (2019). NCPHLDA: a Novel Method for Human lncRNA-Disease Association Prediction Based on Network Consistency Projection. Mol. Omics 15 (6), 442–450. doi:10.1039/c9mo00092e
Xie, L., Zhang, Q., Mao, J., Zhang, J., and Li, L. (2021). The Roles of lncRNA in Myocardial Infarction: Molecular Mechanisms, Diagnosis Biomarkers, and Therapeutic Perspectives. Front. Cel Dev. Biol. 9, 680713. doi:10.3389/fcell.2021.680713
Yang, X., Gao, L., Guo, X., Shi, X., Wu, H., Song, F., et al. (2014). A Network Based Method for Analysis of lncRNA-Disease Associations and Prediction of lncRNAs Implicated in Diseases. PLoS One 9 (1), e87797. doi:10.1371/journal.pone.0087797
Yin, M.-M., Liu, J.-X., Gao, Y.-L., Kong, X.-Z., and Zheng, C.-H. (2020). NCPLP: A Novel Approach for Predicting Microbe-Associated Diseases with Network Consistency Projection and Label Propagation. IEEE Trans. Cybern. 99, 1–9. doi:10.1109/TCYB.2020.3026652
Zeng, M., Lu, C., Fei, Z., Wu, F.-X., Li, Y., Wang, J., et al. (2021). DMFLDA: A Deep Learning Framework for Predicting lncRNA-Disease Associations. Ieee/acm Trans. Comput. Biol. Bioinf. 18 (6), 2353–2363. doi:10.1109/TCBB.2020.2983958
Zeng, M., Lu, C., Zhang, F., Li, Y., Wu, F.-X., Li, Y., et al. (2020). SDLDA: lncRNA-Disease Association Prediction Based on Singular Value Decomposition and Deep Learning. Methods 179, 73–80. doi:10.1016/j.ymeth.2020.05.002
Zeni, P. F., and Mraz, M. (2021). LncRNAs in Adaptive Immunity: Role in Physiological and Pathological Conditions. RNA Biol. 18 (5), 619–632. doi:10.1080/15476286.2020.1838783
Zhang, J., Zou, S., and Deng, L. (2018). Gene Ontology-Based Function Prediction of Long Non-coding RNAs Using Bi-random Walk. BMC Med. Genomics 11 (Suppl. 5), 99. doi:10.1186/s12920-018-0414-2
Zhang, Y., Liu, X., Bai, X., Lin, Y., Li, Z., Fu, J., et al. (2018). Melatonin Prevents Endothelial Cell Pyroptosis via Regulation of Long Noncoding RNA MEG3/miR-223/NLRP3 axis. J. Pineal Res. 64 (2), e12449. doi:10.1111/jpi.12449
Keywords: lncRNA-disease association prediction, integrated similarity, network consistency projection, normalization, bi-random walk
Citation: Liu Y, Yang H, Zheng C, Wang K, Yan J, Cao H and Zhang Y (2022) NCP-BiRW: A Hybrid Approach for Predicting Long Noncoding RNA-Disease Associations by Network Consistency Projection and Bi-Random Walk. Front. Genet. 13:862272. doi: 10.3389/fgene.2022.862272
Received: 25 January 2022; Accepted: 21 March 2022;
Published: 13 April 2022.
Edited by:
Yichuan Liu, Children’s Hospital of Philadelphia (CHOP), United StatesCopyright © 2022 Liu, Yang, Zheng, Wang, Yan, Cao and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence:Yanbo Zhang, c3htdXp5YkAxMjYuY29t