- 1School of Cyberspace Security, Qufu Normal University, Qufu, China
- 2School of Computer Science and Technology, Anhui University, Hefei, China
An increasing number of experiments had verified that miRNA expression is related to human diseases. The miRNA expression profile may be an indicator of clinical diagnosis and provides a new direction for the prevention and treatment of complex diseases. In this work, we present a weighted voting-based model for predicting miRNA–disease association (WVMDA). To reasonably build a network of similarity, we established credibility similarity based on the reliability of known associations and used it to improve the original incomplete similarity. To eliminate noise interference as much as possible while maintaining more reliable similarity information, we developed a filter. More importantly, to ensure the fairness and efficiency of weighted voting, we focus on the design of weighting. Finally, cross-validation experiments and case studies are undertaken to verify the efficacy of the proposed model. The results showed that WVMDA could efficiently identify miRNAs associated with the disease.
Introduction
MicroRNA (miRNA) is a class of non-coding single-stranded RNA with a length of approximately 22 nucleotides, which play a huge role in cell differentiation, biological development, and disease attack (Ambros, 2001; Lee and Ambros, 2001; Ambros, 2004; Bartel, 2004). By comparing the expression profiles of different miRNAs in cancer cells and normal cells, the researchers found that some miRNAs can inhibit the occurrence and development of malignant tumors (Esquela-Kerscher and Slack, 2006; Huang et al., 2008; Wang et al., 2013), such as breast cancer (Blenkiron et al., 1981) and prostate cancer (Garzon et al., 2006). Therefore, the discovery of disease-related miRNAs is of great significance to prevent and treat human diseases. However, it is expensive and time-consuming to look for miRNAs associated with the disease through biological experiments. Therefore, a large number of calculation methods have been developed over the past several years (Ji et al., 2020; Chen et al., 2019; Zhao et al., 2019), and some relevant datasets have also been constructed (Jiang et al., 2009; Yang et al., 2014; Yang et al., 2016; Huang et al., 2018).
In the past, a large number of methods based on measuring biological information have been established, and this idea has been the main theme of miRNA–disease association prediction. Based on the hypothesis that miRNAs with similar functions are related to the same diseases (Chen et al., 2021), Jiang et al. (2010) developed a model that uses hypergeometric distribution to determine the association between diseases and miRNAs. Since then, most researchers have based their predictions on this assumption. Yang et al. (2018) proposed a new algorithm, MiRGOFS, to measure semantic similarity and miRNA similarity based on GO terms. Chen et al. (2016) predicted potential disease-associated miRNAs by integrating miRNA functional similarity (Chen et al., 2015), disease semantic similarity, and Gaussian interaction profile kernel similarity, which is a calculation method that integrates a variety of biological information and greatly reduces the time and expenditure of biological experiments. In addition, introducing associated biological information also serves as a supplementary reference for predictive goals. Chen et al. (2018) introduced lncRNA into miRNA–disease association prediction. They constructed a miRNA–lncRNA–disease heterogeneous network and applied label propagation to identify disease-related miRNAs. Analogous to introduce other types of data, Ji et al. (2020) integrated the association between miRNA and protein and the association between protein and disease to build a tripartite network. Zheng et al. (2020) first introduced incremental learning into the field of biological association prediction. This method can distinguish the associations of previous training when adjusting new data, which strengthens the ability of acquisition, adjustment, and transfer to learning the interaction mode of miRNA and disease.
In addition to integrated biological information as a research subject, researchers also put forward a variety of colorful models, providing inspiration for follow-up research. Chen et al. (2020) presented KBMFMDA to estimate the association network by project miRNA and disease into a unified subspace. This method combines kernel-based nonlinear dimensionality reduction, matrix factorization, and binary classification. Zhao et al. presented the ABMDA to infer potential associations of miRNA–disease, which utilized a random sampling way to balance the positive and negative samples. Besides, ABMDA applied the decision tree to serve as weak classifiers that were integrated to improve the accuracy of the provided learning method. (Toprak and Eryilmaz, 2020) used weighted known nearest neighbor and network congruence projection techniques to predict new miRNA–disease relationships after integrating multiple similarity degrees. The model NCMCMDA (Chen et al., 2020) combined neighborhood constraint with matrix completion and provided a new way to predict potential associations with similarity information. After the task of recovering missing associations was transformed into an optimization problem, the model solved it with a fast iterative shrinkage threshold algorithm. SMALF (Liu et al., 2021) uses XGBoost as the final prediction model and stacked automatic encoders learn miRNA potential features and disease potential features from the original miRNA–disease association matrix, which helps to improve the sparsity and incompleteness of existing datasets.
It is worth mentioning separately that the application of neural networks provides a novel idea for predicting disease-related miRNAs. DBNMDA (Chen et al., 2020) constructed feature vectors for all miRNA–disease pairs to pretrain restricted Boltzmann machines and put the same amount of positive and negative samples into the deep-belief network to get the final prediction results. Li et al. (2021) proposed GAEMDA to identify potential miRNA–disease associations in an end-to-end manner. In multilayer perception machine learning of diverse dimensions of semantic information, the introduction of a graph neural network serves to aggregate the neighborhood information of nodes. The model NIMCGCN (Li et al., 2020) put miRNA similarity and disease similarity into the graph convolutional neural network to learn the potential feature representation of miRNA and disease, and then these features were input into the new neural induction matrix completion model to train its parameters in a supervised manner. Finally, the trained model is used to recover the unknown association.
The above methods provide us with important references, while label propagation (Chen et al., 2018; Yu et al., 2018) and the weighted voting method (Tong and Kain, 1988; Campbell and Kelly, 2010) directly give us great inspiration. In the label propagation algorithm, the elements in the adjacency matrix are iterated with the similarity matrix as a reference until the adjacency matrix converges, and the converged adjacency matrix is used to infer potential associations. Weighted voting is a method of apportioning an unequal number of votes to members of a special proportion. Referring to the algorithm of label propagation, and considering the weighted voting method, we combine the two to get the weighted voting-based model for predicting miRNA–disease association (WVMDA), which does not require iteration.
WVMDA is also a recommendation algorithm in essence, and the main factor that assesses its performance is the construction of voting weight. Its purpose is tantamount to control the weight to get the voting result as fair as possible, and not to let the members of a certain class control the whole situation, and also not to let some classes have no sense of existence. In addition to the design prediction model, we also handled similarity. First, we construct the credibility similarity and use it to complete the existing dataset. Second, we design a filtering method to extract more reliable similarity information while eliminating noise interference as much as possible. In the experimental part, we visualized the processing of similarity to observe its effect. The five-fold cross-validation (5CV) and global leave-one-out cross-validation (LOOCV) were used to measure the performance of our method, and AUC values of 0.9537 and 0.9683 were obtained, respectively. In addition, we performed case studies on human prostate tumors and looked for the top predictor miRNAs in other datasets, and the results showed that our method identified the majority of disease-related miRNAs. In conclusion, WVMDA effectively optimizes the similarity and has certain reliability in predicting miRNA–disease association.
Materials and Methods
Human miRNA–Disease Associations
In this paper, we downloaded the validated association of miRNA–disease from the HMDD v2.0 database. We defined an adjacency matrix
miRNA Functional Similarity
The functional similarity of miRNAs was calculated based on the basic assumption that functionally similar miRNAs tended to be associated with similar diseases (Cui, 2010). We can load miRNA functional similarity data from http://www.cuilab.cn/files/images/cuilab/misim.zip. From these data, we constructed
Disease Semantic Similarity
The MeSH database includes many disease descriptions (Lipscomb, 2000). Directed acyclic graphs (DAG) are used to calculate disease semantic similarity. For node D, we define D(D) = [T(D), E(D)], where T(D) and E(D) are the nodes set and edges set, respectively. (D) includes node D and its ancestor nodes, and (D) represents the direct connection between parent nodes and child nodes. The contribution value of disease
Where
Thus, the semantic similarity of disease
Where
Credibility Similarity
In order to solve the problem of the incompleteness of the existing dataset, we established a novel similarity network based on the association network in this section. The building principle is that if two miRNAs are alike in expression for the same disease, then we believe that the two miRNAs are more analogous.
Unlike previous methods for establishing similarity, the known association matrix was first addressed. Compared with the unknown association, we thought the known association had a higher credibility. Consequently, the credibility of the known association was designed to be
The similarity of
The range of the similarity matrix calculated according to Eq. 5 is too wide, so it is necessary to reduce it to between 0 and 1. Since our weighted voting model only uses the diagonal elements of the similarity matrix when the voters vote for themselves, its definition does not affect the result of the weighted voting, so we do not need to calculate the diagonal elements of the matrix and set them to 0. Since the elements of the principal diagonal are very large, which affects the scaling of other elements, it is also essential to set the principal diagonal element to 0 before the operation of Eq. 6. With the same method, the credibility similarity
Based on miRNA functional similarity and credibility similarity of our constructs, the integrated miRNA similarity
WVMDA
To infer potential associations from known miRNA–disease associations, we proposed a weighted voting method called WVMDA (Figure 2). In WVMDA, the elements of the matrix composed of diseases and miRNAs are regarded as members to be voted, where known associations are regarded as members with voting rights, and these voting members vote for other members according to the designed weight. The final voting result will serve as the prediction result, and members with higher votes are more likely to be potential associations.
FIGURE 2. WVMDA can be divided into three steps: (A) we construct the credibility similarity network through known associations, (B) we filtered them after combining both similarity, and (C) weighted voting is used as the predictive model.
Voting Method
Assuming that there is a known association
When voting for all candidates in group
where
When voting for all candidates in group
According to this idea, the final voting result can be achieved by repeating the operation on all members with voting rights. Our design of voting method is so simple, but planning its weight is the highest priority, which directly determines the rationality and effectiveness of our method.
Assuming the weight of the vote is 1, there will be some unreasonable problems. If there is only a single voter
Furthermore, when there are several voters whose right to vote is 1, the number of votes obtained by the members of the group with the most voters will be significant. On the contrary, members of smaller groups will receive very few votes. Even though candidates with more voters are more likely to be potential association, we do not wish to see such an extreme imbalance. In this situation of extreme imbalance, some groups control the whole situation, whereas others have no meaning of existence.
Basic Voting Weight
With regard to Figure 4, the vote is extremely imbalanced. Obviously, several members have the right to vote in row 3, whereas a single member has the right to vote in row 2. As a result, members of the third row will receive more votes than the second row. For example,
where
FIGURE 4. The dark-colored elements represent the voters, and the remaining elements are the candidates. The number in the figure represents the number of times the current member has been voted and does not represent the final score.
The basic voting weight is determined by the number of voters linked to the current candidate. This design ensures that the overall number of votes for each candidate is 1. Because the basic voting weight reduces the voting rights of voters from related groups, each group will not be dominant or trivial. However, this kind of operation will make it impossible to reflect the differences of all potential connections, so we need to further plan the uniqueness of each vote.
Group Weight
Following the above ideas, we have successfully assigned the weight of each vote that guarantees that the total number of votes for each candidate is 1. But common sense is that the bigger the group, the bigger their voice, meaning that the current candidate should be more credible if recommended by more voters. Therefore, we must consider extending the influence of larger groups a little more. We design weights according to the influence of the group to improve the voice of the group:
where
Candidate’s Weight
In order to account for the different status between the voter and the candidate, as well as the difference between the different candidates, we need a reference to control the weight of the acceptance of the candidate of the vote.
Both miRNA similarity and disease similarity are generally set between 0 and 1. The higher the value, the more similar the two diseases or two miRNAs. The similarity between voters and candidates is equivalent to the efficiency of the candidate in accepting votes. Due to the difference in similarity, the efficiency of different candidates for voting is also different, which reflects the uniqueness of different candidates to a certain extent. Since the main diagonal element of the similarity matrix is 1, the absolute status of voters can also be guaranteed.
Since there is a great amount of noise in the similarity network, which affects the predictive performance of the model, we designed a filter to retain the more reliable information. Taking a row of the matrix as an example, our goal is to find the smallest valuable element in the sequence. If the sequence is arranged in descending order, the above problem is approximately to find the range that falls faster and is relatively early in the sequence. Considering the particularity of some sequences and for easier implementation of operations, we preset a hyperparameter to represent the hypothetical position and find the most reasonable element with the same level of element size as that of the hypothetical position.
Figure 5 displays the distribution of a row in the miRNA similarity matrix. Due to the different number of miRNA sequences and disease sequences, it is not reasonable to use a fixed number to represent the hypothetical position. It is a better choice to use the ratio of the hypothetical position to the sequence length. In the processing of miRNA similarity, the hypothetical position was defined as
Elements before the imaginary position are preserved, and elements after that are set to 0 (Figure 6). Filter each row of the matrix according to the above principles, thereby retaining an appropriate amount of reliable information depending on their different distribution.
FIGURE 6. According to the
This weight maintains the status of voters and reflects the differences between different candidates. The more scattered the elements in the similarity matrix are, the more obvious this difference is. The more reasonable the similarity matrix is constructed, the better the effect of the model.
In conclusion, we finally determined the voting weight:
The association prediction score between disease
Results
In this section, we conducted a number of different experiments to observe and evaluate the effectiveness of this approach, including visualization of similarity processing, adjustment of important hyperparameters, comparison with other existing methods, and analysis based on disease cases.
Visualization of Similarity Processing
Taking miRNA similarity as an example, we firstly fused functional similarity and credibility similarity, and then put it into the similarity filter. To observe the situation of similarity matrix more intuitively, we choose the heat map to express them. The process of their change is shown in Figure 7 below:
As shown in the figure, the similarity network after fusion is denser, and the similarity after filtering retains more reliable information, which provides a guarantee for the subsequent prediction.
Performance Evaluation
In this part, we evaluate the performance of the model. The main measurement methods are 5CV and global LOOCV. 5CV uses four-fifths of the positive samples as the training set, the remaining one-fifth of the positive samples and all the negative samples as the test set, and measures the effect of the model according to the accuracy of the test. When describing its accuracy, we mainly use AUC as the measurement index. It is worth mentioning that AUC is not sensitive to whether the sample category is balanced, which is also a reason why AUC is usually used to evaluate the performance of classifier for unbalanced samples. AUC is defined as the area below the ROC curve. Among them, the ROC curve is plotted with the true positive rate (TPR) as the vertical axis and the false positive rate (FPR) as the horizontal axis. By adjusting the threshold, the probability is converted to the category, so that the TPR and the FPR are plotted as points, and the ROC curve is obtained. The calculation methods of FPR and TPR are as follows:
Among them, TP are samples truly positive and predicted to be positive, and FN are samples truly positive and predicted to be negative. Where FP are samples truly negative but predicted to be positive, TN are the samples truly positive but predicted to be negative. The ROC curve is generally above
According to the aforementioned indicators, we first commissioned the parameters in the model, including credibility and hypothetical position (Figure 8).
Credibility has a very significant meaning for the structure of similarity. For any two sequences
The hypothetical position roughly determines the number of elements to be extracted from each similarity sequence. Due to the uniqueness of different similarity sequences, the number of optimal elements is also different. Our approach does not necessarily guarantee that the most suitable elements are extracted for each sequence, but we can approach the optimal result by adjusting the hypothetical ratio
Comparisons With Existing Work
In recent years, researchers have proposed many miRNA–disease association prediction methods. However, the datasets or evaluation methods used in the existing methods are not consistent. Therefore, we mainly conduct comparative experiments based on five-fold cross-validation and leave-one-out cross-validation. To confirm the validity of the WVMDA prediction results, we compared our model with the previous three models: SVAEMDA (Ji et al., 2021), ICFMDA (Jiang et al., 2018), AEMDA (Ji et al., 2020), SACMDA (Shao et al., 2018), and GRL_2, 1-NMF (Gao et al., 2020). All models were cross-validated to calculate TPR and FPR, draw the ROC curve, and calculate AUC (Figure 9). The better the performance of the model, the farther its ROC curve is from the straight line
It can be seen from the figure that our method is more responsive and has achieved higher AUC values, which indicates that our method has good performance.
Case Study
In this paper, the prediction results of WVMDA were verified by taking prostate neoplasms as an example. We use the HMDD V2.0 database as the training database, and the dbDEMC 2.0 database and miR2Disease database as the validation databases, respectively, to measure the validity of the prediction model. The candidate miRNAs obtained from WVMDA were ranked according to the predicted score. If the miRNAs with the highest scores were found in the other two datasets, it would indicate that our model was effective. Finally, 28 of the top 30 miRNAs were verified in the other two datasets (Table 2). The results demonstrate the effectiveness of WVMDA in predicting unknown interactions between miRNA and disease.
Prediction of Unknown Disease
Predicting potential miRNAs associated with unknown diseases is a huge challenge. For the convenience of experiment and verification, we selected a disease and cleared its association nodes with all miRNAs, so as to make the disease as an unknown disease. Subsequently, we put all the remaining associations into the WVMDA and observe whether its prediction results can restore the miRNAs associated with this disease. We took breast neoplasms as the case and HMDD2.0 as the database for the experiment to find out whether the 30 miRNAs with the highest prediction rank were true associations. If not, we found out whether such associations existed in other databases.
The experimental results show that 28 associations were found in the HMDD dataset, and the remaining two associations were also found in the dbDEMC dataset (Table 3).
Discussion
The study of the possible relationship between miRNA and disease is helpful to understand the pathogenesis of disease and provide the basis for the prevention and treatment of disease. Therefore, we constructed a new miRNA–disease association prediction model based on weighted voting (WVMDA). By proposing credibility, we construct credibility similarity and use it to fill in the inadequacy of existing datasets. By designing a similarity filter, we filter the similarity to retain the reliable data and eliminate the noise. In the final weighted voting model, we mainly regulate the rationality and performance of the model based on three kinds of voting weights. It is worth mentioning that our method only needs positive samples to complete the prediction, which is very convenient for model construction and also reduces the requirements on datasets.
Under the framework of 5CV and global LOOCV, the AUC of WVMDA is 0.9537 and 0.9683, respectively, which is higher than the other methods. Furthermore, a case study on prostate neoplasm was implemented to evaluate the WVMDA model. Therefore, WVMDA can be used as a reliable biological tool for predicting potential disease-related miRNAs, and it can contribute to the discovery, prevention, and diagnosis of complex diseases. What is more, the WVMDA model still has room for improvement, and integrating more effective datasets will certainly bring great progress to future research.
Data Availability Statement
Publicly available datasets were analyzed in this study. This data can be found here: http://www.cuilab.cn/files/images/cuilab/misim.zip.
Author Contributions
SQ and YW conceived the project. ZZ provided methodological support. ZG and LL collected the data. ZZ performed the computational analysis and wrote the manuscript. CZ thoroughly revised the manuscript. All authors discussed the results and contributed to the final manuscript.
Funding
This work was supported by the National Natural Science Foundation of China (Nos. 61873001, U19A2064, 11701318), Natural Science Foundation of Shandong Province (grant number ZR2020KC022), and the Open Project of Anhui Provincial Key Laboratory of Multimodal Cognitive Computation, Anhui University (No. MMC202006).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Blenkiron, C., Goldstein, L. D., Thorne, N. P., Spiteri, I., Chin, S.-F., Dunning, M. J., et al. (2007). Microrna expression profiling of human breast cancer identifies new markers of tumor subtype. Genome Biol. 8 (1), R214–R216. doi:10.1186/gb-2007-8-10-r214
Campbell, D. E., and Kelly, J. S. (2010). Strategy-proofness and weighted voting. Math. Soc. Sci. 60 (1), 15–23. doi:10.1016/j.mathsocsci.2010.02.001
Chen, X., Clarence Yan, C., Luo, C., Ji, W., Zhang, Y., and Dai, Q. (2015). Constructing lncRNA functional similarity network based on lncRNA-disease associations and disease semantic similarity. Sci. Rep. 5, 11338. doi:10.1038/srep11338
Chen, X., Li, S.-X., Yin, J., and Wang, C.-C. (2020). Potential miRNA-disease association prediction based on kernelized Bayesian matrix factorization. Genomics 112 (1), 809–819. doi:10.1016/j.ygeno.2019.05.021
Chen, X., Li, T.-H., Zhao, Y., Wang, C.-C., and Zhu, C.-C. (2020). Deep-belief network for predicting potential miRNA-disease associations. Brief. Bioinform. 22 (3), 10. doi:10.1093/bib/bbaa186
Chen, X., Sun, L.-G., and Zhao, Y. (2020). NCMCMDA: miRNA-disease association prediction through neighborhood constraint matrix completion. Brief. Bioinform. 22 (1), 485–496. doi:10.1093/bib/bbz159
Chen, X., Xie, D., Zhao, Q., and You, Z.-H. (2019). MicroRNAs and complex diseases: from experimental results to computational models. Brief. Bioinform. 20 (2), 515–539. doi:10.1093/bib/bbx130
Chen, X., Yan, C. C., Zhang, X., You, Z.-H., Huang, Y.-A., and Yan, G.-Y. (2016). HGIMDA: Heterogeneous graph inference for miRNA-disease association prediction. Oncotarget 7 (40), 65257–65269. doi:10.18632/oncotarget.11251
Chen, X., Zhang, D.-H., and You, Z.-H. (2018). A heterogeneous label propagation approach to explore the potential associations between miRNA and disease. J. Transl Med. 16, 348. doi:10.1186/s12967-018-1722-1
Chen, X., Zhou, C., Wang, C.-C., and Zhao, Y. (2021). Predicting potential small molecule-miRNA associations based on bounded nuclear norm regularization. Brief. Bioinform doi:10.1093/bib/bbab328
Esquela-Kerscher, A., and Slack, F. J. (2006). Oncomirs - microRNAs with a role in cancer. Nat. Rev. Cancer 6 (4), 259–269. doi:10.1038/nrc1840
Gao, Z., Wang, Y.-T., Wu, Q.-W., Ni, J.-C., and Zheng, C.-H. (2020). Graph regularized L2,1-nonnegative matrix factorization for miRNA-disease association prediction. BMC Bioinformatics 61 (1), 13. doi:10.1186/s12859-020-3409-x
Garzon, R., Fabbri, M., Cimmino, A., Calin, G. A., and Croce, C. M. (2006). MicroRNA expression and function in cancer. Trends Molecular Medicine 12, 580–587. doi:10.1016/j.molmed.2006.10.006
Huang, Q., Gumireddy, K., Schrier, M., le Sage, C., Nagel, R., Nair, S., et al. (2008). The microRNAs miR-373 and miR-520c promote tumour invasion and metastasis. Nat. Cel Biol 10 (2), 202–210. doi:10.1038/ncb1681
Huang, Z., Shi, J., Gao, Y., Cui, C., Zhang, S., Li, J., et al. (2018). HMDD v3.0: a database for experimentally supported human microRNA-disease associations. Nucleic Acids Res. 47, D1013–D1017. doi:10.1093/nar/gky1010
Ji, B.-Y., You, Z.-H., Chen, Z.-H., Wong, L., and Yi, H.-C. (2020). NEMPD: A network embedding-based miRNA-protein-disease network method for the miRNA-disease association prediction. Bioinformatics 21 (1), 401. doi:10.21203/rs.3.rs-28280/v2
Ji, C., Gao, Z., Ma, X., Wu, Q., Ni, J., and Zheng, C. (2020). AEMDA: Inferring miRNA-disease associations based on deep autoencoder. Bioinformatics 37 (1), 66–72. doi:10.1093/bioinformatics/btaa670
Ji, C., Wang, Y.-T., Gao, Z., Li, L., Ni, J.-C., and Zheng, C.-H. (2021). A semi-supervised learning method for miRNA-disease association prediction based on variational autoencoder. Ieee/acm Trans. Comput. Biol. Bioinf. (99), 1. doi:10.1109/TCBB.2021.3067338
Jiang, Q., Hao, Y., Wang, G., Juan, L., Zhang, T., Teng, M., et al. (2010). Prioritization of disease microRNAs through a human phenome-microRNAome network. BMC Syst. Biol. 4, S2. doi:10.1186/1752-0509-4-S1-S2
Jiang, Q., Wang, Y., Hao, Y., Juan, L., Teng, M., Zhang, X., et al. (2009). miR2Disease: a manually curated database for microRNA deregulation in human disease. Nucleic Acids Res. 37, D98–D104. doi:10.1093/nar/gkn714
Jiang, Y., Liu, B., Yu, L., Yan, C., and Bian, H. (2018). Predict MiRNA-Disease Association with Collaborative Filtering. Neuroinform 16 (3-4), 363–372. doi:10.1007/s12021-018-9386-9
Lee, R. C., and Ambros, V. (2001). An Extensive Class of Small RNAs in Caenorhabditis elegans. Science 294 (5543), 862–864. doi:10.1126/science.1065329
Li, J., Zhang, S., Liu, T., Ning, C., Zhang, Z., and Zhou, W. (2020). Neural inductive matrix completion with graph convolutional networks for miRNA-disease association prediction. Bioinformatics 36 (8), 2538–2546. doi:10.1093/bioinformatics/btz965
Li, Y., Qiu, C., Tu, J., Geng, B., Yang, J., Jiang, T., et al. (2014). HMDD v2.0: a database for experimentally supported human microRNA and disease associations. Nucl. Acids Res. 42, D1070–D1074. doi:10.1093/nar/gkt1023
Li, Z., Li, J., Nie, R., You, Z.-H., and Bao, W. (2021). A graph auto-encoder model for miRNA-disease associations prediction. Brief. Bioinform. 22, 25–34. doi:10.1093/bib/bbaa240
Liu, D., Huang, Y., Nie, W., Zhang, J., and Deng, L. (2021). SMALF: miRNA-disease associations prediction based on stacked autoencoder and XGBoost. BMC Bioinformatics 219 (1), 18. doi:10.1186/s12859-021-04135-2
Shao, B., Liu, B., and Yan, C. (2018). SACMDA: MiRNA-Disease Association Prediction with Short Acyclic Connections in Heterogeneous Graph. Neuroinform 16 (3-4), 373–382. doi:10.1007/s12021-018-9373-1
Tong, Z., and Kain, R. Y. (1988). Vote assignments in weighted voting mechanisms. Proc.symp.reliable Distributed Syst. October 40 (5), 664–667. doi:10.1023/A:1007024709087
Toprak, A., and Eryilmaz, E. (2020). Prediction of miRNA-disease associations based on weighted k-nearest known neighbors and network consistency projection. J. Bioinform. Comput. Biol. 19 (1), 2050041. doi:10.1142/S0219720020500419
Wang, D., Wang, J., Lu, M., Song, F., and Cui, Q. (2010). Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases. Bioinformatics 26, 1644–1650. doi:10.1093/bioinformatics/btq241
Wang, H., Zhu, Y., Zhao, M., Wu, C., Zhang, P., Tang, L., et al. (2013). miRNA-29c Suppresses Lung Cancer Cell Adhesion to Extracellular Matrix and Metastasis by Targeting Integrin β1 and Matrix Metalloproteinase2 (MMP2). PLOS ONE 8 (8), e70192. doi:10.1371/journal.pone.0070192
Yang, Y., Fu, X., Qu, W., Xiao, Y., and Shen, H.-B. (2018). MiRGOFS: a GO-based functional similarity measurement for miRNAs, with applications to the prediction of miRNA subcellular localization and miRNA-disease association. Bioinformatics 34 (20), 3547–3556. doi:10.1093/bioinformatics/bty343
Yang, Z., Wu, L., Wang, A., Tang, W., Zhao, Y., Zhao, H., et al. (2016). dbDEMC 2.0: updated database of differentially expressed miRNAs in human cancers. Nucleic Acids Res. 45, D812–D818. doi:10.1093/nar/gkw1079
Yu, S.-P., Liang, C., Xiao, Q., Li, G.-H., Ding, P.-J., and Luo, J.-W. (2018). MCLPMDA: A novel method for miRNA-disease association prediction based on matrix completion and label propagation. J. Cel Mol Med 23 (2), 1427–1438. doi:10.1111/jcmm.14048
Zhao, Y., Chen, X., and Yin, J. (2019). Adaptive boosting-based computational model for predicting potential miRNA-disease associations. Bioinformatics 35 (22), 4730–4738. doi:10.1093/bioinformatics/btz297
Keywords: miRNA-disease association, credibility similarity, weighted voting, miRNA, disease
Citation: Zhang Z-W, Gao Z, Zheng C-H, Li L, Qi S-M and Wang Y-T (2021) WVMDA: Predicting miRNA–Disease Association Based on Weighted Voting. Front. Genet. 12:742992. doi: 10.3389/fgene.2021.742992
Received: 17 July 2021; Accepted: 09 September 2021;
Published: 29 September 2021.
Edited by:
Yufei Huang, University of Pittsburgh Medical Center, United StatesReviewed by:
Xing Chen, China University of Mining and Technology, ChinaMin Wu, Institute for Infocomm Research (A∗STAR), Singapore
Copyright © 2021 Zhang, Gao, Zheng, Li, Qi and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Su-Min Qi, cWl4bUBxZm51LmVkdS5jbg==; Yu-Tian Wang, d3l0ZnV0dXJlQDE2My5jb20=