- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
Drug-target interaction (DTI) is the basis of drug discovery. However, it is time-consuming and costly to determine DTIs experimentally. Over the past decade, various computational methods were proposed to predict potential DTIs with high efficiency and low costs. These methods can be roughly divided into several categories, such as molecular docking-based, pharmacophore-based, similarity-based, machine learning-based, and network-based methods. Among them, network-based methods, which do not rely on three-dimensional structures of targets and negative samples, have shown great advantages over the others. In this article, we focused on network-based methods for DTI prediction, in particular our network-based inference (NBI) methods that were derived from recommendation algorithms. We first introduced the methodologies and evaluation of network-based methods, and then the emphasis was put on their applications in a wide range of fields, including target prediction and elucidation of molecular mechanisms of therapeutic effects or safety problems. Finally, limitations and perspectives of network-based methods were discussed. In a word, network-based methods provide alternative tools for studies in drug repurposing, new drug discovery, systems pharmacology and systems toxicology.
Introduction
As the rapid development of systems biology and network pharmacology, the drug discovery paradigm has changed from the linear mode “one drug → one target → one disease" to the network mode “multi-drugs → multi-targets → multi-diseases" (Hopkins, 2008; Medina-Franco et al., 2013; Anighoro et al., 2014). The new paradigm means that a single drug might act on multiple targets in vivo, rather than selectively bind to one target, which is more consistent with those observed in reality (Roth et al., 2004; Paolini et al., 2006; Yildirim et al., 2007). For a drug, its polypharmacological profile (i.e., on-target and off-target effects) could lead to both desired therapeutic effects and undesired safety problems (Roth et al., 2004; Keiser et al., 2009; Besnard et al., 2012; Lounkine et al., 2012; Anighoro et al., 2014; Zhang et al., 2015). Hence, systematic identification of drug-target interactions (DTIs) is essential in drug discovery, which could help maximize therapeutic effects while minimizing safety problems.
The traditional way to identify DTIs is via biological experiments, for example, to determine the inhibition constant (Ki), dissociation constant (Kd), half-maximal inhibitory concentration (IC50) or half-maximal effective concentration (EC50) values between drugs (e.g., approved drugs, drug candidates in clinical trials, drugs withdrawn from the market and drug-like new chemical entities) and target proteins by in vitro or in vivo assays. However, it is time-consuming and costly to determine all possible DTIs experimentally and systematically. Over the past decade, the development of various computational methods has provided valuable strategies for the systematic prediction of potential DTIs due to their high efficiency and low costs (Zheng et al., 2013; Chen et al., 2016; Lavecchia and Cerchia, 2016). On one hand, according to the type of prediction results, these methods can be divided into two categories, namely qualitative (i.e., classification) and quantitative (i.e., regression) methods. On the other hand, these methods can be roughly divided into several categories, including molecular docking-based, pharmacophore-based, similarity-based, machine learning-based, and network-based methods, although their concepts could overlap each other.
Molecular docking-based methods are traditional approaches based on the three-dimensional (3D) structures of targets, which have been widely used in DTI prediction (Rognan, 2010; Waszkowycz et al., 2011; Ma et al., 2013). These methods use scoring functions to evaluate DTIs, which can provide quantitative docking scores correlated with binding affinities (Li et al., 2014; Liu et al., 2017). For one or a few given targets such as estrogen receptors (Shen et al., 2010) or HIV-1 integrase (Hu et al., 2012b), potential active compounds can be prioritized by molecular docking. Reversely, for a given drug or new chemical entity, reverse docking (also known as inverse docking) can be used to predict potential targets for it (Chen and Zhi, 2001; Tang et al., 2006; Rognan, 2010). Several web applications, such as TarFisDock (Li et al., 2006) and DRAR-CPI (Luo et al., 2011), were built for docking-based target fishing.
Pharmacophore-based methods can be further divided into two subtypes, namely structure-based and ligand-based pharmacophore mapping. Both of them can be used in DTI prediction (Rognan, 2010; Yang, 2010). For example, PharmMapper is a web server which can predict potential targets for a submitted compound via structure-based pharmacophore mapping (Liu et al., 2010). In practical application, to improve the predictive accuracy, a strategy combining pharmacophore mapping and molecular docking is often used to predict potential DTIs, such as to find new ligands for a given receptor (Chen et al., 2014).
Similarity searching is also a traditional approach for DTI prediction (Willett et al., 1998), based on a hypothesis that similar drugs share similar targets and vice versa. Various types of similarity such as two-dimensional (2D) fingerprint-based similarity (Willett, 2006), 3D shape similarity (Hu et al., 2012a), and phenotypic similarity (Campillos et al., 2008) can be employed in similarity-based methods. For example, two web applications named similarity ensemble approach (SEA) (Keiser et al., 2007) and ChemMapper (Gong et al., 2013) use 2D and 3D similarity, respectively, to predict potential DTIs.
Machine learning is a general approach used in DTI prediction, which has been developing rapidly in these years (Ding et al., 2014; Chen et al., 2016). For example, we proposed two machine learning-based methods, namely multitarget-QSAR (mt-QSAR) and computational chemogenomic methods (Cheng et al., 2012c). Based on mt-QSAR, a web application named CPI-Predictor was developed for free use (Cheng et al., 2012c). Besides traditional machine learning techniques, deep learning techniques have been applied in DTI prediction recently (Tian et al., 2016; Wen et al., 2017). Currently, although 3D structure data of targets can be used in building machine learning models (Hwang et al., 2017), molecular descriptors and protein sequence descriptors are much more commonly used (Cheng et al., 2012c; Yu et al., 2012; Ding et al., 2014).
The above-mentioned methods have shown high accuracies and robustness in DTI prediction (Zheng et al., 2013; Chen et al., 2016; Lavecchia and Cerchia, 2016). However, there are still several pitfalls among them. Structure-based methods such as molecular docking and structure-based pharmacophore mapping rely on 3D structures of targets (Tang et al., 2006; Rognan, 2010). Hence, they are often limited by lack of high-quality 3D structures. For example, G protein-coupled receptors (GPCRs) are the largest protein family consisting of more than 800 members (Stevens et al., 2013), but only approximately 30 of them have resolved crystal structures yet (Xiang et al., 2016). Ligand-based pharmacophore mapping relies on the proper selection of training set compounds for building pharmacophore models, a problem often confused by users, even experienced ones (Yang, 2010). Similarity-based methods rely on similarity and also limited by the similarity. For instance, the methods based on chemical structure similarity could be difficult to find active compounds with novel scaffolds. The methods based on phenotypic similarity could be limited by lack of enough phenotypic data. In general, building machine learning models (especially supervised learning models) for DTI prediction requires both positive samples and negative samples (i.e., active and inactive DTIs validated by experiments) (Cheng et al., 2012c; Yu et al., 2012; Ding et al., 2014; Chen et al., 2016). However, it is always difficult to find enough number of experimentally validated negative samples with gold standard from publicly available database and literature (Cheng et al., 2012c; Yu et al., 2012). Although strategies such as “one versus the rest” (Cheng et al., 2012c) can be used to generate enough negative samples, the model performance is often influenced by the low-quality negative samples.
Compared to these methods, network-based methods have demonstrated great advantages. At first, network-based methods do not rely on 3D structures of targets or negative samples. These methods are derived from recommendation algorithms used in recommender systems (Lu et al., 2012) and link prediction algorithms in complex networks (Lu and Zhou, 2011). For example, more than ten years ago, Zhou et al. (2007) proposed a recommendation algorithm named network-based inference (NBI), also known as probabilistic spreading (ProbS) (Zhou et al., 2010), to recommend possible future likes (called objects) for users based on the known preference data of the users. By treating drugs and targets as users and objects, respectively, this algorithm was grafted into the research area of DTI prediction (Cheng et al., 2012b). As one of the simplest network-based methods, NBI can predict potential DTIs only using the known DTI network (i.e., positive samples), without any additional information such as chemical structures, protein structures or sequences. In the next few years, several new network-based methods were developed based on NBI (Cheng et al., 2012d; Wu et al., 2016, 2017) and other recommendation algorithms or link prediction algorithms (Cheng et al., 2013c; Duran et al., 2017). These methods all have the advantages that are independence on 3D structures of targets and negative samples, which enable them to cover much larger target space. Secondly, network-based methods are simple and fast. These methods predict potential DTIs only by performing simple physical processes such as resource diffusion (Cheng et al., 2012b,d; Wu et al., 2016, 2017), collaborative filtering (Cheng et al., 2012b, 2013c), and random walk (Chen et al., 2012) on networks. Considering networks can be represented by matrices, these processes can be described by simple matrix operations such as matrix multiplication mathematically (Chen et al., 2012; Cheng et al., 2012b; Wu et al., 2017). Compared with some complex structure-based and machine learning-based methods, the calculation procedures are simple and can be parallelized easily. Hence, network-based methods often run fast on computers.
In this article, we focused on network-based methods for DTI prediction. We first introduced the data sources for network construction and the methodologies of several representative network-based methods, especially a series of methods developed on the basis of NBI (we called them “NBI series methods”) (Cheng et al., 2012b,d; Wu et al., 2016, 2017). Subsequently, the evaluation approaches and indicators were introduced briefly. Then, the emphasis was put on their applications in a wide range of fields, including target prediction and elucidation of molecular mechanisms of therapeutic effects or safety problems. These applications suggest that network-based methods provide alternative tools for studies in drug repurposing, new drug discovery, systems pharmacology and systems toxicology.
Data Sources for Network Construction
Networks, especially DTI networks, are one of the most important bases of network-based models. To construct reliable networks, it is necessary to have sufficient amount of high-quality data. Fortunately, we are living in an era of big data (Schadt, 2012; Ma’ayan et al., 2014). There are a large number of data available freely online, from small molecules to biomacromolecules, from structures to properties, from raw data to organized data in different topics (Chen et al., 2016). Herein, we introduced several well-known data sources for network construction.
There are different ways to construct DTI networks. For example, we can download prepared DTI data from public databases such as DrugBank (Wishart et al., 2018) and Therapeutic Target Database (Li et al., 2018). The downloaded DTI pairs can be used to construct DTI networks directly. However, DTI data from these databases are not quantitative, because the experimentally determined activity values of the DTIs are not provided. It may cause problems in merging DTI data from different sources. By contrast, as shown in Table 1, there are many databases can provide experimentally determined DTI data with quantitative activity values such as Ki, Kd, IC50, and EC50 values, including BindingDB (Gilson et al., 2016), Binding MOAD (Ahmed et al., 2015), ChEMBL (Gaulton et al., 2017), DrugCentral (Ursu et al., 2017), IUPHAR/BPS Guide to PHARMACOLOGY (Harding et al., 2018), PDBbind-CN (Liu et al., 2015), PDSP Ki Database (Roth et al., 2000), PubChem BioAssay (Wang et al., 2017), RCSB Protein Data Bank (Rose et al., 2017), SuperTarget (Hecker et al., 2012), STITCH (Szklarczyk et al., 2016), TDR Targets (Magarinos et al., 2012), Thomson Reuters Integrity, etc. After collecting quantitative DTI data from these databases, we can use the same criteria for data filtering and merging. Then, one or more DTI networks can be constructed based on the prepared DTI pairs as below. If a drug and a target are validated to interact with each other by experiments (e.g., Ki, Kd, IC50 or EC50 ≤ 10 μM), the node representing the drug and the node representing the target are linked by an edge.
TABLE 1. Several representative databases containing experimentally determined DTI data with quantitative activity values.
In addition to known DTIs, we can also use other types of data to aid the DTI prediction. For example, chemical substructures can be generated for drugs using chemoinformatics software such as Open Babel (O’Boyle et al., 2011) and PaDEL-Descriptor (Yap, 2011). Anatomical Therapeutic Chemical classification (ATC) codes of drugs can be obtained from databases such as DrugBank (Wishart et al., 2018), DrugCentral (Ursu et al., 2017), and KEGG DRUG (Kanehisa et al., 2017). Side effects of drugs can be collected from Comparative Toxicogenomics Database (CTD) (Davis et al., 2017), SIDER (Kuhn et al., 2016), and OFFSIDES (Tatonetti et al., 2012). Sequences of target proteins can be downloaded from UniProt knowledgebase (Bateman et al., 2017). Using these data, we can generate more types of data. For example, chemical similarity of drug-drug pairs can be calculated using their substructures (Chen et al., 2012; Cheng et al., 2012b). Therapeutic and side-effect similarity networks of drug-drug pairs can be calculated using their ATC codes and side effects, respectively (Cheng et al., 2013c). Protein sequence similarity of target-target pairs can be calculated using their sequences (Chen et al., 2012; Cheng et al., 2012b). Using these different similarity data, various similarity networks can be constructed, which may be used in network-based methods together with DTI networks.
Methodologies of Network-Based Methods
As described in the INTRODUCTION section, network-based methods are derived from recommendation algorithms (Lu et al., 2012) and link prediction algorithms (Lu and Zhou, 2011). Previous reviews have suggested that these methods are different from machine learning-based methods and similarity-based methods (Ding et al., 2014; Chen et al., 2016). Although many recommendation algorithms and link prediction algorithms have been proposed up to date (Clauset et al., 2008; Guimera and Sales-Pardo, 2009; Lu and Zhou, 2011; Lu et al., 2012; Pan et al., 2016), few of them were applied in DTI prediction. In this section, we introduced the methodologies of several representative network-based methods, as listed in Table 2.
NBI Series Methods
NBI
NBI performs resource-diffusion processes on the known DTI network to prioritize potential DTIs (Cheng et al., 2012b). As shown in Figure 1A, if we want to predict potential targets for an example drug in the known DTI network (symbolized as Di), the following steps can be performed. Initially, one unit of resource is allocated to each of the neighbor nodes of Di (i.e., the target nodes linked with Di). Then, in the first resource-diffusion process, each target node equally spreads its resource to its neighbor nodes (i.e., the drug nodes linked with the target node). In the second resource-diffusion process, each drug node equally spreads its resource to its neighbor nodes (i.e., the target nodes linked with the drug node). After the two resource-diffusion processes, for each target in the known DTI network (symbolized as Tj), the amount of the resource located in Tj can be recognized as the predictive score of the interaction between Di and Tj. Higher score means higher probability that Di can interact with Tj. Using the same way, we can systematically predict potential DTIs for all drugs in the known DTI network. Although the resource-diffusion processes can be repeated continuously, we usually perform only two resource-diffusion processes.
FIGURE 1. Examples of predicting potential targets for an example drug highlighted with yellow shadow via three network-based methods: (A) network-based inference (NBI), (B) substructure-drug-target network-based inference (SDTNBI), (C) balanced substructure-drug-target network-based inference (bSDTNBI). Drugs, targets and substructures were represented as capsules, ribbons and structural fragments, respectively.
As one of the simplest network-based methods, NBI only uses a known DTI network as input. Although this characteristic makes NBI run fast, it limits the application domain of NBI enormously. NBI can only predict DTIs for the drugs and targets within the known DTI network. It cannot predict potential targets for new chemical entities without known targets (e.g., newly extracted natural products and newly synthesized compounds) and targets without known ligands (e.g., orphan receptors), owing to the fact that they cannot be interlinked with the known DTI network. Moreover, in the design of NBI, DTIs do not have any attributes, such as interaction types or binding affinities. Hence, it is necessary to further improve the NBI method.
EWNBI and NWNBI
To investigate whether introduction of edge or node weights would improve the performance of the original NBI method, we further developed two weighted NBI methods, namely edge-weighted network-based inference (EWNBI) and node-weighted network-based inference (NWNBI) (Cheng et al., 2012d). The two methods use different strategies to improve the predictive accuracy. EWNBI assigns weighted values to all edges (i.e., DTIs) in the known DTI network according to their Ki or IC50 values, whereas NWNBI introduces a tunable parameter to adjust the influence of hub nodes. The designs are reasonable. However, the performance of EWNBI was marginally worse than NBI. The performance of NWNBI was only marginally better than NBI after parameter optimization. These results suggested that adding node or edge weights directly into the original NBI method is not an appropriate strategy for performance improvement. Basically, EWNBI and NWNBI did not make a breakthrough in the old framework of NBI. All the defects of NBI were not repaired.
SDTNBI
In order to overcome one of the aforementioned pitfalls that NBI cannot predict targets for new chemical entities, we proposed a new network-based method entitled substructure-drug-target network-based inference (SDTNBI) (Wu et al., 2017). SDTNBI employs chemical substructures to bridge the gap between known DTI network and the new chemical entities. Specifically, after generating chemical substructures for both drugs in the known DTI network and new chemical entities, the known DTI network and the new chemical entities can be integrated in a structure-drug-target network via linking drugs and new chemical entities by their substructures. As shown in Figure 1B, resource-diffusion processes can be performed on the substructure-drug-target network to prioritize possible targets for known drugs and new chemical entities. The prediction steps for an example drug (symbolized as Di) can be described as below, which are similar to those of NBI. Because new chemical entities can be seen as special drugs without known targets, both drugs in the known DTI network and new chemical entities are called “drugs” to facilitate the description.
Initially, the number of resource-diffusion processes (symbolized as k, as shown in Figure 1) = 0, one unit of resource is located in each of the neighbor nodes of Di. Then, in the first resource-diffusion process (k = 1), each substructure equally spreads its resource to its neighbor nodes, meanwhile each target node also equally spreads its resource to its neighbor nodes. In the second resource-diffusion process (k = 2), each drug node equally spreads its resources to its neighbor nodes. The resource-diffusion processes can be repeated continuously. When the k value is an even number, for each target Tj, the amount of the resource located in Tj can be recognized as the predictive score of the interaction between Di and Tj. Using the same way, we can predict potential DTIs for all drugs and new chemical entities systematically and efficiently.
The development of SDTNBI overcame one of the pitfalls of NBI successfully. However, its performance is often worse than NBI (Wu et al., 2017). Considering that the substructure-drug-target network contains different types of nodes and different types of edges, the possible reason for this underperformance is that potential unbalance exists in the resource-diffusion processes of SDTNBI.
bSDTNBI
Recently, we made an improvement on SDTNBI by introducing three tunable parameters, namely α, β, and γ, into it, which led to the balanced substructure-drug-target network-based inference (bSDTNBI) (Wu et al., 2016). As shown in Figure 1C, the first parameter α ∈ [0,1] is used to adjust which type of nodes will obtain more amount of resource in the initial resource allocation. The second parameter β ∈ [0,1] is used to adjust which type of edges will have larger weighted values in the resource-diffusion processes. The third parameter γ ∈ (-∞,+∞) is used to adjust the influence of hub nodes in the resource-diffusion processes, where γ > 0 and γ < 0 mean strengthen and weaken the influence of hub nodes, respectively. A detailed mathematical description of NBI, SDTNBI, and bSDTNBI was put in the Supplementary data of our previous study (Wu et al., 2018). To compare the performance of bSDTNBI with those of aforementioned NBI and SDTNBI, we performed systematic evaluation and then found that bSDTNBI outperformed SDTNBI and was comparable to NBI when the three parameters were optimized (Wu et al., 2016).
Although aforementioned SDTNBI and bSDTNBI have the ability to prioritize possible targets for various types of compounds, they still cannot predict potential ligands for those targets without known ligands. Moreover, the interaction type and binding affinity of DTIs are not considered yet. Hence, it is still necessary to make further improvements on NBI series methods.
Similarity Inference Methods
Similarity inference methods are derived from collaborative-filtering algorithms in recommender systems (Huang et al., 2007). These methods have characteristics of both network-based methods and similarity-based methods, which use both topology information of the known DTI network and similarity information to predict potential DTIs (Cheng et al., 2012b, 2013c).
For example, in the previous study of the NBI method (Cheng et al., 2012b), we also proposed two similarity inference methods named drug-based similarity inference (DBSI) and target-based similarity inference (TBSI). DBSI is based on the hypothesis that drugs with similar 2D chemical structures tend to act on similar targets. TBSI is based on the hypothesis that proteins with similar sequences tend to be targeted by similar drugs. However, both DBSI and TBSI underperformed NBI in a systematic evaluation. This underperformance may be caused by the redundancy in the drug similarity and target similarity.
Subsequently, we developed two network-based methods named drug side-effect similarity inference (DSESI) and drug therapeutic similarity inference (DTSI) (Cheng et al., 2013c). DSESI and DTSI are based on the hypothesis that drugs with similar side effects and ATC codes tend to act on similar targets, respectively. We compared DSESI and DTSI with aforementioned DBSI and found that the performance of the three methods is DTSI > DBSI > DSESI.
Currently, similarity inference methods also have several pitfalls. Their application domains are often limited by lack of similarity data. For example, DSESI and DTSI cannot be used to predict potential DTIs for new chemical entities due to lack of side-effect data or ATC codes. Moreover, all these type of methods did not consider the interaction types and binding affinities.
Random Walk-Based Methods
Random walk is a classical concept which has been employed in various research areas, such as recommender systems (Liu and Lu, 2010) and prediction of gene-disease associations (Kohler et al., 2008). They can also be used for DTI prediction (Chen et al., 2012).
The most representative example of random walk-based method is Network-based Random Walk with Restart on the Heterogeneous network (NRWRH) (Chen et al., 2012). NRWRH predicts potential DTIs by performing random walk with restart on the heterogeneous network, which is constructed by integrating the known DTI network with chemical similarity of drugs and protein sequence similarity of targets. Based on systematic evaluation, NRWRH outperforms several machine learning-based methods (Chen et al., 2012). After that, NRWRH was improved by employing new types of similarity calculated by extended connectivity fingerprints, 2D pharmacophore fingerprints and ROCS program (Seal et al., 2015).
Mathematically, the descriptions of random walk-based methods are similar to those of the aforementioned NBI series methods. They use matrix multiplication to describe the random-walk and resource-diffusion processes, respectively. Owing to the similarity between these two types of methods, random walk-based methods have similar defects that NBI series methods have, such as cannot predict interaction types and binding affinities.
Local-Community-Paradigm Methods
The local-community-paradigm (LCP) theory was developed for link prediction in monopartite networks such as brain connectomes and protein interactomes (Cannistraci et al., 2013). Then, this theory was extended to bipartite networks (Daminelli et al., 2015). Recently, five LCP methods, including Cannistraci-Alanis-Ravasi (CAR), Cannistraci-Jaccard (CJC), Cannistraci preferential attachment (CPA), Cannistraci-Adamic-Adar (CAA), and Cannistraci resource allocation (CRA), were applied in prediction of potential DTIs and showed high performance (Duran et al., 2017). Currently, these LCP-based methods only rely on the known DTI network. Hence, they would meet the challenges that NBI has met, namely predicting potential DTIs for new chemical entities and targets outside of the known DTI network, as well as considering interaction types and binding affinities.
Other Network-Based Methods
In addition, there are some other types of network-based methods. For instance, DASPfind uses simple paths of particular lengths on the heterogeneous network for DTI prediction (Ba-Alawi et al., 2016). Similar to the aforementioned NRWRH (Chen et al., 2012), the heterogeneous network is also constructed by integrating DTI network with drug similarity and target similarity. In performance evaluation, DASPfind outperformed several previously published network-based methods such as NRWRH (Chen et al., 2012) and DT-Hybrid (Alaimo et al., 2013).
Evaluation of Network-Based Methods
Cross Validation
The 10-fold cross validation is an approach commonly used to evaluate the robustness of the models built via network-based methods (Cheng et al., 2012b,d; Wu et al., 2016, 2017, 2018; Fang et al., 2017b). In a 10-fold cross validation process, 10% of the DTIs are randomly extracted from the known DTI network as the test set in turn, while the remnant are used as the training set. Hence, 10 pairs of training set and test set are generated. Using these pairs, several evaluation indicators can be calculated. Generally, for a network-based model, the 10-fold cross validation process will be repeated several (e.g., 10) times to reduce the randomness. Finally, based on the results from all 10-fold cross validation processes, the evaluation indicators can be expressed as mean ± standard deviation (SD) or mean ± standard error of the mean (SEM).
External Validation
The external validation is an approach commonly used to evaluate the generalization ability of the models built via network-based methods (Cheng et al., 2012d; Wu et al., 2016, 2017; Fang et al., 2017b). Before validation, we need to collect a lot of extra experimentally validated DTIs which do not existed in the known DTI network. Then, the whole known DTI network are used as the training set, while the collected extra DTIs are used as the test set. In external validation, the test set is commonly known as external validation set. Using this pair of training set and test set, evaluation indicators can be calculated.
Evaluation Indicators
Currently, to our knowledge, there are several types of evaluation indicators. For network-based methods, a popular type of evaluation indicators is from recommender systems (Zhou et al., 2010; Lu et al., 2012), such as precision (P), recall (R), precision enhancement (eP), and recall enhancement (eR). Compared with those widely used in evaluating machine learning models for prediction of ADMET properties (Cheng et al., 2012a, 2013a) and DTIs (Cheng et al., 2012c), such as sensitivity and specificity, the evaluation indicators from recommender systems are more personalized and thus suitable for the network-based models that were derived from recommendation algorithms (Cheng et al., 2012b,d; Wu et al., 2016, 2017, 2018; Fang et al., 2017b). Although several evaluation indicators (e.g., P and R) have the same name as those used in evaluating machine learning models, their definitions are different. Herein, we would briefly described how to calculate the personalized evaluation indicators using a pair of training set and test set, referred as to our previous studies (Cheng et al., 2012b,d; Wu et al., 2016, 2017, 2018; Fang et al., 2017b).
At first, for the pair of training set and test set, nodes which lost all its edges in the training set are removed from both the training set and the test set. After predicting all potential DTIs using the training set and required additional information [e.g., drug-substructure associations for SDTNBI (Wu et al., 2017) and bSDTNBI (Wu et al., 2016)], evaluation indicators can be calculated by comparing the predicted DTIs with the known DTIs in the test set. In general, drugs without known DTIs in the test set do not participate in calculation to avoid invalid values such as infinite. For each drug Di participated in calculation, the newly predicted DTIs of Di are sorted by their predictive scores. Then, under a user-given threshold such as L = 20, the DTIs ranked in the top-L places are considered as positive, whereas the others were considered as negative. By comparing those newly predicted DTIs of Di that are considered as positive or negative with the known DTIs of Di in the test set, the numbers of true positives TPi(L), false positives FPi(L), true negatives TNi(L), and false negatives FNi(L) were counted. After counting the four numbers for each drug participated in calculation, four evaluation indicators P, R, eP, and eR can be calculated as:
In these formulas, M and N are the number of drugs and targets participated in calculation, respectively, and Xi is the number of known DTIs of Di in the test set.
However, all the four evaluation indicators depend on the threshold L, whose value may be hard to choose by users. Hence, an evaluation indicator independent of the L value, namely area under receiver operating characteristic curve (AUC), is usually employed (Wu et al., 2016, 2017, 2018; Fang et al., 2017b). The calculation of the AUC value can also be described using the aforementioned TPi(L), FPi(L), TNi(L), FNi(L). For a given L value, a true positive rate (TPR) and a false positive rate (FPR) can be calculated as:
By setting different L values, for example, varying L from 0 to N, a series of TPRs and FPRs can be obtained. Then, a receiver operating characteristic (ROC) curve can be generated by plotting the TPRs against the FPRs. The AUC is the area under the ROC curve.
For a model, higher P, R, eP, eR, and AUC values obtained in cross validation or external validation indicate higher performance, generally.
Applications in Target Prediction
A major application area of the network-based methods for DTI prediction is target prediction. By combining target prediction and experimental validation, we may exploit new uses for approved drugs (Cheng et al., 2012b) and new chemical entities (Wu et al., 2016, 2018), and hence facilitate drug repurposing (also known as drug repositioning) and new drug discovery.
Target Prediction for Approved Drugs
Drug repurposing usually has lower costs and higher successful rate in contrast to new drug discovery (Ashburn and Thor, 2004; Chong and Sullivan, 2007). Over the past decade, various computational methods were developed for drug repurposing (Vanhaelen et al., 2017). A classical strategy is to predict new indications for old drugs directly. For example, a method named MANTRA predicts therapeutic effects for drugs by indentifying network communities in a drug-drug network constructed by calculating the gene expression profile similarity (Iorio et al., 2010). Another method named PREDICT employs multiple drug similarity and disease similarity for large-scale prediction of drug indications (Gottlieb et al., 2011). Recently, bi-directional random walk was also employed to predict drug-disease associations (Luo et al., 2016).
However, targets were not included in the framework of these methods. It may be difficult to understand molecular mechanisms of the new indications. Network-based methods can solve this pitfall and be fruitfully applied in this area. Via an indirect strategy, we may predict potential targets for approved drugs, and hence discover potential new indications of the drugs. For instance, in a previous study (Cheng et al., 2012b), we performed the NBI method on a global DTI network. Nine and 31 approved drugs predicted to target dipeptidyl peptidase IV (DPP4) and estrogen receptors (ERs) with high predictive scores were purchased for experimental assays, respectively. Among the 40 purchased approved drugs, montelukast on DPP4 as well as diclofenac, simvastatin, ketoconazole, and itraconazole on ERs, were validated by in vitro bioassays with IC50 or EC50 values less than 10 μM (Table 3). Furthermore, simvastatin and ketoconazole showed anti-proliferative activities on human MDA-MB-231 breast cancer cell line in MTT assays with IC50 values less than 10 μM, suggesting that these antifungal agents may have therapeutic effects on breast cancer.
Target Prediction for New Chemical Entities
For new chemical entities, we would briefly describe two examples of finding new active compounds on nuclear receptors (Wu et al., 2016) and GPCRs (Wu et al., 2018), respectively.
In a previous study (Wu et al., 2016), we screened potential ligands for a nuclear receptor named ERα, which served as a potential target for ERα-positive breast cancer (Nilsson et al., 2011), from the Enamine database1 via a strategy combining 2D chemical similarity searching and the bSDTNBI method. From the prediction results, 56 commercially available compounds predicted to target ERα were purchased for in vitro assays. 27 of them showed potential activities on ERα with IC50 or EC50 less than 10 μM, suggesting the high performance of our bSDTNBI method (Wu et al., 2016). These new ERα ligands may provide lead compounds for the targeted therapy of ERα-positive breast cancer.
Recently, to investigate polypharmacology of GPCR ligands, we constructed global network-based models for human GPCRs via the bSDTNBI method and three types of molecular fingerprints (Wu et al., 2018). The global network-based model with the best performance in cross validation was employed to predict potential new GPCR targets for known GPCR ligands. 20 compounds predicted to target a GPCR named prostaglandin E2 receptor EP4 subtype were purchased for in vitro assays. Among these purchased compounds, AM966 and Ki16425, two known antagonists for lysophosphatidic acid receptors, showed potential antagonistic activities on EP4 in both calcium flux assay and cAMP assay with IC50 values less than 10 μM (Table 3), providing potential lead compounds for the therapy of colon cancer, lung cancer, osteoporosis and rheumatoid arthritis (Wu et al., 2018).
Applications in Elucidation of Molecular Mechanisms
Another major application area of the network-based methods for DTI prediction is to elucidate potential molecular mechanisms of therapeutic effects or safety problems (e.g., toxicity and side effects), which may facilitate systems pharmacology or systems toxicology.
A commonly used approach of deciphering molecular mechanisms is to construct and analyze drug-gene-disease networks (Cheng et al., 2013c,d; Wu et al., 2016, 2017, 2018; Fang et al., 2017b). Specifically, for a class of drugs or a type of disease of interests, a drug-gene-disease network can be constructed by integrating the known and predicted DTIs with gene-disease associations. The gene-disease associations are usually collected from databases such as CTD (Davis et al., 2017), HuGE Navigator (Yu et al., 2008), Online Mendelian Inheritance in Man (Amberger et al., 2015), and PharmGKB (Hewett et al., 2002). After construction of the drug-gene-disease network, network visualization tools such as Cytoscape (Smoot et al., 2011) can be used to show the network visually. In addition to network visualization, various bioinformatics enrichment tools (Huang et al., 2009), such as gene set enrichment analysis (Subramanian et al., 2005), can be employed to analyze the functions of the genes in the network. Based on the systematic analysis results from different angles as well as previously published data and literature in pharmacology and clinics, we may understand molecular mechanisms of the drugs in the drug-gene-disease network.
In this section, we provided several examples of elucidating molecular mechanisms of therapeutic effects or safety problems for approved drugs (Cheng et al., 2013c; Wu et al., 2016, 2017, 2018), natural products (Fang et al., 2017b) and xenobiotics (Cheng et al., 2013d) via network-based methods.
Elucidation of Molecular Mechanisms of Therapeutic Effects
Recent studies have shown that the use of NSAIDs is associated with lower risk of cancer (Nan et al., 2015). However, molecular mechanisms of the chemoprevention by NSAIDs are still not well understood. In a previous case study of the SDTNBI method (Wu et al., 2017), we used the SDTNBI method to predict potential DTIs for NSAIDs. A drug-gene-disease network containing 21 NSAIDs and 29 cancer types or subtypes were constructed by integrating the known and predicted DTIs with gene-disease associations. Several newly predicted DTIs were validated by previously reported literature, suggesting the high performance of our SDTNBI method. After performing systematic analysis on the drug-gene-disease network using previously published pharmacological experiments and co-crystal structure data, we found that NSAIDs may exert anticancer effect by inhibiting their targets associated with cancer, such as prostaglandin G/H synthase 2 (PTGS2, also known as cyclooxygenase-2), aldo-keto reductase family 1 member C3 (AKR1C3), carbonic anhydrase 9 (CA9), carbonic anhydrase 12 (CA12) and cyclin-dependent kinase 2 (CDK2).
Subsequently, in a case study of the bSDTNBI method (Wu et al., 2016), we investigated molecular mechanism of anticancer effects of approved drugs in a larger scale. After predicting potential DTIs for approved drugs via the bSDTNBI method, a global drug-gene-disease network containing 666 approved drugs and 15 cancer types or subtypes were built by integrating the known and predicted DTIs with the gene-disease associations. Based on systematic analysis, we demonstrated that tricyclic anti-depressant drugs and anti-diabetic drugs may exert anticancer effects by targeting serotonin receptors and cancer cell metabolism, respectively. These two case studies showed the practical application of network-based methods in elucidating therapeutic effects of approved drugs.
Besides approved drugs, network-based methods can also be used to elucidate the therapeutic effects of natural products. For example, in a recent study (Fang et al., 2017b), we utilized the bSDTNBI method to build global network-based models for natural compounds from traditional chinese medicine (TCM) databases. The best bSDTNBI model in cross validation was used to predict potential DTIs for natural products. Then, potential anticancer indications of the natural products were further predicted via gene-disease associations for cancer collected from public available databases and a statistical approach based on permutation test (Fang et al., 2017b). Taking three natural products, namely kaempferol, resveratrol, and genistein, as examples, drug-gene-disease networks were constructed. After performing systematic analysis using the networks and previously published evidence, we found that the three natural products may exert anticancer effects by inhibiting different cancer-associated proteins and pathways. Moreover, the similar workflow was also used to investigate molecular mechanism of anti-aging effects of natural products (Fang et al., 2017a).
Elucidation of Molecular Mechanisms of Safety Problems
In addition to therapeutic effects, network-based models can also be applied in elucidating molecular mechanisms of safety problems, such as side effects (Cheng et al., 2013c) and toxicity (Cheng et al., 2013d).
For side effects, in a recent study (Wu et al., 2018), the bSDTNBI method was used to predict potential targets for GPCR drugs. Then, drug-gene-disease networks were constructed for two example GPCR drugs named clemastine and dobutamine, by integrating the known and predicted DTIs as well as side-effect data from MetaADEDB (Cheng et al., 2013b) and Lounkine’s study (Lounkine et al., 2012). Via systematic analysis on the networks, we identified that the cardiovascular complications of GPCR drugs were associated with their off-target effects on α-adrenergic receptor and muscarinic acetylcholine receptors.
For toxicity, in a previous study (Cheng et al., 2013d), we proposed a computational systems toxicology framework based on the NBI method, named predictive toxicogenomics-derived models (PTDMs), to help understand how the xenobiotics (e.g., drugs, industrial chemicals and pesticides) influence human health and the environment. At first, three networks were constructed for chemical-gene interactions (CGIs), chemical-disease associations (CDAs) and gene-disease associations (GDAs). Herein, different from the above studies, the CGIs included both direct chemical-protein interactions and indirect chemical-gene associations. Then, new potential CGIs, CDAs, and GDAs were prioritized via performing the NBI method on the three networks, respectively. Based on the known and predicted chemical-gene-disease association data, we systematically investigate the toxicological mechanisms of an endocrine disrupter named bisphenol A (BPA). Some predicted associations for BPA were in agreement with previously published data, suggesting the potential application of network-based methods in elucidating the toxicological mechanisms of xenobiotics.
All these examples of elucidating molecular mechanism of therapeutic effects or safety problems illustrate the potential applications of network-based methods in systems pharmacology and systems toxicology. However, to date, most of the studies are just pure computational studies. Further biological assays and clinical studies are needed to validate the predictive results in the future.
Summary and Perspectives
Since the new century, as the rapid development of systems biology and network pharmacology, various computational methods were proposed for DTI prediction with high efficiency and low costs (Zheng et al., 2013; Chen et al., 2016; Lavecchia and Cerchia, 2016). Among these methods, network-based methods have shown obvious advantages (Cheng et al., 2012b,d; Wu et al., 2016, 2017). As mentioned above, this category of methods relies on neither 3D structures of targets nor negative samples, which can cover much larger target space. Although network-based methods only perform simple mathematical operations such as matrix multiplication in prediction, high performance has shown not only in theory but also in potential applications, including target prediction (Cheng et al., 2012b; Wu et al., 2016, 2018) as well as elucidation of molecular mechanisms of both therapeutic effects (Wu et al., 2016, 2017; Fang et al., 2017b) and safety problems (Cheng et al., 2013c,d; Wu et al., 2018).
Despite the success of currently available network-based methods, there are still several pitfalls. First, the application domain of these methods still needs to be extended. Although several recently proposed network-based methods can be used to predict potential DTIs for both approved drugs and new chemical entities (Wu et al., 2016, 2017), they cannot predict potential DTIs for those targets without known ligands. Second, network-based methods are still non-quantitative. They only provide a predictive score for each potential DTI, where a higher score means a higher probability of occurrence (Cheng et al., 2012b,d; Wu et al., 2016, 2017). The binding affinities of the predicted DTIs are unknown. Moreover, the interaction type is not considered yet. In the real world, there are different types of DTIs. For example, receptors have agonists, antagonists and inverse antagonists, while enzymes have activators and inhibitors. However, to our knowledge, no network-based methods have considered the interaction type yet—DTIs were simply seen as indirect edges without any additional attributes. By contrast, efforts have been made in other types of computational methods in these years. In the aspect of quantitative prediction, as described in the INTRODUCTION section, many structure-based methods have the ability to predict binding affinities of DTIs (Aldeghi et al., 2017; Liu et al., 2017). In the aspect of prediction of interaction types, Wang and Zeng (2013) have proposed the first machine learning-based method to predict the interaction type. To avoid losing competitiveness, it is urgently needed to develop novel network-based methods.
In our views, there are several possible strategies to further improve the network-based methods for DTI prediction. At first, we can try to introduce new link prediction algorithms into our research area, such as hierarchical structure (Clauset et al., 2008), stochastic block (Guimera and Sales-Pardo, 2009), and likelihood analysis (Pan et al., 2016). Secondly, we can integrate multi-scale biomedical data, including drug-side effect associations (Tatonetti et al., 2012; Kuhn et al., 2016), drug-indication associations (Brown and Patel, 2017), drug-induced gene expression profiles (Lamb et al., 2006; Subramanian et al., 2017), protein-protein interactions (Li et al., 2017), ADMET properties (Cheng et al., 2012a), clinical data (Zarin et al., 2011), etc. In addition, we can learn something from structure-based methods and make full use of 3D structures of targets. Although many targets do not have 3D structures yet, the already resolved 3D structures are valuable information. For example, as a simplest way, we can use docking scores to improve the predictive scores of network-based methods via a consensus approach. These may help us move toward structural systems pharmacology (Xie et al., 2014).
In summary, although the network-based for DTI prediction still have limitations, they provide alternative tools for studies in drug repurposing, new drug discovery, systems pharmacology and systems toxicology. We hope they would play a greater role in the future.
Author Contributions
YT conceived and directed the project. YT and ZW designed the review and wrote the manuscript. WL and GL provided useful advice. All authors read and approved the final version of the manuscript.
Funding
This work was supported by the National Key Research and Development Program of China (Grant 2016YFA0502304), the National Natural Science Foundation of China (Grants 81673356 and U1603122), and the 111 Project (Grant B07023).
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Footnotes
References
Ahmed, A., Smith, R. D., Clark, J. J., Dunbar, J. B., and Carlson, H. A. (2015). Recent improvements to Binding MOAD: a resource for protein-ligand binding affinities and structures. Nucleic Acids Res. 43, D465–D469. doi: 10.1093/nar/gku1088
Alaimo, S., Pulvirenti, A., Giugno, R., and Ferro, A. (2013). Drug-target interaction prediction through domain-tuned network-based inference. Bioinformatics 29, 2004–2008. doi: 10.1093/bioinformatics/btt307
Aldeghi, M., Heifetz, A., Bodkin, M. J., Knapp, S., and Biggin, P. C. (2017). Predictions of ligand selectivity from absolute binding free energy calculations. J. Am. Chem. Soc. 139, 946–957. doi: 10.1021/jacs.6b11467
Amberger, J. S., Bocchini, C. A., Schiettecatte, F., Scott, A. F., and Hamosh, A. (2015). OMIM.org: online Mendelian Inheritance in Man (OMIM (R)), an online catalog of human genes and genetic disorders. Nucleic Acids Res. 43, D789–D798. doi: 10.1093/nar/gku1205
Anighoro, A., Bajorath, J., and Rastelli, G. (2014). Polypharmacology: challenges and opportunities in drug discovery. J. Med. Chem. 57, 7874–7887. doi: 10.1021/jm5006463
Ashburn, T. T., and Thor, K. B. (2004). Drug repositioning: identifying and developing new uses for existing drugs. Nat. Rev. Drug Discov. 3, 673–683. doi: 10.1038/nrd1468
Ba-Alawi, W., Soufan, O., Essack, M., Kalnis, P., and Bajic, V. B. (2016). DASPfind: new efficient method to predict drug-target interactions. J. Cheminform. 8:15. doi: 10.1186/s13321-016-0128-4
Bateman, A., Martin, M. J., O’Donovan, C., Magrane, M., Alpi, E., Antunes, R., et al. (2017). UniProt: the universal protein knowledgebase. Nucleic Acids Res. 45, D158–D169. doi: 10.1093/nar/gkw1099
Besnard, J., Ruda, G. F., Setola, V., Abecassis, K., Rodriguiz, R. M., Huang, X. P., et al. (2012). Automated design of ligands to polypharmacological profiles. Nature 492, 215–220. doi: 10.1038/nature11691
Brown, A. S., and Patel, C. J. (2017). A standard database for drug repositioning. Sci. Data 4:170029. doi: 10.1038/sdata.2017.29
Campillos, M., Kuhn, M., Gavin, A. C., Jensen, L. J., and Bork, P. (2008). Drug target identification using side-effect similarity. Science 321, 263–266. doi: 10.1126/science.1158140
Cannistraci, C. V., Alanis-Lobato, G., and Ravasi, T. (2013). From link-prediction in brain connectomes and protein interactomes to the local-community-paradigm in complex networks. Sci. Rep. 3:1613. doi: 10.1038/srep01613
Chen, L., Wu, D., Bian, H. P., Kuang, G. L., Jiang, J., Li, W. H., et al. (2014). Selective ligands of estrogen receptor β discovered using pharmacophore mapping and structure-based virtual screening. Acta Pharmacol. Sin. 35, 1333–1341. doi: 10.1038/aps.2014.69
Chen, X., Liu, M. X., and Yan, G. Y. (2012). Drug-target interaction prediction by random walk on the heterogeneous network. Mol. Biosyst. 8, 1970–1978. doi: 10.1039/c2mb00002d
Chen, X., Yan, C. C., Zhang, X. T., Zhang, X., Dai, F., Yin, J., et al. (2016). Drug-target interaction prediction: databases, web servers and computational models. Brief Bioinform. 17, 696–712. doi: 10.1093/bib/bbv066
Chen, Y. Z., and Zhi, D. G. (2001). Ligand-protein inverse docking and its potential use in the computer search of protein targets of a small molecule. Proteins 43, 217–226. doi: 10.1002/1097-0134(20010501)43:2<217::AID-PROT1032>3.0.CO;2-G
Cheng, F. X., Li, W. H., Liu, G. X., and Tang, Y. (2013a). In silico ADMET prediction: recent advances, current challenges and future trends. Curr. Top. Med. Chem. 13, 1273–1289. doi: 10.2174/15680266113139990033
Cheng, F. X., Li, W. H., Wang, X. C., Zhou, Y. D., Wu, Z. R., Shen, J., et al. (2013b). Adverse drug events: database construction and in silico prediction. J. Chem. Inf. Model. 53, 744–752. doi: 10.1021/ci4000079
Cheng, F. X., Li, W. H., Wu, Z. R., Wang, X. C., Zhang, C., Li, J., et al. (2013c). Prediction of polypharmacological profiles of drugs by the integration of chemical, side effect, and therapeutic space. J. Chem. Inf. Model. 53, 753–762. doi: 10.1021/ci400010x
Cheng, F. X., Li, W. H., Zhou, Y. D., Li, J., Shen, J., Lee, P. W., et al. (2013d). Prediction of human genes and diseases targeted by xenobiotics using predictive toxicogenomic-derived models (PTDMs). Mol. Biosyst. 9, 1316–1325. doi: 10.1039/c3mb25309k
Cheng, F. X., Li, W. H., Zhou, Y. D., Shen, J., Wu, Z. R., Liu, G. X., et al. (2012a). admetSAR: A comprehensive source and free tool for assessment of chemical ADMET properties. J. Chem. Inf. Model. 52, 3099–3105. doi: 10.1021/ci300367a
Cheng, F. X., Liu, C., Jiang, J., Lu, W. Q., Li, W. H., Liu, G. X., et al. (2012b). Prediction of drug-target interactions and drug repositioning via network-based inference. PLoS Comput. Biol. 8:e1002503. doi: 10.1371/journal.pcbi.1002503
Cheng, F. X., Zhou, Y. D., Li, J., Li, W. H., Liu, G. X., and Tang, Y. (2012c). Prediction of chemical-protein interactions: multitarget-QSAR versus computational chemogenomic methods. Mol. Biosyst. 8, 2373–2384. doi: 10.1039/c2mb25110h
Cheng, F. X., Zhou, Y. D., Li, W. H., Liu, G. X., and Tang, Y. (2012d). Prediction of chemical-protein interactions network with weighted network-based inference method. PLoS One 7:e41064. doi: 10.1371/journal.pone.0041064
Chong, C. R., and Sullivan, D. J. (2007). New uses for old drugs. Nature 448, 645–646. doi: 10.1038/448645a
Clauset, A., Moore, C., and Newman, M. E. J. (2008). Hierarchical structure and the prediction of missing links in networks. Nature 453, 98–101. doi: 10.1038/nature06830
Daminelli, S., Thomas, J. M., Duran, C., and Cannistraci, C. V. (2015). Common neighbours and the local-community-paradigm for topological link prediction in bipartite networks. New J. Phys. 17:113037. doi: 10.1088/1367-2630/17/11/113037
Davis, A. P., Grondin, C. J., Johnson, R. J., Sciaky, D., King, B. L., McMorran, R., et al. (2017). The comparative toxicogenomics database: update 2017. Nucleic Acids Res. 45, D972–D978. doi: 10.1093/nar/gkw838
Ding, H., Takigawa, I., Mamitsuka, H., and Zhu, S. F. (2014). Similarity-based machine learning methods for predicting drug-target interactions: a brief review. Brief. Bioinform. 15, 734–747. doi: 10.1093/bib/bbt056
Duran, C., Daminelli, S., Thomas, J. M., Haupt, V. J., Schroeder, M., and Cannistraci, C. V. (2017). Pioneering topological methods for network-based drug-target prediction by exploiting a brain-network self-organization theory. Brief. Bioinform. doi: 10.1093/bib/bbx041
Fang, J. S., Gao, L., Ma, H. L., Wu, Q. H., Wu, T., Wu, J., et al. (2017a). Quantitative and systems pharmacology 3. Network-based identification of new targets for natural products enables potential uses in aging-associated disorders. Front. Pharmacol. 8:747. doi: 10.3389/fphar.2017.00747
Fang, J. S., Wu, Z. R., Cai, C. P., Wang, Q., Tang, Y., and Cheng, F. X. (2017b). Quantitative and Systems Pharmacology. 1. In silico prediction of drug-target interaction of natural products enables new targeted cancer therapy. J. Chem. Inf. Model. 57, 2657–2671. doi: 10.1021/acs.jcim.7b00216
Gaulton, A., Hersey, A., Nowotka, M., Bento, A. P., Chambers, J., Mendez, D., et al. (2017). The ChEMBL database in 2017. Nucleic Acids Res. 45, D945–D954. doi: 10.1093/nar/gkw1074
Gilson, M. K., Liu, T. Q., Baitaluk, M., Nicola, G., Hwang, L., and Chong, J. (2016). BindingDB in 2015: a public database for medicinal chemistry, computational chemistry and systems pharmacology. Nucleic Acids Res. 44, D1045–D1053. doi: 10.1093/nar/gkv1072
Gong, J. Y., Cai, C. Q., Liu, X. F., Ku, X., Jiang, H. L., Gao, D. Q., et al. (2013). ChemMapper: a versatile web server for exploring pharmacology and chemical structure association based on molecular 3D similarity method. Bioinformatics 29, 1827–1829. doi: 10.1093/bioinformatics/btt270
Gottlieb, A., Stein, G. Y., Ruppin, E., and Sharan, R. (2011). PREDICT: a method for inferring novel drug indications with application to personalized medicine. Mol. Syst. Biol. 7:496. doi: 10.1038/msb.2011.26
Guimera, R., and Sales-Pardo, M. (2009). Missing and spurious interactions and the reconstruction of complex networks. Proc. Natl. Acad. Sci. U.S.A. 106, 22073–22078. doi: 10.1073/pnas.0908366106
Harding, S. D., Sharman, J. L., Faccenda, E., Southan, C., Pawson, A. J., Ireland, S., et al. (2018). The IUPHAR/BPS guide to pharmacology in 2018: updates and expansion to encompass the new guide to immunopharmacology. Nucleic Acids Res. 46, D1091–D1106. doi: 10.1093/nar/gkx1121
Hecker, N., Ahmed, J., Eichborn, J., Dunkel, M., Macha, K., Eckert, A., et al. (2012). SuperTarget goes quantitative: update on drug-target interactions. Nucleic Acids Res. 40, D1113–D1117. doi: 10.1093/nar/gkr912
Hewett, M., Oliver, D. E., Rubin, D. L., Easton, K. L., Stuart, J. M., Altman, R. B., et al. (2002). PharmGKB: the pharmacogenetics knowledge base. Nucleic Acids Res. 30, 163–165. doi: 10.1093/nar/30.1.163
Hopkins, A. L. (2008). Network pharmacology: the next paradigm in drug discovery. Nat. Chem. Biol. 4, 682–690. doi: 10.1038/nchembio.118
Hu, G. P., Kuang, G. L., Xiao, W., Li, W. H., Liu, G. X., and Tang, Y. (2012a). Performance evaluation of 2D fingerprint and 3D shape similarity methods in virtual screening. J. Chem. Inf. Model. 52, 1103–1113. doi: 10.1021/ci300030u
Hu, G. P., Li, X., Zhang, X., Li, Y. Z., Ma, L., Yang, L. M., et al. (2012b). Discovery of inhibitors to block interactions of HIV-1 integrase with human LEDGF/p75 via structure-based virtual screening and bioassays. J. Med. Chem. 55, 10108–10117. doi: 10.1021/jm301226a
Huang, D. W., Sherman, B. T., and Lempicki, R. A. (2009). Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 37, 1–13. doi: 10.1093/nar/gkn923
Huang, Z., Zeng, D., and Chen, H. C. (2007). A comparison of collaborative-filtering recommendation algorithms for e-commerce. IEEE Intell. Syst. 22, 68–78. doi: 10.1109/MIS.2007.4338497
Hwang, H., Dey, F., Petrey, D., and Honig, B. (2017). Structure-based prediction of ligand-protein interactions on a genome-wide scale. Proc. Natl. Acad. Sci. U.S.A. 114, 13685–13690. doi: 10.1073/pnas.1705381114
Iorio, F., Bosotti, R., Scacheri, E., Belcastro, V., Mithbaokar, P., Ferriero, R., et al. (2010). Discovery of drug mode of action and drug repositioning from transcriptional responses. Proc. Natl. Acad. Sci. U.S.A. 107, 14621–14626. doi: 10.1073/pnas.1000138107
Kanehisa, M., Furumichi, M., Tanabe, M., Sato, Y., and Morishima, K. (2017). KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 45, D353–D361. doi: 10.1093/nar/gkw1092
Keiser, M. J., Roth, B. L., Armbruster, B. N., Ernsberger, P., Irwin, J. J., and Shoichet, B. K. (2007). Relating protein pharmacology by ligand chemistry. Nat. Biotechnol. 25, 197–206. doi: 10.1038/nbt1284
Keiser, M. J., Setola, V., Irwin, J. J., Laggner, C., Abbas, A. I., Hufeisen, S. J., et al. (2009). Predicting new molecular targets for known drugs. Nature 462, 175–181. doi: 10.1038/nature08506
Kohler, S., Bauer, S., Horn, D., and Robinson, P. N. (2008). Walking the interactome for prioritization of candidate disease genes. Am. J. Hum. Genet. 82, 949–958. doi: 10.1016/j.ajhg.2008.02.013
Kuhn, M., Letunic, I., Jensen, L. J., and Bork, P. (2016). The SIDER database of drugs and side effects. Nucleic Acids Res. 44, D1075–D1079. doi: 10.1093/nar/gkv1075
Lamb, J., Crawford, E. D., Peck, D., Modell, J. W., Blat, I. C., Wrobel, M. J., et al. (2006). The Connectivity Map: using gene-expression signatures to connect small molecules, genes, and disease. Science 313, 1929–1935. doi: 10.1126/science.1132939
Lavecchia, A., and Cerchia, C. (2016). In silico methods to address polypharmacology: current status, applications and future perspectives. Drug Discov. Today 21, 288–298. doi: 10.1016/j.drudis.2015.12.007
Li, H. L., Gao, Z. T., Kang, L., Zhang, H. L., Yang, K., Yu, K. Q., et al. (2006). TarFisDock: a web server for identifying drug targets with docking approach. Nucleic Acids Res. 34, W219–W224. doi: 10.1093/nar/gkl114
Li, T. B., Wernersson, R., Hansen, R. B., Horn, H., Mercer, J., Slodkowicz, G., et al. (2017). A scored human protein-protein interaction network to catalyze genomic interpretation. Nat. Methods 14, 61–64. doi: 10.1038/nmeth.4083
Li, Y., Han, L., Liu, Z. H., and Wang, R. X. (2014). Comparative assessment of scoring functions on an updated benchmark: 2. Evaluation methods and general results. J. Chem. Inf. Model. 54, 1717–1736. doi: 10.1021/ci500081m
Li, Y. H., Yu, C. Y., Li, X. X., Zhang, P., Tang, J., Yang, Q. X., et al. (2018). Therapeutic target database update 2018: enriched resource for facilitating bench-to-clinic research of targeted therapeutics. Nucleic Acids Res. 46,D1121–D1127. doi: 10.1093/nar/gkx1076
Liu, W. P., and Lu, L. Y. (2010). Link prediction based on local random walk. EPL 89:58007. doi: 10.1038/s41598-017-12586-y
Liu, X. F., Ouyang, S. S., Yu, B. A., Liu, Y. B., Huang, K., Gong, J. Y., et al. (2010). PharmMapper server: a web server for potential drug target identification using pharmacophore mapping approach. Nucleic Acids Res. 38, W609–W614. doi: 10.1093/nar/gkq300
Liu, Z. H., Li, Y., Han, L., Li, J., Liu, J., Zhao, Z. X., et al. (2015). PDB-wide collection of binding data: current status of the PDBbind database. Bioinformatics 31, 405–412. doi: 10.1093/bioinformatics/btu626
Liu, Z. H., Su, M. Y., Han, L., Liu, J., Yang, Q. F., Li, Y., et al. (2017). Forging the basis for developing protein-ligand interaction scoring functions. Accounts Chem. Res. 50, 302–309. doi: 10.1021/acs.accounts.6b00491
Lounkine, E., Keiser, M. J., Whitebread, S., Mikhailov, D., Hamon, J., Jenkins, J. L., et al. (2012). Large-scale prediction and testing of drug activity on side-effect targets. Nature 486, 361–367. doi: 10.1038/nature11159
Lu, L. Y., Medo, M., Yeung, C. H., Zhang, Y. C., Zhang, Z. K., and Zhou, T. (2012). Recommender systems. Phys. Rep. 519, 1–49. doi: 10.1016/j.physrep.2012.02.006
Lu, L. Y., and Zhou, T. (2011). Link prediction in complex networks: a survey. Phys. A 390, 1150–1170. doi: 10.1016/j.physa.2010.11.027
Luo, H., Chen, J., Shi, L. M., Mikailov, M., Zhu, H., Wang, K. J., et al. (2011). DRAR-CPI: a server for identifying drug repositioning potential and adverse drug reactions via the chemical-protein interactome. Nucleic Acids Res. 39, W492–W498. doi: 10.1093/nar/gkr299
Luo, H. M., Wang, J. X., Li, M., Luo, J. W., Peng, X. Q., Wu, F. X., et al. (2016). Drug repositioning based on comprehensive similarity measures and Bi-Random walk algorithm. Bioinformatics 32, 2664–2671. doi: 10.1093/bioinformatics/btw228
Ma, D. L., Chan, D. S. H., and Leung, C. H. (2013). Drug repositioning by structure-based virtual screening. Chem. Soc. Rev. 42, 2130–2141. doi: 10.1039/c2cs35357a
Ma’ayan, A., Rouillard, A. D., Clark, N. R., Wang, Z. C., Duan, Q. N., and Kou, Y. (2014). Lean big data integration in systems biology and systems pharmacology. Trends Pharmacol. Sci. 35, 450–460. doi: 10.1016/j.tips.2014.07.001
Magarinos, M. P., Carmona, S. J., Crowther, G. J., Ralph, S. A., Roos, D. S., Shanmugam, D., et al. (2012). TDR Targets: a chemogenomics resource for neglected diseases. Nucleic Acids Res. 40, D1118–D1127. doi: 10.1093/nar/gkr1053
Medina-Franco, J. L., Giulianotti, M. A., Welmaker, G. S., and Houghten, R. A. (2013). Shifting from the single to the multitarget paradigm in drug discovery. Drug Discov. Today 18, 495–501. doi: 10.1016/j.drudis.2013.01.008
Nan, H. M., Hutter, C. M., Lin, Y., Jacobs, E. J., Ulrich, C. M., White, E., et al. (2015). Association of aspirin and NSAID use with risk of colorectal cancer according to genetic variants. J. Am. Med. Assoc. 313, 1133–1142. doi: 10.1001/jama.2015.1815
Nilsson, S., Koehler, K. F., and Gustafsson, J. A. (2011). Development of subtype-selective oestrogen receptor-based therapeutics. Nat. Rev. Drug Discov. 10, 778–792. doi: 10.1038/nrd3551
O’Boyle, N. M., Banck, M., James, C. A., Morley, C., Vandermeersch, T., and Hutchison, G. R. (2011). Open Babel: an open chemical toolbox. J. Cheminform. 3:33. doi: 10.1186/1758-2946-3-33
Pan, L. M., Zhou, T., Lu, L. Y., and Hu, C. K. (2016). Predicting missing links and identifying spurious links via likelihood analysis. Sci. Rep. 6:22955. doi: 10.1038/srep22955
Paolini, G. V., Shapland, R. H. B., van Hoorn, W. P., Mason, J. S., and Hopkins, A. L. (2006). Global mapping of pharmacological space. Nat. Biotechnol. 24, 805–815. doi: 10.1038/nbt1228
Rognan, D. (2010). Structure-based approaches to target fishing and ligand profiling. Mol. Inform. 29, 176–187. doi: 10.1002/minf.200900081
Rose, P. W., Prlic, A., Altunkaya, A., Bi, C. X., Bradley, A. R., Christie, C. H., et al. (2017). The RCSB protein data bank: integrative view of protein, gene and 3D structural information. Nucleic Acids Res. 45, D271–D281. doi: 10.1093/nar/gkw1000
Roth, B. L., Lopez, E., Patel, S., and Kroeze, W. K. (2000). The multiplicity of serotonin receptors: uselessly diverse molecules or an embarrassment of riches? Neuroscientist 6, 252–262. doi: 10.1177/107385840000600408
Roth, B. L., Sheffler, D. J., and Kroeze, W. K. (2004). Magic shotguns versus magic bullets: selectively non-selective drugs for mood disorders and schizophrenia. Nat. Rev. Drug Discov. 3, 353–359. doi: 10.1038/nrd1346
Schadt, E. E. (2012). The changing privacy landscape in the era of big data. Mol. Syst. Biol. 8:612. doi: 10.1038/msb.2012.47
Seal, A., Ahn, Y. Y., and Wild, D. J. (2015). Optimizing drug-target interaction prediction based on random walk on heterogeneous networks. J. Cheminform. 7:40. doi: 10.1186/s13321-015-0089-z
Shen, J., Tan, C. F., Zhang, Y. Y., Li, X., Li, W. H., Huang, J., et al. (2010). Discovery of potent ligands for estrogen receptor β by structure-based virtual screening. J. Med. Chem. 53, 5361–5365. doi: 10.1021/jm100369g
Smoot, M. E., Ono, K., Ruscheinski, J., Wang, P. L., and Ideker, T. (2011). Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics 27, 431–432. doi: 10.1093/bioinformatics/btq675
Stevens, R. C., Cherezov, V., Katritch, V., Abagyan, R., Kuhn, P., Rosen, H., et al. (2013). The GPCR network: a large-scale collaboration to determine human GPCR structure and function. Nat. Rev. Drug Discov. 12, 25–34. doi: 10.1038/nrd3859
Subramanian, A., Narayan, R., Corsello, S. M., Peck, D. D., Natoli, T. E., Lu, X. D., et al. (2017). A next generation Connectivity Map: L1000 platform and the first 1,000,000 profiles. Cell 171, 1437.e17–1452.e17. doi: 10.1016/j.cell.2017.10.049
Subramanian, A., Tamayo, P., Mootha, V. K., Mukherjee, S., Ebert, B. L., Gillette, M. A., et al. (2005). Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. U.S.A. 102, 15545–15550. doi: 10.1073/pnas.0506580102
Szklarczyk, D., Santos, A., von Mering, C., Jensen, L. J., Bork, P., and Kuhn, M. (2016). STITCH 5: augmenting protein-chemical interaction networks with tissue and affinity data. Nucleic Acids Res. 44, D380–D384. doi: 10.1093/nar/gkv1277
Tang, Y., Zhu, W. L., Chen, K. X., and Jiang, H. L. (2006). New technologies in computer-aided drug design: toward target identification and new chemical entity discovery. Drug Discov. Today 3, 307–313. doi: 10.1016/j.ddtec.2006.09.004
Tatonetti, N. P., Ye, P. P., Daneshjou, R., and Altman, R. B. (2012). Data-driven prediction of drug effects and interactions. Sci. Transl. Med. 4:125ra31. doi: 10.1126/scitranslmed.3003377
Tian, K., Shao, M., Wang, Y., Guan, J., and Zhou, S. (2016). Boosting compound-protein interaction prediction by deep learning. Methods 110, 64–72. doi: 10.1016/j.ymeth.2016.06.024
Ursu, O., Holmes, J., Knockel, J., Bologa, C. G., Yang, J. J., Mathias, S. L., et al. (2017). DrugCentral: online drug compendium. Nucleic Acids Res. 45,D932–D939. doi: 10.1093/nar/gkw993
Vanhaelen, Q., Mamoshina, P., Aliper, A. M., Artemov, A., Lezhnina, K., Ozerov, I., et al. (2017). Design of efficient computational workflows for in silico drug repurposing. Drug Discov. Today 22, 210–222. doi: 10.1016/j.drudis.2016.09.019
Wang, Y. H., and Zeng, J. Y. (2013). Predicting drug-target interactions using restricted Boltzmann machines. Bioinformatics 29, 126–134. doi: 10.1093/bioinformatics/btt234
Wang, Y. L., Bryant, S. H., Cheng, T. J., Wang, J. Y., Gindulyte, A., Shoemaker, B. A., et al. (2017). PubChem BioAssay: 2017 update. Nucleic Acids Res. 45, D955–D963. doi: 10.1093/nar/gkw1118
Waszkowycz, B., Clark, D. E., and Gancia, E. (2011). Outstanding challenges in protein-ligand docking and structure-based virtual screening. Wiley Interdiscip. Rev. Comput. Mol. Sci. 1, 229–259. doi: 10.1021/acs.jmedchem.6b00716
Wen, M., Zhang, Z. M., Niu, S. Y., Sha, H. Z., Yang, R. H., Yun, Y. H., et al. (2017). Deep-learning-based drug-target interaction prediction. J. Proteome Res. 16, 1401–1409. doi: 10.1021/acs.jproteome.6b00618
Willett, P. (2006). Similarity-based virtual screening using 2D fingerprints. Drug Discov. Today 11, 1046–1053. doi: 10.1016/j.drudis.2006.10.005
Willett, P., Barnard, J. M., and Downs, G. M. (1998). Chemical similarity searching. J. Chem. Inf. Model. 38, 983–996. doi: 10.1021/ci9800211
Wishart, D. S., Feunang, Y. D., Guo, A. C., Lo, E. J., Marcu, A., Grant, J. R., et al. (2018). DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res. 46, D1074–D1082. doi: 10.1093/nar/gkx1037
Wu, Z. R., Cheng, F. X., Li, J., Li, W. H., Liu, G. X., and Tang, Y. (2017). SDTNBI: an integrated network and chemoinformatics tool for systematic prediction of drug-target interactions and drug repositioning. Brief. Bioinform. 18, 333–347. doi: 10.1093/bib/bbw012
Wu, Z. R., Lu, W. Q., Wu, D., Luo, A. Q., Bian, H. P., Li, J., et al. (2016). In silico prediction of chemical mechanism of action via an improved network-based inference method. Br. J. Pharmacol. 173, 3372–3385. doi: 10.1111/bph.13629
Wu, Z. R., Lu, W. Q., Yu, W. W., Wang, T. D. Y., Li, W. H., Liu, G. X., et al. (2018). Quantitative and systems pharmacology 2. In silico polypharmacology of G protein-coupled receptor ligands via network-based approaches. Pharmacol. Res. 129, 400–413. doi: 10.1016/j.phrs.2017.11.005
Xiang, J., Chun, E., Liu, C., Jing, L., Al-Sahouri, Z., Zhu, L., et al. (2016). Successful strategies to determine high-resolution structures of GPCRs. Trends Pharmacol. Sci. 37, 1055–1069. doi: 10.1016/j.tips.2016.09.009
Xie, L., Ge, X. X., Tan, H. P., Xie, L., Zhang, Y. L., Hart, T., et al. (2014). Towards structural systems pharmacology to study complex diseases and personalized medicine. PLoS Comput. Biol. 10:e1003554. doi: 10.1371/journal.pcbi.1003554
Yang, S. Y. (2010). Pharmacophore modeling and applications in drug discovery: challenges and recent advances. Drug Discov. Today 15, 444–450. doi: 10.1016/j.drudis.2010.03.013
Yap, C. W. (2011). PaDEL-Descriptor: an open source software to calculate molecular descriptors and fingerprints. J. Comput. Chem. 32, 1466–1474. doi: 10.1002/jcc.21707
Yildirim, M. A., Goh, K. I., Cusick, M. E., Barabasi, A. L., and Vidal, M. (2007). Drug-target network. Nat. Biotechnol. 25, 1119–1126. doi: 10.1038/nbt1338
Yu, H., Chen, J. X., Xu, X., Li, Y., Zhao, H. H., Fang, Y. P., et al. (2012). A systematic prediction of multiple drug-target interactions from chemical, genomic, and pharmacological data. PLoS One 7:e37608. doi: 10.1371/journal.pone.0037608
Yu, W., Gwinn, M., Clyne, M., Yesupriya, A., and Khoury, M. J. (2008). A navigator for human genome epidemiology. Nat. Genet. 40, 124–125. doi: 10.1038/ng0208-124
Zarin, D. A., Tse, T., Williams, R. J., Califf, R. M., and Ide, N. C. (2011). The ClinicalTrials.gov results database - update and key issues. N. Engl. J. Med. 364, 852–860. doi: 10.1056/NEJMsa1012065
Zhang, C., Hong, H. X., Mendrick, D. L., Tang, Y., and Cheng, F. X. (2015). Biomarker-based drug safety assessment in the age of systems pharmacology: from foundational to regulatory science. Biomark. Med. 9, 1241–1252. doi: 10.2217/bmm.15.81
Zheng, M. Y., Liu, X., Xu, Y., Li, H. L., Luo, C., and Jiang, H. L. (2013). Computational methods for drug design and discovery: focus on China. Trends Pharmacol. Sci. 34, 549–559. doi: 10.1016/j.tips.2013.08.004
Zhou, T., Kuscsik, Z., Liu, J. G., Medo, M., Wakeling, J. R., and Zhang, Y. C. (2010). Solving the apparent diversity-accuracy dilemma of recommender systems. Proc. Natl. Acad. Sci. U.S.A. 107, 4511–4515. doi: 10.1073/pnas.1000488107
Keywords: drug-target interaction, network-based method, target prediction, systems pharmacology, systems toxicology, drug repurposing
Citation: Wu Z, Li W, Liu G and Tang Y (2018) Network-Based Methods for Prediction of Drug-Target Interactions. Front. Pharmacol. 9:1134. doi: 10.3389/fphar.2018.01134
Received: 01 April 2018; Accepted: 18 September 2018;
Published: 09 October 2018.
Edited by:
Jianfeng Pei, Peking University, ChinaCopyright © 2018 Wu, Li, Liu and Tang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yun Tang, eXRhbmcyMzRAZWN1c3QuZWR1LmNu