| Literature DB >> 35704883 |
T J Rintala1, Arindam Ghosh1, V Fortino1.
Abstract
The network approach is quickly becoming a fundamental building block of computational methods aiming at elucidating the mechanism of action (MoA) and therapeutic effect of drugs. By modeling the effect of drugs and diseases on different biological networks, it is possible to better explain the interplay between disease perturbations and drug targets as well as how drug compounds induce favorable biological responses and/or adverse effects. Omics technologies have been extensively used to generate the data needed to study the mechanisms of action of drugs and diseases. These data are often exploited to define condition-specific networks and to study whether drugs can reverse disease perturbations. In this review, we describe network data mining algorithms that are commonly used to study drug's MoA and to improve our understanding of the basis of chronic diseases. These methods can support fundamental stages of the drug development process, including the identification of putative drug targets, the in silico screening of drug compounds and drug combinations for the treatment of diseases. We also discuss recent studies using biological and omics-driven networks to search for possible repurposed FDA-approved drug treatments for SARS-CoV-2 infections (COVID-19).Entities:
Keywords: biological networks; disease modeling; drug’s MOA; network analysis
Mesh:
Year: 2022 PMID: 35704883 PMCID: PMC9294412 DOI: 10.1093/bib/bbac229
Source DB: PubMed Journal: Brief Bioinform ISSN: 1467-5463 Impact factor: 13.994
Figure 1Different types of networks. (A) An undirected-unweighted network provide information only regarding the possible connections between the nodes. No information is provided regarding the type of interaction or its strength. (B) Directed network on the other hand provides information about the direction of interaction, (C) while weighted network tell about the strength of the interaction often denoted by edge. (D) Multiplex networks are formed by obtaining interaction information from different sources for the same set of nodes. Each layer on a multiplex network refers to interaction from different sources. As the nodes within each layer is of similar type, the network within each layer could be referred to as homogenous network. (E) In contrast to homogenous network, heterogenous network involves interaction between different types of nodes.
A list of databases to build knowledge-driven networks. The rows indicate the databases, while the columns indicate the types of connections that can be implemented
| Protein–Protein | Protein-Pathway | Gene-Variant | miRNA-Gene target | Drug-Target/ Gene-Chemical | Drug-Disease/ Chemical-Disease | Drug–Drug/ Chemical–Chemical | Drug-ADR | Chemical-Variant | Drug-Pathway/ Chemical-Pathway | Disease-Gene | Disease-miRNA | Disease-Disease | Disease-Pathway | Disease-Variant | ADR-Protein | ADR-Variant | References | Last updated | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ADReCS | √ | [ | 2021 | ||||||||||||||||
| ADReCS-Target | √ | √ | √ | [ | 2017 | ||||||||||||||
| Agile Protein Interactomes DataServer (APID) | √ | [ | 2021 | ||||||||||||||||
| BioGRID | √ | √ | [ | 2022 | |||||||||||||||
| Comparative Toxicogenomics Database (CTD) | √ | √ | √ | √ | √ | [ | 2022 | ||||||||||||
| DisGeNet | √ | √ | √ | [ | 2020 | ||||||||||||||
| DrugBank | √ | √ | √ | √ | [ | 2022 | |||||||||||||
| Human microRNA Disease Database (HMDD) | √ | [ | 2019 | ||||||||||||||||
| Kyoto Encyclopedia of Genes and Genomes (KEGG) | √ | [ | 2022 | ||||||||||||||||
| miRTarBase | √ | [ | 2021 | ||||||||||||||||
| Online Mendelian Inheritance in Man (OMIM) | √ | [ | 2022 | ||||||||||||||||
| OpenTarget | √ | [ | 2022 | ||||||||||||||||
| PharmGKB | √ | √ | √ | √ | √ | [ | 2022 | ||||||||||||
| REACTOME | √ | √ | √ | [ | 2022 | ||||||||||||||
| Small Molecule Pathway Database (SMPDB) | √ | √ | [ | 2013 | |||||||||||||||
| STITCH | √ | √ | [ | 2015 | |||||||||||||||
| STRING | √ | [ | 2021 | ||||||||||||||||
| Therapeutic Target Database (TTD) | √ | √ | √ | √ | [ | 2022 |
A list of databases to build data-driven networks. The rows indicate the databases, while the columns indicate omics data types
| Chemical structure | Genomic | Transcriptomic | Proteomic | Metabolomic | Epigenomic | Drug sensitivity | References | |
|---|---|---|---|---|---|---|---|---|
| Chemical Entities of Biological Interest (ChEBI) | √ | [ | ||||||
| Human Metabolome Database (HMDB) | √ | √ | [ | |||||
| The Cancer Genome Atlas Program (TCGA) | √ | √ | √ | √ | [ | |||
| Genotype-Tissue Expression (GTEx) | √ | √ | [ | |||||
| Clinical Proteomic Tumor Analysis Consortium (CPTAC) | √ | [ | ||||||
| DrugMatrix (data available via GEO) | √ | [ | ||||||
| Gene Expression Omnibus (GEO) | √ | [ | ||||||
| Open TG-GATEs | √ | [ | ||||||
| Library of Integrated Network-Based Cellular Signatures (LINCS) (data available via GEO) | √ | [ | ||||||
| Genomics of Drug Sensitivity in Cancer | √ | [ | ||||||
| European Nucleotide Archive (ENA) | √ | √ | √ | [ | ||||
| PRoteomics IDEntifications (PRIDE) Archive | √ | [ | ||||||
| Catalogue Of Somatic Mutations In Cancer (COSMIC) | √ | [ |
Figure 2Basic algorithms in network data mining. (A) Examples of graph search algorithms. (B) Examples of in- and out-degree centrality. (C) Examples of nodes highlighting the main characteristic of different centrality scores. ‘1’ has a high degree; ‘2’ has a high betweenness score and ‘3’ has a high closeness score. (D) Network proximity with SP, MST and RW. (E) Example of a Steiner tree. (F) Example of CD in graph. The edges of the SP, MST or ST are presented in bold.
Application of network data-mining algorithms for studying diseases, drugs and their associations
| Algorithm | Target | Application | Description | Reference |
|---|---|---|---|---|
|
| Explore disease and drug effects | Network-based prioritization of gene-disease associations | BFS and DFS were used in combination to compute a low-dimensional vector representation for all nodes in a network. These vectors are then used for gene prioritization. | [ |
|
| Explore disease and drug effects | Identify key and master regulatory genes in disease-specific gene network | BFS was applied to find a minimum set of connected genes such that every other gene in the network is one hop connected with a gene in this set (a minimum connected dominating set of genes). | [ |
|
| Explore disease and drug effects | Identify connected subnetworks in disease-specific biological networks | DFS was extended to identify active gene (or protein)-based modules across multilayer networks (i.e. networks composed of different layers, where every layer is an independent network). | [ |
|
| Explore disease and drug effects | Mine associations between disease and drug targets | Shortest paths were used to compute network proximity of drug-disease pairs. These proximity scores are then evaluated for the prediction of effective drug combinations or adverse effects. | [ |
|
| Explore disease effects | Mine novel disease-associated genes | Shortest paths were utilized to detect genes that are closely related to known disease-associated genes. | [ |
|
| Explore disease variants | Divergence analysis of disease variants | A phylogenetic network analysis of 160 complete human severe acute respiratory syndrome coronavirus 2 (SARS-Cov-2) genomes uses minimum spanning trees to build a phylogenetic tree and measure differences SARS-Cov-2 variants. | [ |
|
| Explore disease variants | Identify patient-driven dysregulated networks | A modified Kruskal minimum spanning tree search strategy was implemented to determine the maximum dysregulated subnetwork for drug treatment in a cohort of cancer patients. | [ |
|
| Explore drug effects | Predict drug-target interaction prediction by using multiple networks | The RWR algorithm was used to infer the cascading effect triggered by perturbed drug targets. The so-called ‘diffusion state’ is then used to compute prediction scores of drug target interactions. | [ |
|
| Explore disease-drug relationships | Identify new indications for existing drugs | An integrated heterogeneous network was constructed by combining multiple sources including drugs, drug targets, diseases and disease genes data. Then, RWR was applied to rank diseases starting from known drug targets. | [ |
|
| Explore disease effects | Identifying gene-disease associations using centrality on gene networks | Text mining and network analysis were used in combination to build disease-specific gene-interaction networks and mine gene-disease associations on the basis of four node centrality scores: degree, eigenvector, betweenness and closeness centrality. | [ |
|
| Explore drug-disease relationships | Explore the SARS-CoV-2 virus-host-drug interactome for drug repurposing | A network was built to present viral-host protein interactions, host-protein interactions and drug-protein interactions. Then, betweenness and closeness centrality are used to rank and select known drugs targeting an optimized set including key viral and host proteins. | [ |
|
| Explore drug-disease relationships | Identify disease-gene and drug-gene associations | A network including disease-drug interactions is built based on known disease-gene associations and drug targets. Then, modularity-based community detection algorithms were used to identify clusters of diseases and drugs that could suggest novel drug repositioning candidates | [ |
|
| Explore disease effects | Identify phenotype-driven modules in gene networks | The Louvain algorithm is utilized to identify biologically relevant gene modules that change under different environmental conditions and biological states. | [ |
Comparison of NDM-based approaches used in drug discovery. The methods are grouped based on four categories: basic NDM algorithms network propagation and random-walk based methods (which do not rely on graph embeddings), matrix factorization and graph neural network-based methods
| Software | Algorithms | Objectives | Input data | Key properties | Limitations |
|---|---|---|---|---|---|
|
| |||||
| Guney’s toolbox | Network proximity between drug targets and disease modules |
| PPIs, DTIs, disease gene sets |
Estimates drug efficacy and safety in gene interaction network context. Uses network-based proximity measures for the discovery of drug combinations. Low computational complexity after computing the shortest paths between all proteins. Estimation for each drug or drug combination is independent of each other and hence it should be highly scalable. Time complexity is not specified, but it should be bounded by | To yield accurate results the set of interactions and disease associated genes should be accurate and complete. |
| GPSnet | Network proximity, gene module detection based on greedy algorithm with multiple random initializations |
| Patient mutation and gene-expression data for disease modules, DTIs (combined from 6 sources), PPIs (combined from 15 sources), drug-induced transcriptome data for GSEA |
Uses cancer-type-specific omics data for disease module detection and cancer specific efficacy estimation based on network proximity. Uses GSEA and cell line expression data to confirm whether disease genes are up-/downregulated by drugs Identifies omics-driven disease gene modules (or pathways). Has low computational complexity and high scalability. | To yield accurate results the set of interactions and disease associated genes should be accurate and complete. |
| SAveRUNNER | Network proximity, drug-disease module detection |
| PPIs, DTIs, disease gene sets |
Estimates drug efficacy using gene interaction networks and network proximity. Prioritizes associations between drugs and diseases located in the same network neighborhoods. Identifies off-label of drugs to be repositioned. Time complexity is not specified, but the module detection algorithms runs in near linear time, High scalability. | To yield accurate results the set of interactions and disease associated genes should be accurate and complete. |
| ThETA | Dijkstra’s algorithm, node centrality based on degree, clustering coefficients and betweenness | Target prioritization | PPIs, tissue-specific gene-expression, disease gene sets |
Drug target efficacy and safety estimation with network topology measures. Computes tissue-specific efficacy estimates. Time complexity is bounded by
| It depends on the accuracy of known disease-gene associations. |
| MeTeOR community detection | Recursive Louvain method | Find disease- and drug-specific pathways | MeSH term co-occurrence in literature |
Identifies novel disease and drug specific biological pathways from genes contained in the same communities. Efficient module detection via Recursive Louvain (RL) method. Requires computationally expensive text mining to produce the network in the first place. Might scale poorly with larger networks due to recursiveness. Time complexity is not specified. | The power of the method for drug discovery depends on the amount of literature that is already published on the topic of interest. |
|
| |||||
| MBiRW | Bi-random walk with restart | Drug repurposing | Drug structure for molecular fingerprints, MeSH terms of diseases, known drug-disease indications |
Simultaneous random walks on drug similarity network and disease similarity network. Known associations are systematically used to adjust similarities and as seed node sets for random walk with restart. Applies clustering analysis to further increase similarity of drugs and diseases belonging to the same cluster. Time complexity is not specified, but it should approximately be cubic in the number of nodes | Repeated adjacency matrix multiplication scales poorly when the number of drugs and diseases becomes very large. |
| DrugNet | Network propagation | Drug repurposing | Drug annotations, disease ontology, known drug-disease indications, DTIs, PPIs, disease gene sets |
Able to integrate data from complex networks involving a wide range of types of elements and interactions The algorithm can work with an arbitrary number of heterogeneous data networks. Computational time relying on network propagation within a network— Time complexity should approximately be bounded by | Network completeness significantly affects the quality predictions. |
|
| |||||
| SCMFDD | Similarity constrained matrix factorization | Drug repurposing | Drug-disease associations, drug molecular structure, drug target proteins and enzymes, drug pathways, drug–drug interactions, disease MeSH DAGs (used for semantic similarity) |
Predicts drug-disease associations by using matrix factorization (MF). Improves MF by using similarity constraints to account for the biological context. Simple and efficient algorithm (should scale better for larger networks than MBiRW) Time complexity is not specified, but it should be approximately bounded by | Three hyper-parameters need to be tuned. Drug similarities were not integrated, but rather tested separately. |
| NMF-DR | Non-negative matrix factorization, similarity network normalization and fusion | Drug repurposing | Known drug-disease indications, drug similarities and disease similarities collected from four previous studies based on: drug clinical annotations, drug structure, drug targets, drug pathways, drug–drug interactions, disease MeSH DAGs, disease ontology, disease gene sets |
Novel approach for constructing a heterogeneous network of drugs and diseases. Aggregates and normalizes similarities between drugs and diseases from different data sources by using similarity network normalization and fusion (SNNF). Provides an improved non-negative matrix factorization (NMF) method which is based on a rank selection method and singular value decomposition (SVD) methods. Could be applied to other bi-partite networks with any number of similarities. Time complexity is not specified. | The matrix initialization cost (using SVD) was identified as a bottleneck by the authors. The similarity fusion step can lose some complementary information from different similarities even though it showed an improvement over the individual similarities that were tested separately. |
| MSBMF | Matrix factorization | Drug repurposing | Drug-disease associations, drug clinical annotations, drug structure, drug side effects, drug–drug interactions, drug targets, disease MeSH terms, disease ontology |
Uses matrix factorization to decompose the drug-disease association matrix into a drug-feature matrix and a disease-feature matrix. Multiple drug- and disease-based similarity constraints are used without applying a fusion step. Latent features are then extracted to infer missing drug-disease associations. Could be applied to other bi-partite networks with any number of similarities. Time complexity is not specified. | It utilizes non-convex optimization which can get stuck in local optima. |
| DTINet | RWR, Diffusion Component Analysis (DCA), inductive matrix completion | Drug-target prediction, drug repurposing | DTIs, drug–drug interactions, drug-disease associations, drug-side effect associations, disease gene sets, PPIs, drug molecular structures, protein sequences |
Integrates diverse information from heterogeneous networks. Uses diffusion component analysis (DCA) to embed nodes of several networks between drugs, targets, side-effects and diseases to extract topological drug and target features. Uses inductive matrix completion to find projection of low-dimensional drug and disease representation such that known drug-disease pairs are geometrically closer in the mapped space. Can be applied to other bi-partite networks with any similarity or association related to drugs and proteins Time complexity is approximately bounded by | Using network propagation for graph embedding is no longer state-of-the-art. A set of negative training examples is required for the matrix completion method used and in practice they are sampled from the set of unknown predictions, which can cause bias in the predictions. |
|
| |||||
| NeoDTI | Deep learning, graph convolutional networks | Drug-target network completion, drug repurposing | DTIs, drug–drug interactions, drug-disease associations, drug-side-effect associations, disease gene sets, PPIs, drug molecular structures, protein sequences |
Predicts drug-target interactions by using several heterogeneous networks between drugs, side effects, proteins and diseases. Embeds nodes in heterogeneous networks by using neighborhood aggregation. Implements end-to-end DTI prediction based on deep learning. Uses multi-objective optimization aiming to preserve topology of input data via reconstruction. Comparatively high predictive performance (improves DTINet). Could be applied to other bi-partite networks with any similarity or association related to drugs and proteins. It can handle large-scale data. Time complexity is not specified. | Computational cost of deep learning can be high, although generally it scales well with more data. Needs to sample negative examples randomly from unknown associations. |
| DeepDTnet | Deep learning, stacked denoising autoencoder, positive-unlabeled matrix completion | Drug-target network completion, drug repurposing | DTIs (assembled from 6 sources), drug molecular structure, PPIs (15 sources), drug–drug interactions (1 source), known drug-disease indications (3 sources), drug side effects (4 sources), protein sequences, tissue gene-expression, gene ontology, drug annotations, disease gene sets (3 sources) |
Predicts drug-target interactions by using multiple heterogeneous networks between drugs, side effects, proteins and diseases. Node embeddings used as drug and target features are extracted by DNGR which uses random surfing, positive pointwise mutual information and stacked denoising autoencoders. Positive unlabeled matrix completion addresses sparsity of associations and lack of negative examples in drug-target graph. Could be used to integrate any weighted networks related to DTI or other bi-partite link prediction. Higher predictive performance compared to DTINet. Time complexity is not specified. | Computational cost can be high for learning multiple deep autoencoders. The authors indicate that their classification of DTI based on a weak binding affinity cut-off might risk false positives. Predictions were slightly biased against drugs with lower average similarity to other drugs, but still very good. |
| HeTDR | Deep learning, similarity network fusion (SNF), sparse autoencoders, text mining, representation learning for attributed multiplex heterogeneous network | Drug repurposing | Drug-disease associations, drug–drug interactions, DTIs (3 sources), drug-side effect association (4 sources), drug molecular structure, drug clinical annotations, protein sequence similarity, gene ontology (GO), drug similarities (structure, clinical annotation semantics, target similarity, 3 GO-based semantics), text-mining-based disease features |
Utilizes a combination of autoencoder, text-mining and deep learning-based link prediction. Sparse autoencoder extracts drug attributes from multiple similarity measures fused together with SNF. Text mining is used to extract rich disease attributes. Link prediction method (GATNE-I) utilizes attributed heterogeneous network graph embedding. GATNE-I could perhaps be used to integrate multiple relations between drugs, diseases and other entities. Text mining could be applied to other corpora. Significantly outperformed other methods in comparison. Time complexity is not specified. | Uses three separately trained deep learning methods with significant computational cost. The final network features only drugs and proteins and could perhaps be improved by adding relevant biomolecules such as proteins. |
| MGRL | Deep learning, graph convolutional networks, node2vec, random forest classifier | Drug repurposing | Drug-disease associations, drug molecular structure, DTIs, disease MeSH DAGs (used for semantic similarity) |
Combines neighborhood attribute aggregation and non-attributed graph node embedding. Uses graph convolutional networks for feature extraction by embedding drugs and diseases based on their immediate neighborhood and attributes. Uses node2vec to embed drugs and diseases in the global network context. Uses random forest for predictions. Time complexity is not specified. | It uses a simplified version of graph convolution that may not learn rich representations of the nodes. Nevertheless, the computational costs of deep learning are typically higher than other methods. The prediction performance was only slightly higher than previous deep learning methods. |
| GRLMN | Deep learning, stacked autoencoders, Large-scale Information Network Embedding (LINE), random forest classifier | Drug repurposing | Drug-disease associations, drug molecular structure, drug target proteins and enzymes, drug pathways, drug–drug interactions, disease MeSH DAGs (used for semantic similarity), disease and drug associations for miRNA, lncRNA and proteins, DTIs, disease gene sets, PPIs |
Integrates association networks consisting of multiple biomolecules. Uses heterogeneous network node embedding for drugs and diseases as well as a drug fingerprint autoencoder for feature extraction. Uses random forest for predictions based on node embeddings and attributes. Could be used to incorporate relations among any number of different entities. When testing different embedding algorithms, LINE outperformed node2vec slightly. Time complexity not specified, but LINE is approximately bounded by | When updating the input network, the whole model needs to be retrained, although the computational cost is not very high according to the authors. Low improvement over SCMFDD prediction performance. |
| Decagon | Deep learning, graph convolutional networks, tensor factorization | Prediction of side effects for drug combinations | PPIs (assembled from 4 sources), DTIs, drug side effects (2 sources), drug–drug combination side-effects |
Predicts polypharmacy side effects. Represents side effects for drug combinations as different drug–drug relations which represent different side effects. Uses end-to-end deep learning combining a graph convolutional network as an encoder of drug–drug and drug-protein relations as well as tensor factorization as a decoder to predict drug combination side effects. Significantly outperformed other multi-relational link prediction methods in comparison. Time complexity is not specified. | It samples random missing links for negative examples in training. Side effects can depend on the dose and patient, which cannot be accounted for by using this method. |
|
| |||||
| CDCN | Network model | Drug sensitivity prediction | Drug response data (IC50) in different cancer cell lines, cell line gene expression at steady state, drug molecular structures |
Constructs a heterogenous network of drugs and cell lines, where drug-cell line edges are weighted by drug response, and drug–drug and cell line-cell line edges are weighted by their similarity. Predicts drug response for new drug on new cell line by using a network model. The predictions are optimized by evaluating a network model over a range of decay parameter values. The optimization algorithm’s time complexity is bounded by the square of the number of drug-cell line pairs which could become an issue with larger datasets. | The suggested strategy for predicting responses in patients requires that there exists a correlated gene-expression profile for which drug response data is known at least for some drugs; however, in practice drug response is typically analyzed in cell lines which often do not exhibit strong correlation with patient profiles. |
| Pouryahya’s method | Network model, clustering, random forest regressor | Drug sensitivity prediction | Drug response data (IC50) in different cancer cell lines, cell line gene expression at steady state, PPIs, cancer associated gene set, drug molecular structures |
Uses a stochastic message-passing process to define an invariant measure of gene expression in a PPI network. Uses Wasserstein distance and optimal mass transport to calculate distance between cell line gene expression profiles. Predictions were made with random forests based on gene expression and drug molecular features. Cell lines and drugs were clustered and separate regression models were trained on different cluster pairs. Time complexity is not specified. | Predictions depend on clustering information, assigning drugs and cell lines to their nearest cluster works as long as they are close to existing clusters, but might cause issues otherwise. Limiting analysis known cancer genes may discard useful information. Need to learn one model per drug-cell line cluster pair. |
| PaccMann | Network propagation, deep learning, multimodal attention-based neural networks, recurrent neural networks, convolutional neural networks | Drug sensitivity prediction | Drug response data (IC50) in different cancer cell lines, cell line gene expression at steady state, drug molecular structures, PPIs |
Uses network propagation to extract subset of informative genes that is used as input in an end-to-end neural network to predict drug sensitivity. Attention-based neural networks offer better interpretability compared to other deep learning approaches. SMILES-based feature extraction outperformed molecular fingerprints. Time complexity is not specified. | The authors mention that adding gene expression profiles from healthy cell lines could improve the extraction of cancer specific features. The method uses PPI for selecting genes, but otherwise does not integrate biological networks. |
| DeepCDR | Deep learning, graph convolutional networks, convolutional neural networks, feed-forward neural networks | Drug sensitivity prediction | Drug response data (IC50) in different cancer cell lines, drug molecular structure, cell line multi-omic profiles (mutation, transcriptomic, epigenomic), patient multi-omic profiles and clinical annotations (used for validation only), known cancer associated gene set |
Uses a uniform graph convolution network (UGCN) to extract rich drug representations from 2D drug graph structure. Integrates multi-omic data to characterize cell lines. Based on ablation analysis epigenomic profiles were particularly useful. Significantly outperformed other baseline machine learning models. Time complexity is not specified. | Does not integrate biological networks. UGCN uses fixed size inputs by utilizing a complementary drug structure graph that is used to complete the adjacency matrix. |
| Y Guan’s method | Network propagation, random forest regressor | Drug synergy prediction | Monotherapy and drug combination response data (IC50) and synergy scores in different cancer cell lines, DTIs, multi-omic cell line profiles (mutations, transcriptome, epigenome), gene network |
Uses network propagation to simulate post treatment molecular profiles for drug combinations in each cell line. The simulated profiles were used as features for random forest-based regression. Trains ensemble of models restricted to subsets of all, one or two drugs for training. The monotherapy data was found more informative than simulated post combination treatment features, but still provided additional value for predictions. Time complexity is not specified. | The local synergy model trains one classifier for each drug combination, which does not scale well for a large number of drugs, although could still be reasonable by choosing an efficient method or appropriate hyper-parameters. |
| PRODeep Syn | Deep learning, graph convolutional network, matrix completion | Synergistic drug combination prediction | Drug combination response data (IC50) and synergy scores in different cancer cell lines, drug molecular structures, cell line gene expression and mutation profiles, PPIs, gene sets used to define node attributes (position, motif, immunological signature) |
Merges PPIs, gene annotations and cell line gene expression to represent cell lines. Performs attributed graph node embedding with GCN to extract features for predicting cell line-specific gene expression profiles from PPI and gene annotations. The learned cell line hidden state and drug features are used in a fully connected deep neural network to predict drug synergy. Time complexity is not specified. | The authors noted that the predicted synergies tended to be lower than observed for drug combinations with high synergy scores due to potential issues with low synergy scores being very common in the training data. The performance improvement over other deep learning methods was quite low. |
| Dcombo Net | Heterogenous network RWR | Synergistic drug combination prediction | Known synergistic drug combinations, drug annotations, drug molecular structure, drug side effects, DTIs, drug-related pathways, PPIs, cancer pathways, baseline and drug exposed cell line gene expression profiles |
Predicts anticancer drug combinations. Uses random walk in a heterogeneous network of drugs, genes and pathways. The walker jumps between subnetworks depending on jumping probability and the subnetwork. Can predict sample-specific drug combinations by modifying the network based on gene expression. Time complexity is not specified, but the main random walk algorithm is approximately bounded by O(n3) where n is the number of nodes in the heterogeneous network. | Three different jumping probabilities need to be optimized. Performance was evaluated by classifying drugs into two or three groups (combinable, uncombinable, intermediate). Drugs with unique mechanisms that are relatively disconnected from the rest of the network are less likely to be predicted well. |
Drug-repurposing results for COVID-19 from various authors applying network proximity algorithms. Drugs in bold indicate drugs which have FDA emergency use authorization for COVID-19 treatment as listed on https://www.fda.gov/ (accessed 9 February 2022). Curly brackets {} indicate drug combinations
| Objective | Network-based approach | Selected drugs | Validation | Ref |
|---|---|---|---|---|
| Host interactome exploration and drug (target) identification | Various NMD algorithms applied to combined virus-host PPI including: a Steiner tree algorithm, closeness centrality and network proximity (average closest shortest-path-distance between gene sets) |
| Experimental validation data not available. Web application: | [ |
| Network medicine drug-repurposing | Ensemble-based approach which is based on three different graph-based approaches: ML combined with graph embeddings, diffusion-based algorithms and proximity-based algorithms. | Auranofin, Azelastine, Vinblastine, Fluvastatin, Methotrexate and Digoxin. | Experimental validation data: | [ |
| Identify drug-disease associations and repurposing drugs | SAveRUNNER aims to quantify the vicinity between the drug targets and the disease-associated proteins in the human interactome via a novel network-based similarity measure that rewards associations between drugs and diseases located in the same network neighborhoods by applying community detection. | SARS & COVID-19: chloroquine, hydroxychloroquine, tocilizumab, heparin COVID-19: {lopinavir, ritonavir, remdesivir, chloroquine, hydroxychloroquine}, dabigatran, adalimumab | Experimental validation data not available. Software: | [ |
| Identify drug-disease associations in virus-host PPI. | Network proximity (average closest shortest-path-distance between gene sets) in host-virus PPI. | Cefdinir, Toremifene, Irbesartan, Melatonin, Carvedilol. | Experimental validation data not available. Software not available. | [ |
| Repurposing single and drug combinations in virus-host PPI. | Network proximity (average closest shortest-path-distance between gene sets) in host-only PPI, complementary exposure for combinations. | Mesalazine, Toremifene, Eplerenone, Paroxetine, Sirolimus, Dactinomycin, irbesartan, mercaptopurine, melatonin, quinacrine, carvedilol, colchicine, camphor, equilin, oxymetholone, emodin {sirolimus, dactinomycin}, {toremifene, emodin}, {mercaptopurine, melatonin}. | Experimental validation data not available. Software not available. | [ |