| Literature DB >> 31945078 |
Aanchal Mongia1, Angshul Majumdar2.
Abstract
The identification of potential interactions between drugs and target proteins is crucial in pharmaceutical sciences. The experimental validation of interactions in genomic drug discovery is laborious and expensive; hence, there is a need for efficient and accurate in-silico techniques which can predict potential drug-target interactions to narrow down the search space for experimental verification. In this work, we propose a new framework, namely, Multi-Graph Regularized Nuclear Norm Minimization, which predicts the interactions between drugs and target proteins from three inputs: known drug-target interaction network, similarities over drugs and those over targets. The proposed method focuses on finding a low-rank interaction matrix that is structured by the proximities of drugs and targets encoded by graphs. Previous works on Drug Target Interaction (DTI) prediction have shown that incorporating drug and target similarities helps in learning the data manifold better by preserving the local geometries of the original data. But, there is no clear consensus on which kind and what combination of similarities would best assist the prediction task. Hence, we propose to use various multiple drug-drug similarities and target-target similarities as multiple graph Laplacian (over drugs/targets) regularization terms to capture the proximities exhaustively. Extensive cross-validation experiments on four benchmark datasets using standard evaluation metrics (AUPR and AUC) show that the proposed algorithm improves the predictive performance and outperforms recent state-of-the-art computational methods by a large margin. Software is publicly available at https://github.com/aanchalMongia/MGRNNMforDTI.Entities:
Year: 2020 PMID: 31945078 PMCID: PMC6964976 DOI: 10.1371/journal.pone.0226484
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Drugs, targets and interactions in each dataset used for validation.
| Datasets | Nuclear Receptor (NR) | G-Protein Coupled Receptor (GPCR) | Ion channel (IC) | Enzyme (E) |
|---|---|---|---|---|
| # Interactions | 90 | 635 | 1476 | 2926 |
| # Drugs | 54 | 223 | 210 | 445 |
| # Targets | 26 | 95 | 204 | 664 |
Fig 1Converge plot of the MGRNNM algorithm for NR dataset with cross validation setting CVS1 (drug-target pair prediction).
AUPR results for interaction prediction under validation setting CVS1.
| AUPR | MGRNNM | standard | MC | MCG | WGRMF | RLS_WNN | CMF | NRLMF | TMF |
|---|---|---|---|---|---|---|---|---|---|
| E | 0.9014 | 0.7882 | 0.7621 | 0.8768 | 0.8093 | 0.8837 | 0.8749 | 0.7886 | |
| IC | 0.9298 | 0.8868 | 0.8346 | 0.9225 | 0.8459 | 0.9373 | 0.8674 | 0.8654 | |
| GPCR | 0.7483 | 0.6481 | 0.5956 | 0.7370 | 0.6933 | 0.7543 | 0.7115 | 0.6600 | |
| NR | 0.6408 | 0.3950 | 0.4558 | 0.6016 | 0.7072 | 0.6383 | 0.7390 | 0.4248 | |
| Average | 0.8051 | 0.6795 | 0.6620 | 0.7845 | 0.7639 | 0.8034 | 0.7982 | 0.6847 |
AUC results for interaction prediction under validation setting CVS1.
| AUC | MGRNNM | standard | MC | MCG | WGRMF | RLS_WNN | CMF | NRLMF | TMF |
|---|---|---|---|---|---|---|---|---|---|
| E | 0.9798 | 0.8753 | 0.9596 | 0.9647 | 0.9635 | 0.9705 | 0.9761 | 0.8943 | |
| IC | 0.9829 | 0.9415 | 0.9539 | 0.9747 | 0.9786 | 0.9832 | 0.9838 | 0.9433 | |
| GPCR | 0.9531 | 0.8110 | 0.8977 | 0.9432 | 0.9458 | 0.9493 | 0.9620 | 0.8373 | |
| NR | 0.9083 | 0.5882 | 0.8315 | 0.8892 | 0.9329 | 0.8679 | 0.9479 | 0.5496 | |
| Average | 0.9560 | 0.8040 | 0.9107 | 0.9429 | 0.9552 | 0.9427 | 0.9674 | 0.8061 |
AUPR results for interaction prediction under validation setting CVS2.
| AUPR | MGRNNM | standard | MC | MCG | WGRMF | RLS_WNN | CMF | NRLMF | TMF |
|---|---|---|---|---|---|---|---|---|---|
| E | 0.4089 | 0.0114 | 0.0457 | 0.4019 | 0.2409 | 0.3848 | 0.3582 | 0.3748 | |
| IC | 0.3650 | 0.0473 | 0.0925 | 0.3666 | 0.3090 | 0.3538 | 0.3414 | 0.3371 | |
| GPCR | 0.4175 | 0.0404 | 0.1091 | 0.4247 | 0.3463 | 0.4059 | 0.3671 | 0.3866 | |
| NR | 0.5620 | 0.1120 | 0.2404 | 0.5695 | 0.5373 | 0.5203 | 0.5296 | 0.4912 | |
| Average | 0.8735 | 0.4384 | 0.0528 | 0.1219 | 0.4407 | 0.3584 | 0.4162 | 0.3990 | 0.3974 |
AUC results for interaction prediction under validation setting CVS2.
| AUC | MGRNNM | standard | MC | MCG | WGRMF | RLS_WNN | CMF | NRLMF | TMF |
|---|---|---|---|---|---|---|---|---|---|
| E | 0.8260 | 0.5060 | 0.7413 | 0.7982 | 0.7755 | 0.7952 | 0.8151 | 0.8204 | |
| IC | 0.7913 | 0.5512 | 0.7196 | 0.7902 | 0.7669 | 0.7576 | 0.7881 | 0.8030 | |
| GPCR | 0.8805 | 0.5855 | 0.7745 | 0.8800 | 0.8524 | 0.8067 | 0.8841 | 0.8452 | |
| NR | 0.8452 | 0.5294 | 0.6992 | 0.8615 | 0.8390 | 0.8124 | 0.8804 | 0.8435 | |
| Average | 0.9568 | 0.8357 | 0.5430 | 0.7337 | 0.8325 | 0.8085 | 0.7930 | 0.8419 | 0.8280 |
AUPR results for interaction prediction under validation setting CVS3.
| AUPR | MGRNNM | standard | MC | MCG | WGRMF | RLS_WNN | CMF | NRLMF | TMF |
|---|---|---|---|---|---|---|---|---|---|
| E | 0.8087 | 0.0124 | 0.0691 | 0.8070 | 0.5465 | 0.7808 | 0.8112 | 0.8000 | |
| IC | 0.8079 | 0.0421 | 0.2256 | 0.8128 | 0.7437 | 0.7786 | 0.7753 | 0.7893 | |
| GPCR | 0.5963 | 0.0549 | 0.1061 | 0.6093 | 0.5397 | 0.5989 | 0.5515 | 0.6001 | |
| NR | 0.4356 | 0.0850 | 0.2669 | 0.4643 | 0.4907 | 0.4774 | 0.5207 | 0.4709 | |
| Average | 0.7679 | 0.6621 | 0.0486 | 0.1669 | 0.6734 | 0.5801 | 0.6589 | 0.6646 | 0.6650 |
AUC results for interaction prediction under validation setting CVS3.
| AUC | MGRNNM | standard | MC | MCG | WGRMF | RLS_WNN | CMF | NRLMF | TMF |
|---|---|---|---|---|---|---|---|---|---|
| E | 0.9246 | 0.5234 | 0.8065 | 0.9338 | 0.9067 | 0.9272 | 0.9465 | 0.9436 | |
| IC | 0.9541 | 0.9346 | 0.4724 | 0.7871 | 0.9460 | 0.9286 | 0.9368 | 0.9476 | |
| GPCR | 0.8975 | 0.8798 | 0.5683 | 0.6289 | 0.8892 | 0.8694 | 0.8966 | 0.8735 | |
| NR | 0.7502 | 0.7263 | 0.3767 | 0.6522 | 0.7967 | 0.8124 | 0.8373 | 0.8407 | |
| Average | 0.8909 | 0.8618 | 0.4575 | 0.7486 | 0.8922 | 0.8826 | 0.9004 | 0.9217 | 0.9013 |
Fig 2Three-dimensional mesh depicting the variation of AUPR with the parameters μ1 and μ2 for drug-target interaction prediction using MGRNNM.
Fig 3Bar plots depicting that incorporating all the similarities for drugs and targets for prediction task yields best results for every dataset (a) E (b) IC (c) GPCR and (d) NR under the three cross-validation settings in comparison to the cases where each type of similarity was considered separately.
Here, “standard” represents the case when only the chemical structure similarity for drugs and genomic sequence similarity for targets were taken into account and “COMBINED” refers to the use case where all the similarity matrices (standard similarity, Cosine similarity, Correlation, Hamming similarity and Jaccard similarity) were considered.