| Literature DB >> 35625468 |
Lei Wang1,2, Leon Wong1, Zhan-Heng Chen3, Jing Hu2, Xiao-Fei Sun2, Yang Li4, Zhu-Hong You1,5.
Abstract
The key to new drug discovery and development is first and foremost the search for molecular targets of drugs, thus advancing drug discovery and drug repositioning. However, traditional drug-target interactions (DTIs) is a costly, lengthy, high-risk, and low-success-rate system project. Therefore, more and more pharmaceutical companies are trying to use computational technologies to screen existing drug molecules and mine new drugs, leading to accelerating new drug development. In the current study, we designed a deep learning computational model MSPEDTI based on Molecular Structure and Protein Evolutionary to predict the potential DTIs. The model first fuses protein evolutionary information and drug structure information, then a deep learning convolutional neural network (CNN) to mine its hidden features, and finally accurately predicts the associated DTIs by extreme learning machine (ELM). In cross-validation experiments, MSPEDTI achieved 94.19%, 90.95%, 87.95%, and 86.11% prediction accuracy in the gold-standard datasets enzymes, ion channels, G-protein-coupled receptors (GPCRs), and nuclear receptors, respectively. MSPEDTI showed its competitive ability in ablation experiments and comparison with previous excellent methods. Additionally, 7 of 10 potential DTIs predicted by MSPEDTI were substantiated by the classical database. These excellent outcomes demonstrate the ability of MSPEDTI to provide reliable drug candidate targets and strongly facilitate the development of drug repositioning and drug development.Entities:
Keywords: convolutional neural network; deep learning; drug–target interactions; extreme learning machine
Year: 2022 PMID: 35625468 PMCID: PMC9138588 DOI: 10.3390/biology11050740
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
Statistical information for the four gold-standard datasets: the number of target proteins, drugs, and interaction pairs. Sparsity is the ratio of positive DTIs to all possible interactions.
| Dataset | Target Proteins | Drugs | Interactions | Sparsity |
|---|---|---|---|---|
| Enzymes | 664 | 445 | 2926 | 0.0099 |
| Ion Channels | 204 | 210 | 1467 | 0.0344 |
| GPCRs | 95 | 223 | 635 | 0.0299 |
| Nuclear Receptors | 26 | 54 | 90 | 0.0641 |
Figure 1Schematic diagram of the structure of CNN.
MSPEDTI outcomes for 5CV on enzyme dataset.
| Test Set | Accu. (%) | Sen. (%) | Prec. (%) | MCC (%) | AUC (%) |
|---|---|---|---|---|---|
| 1 | 94.87 | 91.23 | 98.75 | 90.04 | 95.12 |
| 2 | 94.27 | 93.14 | 95.26 | 88.57 | 94.77 |
| 3 | 93.85 | 89.80 | 97.78 | 87.99 | 94.32 |
| 4 | 94.02 | 93.07 | 94.71 | 88.04 | 93.98 |
| 5 | 93.94 | 92.33 | 95.15 | 87.91 | 93.68 |
| Average |
|
|
|
|
|
MSPEDTI outcomes for 5CV on ion channel dataset.
| Test Set | Accu. (%) | Sen. (%) | Prec. (%) | MCC (%) | AUC (%) |
|---|---|---|---|---|---|
| 1 | 90.17 | 88.44 | 91.55 | 80.38 | 89.99 |
| 2 | 89.83 | 90.70 | 89.51 | 79.65 | 90.14 |
| 3 | 92.20 | 90.26 | 94.56 | 84.50 | 91.66 |
| 4 | 90.51 | 91.86 | 89.44 | 81.05 | 90.46 |
| 5 | 92.06 | 90.27 | 93.73 | 84.18 | 92.15 |
| Average |
|
|
|
|
|
MSPEDTI outcomes for 5CV on GPCR dataset.
| Test Set | Accu. (%) | Sen. (%) | Prec. (%) | MCC (%) | AUC (%) |
|---|---|---|---|---|---|
| 1 | 86.61 | 92.68 | 82.01 | 73.89 | 85.37 |
| 2 | 89.76 | 95.74 | 87.10 | 79.53 | 91.90 |
| 3 | 88.98 | 95.58 | 82.44 | 78.82 | 88.46 |
| 4 | 88.19 | 92.86 | 84.78 | 76.74 | 89.39 |
| 5 | 86.22 | 93.94 | 82.12 | 73.07 | 85.00 |
| Average |
|
|
|
|
|
MSPEDTI outcomes for 5CV on nuclear receptor dataset.
| Test Set | Accu. (%) | Sen. (%) | Prec. (%) | MCC (%) | AUC (%) |
|---|---|---|---|---|---|
| 1 | 91.67 | 86.96 | 100.00 | 84.05 | 94.98 |
| 2 | 80.56 | 85.71 | 70.59 | 61.51 | 84.74 |
| 3 | 88.89 | 85.00 | 94.44 | 78.26 | 85.63 |
| 4 | 83.33 | 83.33 | 83.33 | 66.67 | 83.02 |
| 5 | 86.11 | 86.67 | 81.25 | 71.81 | 84.76 |
| Average |
|
|
|
|
|
Figure 2ROC of 5CV mapped by MSPEDTI on enzyme dataset.
Figure 3ROC of 5CV mapped by MSPEDTI on ion channel dataset.
Figure 4ROC of 5CV mapped by MSPEDTI on GPCR dataset.
Figure 5ROC of 5CV mapped by MSPEDTI on nuclear receptor dataset.
Comparison results of the 2DPCA descriptor model and MSPEDTI on ion channel.
| Test Set | Accu. (%) | Sen. (%) | Prec. (%) | MCC (%) | AUC (%) |
|---|---|---|---|---|---|
| 1 | 84.75 | 84.90 | 84.90 | 69.49 | 86.41 |
| 2 | 82.03 | 82.31 | 80.00 | 64.02 | 81.24 |
| 3 | 82.37 | 82.84 | 82.84 | 64.72 | 83.35 |
| 4 | 80.68 | 84.23 | 78.93 | 61.47 | 81.22 |
| 5 | 82.77 | 82.00 | 83.67 | 65.56 | 83.12 |
| Average |
|
|
|
|
|
| MSPEDTI |
|
|
|
|
|
Figure 6ROC curves plotted by the 2DPCA descriptor model on ion channel.
Comparison outcomes of SVM model and MSPEDTI on ion channel.
| Test Set | Accu. (%) | Sen. (%) | Prec. (%) | MCC (%) | AUC (%) |
|---|---|---|---|---|---|
| 1 | 85.76 | 90.14 | 81.42 | 71.81 | 85.08 |
| 2 | 85.93 | 89.04 | 82.70 | 71.94 | 87.90 |
| 3 | 85.76 | 87.34 | 84.04 | 71.46 | 84.80 |
| 4 | 86.61 | 89.49 | 83.73 | 73.34 | 87.10 |
| 5 | 88.34 | 89.26 | 87.41 | 76.70 | 88.33 |
| Average |
|
|
|
|
|
| MSPEDTI |
|
|
|
|
|
Figure 7ROC curves plotted by the SVM classifier model on ion channel.
Comparison of AUC with previous methods in the gold-standard dataset.
| Method | Enzymes | Ion Channels | GPCRs | Nuclear Receptors |
|---|---|---|---|---|
| SIMCOMP | 86.30 | 77.60 | 86.70 | 85.60 |
| NLCS | 83.70 | 75.30 | 85.30 | 81.50 |
| Temerinac-Ott | 83.20 | 79.90 | 85.70 | 82.40 |
| Yamanishi | 82.10 | 69.20 | 81.10 | 81.40 |
| KBMF2K | 83.20 | 79.90 | 85.70 | 82.40 |
| WNN-GIP | 86.10 | 77.50 | 87.20 | 83.90 |
| DBSI | 80.75 | 80.29 | 80.22 | 75.78 |
| NetCBP | 82.51 | 80.34 | 82.35 | 83.94 |
| MSPEDTI |
|
|
|
|
Top 10 DTI pairs predicted by MSPEDTI.
| Drug ID | Drug Name | Taregt Protein ID | Target Protein Name | Validation Source |
|---|---|---|---|---|
| D00951 | Medroxyprogesteroneacetate | hsa2099 | ESR1_HUMAN | SuperTarget |
| D00542 | Bromochlorotrifluoroethane | hsa1571 | CP2E1_HUMAN | SuperTarget |
| D03365 | Transdermal Nicotine | hsa1137 | ACHA4_HUMAN | SuperTarget |
| D00049 | Nikotinsaeure | hsa 8843 | G109B_HUMAN | SuperTarget |
| D00160 | Epsilcapramine | hsa7298 | TYSY_HUMAN | unconfirmed |
| D00771 | Chlorzoxazone | hsa1374 | CPT1A_HUMAN | unconfirmed |
| D00139 | Xanthotoxine | hsa1543 | CP1A1_HUMAN | SuperTarget |
| D00964 | Letrozole | hsa1215 | CMA1_HUMAN | unconfirmed |
| D00585 | Mifepristone | hsa2099 | ESR1_HUMAN | SuperTarget |
| D00437 | Nifedipine Monohydrochloride | hsa1559 | CP2C9_HUMAN | SuperTarget |