| Literature DB >> 31541145 |
Narjes Rohani1, Changiz Eslahchi2,3.
Abstract
Drug-Drug Interaction (DDI) prediction is one of the most critical issues in drug development and health. Proposing appropriate computational methods for predicting unknown DDI with high precision is challenging. We proposed "NDD: Neural network-based method for drug-drug interaction prediction" for predicting unknown DDIs using various information about drugs. Multiple drug similarities based on drug substructure, target, side effect, off-label side effect, pathway, transporter, and indication data are calculated. At first, NDD uses a heuristic similarity selection process and then integrates the selected similarities with a nonlinear similarity fusion method to achieve high-level features. Afterward, it uses a neural network for interaction prediction. The similarity selection and similarity integration parts of NDD have been proposed in previous studies of other problems. Our novelty is to combine these parts with new neural network architecture and apply these approaches in the context of DDI prediction. We compared NDD with six machine learning classifiers and six state-of-the-art graph-based methods on three benchmark datasets. NDD achieved superior performance in cross-validation with AUPR ranging from 0.830 to 0.947, AUC from 0.954 to 0.994 and F-measure from 0.772 to 0.902. Moreover, cumulative evidence in case studies on numerous drug pairs, further confirm the ability of NDD to predict unknown DDIs. The evaluations corroborate that NDD is an efficient method for predicting unknown DDIs. The data and implementation of NDD are available at https://github.com/nrohani/NDD .Entities:
Mesh:
Year: 2019 PMID: 31541145 PMCID: PMC6754439 DOI: 10.1038/s41598-019-50121-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Performance comparison of all methods on DS1.
| Method | AUC | AUPR | F-measure | Recall | Precision |
|---|---|---|---|---|---|
| Substructure-based label propagation model | 0.937 | 0.901 | 0.804 | 0.797 | 0.811 |
| Side-effect-based label propagation model | 0.936 | 0.903 | 0.806 | 0.793 | 0.820 |
| Offside-effect-based label propagation model | 0.937 | 0.904 | 0.809 | 0.795 | 0.823 |
| Vilar’s substructure-based model | 0.936 | 0.902 | 0.804 | 0.797 | 0.812 |
| Classifier ensemble method |
|
|
| 0.827 |
|
| Weighted average ensemble method | 0.948 | 0.919 | 0.831 | 0.835 | 0.826 |
| NDD | 0.954 | 0.922 | 0.835 | 0.836 | 0.833 |
| RF | 0.830 | 0.693 | 0.666 | 0.738 | 0.607 |
| LR | 0.941 | 0.905 | 0.812 | 0.810 | 0.818 |
| Adaptive boosting | 0.722 | 0.587 | 0.558 | 0.572 | 0.546 |
| LDA | 0.935 | 0.898 | 0.801 | 0.800 | 0.803 |
| QDA | 0.857 | 0.802 | 0.751 |
| 0.638 |
| KNN | 0.730 | 0.134 | 0.080 | 0.062 | 0.098 |
The best value of each criterion is shown in bold.
Comparison of NDD performance on different similarity types.
| Method | Similarity | AUC | AUPR | F-measure | Recall | Precision |
|---|---|---|---|---|---|---|
| NDD | chemical | 0.631 | 0.455 | 0.527 | 0.899 | 0.373 |
| NDD | target | 0.787 | 0.642 | 0.617 | 0.721 | 0.540 |
| NDD | transporter | 0.682 | 0.568 | 0.519 | 0.945 | 0.358 |
| NDD | enzyme | 0.734 | 0.599 | 0.552 | 0.579 | 0.529 |
| NDD | pathway | 0.767 | 0.623 | 0.587 | 0.650 | 0.536 |
| NDD | indication |
|
|
| 0.740 |
|
| NDD | side effect | 0.778 | 0.601 | 0.619 | 0.748 | 0.528 |
| NDD | offside effect | 0.782 | 0.606 | 0.617 |
| 0.517 |
The best value of each criterion is shown in bold.
Performance comparison of all methods on DS2.
| Method | AUC | AUPR | F-measure | Recall | Precision |
|---|---|---|---|---|---|
| Substructure-based label propagation model | 0.788 | 0.208 | 0.294 | 0.537 | 0.197 |
| Vilar’s substructure-based model | 0.810 | 0.244 | 0.312 | 0.479 | 0.232 |
| Classifier ensemble method | 0.936 | 0.487 | 0.553 | 0.689 | 0.462 |
| Weighted average ensemble method | 0.646 | 0.440 | 0.15 | 0.226 | 0.118 |
| NDD |
|
|
|
|
|
| RF | 0.982 | 0.812 | 0.747 | 0.713 | 0.785 |
| LR | 0.911 | 0.251 | 0.318 | 0.397 | 0.268 |
| Adaptive boosting | 0.904 | 0.185 | 0.266 | 0.359 | 0.211 |
| LDA | 0.894 | 0.215 | 0.295 | 0.407 | 0.231 |
| QDA | 0.926 | 0.466 | 0.174 | 0.875 | 0.096 |
| KNN | 0.927 | 0.785 | 0.602 | 0.445 | 0.932 |
The best value of each criterion is shown in bold.
Performance comparison of all methods on CYP interactions of DS3.
| Method | AUC | AUPR | F-measure | Recall | Precision |
|---|---|---|---|---|---|
| Substructure-based label propagation model | 0.952 | 0.126 | 0.206 | 0.278 | 0.161 |
| Side-effect-based label propagation model | 0.953 | 0.120 | 0.199 | 0.278 | 0.156 |
| Vilar’s substructure-based model | 0.953 | 0.126 | 0.196 | 0.279 | 0.152 |
| Classifier ensemble method | 0.990 | 0.541 | 0.553 | 0.566 | 0.546 |
| Weighted average ensemble method | 0.695 | 0.484 | 0.198 | 0.201 | 0.201 |
| NDD |
|
|
|
|
|
| RF | 0.737 | 0.092 | 0.161 | 0.216 | 0.132 |
| LR | 0.977 | 0.487 | 0.524 | 0.589 | 0.475 |
| Adaptive boosting | 0.830 | 0.143 | 0.215 | 0.259 | 0.185 |
| LDA | 0.953 | 0.327 | 0.388 | 0.363 | 0.425 |
| QDA | 0.709 | 0.317 | 0.259 | 0.446 | 0.184 |
| KNN | 0.590 | 0.064 | 0.039 | 0.008 | 0.190 |
The best value of each criterion is shown in bold.
Performance comparison of all methods on NCYP interactions of DS3.
| Method | AUC | AUPR | F-measure | Recall | Precision |
|---|---|---|---|---|---|
| Substructure-based label propagation model | 0.890 | 0.159 | 0.216 | 0.379 | 0.153 |
| Side-effect-based label propagation model | 0.895 | 0.181 | 0.234 | 0.285 | 0.208 |
| Vilar’s substructure-based model | 0.904 | 0.295 | 0.248 | 0.383 | 0.183 |
| Classifier ensemble method | 0.986 | 0.756 | 0.708 | 0.702 | 0.714 |
| Weighted average ensemble method | 0.974 | 0.599 | 0.587 | 0.584 | 0.591 |
| NDD |
|
|
|
|
|
| RF | 0.889 | 0.167 | 0.242 | 0.411 | 0.168 |
| LR | 0.916 | 0.472 | 0.506 | 0.571 | 0.454 |
| Adaptive boosting | 0.709 | 0.150 | 0.193 | 0.358 | 0.141 |
| LDA | 0.889 | 0.414 | 0.456 | 0.501 | 0.419 |
| QDA | 0.536 | 0.260 | 0.132 | 0.080 | 0.387 |
| KNN | 0.603 | 0.235 | 0.134 | 0.229 | 0.094 |
The best value of each criterion is shown in bold.
Top ten predicted interactions (confirmed interactions by DrugBank database is shown in bold).
| Rank | ID1 | ID2 | Drug Name 1 | Drug Name 2 |
|---|---|---|---|---|
| 1 |
|
| Pemetrexed | Cefoxitin |
| 2 |
|
| Pemetrexed | Amoxicillin |
| 3 |
|
| Dexmedetomidine | Naloxone |
| 4 | DB00633 | DB00361 | Dexmedetomidine | Vinorelbine |
| 5 | DB00535 | DB00373 | Cefdinir | Timolol |
| 6 |
|
| Sevoflurane | Ursodeoxycholic acid |
| 7 | DB01236 | DB00415 | Sevoflurane | Ampicillin |
| 8 |
|
| Mannitol | Gemcitabine |
| 9 |
|
| Nizatidine | Methamphetamine |
| 10 | DB01136 | DB00952 | Carvedilol | Naratriptan |
Details of Benchmarks.
| Benchmark Name | Reference | Number of Drugs | Number of Pairs | Number of Interactions | Number of Non-interactions | Number of Similarities | Similarity Types |
|---|---|---|---|---|---|---|---|
| DS1 | Zhang | 548 | 300304 | 97168 | 203136 | 8 | Chemical, Target, Transporter, Enzyme, Pathway, Indication, Side effects, Offside effect |
| DS2 | Wan | 707 | 499849 | 34412 | 465437 | 1 | Chemical |
| DS3: CYP | Gottlieb | 807 | 651249 | 10078 | 641171 | 7 | GO, Target, Ligand, Chemical, PPI Distance, Side effect, ATC |
| DS3: NCYP | Gottlieb | 807 | 651249 | 40904 | 610345 | 7 | GO, Target, Ligand, Chemical, PPI Distance, Side effect, ATC |
Figure 1The scheme of NDD workflow. (a) Selecting the best subset of similarity matrices. (b) Applying SNF, a fusion method, to integrate all selected similarities into an m*m matrix where m is the number of drugs. (c) Every row in the integrated matrix, is the feature vector of its corresponding drug. (d) For each pair of the drugs, their feature vectors are concatenated in a vector and is considered as the input of a neural network. (e) The neural network is applied to the input vector to calculate the probability of interaction between input drug pair.