| Literature DB >> 28056782 |
Wen Zhang1,2, Yanlin Chen3, Feng Liu4, Fei Luo5,6, Gang Tian5,6, Xiaohong Li5,6.
Abstract
BACKGROUND: Drug-drug interactions (DDIs) are one of the major concerns in drug discovery. Accurate prediction of potential DDIs can help to reduce unexpected interactions in the entire lifecycle of drugs, and are important for the drug safety surveillance.Entities:
Keywords: Drug-drug interaction; Ensemble learning; Missing link prediction; Random walk
Mesh:
Year: 2017 PMID: 28056782 PMCID: PMC5217341 DOI: 10.1186/s12859-016-1415-9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The descriptions about multi-source drug data
| Data type | Data | Data Source | Description |
|---|---|---|---|
| Chemical | Substructures | PubChem | 881 substructure types |
| Biological | Targets | DrugBank | 780 target types |
| Biological | Transporters | DrugBank | 78 transporter types |
| Biological | Enzymes | DrugBank | 129 enzyme types |
| Biological | Pathways | KEGG | 253 pathway types |
| Phenotypic | Indications | SIDER | 4897 indication types |
| Phenotypic | Side effects | SIDER | 4897 side effect types |
| Phenotypic | Off side effects | OFFSIDES | 9496 off side effects types |
| Network | Drug-drug interaction network | TWOSIDES | 548 drugs and 48,584 DDIs |
Fig. 1The scheme of integrating multi-source data for DDI prediction
Performances of different models evaluated by 20 runs of 5-CV
| Method | Similarity | Index | AUC | AUPR | Recall | Precision | Accuracy | F |
|---|---|---|---|---|---|---|---|---|
| Neighbor recommender | Substructure | 1 | 0.936 | 0.759 | 0.765 | 0.617 | 0.950 | 0.683 |
| Target | 2 | 0.820 | 0.365 | 0.338 | 0.548 | 0.867 | 0.418 | |
| Transporter | 3 | 0.714 | 0.329 | 0.290 | 0.389 | 0.862 | 0.331 | |
| Enzyme | 4 | 0.756 | 0.377 | 0.471 | 0.346 | 0.909 | 0.399 | |
| Pathway | 5 | 0.812 | 0.571 | 0.657 | 0.474 | 0.932 | 0.550 | |
| Indication | 6 | 0.912 | 0.599 | 0.555 | 0.591 | 0.923 | 0.572 | |
| Label | 7 | 0.936 | 0.754 | 0.750 | 0.618 | 0.949 | 0.678 | |
| Off label | 8 | 0.940 | 0.768 | 0.765 | 0.629 | 0.951 | 0.691 | |
| CN | 9 | 0.941 | 0.767 | 0.745 | 0.635 | 0.949 | 0.685 | |
| AA | 10 | 0.941 | 0.767 | 0.747 | 0.634 | 0.949 | 0.686 | |
| RA | 11 | 0.943 | 0.770 | 0.752 | 0.634 | 0.950 | 0.688 | |
| Katz | 12 | 0.937 | 0.735 | 0.707 | 0.608 | 0.944 | 0.653 | |
| ACT | 13 | 0.931 | 0.752 | 0.723 | 0.618 | 0.947 | 0.667 | |
| RWR | 14 | 0.941 | 0.766 | 0.746 | 0.634 | 0.949 | 0.685 | |
| Random walk | Substructure | 15 | 0.936 | 0.758 | 0.763 | 0.616 | 0.950 | 0.681 |
| Target | 16 | 0.852 | 0.559 | 0.596 | 0.501 | 0.927 | 0.544 | |
| Transporter | 17 | 0.713 | 0.363 | 0.297 | 0.381 | 0.864 | 0.329 | |
| Enzyme | 18 | 0.760 | 0.470 | 0.657 | 0.344 | 0.927 | 0.451 | |
| Pathway | 19 | 0.811 | 0.594 | 0.709 | 0.479 | 0.937 | 0.572 | |
| Indication | 20 | 0.941 | 0.777 | 0.768 | 0.641 | 0.952 | 0.699 | |
| Label | 21 | 0.936 | 0.760 | 0.764 | 0.621 | 0.950 | 0.685 | |
| Off label | 22 | 0.937 | 0.763 | 0.761 | 0.627 | 0.950 | 0.688 | |
| CN | 23 | 0.938 | 0.757 | 0.736 | 0.625 | 0.948 | 0.676 | |
| AA | 24 | 0.938 | 0.755 | 0.734 | 0.624 | 0.947 | 0.675 | |
| RA | 25 | 0.937 | 0.748 | 0.729 | 0.616 | 0.946 | 0.667 | |
| Katz | 26 | 0.937 | 0.750 | 0.730 | 0.619 | 0.946 | 0.669 | |
| ACT | 27 | 0.930 | 0.748 | 0.727 | 0.632 | 0.938 | 0.671 | |
| RWR | 28 | 0.939 | 0.764 | 0.742 | 0.635 | 0.949 | 0.684 | |
| Matrix perturbation method | 29 | 0.948 | 0.782 | 0.755 | 0.666 | 0.952 | 0.707 |
Performances of different models evaluated by 20 runs of 3-CV
| Method | Similarity | Index | AUC | AUPR | Recall | Precision | Accuracy | F |
|---|---|---|---|---|---|---|---|---|
| Neighbor recommender | Substructure | 1 | 0.935 | 0.808 | 0.772 | 0.669 | 0.927 | 0.717 |
| Target | 2 | 0.806 | 0.425 | 0.420 | 0.579 | 0.831 | 0.486 | |
| Transporter | 3 | 0.714 | 0.405 | 0.344 | 0.495 | 0.800 | 0.406 | |
| Enzyme | 4 | 0.753 | 0.437 | 0.466 | 0.424 | 0.853 | 0.443 | |
| Pathway | 5 | 0.810 | 0.624 | 0.674 | 0.510 | 0.898 | 0.581 | |
| Indication | 6 | 0.903 | 0.640 | 0.584 | 0.658 | 0.888 | 0.618 | |
| Label | 7 | 0.935 | 0.803 | 0.758 | 0.673 | 0.925 | 0.713 | |
| Off label | 8 | 0.939 | 0.815 | 0.771 | 0.684 | 0.928 | 0.725 | |
| CN | 9 | 0.940 | 0.816 | 0.761 | 0.691 | 0.927 | 0.724 | |
| AA | 10 | 0.941 | 0.816 | 0.761 | 0.690 | 0.927 | 0.724 | |
| RA | 11 | 0.942 | 0.819 | 0.763 | 0.691 | 0.928 | 0.725 | |
| Katz | 12 | 0.933 | 0.782 | 0.715 | 0.666 | 0.917 | 0.689 | |
| ACT | 13 | 0.866 | 0.721 | 0.629 | 0.574 | 0.915 | 0.600 | |
| RWR | 14 | 0.940 | 0.814 | 0.760 | 0.688 | 0.927 | 0.722 | |
| Random walk | Substructure | 15 | 0.935 | 0.807 | 0.768 | 0.670 | 0.927 | 0.716 |
| Target | 16 | 0.844 | 0.608 | 0.601 | 0.555 | 0.888 | 0.576 | |
| Transporter | 17 | 0.713 | 0.437 | 0.339 | 0.504 | 0.795 | 0.404 | |
| Enzyme | 18 | 0.760 | 0.533 | 0.655 | 0.374 | 0.886 | 0.476 | |
| Pathway | 19 | 0.810 | 0.648 | 0.724 | 0.515 | 0.906 | 0.601 | |
| Indication | 20 | 0.939 | 0.820 | 0.773 | 0.693 | 0.930 | 0.731 | |
| Label | 21 | 0.936 | 0.809 | 0.771 | 0.674 | 0.927 | 0.719 | |
| Off label | 22 | 0.937 | 0.811 | 0.771 | 0.680 | 0.928 | 0.722 | |
| CN | 23 | 0.937 | 0.807 | 0.748 | 0.685 | 0.925 | 0.715 | |
| AA | 24 | 0.937 | 0.806 | 0.747 | 0.683 | 0.924 | 0.714 | |
| RA | 25 | 0.936 | 0.799 | 0.741 | 0.675 | 0.923 | 0.706 | |
| Katz | 26 | 0.936 | 0.801 | 0.743 | 0.677 | 0.923 | 0.708 | |
| ACT | 27 | 0.866 | 0.706 | 0.658 | 0.699 | 0.834 | 0.643 | |
| RWR | 28 | 0.938 | 0.813 | 0.759 | 0.690 | 0.927 | 0.723 | |
| Matrix perturbation method | 29 | 0.941 | 0.813 | 0.755 | 0.709 | 0.928 | 0.731 |
Performances of ensemble model evaluated by 20 runs of 3-CV and 5-CV
| Evluation | Method | AUC | AUPR | Precision | Recall | Accuracy | F-measure |
|---|---|---|---|---|---|---|---|
| 3-CV evaluation | Weighted average ensemble method | 0.947 | 0.832 | 0.782 | 0.703 | 0.932 | 0.740 |
| Classifier ensemble method(L1) | 0.954 | 0.841 | 0.788 | 0.717 | 0.934 | 0.751 | |
| Classifier ensemble method(L2) | 0.952 | 0.839 | 0.784 | 0.712 | 0.933 | 0.746 | |
| 5-CV evaluation | Weighted average ensemble method | 0.951 | 0.795 | 0.775 | 0.659 | 0.953 | 0.712 |
| Classifier ensemble method(L1) | 0.957 | 0.807 | 0.785 | 0.670 | 0.955 | 0.723 | |
| Classifier ensemble method(L2) | 0.956 | 0.806 | 0.783 | 0.665 | 0.955 | 0.719 |
Fig. 2Weights for base predictors in the weighted average ensemble models (a) 3-CV experiments; (b) 5-CV experiments
Performances of the ensemble method and benchmark methods evaluated by 20 runs of 3-CV and 5-CV
| Evluation | Method | AUC | AUPR | Precision | Recall | Accuracy | F-measure |
|---|---|---|---|---|---|---|---|
| 3-CV evaluation | Vilar’s substructure-based model | 0.670 | 0.273 | 0.145 | 0.535 | 0.684 | 0.229 |
| Vilar’s CN index-based model | 0.872 | 0.413 | 0.377 | 0.553 | 0.880 | 0.447 | |
| Substructure-based label propagation model | 0.935 | 0.807 | 0.768 | 0.670 | 0.927 | 0.716 | |
| Side effect-based Label propagation model | 0.936 | 0.809 | 0.771 | 0.674 | 0.927 | 0.719 | |
| Off side effect-based label propagation model | 0.937 | 0.811 | 0.771 | 0.680 | 0.928 | 0.722 | |
| Weighted average ensemble method | 0.947 | 0.832 | 0.782 | 0.703 | 0.932 | 0.740 | |
| Classifier ensemble method (L1) | 0.954 | 0.841 | 0.788 | 0.717 | 0.934 | 0.751 | |
| Classifier ensemble method (L2) | 0.952 | 0.839 | 0.784 | 0.712 | 0.933 | 0.746 | |
| 5-CV evaluation | Vilar’s substructure-based model | 0.670 | 0.273 | 0.145 | 0.535 | 0.684 | 0.229 |
| Vilar’s CN index-based model | 0.872 | 0.413 | 0.377 | 0.553 | 0.880 | 0.447 | |
| Substructure-based label propagation model | 0.936 | 0.758 | 0.763 | 0.616 | 0.950 | 0.681 | |
| Side effect-based Label propagation model | 0.936 | 0.760 | 0.764 | 0.621 | 0.950 | 0.685 | |
| Off side effect-based label propagation model | 0.937 | 0.763 | 0.761 | 0.627 | 0.950 | 0.688 | |
| Weighted average ensemble method | 0.951 | 0.795 | 0.775 | 0.659 | 0.953 | 0.712 | |
| Classifier ensemble method (L1) | 0.957 | 0.807 | 0.785 | 0.670 | 0.955 | 0.723 | |
| Classifier ensemble method (L2) | 0.956 | 0.806 | 0.783 | 0.665 | 0.955 | 0.719 |
The statistical significance of performance improvements achieved by our ensemble methods
| Evaluation | Methods | Weighted average ensemble method | Classifier ensemble method(L1) | Classifier ensemble method(L2) |
|---|---|---|---|---|
| 3-CV | Vilar’s substructure-based model | 1.05E-94 | 2.67E-78 | 1.18E-86 |
| Vilar’s CN index-based model | 4.12E-74 | 7.32E-67 | 1.14E-71 | |
| Substructure-based label propagation model | 1.02E-45 | 8.30E-34 | 2.96E-41 | |
| Side effect-based Label propagation model | 1.61E-44 | 8.86E-33 | 3.28E-40 | |
| Off side effect-based label propagation model | 3.32E-42 | 1.94E-31 | 1.17E-38 | |
| 5-CV | Vilar’s substructure-based model | 4.76E-52 | 3.12E-48 | 5.42E-54 |
| Vilar’s CN index-based model | 2.27E-48 | 2.34E-44 | 1.14E-48 | |
| Substructure-based label propagation model | 1.68E-31 | 1.71E-29 | 1.28E-36 | |
| Side effect-based Label propagation model | 1.27E-30 | 6.71E-29 | 3.04E-36 | |
| Off side effect-based label propagation model | 4.03E-30 | 2.43E-28 | 1.67E-35 |
Fig. 3The number of identified testing interactions (a) top 10,000 predictions; (b) top 15, 000 predictions. 1: Vilar’s substructure-based model (6626, 7527); 2: Vilar’s CN index-based model (6667, 7639); 3: Substructure-based label propagation model (6597, 7515); 4: Side effect-based Label propagation model (6641, 7573); 5: Off side effect-based label propagation model (6693,7591); 6: Weighted average ensemble method (6923, 7842); 7: L1 Classifier ensemble method (7027, 7972); 8: L2 Classifier ensemble method (6980, 7942)
Top 20 novel interactions predicted by our method (confirmed interactions shown in bold)
| Rank | ID1 | ID2 | Drug name 1 | Drug name 2 |
|---|---|---|---|---|
| 1 |
|
|
|
|
| 2 |
|
|
|
|
| 3 | DB00945 | DB01033 | Acetylsalicylic acid | Mercaptopurine |
| 4 | DB01059 | DB00448 | Norfloxacin | Lansoprazole |
| 5 | DB00990 | DB00635 | Exemestane | Prednisone |
| 6 | DB00213 | DB00310 | Pantoprazole | Chlorthalidone |
| 7 |
|
|
|
|
| 8 | DB00658 | DB00331 | Sevelamer | Metformin |
| 9 |
|
|
|
|
| 10 | DB00537 | DB00869 | Ciprofloxacin | Dorzolamide |
| 11 |
|
|
|
|
| 12 | DB00346 | DB00630 | Alfuzosin | Alendronic acid |
| 13 | DB00535 | DB00813 | Cefdinir | Fentanyl |
| 14 | DB00334 | DB00795 | Olanzapine | Sulfasalazine |
| 15 | DB00749 | DB01142 | Etodolac | Doxepin |
| 16 |
|
|
|
|
| 17 | DB00472 | DB00214 | Fluoxetine | Torasemide |
| 18 | DB00862 | DB00407 | Vardenafil | Ardeparin |
| 19 | DB00275 | DB00959 | Olmesartan | Methylprednisolone |
| 20 |
|
|
|
|