| Literature DB >> 31870276 |
Yi Zheng1, Hui Peng1, Xiaocai Zhang1, Zhixun Zhao1, Xiaoying Gao2, Jinyan Li3.
Abstract
BACKGROUND: Drug-drug interactions (DDIs) are a major concern in patients' medication. It's unfeasible to identify all potential DDIs using experimental methods which are time-consuming and expensive. Computational methods provide an effective strategy, however, facing challenges due to the lack of experimentally verified negative samples.Entities:
Keywords: Drug interaction prediction; Positive-unlabeled learning; Reliable negative samples
Mesh:
Year: 2019 PMID: 31870276 PMCID: PMC6929327 DOI: 10.1186/s12859-019-3214-6
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1F1-scores of DDI-PULearn with different PCNs. The x-axis is the PCA component number and the y-axis is the F1-score. Panel (a) shows the F1-scores for PCN between 1 and 2000, and Panel (b) is an amplification of the range [20,150] (amplification ratio = 5)
Fig. 2Prediction results using different combinations of drug features. BDPs refer to the basic drug properties namely drug chemical substructures, drug targets, and drug indications
Prediction performance comparison with the two baseline methods, namely all-negatives and random-negatives
| Method | Precision | Recall | F1-score |
|---|---|---|---|
| DDI-PULearn | 0.906 | 0.828 | 0.865 |
| Random-negatives | 0.765 | 0.676 | 0.718 |
| All-negatives | 0.709 | 0.449 | 0.550 |
Performances of DDI-PULearn and the benchmark methods evaluated by 20 runs of 3-fold cross-validation and 5-fold cross-validation
| Evaluation | Method | Precision | Recall | F1-score |
|---|---|---|---|---|
| 3-fold CV | Vilar’s substructure-based method | 0.145 | 0.535 | 0.229 |
| Vilar’s interaction-fingerprint-based method | 0.377 | 0.553 | 0.447 | |
| Zhang’s weighted average ensemble method | 0.782 | 0.703 | 0.740 | |
| Zhang’s L1 classifier ensemble method | 0.788 | 0.717 | 0.751 | |
| Zhang’s L2 classifier ensemble method | 0.784 | 0.712 | 0.746 | |
| DDI-PULearn | ||||
| 5-fold CV | Vilar’s substructure-based method | 0.145 | 0.535 | 0.229 |
| Vilar’s interaction-fingerprint-based method | 0.377 | 0.553 | 0.447 | |
| Zhang’s weighted average ensemble method | 0.775 | 0.659 | 0.712 | |
| Zhang’s L1 classifier ensemble method | 0.785 | 0.670 | 0.723 | |
| Zhang’s L2 classifier ensemble method | 0.783 | 0.665 | 0.719 | |
| DDI-PULearn |
Performance assessment of DDI-PULearn-Top and Hameed’s approaches using 10 training set and 5-fold cross-validation
| Evaluation | DDI-PULearn-Top | Hameed’s GSOM-based PUL | |
|---|---|---|---|
| SFR1 | SFR2 | ||
| Precision | 0.944 | 0.951 | 0.974 |
| Recall | 0.934 | 0.861 | 0.975 |
| F1-score | 0.939 | 0.904 | 0.974 |
Top 25 novel DDIs predicted by the proposed method DDI-PULearn
| Rank | Drug1 | Drug1 Name | Drug2 | Drug2 Name | Score |
|---|---|---|---|---|---|
| 2 | DB00512 | Vancomycin | DB00704 | Naltrexone | 1 |
| 3 | DB00704 | Naltrexone | DB00783 | Estradiol | 1 |
| 4 | DB00773 | Etoposide | DB00783 | Estradiol | 1 |
| 5 | DB01137 | Levofloxacin | DB00704 | Naltrexone | 1 |
| 6 | DB00635 | Prednisone | DB00704 | Naltrexone | 1 |
| 10 | DB00213 | Pantoprazole | DB01189 | Desflurane | 0.975 |
| 11 | DB00586 | Diclofenac | DB01037 | Selegiline | 0.975 |
| 12 | DB00398 | Sorafenib | DB00445 | Epirubicin | 0.975 |
| 13 | DB00724 | Imiquimod | DB00331 | Metformin | 0.975 |
| 16 | DB00635 | Prednisone | DB00324 | Fluorometholone | 0.975 |
| 21 | DB00959 | Methylprednisolone | DB00550 | Propylthiouracil | 0.95 |
| 23 | DB00295 | Morphine | DB00674 | Galantamine | 0.95 |
| 24 | DB01137 | Levofloxacin | DB00323 | Tolcapone | 0.95 |
| 25 | DB00591 | Fluocinolone Acetonide | DB00641 | Simvastatin | 0.95 |
(DDIs which are confirmed in DrugBank are highlighted in bold font.)
Fig. 3The framework of the proposed method. It consists of the following five components: reliable negative sample identification, feature vector representation for DDIs, PCA compression, DDI prediction, and performance evaluation. RN: reliable negative samples; PCA: principal component analysis; DDI: drug-drug interaction
Fig. 4The flow chart for the identification of reliable negative samples. OCSVM: one-class support vector machine; KNN: k-nearest neighbor; RNS: reliable negative samples; RU: remaining unlabeled