| Literature DB >> 35910223 |
Yong-Jian Guan1, Chang-Qing Yu1, Li-Ping Li1,2, Zhu-Hong You3, Zhong-Hao Ren1, Jie Pan4, Yue-Chao Li1.
Abstract
As a novel target in pharmacy, microRNA (miRNA) can regulate gene expression under specific disease conditions to produce specific proteins. To date, many researchers leveraged miRNA to reveal drug efficacy and pathogenesis at the molecular level. As we all know that conventional wet experiments suffer from many problems, including time-consuming, labor-intensity, and high cost. Thus, there is an urgent need to develop a novel computational model to facilitate the identification of miRNA-drug interactions (MDIs). In this work, we propose a novel bipartite network embedding-based method called BNEMDI to predict MDIs. First, the Bipartite Network Embedding (BiNE) algorithm is employed to learn the topological features from the network. Then, the inherent attributes of drugs and miRNAs are expressed as attribute features by MACCS fingerprints and k-mers. Finally, we feed these features into deep neural network (DNN) for training the prediction model. To validate the prediction ability of the BNEMDI model, we apply it to five different benchmark datasets under five-fold cross-validation, and the proposed model obtained excellent AUC values of 0.9568, 0.9420, 0.8489, 0.8774, and 0.9005 in ncDR, RNAInter, SM2miR1, SM2miR2, and SM2miR MDI datasets, respectively. To further verify the prediction performance of the BNEMDI model, we compare it with some existing powerful methods. We also compare the BiNE algorithm with several different network embedding methods. Furthermore, we carry out a case study on a common drug named 5-fluorouracil. Among the top 50 miRNAs predicted by the proposed model, there were 38 verified by the experimental literature. The comprehensive experiment results demonstrated that our method is effective and robust for predicting MDIs. In the future work, we hope that the BNEMDI model can be a reliable supplement method for the development of pharmacology and miRNA therapeutics.Entities:
Keywords: BiNE; MACCS fingerprint; deep neural network; k-mer; miRNA–drug interaction
Year: 2022 PMID: 35910223 PMCID: PMC9334674 DOI: 10.3389/fgene.2022.919264
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1Flowchart of the BNEMDI model for predicting potential MDIs.
Statistics of miRNAs, drugs, and miRNA–drug interactions in five datasets.
| Dataset | ncDR | RNAInter | SM2miR1 | SM2miR2 | SM2miR3 |
|---|---|---|---|---|---|
| Drug | 95 | 281 | 86 | 113 | 142 |
| miRNA | 624 | 1,009 | 358 | 536 | 645 |
| Interaction | 4,457 | 5,739 | 1,110 | 1,697 | 1,940 |
FIGURE 2Diagram of k-mer for extracting attribute sequences from miRNA sequences.
FIGURE 3Diagram of MACCS fingerprint-represented drug substructures.
Performance of the proposed method on five datasets.
| Fold | AUC | AUPR (%) | Acc (%) | Sen (%) | Spec (%) | Prec (%) | MCC (%) |
|---|---|---|---|---|---|---|---|
| ncDR | 0.9568 ± 0.0010 | 95.65 ± 0.13 | 88.75 ± 0.10 | 89.13 ± 0.19 | 88.39 ± 0.13 | 88.47 ± 0.11 | 77.51 ± 0.20 |
| RNAInter | 0.9420 ± 0.0016 | 93.88 ± 0.16 | 87.23 ± 0.34 | 88.99 ± 1.05 | 85.47 ± 1.30 | 85.98 ± 0.94 | 74.52 ± 0.65 |
| SM2miR1 | 0.8489 ± 0.0021 | 84.61 ± 0.25 | 77.24 ± 0.39 | 80.82 ± 0.33 | 73.66 ± 0.68 | 75.42 ± 0.49 | 54.62 ± 0.78 |
| SM2miR2 | 0.8774 ± 0.0023 | 87.04 ± 0.17 | 79.92 ± 0.21 | 81.12 ± 0.33 | 78.73 ± 0.23 | 79.22 ± 0.19 | 59.86 ± 0.42 |
| SM2miR3 | 0.9005 ± 0.0026 | 89.34 ± 0.20 | 81.86 ± 0.65 | 79.47 ± 1.42 | 84.24 ± 2.05 | 83.49 ± 1.62 | 63.81 ± 1.37 |
FIGURE 4Prediction performance of BNEMDI based on ROC and PR curves. (A) Five-fold cross-validation ROC and PR curves on the ncDR dataset. (B) Five-fold cross-validation ROC and PR curves on the RNAInter dataset. (C) Five-fold cross-validation ROC and PR curves on the SM2miR1 dataset. (D) Five-fold cross-validation ROC and PR curves on the SM2miR2 dataset. (E) Five-fold cross-validation ROC and PR curves on the SM2miR3 dataset.
Comparison of the prediction performance based on the ncDR dataset (N/A means not available).
| Method | ncDR | RNAInter | SM2miR1 | SM2miR2 | SM2miR3 |
|---|---|---|---|---|---|
| GCMDR | 0.9359 ± 0.0006 | N/A | N/A | N/A | N/A |
| EPLMI | 0.8971 ± 0.0009 | N/A | N/A | N/A | N/A |
| Neighbor-based CF | 0.8644 ± 0.0009 | 0.8532 ± 0.0007 | 0.6289 ± 0.0017 | 0.7346 ± 0.0027 | 0.8654 ± 0.0015 |
| Drug-based CF | 0.7313 ± 0.0008 | 0.7120 ± 0.0010 | 0.6982 ± 0.0026 | 0.6993 ± 0.0013 | 0.7030 ± 0.0016 |
| miRNA-based CF | 0.8235 ± 0.0015 | 0.8364 ± 0.0022 | 0.6325 ± 0.0019 | 0.6534 ± 0.0014 | 0.7644 ± 0.0009 |
| SVD-based MF | 0.6007 ± 0.0052 | 0.6189 ± 0.0044 | 0.5978 ± 0.0050 | 0.6039 ± 0.0051 | 0.6045 ± 0.0045 |
| BNEMDI | 0.9568 ± 0.0010 | 0.9420 ± 0.0016 | 0.8489 ± 0.0021 | 0.8774 ± 0.0023 | 0.9005 ± 0.0026 |
FIGURE 5Prediction performance of different features on different datasets.
FIGURE 6AUC and AUPR of four network embedding methods in different dimensions.
Average performance of the different classifiers on ncDR datasets.
| Classifier | AUC | AUPR (%) | Acc (%) | Sen (%) | Spec (%) | Prec (%) | MCC (%) |
|---|---|---|---|---|---|---|---|
| NB | 0.9166 ± 0.0035 | 90.16 ± 0.53 | 86.49 ± 0.54 | 81.69 ± 1.22 | 91.30 ± 0.87 | 90.38 ± 0.81 | 73.34 ± 1.05 |
| SVM | 0.9415 ± 0.0033 | 92.93 ± 0.57 | 86.84 ± 0.63 | 85.91 ± 0.38 | 87.77 ± 1.21 | 87.55 ± 1.08 | 73.70 ± 1.28 |
| LR | 0.9473 ± 0.0029 | 94.27 ± 0.48 | 87.56 ± 0.77 | 86.56 ± 0.32 | 88.56 ± 1.45 | 88.34 ± 1.32 | 75.14 ± 1.56 |
| RF | 0.9502 ± 0.0036 | 94.42 ± 0.56 | 88.38 ± 0.87 | 88.18 ± 0.88 | 88.58 ± 1.79 | 88.56 ± 1.58 | 76.77 ± 1.76 |
| BNEMDI | 0.9573 ± 0.0009 | 95.65 ± 0.13 | 88.75 ± 0.10 | 89.13 ± 0.19 | 88.39 ± 0.13 | 88.47 ± 1.11 | 77.51 ± 0.20 |
FIGURE 7Comparison of BNEMDI with different classifiers under five-fold cross-validation. (A) ROC curve on the ncDR MDI dataset. (B) PR curve on the ncDR MDI dataset.
Top 30 potential MDIs predicted by BNEMDI.
| Drug (CID) | miRNA | Evidence | Drug (CID) | miRNA | Evidence |
|---|---|---|---|---|---|
| 60750 | hsa-miR-24-3p | Unconfirmed | 60750 | hsa-miR-29c-3p | 29807360 |
| 60750 | hsa-miR-205-5p | 31602229 | 2767 | hsa-miR-1236-3p | 30805558 |
| 2767 | hsa-miR-193b-3p | 27918099 | 5310940 | hsa-miR-660-5p | Unconfirmed |
| 3385 | hsa-miR-10a-5p | Unconfirmed | 60750 | hsa-miR-532-5p | Unconfirmed |
| 3385 | hsa-miR-33b-5p | Unconfirmed | 31703 | hsa-miR-18a-5p | Unconfirmed |
| 3385 | hsa-miR-376a-3p | Unconfirmed | 3385 | hsa-miR-431-5p | Unconfirmed |
| 2520 | hsa-miR-126-3p | Unconfirmed | 5310940 | hsa-miR-196a-5p | Unconfirmed |
| 60750 | hsa-miR-124-3p | 35127724 | 5310940 | hsa-miR-101-3p | 31934027 |
| 3385 | hsa-miR-93-5p | 30573973 | 60750 | hsa-miR-1908-5p | Unconfirmed |
| 31703 | hsa-miR-19b-3p | 30343695 | 6857599 | hsa-miR-200c-3p | 25757925 |
| 3385 | hsa-miR-32-5p | 29530052 | 36314 | hsa-miR-141-3p | 26025631 |
| 2767 | hsa-miR-363-3p | 25416050 | 119307 | hsa-miR-181d-5p | Unconfirmed |
| 5310940 | hsa-miR-373-3p | Unconfirmed | 3385 | hsa-miR-620 | Unconfirmed |
| 3385 | hsa-miR-576-5p | Unconfirmed | 3385 | hsa-miR-9-3p | Unconfirmed |
| 36462 | hsa-miR-21-5p | 23834154 | 5310940 | hsa-miR-128-3p | 30890168 |
| 60750 | hsa-miR-24-3p | Unconfirmed | 60750 | hsa-miR-29c-3p | 29807360 |
| 60750 | hsa-miR-205-5p | 31602229 | 2767 | hsa-miR-1236-3p | 30805558 |
| 2767 | hsa-miR-193b-3p | 27918099 | 5310940 | hsa-miR-660-5p | Unconfirmed |
| 3385 | hsa-miR-10a-5p | Unconfirmed | 60750 | hsa-miR-532-5p | Unconfirmed |
| 3385 | hsa-miR-33b-5p | Unconfirmed | 31703 | hsa-miR-18a-5p | Unconfirmed |
| 3385 | hsa-miR-376a-3p | Unconfirmed | 3385 | hsa-miR-431-5p | Unconfirmed |
| 2520 | hsa-miR-126-3p | Unconfirmed | 5310940 | hsa-miR-196a-5p | Unconfirmed |
The CID of PubChem is used to indicate known MDIs in the RNAInter dataset. The first column records the top 1–25 MDIs. The second column records the top 26–50 MDIs. The evidence is indicated by the PubMed ID of the experimental literature.
Top 50 associated miRNA of drug 5-FU predicted by BNEMDI.
| Drug (CID) | miRNA | Evidence | Drug (CID) | miRNA | Evidence |
|---|---|---|---|---|---|
| 3385 | hsa-miR-21-5p | 31918721 | 3385 | hsa-miR-181b-5p | Unconfirmed |
| 3385 | hsa-miR-221-3p | 27726102 | 3385 | hsa-miR-26b-5p | 30662808 |
| 3385 | hsa-miR-126-3p | Unconfirmed | 3385 | hsa-miR-194-5p | 30451820 |
| 3385 | hsa-miR-200c-3p | 28411308 | 3385 | hsa-miR-103a-3p | 27247088 |
| 3385 | hsa-miR-222-3p | 19956872 | 3385 | hsa-miR-208a-3p | Unconfirmed |
| 3385 | hsa-let-7c-5p | 33051247 | 3385 | hsa-miR-18a-5p | 32884453 |
| 3385 | hsa-miR-214-3p | Unconfirmed | 3385 | hsa-miR-20b-5p | 27878272 |
| 3385 | hsa-miR-155-5p | 30741544 | 3385 | hsa-miR-663a | confirmed |
| 3385 | hsa-miR-93-5p | 32426273 | 3385 | hsa-miR-145-5p | 32801865 |
| 3385 | hsa-miR-18b-5p | 25990502 | 3385 | hsa-miR-24-3p | 31646794 |
| 3385 | hsa-miR-143-3p | 19843160 | 3385 | hsa-miR-19a-3p | 24460313 |
| 3385 | hsa-miR-181a-3p | 29795190 | 3385 | hsa-let-7a-5p | 35071455 |
| 3385 | hsa-miR-16-5p | 18449891 | 3385 | hsa-miR-4661-3p | Unconfirmed |
| 3385 | hsa-miR-27b-3p | 24401318 | 3385 | hsa-miR-27a-3p | 24401318 |
| 3385 | hsa-miR-107 | 26636340 | 3385 | hsa-miR-200b-3p | 32714549 |
| 3385 | hsa-miR-34c-5p | Unconfirmed | 3385 | hsa-miR-9-5p | Unconfirmed |
| 3385 | hsa-miR-17-5p | 32426273 | 3385 | hsa-miR-101-3p | 34086111 |
| 3385 | hsa-miR-34a-5p | 31802650 | 3385 | hsa-miR-196a-5p | Unconfirmed |
| 3385 | hsa-miR-125b-5p | 28176874 | 3385 | hsa-miR-200a-3p | 28496200 |
| 3385 | hsa-miR-497-5p | 26673620 | 3385 | hsa-miR-802 | Unconfirmed |
| 3385 | hsa-miR-29b-3p | 34155879 | 3385 | hsa-miR-197-3p | 26055341 |
| 3385 | hsa-miR-20a-5p | 31760170 | 3385 | hsa-miR-30b-5p | miR-30b |
| 3385 | hsa-miR-1915-3p | Unconfirmed | 3385 | hsa-miR-181b-2-3p | Unconfirmed |
| 3385 | hsa-miR-210-3p | 31468617 | 3385 | hsa-miR-100-5p | Unconfirmed |
| 3385 | hsa-miR-25-3p | 35014676 | 3385 | hsa-miR-153-3p | Unconfirmed |
The CID of PubChem is used to indicate known MDIs in the RNAInter dataset. The first column records the top 1–25 MDIs. The second column records the top 26–50 MDIs. The evidence is indicated by the PubMed ID of the experimental literature.