| Literature DB >> 32715187 |
Leon Wong1,2,3, Zhu-Hong You1,2,3, Zhen-Hao Guo1,2,3, Hai-Cheng Yi1,2,3, Zhan-Heng Chen1,2,3, Mei-Yuan Cao4.
Abstract
Analysis of miRNA-target mRNA interaction (MTI) is of crucial significance in discovering new target candidates for miRNAs. However, the biological experiments for identifying MTIs have a high false positive rate and are high-priced, time-consuming, and arduous. It is an urgent task to develop effective computational approaches to enhance the investigation of miRNA-target mRNA relationships. In this study, a novel method called MIPDH is developed for miRNA-mRNA interaction prediction by using DeepWalk on a heterogeneous network. More specifically, MIPDH extracts two kinds of features, in which a biological behavior feature is learned using a network embedding algorithm on a constructed heterogeneous network derived from 17 kinds of associations among drug, disease, and 6 kinds of biomolecules, and the attribute feature is learned using the k-mer method on sequences of miRNAs and target mRNAs. Then, a random forest classifier is trained on the features combined with the biological behavior feature and attribute feature. When implementing a 5-fold cross-validation experiment, MIPDH achieved an average accuracy, sensitivity, specificity and AUC of 75.85, 74.37, 77.33%, and 0.8044, respectively. To further evaluate the performance of MIPDH, other classifiers and feature descriptors are conducted for comparisons. MIPDH can achieve a better performance. Additionally, case studies on hsa-miR-106b-5p, hsa-let-7d-5p, and hsa-let-7e-5p are also implemented. As a result, 14, 9, and 9 out of the top 15 targets that interacted with these miRNAs were verified using the experimental literature or other databases. All these prediction results indicate that MIPDH is an effective method for predicting miRNA-target mRNA interactions.Entities:
Year: 2020 PMID: 32715187 PMCID: PMC7376568 DOI: 10.1021/acsomega.9b04195
Source DB: PubMed Journal: ACS Omega ISSN: 2470-1343
Figure 1Quantity distribution of biological molecule associations.
Figure 2Flowchart of the computational process of MIPDH based on the biological behavior and attribute.
5-fold Cross Validation Results Performed by RF Classifier on Integrated Features of Attribute and Behavior
| test set | accuracy (%) | sensitivity (%) | precision (%) | MCC | specificity (%) | AUC |
|---|---|---|---|---|---|---|
| 1 | 75.22 | 74.74 | 75.46 | 0.6272 | 75.70 | 0.8014 |
| 2 | 75.70 | 74.83 | 76.15 | 0.6320 | 76.56 | 0.8012 |
| 3 | 75.98 | 72.81 | 77.74 | 0.6343 | 79.15 | 0.8100 |
| 4 | 75.36 | 74.16 | 75.98 | 0.6285 | 76.56 | 0.7933 |
| 5 | 77.00 | 75.31 | 77.94 | 0.6456 | 78.69 | 0.8159 |
| average | 75.85 ± 0.63 | 74.37 ± 0.86 | 76.66 ± 1.00 | 0.6335 ± 0.0066 | 77.33 ± 1.34 | 0.8044 ± 0.0078 |
Figure 3ROC curves performed by the RF classifier based on integrated features of attribute and behavior.
5-Fold Cross Validation Results Performed by SVM Classifier on Integrated Features of Attribute and Behavior
| test set | accuracy (%) | sensitivity (%) | precision (%) | MCC | specificity (%) | AUC |
|---|---|---|---|---|---|---|
| 1 | 74.50 | 74.74 | 74.38 | 0.6200 | 74.26 | 0.7961 |
| 2 | 76.03 | 75.60 | 76.26 | 0.6355 | 76.46 | 0.8041 |
| 3 | 74.78 | 73.78 | 75.29 | 0.6228 | 75.79 | 0.8079 |
| 4 | 74.21 | 73.58 | 74.51 | 0.6172 | 74.83 | 0.7940 |
| 5 | 75.60 | 74.83 | 76.00 | 0.6311 | 76.37 | 0.8084 |
| average | 75.02 ± 0.77 | 74.51 ± 0.83 | 75.29 ± 0.85 | 0.6253 ± 0.0077 | 75.54 ± 0.97 | 0.8021 ± 0.0067 |
Figure 4ROC curves performed by the SVM classifier based on integrated features of attribute and behavior.
Figure 5ROC curves performed by the LR classifier based on integrated features of attribute and behavior.
Figure 6Performance comparison among RF, SVM, and LR models in terms of ROC curves and AUCs based on integrated features of attribute and behavior.
5-Fold Cross Validation Results Performed by the RF Classifier on Attribute Features
| test set | accuracy (%) | sensitivity (%) | precision (%) | MCC | specificity (%) | AUC |
|---|---|---|---|---|---|---|
| 1 | 73.78 | 73.29 | 74.01 | 0.6130 | 74.26 | 0.7921 |
| 2 | 74.45 | 73.58 | 74.88 | 0.6195 | 75.31 | 0.7953 |
| 3 | 74.40 | 71.95 | 75.66 | 0.6186 | 76.85 | 0.7968 |
| 4 | 74.69 | 73.49 | 75.30 | 0.6218 | 75.89 | 0.8021 |
| 5 | 75.02 | 73.10 | 76.03 | 0.6250 | 76.95 | 0.8097 |
| average | 74.47 ± 0.46 | 73.08 ± 0.66 | 75.17 ± 0.78 | 0.6196 ± 0.0044 | 75.85 ± 1.12 | 0.7992 ± 0.0069 |
5-Fold Cross Validation Results Performed by the RF Classifier on Behavior Features
| test set | accuracy (%) | sensitivity (%) | precision (%) | MCC | specificity (%) | AUC |
|---|---|---|---|---|---|---|
| 1 | 74.59 | 74.45 | 74.66 | 0.6209 | 74.74 | 0.7872 |
| 2 | 74.83 | 73.97 | 75.27 | 0.6233 | 75.70 | 0.7920 |
| 3 | 75.55 | 73.58 | 76.60 | 0.6303 | 77.52 | 0.7975 |
| 4 | 74.54 | 72.62 | 75.52 | 0.6202 | 76.46 | 0.7816 |
| 5 | 75.84 | 75.31 | 76.12 | 0.6336 | 76.37 | 0.8028 |
| average | 75.07 ± 0.59 | 73.99 ± 1.00 | 75.64 ± 0.75 | 0.6257 ± 0.0060 | 76.16 ± 1.03 | 0.7922 ± 0.0083 |
Figure 7ROC curves performed by the RF classifier based on attribute features.
Figure 8ROC curves performed by the RF classifier based on behavior features.
Figure 9Performance comparison among behavior features, attribute features, and integrated features in terms of ROC curves and AUCs based on the RF classifier.
Top 15 mRNA Related to hsa-let-7d-5p Predicted by MIPDH
| rank | mRNA | evidence | Rank | mRNA | evidence |
|---|---|---|---|---|---|
| 1 | TIMP3 | unconfirmed | 9 | FBN1 | PubMed TargetScan miRDB |
| 2 | CD44 | unconfirmed | 10 | ITGB3 | TargetScan miRDB |
| 3 | PTEN | unconfirmed | 11 | SMN1 | PubMed TargetScan |
| 4 | NCAM1 | unconfirmed | 12 | IL6R | PubMed TargetScan miRDB |
| 5 | AFTPH | unconfirmed | 13 | BACH1 | TargetScan |
| 6 | ADAM9 | unconfirmed | 14 | FAIM | TargetScan |
| 7 | BCL2L1 | TargetScan | 15 | CCNE1 | miRDB |
| 8 | MAP4K3 | TargetScan miRDB |
Top 15 mRNA Related to hsa-miR-106b-5p Predicted by MIPDH
| rank | mRNA | evidence | rank | mRNA | evidence |
|---|---|---|---|---|---|
| 1 | PPP2R5C | unconfirmed | 9 | NTRK2 | miRDB |
| 2 | FXN | miRDB | 10 | ATAT1 | PubMed |
| 3 | SLC6A4 | PubMed | 11 | FLT1 | TargetScan miRDB |
| 4 | FAS | PubMed miRDB | 12 | NLN | PubMed TargetScan miRDB |
| 5 | GPD2 | TargetScan | 13 | PBX3 | PubMed TargetScan miRDB |
| 6 | MCL1 | PubMed TargetScan miRDB | 14 | PGR | PubMed TargetScan miRDB |
| 7 | EGLN1 | TargetScan miRDB | 15 | RASA1 | PubMed TargetScan |
| 8 | PAX6 | miRDB |
Top 15 mRNA Related to hsa-let-7e-5p Predicted by MIPDH
| rank | mRNA | evidence | Rank | mRNA | evidence |
|---|---|---|---|---|---|
| 1 | CDK4 | unconfirmed | 9 | TIMP3 | PubMed |
| 2 | CALN1 | TargetScan | 10 | TRIM71 | PubMed TargetScan miRDB |
| 3 | ZBTB7A | unconfirmed | 11 | BCL2L1 | TargetScan miRDB |
| 4 | VDR | unconfirmed | 12 | TGFBR3 | PubMed TargetScan miRDB |
| 5 | IGFBP5 | unconfirmed | 13 | MDM4 | PubMed TargetScan miRDB |
| 6 | GRM3 | unconfirmed | 14 | KLF9 | TargetScan miRDB |
| 7 | ALDH5A1 | unconfirmed | 15 | PAPPA | TargetScan miRDB |
| 8 | MYC | PubMed |
5-Fold Cross Validation Results Performed by the LR Classifier on Integrated Features of Attribute and Behavior
| test set | accuracy (%) | sensitivity (%) | precision (%) | MCC | specificity (%) | AUC |
|---|---|---|---|---|---|---|
| 1 | 68.30 | 69.45 | 67.89 | 0.5669 | 67.15 | 0.7315 |
| 2 | 67.77 | 70.12 | 66.97 | 0.5627 | 65.42 | 0.7408 |
| 3 | 69.07 | 70.22 | 68.64 | 0.5726 | 67.92 | 0.7443 |
| 4 | 67.29 | 67.72 | 67.14 | 0.5598 | 66.86 | 0.7293 |
| 5 | 68.76 | 68.18 | 68.98 | 0.5703 | 69.33 | 0.7433 |
| average | 68.23 ± 0.72 | 69.13 ± 1.14 | 67.92 ± 0.89 | 0.5665 ± 0.0053 | 67.33 ± 1.44 | 0.7378 ± 0.0070 |