| Literature DB >> 30255799 |
Abstract
BACKGROUND: In silico investigations on the integration of multiple datasets are in need of higher statistical power methods to unveil secondary findings that were hidden from the initial analyses. We present here a novel method for the network analysis of messenger RNA post-translational regulation by microRNA molecules. The method integrates expression data and sequence binding predictions through a set of sound machine learning techniques, forwarding all results to an ensemble graph of regulations.Entities:
Keywords: Alzheimer’s disease; Bayesian network classifiers; Ensemble graphs; Feature stability; Post-transcriptional regulation
Mesh:
Substances:
Year: 2018 PMID: 30255799 PMCID: PMC6157163 DOI: 10.1186/s12864-018-5025-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Algorithm for computing the miRNA-gene target prediction score b
| |
| |
| Extract |
| |
| |
| |
| |
Fig. 1Probabilistic graphical model of a conditional dependency. Conditional dependence structure between a microRNA and a gene, both depending on the class separation
Stability scores of weighted Spearman correlation of a pool of 1000 differentially expressed rankings computed from class-balanced 1000 bootstrapped mRNA databases
| Min. | 1 | Median | Mean | 3 | Max. | |
|---|---|---|---|---|---|---|
| T-test | -0.8050 | -0.2540 | 0.0683 | 0.0044 | 0.2860 | 0.6570 |
| LIMMA | -0.7960 | -0.2160 | 0.1030 | 0.0346 | 0.3140 | 0.6690 |
| SAM | -0.7910 | -0.1740 |
|
|
|
|
| Permutation | -0.7990 | -0.2700 | 0.0508 | -0.0117 | 0.2610 | 0.6340 |
| Wilcoxon |
|
| 0.0077 | -0.0360 | 0.2350 | 0.6660 |
Values of -1 or 1 correspond to the highest concordance and 0 to the lowest. Highlighted in bold are the highest absolute values
Statistics of microRNA-gene binding site predictions
| Predictions | Unique | Gene | MicroRNA | |
|---|---|---|---|---|
| TargetScan | 30,703 | 22,173 | GPR26 (101) | miR-520d-5p (403) |
| doRiNA | – | 1,395 | LCOR (16) | miR-137 (73) |
| DIANA | 47,114 | 27,967 | PSD3 (97) | miR-495 (438) |
| miRanda | 41,155 | 24,437 | UHRF2 (89) | miR-186 (419) |
| PITA | 86,996 | 41,600 | LONRF2 (130) | miR-186 (517) |
Column Predictions shows the total number of predicted interactions, with repetitions removed in column Unique. Gene and MicroRNA columns include which gene and microRNA received the highest number of interactions
Top ten mRNA-gene edges ranked based on structural and functional scores
| Structural | Functional | ||||
|---|---|---|---|---|---|
| MicroRNA | Gene |
| MicroRNA | Gene |
|
| miR-184 | PPP1CC | 1.0000 | miR-106a | KIF1B | 0.0206 |
| miR-215 | PABPC4 | 1.0000 | miR-106a | ZCCHC2 | 0.0203 |
| miR-504 | GRM3 | 1.0000 | miR-106a | EFHA2 | 0.0201 |
| miR-142-3p | GNB2 | 1.0000 | miR-106a | EPB41L1 | 0.0196 |
| miR-142-3p | PSRC1 | 1.0000 | miR-106a | THRB | 0.0193 |
| miR-328 | ZNF423 | 1.0000 | miR-106a | WASL | 0.0192 |
| let-7c | ABCC10 | 1.0000 | miR-106a | ACPL2 | 0.0191 |
| miR-142-3p | XPO1 | 1.0000 | miR-106a | N4BP1 | 0.0191 |
| miR-504 | CEP170 | 1.0000 | miR-106a | KLHDC5 | 0.0190 |
| let-7c | CYP46A1 | 1.0000 | miR-106a | CAP2 | 0.0189 |
Fig. 2Performance estimation in classification. Average validation metrics for all ensemble Bayesian network classifiers up to 100 edges. From left to right and top to bottom: AUC, Accuracy, BIC, and Log-likelihood a Area under ROC curve b Accuracy c BIC score d Log-likelihood
Fig. 3Optimal ensemble structure of post-transcriptional microRNA–mRNA regulations. Optimal ensemble classification structure comprised by 13 edges, connecting 7 miRNAs and 13 genes. Labels over edges include the pair of structural and functional weights (b,d) for each dependence