| Literature DB >> 35679295 |
Mert Cihan1, Miguel A Andrade-Navarro1.
Abstract
Gene activity is controlled by multiple molecular mechanisms, for instance through transcription factors or by microRNAs (miRNAs), among others. Established bioinformatics tools for the prediction of miRNA target genes face the challenge of ensuring accuracy, due to high false positive rates. Further, these tools present poor overlap. However, we demonstrated that it is possible to filter good predictions of miRNA targets from the bulk of all predictions by using information from the gene regulatory network. Here, we take advantage of this strategy that selects a large subset of predicted microRNA binding sites as more likely to possess less false-positives because of their over-representation in RE1 silencing transcription factor (REST)-regulated genes from the background of TargetScanHuman 7.2 predictions to identify useful features for the prediction of microRNA targets. These enriched miRNA families would have silencing activity for neural transcripts overlapping the repressive activity on neural genes of REST. We analyze properties of associated microRNA binding sites and contrast the outcome to the background. We found that the selected subset presents significant differences respect to the background: (i) lower GC-content in the vicinity of the predicted miRNA binding site, (ii) more target genes with multiple identical microRNA binding sites and (iii) a higher density of predicted microRNA binding sites close to the 3' terminal end of the 3'-UTR. These results suggest that network selection of miRNA-mRNA pairs could provide useful features to improve microRNA target prediction.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35679295 PMCID: PMC9182691 DOI: 10.1371/journal.pone.0269731
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Descriptive statistics and p-values for parsed properties of REST-associated miRNA-target gene pairs (REST) and TargetScanHuman miRNA-target gene pairs (TSH) or TargetScanHuman miRNA-target gene pairs in REST 3’-UTRs (TSH-REST) (see Materials and Methods for details).
| Analysis | Dataset | N | Mean | Std. deviation | p-value |
|---|---|---|---|---|---|
| 3’-UTR length | REST | 2781 | 4635 nt | 3377 nt | <0.001 |
| TSH | 12989 | 2482 nt | 2098 nt | ||
| Distance from 3’-UTR start to miRNA binding site | REST | 17325 | 1979 nt | 2480 nt | <0.001 |
| TSH | 103467 | 1648 nt | 1926 nt | ||
| Distance from miRNA binding site to 3’-UTR end | REST | 17325 | 2650 nt | 2713 nt | <0.001 |
| TSH | 103467 | 2226 nt | 2261 nt | ||
| Position of miRNA binding site (relative) | REST | 17325 | 0.437 | 0.325 | 0.012 |
| TSH | 103467 | 0.444 | 0.323 | ||
| GC-content of 3’-UTRs | REST | 2781 | 0.390 | 0.069 | <0.001 |
| TSH | 12989 | 0.441 | 0.095 | ||
| Distance between multiple miRNA binding sites | REST | 2689 | 1886 nt | 2978 nt | <0.001 |
| TSH (REST 3’-UTRs) | 25946 | 1932 nt | 2594 nt | <0.001 | |
| TSH | 11127 | 1451 nt | 2014 nt | ||
| GC-content between multiple miRNA binding sites | REST | 2689 | 0.363 | 0.087 | <0.001 |
| TSH (REST 3’-UTRs) | 25946 | 0.391 | 0.094 | <0.001 | |
| TSH | 11127 | 0.402 | 0.108 |
Fig 1Properties of REST-associated miRNA-target gene pairs (REST) and TargetScanHuman miRNA-target gene pairs (TSH) or TargetScanHuman miRNA-target gene pairs in REST 3’-UTRs (TSH-REST).
(A, B) Distribution of 3’-UTR length. (C, D) Distance from 3’-UTR start to miRNA binding site. (E, F) Distance from miRNA binding site to 3’-UTR end. (G, H) Relative position of miRNA binding site to the 3’-UTR length. Left side: the box plots indicate median, second and third quartile, mean (white dot) and standard deviation (whiskers). Right side: kde = kernel density of the corresponding distribution.
Fig 2GC-content in the vicinity of predicted miRNA binding for subsets of miRNA-mRNA pairs.
(A) GC-content relative to the distance around the miRNA binding site for REST pairs, TSH pairs and TSH pairs in the 3’-UTRs of REST regulated genes (TSH pairs in REST 3’-UTRs). (B) Background GC content for REST 3’-UTRs and TSH 3’-UTRs (see Methods for details). The error bars indicate the 99% confidence interval. (C) Kernel density estimation for the distribution of GC-content values between pairs of miRNA binding sites for the same miRNA family. Values obtained from TSH pairs, TSH pairs in REST 3’-UTRs and REST pairs.
Fig 3Illustration of p-value calculation by conducting 10,000 random tests for the statistical comparison of two sample means.