| Literature DB >> 17524134 |
Davide Corà1, Ferdinando Di Cunto, Michele Caselle, Paolo Provero.
Abstract
BACKGROUND: 3' untranslated regions (3' UTRs) contain binding sites for many regulatory elements, and in particular for microRNAs (miRNAs). The importance of miRNA-mediated post-transcriptional regulation has become increasingly clear in the last few years.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17524134 PMCID: PMC1904458 DOI: 10.1186/1471-2105-8-174
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Flow-chart of the proposed methodology. Flow-chart of the proposed methodology: from a list of human 3' UTR exons to a list of putative 3' UTR regulatory elements.
Nucleotide composition of 3' UTR regions.
| Human | Mouse | |
| A | 0.2683 | 0.2638 |
| C | 0.2199 | 0.2237 |
| G | 0.2210 | 0.2254 |
| T | 0.2908 | 0.2871 |
The base frequencies of 3' UTR regions in human and mouse, excluding the masked repeats.
Figure 2Distribution of strand-asymmetry . The distribution of the absolute value of z, defined in the text as a measure of strand asymmetry among all possible 7-mers in 3' UTR regions(red) and in 3000 bp upstream of the TSS (grey).
Comparison between the results of our computational approach and seed regions of known miRNAs.
| Method | 7-mers identified | corresponding to known miRNA | expected by chance | P-value |
| 465 | 74 | 28.9 | 5.0 · 10-14 | |
| 214 | 41 | 13.3 | 7.9 · 10-11 | |
| 113 | 19 | 7.01 | 6.3 · 10-5 | |
| 59 | 14 | 3.66 | 1.1 · 10-5 | |
| 601 | 94 | 37.3 | 4.8 · 10-17 |
The rows indicate the computational methods as described in the text: CO = conserved overrepresentation; SA = strand asymmetry; CSA = conserved strand asymmetry (oligos displaying strand asymmetry in both human and mouse); CO∩CSA oligos identified by both CO and CSA;CO∪SA: oligos identified by either conserved overrepresentation or strand asymmetry (human) or both. The columns are (1) Number of oligos identified computationally; (2) Number of these matching the seed region of known human miRNAs; (3) number of such matches expected by chance; (4) number of different known human miRNAs putatively binding the oligos (many 7-oligos occur in the seed region of more than one miRNA) (5) P-value from exact Fisher test, taking into account that there are 16384 possible 7-mers 1017 of which are seed regions of known miRNAs.
7-mers identified by all methods matching the seed region of known human miRNAs
| AGCACAA | hsa-miR-218 |
| CTTTGTA1 | hsa-miR-524* hsa-miR-520d* |
| GCACTTT | hsa-miR-520d hsa-miR-93 hsa-miR-106a hsa-miR-520h |
| hsa-miR-520a hsa-miR-520e hsa-miR-519b hsa-miR-20a | |
| hsa-miR-106b hsa-miR-372 hsa-miR-520b hsa-miR-17-5p | |
| hsa-miR-520g hsa-miR-519c hsa-miR-519d hsa-miR-20b | |
| hsa-miR-519a hsa-miR-520c hsa-miR-526b* hsa-miR-519e | |
| GGTGCTA | hsa-miR-29c hsa-miR-29b hsa-miR-29a |
| GTGCAAT | hsa-miR-92b hsa-miR-367 hsa-miR-92 hsa-miR-363 hsa-miR-32 |
| hsa-miR-25 | |
| GTGCCTT | hsa-miR-506 hsa-miR-124a |
| GTTTACA | hsa-miR-30a-5p hsa-miR-30b hsa-miR-30d hsa-miR-30e-5p hsa-miR-30c |
| TACTGTA | hsa-miR-101 hsa-miR-199a* hsa-miR-144 |
| TGCAATA | hsa-miR-92b hsa-miR-92 hsa-miR-32 |
| TGCCTTA | hsa-miR-506 hsa-miR-124a |
| TGTTTAC | hsa-miR-30a-5p hsa-miR-30b hsa-miR-30d hsa-miR-30e-5p |
| hsa-miR-30c | |
| TTATATT | hsa-miR-410 |
| TTGTATA1 | hsa-miR-381 |
| TTTGCAC | hsa-miR-19b hsa-miR-19a |
(1) also matches CFIm binding site
7-mers identified by all methods matching other known regulatory elements
| AATAAAC | |
| AATAAAG | |
| ATAAAAG | |
| ATAAAGG | |
| ATAAAGT | |
| ATAAATG | |
| ATTAAAG | PolyA |
| CAATAAA | |
| CCAATAA | |
| CTAATAA | |
| GAATAAA | |
| GCAATAA | |
| TCAATAA | |
| ATTTAAG | |
| ATTTATA | ARE |
| TATTTAT | |
| GTAAATA | |
| GTACATA | PUF |
| TGTAAAT | |
| TGTACAT | |
| TGTATAT | |
| TTGTAAA | |
| ATTGTAA | |
| TTTGTAA | PUF, CFIm |
| TTTGTAT | |
| TTGTATT | CFIm |
| TTTTGTA | CPE, CFIm |
| TTTTATA | |
| TTTTGTT | |
| TTTATAA | CPE |
| TTTATAT | |
| TTTTTAT | |
| ATATTTT | |
| CTATTTT | |
| GTATTTT | CstF |
| TATTTTG | |
| TATTTTT | |
| TTTTTAA | |
Comparison between the results of our computational approach and seed regions of putative miRNAs listed in miRNAMap [45].
| Method | 7-mers identified | 7-mers corresponding to putative miRNAs | expected by chance | number of putative miRNAs | P-value |
| 465 | 76 | 26.7 | 101 | 6.7 · 10-17 | |
| 214 | 46 | 12.3 | 79 | 3.9 · 10-15 | |
| 113 | 23 | 6.48 | 42 | 8.8 · 10-8 | |
| 59 | 15 | 3.38 | 31 | 7.8 · 10-7 | |
| 601 | 98 | 34.4 | 135 | 1.8 · 10-21 |
Same as Table 2 for putative miRNAs. The total number of 7-mers which are seed regions of putative miRNA is 939.
Comparison between the results of our computational approach and seed regions of known and putative miRNAs
| Method | 7-mers identified | 7-mers corresponding to known or putative miRNAs | expected by chance | number of known or putative miRNAs | P-value |
| 465 | 114 | 50.3 | 217 | 1.5 · 10-17 | |
| 214 | 62 | 23.1 | 180 | 1.6 · 10-13 | |
| 113 | 30 | 12.2 | 96 | 2.2 · 10-6 | |
| 59 | 19 | 6.38 | 77 | 8.0 · 10-6 | |
| 601 | 149 | 65.0 | 285 | 3.4 · 10-23 |
r Same as Table 2 for known and putative miRNAs together. The total number of 7-mers which are seed regions of known or putative miRNA is 1772.
7-mers identified by all methods and not matching experimentally known regulatory elements or miRNA seed regions
| AAACTTG |
| AATCATG |
| GACCAAA |
| GTTATTT |
| TATATGT |
| TGTGAAT |
| TTGCCTT |