| Literature DB >> 15453917 |
Neil R Smalheiser1, Vetle I Torvik.
Abstract
BACKGROUND: MicroRNAs are approximately 17-24 nt. noncoding RNAs found in all eukaryotes that degrade messenger RNAs via RNA interference (if they bind in a perfect or near-perfect complementarity to the target mRNA), or arrest translation (if the binding is imperfect). Several microRNA targets have been identified in lower organisms, but only one mammalian microRNA target has yet been validated experimentally.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15453917 PMCID: PMC523849 DOI: 10.1186/1471-2105-5-139
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1microRNAs and their scrambled counterparts interact differently with the population of human mRNAs. Shown are all exact hits ≥ 10 bases long (not counting G:U matches) produced on human RefSeq mRNAs by the set of nonredundant microRNAs, vs. the average of 10 replications of scrambled control sequences. Shown is the number of hits as a function of exact hit length. Only the longest hit was counted: e.g., for a hit of length 18, the two subsets of length 17 in the same hit position were not counted.
Figure 2Distribution of gapped-BLAST scores in hits made by microRNAs and scrambled counterparts. Without permitting G:U matches in the extension phase, the microRNAs had better average gapped-BLAST scores than scrambled counterparts across all mRNAs in the "10+ set" (153.00 ± 0.03 vs. 150.98 ± 0.01, mean ± s.e.m., p < 0.0001). With permitting G:U matches in the extension phase, the microRNA set showed significantly fewer G:U matches overall relative to scrambled counterparts, even when holding constant the length of the exact hit (2.891 ± 0.004 vs. 2.939 ± 0.001, p < 0.0001).
Figure 3Number of distinct mRNA sequences which received hits from two or more distinct microRNAs, as a function of the minimum distance between hits. Distance of 0 or 1 was excluded because this might be produced by partial overlap of microRNA sequences.
The Outlier mRNA Set (Candidate Target List)
| homeo box B8 (HOXB8) |
| E2F transcription factor 6 (E2F6) |
| transcription factor 20 (AR1) (TCF20) |
| DEAD (Asp-Glu-Ala-Asp) box polypeptide 51 (DDX51) |
| similar to ATP-dependent RNA helicase DDX24 (DEAD-box protein 24) (LOC221311) |
| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog); translocated to, 1 (MLLT1) |
| high mobility group AT-hook 2 (HMGA2) |
| polymerase (DNA directed), theta (POLQ) |
| strand-exchange protein 1 (SEP1) |
| hypothetical protein FLJ12994 – RFX DNA-binding domain |
| similar to LINE-1 reverse transcriptase homolog (LOC285907) |
| similar to hypothetical protein (L1H 3 region) – related to reverse transcriptase |
| similar to putative p150 (LOC282945) – related to reverse transcriptase |
| similar to reverse transcriptase related protein (LOC222252) |
| similar to RTl1 (LOC376283) – related to reverse transcriptase |
| fyn-related kinase (FRK) |
| WNK kinase, lysine deficient 3 (PRKWNK3) |
| protein phosphatase 2, regulatory subunit B (B56), epsilon isoform (PPP2R5E) |
| EphA5 receptor (EPHA5) |
| killer cell lectin-like receptor subfamily A, member 1 (KLRA1) |
| polycystin and REJ (sperm receptor for egg jelly homolog, sea urchin)-like (PKDREJ) |
| integrin, alpha X (antigen CD11C (p150), alpha polypeptide) (ITGAX) |
| inositol 1,4,5-triphosphate receptor, type 1 (ITPR1) |
| hypothetical protein FLJ32810 – RhoGAP domain, SH3 domain |
| hypothetical protein FLJ00058 – G protein gamma 7 |
| Rho GTPase-activating protein (RICS) |
| hypothetical protein FLJ30899 – probable ras GAP |
| similar to ADP-ribosylation factor-like membrane-associated protein (LOC132946) – ARF-like small GTPase domain, Sar1p-like member of the Ras-family |
| Laminin, beta 4 (LAMB4) |
| laminin, gamma 2 (LAMC2) |
| fibronectin 1 (FN1) |
| collagen, type IV, alpha 5 (Alport syndrome) (COL4A5) |
| collagen, type XIX, alpha 1 (COL19A1) |
| similar to Voltage-dependent anion-selective channel protein 1 (VDAC-1) |
| ATPase, Na+/K+ transporting, alpha 2 (+) polypeptide (ATP1A2) |
| complement component 1, q subcomponent, beta polypeptide (C1QB) |
| hypothetical protein FLJ20506 – transmembrane protein |
| MAM domain containing glycosylphosphatidylinositol anchor 1 (MDGA1) – Ig, MAM domains |
| similar to TCAM-1 (LOC284171) |
| myosin heavy chain Myr 8 (MYR8) |
| ankyrin repeat domain 17 (ANKRD17) |
| KIAA1817 protein – intermediate filament, ATPase, PDZ, Band 4.1, FERM domains |
| chromosome 10 open reading frame 39 (C10orf39) – homologous to myosin, plectin |
| oxysterol binding protein 2 (OSBP2) – pleckstrin homology domain |
| KIAA1202 protein – PDZ, ATPase domains |
| hypothetical protein FLJ23529 – homolgous to dynein heavy chain |
| cell cycle progression 2 protein (CPR2) |
| olfactomedin 3 (OLFM3) |
| histidine rich calcium binding protein (HRC) |
| interferon-related developmental regulator 1 (IFRD1) |
| KIAA1301 protein – NEDD4-related E3 ubiquitin ligase NEDL2 |
| KIAA1203 protein – ubiquitin C-terminal hydrolase |
| hydroxyprostaglandin dehydrogenase 15-(NAD) (HPGD) |
| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1 (B3GNT1) |
| KIAA1854 protein – leucine rich repeat C-terminal domains |
| testis specific, 14 (TSGA14) |
| chromosome 4 open reading frame 1 (C4orf1)-membrane AND nuclear protein |
| hypothetical protein FLJ33069 |
| hypothetical protein FLJ38464 |
| hypothetical protein LOC285431 |
| hypothetical protein LOC284107 |
| similar to agCP1362 [Anopheles gambiae str. PEST] (LOC344751) |
| KIAA1632 protein |
| similar to hypothetical protein D11Ertd497e (LOC343360) |
| LOC138724 |
| LOC343460 |
| LOC340963 |
| LOC343220 |
| LOC285842 |
| LOC352767 |
| LOC350293 |
Figure 4Individual microRNAs hit multiple targets on the candidate list, more often than expected by chance.