| Literature DB >> 18366608 |
Guohua Wang1, Xin Wang, Yadong Wang, Jack Y Yang, Lang Li, Kenneth P Nephew, Howard J Edenberg, Feng C Zhou, Yunlong Liu.
Abstract
This is a first report, using our MotifModeler informatics program, to simultaneously identify transcription factor (TF) and microRNA (miRNA) binding sites from gene expression microarray data. Based on the assumption that gene expression is controlled by combinatorial effects of transcription factors binding in the 5'-upstream regulatory region and miRNAs binding in the 3'-untranslated region (3'-UTR), we developed a model for (1) predicting the most influential cis-acting elements under a given biological condition, and (2) estimating the effects of those elements on gene expression levels. The regulatory regions, TF and miRNA, which mediate the differential genes expression in fetal alcohol syndrome were unknown; microarray data from alcohol exposure paradigm was used. The model predicted strong inhibitory effects of 5' cis-acting elements and stimulatory effects of 3'-UTR under alcohol treatment. Current predictive model derived a key hypothesis for the first time a novel role of miRNAs in gene expression changes associated with abnormal mouse embryo development after alcohol exposure. This suggests that disturbance of miRNA functions may contribute to the alcohol-induced developmental deficiencies.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18366608 PMCID: PMC2386061 DOI: 10.1186/1471-2164-9-S1-S19
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1TCS and Xm scores for 5'-regulatory region (promoter) and 3'-untranslated region. (A) Histogram of TCS scores of the 2,080 6-bp motifs, and predicted functional levels (Xm) of top 30 motif candidates. (B) Histogram of TCS scores of the 15,870 7-bp motifs, and predicted functional levels (Xm) of top 30 motif candidates.
Transcriptional contribution scores (TCS) and estimated functional levels (Xm) of top 30 selected TFBS and MBS
| TFBS | TCS(×10-6) | MBS | TCS(×10-6) | |||
| CTCCAA | 4.744 | -0.186 | TGGTGTC | 5.073 | 0.576 | |
| GCGTTA | 4.684 | -0.546 | GCTGTGC | 4.936 | 0.550 | |
| CACATA | 4.640 | -0.175 | CTGGTGT | 4.914 | 0.526 | |
| AACTGA | 4.615 | -0.207 | TGACCAG | 4.830 | 0.531 | |
| CCTACC | 4.613 | -0.232 | CTCCCAA | 4.824 | 0.642 | |
| GACACA | 4.608 | -0.171 | GTGTTGC | 4.782 | 0.747 | |
| AACACA | 4.606 | -0.168 | TATGTAG | 4.756 | 0.694 | |
| CACAAC | 4.595 | -0.185 | ACCTGGC | 4.724 | 0.554 | |
| ACAAGT | 4.582 | -0.229 | TGGAGCA | 4.724 | 0.560 | |
| CCACAA | 4.552 | -0.187 | ACAACAG | 4.713 | 0.735 | |
| AAGTTA | 4.548 | -0.207 | GCTGTTT | 4.698 | 0.359 | |
| GCATGC | 4.533 | 0.285 | AAGACAA | 4.692 | 0.460 | |
| ACACAC | 4.530 | -0.074 | TGTCTCG | 4.687 | 0.872 | |
| AGGGGA | 4.527 | -0.134 | GTGCCTC | 4.687 | 0.438 | |
| CAAGGG | 4.525 | -0.146 | TTGAGGT | 4.687 | 0.821 | |
| AAAATA | 4.523 | -0.113 | CTGTGCT | 4.676 | 0.322 | |
| CAGCCC | 4.522 | -0.111 | GCTCCCT | 4.671 | 0.484 | |
| ATCTTG | 4.519 | 0.195 | GGTCCTG | 4.655 | 0.499 | |
| AAGTAA | 4.512 | -0.176 | CAGGACT | 4.655 | 0.421 | |
| ACGCCC | 4.494 | -0.171 | GGGGCCA | 4.655 | 0.442 | |
| CTCCGA | 4.492 | -0.221 | AGGGAAC | 4.645 | 0.759 | |
| TCACAA | 4.491 | -0.158 | GTGTATC | 4.639 | 0.658 | |
| GCCGGC | 4.487 | 0.213 | TTCTAGA | 4.639 | 0.450 | |
| AGGGCA | 4.484 | -0.130 | GTGCTCA | 4.639 | 0.449 | |
| ACGCTA | 4.484 | -0.362 | GGTGTCT | 4.629 | 0.526 | |
| AAGCAC | 4.484 | -0.184 | AGACAAC | 4.613 | 0.673 | |
| AGGACT | 4.479 | -0.152 | GAATTGC | 4.613 | 1.065 | |
| CTGCAG | 4.477 | -0.156 | GGTACTG | 4.613 | 0.694 | |
| CCCATA | 4.473 | -0.238 | CTCTTCC | 4.608 | 0.311 | |
| GGCACA | 4.469 | -0.159 | ACAGCCT | 4.602 | 0.407 | |
Figure 2Permutation analysis. Histograms of TCS scores of (A) 6-bp motifs in the 5'-proximal regulatory region; and (B) 7-bp motifs in the 3'-untranslated regions. TCS scores were calculated based on gene expression levels with randomized orders (red histograms) and original orders (blue histograms).
Correspondence of 27 negative TFBS to the TRANSFAC database
| M00152 | 13.86 | SRF | 3(14/18) | Serum response factor |
| M00255 | 13.62 | GC box | 3(13/14) | GC box elements |
| M00245 | 13.32 | Egr-3 | 2(12/12) | Early growth response gene 3 product |
| M00246 | 12.65 | Egr-2 | 2(12/12) | Egr-2/Krox-20 early growth response gene product |
| M00982 | 12.55 | KROX | 3(12/14) | |
| M00731 | 12.98 | Osf2 | 3(7/8) | Runt-related transcription factor 2 (Runx2) |
| M00154 | 12.15 | STRE | 2(8/8) | Stress-response element |
| M00933 | 12.07 | Sp-1 | 3(8/10) | Stimulating protein 1 |
| M00932 | 10.55 | Sp-1 | 3(8/13) | Stimulating protein 1 |
| M00931 | 8.83 | Sp-1 | 2(6/10) | Stimulating protein 1 |
| M00008 | 8.47 | Sp1 | 2(6/10) | Stimulating protein 1 |
| M00456 | 11.26 | FAC1 | 3(10/14) | Fetal Alz-50 clone 1 |
| M00360 | 10.29 | Pax-3 | 3(9/13) | Pax-3 binding sites |
| M00134 | 10.19 | HNF-4 | 3(13/19) | Hepatic nuclear factor 4 |
| M00411 | 9.68 | HNF4α1 | 2(12/15) | Hepatocyte nuclear factor 4 |
| M00316 | 10.04 | Hogness BOX | 3(15/30) | Imperfect Hogness/Goldberg Box |
| M00467 | 9.83 | Roaz | 2(8/14) | Rat Olf-1/EBF-associated zinc finger protein |
| M00977 | 9.78 | EBF | 2(7/11) | Early B-cell factor |
| M00119 | 9.28 | Max | 2(12/14) | Max |
| M00118 | 9.20 | c-Myc:Max | 2(12/14) | c-Myc:Max heterodimer |
| M00953 | 9.13 | AR | 3(13/27) | High affinity binding sites for androgen receptor |
| M00332 | 8.78 | Whn | 2(6/11) | Winged-helix factor nude |
| M00330 | 8.64 | Major T-antigen | 2(12/19) | Major T-antigen binding sites |
| M00763 | 8.37 | PPAR | 2(9/13) | Peroxisome proliferator-activated receptor |
| M00778 | 8.33 | AhR | 2(6/11) | Aryl hydrocarbon / dioxin receptor |
| M00769 | 8.12 | AML | 2(7/15) | Leukemia |
| M00256 | 8.11 | NRSF | 2(12/21) | Neuron-restrictive silencer factor |
Correspondences of 3 positive TFBS to the TRANSFAC database
| M00078 | 9.49 | 2 (11/16) | Ectopic viral integration site 1 encoded factor | |
| M00034 | 9.15 | 2 (12/20) | Tumor suppressor p53 | |
| M00652 | 7.53 | 1 (6/10) | Nuclear respiratory factor 1 |
MiRNAs that match perfectly with predicted 7-bp motifs.
| miRNA | 7-bp motif | MBS index | Match position | Mature miRNA sequence* |
| miR-489 | TGGTGTC | 1 | 4-10 | aat |
| miR-666† | GCTGTGC | 2 | 6-12 | agcgg |
| miR-292-3p† | ACCTGGC | 8 | 8-14 | aagtgcc |
| miR-196b† | ACAACAG | 10 | 12-18 | taggtagtttc |
| miR-685† | GTGCCTC | 14 | 15-21 | tcaatggctgaggt |
| miR-195 | CTGTGCT | 16 | 5-11 | tagc |
| miR-330† | CTGTGCT | 16 | 5-11 | gcaa |
| miR-424 | GAATTGC | 27 | 6-12 | cagca |
| miR-219 | GAATTGC | 27 | 14-20 | tgattgtccaaac |
| miR-488*† | ACAGCCT | 30 | 6-12 | ttgaa |
MiRNAs that match with two predicted 7-bp motifs (one G-U pair allowed)
| miRNA | 7-bp motif | MBS index | Match position | Mature miRNA sequence* |
| miR-195 | GTGTTGC | 6 | 3-9 | ta |
| CTGTGCT | 16 | 5-11 | tagc | |
| miR-196b | ACAACAG | 10 | 12-18 | taggtagtttc |
| AGGGAAC | 21 | 7-13 | taggta | |
| miR-330† | CTGTGCT | 16 | 5-11 | gcaa |
| GGTCCTG | 18 | 9-15 | gcaaagca | |
| miR-666† | GCTGTGC | 2 | 6-12 | agcgg |
| CTGTGCT | 16 | 5-11 | agcg | |
| miR-710† | CAGGACT | 19 | 4-10 | cca |
| CTCTTCC | 29 | 11-17 | ccaagtcttg |