| Literature DB >> 19846595 |
Federico Bolognani1, Tania Contente-Cuomo, Nora I Perrone-Bizzozero.
Abstract
HuD is a neuronal ELAV-like RNA-binding protein (RBP) involved in nervous system development, regeneration, and learning and memory. This protein stabilizes mRNAs by binding to AU-rich instability elements (AREs) in their 3' unstranslated regions (3' UTR). To isolate its in vivo targets, messenger ribonucleoprotein (mRNP) complexes containing HuD were first immunoprecipitated from brain extracts and directly bound mRNAs identified by subsequent GST-HuD pull downs and microarray assays. Using the 3' UTR sequences of the most enriched targets and the known sequence restrictions of the HuD ARE-binding site, we discovered three novel recognition motifs. Motifs 2 and 3 are U-rich whereas motif 1 is C-rich. In vitro binding assays indicated that HuD binds motif 3 with the highest affinity, followed by motifs 2 and 1, with less affinity. These motifs were found to be over-represented in brain mRNAs that are upregulated in HuD overexpressor mice, supporting the biological function of these sequences. Gene ontology analyses revealed that HuD targets are enriched in signaling pathways involved in neuronal differentiation and that many of these mRNAs encode other RBPs, translation factors and actin-binding proteins. These findings provide further insights into the post-transcriptional mechanisms by which HuD promotes neural development and synaptic plasticity.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19846595 PMCID: PMC2800223 DOI: 10.1093/nar/gkp863
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 3.Identification of the HuD-binding motifs. (A) Experimental strategy implemented to identify HuD-binding motifs. (B) WebLogo representations of the three identified HuD-binding sequences. (C) Venn diagram showing the distribution of the three new HuD-binding motifs in the HuD targets. (D) Frequency of motif 1 in the 5′ UTR, coding region (CR), and 3′ UTR of mouse genome (blue bars), Affymetrix 430 2.0 chip genes (red bars), forebrain transcripts (yellow bars) and HuD targets (green bars). (E) Percentage of presence of motif 2 in the same groups as Panel D. (F) Same analysis for motif 3. ***P < 0.0001 of HuD targets versus each of the three reference datasets. (a) P < 0.05 HuD targets versus genome and forebrain, (b) P < 0.05 versus genome, P < 0.01 versus chip and P < 0.0001 versus forebrain, (c) P < 0.05 versus chip and versus forebrain and (d) P < 0.05 versus forebrain.
Figure 1.Identification of HuD target mRNAs. Experimental strategy implemented for analyzing HuD target mRNAs at genomic scale. The flow chart shows the two sequential steps used for the identification of HuD targets.
Partial list of HuD targets
| Probe set ID | Fold enrichment | Gene symbol | Description | Ensembl |
|---|---|---|---|---|
| 1438554_x_at | 19.43 | Eif4h | Eukaryotic translation initiation factor 4H | ENSMUSG00000040731 |
| 1452833_at | 19.42 | Rapgef2 | Rap guanine nucleotide exchange factor (GEF) 2 | ENSMUSG00000062232 |
| 1424358_at | 19.16 | Ube2e2 | Ubiquitin-conjugating enzyme E2E 2 (UBC4/5 homolog, yeast) | ENSMUSG00000058317 |
| 1434277_a_at | 19.08 | Ypel2 | Yippee-like 2 (Drosophila) | ENSMUSG00000018427 |
| 1416313_at | 18.85 | Mllt11 | Myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog, Drosophila); translocated to, 11 | ENSMUSG00000053192 |
| 1434310_at | 18.78 | Bmpr2 | Bone morphogenic protein receptor, type II (serine/threonine kinase) | – |
| 1429839_a_at | 18.63 | Yaf2 | YY1 associated factor 2 | ENSMUSG00000022634 |
| 1435521_at | 18.5 | Msi2 | Musashi homolog 2 (Drosophila) | ENSMUSG00000069769 |
| 1457248_x_at | 18.47 | Hsd17b7 | Hydroxysteroid (17-beta) dehydrogenase 7 | ENSMUSG00000026675 |
| 1435807_at | 18.46 | Cdc42 | Cell division cycle 42 homolog ( | ENSMUSG00000006699 |
| 1416767_a_at | 18.45 | RIKEN cDNA 1110003E01 gene | ENSMUSG00000037822 | |
| 1433519_at | 18.43 | Nucks1 | Nuclear casein kinase and cyclin-dependent kinase substrate 1 | – |
| 1450021_at | 18.38 | Ubqln2 | Ubiquilin 2 | ENSMUSG00000050148 |
| 1437457_a_at | 18.38 | Mtpn | Myotrophin | ENSMUSG00000029840 |
| 1416082_at | 18.32 | Rab1 | RAB1, member RAS oncogene family | ENSMUSG00000020149 |
| 1448100_at | 18.31 | RIKEN cDNA 4833439L19 gene | ENSMUSG00000025871 | |
| 1447776_x_at | 18.29 | Rab6 | RAB6, member RAS oncogene family | ENSMUSG00000030704 |
| 1432198_at | 18.29 | RIKEN cDNA A230083H22 gene | ENSMUSG00000039126 | |
| 1448504_a_at | 18.28 | Cbx3 | Chromobox homolog 3 (Drosophila HP1 gamma) | ENSMUSG00000029836 ENSMUSG00000059647 |
| 1415971_at | 18.23 | Marcks | Myristoylated alanine rich protein kinase C substrate | ENSMUSG00000069662 |
| 1426776_at | 18.21 | Wasl | Wiskott–Aldrich syndrome-like (human) | ENSMUSG00000029684 |
| 1433540_x_at | 18.16 | Ppp1cb | Protein phosphatase 1, catalytic subunit, beta isoform | ENSMUSG00000014956 |
| 1426401_at | 18.15 | Ppp3ca | Protein phosphatase 3, catalytic subunit, alpha isoform | ENSMUSG00000028161 |
| 1419112_at | 18.12 | Nlk | Nemo like kinase | ENSMUSG00000017376 |
| 1423895_a_at | 18.11 | Cugbp2 | CUG triplet repeat, RNA-binding protein 2 | ENSMUSG00000002107 |
| 1423220_at | 18.07 | Eif4e | Eukaryotic translation initiation factor 4E | ENSMUSG00000028156 |
| 1458351_s_at | 18.06 | Klhl2 | Kelch-like 2, Mayven (Drosophila) | ENSMUSG00000031605 |
| 1440270_at | 18.05 | Fgf12 | Fibroblast growth factor 12 | ENSMUSG00000022523 |
| 1434082_at | 18.04 | Pctk2 | Serine/threonine-protein kinase PCTAIRE-2 | ENSMUSG00000020015 |
| 1434232_a_at | 18.04 | RIKEN cDNA 2610030H06 gene | ENSMUSG00000073131 | |
| 1428473_at | 18.03 | Ppp3cb | Protein phosphatase 3, catalytic subunit, beta isoform | ENSMUSG00000021816 |
| 1448184_at | 18.03 | Fkbp1a | FK506-binding protein 1a | ENSMUSG00000032966 |
| 1447669_s_at | 18.02 | Gng4 | Guanine nucleotide binding protein (G protein), gamma 4 subunit | ENSMUSG00000021303 |
| 1428970_at | 18.01 | Nat13 | N-acetyltransferase 13 | ENSMUSG00000022698 |
| 1437585_x_at | 17.99 | Zfp161 | Zinc finger protein 161 | ENSMUSG00000049672 |
| 1419246_s_at | 17.99 | Rab14 | RAB14, member RAS oncogene family | ENSMUSG00000026878 |
| 1438007_at | 17.94 | Expressed sequence AI851790 | ENSMUSG00000044071 | |
| 1424852_at | 17.87 | Mef2c | Myocyte enhancer factor 2C | – |
| 1436452_x_at | 17.82 | Tmed2 | Transmembrane emp24 domain trafficking protein 2 | ENSMUSG00000029390 ENSMUSG00000074460 |
| 1428416_at | 17.79 | RIKEN cDNA 3110050N22 gene | ENSMUSG00000043542 | |
| 1417377_at | 17.78 | Cadm1 | Cell adhesion molecule 1 | ENSMUSG00000032076 |
| 1433521_at | 17.75 | Ankrd13c | Ankyrin repeat domain 13c | ENSMUSG00000039988 |
| 1433751_at | 17.73 | Slc39a10 | Solute carrier family 39 (zinc transporter), member 10 | ENSMUSG00000025986 |
| 1424215_at | 17.72 | Fundc1 | FUN14 domain containing 1 | ENSMUSG00000025040 |
| 1434620_s_at | 17.72 | RIKEN cDNA 2610024E20 gene | ENSMUSG00000036501 | |
| 1421323_a_at | 17.71 | G3bp2 | GTPase activating protein (SH3 domain) binding protein 2 | ENSMUSG00000029405 |
| 1423309_at | 17.7 | Tgoln1 | ENSMUSG00000056429 | |
| 1426864_a_at | 17.7 | Ncam1 | Neural cell adhesion molecule 1 | ENSMUSG00000039542 |
| 1424683_at | 17.67 | RIKEN cDNA 1810015C04 gene | ENSMUSG00000022270 | |
| 1418067_at | 17.65 | Cfl2 | Cofilin 2, muscle | ENSMUSG00000062929 |
| 1437801_at | 17.62 | Morf4l1 | Mortality factor 4 like 1 | ENSMUSG00000062270 |
| 1428537_at | 17.62 | Csnk1a1 | Casein kinase 1, alpha 1 | ENSMUSG00000024576 |
| 1419971_s_at | 17.61 | Slc35a5 | Solute carrier family 35, member A5 | ENSMUSG00000022664 |
| 1417410_s_at | 17.61 | Prkci | Protein kinase C, iota | ENSMUSG00000037643 |
| 1417411_at | 17.61 | Nap1l5 | Nucleosome assembly protein 1-like 5 | ENSMUSG00000029804 |
| 1418436_at | 17.6 | Stx7 | Syntaxin 7 | ENSMUSG00000019998 |
| 1422748_at | 17.58 | Zeb2 | Zinc finger E-box binding homeobox 2 | ENSMUSG00000026872 |
| 1434820_s_at | 17.57 | Pkig | Protein kinase inhibitor, gamma | ENSMUSG00000035268 |
| 1415911_at | 17.57 | Impact | Imprinted and ancient | ENSMUSG00000024423 |
| 1454976_at | 17.55 | Sod2 | Superoxide dismutase 2, mitochondrial | ENSMUSG00000006818 |
| 1433986_at | 17.52 | cDNA sequence BC024659 | – | |
| 1437016_x_at | 17.46 | Rap2c | RAP2C, member of RAS oncogene family | ENSMUSG00000050029 |
| 1434106_at | 17.46 | Epm2aip1 | EPM2A (laforin) interacting protein 1 | ENSMUSG00000046785 |
| 1416008_at | 17.45 | Satb1 | Special AT-rich sequence binding protein 1 | ENSMUSG00000023927 |
| 1456177_x_at | 17.44 | Zfp706 | Zinc finger protein 706 | ENSMUSG00000062397 |
| 1416501_at | 17.44 | Pdpk1 | 3-phosphoinositide dependent protein kinase-1 | ENSMUSG00000024122 |
| 1423684_at | 17.44 | Hnrpk | Heterogeneous nuclear ribonucleoprotein K | ENSMUSG00000021546 |
| 1426924_at | 17.4 | Rc3h2 | Ring finger and CCCH-type zinc finger domains 2 | ENSMUSG00000075376 |
| 1437288_at | 17.38 | Impad1 | Inositol monophosphatase domain containing 1 | ENSMUSG00000066324 |
| 1429579_at | 17.37 | RIKEN cDNA 6330407I18 gene | – |
The top 70 most enriched mRNAs of the 700 putative HuD targets. The complete dataset is shown in Supplementary Table S1.
Predicted location of HuD-binding motifs in selected known and new HuD targets
| mRNA | Motif 1 | Motif 2 | Motif 3 |
|---|---|---|---|
| Known HuD targets | |||
| GAP-43 | 128 332 | – | – |
| MARCKS | 518 522 526 530 534 538 542 546 | 151 158 275 276 277 278 279 | 111 559 560 |
| Neuroserpin | 1801 1803 1805 1807 1809 1811 1813 1815 1817 1819 1821 1823 1825 1827 1829 1831 1833 1835 1837 1839 1841 1843 | 2073 | 615 |
| N-myc | – | – | 244 245 369 |
| p21 waf1 | 133 1211 | – | – |
| tau | – | – | – |
| VEGF | 939 1261 | – | 397 |
| p27 5UTR | 225 233 306 | 436 | – |
| p27 3UTR | – | 878 879 880 881 882 883 884 885 886 887 888 889 | 475 597 891 |
| New HuD targets | |||
| eIF4e | – | 563 657 | – |
| HuR | – | 1190 1199 | – |
| HuB | – | 456 | – |
| Fkbp1a | 976 | – | – |
| PPP1cb | 985 | – | – |
| PPP3ca | – | 15 | – |
| CaMkinIIα | 1475 1483 1485 1487 1491 1497 1512 3032 3034 3036 3038 3040 3042 3044 3046 3048 3050 3052 3054 3056 3058 3060 3062 3064 3066 3068 3070 | 3294 3298 3302 3306 3312 | – |
| Homer1 | – | – | 754 |
| Rab1 | 891 1247 1249 | – | – |
| N-CAM1 | 75 77 79 81 83 | 744 1108 1109 1317 1334 1351 1352 1353 1354 1355 1356 1357 1358 1359 2833 2837 | 2705 |
| Musashi 2 | 274 276 280 | 253 416 421 422 426 456 461 1558 1563 1570 1575 1764 1765 2258 2263 2760 3632 3638 3889 4248 4253 | 1141 1580 1767 1768 1871 1872 2955 2956 3642 4082 4083 |
The predicted location of each motif in the 3′ UTR of selected HuD targets. As described in the ‘Materials and Methods’ section, Pearl scripts were used to search for the location of each motif, allowing one mismatch in each consensus sequence.
Characteristics of the datasets used in the present studies
| 5′ UTR | CR | 3′ UTR | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Genome | Chip | Forebrain | HuD targets | Genome | Chip | Forebrain | HuD targets | Genome | Chip | Forebrain | HuD Targets | |
| Number of sequences | 12 397 | 11 036 | 8200 | 421 | 23 679 | 19 029 | 8688 | 608 | 17 918 | 16 359 | 8363 | 572 |
| Length | 380 | 367 | 274 | 359 | 1469 | 1605 | 1739 | 1593 | 1121 | 1142 | 1325 | 1,909 |
| A | 20.48 | 19.92 | 17.97 | 18.3 | 25.52 | 25.5 | 25.97 | 27.63 | 26.6 | 26.45 | 26.39 | 27.86 |
| U | 20.12 | 19.7 | 17.93 | 18.85 | 22.8 | 21.95 | 21.66 | 22.68 | 28.76 | 28.74 | 29.28 | 32.95 |
| C | 29.23 | 29.67 | 30.77 | 30.9 | 25.73 | 25.97 | 25.52 | 23.92 | 22.55 | 22.65 | 22.27 | 19.11 |
| G | 30.18 | 30.71 | 33.34 | 31.95 | 25.95 | 26.58 | 26.85 | 25.77 | 22.08 | 22.16 | 22.06 | 20.08 |
3′ UTR sequences were retrieved from Ensembl BioMart and datasets were made non-redundant by Gene ID. The datasets were analyzed by scripts written in Perl v5.8.8 and using BioPerl modules.
Figure 2.ARE frequency and U content in HuD target genes. (A) The presence of the different AREs was analyzed in 3′ UTR sequences of the entire mouse genome (white bars), the genes present in the Affymetrix 430 2.0 chip (light gray bars), a set of forebrain expressed transcripts (dark gray bars) and in the HuD target dataset (black bars). Inset shows the same data normalized to the frequency in the genome. (B) Analysis of the percentage of Us in 20 nt long stretches in the 3′ UTRs of the mouse genome, Affymetrix 430 2.0 chip, and HuD targets. Inset shows the difference between HuD targets and mouse genome. (C) Percentage of genes with segments of 70% Us as a function of stretch length. Inset shows the difference between HuD targets and mouse genome. (D) Frequency of genes with 20 nt long fragments in the 3′ UTR containing at least 70% Us in all the datasets described in panel A. ***P < 0.0001 after a Chi-square test.
Figure 4.Binding affinities of recombinant HuD for the three motifs. Panels show binding curves of motif 1 (A), motif 2 (B) and motif 3 (C). In vitro binding assays were performed with increasing amounts of radiolabeled oligoribonucleotides corresponding to the most likely sequence in each motif (CCCUCCCUCUCUC for motif 1, UUUUGUUUUGUUU for motif 2 and UUUUUUUUUUAAA for motif 3) and fixed amounts GST-HuD as indicated in ‘Materials and Methods’ section. Non-specific binding was measured using a 100 molar excess of cold oligoribonucleotides. Binding curves of total (solid line) and non-specific (dashed lines) used to calculate the Kd for each motifs.
Figure 5.Increased frequency of HuD-binding motifs in mRNAs upregulated in the brains of HuD overexpressor mice. (A) Frequency of HuD-binding motif 1 in the 5′ UTR, CR and 3′ UTR of mouse genome (white bars), Affymetrix 430 2.0 chip genes (light grey bars), forebrain transcripts (dark grey bars) and mRNAs upregulated in HuD-Tg mice (black bars). (B) Same analysis for HuD-binding motif 2. (C) Frequency of motif 3 in the same gene datasets. ***P < 0.0001 HuD-Tg mice versus each of the three reference sets (genome, chip and forebrain), (a) P < 0.02 versus genome and chip, (b) P < 0.05 versus genome and chip and (c) P < 0.05 versus all three reference sets.
Figure 6.Gene Ontology analysis of HuD targets. The graph shows the number of HuD target genes observed in each gene ontology category (black bars) and the expected number in the genome (white bars). All the categories presented in the graph have P-values < 0.01.