| Literature DB >> 19305498 |
Noëlle Guillon1, Franck Tirode, Valentina Boeva, Andrei Zynovyev, Emmanuel Barillot, Olivier Delattre.
Abstract
The fusion between EWS and ETS family members is a key oncogenic event in Ewing tumors and important EWS-FLI1 target genes have been identified. However, until now, the search for EWS-FLI1 targets has been limited to promoter regions and no genome-wide comprehensive analysis of in vivo EWS-FLI1 binding sites has been undertaken. Using a ChIP-Seq approach to investigate EWS-FLI1-bound DNA sequences in two Ewing cell lines, we show that this chimeric transcription factor preferentially binds two types of sequences including consensus ETS motifs and microsatellite sequences. Most bound sites are found outside promoter regions. Microsatellites containing more than 9 GGAA repeats are very significantly enriched in EWS-FLI1 immunoprecipitates. Moreover, in reporter gene experiments, the transcription activation is highly dependent upon the number of repeats that are included in the construct. Importantly, in vivo EWS-FLI1-bound microsatellites are significantly associated with EWS-FLI1-driven gene activation. Put together, these results point out the likely contribution of microsatellite elements to long-distance transcription regulation and to oncogenesis.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19305498 PMCID: PMC2654724 DOI: 10.1371/journal.pone.0004932
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Number of reads and corresponding mapped sequences per Chip-Seq experiments.
| Reads | SK-N-MC | A673 (1) | A673 (2) | MON (control) |
| Total sequenced | 2,961,880 | 1,888,878 | 3,466,371 | 2,473,927 |
| Total uniquely mapped | 2,577,613 | 1,656,023 | 3,004,601 | 1,982,019 |
Figure 1EWS-FLI1 binds GGAA microsatellites or GGAA-rich sequences.
A. Enrichment of GGAA motifs in EWS-FLI1-bound sequences. Frequencies of each of 4096 possible 6mer nucleotides found for the 246 identified EWS-FLI1 specific regions (black circle) and for regions identified in the control experiment (white circle) are represented along the Y axis whereas frequency of the same 6mers in the genome is represented on the X axis. B. GGAA repeat enrichment is a common feature of Ewing cell lines. Number of sequences found in A673 (grey circle) and SK-N-MC (white circle) for each type of binding site. C. Consensus motif assessed with MEME algorithm (E-value = 4.1×10−46) in regions other than GGAA microsatellites.
Transcription factor consensus sites enrichment in regions other than GGAA microsatellites.
| TF Matrices | Over representation (1) | Z-Score (1) | Number of Matches |
| V$ELK1.02 | 10.4 | 41.82 | 207 |
| V$CETS1P54.01 | 6.83 | 35.63 | 256 |
| V$ETS1.01 | 5.76 | 29.28 | 219 |
| V$ETS2.01 | 4.14 | 26.97 | 306 |
| V$ELK1.01 | 5.62 | 26.63 | 188 |
| V$FLI.01 | 5.86 | 26.39 | 174 |
| V$ELF2.01 | 4.31 | 24.5 | 237 |
(1) Compared to the genomic representation.
Transcription factor modules containing an ETSF binding site in regions other than GGAA microsatellites.
| Modules with V$ETSF | Over representation (1) | Z-Score (1) | Number of Matches |
| V$ETSF-V$ETSF | 5.43 | 41.08 | 468 |
| V$ETSF-V$GREF | 5.22 | 30.55 | 275 |
| V$ETSF-V$HOXF | 2.18 | 17.92 | 504 |
| V$CREB-V$ETSF | 2.74 | 16.68 | 254 |
| V$ETSF-V$NKXH | 2.18 | 14.67 | 338 |
| V$ETSF-V$NFKB | 3.5 | 14.26 | 115 |
| V$AP4R-V$ETSF | 4.98 | 13.32 | 57 |
| V$ETSF-V$NOLF | 4.13 | 13.03 | 73 |
| V$E2FF-V$ETSF | 2.94 | 12.54 | 124 |
| V$ETSF-V$OCT1 | 2.09 | 12.5 | 277 |
| V$ETSF-V$ZBPF | 2.78 | 12.48 | 138 |
| V$ETSF-V$PAX1 | 6.85 | 12.42 | 32 |
| V$ETSF-V$HAND | 2.5 | 12.17 | 166 |
| V$ETSF-V$NR2F | 2.23 | 11.79 | 206 |
| V$ETSF-V$MOKF | 3.22 | 11.72 | 91 |
| V$ETSF-V$SORY | 1.95 | 11.08 | 265 |
| V$ETSF-V$PARF | 2.05 | 10.98 | 227 |
| V$ETSF-V$HEAT | 2.37 | 10.81 | 149 |
| V$ETSF-V$MEF3 | 6.01 | 10.78 | 29 |
| V$ETSF-V$PTF1 | 4.02 | 10.17 | 47 |
| V$BTBF-V$ETSF | 4.52 | 10.03 | 38 |
(1) Compared to the genomic representation.
Figure 2EWS-FLI1 microsatellite length preferences.
A. Ratio of the number of GGAA repeats in EWS-FLI1-bound regions to the number of repeat in 50000 randomly picked regions. B. Ability of EWS-FLI1 to modulate transcription of a reporter gene depending upon the number of GGAA repeats. Firefly relative to Renilla luciferase activity is shown. Control experiments with the empty pGL3-promoter vector were set to 1.
Figure 3Long distance EWS-FLI1 binding on GGAA microsatellites results in significant gene expression activation.
A. Proportion of EWS-FLI1-bound regions, as compared to the proportion of random regions, around EWS-FLI1 regulated genes. The proportion of EWS-FLI1-bound regions as a function of the distance to the transcription start sites of EWS-FLI1-up or -down regulated genes (solid lines) is shown. As a control, a similar function is indicated for 1500 randomly chosen regions (dashed line). B. Gene Set Enrichment Analysis (GSEA) of genes flanking EWS-FLI1-bound microsatellites. The 94 genes flanking the 80 microsatellites>9R regions (upper panel) as well as the 144 genes flanking the non-microsatellites regions (lower panel) were used as gene set. The expression dataset resulted from previously described EWS-FLI1 inhibition experiments of A673 and SK-N-MC Ewing cell lines [37], [40], ranked using the signal-to-noise algorithm. A strong enrichment of genes flanking EWS-FLI1 bound GGAA microsatellites among EWS-FLI1 up-regulated genes is observed (upper panel). C–F. Regions upstream of EWS-FLI1 up-regulated genes are enriched in GGAA-microsatellites. The number of microsatellites with either 3 to 9 GGAA repeats (grey line) or more than 9 repeats (black line) was calculated for each 1 Kb window from 1 Kb to 1 Mb upstream of the transcription start sites. The numbers of GGAA repeats along DNA are shown for (C) 17000 known genes (control distribution), (D) 582 EWS-FLI1-up-regulated genes, (E) 558 EWS-FLI1-down-regulated genes and (F) 561 genes that are expressed in A673 and SK-N-MC cell lines but not regulated by EWS-FLI1. The control distribution shown in C is also indicated on part D, E and F.