| Literature DB >> 24771338 |
Stefan Enroth1, Robin Andersson1, Madhusudhan Bysani2, Ola Wallerman2, Stefan Termén3, Brian B Tuch4, Francisco M De La Vega4, Carl-Henrik Heldin3, Aristidis Moustakas5, Jan Komorowski6, Claes Wadelius7.
Abstract
Nucleosomes play important roles in a cell beyond their basal functionality in chromatin compaction. Their placement affects all steps in transcriptional regulation, from transcription factor (TF) binding to messenger ribonucleic acid (mRNA) synthesis. Careful profiling of their locations and dynamics in response to stimuli is important to further our understanding of transcriptional regulation by the state of chromatin. We measured nucleosome occupancy in human hepatic cells before and after treatment with transforming growth factor beta 1 (TGFβ1), using massively parallel sequencing. With a newly developed method, SuMMIt, for precise positioning of nucleosomes we inferred dynamics of the nucleosomal landscape. Distinct nucleosome positioning has previously been described at transcription start site and flanking TF binding sites. We found that the average pattern is present at very few sites and, in case of TF binding, the double peak surrounding the sites is just an artifact of averaging over many loci. We systematically searched for depleted nucleosomes in stimulated cells compared to unstimulated cells and identified 24 318 loci. Depending on genomic annotation, 44-78% of them were over-represented in binding motifs for TFs. Changes in binding affinity were verified for HNF4α by qPCR. Strikingly many of these loci were associated with expression changes, as measured by RNA sequencing.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24771338 PMCID: PMC4066760 DOI: 10.1093/nar/gku326
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 8.(A) Coverage plots of RNA-seq data over the SMAD7 gene. The TGFβ stimulated (TGFB+) data have been normalized to the same sequencing depth as the unstimulated (TGFB−). (B) Scatter plot of RPKM-counts over genes for TGFB+ and TGFB−. The genes selected for Taqman validation are marked with black triangles. (C) Taqman validation results for 25 selected genes. The RNA-seq is represented as log2 of sequence depth normalized fold change between TGFB+ and TGFB−, and the Taqman values are also log2 of fold change between TGFB+ and TGFB−.
Figure 1.(A) Log-odds values around the TSS of gene HEY2 for nucleosome mid-positions in HepG2 unstimulated (labeled TGFB−) cells (Nucl Interior log-odds TGFβ−) and TGFβ stimulated (labeled TGFB+) cells (Nucl Interior log-odds TGFβ+). For comparison, counts of reads extended to the average fragment length of sequenced DNA, combined signal, are shown (Nucleosome TGFβ− and Nucleosome TGFβ+). Locations of inferred nucleosome interior regions in TGFβ− cells (Nucleosome interiors TGFβ−) and TGFβ+ cells (Nucleosome interiors TGFβ+) as well as locations of inferred nucleosome depletions in TGFβ+ cells (Nucleosome depletion TGFβ+) are depicted in the bottom panels. Data were uploaded as custom tracks to the UCSC Genome Browser where the graphics were produced. (B) and (C) Average signal footprints of log-odds (black lines, left vertical axes), counts of strand-directed fragment-length extended reads (gray lines, right vertical axes) and DNaseI hypersensitivity (blue dashed lines, scaled to fit) around TSSs of the top 5000 high-expressed protein-coding genes (B) and 25 651 JUND binding sites (C) .
Figure 2.Heat maps of log-odds values (left column) and counts of fragment-length extended reads, combined signal, (right column) in 2-kb windows around TSS of highly expressed genes (top 5000). Their average values are also shown (middle column) in 10 K-means clusters (rows) inferred from discretized (0 or 1) log-odds data. The clusters were inferred using the data within the gray-marked regions in the bottom panel, leftmost figure. The middle figure in the bottom panel shows the combined average of all 10 clusters. Black lines (left vertical axes) and gray lines (right vertical axes) in the middle column depict the average of log-odds values and the combined signals, respectively, within each cluster. Yellow colors indicate high values while blue colors indicate low values.
Figure 3.Heat maps of log-odds values (left column) and counts of fragment-length extended reads, combined signal, (right column) in 2-kb windows around 25 651 JunD binding sites. Their average values are also shown (middle column) in 10 K-means clusters (rows) inferred from discretized (0 or 1) log-odds data. The clusters were inferred using the data within the gray-marked regions in the bottom panel, leftmost figure. The middle figure in the bottom panel shows the combined average of all 10 clusters. within the gray-marked regions in the bottom panel. Black lines (left vertical axes) and gray lines (right vertical axes) in the middle column depict the average of log-odds values and the combined signals, respectively, within each cluster. Yellow colors indicate high values while blue colors indicate low values.
Figure 4.Annotations of nucleosomes in HepG2 unstimulated cells. Left pie chart shows the distribution of nucleosomes in exonic, intronic and intergenic regions. For comparison (right pie chart), the genomic sequence coverage of these regions is shown.
Figure 5.(A) Distribution of phased, intermediate and fuzzy nucleosomes in unstimulated cells in exonic, intronic and intergenic regions. (B) The fraction of phased, intermediate and fuzzy nucleosomes in exons of lengths within intervals ranging between 0 and 1000 bp. Length intervals of 20 bp in size were used.
Figure 6.Average signal footprints of log-odds values [(A) and (B)] and counts of reads extended to the average fragment length of sequenced DNA [(C) and (D)] around loci of nucleosome depletion in TGFβ stimulated cells using data from unstimulated [(A) and (C)] and TGFβ stimulated cells [(B) and (D)]. Combined signals refer to the summation of signals from both ends of sequenced fragments (Start and End for log-odds and Sense and Antisense for counts of extended reads).
Figure 7.(A) Distribution of loci with nucleosome depletion in TGFβ stimulated cells according to genomic annotation. (B) qPCR validation in unstimulated (TGFB−) and TGFβ stimulated (TGFB+) cells of HNF4α binding in regions with ejected nucleosomes.
Genomic distribution of inferred loci with nucleosomal depletion in TGFβ stimulated cells and their association with genes and expression changes
| Exonic | Intronic proximal | Intronic distal | Intergenic proximal | Intergenic distal | |
|---|---|---|---|---|---|
| Number of loci | 454 | 5157 | 4290 | 1725 | 12692 |
| Number of associated genes | 437 | 4158 | 2235 | 1811 | |
| Number of loci associated with gene expression change | 68 | 599 | 976 | 242 | |
| Number of associated genes with expression change | 108 | 1737 | 397 | 239 | |
| Number of associated exons | 553 | 16883 | |||
| Number of loci associated with exon expression change | 84 | 1772 | |||
| Number of associated exons with expression change | 99 | 2659 |
Genomic distribution of inferred loci with nucleosomal depletion in TGFβ stimulated cells with associated over-represented TF binding motifs in defined categories according to distance from exons and genes and their association with genes and expression changes
| Exonic | Intronic proximal | Intronic distal | Intergenic proximal | Intergenic distal | |
|---|---|---|---|---|---|
| Number of loci | 199 | 2802 | 2469 | 245 | 9859 |
| Percentage of total loci | 43.8 | 54.3 | 57.6 | 14.2 | 77.7 |
| TFs | SPI1, ELF5, FEV, CTCF, ETS1, Hltf | Hand1::Tcfe2a, SPI1, NFE2L2, SOX10, NFATC2, FEV, Mafb, EBF1, CTCF | MZF1_1–4, PLAG1, MZF1_5–13, SPI1, RUNX1, NFATC2, EBF1, SOX10, FEV, BRCA1 | SPI1, FEV | NFATC2, EBF1, Myf, SPIB, SOX10, PPARG::RXRA, Hand1::Tcfe2a, NR2F1, Gfi, INSM1, SPI1, FEV, MZF1_1–4, Mafb, HNF4A, RREB1, En1, BRCA1 |
| Number of associated genes | 182 | 2503 | 1539 | 279 | |
| Number of loci associated with gene expression change | 45 | 607 | 1173 | 56 | |
| TFs associated with gene expression change | SPI1, Hltf, ELF5, FEV, CTCF, ETS1 | FEV, SPI1, Hand1::Tcfe2a, Mafb, EBF1, NFE2L2, SOX10, NFATC2, CTCF | MZF1_5–13, MZF1_1–4, PLAG1, EBF1, SPI1, FEV, NFATC2, RUNX1, SOX10, BRCA1 | SPI1, FEV | |
| Number of associated genes with expression change | 21 | 290 | 306 | 38 | |
| Number of associated exons | 225 | 9229 | |||
| Number of loci associated with exon expression change | 64 | 1711 | |||
| TFs associated with exon expression change | SPI1, ELF5, ETS1, CTCF, FEV, Hltf | NFE2L2, FEV, SOX10, Hand1::Tcfe2a, SPI1, Mafb, EBF1, CTCF, NFATC2 | |||
| Number of associated exons with expression change | 30 | 1498 |
| Exonic | Overlapping with an exon |
| Intronic proximal | Intronic and no more than 5 kb away from an exon |
| Intergenic proximal | Intergenic and no more than 5 kb away from an exon |
| Intronic distal | Intronic and more than 5 kb away from an exon |
| Intergenic distal | Intergenic and more than 5 kb away from an exon |