| Literature DB >> 26271925 |
Vuthy Ea1, Tom Sexton2, Thierry Gostan3, Laurie Herviou4, Marie-Odile Baudement5, Yunzhe Zhang6, Soizik Berlivet7, Marie-Noëlle Le Lay-Taha8, Guy Cathala9,10, Annick Lesne11,12,13, Jean-Marc Victor14,15,16, Yuhong Fan17, Giacomo Cavalli18,19, Thierry Forné20,21.
Abstract
BACKGROUND: In higher eukaryotes, the genome is partitioned into large "Topologically Associating Domains" (TADs) in which the chromatin displays favoured long-range contacts. While a crumpled/fractal globule organization has received experimental supports at higher-order levels, the organization principles that govern chromatin dynamics within these TADs remain unclear. Using simple polymer models, we previously showed that, in mouse liver cells, gene-rich domains tend to adopt a statistical helix shape when no significant locus-specific interaction takes place.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26271925 PMCID: PMC4536789 DOI: 10.1186/s12864-015-1786-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Fitting globule models to contact frequencies quantified in mESC. Experimental 3C-qPCR data obtained for wt mESC in gene-rich TADs (Fig. 2) have been displayed into a Log-Log plot and globule models were fitted to the following power-law: X(s) = k*s (adapted from Eq. 6 and Eq. 9 from ref. [20]), where X(s) is the cross-linking frequency, s (in kb) is the site separation along the genome, K is representing the efficiency of cross-linking and the exponent α is the slope associated to this power-law. Best-fits (using the nls object of the R software) show that the slope associated to our experimental data (red line) is approximately α = −1/2 (−0.52) with a correlation coefficient R = 0.47, while correlation coefficients associated to the equilibrium (α = −3/2) (black line) or crumpled globules (α = −1) (green line) are much lower
Fig. 2Fitting the statistical helix model to contact frequencies quantified in mESC. Quantitative 3C data were obtained from wild-type mouse ESC in five gene-rich TADs (a), two gene-poor TADs (b) and one gene-desert TAD (c) (see genomic maps in Additional file 1). For each type of TAD, data obtained from all the anchor primers used for each locus (Additional file 7) were compiled in a single graph (each locus is represented by a specific color). Error bars are standard error of the mean of three independent quantitative 3C assays each quantified at least in triplicate. Dashed lines delimit supranucleosomal domains that encompass separation distances where contact frequencies are alternatively lower and higher (see Methods). The graphs show the best fit analyses obtained with the unconstrained chromatin model [eqs. 1 and 2] (black curves) or the statistical helix model [eqs. 1 and 3] (red curves). Correlation coefficients (R 2) are indicated on the graphs. Best fit parameters, and the genomic distance contained within one statistical helix turn (Sh in kb), are given in the upper part of Table 1. For each supranucleosomal domains, the mean contact frequencies and the number (n) of experimental points are indicated on the graphs. p-values (Mann–Whitney U-test) account for the significance of the differences observed between the experimental means of two adjacent domains (double asterisks indicate a p-value < 0.05 and > 0.01 and triple asterisks a p-value < 0.01)
Fitting the statistical helix model to the relative contact frequencies observed in wild-type (upper part, rows 2–4) and triple KO (lower part, rows 5–7) mouse ES cells (mESCs)
| WT vs H1 TKO mESC |
|
| < | < |
| |
|---|---|---|---|---|---|---|
| 1 | mouse liver gene-rich | 890 ± 70 | 2.7 ± 0.1 | 287 ± 5 | 160 ± 9 | 97 ± 1 |
| 2 | WT mESC gene-rich (Fig. | 1,070 ± 80 | 2.7 ± 0.1 |
|
|
|
| 3 | WT mESC gene-poor (Fig. | 1,880 ± 360 |
| 262 ± 18 | 213 ± 31 | 87 ± 5 |
| 4 | WT mESC gene-desert (Fig. | 1,380 ± 370 |
| 208 ± 44 | 264 ± 77 |
|
| 5 | H1TKO gene-rich (Fig. | 1,810 ± 140 |
| 268 ± 8 | 230 ± 14 | 83 ± 2 |
| 6 | H1TKO gene-poor (Fig. | 2,270 ± 430 | 3.7 ± 0.3 | 269 ± 19 | 224 ± 30 | 83 ± 5 |
| 7 | H1TKO gene-desert (Fig. | 2,620 ± 840 | 3.9 ± 0.4 | 264 ± 105 | 380 ± 162 | 86 ± 28 |
Parameters obtained for mouse liver cells [4] are indicated for comparisons (row 1). Remarkable values are indicated in bold (see text)
Fig. 3Fitting the statistical helix model to contact frequencies quantified in mouse H1 TKO ESC. Quantitative 3C data were obtained from mouse ESC that are Triple Knock-Out for Histone H1 genes (H1 TKO), for five gene-rich TADs (a), two gene-poor TADs (b) and one gene-desert TAD (c). The graphs show the best-fit analyses obtained with the unconstrained chromatin model [eqs. 1 and 2] (black curves) or the statistical helix model [eqs. 1 and 3] (red curves). The data (see Additional file 8) were analyzed and are depicted as described in the legend of Fig. 2. Best-fit parameters, and the genomic distance contained within one statistical helix turn (Sh in kb), are given in the lower part of Table 1
Fig. 4Epigenetic landscapes and chromatin dynamics of the Drosophila chromosome 2 L. “Virtual 3C”, obtained from Hi-C experiments in the Drosophila, were classified according to the four previously defined epigenetic domains (D1 to D4) [6]: D1 (“red chromatin”) corresponds to domains with “active” epigenetic marks, D2 (“black chromatin”) displays no specific epigenetic modifications, D3 (“blue chromatin”) is PcG associated chromatin and D4 (“green chromatin”) is HP1/heterochromatin. The unconstrained chromatin model [eqs.1 and 2] was then fitted and the three best-fit parameters (K = crosslinking efficiency; L = compaction; S = flexibility) were recovered from each “virtual 3C”. Statistical analyses of best-fit parameters were performed separately according to the epigenetic domains. Box-plots show the results obtained for each type of domains on the chromosome 2 L. Stars indicate statistically significant differences: single asterisk indicates a p-value < 0.05 and > 0.01, a double asterisk a p-value < 0.01 and > 0.001 and a triple asterisk a p-value < 0.001 (all p-values are given in Additional file 5). The number of best-fits (n) performed in each domain is as follows: D1: n = 990; D2: n = 2481; D3: n = 624; D4: n = 239). The results obtained from the other Drosophila chromosomes are given in Additional file 3
Fitting the unconstrained model on Drosophila Hi-C dataa
| Chromosome | Parameters | Active D1 | Black D2 | PcG D3 | Centromeric D4 |
|---|---|---|---|---|---|
| K *109 | 0.852 | 1.487 | 1.340 | 2.405 | |
| Chr2L | S (kb) | 4.150 | 4.918 | 4.849 | 5.296 |
| L (nm/kb) | 10.81 | 10.56 | 10.6 | 10,32 | |
| K *109 | 0.847 | 1.471 | 1.122 | 2.699 | |
| Chr2R | S (kb) | 4.147 | 4.925 | 4.552 | 5.292 |
| L (nm/kb) | 10.76 | 10.57 | 10.71 | 9.99 | |
| K *109 | 0.808 | 1.383 | 1.232 | 2.684 | |
| Chr3L | S (kb) | 4.090 | 4.881 | 4.873 | 5.382 |
| L (nm/kb) | 10.80 | 10.55 | 10.62 | 10.06 | |
| K *109 | 0.857 | 1.548 | 1.324 | ||
| Chr3R | S (kb) | 4.132 | 4.95 | 4.773 | |
| L (nm/kb) | 10.80 | 10.59 | 10.64 | ||
| K *109 | 1.303 | ||||
| Chr4 | S (kb) | 4.55 | |||
| L (nm/kb) | 10.67 |
Median values of 3 best-fit parameters obtained on the autosomal chromosomes in each type of TADs
aHi-C samples were prepared from unsexed flies, and therefore the X chromosome was not analysed since, in males, this chromosome undergoes dosage compensation that largely affects its epigenetic features