| Literature DB >> 28035242 |
Tobias A Knoch1, Malte Wachsmuth2, Nick Kepper3,4, Michael Lesnussa3, Anis Abuseiris3, A M Ali Imam3,5, Petros Kolovos3,5, Jessica Zuin6, Christel E M Kockx7, Rutger W W Brouwer7, Harmen J G van de Werken5, Wilfred F J van IJcken7, Kerstin S Wendt6, Frank G Grosveld5.
Abstract
BACKGROUND: The dynamic three-dimensional chromatin architecture of genomes and its co-evolutionary connection to its function-the storage, expression, and replication of genetic information-is still one of the central issues in biology. Here, we describe the much debated 3D architecture of the human and mouse genomes from the nucleosomal to the megabase pair level by a novel approach combining selective high-throughput high-resolution chromosomal interaction capture (T2C), polymer simulations, and scaling analysis of the 3D architecture and the DNA sequence.Entities:
Keywords: Cell nucleus architecture; Chromatin fibre; Chromatin loops; Chromatin rosettes; DNA sequence organization; Genome organization; Nucleosome; Polymer physics simulation; Targeted chromatin capture
Year: 2016 PMID: 28035242 PMCID: PMC5192698 DOI: 10.1186/s13072-016-0089-x
Source DB: PubMed Journal: Epigenetics Chromatin ISSN: 1756-8935 Impact factor: 4.954
Fig. 1T2C description, interaction mapping, and direct determination of the chromatin quasi-fibre and the aggregated loop/rosette 3D architecture of the human and mouse genomes: a Cell nuclei in a population of cells (transmission light and fluorescence microscopy, [89]) have an underlying chromatin architecture (simulated cell nucleus containing 1.2 million polymer segments; resolution 5.2 kbp, i.e. ~50 nucleosomes; Multi-Loop-Subcompartment (MLS) rosette model with 126 kbp loops and linkers; [5]). After crosslinking the DNA is restricted within the nucleus by a 1st restriction enzyme, before the crosslinked fragments are extracted and diluted such that intra-fragment re-ligation is favoured. After de-crosslinking, the re-ligated material is shortened by a 2nd restriction enzyme or sonication and purified by a capture array with oligos designed next to the 1st restriction enzyme, before paired-end-sequencing over the ligation position. After alignment to the reference genome, this results in interactions frequency matrices (b–d) and scaling curves (Fig. 2). b, c Interaction matrices (logarithmic and colour-coded scale; left and right) of the human IGF/H19 11p 15.5-15.4 region (b) in HB2, HEK293T TEV (intact cohesin) and HEK293T HRV (cleaved cohesin) as well as the mouse β-globin 7qE3-F1 region (c) for fetal brain (inactive β-globin) and liver cells (active β-globin) show the formation of subchromosomal domains separated by a linker (borders: pink lines, right; D1s, D1e: start and end of domains), which consist of loops (red lines; 8L: number of loops), representing due to the grid-like pattern loop aggregates/rosettes. A grid-like pattern is also visible in the interactions between the domains and corresponds to the interactions of loops and loop bases of interacting domains. Near the diagonal the aggregation into a chromatin quasi-fibre and loop internal structures are visible (zooming in and out the images can make this clearer). Between different cell types and functional states only some local differences are visible resulting in a consensus architecture and allowing simulation of the 3D architecture (middle; resolution <~1 kbp). Note that the simulation is driven by the dominant consensus architecture. d The interaction matrix of a 380 kbp subchromosomal domain in the mouse 12qF1–F2 region at high resolution clearly shows the regular rosette-like picture with a detailed structure of the loop base with in- and outgoing loop fibre stretches as seen in simulations (e, f). e Simulated Multi-Loop-Subcompartment (MLS) model with an averaged spatial distance map for exact spatial distances 〈R 〉 (left) and on the diagonal normalized interaction maps for interaction radii 〈d 〉 of 50 nm, 70 nm, and 150 nm (right), for an MLS model with 126 kbp loops and linkers [16 Mbp upper and 1.2 Mbp zoom-in (z) lower row), showing clearly the formation of domains connected by a linker, their interaction, and the underlying loop aggregates/rosette architecture, with (anti-)parallel fibre stretches at the loop base. The dependence on the interaction radii corresponding to different crosslink probabilities is also clearly visible. f Sketch of the different structures visible on different scales in the experimental and simulated interaction matrices (spatial distance matrix: left; simulated interaction matrix: upper) (from e): On the smallest scale, near the diagonal the compaction of nucleosomes into the quasi-fibre (yellow line) and the fibre regime (dark blue line) can be found. On the largest scale the domains are clearly bordered (pink lines) and connected by a linker. On medium scales the loop aggregate/rosette like structure is characterized by the loop bases (red circles: within domains, blue circles: between domains) as well as the loop interactions (green triangles). The fine structure of the loops representing the (anti-)parallel loop stretches at the base (red crosses) and within loops (green stretches near diagonal)
Fig. 2Scaling analysis of experiments, simulations, and the DNA sequence showing the formation of a chromatin quasi-fibre and the loop aggregate/rosette genome architecture: a The fine-structured multi-scaling resulting from the T2C interaction frequency as a function of the genomic separation for the human IGF/H19 11p 15.5–15.4 region and the mouse β-globin locus 7qE3–F1 (3 bp average (1–200 bp) and thereafter a grouping with a 1 % resolution per order of magnitude which for clarity is smoothed by a running window average for >103 bp; see also Additional file 27: Figure S14; the values <10 bp are due to the algorithm used and for transparency not discarded since they nevertheless show the extrapolation from values >10 bp), shows: (i) The structure of the nucleosome, (ii) the formation of a plateau from 195 to ~1000 bp, indicating the formation of a chromatin quasi-fibre with a density of 5 ± 1 nucleosomes per 11 nm, (iii) the chromatin quasi-fibre regime, (iv) a mixed chromatin fibre/loop regime with a slightly higher interaction decrease, (v) the plateau indicating the loop aggregate/rosette state, and (vi) in principle the linker regime (not visible in a but in d). c, d The fine-structured multi-scaling is even clearer for the average of 15 loci covering in total ~99 Mbp in mouse MEL cells with subnucleosomal fragment resolution: After an initial increase a plateau is reached from ~50 to ~100 bp, followed by a sharp peak from ~110 to 195 bp (width at plateau level ~85 bp), followed by a second ~10 % decreasing plateau up to 1.0–1.2 kbp, which after a sharp decent until ~104 bp transits to the known multi-scaling behaviour (d, compare with a). With this resolution the fine structure visible (Additional file 28: Figure S15), can be associated with the binding of the DNA double helix to the nucleosome, since up to ~195 bp many of the small peaks (the most prominent at 145 bp) can be associated with the fine structure in the fine-structured multi-scaling behaviour of DNA sequence correlations (e; Additional file 28: Figure S15, Additional file 29: Figure S16). Whereas the structure of the nucleosome vanishes using “secured” interactions (c, pink and light blue), above 195 bp the plateau and multi-scaling behaviour remain. Again the values <10 bp are due to the algorithm used and for transparency not discarded since they nevertheless show the extrapolation from values >10 bp. b The interaction scaling of a simulated Multi-Loop-Subcompartment model with 126 kbp loops and linkers as well as a Random-Walk/Giant-Loop model with 1 Mbp loops and 126 kbp linkers consistently shows for different interaction radii a multi-scaling behaviour. The MLS model shows the characteristic rosette plateau, followed by the random scaling regime of the linker conducting a random-walk. The peaked fine structure originates from the loops forming the rosettes. In contrast, the RWGL model is characterized by random-walk regime and only one major fine structure attributable to the single loops. At greater scales the limit of the entire chromosome is seen in the cut-off. The MLS model agrees in detail with experiments (a, c–d) and the DNA sequence organization (e). e The fine-structured multi-scaling long-range correlation behaviour of each of two human and mouse strains shows clearly again the architectural features: a general increase until a plateaued maximum (including the 145 bp peak), a first plateau area until ~1200 bp, transition to a sharper decrease at ~3.6 kbp (the sweet point used in the calculation of the persistence length) until a minimum ~10–20 kbp and a second statistically significant maximum at ~100 kbp, followed by a random regime and a final cut-off. The first maximum and plateau are characteristic for the nucleosome and formation of the quasi-fibre (c; Additional file 28: Figure S15, Additional file 29: Figure S16) which then transits to chromatin loops and their clustering into loop aggregates/rosettes which are connected by a random-walk behaving linker. Thus, due to the higher statistics here, the architectural features and their tight representation within the DNA sequence organization are even clearer