| Literature DB >> 30414923 |
Aleksandra Pękowska1, Bernd Klaus2, Wanqing Xiang3, Jacqueline Severino2, Nathalie Daigle4, Felix A Klein2, Małgorzata Oleś2, Rafael Casellas5, Jan Ellenberg3, Lars M Steinmetz6, Paul Bertone7, Wolfgang Huber8.
Abstract
The genome of pluripotent stem cells adopts a unique three-dimensional architecture featuring weakly condensed heterochromatin and large nucleosome-free regions. Yet, it is unknown whether structural loops and contact domains display characteristics that distinguish embryonic stem cells (ESCs) from differentiated cell types. We used genome-wide chromosome conformation capture and super-resolution imaging to determine nuclear organization in mouse ESC and neural stem cell (NSC) derivatives. We found that loss of pluripotency is accompanied by widespread gain of structural loops. This general architectural change correlates with enhanced binding of CTCF and cohesins and more pronounced insulation of contacts across chromatin boundaries in lineage-committed cells. Reprogramming NSCs to pluripotency restores the unique features of ESC domain topology. Domains defined by the anchors of loops established upon differentiation are enriched for developmental genes. Chromatin loop formation is a pervasive structural alteration to the genome that accompanies exit from pluripotency and delineates the spatial segregation of developmentally regulated genes. Published by Elsevier Inc.Entities:
Keywords: CTCF; CTCF loops; chromatin architecture; chromatin loops; chromatin structure; differentiation; pluripotency; topologically associating domains
Mesh:
Substances:
Year: 2018 PMID: 30414923 PMCID: PMC6327227 DOI: 10.1016/j.cels.2018.09.003
Source DB: PubMed Journal: Cell Syst ISSN: 2405-4712 Impact factor: 10.304
Figure 1Differentiation Elicits Formation of Long-Range Chromatin Loops
(A) Examples of chromatin loops (arrows) in ESCs and NSCs (lower and upper triangles, respectively). Heatmaps show normalized counts of in situ Hi-C reads between pairs of genomic loci (STAR Methods).
(B) Composite profile of in situ Hi-C signal (similar to implementation of APA [Rao et al., 2014]) from reduced (top) and induced (bottom) loops in ESCs (left) and NSCs (right). Statistical significance of loop signal was assessed by a Wald test (FDR = 0.05 and FC > 1.5; STAR Methods).
(C) Examples of dynamic and stable loops.
(D) Length distributions of NSC-specific, common, and ESC-specific loops.
Figure 2Compactness of Euchromatin Remains Unchanged upon Differentiation
(A) Experimental approach.
(B) SRI identification of RD in ESCs and Nestin+ NSCs. Cells were labeled with anti-Nestin antibody prior to SRI, and Nestin− and Nestin+ fractions were analyzed in ESC and post-neural induction cultures, respectively (Nestin signal not shown). RDs imaged by conventional microscopy (first panel column), GSDIM (pixel size 10 nm; second and third panel columns), and RD detection (fourth panel column) by automated image analysis.
(C) Nearest neighbor distance (NND) distributions in ESCs (red) and NSCs (blue) (sample sizes: nES = 24, nNS = 19; RDs: nESC = 2,410, nNSC = 2,576; pixel size = 10 nm).
Figure 3Loop Formation Is Associated with Gains in CTCF and Cohesin Binding
(A) Example of concomitant loop gain (in situ Hi-C) and increased CTCF ChIP-seq signal.
(B) Anchors of induced loops primarily overlap CTCF peaks that gain CTCF and Rad21 signal upon neural induction of ESCs. The union of CTCF peaks identified in ESCs and NSCs (PCTCF) was considered. ChIP-seq reads were counted inside each PCTCF interval, and differences were assessed with the DESeq2 method. PCTCF with padj. < 0.1, for which NSC/ESC > 1 were also considered gained. Top: loops for which both anchors overlapped at least one CTCF peak. Bottom: loops with a single CTCF peak at each anchor (n = 479, 20% of loops, consistent with Rao et al. [2014]).
(C) Loop induction correlates with a gain of CTCF peaks located primarily at loop anchors and facing the interior of the loop. Increased sites were those where the normalized ChIP-seq ratio of NSC/ESC was > 1 and padj. < 0.1 (DESeq2 method). CTCF peaks were further stratified based on the orientation of the CTCF motif (forward and reverse groups). Each domain, defined by the anchors of an induced loop, was divided into 250 intervals (x axis; ten intervals were appended to the starts and ends of the loop domains), and the overlap with CTCF peaks was assessed therein. The percentage of domains intersecting a CTCF peak group is shown along the y axis.
Figure 4Pluripotent Stem Cell Chromatin Features Weak Chromatin Domain Boundaries
(A) Schema of the definition of the insulation score at a boundary between two domains (gray) as the log2 of the ratio of “inside” to “between” interactions. The score is positive for strong insulators and negative for weak insulators.
(B) Insulatory strength of CTCF sites at contact domain boundaries is correlated with loop formation. Bins overlapping a CTCF peak and at domain boundaries were stratified based on whether they overlapped with a loop anchor (with/without loop; p values: two-sided t test, NSCs, in situ Hi-C data).
(C) Difference of insulation scores (NSC minus ESC) at anchors of reduced, common, and induced loops (p < 2.2 × 10−16, two-sided t test; induced versus reduced loops, in situ Hi-C data).
(D) Boundaries of contact domains display overall lower insulation score in ESCs relative to differentiated cells.
(E) CTCF and Rad21 binding more frequently increases at boundaries of contact domains than at other genomic locations (p < 2.2 × 10−16, two-sided t test), which preferentially lose CTCF and Rad21 signals, consistent with the detection of greater numbers of peaks in ESCs.
(F) Reprogramming-induced depletion of loops; average of the Hi-C profiles (data from Krijger et al., 2016) at induced loops (in situ Hi-C data, n = 2,454) in NSCs and reprogrammed derivatives.
(G) Insulation scores at contact domain boundaries are diminished upon reversion of NSCs to iPSCs (two-sided t test).
Figure 5Chromatin Topology Is Established Progressively during Differentiation
(A) Experimental design: in vitro conditions to obtain uniform cultures of ground-state pluripotent cells (ESCs maintained in 2i/LIF) and primed pluripotent stem cells (post-implantation epiblast stem cells [EpiSCs]).
(B) Composite profile of TCC signal at loops identified as stronger in ESCs (top) or EpiSCs (bottom).
(C) Length distribution of loops specific to ESCs and EpiSCs.
(D) Composite profile of loops displaying a significant alteration of TCC signal between ESCs (2i/LIF) and NSCs. Loops identified in either or both conditions were considered (TCC data).
(E) Loops are gained in a stepwise manner following loss of naive pluripotency. Loops identified as induced in NSCs relative to ESCs (2i/LIF) were considered (TCC data). Induced loops were grouped into three classes according to genomic span. For each class, ratios of the loop signal between ESCs or NSCs to the signal in EpiSCs are displayed. Loop strength in EpiSCs is between that of ESCs and NSCs (two-sided t test).
(F) Interactions across anchors of NSC-specific loops are gradually lost. The two panels display the ratios between composite profiles of the TCC signal around anchors of induced loops (ESCs [2i/LIF] versus NSCs; TCC data) at 10-kb resolution. Left: ratio of ESC (2i/LIF) to NSC TCC signal; right plot: ratio of ESCs to EpiSCs.
Figure 6Loop Dynamics and the Regulation of Gene Expression
(A) Loop domains are genomic intervals defined by the end of the left anchor (+10 kb) and the start of the right anchor (−10 kb).
(B) Induced loops (in situ Hi-C; n = 2,454) preferentially connect active regulatory elements. Enrichment relative to random pairs of loci separated by a similar genomic distance is indicated above each bar. Inset: the number of up- and downregulated genes (DESeq method; FC > 1.5; adjusted p < 0.1) among loci with promoters forming a loop with enhancers in NSCs only.
(C) Example of an upregulated locus (Lhx2) inside an induced loop domain.
(D) Induced loop domains are formed around activated enhancers and upregulated genes. The x axis plots the fraction of induced loop domains overlapping induced and repressed enhancers (top) and transcriptionally up- and downregulated genes (bottom).
(E) Loop changes correlate with the dynamics of intra-loop-domain promoter-enhancer contacts measured by in situ Hi-C (two-sided t test).
(F) Genes and enhancers active in adult neuronal tissues are found more frequently inside induced than reduced loop domains (Fisher’s exact test).
Findings from This Work and Related Studies
| Feature | This study | ||
|---|---|---|---|
| Embryonic stem cells (ESCs) (conventional and ground-state cultures), post-implantation epiblast stem cells (EpiSCs), induced pluripotent stem cells (iPSCs), neural stem cells (NSCs). | ESCs (conventional), neural progenitor cells (NPCs) and post-mitotic cortical neurons (CNs). NPCs and CNs were derived | Reprogramming of B cells to iPSCs. | |
| CTCF-anchored loops | Stepwise genome-wide induction of long-range loops upon exit from naive pluripotency. | Genome-wide gain of contacts between domain boundaries and convergent CTCF binding sites. | Dissolution of B cell-specific loops after reprogramming. iPSC-specific loops identified as long range. |
| Domain type | Contact domains; topologically associated domains (TADs); replication domains | TADs | TADs |
| Domain boundaries in cell differentiation | Progressive strengthening following loss of naive pluripotency and differentiation to EpiSCs and more specialized NSCs. | Strengthening upon differentiation of ESCs to NPCs. Pronounced strengthening of domain boundaries was not observed in terminally differentiated cells. | N/A |
| Domain boundaries in reprogramming | Restores weak domain boundaries. | N/A | Gain of domain boundaries is more frequent than loss after reprogramming. |
| Chromatin compaction | Not coupled with genome-wide gain of loops. | N/A | N/A |
| Profiles of CTCF and cohesins | More peaks in ESCs than NSCs. Neural induction results in quantitative gain of CTCF, and cohesin binding at loop anchors and domain boundaries. Binding of the two factors is diminished at other genomic loci. | Presence/absence of CTCF peaks does not account for genome-wide gain of loops and domain boundary strength. | CTCF binding correlates with insulatory strength of TAD boundaries. No correlation between gain or loss of TAD boundaries and CTCF recruitment. |
| Relationship between gene expression, chromatin domains, domain boundaries, and structural loops | Positive correlation between loop formation and gene expression. Few loops connect active promoters and enhancers. Induced loop domains span enhancers and developmentally regulated genes. | Positive correlation between loop formation and gene expression. Transcriptional activation frequently coincides with formation of boundaries but is insufficient to elicit boundary formation. | Positive correlation between loop formation and gene expression. No correlation between formation of TAD boundaries and transcriptional regulation. Changes in TAD structure precede gene activation. |
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Anti-CTCF | Milipore | Cat# 07-729; RRID: |
| Anti-Nestin | Developmental Studies Hybridoma Bank | Cat# Rat-401; RRID: |
| Anti-Oct3/4 | Santa Cruz Biotechnology | Cat# sc-5279; RRID: |
| Anti-Rad21 | Abcam | Cat# ab-992; AB_2176601 |
| Anti-mouse Alexa Fluor 532 | Invitrogen | Cat# A11002; RRID: |
| Phycoerythrin-labelled secondary antibody | Santa Cruz Biotechnology | Cat# sc-3761; RRID: |
| DMEM/F-12 | Thermo Fisher | 31331028 |
| GMEM | Invitrogen | 11710035 |
| N2 supplement | Thermo Fisher | 17502-048 |
| B27 supplement | Thermo Fisher | 17504-04 |
| Non-essential amino acids | Thermo Fisher | 11140050 |
| L-glutamine | Thermo Fisher | 25030081 |
| Bovine serum albumin (BSA, fraction V) | Thermo Fisher | 15260037 |
| CHIR99021 (GSK3ß inhibitor) | Trevigen (Reagents Direct) | 27-H76 |
| PD0325901 (MEK inhibitor) | Trevigen (Reagents Direct) | PD0325901 |
| Leukemia Inhibitory Factor (LIF) | This study | This study |
| Epidermal Growth Factor (EGF) | This study | This study |
| Basic Fibroblast Growth Factor (bFGF) | This study | This study |
| Activin A | Sigma | A4941 |
| Fibronectin | Sigma | F1141 |
| Laminin | Sigma | L2020 |
| Accutase | Sigma | A6964-100ML |
| T1 paramagnetic beads | Thermo Fisher | 65602 |
| C1 paramagnetic beads | Thermo Fisher | 65002 |
| Ampure XP paramagnetic beads | Beckman Coulter | A63881 |
| Protein A Dynabeads | Invitrogen | 10002D |
| Complete Mini EDTA-free proteinase inhibitor | Roche | 11836170001 |
| Catalase | Sigma | C3155 |
| Glucose oxidase | Sigma | G0543 |
| HindIII | New England Biolabs | R0104S |
| MboI | New England Biolabs | R0147S |
| Quick Ligation kit | New England Biolabs | M2200S |
| T4 DNA ligase | New England Biolabs | M0202S |
| Exonuclease III ( | New England Biolabs | M0206S |
| Klenow Fragment (3’→5’ exo-) | New England Biolabs | M0212S |
| DNA Polymerase I, Large (Klenow) Fragment | New England Biolabs | M0210S |
| T4 Polynucleotide Kinase | New England Biolabs | M0201S |
| T4 DNA polymerase | New England Biolabs | M0203S |
| Phusion 2x master mix | New England Biolabs | M0536S |
| EZlink Iodoacetyl-PEG2-Biotin | Thermo Fisher | 21334 |
| NEXTFlex DNA Barcodes | Bioo Scientific | NOVA-514102 |
| 2′-Deoxyguanosine-5′-O-(1-thiotriphosphate) sodium salt, Sp-isomer | Axxora | BLG-D031-05 |
| EdU | Molecular Probes | C10340 |
| Alexa Fluor 647 | Molecular Probes | C10340 |
| Slide-A-lyzer | Thermo Fisher | 66003 |
| RNase A | Thermo Fisher | EN0531 |
| Proteinase K | New England Biolabs | P8107S |
| Ovation Ultralow library system V2 | NuGEN | 034432 |
| TRIzol reagent | Thermo Fisher | 15596026 |
| TURBO DNase | Ambion | AM2238 |
| Agilent 2100 Bioanalyzer RNA 6000 Nano | Agilent | 5067-1511 |
| TruSeq Stranded mRNA Sample Prep Kit | Illumina | 20020594 |
| TCC | ||
| In-situ Hi-C | ||
| ChIP-seq | ||
| RNA-seq | ||
| Sox1-GFP mouse embryonic stem cells (46C) | Austin Smith | 12524553 |
| Neural stem cells | This study | This study |
| Post-implantation epiblast stem cells (EpiSC) | This study | This study |
| ESC line with stably inserted FUCCI system | Matthias Lutolf | 23193167 |
| Mus musculus 129P2/Ola | NA | NA |
| Leica SR GSD Wizard | NA | |
| MATLAB 2012b | NA | |
| Juicer | NA | |
| Juicebox v1.0 | NA | |
| SAMtools 0.1.19 | NA | |
| BEDtools | NA | |
| Bowtie 2 | NA | |
| TopHat 2 | NA | |
| MACS 1.4 | NA | |
| MACS 2 | ||
| R 3.2.2 | NA | |
| edgeR_3.12.1 | NA | |
| DESeq2_1.10.1 | NA | |
| DESeq_1.22.1 | NA | |
| ggplot2_2.1.0 | NA | |
| Matrix_1.2-6 | NA | |
| IRanges_2.4.8 | NA | |
| GenomicRanges v1.22.4 | NA | |
| Analysis code and vignette | This study | NA |
| H3K4me1 (ESCs) | SRP000230 | |
| H3K4me1 (NP cells) | SRP000230 | |
| H3K4me3 (ESCs) | SRP000230 | |
| H3K4me3 (NP cells) | SRP000230 | |
| H3K27ac (ESCs) | SRP003638 | |
| H3K27ac (NP cells) | SRP003638 | |
| GNF Mouse GeneAtlas V3 | ||
| Expression data from murine NP cells | ||
| Genomic coordinates of enhancer elements identified in brain tissues | NA | |
| Hi-C data (NS and NSC derived iPS cells [passage 20]) | ||