Literature DB >> 32783593

Transcription factors: building hubs in the 3D space.

Dafne Campigli Di Giammartino1, Alexander Polyzos1, Effie Apostolou1.   

Abstract

The hierarchical three-dimensional folding of the mammalian genome constitutes an important regulatory layer of gene expression and cell fate control during processes such as development and tumorigenesis. Accumulating evidence supports the existence of complex topological assemblies in which multiple genes and regulatory elements are frequently interacting with each other in the 3D nucleus. Here, we will discuss the nature, organizational principles, and potential function of such assemblies, including the recently reported enhancer "hubs," "cliques," and FIREs (frequently interacting regions) as well as multi-contact hubs. We will also review recent studies that investigate the role of transcription factors (TFs) in driving the topological genome reorganization and hub formation in the context of cell fate transitions and cancer. Finally, we will highlight technological advances that enabled these studies, current limitations, and future directions necessary to advance our understating in the field.

Entities:  

Keywords:  3D promoter–enhancer hubs; Chromatin looping; transcription factors

Year:  2020        PMID: 32783593      PMCID: PMC7553511          DOI: 10.1080/15384101.2020.1805238

Source DB:  PubMed          Journal:  Cell Cycle        ISSN: 1551-4005            Impact factor:   4.534


Introduction

Gene expression during physiological and pathological cell fate changes is controlled in part by the spatiotemporal regulation of enhancers and their target promoters. Despite progress in the genome-wide mapping of putative enhancers based on chromatin accessibility, transcription factor (or cofactor) binding or select histone marks (e.g. H3K27ac and H3K4me1), uncovering functional enhancers and assigning them to target genes remains extremely challenging. The emergence of 3D chromatin organization as an important regulatory layer of gene expression and cell identity [1-7] highlighted the fact that understanding enhancer specificity and activity requires knowledge of its 3D neighborhood [8-12]. Glossary for 3D topology terms used in this review. All different hierarchical levels of 3D chromatin organization, ranging from megabase-scale compartments to Topologically Associating Domains (TADs) and fine-scale chromatin loops contribute to shape the regulatory enhancer landscape [13-15] (see Table 1). At the level of chromosome compartments, active or poised enhancers and genes are expected to be located within the so-called A compartments, which represent more euchromatic, open, and active genomic regions, while repressed and methylated enhancers will often reside within the heterochromatic and transcriptionally inactive B compartments [16]. Interestingly, during processes such as differentiation and somatic cell reprogramming A-to-B and B-to-A compartment switches can occur, consistent with enhancer silencing or de novo activation, respectively, and a global 3D rewiring [17-19]. Although enhancers can function over large distances, their activity is largely restricted within TADs. TADs are chromatin domains with high degree of self-association that are insulated from neighboring genomic regions by boundary elements [5]. Albeit TAD boundaries are largely conserved across cell types and species and are thought to spatially confine transcriptional regulatory units [20], boundary insulation and intra-TAD enhancer–promoter contacts are dynamically changing during cell fate transitions instructing cell-type-specific transcriptional programs [21]. Here, we will focus on the most recent 3D chromatin studies reporting complex and dynamic enhancer and promoter contacts and discuss the nature, organizational principles, and potential function of such intricate interactions.

Enhancer–promoter networks and 3D hubs

Over the last two decades, hundreds of studies using either DNA fluorescence in situ hybridization (FISH) or chromosome conformation capture (3C)-based approaches have described pairwise enhancer–promoter interactions in various cellular contexts [21-23]. These studies were instrumental assigning enhancers to the right target genes and revealed cases in which this communication may happen over very large distances (over a Megabase) and by skipping one or more intervening genes. Intriguingly, in addition to the typical one-to-one enhancer–promoter contacts, increasing evidence from multiple studies and experimental systems support the existence of complex and highly dynamic enhancer–promoter networks. The first observations of multi-loci connections were based on 4C-based approaches around selected viewpoints in multiple different cellular contexts and conditions, including antiviral response [24], reprogramming [3,4,25], olfactory neurons [26,27], erythroid cell differentiation [28] and other developmental processes [29]. These studies, often combined with single-cell 3D-DNA FISH validations, revealed contacts among multiple genes and regulatory elements both in cis and in trans at variable frequencies providing important insights into the dynamic nature of 3D chromatin organization and the factors that may drive formation of multiconnected hubs. In addition, although limited to few select loci, these studies established the first functional links between long-range interactions and transcriptional regulation or co-regulation of implicated genes. The development of Hi-C technology which allows genome-wide mapping of regions in physical proximity further improved our understanding of the organizational principles of 3D chromatin topology and the association with gene expression [30,31]. However, the ability of Hi-C to detect fine-scale long-range enhancer–promoter contacts is limited by the resolution and requires very high sequencing depth. Moreover, findings from several groups hint to the fact that, even at high resolution, Hi-C is mostly able to detect structural loops (such as those mediated by CTCF) but promoter–enhancer contacts are underrepresented [32,33]. Recent technological advances that combine Hi-C with antibody-based or oligo-based enrichment steps, such as ChIA-PET [34], PLAC-seq [35], HiChIP [36] and Capture Hi-C [37] have drastically improved the resolution and the discovery rate of promoter–enhancer interactionss. A number of recent studies that utilize these technologies in different cellular systems and combined with antibodies against transcriptional and architectural factors (e.g. RNA PolII or Smc1) or characteristic “active” histone modifications (H3K4me3 or H3K27ac) have provided unique insights into the frequency, nature, dynamics, and roles of enhancer–promoter contacts and multiconnected hubs [8,38-41]. By performing HiChIP with an antibody against histone H3 lysine 27 acetylation (H3K27ac), we have recently generated 3D maps of active enhancer and promoter contacts in murine somatic and pluripotent stem cells (PSCs) and described a drastic rewiring during somatic cell reprogramming [33]. This assay captured thousands of high-confidence and statistically significant enhancer–enhancer (41%), enhancer–promoter (41%), and promoter–promoter (18%) contacts. Although the majority of cis-regulatory elements were engaged in only one interaction, we found few hundreds of genomic regions communicating with 10 or more (up to 33) distant genes and/or enhancers at high frequencies. Importantly, the degree of interactivity (number of distinct contacts per anchor), which was validated by H3K27ac-independent approaches such as 4C-seq and Hi-C, was strongly associated with the transcriptional levels of connected genes. Among the highest connected HiChIP anchors in pluripotent stem cells were many of the previously described super-enhancers (SE) [42] as well as known stem cell regulators, including Mycn, Esrrb, and Mir290. In agreement with these findings, an independent study, which applied Hi-C in 21 primary human tissues, identified tissue-specific SEs as hotspots of local chromatin interactions termed FIREs (frequently interacting regions) [43]. FIREs showed high levels of transcriptional activity in the respective cell types and included many cell-identity genes. The increased 3D interactivity of SEs has been further supported by the detection of SEs within recently described Hi-C stripes [44] (described later in this review) and the reported inter-connectivity amongSEs in cis and trans [45]. Together, these results suggest that a high degree of connectivity might be an inherent characteristic of regulatory elements that control high transcriptional levels of cell-type-associated genes, such as SEs or stretch enhancers [46,47]. An important consideration when evaluating results from these classical pairwise 3C-based studies is that they represent the “average” 3D chromatin conformation of cell populations and cannot distinguish between co-occurring interactions in the same cells and mutually exclusive, or competitive, interactions that occur in different cells. Recently developed technologies, such as multi-contact 4C and Tri-C, enabled the detection of simultaneous multi-loci interactions within a single allele [48-52]. These studies identified high-order complexes formed by multi-way chromatin interactions between enhancers and promoters at selected mouse gene loci, including the globin locus [48,49] and Pcdh locus [50] as well as antigen receptor genes [52], confirming the existence of “regulatory hubs” within single alleles in which enhancer elements can simultaneously contact and potentially regulate multiple genes. Overall, these results confirm the presence of multi-way contacts in single cells but are restricted to a very small number of tested loci. Single-cell Hi-C could potentially overcome this issue by providing a genome-wide view of multi-way contacts in single cells [53-55]; however, this method is limited by the sparsity of data that is inherent to current single-cell assays and cannot achieve high enough resolution to be informative at the level of individual regulatory elements. Other approaches that do not rely on pairwise proximity ligation have recently emerged to study chromatin conformation [56], including genome architecture mapping [57] (GAM), split-pool recognition of interactions by tag extension [58] (SPRITE), transposase-mediated analysis of chromatin looping [59] (Trac-looping), optical reconstruction of chromatin architecture (ORCA) [60] and chromatin-interaction analysis via droplet-based and barcode-linked sequencing [61] (ChIA-Drop). Using these techniques, the authors were able to identify multi-loci chromatin interactions involving three or more genomic regions, albeit at low frequencies. For example, ChIA-Drop in Drosophila cells revealed that the vast majority (80%) of RNA polymerase II-mediated complexes included only one promoter, supporting that most promoters are not spatially interconnected in individual cells and alleles. Nonetheless, the same study captured 2,700 complexes that simultaneously connect 3 or more promoters, indicating that rare multi-way interactions do occur in single molecules . This is in agreement with recent high-resolution FISH-based approaches [62]. These results suggest caution in the interpretation of high 3D interactivity from population-averaged 3C-based data and support that highly connected hubs likely represent different pairwise (instead of multi-way) contacts that occur within the population. This could either reflect population heterogeneity (different stable loops in cell subpopulations) and/or dynamic pairwise contacts across the population (see also Table I).
Table 1.

Glossary for 3D topology terms used in this review.

Features of chromatin topology
CompartmentsHigher-order megabase structures formed by segregation of two mutually exclusive types of chromatin, A or B [16]. Active or poised enhancers and genes are expected to be located within the so-called A compartments, which represent more euchromatic, open and active genomic regions, while repressed and methylated enhancers will often reside within the heterochromatic and transcriptionally inactive B compartment.
TADs (Topological Associated Domains)(Sub)megabase chromatin domains characterized by a high degree of self-association, and demarcated by boundaries that insulate them from neighboring genomic regions [5].
LoopsChromosomal structures formed by chromatin extrusion through the cohesin ring(s) that bring distal genomic regions in physical proximity [30].
Chromatin contactPhysical proximity between distal genomic sites in the 3D nucleus [136], not necessarily mediated by looping (can be intra-chromosomal [22,52] or inter-chromosomal [24,26,27]).
HubsHighly interacting regions that form contacts with 2 or more chromatin loci. They can represent either highly dynamic or heterogenous pairwise interactions within a cell population or -more rarely- multiway contacts on the same cell and allele. Hubs can either form through loop extrusion or through segregation and (micro)phase separation [33,47–51,57,58].
Transcription factoriesNuclear foci of concentrated RNA polymerase II and basic transcriptional machinery where multiple genes and regulatory elements physically merge to become actively transcribed [100].
Possible mechanisms driving chromatin topology
Loop ExtrusionA mechanism of loop formation whereby cis-acting loop-extruding factors (e.g. cohesin) form progressively larger DNA loops until they encounter boundary elements (e.g. convergently oriented CTCF binding sites) [13,30,72,73].
Segregation/Self-organizationThe differential affinities among various histone modifications and chromatin-bound factors promote self-assembly and/or separation within the 3D nucleus resulting in the formation of subnuclear structures, such as A and B compartments [16].
Phase separationProcess by which molecules separate in the absence of membranes into discrete liquid condensates with distinct compositions. Phase separation is formed by weak and highly multivalent interactions between their protein and/or RNA components [86].

Functional implications of hubs

As discussed above, high 3D chromatin interactivity strongly correlates with increased regulatory and transcriptional activity. To understand the degree to which multiple contacts have functional impact on the expression of involved genes, we will consider two types of multi-way interactions (Figure 1): (i) promoter-centric hubs, in which one promoter can contact multiple enhancers and (ii) enhancer-centric hubs, in which a single enhancer can interact with multiple genes.
Figure 1.

Different types of 3D hubs and their biological relevance. Schematic diagram of different types of dynamic multi-way interactions. (a) Promoter-centric hubs, where a single promoter can interact with multiple enhancers. (b) Enhancer-centric hubs, where one enhancer is in contact with multiple promoters. (c) Multi-hubs represent networks of interconnected enhancer and promoter hubs. The potential biological relevance of each type of 3D hub is indicated below each diagram.

Different types of 3D hubs and their biological relevance. Schematic diagram of different types of dynamic multi-way interactions. (a) Promoter-centric hubs, where a single promoter can interact with multiple enhancers. (b) Enhancer-centric hubs, where one enhancer is in contact with multiple promoters. (c) Multi-hubs represent networks of interconnected enhancer and promoter hubs. The potential biological relevance of each type of 3D hub is indicated below each diagram. Promoter-centric hubs have been extensively described in various contexts and have been suggested to confer phenotypic robustness through enhancer redundancy (Figure 1(a)). For example, deleting individual enhancers in loci required for mammalian limb formation did not affect limb development and morphology, however, deleting pairs of enhancers resulted in discernible phenotypes [63]. Another example is the alpha-globin gene, where no single enhancer element is critical for the activation of alpha-globin transcription [64]. In this case too, it seems that one enhancer can compensate for the loss of a different enhancer, in a redundant fashion. Our integrative analysis of RNA-seq data and H3K27ac HiChIP data in MEFs (mouse embryonic fibroblast) and PSCs [33] indicated that genes with higher number of enhancer contacts have higher levels of cell type-specific expression, suggesting that different enhancers may function synergistically to ensure tight and robust target gene activation. Future systematic interrogation of individual enhancers within promoter hubs in established PSCs or in the context of reprogramming is needed to directly test this hypothesis and further dissect potential functional hierarchies for the initial activation and maintenance of the transcriptional status of the target genes. While enhancers are typically assigned to the single most proximal gene, 3C-based assays have revealed cases where enhancers contact multiple different genes over distance [36,38,47] (Figure 1(b)), expanding the repertoire of candidate target genes per enhancer and raising the possibility of gene co-regulation by the same enhancer. Our H3K27ac HiChIP analysis in PSCs [33] identified a few hundred 3D enhancer hubs, where enhancers or SEs formed significant interactions with two or more genes (up to 10 in cis). Interestingly, when tested in the dynamic context of reprogramming, gene pairs within such hubs showed significantly higher percentage of transcriptional co-activation, compared to gene pairs of similar linear distance or residing in the same TADs, suggesting that hubs are responsible for co-regulation of spatially connected genes. Functional validation experiments further supported this notion, since CRISPR/(d)Cas9 targeting of select enhancer hubs, e.g. the Tbx3 and Zic2/Zic5 hubs, induced the concerted downregulation of multiple hub-associated genes without affecting neighboring non-hub genes. In agreement with these results, a parallel study in human pancreatic islets applied promoter Capture-C technology and found that 42% of enhancers had high-confidence interactions with more than one gene, forming 3D enhancer hubs [46,47]. Genes within hubs enriched for islet-related functions, including insulin secretion and diabetes, and accordingly showed coordinated transcriptional changes upon perturbation of glucose levels. CRISPR-mediated activation or inactivation experiments further supported that enhancers hubs are responsible for the co-regulation of multiple hub-connected genes, while non-hub transcripts were only moderately affected [47]. Importantly, both studies report cases where only some of the hub-connected genes respond to the enhancer perturbations, while others remain unaffected. We envision two possible scenarios that might explain this outcome. The first would suggest the existence of compensatory mechanisms due to the connection of non-responsive genes to additional enhancers that confer redundancy, as described in the “promoter-centric” hubs. An alternative possibility is that only some of the detected long-range contacts play direct regulatory roles, stressing the need to establish robust criteria for the prediction and identification of such interactions and their distinction from structural or bystander contacts. Predicting the regulatory effects of chromatin loops has been an area of intense investigation and several parameters have been proposed to dictate the regulatory effects of enhancer–promoter contacts [65,66]. The relative strengths and frequencies of different enhancer–promoter contacts, the linear distance between them and the presence of intervening insulators /boundaries are some of them. The local chromatin structure (accessibility, histone marks, and transcription factor binding) of enhancers and candidate target promoters might also play important role in defining the specificity of these interactions. Utilizing combinations of some these parameters, various predictive models have been proposed with variable degrees of success when independently validated for specific loci [32,67-69]. Moreover, integration of experimentally validated enhancers as a training set could help in the construction of models with high predictive value. A recent landmark study performed a systematic interrogation of thousands of putative enhancers around 30 target genes providing unprecedented insights into the principles of enhancer–promoter functionality [70]. By using a novel technology called CRISPRi-FlowFISH in human erythroleukemia cells, the authors performed a large-scale enhancer perturbation screen measuring the effects on the expression of endogenous candidate target genes through FISH [70]. Comparing their experimental results to the outcomes of various predictive models, they show that the recently proposed “activity-by-contact” (ABC) model significantly outperforms both in accuracy and sensitivity previous predictive models based either only on linear distance or on 3D genomic features (Hi-C contact or domain) or on chromatin states using machine learning approaches. The ABC model calculates the relative contribution of a regulatory element on target gene activity by using only two factors: the strength of the enhancer (as measured by H3K27ac ChIP-seq) weighted by the frequency of 3D contact with the target promoter (by using Hi-C, PolII ChIA-PET, or H3K27ac HiChIP information) divided by the total effect of all regulatory elements. Importantly, the perturbation-based maps built in this study uncovered complex connections where individual enhancers regulated up to 5 genes and individual genes were regulated by up to 14 distal elements, further supporting the functional role of highly connected hubs in living cells. Promoter-centric or enhancer-centric hubs represent a simplistic view of potential promoter–enhancer networks. An unbiased analysis focusing on the highly interacting hubs in both murine somatic and pluripotent cells revealed that the majority of enhancer and promoter hubs are inter-connected to each other forming complex high-order networks, as depicted in Figure 1(c). Based on the H3K27ac HiChIP [33] we were able to estimate that more than 80% of enhancer hubs contained genes that were also parts of promoter-hubs, suggesting an extensive “cross-talk” and interconnectivity (unpublished data). Together the abovementioned studies demonstrate the complexity of the 3D regulatory networks and highlight the challenges to accurately predict enhancer efficacy and specificity. Addressing these challenges will enable better understanding of the function of non-coding regulatory elements in health and disease and will ultimately enable identifying central nodes in the regulatory networks that control cell-type-specific or pathogenic programs.

Mechanisms of loop and hub formation

One of the most well-described mechanisms that has been proposed to shape the 3D genome is the loop extrusion model (see Table 1). Loop extrusion nicely explains the formation of TADs [71-73] and relies on progressive extrusion of chromatin through the cohesin ring until it reaches convergent CTCF-bound sites. Loop extrusion is also the underlying mechanism of the so-called “architectural stripes” (Figure 2(a)), which are formed by a cluster of unidirectional CTCF binding sites that blocks cohesin-mediated loop extrusion from progressing in that direction (30, 44, 73, 74). In the other direction, the DNA is being extruded and each segment forms a contact with the anchor, resulting in a stripe. This model can explain how one regulatory region (at the stripe anchor) could contact several other genomic loci, forming a hub. Architectural stripes are associated with both poised and active chromatin landscapes and have been shown to facilitate cognate promoter–enhancer interactions [44,74]. Intriguingly, upon targeted degradation of either CTCF or Cohesin subunits, long-range chromatin loops, TADs and stripes are drastically eliminated, while compartments and cis and trans contacts around super-enhancers remain unaffected or become even stronger [45,75-78], suggesting additional or competing forces of 3D chromatin organization. Despite the global effects on chromatin architecture, the effects on gene expression are less severe and selective, suggesting the presence of additional mechanisms that preserve transcriptional regulation and gene-enhancer communication. In agreement, recent work from several groups supports the existence of CTCF/cohesin-dependent and independent chromatin contacts [79-82] and speculated on their different function and organizational principles.
Figure 2.

Mechanisms of hub formation. Two main forces have been reported to contribute to hub formation involving promoter–enhancer contacts. (a) Architectural stripes are formed by cohesin-mediated loop extrusion, when a cluster of unidirectional CTCF binding sites on one boundary blocks extrusion in that direction, while in the other direction, the DNA is being extruded and each segment forms a contact with the anchor, resulting in a stripe. (b) phase separation is based on weak, low affinity, multivalent interactions mediated by large intrinsically disordered regions (IDRs) within proteins, that lead to the formation of subnuclear biomolecular condensates. These membrane-less structures are enriched, among others, in transcription factors, epigenetic writers and readers, nascent RNAs and activating histone marks that facilitate hub chromatin contacts.

Mechanisms of hub formation. Two main forces have been reported to contribute to hub formation involving promoter–enhancer contacts. (a) Architectural stripes are formed by cohesin-mediated loop extrusion, when a cluster of unidirectional CTCF binding sites on one boundary blocks extrusion in that direction, while in the other direction, the DNA is being extruded and each segment forms a contact with the anchor, resulting in a stripe. (b) phase separation is based on weak, low affinity, multivalent interactions mediated by large intrinsically disordered regions (IDRs) within proteins, that lead to the formation of subnuclear biomolecular condensates. These membrane-less structures are enriched, among others, in transcription factors, epigenetic writers and readers, nascent RNAs and activating histone marks that facilitate hub chromatin contacts. Other mechanisms have been proposed to explain the formation of chromatin loops such as the handcuff mechanism [83] and the depletion/attraction mechanism [84]. However, it is becoming commonly accepted that the chromatin 3D architecture is shaped through an interplay between loop extrusion and segregation/self-organization [13-15,23]. The latter describes the process of “self-attraction” among loci with similar transcriptional and chromatin states [16], such as epigenetic modifications (e.g. histone marks) and protein composition (e.g. transcription factors and cofactors) and exclusion of regions with different chromatin characteristics. This process nicely explains the formation of chromosome compartments and subnuclear membrane-less structures, including nucleoli and speckle bodies. One of the proposed mechanisms for self-organization is through phase separation [85-88] (Figure 2(b)), a process based on weak, low affinity, multivalent interactions mediated by large intrinsically disordered regions (IDRs) within proteins, that lead to the formation of subnuclear biomolecular condensates (see Table 1). It is plausible that phase separation could drive the formation of chromatin contacts as well as 3D hubs independently of cohesin-mediated extrusion by bringing multiple distal regions in physical proximity. In support to this notion, a large number of transcription factors [85,88,89], epigenetic modulators [87,90], (BRD4 PRC2), transcriptional cofactors [91,92] (Mediator, PolII) [91], long non-coding RNAs [93] and histone modifications [94] (H3K9me3, etc.) have been reported to promote nuclear condensation and form large nuclear clusters, which are particularly sensitive to concentration changes. Many of these factors are found preferentially enriched and at high density on multiconnected hubs [3333], FIREs [43] and super-enhancers [42], further supporting the hypothesis that 3D hubs are formed through biomolecular condensation, promoted by the high density of bound TFs and recruited cofactors. The high levels of histone acetylation and transcriptional activity that characterize 3D enhancer hubs and interacting genes may further facilitate multivalent interactions and self-organization. The relative contribution of each hub-associated component (transcriptional factors, protein cofactors, histone marks, and nascent transcripts) in hub organization and regulation has not been systematically investigated. However, recent studies that perturbed the activity, binding, or concentration of individual factors either by genetic interference or chemical inhibition have started shedding light into their roles in enhancer–promoter communication and gene regulation. For example, it has been shown that inhibition of BRD4 [87] binding or general perturbation of protein condensation by 1,6-hexanediol [91] treatment result in dissolution of large Med1, BRD4, and PolII condensates in mouse PSCs, preferential loss of SE activity and downregulation of many associated genes. Interestingly, some recent studies demonstrated that although these treatments have drastic effects on enhancer activity and gene expression, they do not affect 3D chromatin organization and enhancer–promoter contacts [95], arguing against the role of BRD4 and/or protein condensation in long-range interactions. Similarly, inhibition of transcription per se seems to have very little impact on promoter–enhancer contacts [95,96]. These data together argue that the transcriptional activity of a gene and the physical interaction with enhancer(s) are separable events.

Transcription, transcription factors, and hubs

The role of transcription and transcription-associated factors in 3D chromatin organization becomes increasingly appreciated over the years [97-99]. Back in 1993 Jackson et al. [100] had identified the so-called “transcription factories,” a concept that has been defined and re-defined over the years and which, in the broader sense, indicates nuclear foci of concentrated RNA polymerase II and basic transcriptional machinery at which nascent transcription occurs (see Table 1). Active genes and their regulatory elements have been shown to coalesce at transcription factories, and the dynamic relocation of genes toward and away from transcription factories correlates with their transcriptional “on” and “off” cycles. Transcription factories may thus represent the first report of a multiprotein nuclear body containing at least two different transcription units at any given time (P-P, E-E, P-E), resulting in the formation of chromatin 3D hubs. The potential architectural function of TFs was initially based on integrative analyses of 4C-seq or Hi-C datasets with ChIP-seq results [19,101]. More recent functional studies in various cellular contexts provided important experimental support either by directly depleting selected TFs or modulating TF binding sites and quantifying the negative effects on enhancer–promoter looping and gene expression [33,38,96,102-104]. For example, we have recently showed [33] that Cas9-mediated disruption of individual KLF4 binding sites from PSC-specific enhancers resulted in a reduced expression of multiple genes within the enhancer hubs, partly by reducing contact frequency (Figure 3(a)). In addition, inducible CRISPR-Cas9-mediated KO of KLF4 in stem cells, together with KLF2 and KLF5 KO (two closely related family members with overlapping binding targets) caused an overall decrease of hub and super-enhancer connectivity, supporting that KLF factors play a critical role in the maintenance of the 3D enhancer network in PSCs (Figure 3(a)). Other examples of TFs mediating enhancer–promoter contacts include Nanog [105] and Sox2 [103] in pluripotent stem cells, GATA1 [106] and Klf1 [22] in erythrocytes, Pax3 [104] and MyoD [96] during myogenesis, Pax5 [102] in B cell differentiation, Notch [38] in the context of cancer and YY1 in various cell types [39,107].
Figure 3.

Roles of KLF4 in the organization of 3D hubs. (a) KLF4 plays a role in the maintenance of 3D enhancer hubs. Depletion of KLF4 protein or disruption of its binding site within enhancers was shown to have a major effect on the maintenance of promoter–enhancer chromatin contacts within the hub and expression of the linked genes [1]. (b) KLF4 HiChIP in pluripotent stem cells showed that KLF4 is involved both in active and repressive/poised loops and hubs around pluripotency or developmental genes, respectively. For each category the potential KLF4 cofactors and example genes are shown. O/S/N: Oct4, Sox2, and Nanog proteins, PRC: polycomb repressive complex.

Roles of KLF4 in the organization of 3D hubs. (a) KLF4 plays a role in the maintenance of 3D enhancer hubs. Depletion of KLF4 protein or disruption of its binding site within enhancers was shown to have a major effect on the maintenance of promoter–enhancer chromatin contacts within the hub and expression of the linked genes [1]. (b) KLF4 HiChIP in pluripotent stem cells showed that KLF4 is involved both in active and repressive/poised loops and hubs around pluripotency or developmental genes, respectively. For each category the potential KLF4 cofactors and example genes are shown. O/S/N: Oct4, Sox2, and Nanog proteins, PRC: polycomb repressive complex. Mechanistically, TFs can mediate promoter–enhancer contacts in a variety of ways, which have been extensively described in several recent reviews [21,97,98]. In addition to the phase separation model mentioned above, these mechanisms include direct protein homo-dimerization (e.g. YY1 [39]), recruitment of co-factor proteins that in turn form oligomers (e.g. Ldb1 [104,108,109]), interaction with loop extruders (such as cohesin [25,105,110,111] and CTCF [74-76,107,112]) and recruitment of chromatin modifiers, which can either methylate DNA or modify histone tails [113,114], therefore leading to recruitment of other loop-mediating factors. To gain a better mechanistic understanding on how TFs shape the 3D genome organization, further experiments would be required in order to (i) identify all interacting proteins and nucleo-protein complexes in a locus-specific manner (e.g. by applying proximity labeling technologies around highly-interacting hubs), (ii) systematically perturb TF-mediated interactions either by mutating specific protein domains or block specific interfaces. Using a combination of 3D genomics, functional and proteomics assays we recently demonstrated the role of KLF4 in mediating long-range chromatin interactions during reprogramming and in established PSCs and its potential architectural co-factors [33]. By performing KLF4 HiChIP, we identified several thousands of KLF4-enriched chromatin contacts in ESC, which could be clustered into two categories (Figure 3(b)): (A) loops that overlap with enhancer–promoter contacts as detected by H3K27ac HiChIP and are enriched in active marks and transcribed genes and (B) loops that contained mostly poised developmental genes and are devoid of active histone marks. Bioinformatics and proteomics analyses revealed that the first class of KLF4-bound contacts were also co-occupied by Mediator, BRD4, cohesin and other pluripotency TFs (e.g. Nanog, Oct4), whereas the second class were bound by components of the Polycomb Repressive Complexes 1 and 2. These results suggest that TFs might collaborate with different cofactors to establish activating or repressive chromatin contacts. Given that many of these cofactors have been shown to form condensates and multi-connected hubs [87,110,115], it is plausible that TFs might nucleate the formation of distinct types of topological assemblies (activating or repressive hubs) by a process of self-organization.

Dynamics of hub establishment and maintenance

Most of the studies that aimed to address the role of 3D chromatin organization in gene regulation and the involved architectural factors have mostly focused on transcriptional and topological effects in steady-state conditions and in established cell lines. Therefore, the mechanisms that control establishment rather than maintenance of long-range chromatin interactions and the dynamic interplay between protein factor binding, chromatin looping, and transcriptional activation remain largely elusive. Few recent studies have started shedding light into these dynamic processes in the context of inducible gene expression [116], directed differentiation [96] or reprogramming [3,19,33,117,118] as well as cell cycle [119-123]. For example, it was recently shown that in contrast with the moderate and localized effects of cohesin depletion on gene expression, inducible activation of genes and enhancers in macrophages as response to microbial signals was directly dependent on cohesin [116]. Similarly, reduced cohesin levels or mutations can compromise proper activation of lineage-specific genes and perturb the balance between self-renewal and differentiation [116,124], supporting an instructive role of cohesin in the establishment of lineage-specific programs. To gain insights into these dynamic processes and dissect the order of molecular events that dictate 3D enhancer rewiring and transcriptional reprogramming of somatic cells toward pluripotency, we recently combined KLF4 HiChIP datasets with ChIP-seq and RNA-seq at different time points during iPSC reprogramming [33]. We found that KLF4 binding was strongly associated with de novo establishment of enhancer–promoter contacts, and often preceded chromatin looping and 3D hub organization. On the other hand, looping formation and enhancer/gene activation were closely linked and usually coincided (Figure 4). These data would suggest that KLF4 binding is important, but not always sufficient to induce long-range chromatin interactions and/or transcriptional activation. The temporal delay between TF binding and subsequent loop/hub formation suggests the existence of additional barriers that need to be removed (e.g. local chromatin remodeling, switch of chromatin state or compartment) or the absence of critical architectural and regulatory cofactors (e.g. proteins or potentially ncRNAs that are not yet expressed at early time-points during reprogramming).
Figure 4.

Kinetics of KLF4 binding, hub formation and transcription activation during somatic cell reprogramming to Pluripotent Stem Cells (PSC). KLF4 bound hubs as detected by both KLF4 and H3K27ac HiChIP are split into three categories depending on the relative timing of KLF4 binding and looping [1]. The median expression of genes within these hubs during reprogramming is plotted on the right. Overall, looping appears to coincide with gene activation, while KLF4 binding often precedes looping and expression, suggesting the requirement of additional cofactors.

Kinetics of KLF4 binding, hub formation and transcription activation during somatic cell reprogramming to Pluripotent Stem Cells (PSC). KLF4 bound hubs as detected by both KLF4 and H3K27ac HiChIP are split into three categories depending on the relative timing of KLF4 binding and looping [1]. The median expression of genes within these hubs during reprogramming is plotted on the right. Overall, looping appears to coincide with gene activation, while KLF4 binding often precedes looping and expression, suggesting the requirement of additional cofactors. The abovementioned examples highlight the need for future studies to carefully dissect the different mechanisms that control the formation or stability of 3D chromatin loops. For example, it will be interesting to test whether inducible degradation of hub-associated factors or perturbation of protein condensation, which appeared to have minimal effects on the maintenance of enhancer–promoter contacts, will have a strong impact in the de novo establishment of such loops during dynamic processes, such as cell cycle or activation/differentiation.

Conclusions and future directions

Genetic or epigenetic alterations that either perturb enhancer activity or enhancer specificity by enabling communication with aberrant target genes have been described in various disorders as the potential molecular basis of pathogenic transcriptional programs [125-131]. These findings highlight that studying the nature and mechanisms of enhancer rewiring in disease has a considerable potential for understanding and treating disorders, such as cancer. Using 3D genomics assays several groups started to assign non-coding genetic variants, associated with disease, and putative enhancers to their interacting target genes in the contexts of cancer [132], neuronal disorders [133,134], autoimmune and cardiovascular diseases [36]. However, the computational and biological interpretation of these results remain very challenging. Predicting the actual regulatory impact of a long-range interaction and the potential synergies or redundancies with other interacting elements are very active topics of research both by applying mathematical modeling and through large-scale functional screens in various cellular contexts. Generating high-quality 4D genomics datasets in the course of dynamic biological processes, such as differentiation or disease progression, and developing statistically rigorous and standardized analyses pipelines will be essential steps toward this direction. Moreover, building 3D regulatory networks and applying advanced machine learning approaches may enable a better prediction of central 3D regulatory nodes that ca be targeted to efficiently modulate cell-type-specific or disease-specific transcriptional programs. Finally, implementation of existing CRISPR-Cas9 technologies or development of novel assays that enable large-scale interrogation of putative enhancers in the context of dynamic cell fate transitions invivo or ex vivo would be extremely informative. The existence of highly interacting hubs and their association with highly active genes and enhancers that are critical for cell identity highlights the need for additional studies in order to dissect the organizational principles of such hubs and their functional roles in controlling cell fate decisions. Development or improvement of technologies that enable detection of multiway interactions and quantifying the frequency of such events at the single-cell level will be very relevant. In addition, orthogonal validations with imaging or recently developed approaches that do not depend on cross-linking or proximity ligation will help to reevaluate the hub connectivity, dynamics, and heterogeneity overcoming potential technical biases. Another challenge that needs to be addressed in the following years is to unravel the mechanisms that promote or stabilize long-range enhancer–promoter interactions and hubs. As discussed above, important insights have been gained for the contribution of loop extrusion mechanisms versus self-organization through phase separation. However, the relative synergy or competition between these forces and the contribution of individual factors remain elusive. Systematic identification of all involved protein or RNA factors using technologies such as APEX2 [135], which allow proximity labeling around target proteins or target loci, followed by mass-spectrometry, will be a critical first step to find top candidate genome organizers, that are either “universal” or hub-specific. Additional studies where top candidate factors are systematically perturbed either by inducible depletion/degradation or by blocking their binding and activities will allow to appreciate regulatory hierarchies and uncouple effects on transcriptional activity versus chromatin topology. It is also important to keep in mind that different mechanisms may mediate the establishment or the maintenance of 3D contacts. For example, it is possible that one protein factor is essential for the initial activation of a gene or the establishment of an enhancer–promoter interaction, but is not relevant for steady-state activity due to compensation by additional factors and/or enhancers. Therefore, the choice of treatment (e.g. protein knock-out versus reversible degradation), the duration (few hours versus days) and the study system (e.g. cycling asynchronous cells versus postmitotic or synchronized at specific cell cycle stages) need to be carefully designed and selected for proper interpretation of the results.
  125 in total

1.  Proximity among distant regulatory elements at the beta-globin locus requires GATA-1 and FOG-1.

Authors:  Christopher R Vakoc; Danielle L Letting; Nele Gheldof; Tomoyuki Sawado; M A Bender; Mark Groudine; Mitchell J Weiss; Job Dekker; Gerd A Blobel
Journal:  Mol Cell       Date:  2005-02-04       Impact factor: 17.970

2.  Chromatin stretch enhancer states drive cell-specific gene regulation and harbor human disease risk variants.

Authors:  Stephen C J Parker; Michael L Stitzel; D Leland Taylor; Jose Miguel Orozco; Michael R Erdos; Jennifer A Akiyama; Kelly Lammerts van Bueren; Peter S Chines; Narisu Narisu; Brian L Black; Axel Visel; Len A Pennacchio; Francis S Collins
Journal:  Proc Natl Acad Sci U S A       Date:  2013-10-14       Impact factor: 11.205

3.  Preferential associations between co-regulated genes reveal a transcriptional interactome in erythroid cells.

Authors:  Stefan Schoenfelder; Tom Sexton; Lyubomira Chakalova; Nathan F Cope; Alice Horton; Simon Andrews; Sreenivasulu Kurukuti; Jennifer A Mitchell; David Umlauf; Daniela S Dimitrova; Christopher H Eskiw; Yanquan Luo; Chia-Lin Wei; Yijun Ruan; James J Bieker; Peter Fraser
Journal:  Nat Genet       Date:  2009-12-13       Impact factor: 38.330

4.  Oncogenic transcription factors as master regulators of chromatin topology: a new role for ERG in prostate cancer.

Authors:  Olivier Elemento; Mark A Rubin; David S Rickman
Journal:  Cell Cycle       Date:  2012-08-23       Impact factor: 4.534

5.  Comprehensive mapping of long-range interactions reveals folding principles of the human genome.

Authors:  Erez Lieberman-Aiden; Nynke L van Berkum; Louise Williams; Maxim Imakaev; Tobias Ragoczy; Agnes Telling; Ido Amit; Bryan R Lajoie; Peter J Sabo; Michael O Dorschner; Richard Sandstrom; Bradley Bernstein; M A Bender; Mark Groudine; Andreas Gnirke; John Stamatoyannopoulos; Leonid A Mirny; Eric S Lander; Job Dekker
Journal:  Science       Date:  2009-10-09       Impact factor: 47.728

6.  Mediator and RNA polymerase II clusters associate in transcription-dependent condensates.

Authors:  Won-Ki Cho; Jan-Hendrik Spille; Micca Hecht; Choongman Lee; Charles Li; Valentin Grube; Ibrahim I Cisse
Journal:  Science       Date:  2018-06-21       Impact factor: 47.728

7.  RAD21 cooperates with pluripotency transcription factors in the maintenance of embryonic stem cell identity.

Authors:  Anja Nitzsche; Maciej Paszkowski-Rogacz; Filomena Matarese; Eva M Janssen-Megens; Nina C Hubner; Herbert Schulz; Ingrid de Vries; Li Ding; Norbert Huebner; Matthias Mann; Hendrik G Stunnenberg; Frank Buchholz
Journal:  PLoS One       Date:  2011-05-12       Impact factor: 3.240

8.  Multiplexed analysis of chromosome conformation at vastly improved sensitivity.

Authors:  James O J Davies; Jelena M Telenius; Simon J McGowan; Nigel A Roberts; Stephen Taylor; Douglas R Higgs; Jim R Hughes
Journal:  Nat Methods       Date:  2015-11-23       Impact factor: 28.547

Review 9.  Demystifying the secret mission of enhancers: linking distal regulatory elements to target genes.

Authors:  Lijing Yao; Benjamin P Berman; Peggy J Farnham
Journal:  Crit Rev Biochem Mol Biol       Date:  2015-10-08       Impact factor: 8.250

10.  Enhancer variants associated with Alzheimer's disease affect gene expression via chromatin looping.

Authors:  Masataka Kikuchi; Norikazu Hara; Mai Hasegawa; Akinori Miyashita; Ryozo Kuwano; Takeshi Ikeuchi; Akihiro Nakaya
Journal:  BMC Med Genomics       Date:  2019-09-09       Impact factor: 3.063

View more
  12 in total

Review 1.  Seeing the forest through the trees: prioritising potentially functional interactions from Hi-C.

Authors:  Ning Liu; Wai Yee Low; Hamid Alinejad-Rokny; Stephen Pederson; Timothy Sadlon; Simon Barry; James Breen
Journal:  Epigenetics Chromatin       Date:  2021-08-28       Impact factor: 4.954

2.  Assessing Specific Networks of Chromatin Interactions with HiChIP.

Authors:  Dafne Campigli Di Giammartino; Alexander Polyzos; Effie Apostolou
Journal:  Methods Mol Biol       Date:  2022

3.  H3K27ac bookmarking promotes rapid post-mitotic activation of the pluripotent stem cell program without impacting 3D chromatin reorganization.

Authors:  Bobbie Pelham-Webb; Alexander Polyzos; Luke Wojenski; Andreas Kloetgen; Jiexi Li; Dafne Campigli Di Giammartino; Theodore Sakellaropoulos; Aristotelis Tsirigos; Leighton Core; Effie Apostolou
Journal:  Mol Cell       Date:  2021-03-16       Impact factor: 17.970

Review 4.  Igh Locus Polymorphism May Dictate Topological Chromatin Conformation and V Gene Usage in the Ig Repertoire.

Authors:  Amy L Kenter; Corey T Watson; Jan-Hendrik Spille
Journal:  Front Immunol       Date:  2021-05-18       Impact factor: 7.561

5.  Analysis of long and short enhancers in melanoma cell states.

Authors:  David Mauduit; Ibrahim Ihsan Taskiran; Liesbeth Minnoye; Maxime de Waegeneer; Valerie Christiaens; Gert Hulselmans; Jonas Demeulemeester; Jasper Wouters; Stein Aerts
Journal:  Elife       Date:  2021-12-07       Impact factor: 8.140

6.  BET inhibition disrupts transcription but retains enhancer-promoter contact.

Authors:  Nicholas T Crump; Erica Ballabio; Laura Godfrey; Ross Thorne; Emmanouela Repapi; Jon Kerry; Marta Tapia; Peng Hua; Christoffer Lagerholm; Panagis Filippakopoulos; James O J Davies; Thomas A Milne
Journal:  Nat Commun       Date:  2021-01-11       Impact factor: 14.919

Review 7.  Genetic Variants in Transcription Factor Binding Sites in Humans: Triggered by Natural Selection and Triggers of Diseases.

Authors:  Chia-Chun Tseng; Man-Chun Wong; Wei-Ting Liao; Chung-Jen Chen; Su-Chen Lee; Jeng-Hsien Yen; Shun-Jen Chang
Journal:  Int J Mol Sci       Date:  2021-04-18       Impact factor: 5.923

8.  Cis-regulatory hubs: a new 3D model of complex disease genetics with an application to schizophrenia.

Authors:  Loïc Mangnier; Charles Joly-Beauparlant; Arnaud Droit; Steve Bilodeau; Alexandre Bureau
Journal:  Life Sci Alliance       Date:  2022-01-27

Review 9.  Deciphering the Complexity of 3D Chromatin Organization Driving Lymphopoiesis and Lymphoid Malignancies.

Authors:  Laurianne Scourzic; Eralda Salataj; Effie Apostolou
Journal:  Front Immunol       Date:  2021-05-14       Impact factor: 7.561

Review 10.  Dynamic 3D Chromatin Reorganization during Establishment and Maintenance of Pluripotency.

Authors:  Bobbie Pelham-Webb; Dylan Murphy; Effie Apostolou
Journal:  Stem Cell Reports       Date:  2020-11-25       Impact factor: 7.765

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.