| Literature DB >> 30268153 |
Yao Fu1, Kandice L Tessneer1, Chuang Li2, Patrick M Gaffney3.
Abstract
Genome-wide association studies (GWAS) and fine mapping studies in autoimmune diseases have identified thousands of genetic variants, the majority of which are located in non-protein-coding enhancer regions. Enhancers function within the context of the three-dimensional (3D) genome to form long-range DNA looping events with target gene promoters that spatially and temporally regulate gene expression. Investigating the functional significance of GWAS variants in the context of the 3D genome is essential for mechanistic understanding of these variants and how they influence disease pathology by altering DNA looping between enhancers and the target gene promoters they regulate. In this review, we discuss the functional complexity of the 3D genome and the technological approaches used to characterize DNA looping events. We then highlight examples from the literature that illustrate how functional mapping of the 3D genome can assist in defining mechanisms that influence pathogenic gene expression. We conclude by highlighting future advances necessary to fully integrate 3D genome analyses into the functional workup of GWAS variants in the continuing effort to improve the health of patients with autoimmune diseases.Entities:
Keywords: 3D genome; Autoimmune disease; Chromatin conformation; Complex genetic disease; DNA looping; Enhancer; Functional genomics; GWAS; Promoter
Mesh:
Substances:
Year: 2018 PMID: 30268153 PMCID: PMC6162955 DOI: 10.1186/s13075-018-1721-x
Source DB: PubMed Journal: Arthritis Res Ther ISSN: 1478-6354 Impact factor: 5.156
Fig. 1Predicting enhancer–promoter interactions using linear proximity versus 3D proximity. a Traditional modeling of enhancer function in the context of a linear genome where an enhancer (green triangle) is predicted to modulate the function of the promoter in closest linear proximity (gene 2 (blue rectangle) or gene 3 (yellow rectangle)). b Modeling in the context of the 3D genome where an enhancer (green triangle) often regulates distant gene expression through long-range DNA looping to the gene promoter (gene 1 (green rectangle)). Due to spatial proximity, the enhancer “skips” gene 2 (blue rectangle). Enhancer function is restricted within the insulated loop structure formed by a CTCF-CTCF (arrows)–cohesion (red ring) complex, and therefore cannot activate gene 3 (yellow rectangle) or gene 4 (red rectangle) despite close linear proximity
Fig. 23D genome organization. a Each chromosome tends to occupy a particular region in the nucleus, defined as chromosome territories. Within a chromosome, there are regions with relatively high interaction frequencies, defined as topologically associating domains (TADs), and regions with relatively low interaction frequencies called TAD boundaries. Nested within each TAD are several sub-TAD domains, such as insulated neighborhoods, defined as DNA loops formed by CTCF homodimer (orange arrows), co-bound with cohesin (red ring), and containing at least one gene. b Extrusion/sliding model for TAD and sub-TAD loop formation: cohesin ring (red ring) facilitates the “sliding” of DNA through the ring structure to form a small loop. When bound CTCF (orange arrow) encounters cohesin, the DNA stops sliding on that side. The opposing side continues to slide through until a convergently oriented CTCF anchor motif is recognized and the insulator CTCF-CTCF–cohesin complex forms. Loops are less likely to form if two CTCF binding motifs are of tandem or divergent orientation
Fig. 3Proximity ligation. Chromatin are crosslinked to preserve interactions between proximal regions of DNA and associated proteins. Crosslinked chromatin are digested using restriction enzymes (scissors) to create two short DNA fragments complexed with associated proteins. “Sticky ends” of the two DNA fragments originally in close 3D proximity are then ligated using DNA ligase to create a chimeric strand of DNA. After de-crosslinking, the chimeric DNA can be used in downstream applications to identify and characterize loop formation
Advantages and disadvantages of current 3D genome technologies
| Technique | Assay name/description | Target size | Assay platform | Cell input | Advantages | Disadvantages | References | |
|---|---|---|---|---|---|---|---|---|
| Targeted | ||||||||
| 3C | Chromosome conformation capture | One target | Quantitative PCR | > 100 M | • Quantitative measurement of long-range interactions between two targeted loci | • Low throughput | [ | |
| 4C | Circular chromosome conformation capture or chromosome conformation capture-on-chip | Multiple targets | Microarray | > 100 M | • Identification of multiple DNA regions that interact with a target locus | • Relatively low throughput | [ | |
| 5C | Chromosome conformation carbon copy | Multiple targets | Microarray or sequencing | > 100 M | • Multiplexed conformation capture | • Not all sites are compatible to 5C primer design | [ | |
| Capture-C | 3C with specific oligonucleotides capture | Multiple targets | Sequencing | 10-20 M | • Unbiased capture of all regions interacting with a specific target sequence | • Interaction detection depends on the design of the target “bait” | [ | |
| Genome-wide | ||||||||
| Non-protein-mediated | Hi-C | Chromosome conformation capture by high-throughput sequencing | All interactions | Sequencing | 20-25 M | • High throughput | • High background due to random ligations | [ |
| Protein-mediated | ChIA-PET | Chromatin interaction analysis by paired-end tag sequencing | All interactions | Sequencing | > 100 M | • Identify specific protein-mediated DNA loop structures | • Long processing time (> 6 days) | [ |
| HiChIP/ | In situ Hi-C with protein-centric ChIP/proximity ligation-assisted ChIP-seq | All interactions | Sequencing | 1-10 M | • Faster protocol (2 days) | • Requires high efficiency ChIP-grade antibodies | [ | |
Fig. 4Altering the 3D genome architecture disrupts gene expression regulation. a, b An enhancer (green triangle) can modulate gene expression by interacting with and delivering transcription factors to its target gene promoter (yellow rectangle) through long-range enhancer–promoter interactions. A causal mutation (red bar) in the enhancer can alter gene expression by modulating the frequency of this interaction. Impairing the frequency of the long-range interaction reduces delivery of transcription factors to the promoter, thus hindering gene expression (a). Enhancing interactions between the enhancer and promoter facilitates gene expression (b). c, d Insulated neighborhoods can regulate gene expression by restricting interactions between active enhancers (green triangle) and target gene promoters (blue rectangle) within an insulated loop boundary. Causal mutations (red bar) that disrupt CTCF anchor motifs can modify (c) or disrupt (d) existing loops, allowing the once-restricted enhancer (green triangle) to now interact with gene promoters (yellow rectangle) outside of the original insulated neighborhood