| Literature DB >> 25473421 |
Maria Tsompana1, Michael J Buck2.
Abstract
Transcriptional activation throughout the eukaryotic lineage has been tightly linked with disruption of nucleosome organization at promoters, enhancers, silencers, insulators and locus control regions due to transcription factor binding. Regulatory DNA thus coincides with open or accessible genomic sites of remodeled chromatin. Current chromatin accessibility assays are used to separate the genome by enzymatic or chemical means and isolate either the accessible or protected locations. The isolated DNA is then quantified using a next-generation sequencing platform. Wide application of these assays has recently focused on the identification of the instrumental epigenetic changes responsible for differential gene expression, cell proliferation, functional diversification and disease development. Here we discuss the limitations and advantages of current genome-wide chromatin accessibility assays with especial attention on experimental precautions and sequence data analysis. We conclude with our perspective on future improvements necessary for moving the field of chromatin profiling forward.Entities:
Keywords: ATAC; Chromatin; DNase; Epigenome; FAIRE; Histone; Library; MNase; Nucleosome; Sequencing
Year: 2014 PMID: 25473421 PMCID: PMC4253006 DOI: 10.1186/1756-8935-7-33
Source DB: PubMed Journal: Epigenetics Chromatin ISSN: 1756-8935 Impact factor: 4.954
Current genome-wide high-throughput chromatin accessibility assays
| Cell type/Number | Sequencing type | Traditional approach | Genomic target | Experimental considerations | Key references | |
|---|---|---|---|---|---|---|
|
| Any cell type 1 to 10 million cells | Paired-end or Single-end | MNase digests unprotected DNA | Maps the total nucleosome population in a qualitative and quantitative manner | 1. Requires many cells. | [ |
| 2. Laborious enzyme titrations. | ||||||
| 3. Probes total nucleosomal population, not active regulatory regions only. | ||||||
| 4. Degrades active regulatory regions, making their detection possible only | ||||||
| 5. Requires 150 to 200 million reads for standard accessibility studies of the human genome. | ||||||
|
| Any cell type 1 to 10 million cells | Paired-end or Single-end | DNase I cuts within unprotected DNA | Maps open chromatin | 1. Requires many cells. | [ |
| 2. Time-consuming and complicated sample preparations. | ||||||
| 3. Laborious enzyme titrations. | ||||||
| 4. Requires 20 to 50 million reads for standard accessibility studies of the human genome. | ||||||
|
| Any cell type 100,000 to 10 million cells | Paired-end or Single-end | Based on the phenol-chloroform separation of nucleosome-bound and free sonicated areas of a genome, in the interphase and aqueous phase respectively | Maps open chromatin | 1. Low signal-to-noise ratio, making computational data interpretation very difficult. | [ |
| 2. Results depend highly on fixation efficiency. | ||||||
| 3. Requires 20 to 50 million reads for standard accessibility studies of the human genome. | ||||||
|
| 500 to 50,000 freshly isolated cells | Paired-end | Unfixed nuclei are tagged | Maps open chromatin, TF and nucleosome occupancy | 1. Contamination of generated data with mitochondrial DNA. | [ |
| 2. Immature data analysis tools. | ||||||
| 3. Requires 60 to 100 million reads for standard accessibility studies of the human genome. |
ATAC: assay for transposase-accessible chromatin; DNase I: deoxyribonuclease I; FAIRE: formaldehyde-assisted isolation of regulatory elements; MNase: micrococcal nuclease.
Figure 1Schematic diagram of current chromatin accessibility assays performed with typical experimental conditions. Representative DNA fragments generated by each assay are shown, with end locations within chromatin defined by colored arrows. Bar diagrams represent data signal obtained from each assay across the entire region. The footprint created by a transcription factor (TF) is shown for ATAC-seq and DNase-seq experiments.
Figure 2Chromatin accessibility high-throughput data analysis workflow. Chromatin accessibility data analysis involves a number of stages with progressively increased level of difficulty and advanced requirements for computational and genomics expertise. All major steps of analyses, from sequence tags to data annotation/integration are shown in a comprehensive workflow format (read text for additional details).
Chromatin accessibility high-throughput sequence data analysis
| Detection of enriched regions | Estimation of nucleosome organization and TF occupancy metrics | |
|---|---|---|
|
| 1. GeneTrack [ | 1. Nucleosome positioning algorithms [ |
| 2. Template filtering algorithm [ | 2. Nucleosome occupancy algorithms [ | |
| 3. DANPOS [ | 3. V-plots for TF occupancy [ | |
| 4. iNPS [ | ||
|
| 1. F-Seq [ | 1. Digital genomic footprinting algorithms [ |
| 2. Hotspot, DNase2Hotspots [ | 2. Nucleosome and TF occupancy algorithms [ | |
| 3. ZINBA [ | 3. CENTIPEDE [ | |
| 4. MACS [ | ||
|
| 1. MACS2; | Not available |
| 2. ZINBA [ | ||
|
| 1. ZINBA [ | 1. Digital genomic footprinting algorithms [ |
| 2. MACS2; | 2. CENTIPEDE [ | |
| 3. Hotspot, DNase2Hotspots [ |