| Literature DB >> 29577419 |
David Segorbe1, Derek Wilkinson1, Alexandru Mizeranschi1, Timothy Hughes2, Ragnhild Aaløkken2, Libuše Váchová3, Zdena Palková1, Gregor D Gilfillan2.
Abstract
We report an optimized low-input FAIRE-seq (Formaldehyde-Assisted Isolation of Regulatory Elements-sequencing) procedure to assay chromatin accessibility from limited amounts of yeast cells. We demonstrate that the method performs well on as little as 4 mg of cells scraped directly from a few colonies. Sensitivity, specificity and reproducibility of the scaled-down method are comparable with those of regular, higher input amounts, and allow the use of 100-fold fewer cells than existing procedures. The method enables epigenetic analysis of chromatin structure without the need for cell multiplication of exponentially growing cells in liquid culture, thus opening the possibility of studying colony cell subpopulations, or those that can be isolated directly from environmental samples.Entities:
Keywords: HTS; NGS; Saccharomyces cerevisiae; chromatin-accessibility; epigenetics
Mesh:
Substances:
Year: 2018 PMID: 29577419 PMCID: PMC6099244 DOI: 10.1002/yea.3316
Source DB: PubMed Journal: Yeast ISSN: 0749-503X Impact factor: 3.239
Figure 1Genomic mapping of sequence reads. The proportions of unmapped reads (red), those mapping to single genomic positions (green), and those mapping to multiple locations (repeats, in blue) are shown. Reads mapped to single genomic positions are divided into those present as a unique copy and those present in two or more identical copies (duplicates). Results shown are the mean of three replicates for each method, using 10 million raw reads per replicate. Error bars show the standard deviation from the mean [Colour figure can be viewed at http://wileyonlinelibrary.com]
Figure 2Genomic locations of FAIRE peaks. y‐Axis shows percentage peaks for each given starting amount of yeast cells. Promoter regions were defined as overlapping or within 1 kb upstream of the transcription start site. Downstream regions are those within 1 kb downstream of the gene but not overlapping the 3′ end of the gene. Intergenic regions were those that were > 1 kb removed from transcription start sites or gene 3′ ends. No significant differences were detected between the different starting amounts (two‐tailed Student's t‐test)
Peak calling, sensitivity and specificity. Peaks were called using MACS2, using 3 million uniquely mapping non‐duplicate reads per sample
| Input amount (mg) | Mean number of peaks called | Mean number of peaks overlapping 500 mg dataset peaks | Mean number of peaks not found in 500 mg dataset | Mean sensitivity (percentage reference peaks detected) | Mean specificity (percentage method peaks found in reference dataset) |
|---|---|---|---|---|---|
| 500 |
|
|
| 100 | 90 |
| 100 |
|
|
| 82 | 75 |
| 20 |
|
|
| 94 | 68 |
| 4 |
|
|
| 86 | 81 |
Peak regions present in all three 500 mg datasets (n = 4574) were used as the reference dataset to which all other samples were compared to measure sensitivity and specificity. Data presented are mean ± standard deviation of three replicates.
Figure 3FAIRE‐seq and input data in a 5 kb genomic region visualized in the Integrative Genomics Viewer. (a) A single example of FAIRE‐seq and corresponding input sample is shown for each input amount. Peaks called by MACS2 are shown as black bars. Genes are shown at the bottom of the figure. (b) Data for all replicates shown as heat maps of the same genomic region. y‐Axis scale in all cases is read depth 0–100
Figure 4Reproducibility of peak calls. (a) Venn diagrams showing the overlapping peak calls in the triplicate samples within each input amount. (b) Venn diagram of inter‐group peak overlaps, using only peaks common to all three replicates at each input amount. (c) Reproducibility measured by the irreproducible discovery rate (IDR) at different numbers of selected peaks, plotted at various IDR cutoffs. High reproducibility produces a curve with a late transition to high IDR values