| Literature DB >> 35382117 |
Daniela Strenkert1, Matthew Mingay2, Stefan Schmollinger1, Cindy Chen2, Ronan C O'Malley2, Sabeeha S Merchant3,4,5.
Abstract
The eukaryotic green alga Chromochloris zofingiensis is a reference organism for studying carbon partitioning and a promising candidate for the production of biofuel precursors. Recent transcriptome profiling transformed our understanding of its biology and generally algal biology, but epigenetic regulation remains understudied and represents a fundamental gap in our understanding of algal gene expression. Chromatin immunoprecipitation followed by deep sequencing (ChIP-Seq) is a powerful tool for the discovery of such mechanisms, by identifying genome-wide histone modification patterns and transcription factor-binding sites alike. Here, we established a ChIP-Seq framework for Chr. zofingiensis yielding over 20 million high-quality reads per sample. The most critical steps in a ChIP experiment were optimized, including DNA shearing to obtain an average DNA fragment size of 250 bp and assessment of the recommended formaldehyde concentration for optimal DNA-protein cross-linking. We used this ChIP-Seq framework to generate a genome-wide map of the H3K4me3 distribution pattern and to integrate these data with matching RNA-Seq data. In line with observations from other organisms, H3K4me3 marks predominantly transcription start sites of genes. Our H3K4me3 ChIP-Seq data will pave the way for improved genome structural annotation in the emerging reference alga Chr. zofingiensis.Entities:
Keywords: ChIP‐Sequencing; Chromochloris; epigenetics; formaldehyde crosslinking; green algae; histone lysine methylation
Year: 2022 PMID: 35382117 PMCID: PMC8961045 DOI: 10.1002/pld3.392
Source DB: PubMed Journal: Plant Direct ISSN: 2475-4455
FIGURE 1Antibody test, optimization of sonication conditions to yield ~250‐bp chromatin fragments and concentrations for efficient cross‐linking. (a) 2 × 107 cells of Chromochloris (Chr.) zofingiensis were sonicated 2, 6, and 20 times for 10 s. DNA was extracted with phenol/chloroform/isoamyl alcohol (25/24/1 v/v), separated on a 1.5% agarose gel and stained with SYBR gold. (b) Whole‐cell proteins corresponding to 2 × 105 cells from Chr. zofingiensis were separated on an SDS‐polyacrylamide (15% monomer) gel and analyzed by immunodetection using antibodies against histone H3 or histone H3K4me3 as indicated. (c) 2 × 107 cells were incubated for 10 min with formaldehyde (HCHO) at the concentrations indicated. The reaction was quenched by the addition of 125‐mM glycine. Antibodies against histone H3 were used for ChIP (with no antibody as a background control). Chromatin‐immunoprecipitated DNA was extracted and amplified using primers targeting the RBCS promoter region. Shown are averages and StDEV of three amplification reactions
FIGURE 2H3K4me3 peak calling and genomic distribution in Chromochloris. (a) Total number of narrow H3K4me3 peaks in each replicate and the overlap between both replications. (b) Relative fold enrichment of H3K4me3 at different genomic regions as indicated. Each individual sample represents one independent replicate. The fold enrichment represents the number of peaks that overlap with a region divided by the number of peaks that would overlap with the respective region if they were randomly distributed across the genome
H3K4me3 ChIP‐Seq mapping statistics
| Total reads | Mapped reads | Mapped reads (%) | Unique deduplicate reads | |
|---|---|---|---|---|
| H3K4me3 ChIP Rep1 | 137,855,434 | 136,896,574 | 99.3 | 76,945,677 |
| H3K4me3 ChIP Rep2 | 125,105,126 | 123,333,772 | 98.6 | 21,906,729 |
| Input DNA | 188,843,106 | 186,436,803 | 98.8 | 163,415,936 |
FIGURE 3Genome‐wide distribution pattern of H3K4me3 in Chromochloris. Shown is the average normalized H3K4me3 enrichment for both replicates (blue and red) and input genomic DNA (black) at the TSSs (gray line) and ±2500 bps from TSSs. Values were normalized by scaling based on the number of uniquely mapped, nonduplicate reads
FIGURE 4H3K4me3 enrichment ranked by gene expression. Shown are H3K4me3 values visualized as heat map. H3K4me3 values are sorted based on the respective expression values (as transcripts per million [TPMs]) for both replicates at the TSSs (dashed line) and ±2000 bp from TSSs
FIGURE 5Integrated analyses of H3K4me3 enrichment and gene expression. Transcript abundance values (in transcripts per million [TPMs]) from genes that harbor a H3K4me3 peak at their TSSs (blue) versus all others (gray) are visualized using boxplots