Literature DB >> 35382117

An optimized ChIP-Seq framework for profiling histone modifications in Chromochloris zofingiensis.

Daniela Strenkert¹, Matthew Mingay², Stefan Schmollinger¹, Cindy Chen², Ronan C O'Malley², Sabeeha S Merchant^3,4,5.

Abstract

The eukaryotic green alga Chromochloris zofingiensis is a reference organism for studying carbon partitioning and a promising candidate for the production of biofuel precursors. Recent transcriptome profiling transformed our understanding of its biology and generally algal biology, but epigenetic regulation remains understudied and represents a fundamental gap in our understanding of algal gene expression. Chromatin immunoprecipitation followed by deep sequencing (ChIP-Seq) is a powerful tool for the discovery of such mechanisms, by identifying genome-wide histone modification patterns and transcription factor-binding sites alike. Here, we established a ChIP-Seq framework for Chr. zofingiensis yielding over 20 million high-quality reads per sample. The most critical steps in a ChIP experiment were optimized, including DNA shearing to obtain an average DNA fragment size of 250 bp and assessment of the recommended formaldehyde concentration for optimal DNA-protein cross-linking. We used this ChIP-Seq framework to generate a genome-wide map of the H3K4me3 distribution pattern and to integrate these data with matching RNA-Seq data. In line with observations from other organisms, H3K4me3 marks predominantly transcription start sites of genes. Our H3K4me3 ChIP-Seq data will pave the way for improved genome structural annotation in the emerging reference alga Chr. zofingiensis.

Entities: Chemical

Keywords: ChIP‐Sequencing; Chromochloris; epigenetics; formaldehyde crosslinking; green algae; histone lysine methylation

Year: 2022 PMID： 35382117 PMCID： PMC8961045 DOI： 10.1002/pld3.392

Source DB: PubMed Journal: Plant Direct ISSN： 2475-4455

INTRODUCTION

Chromatin is the higher order structure in which eukaryotic DNA is organized, facilitating condensed packaging of genomic DNA into the nucleus. Within chromatin, nucleosomes are considered to be the major structural subunit, in which approximately 146 bp of DNA are wrapped around dimers of core histones H2A, H2B, H3, and H4 (Luger, 2003). In addition to its packaging function, histones are hotspots for a variety of different posttranslational modifications (PTMs) that directly affect gene expression and thus contribute to the role of chromatin towards gene regulation (Li et al., 2007; Workman & Kingston, 1998). Modified N‐terminal tails of histones emerge from the nucleosome core and are thus accessible to chromatin effectors, enzymes that convert them into regulatory function. Each organism's specific histone code consists of various different histone modifications (acetylation, monomethylation/dimethylation/trimethylation, phosphorylation, sumoylation, or ubiquitination) that can be combined to form distinct chromatin states, although the number of possible combinations is typically limited. In Arabidopsis, for instance, the combinatorial complexity is reduced to just nine distinct chromatin states (Sequeira‐Mendes et al., 2014). Accordingly, the histone code provides an additional regulatory layer extending the information content of the genetic code (Luger, 2003). Previous studies have demonstrated that histone H3 lysine 4 trimethylation plays an important role during active transcription; for reviews, see Martin and Zhang (2005); Ruthenburg et al. (2007). In Saccharomyces cerevisiae, trimethylation of lysine 4 at histone H3 (H3K4me3) is a signature motif marking transcription start sites (TSSs) of genes that are either actively transcribed or that are poised for transcription (Bernstein et al., 2002; Ng et al., 2003; Santos‐Rosa et al., 2002; Schneider et al., 2004). In addition, H3K4me3 was implied in mediating epigenetic memory (Ng et al., 2003). Notably, enrichment of H3K4me3 at TSSs seems to be a highly conserved feature in many taxa. In human cell lines, mouse, and fly, TSSs of actively transcribed genes are also marked with H3K4me3 (Barski et al., 2007; Bernstein et al., 2005; Guenther et al., 2007; Schübeler et al., 2004). The same is true in land plants: In Arabidopsis, rice, and maize, trimethylation at lysine 4 of histone H3 occurred predominantly at gene promoters (He et al., 2010; Oh et al., 2008; Wang et al., 2009; Zhang et al., 2009; Zong et al., 2013). Chromatin immunoprecipitation (ChIP) followed by deep sequencing (ChIP‐Seq) is the method of choice for studying histone modification dynamics. ChIP is based on the recovery of DNA by immunoprecipitation using a specific antibody against the DNA‐bound protein of interest, such as a modified histone. ChIP'ed DNA may be quantified using (q)PCR, microarrays or deep sequencing. Detailed and optimized ChIP protocols have been published for Tetrahymena (Dedon et al., 1991), Drosophila (Orlando et al., 1997), yeast (Hecht & Grunstein, 1999), mammalian cell lines (Das et al., 2004), Chlamydomonas (Strenkert et al., 2011), and land plants like Arabidopsis (Bowler et al., 2004), maize (Haring et al., 2007), and tomato (Ricardi et al., 2010). Here, we describe an optimized ChIP protocol for the green alga Chromochloris. As illustrated by ChIP‐Seq with an antibody targeting H3K4me3, we show that our optimized protocol is suited for ChIP‐Seq analyses of histone modifications with proper resolution and high coverage yielding excellent signal to noise during subsequent data analyses.

MATERIALS AND METHODS

Cultures and growth conditions

Chromochloris zofingiensis SAG 211‐14 culture was grown in TAP medium with KNO3 replacing NH4Cl as nitrogen source and with modified trace elements from (Kropat et al., 2011) instead of Hutner's trace elements. Cells were grown at 90 μmol photons m−2 s−1, 140 rpm to a cell density of 2 × 106 cells per milliliter for all experiments.

Cross‐reactivity and specificity of antibodies used for ChIP

All antibodies used in this study were evaluated on total cell lysate of Chromochloris SAG 211‐14 cultures; 2 × 107 cells were collected by centrifugation at 4°C, 1650 and 2 min. The cell pellet was resuspended in 500‐μl MilliQ water and 2× protein lysis buffer was added (125‐mM Tris‐Cl, pH 6.8, 20% glycerol, 4% SDS, 10% β‐mercaptoethanol and 0.005% bromophenol blue). Samples were sonicated for 5 s to break the cell wall (Sonic Dismembrator System with a 1/2 in. probe, Model 505; 117 V, 50/60 Hz [Fisher Scientific Company, No. FB505110], settings: 1 s ON/1 s OFF, amplitude 50%). Twenty‐microliter cell lysate corresponding to 4 × 105 cells was loaded on each lane. Proteins were separated by 15% SDS‐PAGE and transferred to nitrocellulose membranes (Amersham Protran 0.1 NC). The membrane was blocked for 30 min with 3% dried nonfat milk in phosphate‐buffered saline (PBS) solution containing 0.1% (w/v) Tween 20 and then incubated in primary antiserum. The PBS solution was used as the diluent for both primary and secondary antibodies. The membranes were washed in PBS containing 0.1% (w/v) Tween 20. Antibodies directed against histone H3 (1:1000) or trimethylated lysine 4 of histone H3 (1:1000) were used. The secondary antibody, used at 1:4000, was goat antirabbit conjugated to alkaline phosphatase and processed according to the manufacturer's instructions.

Establishment of DNA shearing

A culture volume corresponding to 2 × 108 was collected by centrifugation at 4°C, 1650 for 2 min. Cell pellet was resuspended in 1‐ml ChIP lysis buffer (1% SDS, 10‐mM EDTA, 50‐mM Tris‐Cl, pH 8.0, and 0.25× protease inhibitor cocktail [Roche]) and transferred to Beckman Coulter 4‐ml Polycarbonate Thick Wall Centrifuge Tubes (13 × 64 mm). For shearing of genomic DNA, we used a Sonic Dismembrator System with a 1/2 in. probe, Model 505; 117 V, 50/60 Hz (Fisher Scientific Company, No. FB505110), settings: 1 s ON/1 s OFF, amplitude 50%). To test conditions needed for efficient fragmentation of Chromochloris genomic DNA, we used 2, 6, and 10 s of sonication. After sonication, 100 μl of each cell lysate (corresponding to 2 × 107 cells) was transferred to a new 1.5‐ml Eppendorf tube. Three hundred microliters of ChIP lysis buffer was added. DNA was extracted using phenol/chloroform extraction as follows: 500 μl of phenol/chloroform/isoamyl alcohol (25:24:1) was added to the cell lysates, and samples were mixed and centrifuged for 10 min at 4°C and 16,200 . The aqueous phase was transferred to 500‐μl chloroform/isoamyl alcohol (24:1). Samples were mixed and centrifuged for 10 min at 4°C and 16,200 , and this step was repeated once. After centrifugation, the aqueous phase containing the DNA was added to a new Eppendorf tube containing 50 μl of 3‐M Na acetate (pH 5.3), and 1 ml of 100% ethanol was added. Samples were mixed and kept at −20°C for at least 2 h. DNA was pelleted by a 15‐min centrifugation at 4°C at 16,200 . DNA pellets were washed once with 70% ethanol, and the DNA pellet was air dried for at least 15 min at room temperature. DNA was resuspended in 50‐μl TE supplemented with 10‐μg/μl DNase‐free RNAse A. Ten microliters of each sample was supplemented with 6× DNA loading dye and loaded on a 1.2% agarose gel supplemented with SYBR gold.

Optimizing formaldehyde cross‐linking

A culture volume corresponding to 2 × 108 was collected by centrifugation at 4°C, 1650 for 2 min. Cell pellets were washed with 50‐ml KH buffer (20‐mM K‐HEPES, pH 7.4, 40‐mM KCl). This is necessary because residual Tris from the growth medium would quench formaldehyde and lead to irreproducible results. After washing, cells were collected by centrifugation for 2 min at 4°C and 1650 . Cell pellets were resuspended in 10‐ml cross‐linking buffer (20‐mM K‐HEPES, 40‐mM KCl supplemented with no or 0.1%, 0.3%, 0.7%, and 1% formaldehyde). This solution was always made fresh because formaldehyde is unstable. Optimal formaldehyde concentration for cross‐linking DNA to chromatin in Chromochloris was determined based on efficient ChIP using an antibody cross‐reactive against an unmodified N‐terminus of histone H3. We used the promoter region of RBCS as target genomic region during optimization. qPCR was performed on a Bio‐Rad CFX96 Touch Real‐Time PCR Detection System using iTAQ Mastermix according to the manufacturer's instructions. Primers used were as follows: RBCSpromfor: CAATGCAAGCAGTTCGCATG and RBCSpromrev: ACGGAGGACTTGGCAATGAC.

Optimized ChIP protocol

A culture volume corresponding to 2 × 108 cells was collected by centrifugation at 4°C, 1650 for 2 min. Cell pellets were washed with 50‐ml KH buffer. After washing, cells were collected by centrifugation for 2 min at 4°C and 1650 . Cell pellets were resuspended in 10‐ml cross‐linking buffer (20‐mM HEPES‐KOH, 40‐mM KCL supplemented with 0.7% formaldehyde). The cell pellet was resuspended in 1‐ml ChIP lysis buffer (1% SDS, 10‐mM EDTA, 50‐mM Tris‐Cl, pH 8.0, and 0.25× protease inhibitor cocktail [Roche]) and transferred to Beckman Coulter 4‐ml Polycarbonate Thick Wall Centrifuge Tubes (13 × 64 mm). For shearing of genomic DNA, we used a Sonic Dismembrator System with a 1/2 in. probe, Model 505; 117 V, 50/60 Hz (Fisher Scientific Company, No. FB505110) at settings: 1 s ON/1 s OFF, amplitude 50%). After sonication for 10 s, cell lysate was centrifuged for 10 min at 4°C and 16,200 (this is to clear the lysate from starch and cell debris). We generated aliquots of 100‐μl input chromatin in 1.5‐ml tubes and flash froze them in liquid nitrogen (each aliquot corresponds to chromatin from ~2 × 107 cells). The input chromatin can be stored for several months at −80°C, but avoid multiple freeze/thaw cycles. We used 50 mg of Protein‐A‐Sepharose beads (Sigma, P3391‐1G), resuspended beads in 1‐ml ChIP buffer (1.1% Triton X‐100, 1.2‐mM EDTA, 167‐mM NaCl, and 16.7‐mM Tris‐HCl, pH 8) and incubated them for 30 min at 4°C. Swollen beads were washed two times with 500‐μl ChIP buffer. After the last wash, supernatant was discarded, and we added 500‐μl ChIP buffer (gives ~750‐μl suspension with swollen beads). We did not add sonicated lambda DNA, because this would interfere with DNA library generation. For ChIP, 5 μl of anti‐H3 or 10 μl of anti‐H3K4me3 antibodies was premixed with 10‐μl BSA solution (10 mg/ml) and incubated for at least 30 min on ice. For each antibody employed, two 100‐μl aliquots of chromatin solution were thawed on ice (one for a control without antibody and one for the antibody to be tested). Nine hundred microliters of ChIP buffer was added to each aliquot (from this step on, we consequently used filter tips to avoid DNA contaminations). Samples were centrifuged for 20 s at 16,100 and 4°C, and supernatant was transferred to microcentrifuge tubes containing the prepared antibody solutions. Samples were mixed on a rotation wheel for 1 h at 4°C. After 1 h, samples were centrifuged for 20 s at 16,200 and 4°C, and supernatants were transferred to new microcentrifuge tubes containing 60‐μl Sepharose beads. Samples were mixed on a rotation wheel for 2 h at 4°C. After incubation, samples were centrifuged for 20 s at 16,200 and 4°C and supernatants discarded. Sepharose beads were washed once with washing buffer 1 (0.1% SDS, 1% Triton X‐100, and 2‐mM EDTA, pH 8) containing 150‐mM NaCl, once with washing buffer 1 containing 500‐mM NaCl, once with washing buffer 2 (250‐mM LiCl, 1% Nonidet P‐40, 1% Na deoxycholate, 1‐mM EDTA, and 10‐mM Tris‐HCl, pH 8), and twice with TE (1‐mM EDTA and 10‐mM Tris‐HCl, pH 8). Cross‐links were reverted by an overnight incubation at 65°C after addition of NaCl to a final concentration of 0.5 M, 1% SDS, and 0.1‐M NaHCO3 (made fresh) for 15 min at 65°C. This step was repeated once, and eluates were pooled. At this point, another aliquot of input chromatin was also thawed, and 400 μl of ChIP lysis buffer was added and incubated overnight at 65°C (input DNA control). In order to remove proteins in the precipitates, we did add 10 μl of 0.5‐M EDTA (pH 8.0), 20 μl of 1‐M Tris‐HCl (pH 8.0), and 2.1 μl of proteinase K (10 mg/ml) and incubated the samples for 1 h at 55°C. DNA extraction was performed by one extraction with 500‐μl phenol/chloroform/isoamyl alcohol and one extraction using 500‐μl chloroform/isoamyl alcohol. DNA was precipitated by adding 50 μl of 3‐M Na acetate (pH 5.2), 2.5 μl of glycogen (2.5 μg/μl) (this will give an otherwise invisible pellet), and 1 ml of 100% EtOH and incubating for at least 2 h at −20°C. Samples were centrifuged for 20 min at 16,200 and 4°C. DNA pellet was dried and resuspended in 50 μl of 10‐mM Tris‐HCl buffer (pH 8.0).

ChIP library preparation and sequencing

Five nanogram of ChIP'ed DNA was treated with end‐repair, A‐tailing, and ligation of Illumina compatible adapters (IDT) using the KAPA‐HyperPrep kit (KAPA Biosystems). The ligated products were enriched with 8 cycles of PCR (KAPA Biosystems) and size selected to 200–500 bp with Total Pure NGS beads (Omega Bio‐Tek). The prepared libraries were quantified using KAPA Biosystems' next‐generation sequencing library qPCR kit and run on a Roche LightCycler 480 real‐time PCR instrument. Sequencing of the flow cell was performed on the Illumina NextSeq500 sequencer using NextSeq500 NextSeq HO kits, v2, following a 2 × 151 indexed run recipe.

ChIP read preprocessing and filtering

2 × 151 sequence data were generated at the DOE Joint Genome Institute (JGI) using the Illumina NextSeq platform. BBDuk version 38.87 (https://bbtools.jgi.doe.go) was used to remove contaminants, trim reads that contained adapter sequence and homopolymers of G's of size 5 or more at the ends of the reads and right quality trim reads where quality drops below 6. BBDuk was used to remove reads that contained 1 or more “N” bases and had an average quality score across the read less than 10 or had minimum length ≤49 bp. Reads mapped with BBMap (v. 38.87) to masked human, cat, dog, mouse, and common microbial contaminant references at 93% identity were removed.

ChIP read alignment and peak calling

Filtered reads were aligned to the Chromochloris zofingiensis reference genome (Roth et al., 2017) using BWA mem (version 0.7.17). Only uniquely mapped reads were retained. Duplicated reads were removed by Picard (v. 2.22.9) (http://broadinstitute.github.io/picard/; Broad Institute) MarkDuplicates tool. Finally, the remaining reads were used for peak calling by MACS2 (v. 2.1.1) (Zhang et al., 2008) with parameters “‐‐call‐summits ‐‐nomodel ‐‐extsize 147 ‐c.” Input control libraries were generated and used for peak calling and downstream analysis. To visualize and plot data, bigwig files were created using bedGraphToBigWig (v. 4) (Kent et al., 2010) and Deeptools (v. 3.1.3) (Ramirez et al., 2016) was used to generate summary signal plots and heatmaps.

RNA extraction, sequencing, and transcriptome analyses

A culture volume corresponding to 5 × 107 cells was collected by centrifugation at 4°C, 3500 rpm for 2 min. Cell pellet was resuspended in 0.2‐ml RLC buffer (Qiagen), flash frozen in liquid nitrogen, and ground to a fine powder using mortar and pestle. Sample was added to 700‐μl TRIzol and mixed overhead before the addition of 200‐μl chloroform/isoamyl alcohol. Samples were vigorously shaken, then centrifuged for 10 min at 4°C and 13,200 rpm. Supernatant was added to 700‐μl isopropanol. RNA was precipitated at −20°C overnight and washed with 70% ethanol, air dried, and resuspended in 40‐μl RNAse‐free water. DNase I digest and cleanup was performed using Zymo RNA Clean & Concentrator Kit (RCC) according to the manufacturer's instructions. RNA was converted into cDNA and made into sequence ready libraries with the KAPA RNA‐Seq Kit (KAPA Biosystems). RNA‐Seq libraries were sequenced with 150‐bp single‐end reads on a HiSeq 2500. Data were aligned to the ChrZofV5 release of the C. zofingiensis genome with RNA STAR. Determination of counts per gene and transcript abundance in transcripts per million (TPMs) were made with DESeq2.

Data availability

Data are available from the US NCBI Short Read Archive (SRA) (https://www.ncbi.nlm.nih.gov/sra) using the following accession numbers: SRP354587,SRP354588, SRP354586 (ChIP‐Seq data) and SRP355099, SRP355100 (RNA‐Seq data).

RESULTS

An optimized ChIP‐Seq frame work for Chromochloris—An overview of the most critical parameters

ChIP involves cross‐linking of the chromatin‐bound proteins by formaldehyde, followed by sonication to obtain small DNA fragments. Immunoprecipitation of cross‐linked, fragmented material is then carried out using specific antibodies against the DNA‐binding protein of interest. As pointed out by us and others, some parameters are crucial for a successful ChIP experiment and need to be individually adjusted for each respective organism (Das et al., 2004). Accordingly, the optimization of the most critical steps for a ChIP protocol in Chromochloris is outlined in more detail below. In order to be able to identify the location of the histone‐bound DNA sequence of interest with proper resolution, it is critical to break the DNA to an average fragment size of around 250 bp (approximate average nucleosome spacing). The hydrodynamic shearing method using sonication is commonly used for this purpose (Orlando et al., 1997; Spencer et al., 2003). A total of three sonication times were tested on Chromochloris samples: 2, 6, and 10 × 10 s. While intact genomic DNA was still visible after a sonication time of 2 × 10 s, sonication for 10 × 10 s yielded the desired average DNA fragment size of ~250 (Figure 1a).

FIGURE 1

Antibody test, optimization of sonication conditions to yield ~250‐bp chromatin fragments and concentrations for efficient cross‐linking. (a) 2 × 107 cells of Chromochloris (Chr.) zofingiensis were sonicated 2, 6, and 20 times for 10 s. DNA was extracted with phenol/chloroform/isoamyl alcohol (25/24/1 v/v), separated on a 1.5% agarose gel and stained with SYBR gold. (b) Whole‐cell proteins corresponding to 2 × 105 cells from Chr. zofingiensis were separated on an SDS‐polyacrylamide (15% monomer) gel and analyzed by immunodetection using antibodies against histone H3 or histone H3K4me3 as indicated. (c) 2 × 107 cells were incubated for 10 min with formaldehyde (HCHO) at the concentrations indicated. The reaction was quenched by the addition of 125‐mM glycine. Antibodies against histone H3 were used for ChIP (with no antibody as a background control). Chromatin‐immunoprecipitated DNA was extracted and amplified using primers targeting the RBCS promoter region. Shown are averages and StDEV of three amplification reactions A second important factor for a successful ChIP experiment is the choice of the antibody against the protein of interest. Commercially available antibodies are usually raised against human (modified) histones and are most likely cross‐reactive with other species, as histones are highly conserved proteins. Unsurprisingly, Chromochloris histone H3 shares high sequence similarity with human histone H3, making it likely that commercial antibodies will be functional in our applications (Figure S1). We aimed to use two different antibodies in this study, one cross‐reactive against unmodified histone H3 (for optimization purposes) and the other one against trimethylated H3K4. Cross‐reactivity and specificity of both antibodies was tested by immunoblotting using Chromochloris cell lysates for both of which we saw a single, clear band at the expected molecular weight (Figure 1b). Cross‐linking of the starting material prior to ChIP is used to ensure that the chromatin structure is preserved during sample collection and during the remainder of the ChIP procedure (Solomon et al., 1988; Solomon & Varshavsky, 1985). Formaldehyde is the most commonly used cross‐linking agent in ChIP experiments as it penetrates most cell walls and because cross‐links are reversible by incubation at higher temperatures; commonly 65°C is used for the purpose of decross‐linking (Orlando et al., 1997). In addition, formaldehyde forms bonds that span a distance of approximately 2 Å (Dedon et al., 1991), resulting in DNA–protein, RNA–protein, and protein–protein cross‐linking. While one needs sufficient amounts of formaldehyde for efficient DNA–protein cross‐linking, it is also crucial not to overcross‐link the sample to avoid the loss of antibody recognition sites and thus loss of ChIP'ed DNA. Many published ChIP protocols use a standard amount of 1% formaldehyde, but we have seen in the past that different concentrations can give better signal to noise ratios in ChIP experiments of algae (Strenkert et al., 2011). For the determination of the optimal cross‐linker concentrations, ChIP was performed using an antibody targeting unmodified histone H3 (as opposed to a certain histone mark) as we expected reasonable nucleosome occupancy at the whole‐genome level, including at a promoter region of the gene encoding the small subunit of Rubisco, RBCS. We tested formaldehyde at concentrations of 0%, 0.1%, 0.3%, 0.7%, and 1 percent. Samples without an antibody (noAB) were used as negative controls, which allowed us to estimate DNA enrichment over background contamination if histone–DNA cross‐linking was successful. While all formaldehyde concentrations tested led to significant enrichment of ChIP'ed DNA over background (Figure 2c), the use of 0.7% formaldehyde yielded the largest amounts of ChIP'ed RBCS promoter DNA and the highest signal to noise (Figure 2c). Lower formaldehyde concentrations gave insufficient cross‐linking, and higher concentrations resulted in overcross‐linking, both of which reduced ChIP efficiency. Therefore, we recommend the use of 0.7% formaldehyde.

FIGURE 2

H3K4me3 peak calling and genomic distribution in Chromochloris. (a) Total number of narrow H3K4me3 peaks in each replicate and the overlap between both replications. (b) Relative fold enrichment of H3K4me3 at different genomic regions as indicated. Each individual sample represents one independent replicate. The fold enrichment represents the number of peaks that overlap with a region divided by the number of peaks that would overlap with the respective region if they were randomly distributed across the genome

A survey of the genome‐wide H3K4me3 distribution pattern in Chromochloris

To further test suitability of our optimized ChIP framework for deep sequencing applications, we determined H3K4me3 enrichment at a genome‐wide level. H3K4me3 is a so‐called narrow‐peak histone mark that seems to be specifically enriched at TSSs of protein coding genes in a variety of different organisms including yeast, mammals, and land plants (Barski et al., 2007; Bernstein et al., 2005; Pokholok et al., 2005; Santos‐Rosa et al., 2002; Zhang et al., 2009). Accordingly, assessment of data quality will be straightforward. In our ChIP experiments, one possible negative control, ChIPs performed without an antibody (noAB controls) did not yield sufficient amounts of DNA for library preparation and subsequent deep sequencing. It is certainly possible to identify DNA‐binding sites based on relative enrichment of different chromosomal regions even without a negative control; however, some chromosomal regions can be significantly enriched even when sequencing genomic DNA alone (Rozowsky et al., 2009). Accordingly, non‐ChIP'ed, genomic DNA is often analyzed in parallel as negative control for ChIP‐Seq experiments to account for this issue. To this end, we included non‐ChIP'ed genomic DNA as our control. We interrogated H3K4 trimethylation from duplicate samples in Chromochloris cultures. mRNA from the same cultures was obtained in parallel and used for RNA‐Seq analysis, with the intention to later analyze it alongside the ChIP‐Seq data. By aligning H3K4me3 ChIP‐Seq reads to the reference genome, we noted that the majority of reads (>98%) could be mapped to the nuclear genome, which suggested that the sequencing data were of a high quality (Table 1). Model‐based analysis using ChIP‐Seq (MACS) software (Zhang et al., 2008) was used for peak calling. We identified a total of 8945 H3K4me3 consensus peaks in both replicates, with the duplicate samples showing excellent overlap between each other (Figure 2a).

TABLE 1

H3K4me3 ChIP‐Seq mapping statistics

	Total reads	Mapped reads	Mapped reads (%)	Unique deduplicate reads
H3K4me3 ChIP Rep1	137,855,434	136,896,574	99.3	76,945,677
H3K4me3 ChIP Rep2	125,105,126	123,333,772	98.6	21,906,729
Input DNA	188,843,106	186,436,803	98.8	163,415,936

H3K4me3 ChIP‐Seq mapping statistics The distribution of H3K4me3 peaks along the Chromochloris genome was further characterized using 11 groups that included distinct genic regions and observed predominant H3K4me3 enrichment at TSSs of genes (Figure 2b). Genome‐wide profiling of H3K4me3 revealed an average H3K4me3 peak width of ±500 bp around the TSSs in the majority of genes (Figure 3).

FIGURE 3

Genome‐wide distribution pattern of H3K4me3 in Chromochloris. Shown is the average normalized H3K4me3 enrichment for both replicates (blue and red) and input genomic DNA (black) at the TSSs (gray line) and ±2500 bps from TSSs. Values were normalized by scaling based on the number of uniquely mapped, nonduplicate reads

H3K4me3 and gene expression

To get an integrated view on H3K4me3 enrichment and its relationship to gene expression, we compared transcript abundances of individual genes with H3K4me3 enrichment at their respective TSSs. An increase in transcript abundance seems to correlate with the chances of a gene harboring a H3K4me3 at its TSS (Figure 4). When we further interrogated transcript abundances of genes that harbor a H3K4me3 peak at their TSS and compared it with those that do not, again, we saw a clear, positive correlation between active transcription and the likelihood of a gene being marked with H3K4me3 (Figure 5).

FIGURE 4

FIGURE 5

Integrated analyses of H3K4me3 enrichment and gene expression. Transcript abundance values (in transcripts per million [TPMs]) from genes that harbor a H3K4me3 peak at their TSSs (blue) versus all others (gray) are visualized using boxplots

H3K4me3 enrichment ranked by gene expression. Shown are H3K4me3 values visualized as heat map. H3K4me3 values are sorted based on the respective expression values (as transcripts per million [TPMs]) for both replicates at the TSSs (dashed line) and ±2000 bp from TSSs Integrated analyses of H3K4me3 enrichment and gene expression. Transcript abundance values (in transcripts per million [TPMs]) from genes that harbor a H3K4me3 peak at their TSSs (blue) versus all others (gray) are visualized using boxplots

DISCUSSION

Epigenetic regulation is a prerequisite for mediating transcriptional responses in general and for the fine tuning of genetically programmed changes in particular. ChIP‐Seq is a powerful tool to assess changes in histone mark patterns and to identify transcription factor‐binding sites. Most importantly, some critical parameters in the procedure need to be optimized in each organism individually to ensure proper data quality and reproducibility between samples. Given the conserved, genome‐wide distribution pattern of H3K4me3 in many diverse taxa, ChIP‐Seq using H3K4me3 was facilitated as a proof of principle application to illustrate desired data quality of our optimized ChIP‐Seq frame work. H3K4me3 was shown to be a narrow‐peak histone mark that is enriched at TSSs of genes in yeast, fly, human, algae, and plants (Barski et al., 2007; Bernstein et al., 2005; Pokholok et al., 2005; Santos‐Rosa et al., 2002; Zhang et al., 2009). Unsurprisingly, and in line with a conserved role for H3K4me3 in gene activation, Chromochloris genes are also marked with H3K4me3 at their TSSs. Notably, the likelihood of a gene being marked by H3K4me3 was higher, if the underlying gene was transcribed. The optimized ChIP‐Seq framework established herein will pave the way for studying epigenetic changes and transcription factor‐binding sites in the emerging reference alga Chromochloris. Moreover, the coverage achieved by our ChIP‐Seq frame work will be sufficient for quantitative recovery of narrow‐ and broad‐peak histone marks alike. This work represents a critical first step in establishing chromatin state dynamics in Chromochloris.

CONFLICT OF INTEREST

The authors have no conflicts of interest to declare. Figure S1. Histone H3 is a highly conserved protein between Chromochloris and Human. Multiple sequence alignment was performed using Clustal Omega. Multiple sequence alignment of Histone H3: Chromochloris zofingiensis (Cz05g04150), (CAB02546.1). Under the alignment, asterisks indicate identical residues between both organisms. Click here for additional data file.

37 in total

1. Mapping DNA interaction sites of chromosomal proteins using immunoprecipitation and polymerase chain reaction.

Authors: A Hecht; M Grunstein
Journal: Methods Enzymol Date: 1999 Impact factor: 1.600

2. Targeted recruitment of Set1 histone methylase by elongating Pol II provides a localized mark and memory of recent transcriptional activity.

Authors: Huck Hui Ng; François Robert; Richard A Young; Kevin Struhl
Journal: Mol Cell Date: 2003-03 Impact factor: 17.970

3. Chromatin techniques for plant cells.

Authors: Chris Bowler; Giovanna Benvenuto; Pierre Laflamme; Diana Molino; Aline V Probst; Muhammad Tariq; Jerzy Paszkowski
Journal: Plant J Date: 2004-09 Impact factor: 6.417

Review 4. Chromatin immunoprecipitation assay.

Authors: Partha M Das; Kavitha Ramachandran; Jane vanWert; Rakesh Singal
Journal: Biotechniques Date: 2004-12 Impact factor: 1.993

5. A simplified formaldehyde fixation and immunoprecipitation technique for studying protein-DNA interactions.

Authors: P C Dedon; J A Soults; C D Allis; M A Gorovsky
Journal: Anal Biochem Date: 1991-08-15 Impact factor: 3.365

Review 6. Methylation of lysine 4 on histone H3: intricacy of writing and reading a single epigenetic mark.

Authors: Alexander J Ruthenburg; C David Allis; Joanna Wysocka
Journal: Mol Cell Date: 2007-01-12 Impact factor: 17.970

7. BigWig and BigBed: enabling browsing of large distributed datasets.

Authors: W J Kent; A S Zweig; G Barber; A S Hinrichs; D Karolchik
Journal: Bioinformatics Date: 2010-07-17 Impact factor: 6.937

8. Genome-wide and organ-specific landscapes of epigenetic modifications and their relationships to mRNA and small RNA transcriptomes in maize.

Authors: Xiangfeng Wang; Axel A Elling; Xueyong Li; Ning Li; Zhiyu Peng; Guangming He; Hui Sun; Yijun Qi; X Shirley Liu; Xing Wang Deng
Journal: Plant Cell Date: 2009-04-17 Impact factor: 11.277

9. PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls.

Authors: Joel Rozowsky; Ghia Euskirchen; Raymond K Auerbach; Zhengdong D Zhang; Theodore Gibson; Robert Bjornson; Nicholas Carriero; Michael Snyder; Mark B Gerstein
Journal: Nat Biotechnol Date: 2009-01-04 Impact factor: 54.908

10. Mapping protein-DNA interactions in vivo with formaldehyde: evidence that histone H4 is retained on a highly transcribed gene.

Authors: M J Solomon; P L Larsen; A Varshavsky
Journal: Cell Date: 1988-06-17 Impact factor: 41.582