| Literature DB >> 19159457 |
Philippe Lefrançois1, Ghia M Euskirchen, Raymond K Auerbach, Joel Rozowsky, Theodore Gibson, Christopher M Yellman, Mark Gerstein, Michael Snyder.
Abstract
BACKGROUND: Short-read high-throughput DNA sequencing technologies provide new tools to answer biological questions. However, high cost and low throughput limit their widespread use, particularly in organisms with smaller genomes such as S. cerevisiae. Although ChIP-Seq in mammalian cell lines is replacing array-based ChIP-chip as the standard for transcription factor binding studies, ChIP-Seq in yeast is still underutilized compared to ChIP-chip. We developed a multiplex barcoding system that allows simultaneous sequencing and analysis of multiple samples using Illumina's platform. We applied this method to analyze the chromosomal distributions of three yeast DNA binding proteins (Ste12, Cse4 and RNA PolII) and a reference sample (input DNA) in a single experiment and demonstrate its utility for rapid and accurate results at reduced costs.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19159457 PMCID: PMC2656530 DOI: 10.1186/1471-2164-10-37
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Scheme for yeast barcoded ChIP-Seq. (a) Barcoded ChIP-Seq workflow. Ovals depict yeast cells and squares depict proteins. An aliquot of sheared cell lysate is not immunoprecipitated but is otherwise processed normally (green). This DNA, termed input DNA, is a reference sample for ChIP-Seq. Illumina DNA libraries are generated from both ChIP and input DNA samples. In multiplex ChIP-Seq, a barcoded adapter is ligated to an individual DNA sample. The barcode has 3 unique bases followed by a 'T' to anneal with the end-repaired DNA. Four libraries are then pooled together and applied to a single flowcell lane. After sequencing on the Genome Analyzer, reads are separated according to the first four bases and aligned to the yeast genome. Reads are stacked to generate a signal profile and scored against a pool of input DNA to determine significant transcription factor binding sites. (b) The barcode (orange) is located between Illumina adapter sequences (purple) and ChIP or input DNA inserts (black). The sequencing primer (pink) anneals to the adapter sequences and short reads start with the four bases of the barcode (orange) followed by DNA inserts (black). For the sequencing primer and Illumina adapter, oligonucleotide sequences were given by the manufacturer © 2006 Illumina, Inc. All rights reserved.
Oligonucleotide sequences for generation of the four barcoded adapters
| MPLEXA1F | GTAT | None | ACACTCTTTCCCTACACGACGCTCTTCCGATCTGTAT |
| MPLEXA1R | GTAT | 5' phosphate | TACAGATCGGAAGAGCTCGTATGCCGTCTTCTGCTTG |
| MPLEXA6F | CATT | None | ACACTCTTTCCCTACACGACGCTCTTCCGATCTCAT |
| MPLEXA6R | CATT | 5' phosphate | ATGAGATCGGAAGAGCTCGTATGCCGTCTTCTGCTTG |
| MPLEXA8F | ACGT | None | ACACTCTTTCCCTACACGACGCTCTTCCGATCTACG |
| MPLEXA8R | ACGT | 5' phosphate | CGTAGATCGGAAGAGCTCGTATGCCGTCTTCTGCTTG |
| MPLEXA9F | TGCT | None | ACACTCTTTCCCTACACGACGCTCTTCCGATCTTGC |
| MPLEXA9R | TGCT | 5' phosphate | GCAAGATCGGAAGAGCTCGTATGCCGTCTTCTGCTTG |
Figure 2Comparison of input DNA signal tracks among all four barcoded adapters relative to standard Illumina adapters. An input sample was split in five aliquots. Four were barcoded differentially (top four lanes) and one had non-barcoded, Illumina adapters (fifth lane, labeled 'None'). Barcoded inputs were scored against non-barcoded input. IGB signal tracks of yeast chromosome 16 are shown for each sample, with ORF locations on the x-axis. ORFs are depicted in purple. On the y-axis, a normalized scale represents the number of read counts at a particular location. Each scale is normalized according to the number of mapped reads (Table 10). A box in the left panel depicts the enlarged section shown in the right panel for positions between 828,000 and 833,000 to demonstrate the overlap among all signal tracks.
Comparisons of the average tag count in 500 bp windows across the yeast genome among different samples
| Experiment A | Experiment B | ||||
| Experiment # | Experiment ID | Experiment # | Experiment ID | R2 | p-value |
| 10 | Input_NB | 14 | Input_NB | 0.9455 | < 2.2e-16 |
| 14 | Input_NB | 15 | Input_ACGT | 0.9556 | < 2.2e-16 |
| 14 | Input_NB | 16 | Input_CATT | 0.9473 | < 2.2e-16 |
| 14 | Input_NB | 17 | Input_GTAT | 0.9685 | < 2.2e-16 |
| 14 | Input_NB | 18 | Input_TGCT | 0.9325 | < 2.2e-16 |
| 15 | Input_ACGT | 16 | Input_CATT | 0.9422 | < 2.2e-16 |
| 17 | Input_GTAT | 18 | Input_TGCT | 0.9304 | < 2.2e-16 |
| 1 | PolII_Rep1_ACGT | 8 | PolII_Rep2_TGCT | 0.8904 | < 2.2e-16 |
| 1 | PolII_Rep1_ACGT | 9 | PolII_Rep3_ACGT | 0.9104 | < 2.2e-16 |
| 1 | PolII_Rep1_ACGT | 13 | PolII_Rep4_NB | 0.9040 | < 2.2e-16 |
| 8 | PolII_Rep2_TGCT | 9 | PolII_Rep3_ACGT | 0.9288 | < 2.2e-16 |
| 8 | PolII_Rep2_TGCT | 13 | PolII_Rep4_NB | 0.9464 | < 2.2e-16 |
| 9 | PolII_Rep3_ACGT | 13 | PolII_Rep4_NB | 0.9302 | < 2.2e-16 |
| 4 | Ste12_Rep1_TGCT | 7 | Ste12_Rep2_GTAT | 0.9655 | < 2.2e-16 |
| 4 | Ste12_Rep1_TGCT | 12 | Ste12_Rep3_NB | 0.9058 | < 2.2e-16 |
| 7 | Ste12_Rep2_GTAT | 12 | Ste12_Rep3_NB | 0.8963 | < 2.2e-16 |
Correlation coefficients on the average tag count in 500 bp bins between two experiments among barcoded and non-barcoded input DNA samples from the section "Similar barcode behavior with the same DNA sample" (Comparison 1–7), among barcoded and non-barcoded PolII replicates (Comparison 8–13) and among barcoded and non-barcoded Ste12 replicates (Comparison 14–16). p-values are inferior to 2.2e-16, the lowest p-value that R can calculate. See Methods for details of the analysis.
Experimental design and values for ChIP replicates and input samples
| PolII_Rep1 | ACGT | PolII | 16.6 | 1.83 | 1110177 | 623328 | 56.15 | 14 | |
| Input_CATT | CATT | Input | 51.6 | 1.85 | 2526689 | 1719855 | 68.07 | Not being scored | |
| Cse4_Rep1 | GTAT | Cse4 | 12.0 | 1.99 | 2606497 | 1080537 | 41.46 | 81 | |
| Ste12_Rep1 | TGCT | Ste12 | 11.6 | 1.85 | 2020432 | 1072428 | 53.08 | 42 | |
| Input_ACGT | ACGT | Input | 66.9 | 1.80 | 4678579 | 2857615 | 61.08 | Not being scored | |
| Cse4_Rep2 | CATT | Cse4 | 8.6 | 1.85 | 679315 | 357384 | 52.61 | 25 | |
| Ste12_Rep2 | GTAT | Ste12 | 9.4 | 1.86 | 2316023 | 1067739 | 46.10 | 44 | |
| PolII_Rep2 | TGCT | PolII | 15.5 | 2.00 | 3991833 | 2561147 | 64.16 | 12 | |
| PolII_Rep3 | ACGT | PolII | 9.7 | 1.97 | 2661304 | 1725793 | 64.85 | 42 | |
| Input_NB | None | Input | 60.4 | 1.83 | 2432221 | 1582226 | 65.05 | Not being scored | |
| Cse4_Rep3 | None | Cse4 | 10.7 | 1.69 | 1168583 | 239580 | 20.50 | 60 | |
| Cse4_Rep3 | None | Cse4 | 10.7 | 1.69 | 3245287 | 583876 | 17.99 | ||
| Ste12_Rep3 | None | Ste12 | 9.1 | 1.64 | 2106917 | 473965 | 22.50 | 39 | |
| Ste12_Rep3 | None | Ste12 | 9.1 | 1.64 | 2668067 | 551040 | 20.65 | ||
| PolII_Rep4 | None | PolII | 18.8 | 1.94 | 4649077 | 3151068 | 67.78 | 40 | |
| Input_NB | None | Input | 60.4 | 1.83 | 4373256 | 2455181 | 56.14 | Not being scored | |
| Input_ACGT | ACGT | Input | 66.9 | 1.80 | 873037 | 529038 | 60.60 | 5 | |
| Input_CATT | CATT | Input | 51.6 | 1.85 | 799430 | 479546 | 59.99 | 5 | |
| Input_GTAT | GTAT | Input | 34.6 | 1.69 | 1380820 | 850262 | 61.58 | 6 | |
| Input_TGCT | TGCT | Input | 68.1 | 1.78 | 611123 | 362458 | 59.31 | 4 |
NB (No Barcode) indicates that standard non-barcoded Illumina adapters were used. DNA concentrations of Illumina DNA libraries and their A260/280 ratios are given in columns 5 and 6.
Mapping values after sequencing (total sequencing reads, total mapped reads and percentage of mapped reads) are shown in columns 7, 8 and 9.
Threshold values for PeakSeq scoring algorithm are displayed in column 10.
* Sample ran twice on sequencer and reads pooled together from both lanes for scoring as a single experiment
DNA libraries pooled for multiplex sequencing
| PolII_Rep1 | Input_CATT | Cse4_Rep1 | Ste12_Rep1 | |
| Input_ACGT | Cse4_Rep2 | Ste12_Rep2 | PolII_Rep2 | |
| PolII_Rep3 | Input_CATT | Cse4_Rep1 | Ste12_Rep1 |
Figure 3Barcoded adapters perform similarly to standard Illumina adapters and do not crossover to other samples in the same lane. (a) RNA PolII binding profiles from different biological replicates with the same barcode (PolII_Rep1, dark blue; PolII_Rep3, red), with different barcodes (PolII_Rep2, orange) or without barcode (PolII_Rep4, green) strongly overlap. See also Table 3. Input DNA serves as a reference (light blue). IGB signal tracks of chromosome 5 between 130,000 and 320,000 are shown for each library. A box in the left panel depicts the enlarged section shown in the right panel between positions 298,000 and 309,000 to illustrate the overlap among all PolII signal tracks. (b) Binding profiles from four different libraries pooled and sequenced in the same flowcell lane show very little resemblance. Shown here are the binding profiles for Cse4_Rep2 (dark blue), Ste12_Rep2 (red), PolII_Rep2 (green) and Input_ACGT (light blue). IGB signal tracks of chromosome 12 between 80,000 and 210,000 are shown for each sample. For (a) and (b), axis and scale normalizations are similar to Figure 2. (c) Left: Rank-rank comparison of target lists between all pairwise barcoded replicates for Cse4, PolII and Ste12. The horizontal axis shows the fraction of the two lists being compared and the vertical axis shows the fraction of those targets that agree between a given pair of target lists. All comparisons show strong agreement, although the rank lists for Cse4 differ more than PolII or Ste12 for the second half of their length. Right: Rank-rank comparison between barcoded replicates from the same factors (averaged over all pairwise comparisons) compared to rank-rank comparisons for barcoded replicates between different factors: PolII_Rep1 (ACGT) vs. Ste12_Rep1 (TGCT) and Cse4_Rep2 (CATT) vs. Ste12_Rep2 (GTAT).
Figure 4Ste12 distribution during pseudohyphal growth is similar across three different biological replicates. Two barcoded replicates (Ste12_Rep2, dark blue; Ste12_Rep1, red) and a non-barcoded replicate (Ste12_Rep3, green) were compared to input DNA (light blue). Ste12 ChIP samples were scored against a pool of input DNA. IGB signal tracks of chromosome 2 between 340,000 and 410,000 are shown for each sample. Axis and scale normalizations are similar to Figure 2. A box in the left panel containing the TEC1 gene and its surrounding intergenic region was enlarged in panel B and rescaled to emphasize the strong signal at the TEC1 promoter. The same normalization as in Figure 2 was applied. Ste12p and Tec1p act as a dimer during pseudohyphal growth [31].
GO process analysis of gene targets for Cse4, Ste12 and PolII. For a complete GO analysis, please refer to Additional File 5.
| GO process categories for Cse4 | |||
| glycolysis | 8 out of 132 genes, 6.1% | 22 out of 7158 background genes, 0.3% | 8.32e-07 |
| glucose catabolic process | 8 out of 132 genes, 6.1% | 32 out of 7158 background genes, 0.4% | 2.34e-05 |
| alcohol metabolic process | 16 out of 132 genes, 12.1% | 179 out of 7158 background genes, 2.5% | 4.49e-05 |
| hexose catabolic process | 8 out of 132 genes, 6.1% | 37 out of 7158 background genes, 0.5% | 7.96e-05 |
| cellular biosynthetic process | 57 out of 132 genes, 43.2% | 1689 out of 7158 background genes, 23.6% | 0.00013 |
| monosaccharide catabolic process | 8 out of 132 genes, 6.1% | 42 out of 7158 background genes, 0.6% | 0.00023 |
| alcohol catabolic process | 8 out of 132 genes, 6.1% | 45 out of 7158 background genes, 0.6% | 0.00039 |
| pyruvate metabolic process | 7 out of 132 genes, 5.3% | 39 out of 7158 background genes, 0.5% | 0.00174 |
| hexose metabolic process | 9 out of 132 genes, 6.8% | 78 out of 7158 background genes, 1.1% | 0.00349 |
| biosynthetic process | 58 out of 132 genes, 43.9% | 1929 out of 7158 background genes, 26.9% | 0.00496 |
| hexose biosynthetic process | 6 out of 132 genes, 4.5% | 31 out of 7158 background genes, 0.4% | 0.00529 |
| glucose metabolic process | 8 out of 132 genes, 6.1% | 64 out of 7158 background genes, 0.9% | 0.00601 |
| GO process categories for Ste12 | |||
| biological regulation | 193 out of 823 genes, 23.5% | 1132 out of 7158 background genes, 15.8% | 7.46e-07 |
| regulation of cellular process | 158 out of 823 genes, 19.2% | 889 out of 7158 background genes, 12.4% | 2.02e-06 |
| regulation of biological process | 160 out of 823 genes, 19.4% | 904 out of 7158 background genes, 12.6% | 2.13e-06 |
| monosaccharide transport | 15 out of 823 genes, 1.8% | 24 out of 7158 background genes, 0.3% | 3.48e-06 |
| hexose transport | 15 out of 823 genes, 1.8% | 24 out of 7158 background genes, 0.3% | 3.48e-06 |
| multi-organism process | 40 out of 823 genes, 4.9% | 136 out of 7158 background genes, 1.9% | 9.27e-06 |
| response to pheromone | 32 out of 823 genes, 3.9% | 96 out of 7158 background genes, 1.3% | 1.03e-05 |
| response to chemical stimulus | 86 out of 823 genes, 10.4% | 412 out of 7158 background genes, 5.8% | 1.18e-05 |
| response to stimulus | 144 out of 823 genes, 17.5% | 818 out of 7158 background genes, 11.4% | 2.41e-05 |
| cellular carbohydrate metabolic process | 47 out of 823 genes, 5.7% | 187 out of 7158 background genes, 2.6% | 0.00011 |
| growth | 40 out of 823 genes, 4.9% | 148 out of 7158 background genes, 2.1% | 0.00012 |
| regulation of transcription from RNA Pol II | 56 out of 823 genes, 6.8% | 243 out of 7158 background genes, 3.4% | 0.00017 |
| regulation of cell size | 34 out of 823 genes, 4.1% | 117 out of 7158 background genes, 1.6% | 0.00017 |
| sexual reproduction | 34 out of 823 genes, 4.1% | 120 out of 7158 background genes, 1.7% | 0.00033 |
| conjugation | 34 out of 823 genes, 4.1% | 120 out of 7158 background genes, 1.7% | 0.00033 |
| conjugation with cellular fusion | 34 out of 823 genes, 4.1% | 120 out of 7158 background genes, 1.7% | 0.00033 |
| response to abiotic stimulus | 32 out of 823 genes, 3.9% | 111 out of 7158 background genes, 1.6% | 0.00047 |
| cellular structure morphogenesis | 40 out of 823 genes, 4.9% | 156 out of 7158 background genes, 2.2% | 0.00057 |
| anatomical structure development | 40 out of 823 genes, 4.9% | 156 out of 7158 background genes, 2.2% | 0.00057 |
| cell morphogenesis | 40 out of 823 genes, 4.9% | 156 out of 7158 background genes, 2.2% | 0.00057 |
| anatomical structure morphogenesis | 40 out of 823 genes, 4.9% | 156 out of 7158 background genes, 2.2% | 0.00057 |
| signal transduction | 51 out of 823 genes, 6.2% | 222 out of 7158 background genes, 3.1% | 0.00067 |
| carbohydrate metabolic process | 49 out of 823 genes, 6.0% | 210 out of 7158 background genes, 2.9% | 0.00068 |
| external encapsulating structure organization | 48 out of 823 genes, 5.8% | 206 out of 7158 background genes, 2.9% | 0.00093 |
| cell wall organization and biogenesis | 48 out of 823 genes, 5.8% | 206 out of 7158 background genes, 2.9% | 0.00093 |
| carboxylic acid metabolic process | 67 out of 823 genes, 8.1% | 332 out of 7158 background genes, 4.6% | 0.00195 |
| organic acid metabolic process | 67 out of 823 genes, 8.1% | 332 out of 7158 background genes, 4.6% | 0.00195 |
| carbohydrate transport | 15 out of 823 genes, 1.8% | 35 out of 7158 background genes, 0.5% | 0.00254 |
| cell growth | 26 out of 823 genes, 3.2% | 87 out of 7158 background genes, 1.2% | 0.00268 |
| reproduction | 65 out of 823 genes, 7.9% | 323 out of 7158 background genes, 4.5% | 0.00309 |
| hexose metabolic process | 24 out of 823 genes, 2.9% | 78 out of 7158 background genes, 1.1% | 0.00367 |
| glucose metabolic process | 21 out of 823 genes, 2.6% | 64 out of 7158 background genes, 0.9% | 0.00478 |
| pseudohyphal growth | 21 out of 823 genes, 2.6% | 65 out of 7158 background genes, 0.9% | 0.00631 |
| cell communication | 56 out of 823 genes, 6.8% | 271 out of 7158 background genes, 3.8% | 0.00665 |
| nitrogen compound metabolic process | 51 out of 823 genes, 6.2% | 241 out of 7158 background genes, 3.4% | 0.00860 |
| establishment and maintenance of cell polarity | 32 out of 823 genes, 3.9% | 126 out of 7158 background genes, 1.8% | 0.00953 |
| Top 40 GO process categories for PolII | |||
| cellular process | 1864 out of 2508 genes, 74.3% | 4710 out of 7158 background genes, 65.8% | 8.82e-27 |
| ribonucleoprotein complex biogenesis | 289 out of 2508 genes, 11.5% | 494 out of 7158 background genes, 6.9% | 3.36e-25 |
| ribosome biogenesis | 246 out of 2508 genes, 9.8% | 407 out of 7158 background genes, 5.7% | 7.01e-24 |
| cellular component organization and biogenesis | 785 out of 2508 genes, 31.3% | 1725 out of 7158 background genes, 24.1% | 4.02e-22 |
| organelle organization and biogenesis | 621 out of 2508 genes, 24.8% | 1331 out of 7158 background genes, 18.6% | 2.59e-19 |
| primary metabolic process | 1276 out of 2508 genes, 50.9% | 3187 out of 7158 background genes, 44.5% | 1.59e-12 |
| maturation of SSU-rRNA from tricistronic rRNA | 49 out of 2508 genes, 2.0% | 60 out of 7158 background genes, 0.8% | 1.86e-10 |
| RNA metabolic process | 486 out of 2508 genes, 19.4% | 1082 out of 7158 background genes, 15.1% | 2.71e-10 |
| nucleotide and nucleic acid metabolic process | 649 out of 2508 genes, 25.9% | 1512 out of 7158 background genes, 21.1% | 6.60e-10 |
| maturation of SSU-rRNA | 49 out of 2508 genes, 2.0% | 62 out of 7158 background genes, 0.9% | 1.97e-09 |
| rRNA metabolic process | 143 out of 2508 genes, 5.7% | 257 out of 7158 background genes, 3.6% | 7.39e-09 |
| ribosomal large subunit biogenesis | 50 out of 2508 genes, 2.0% | 65 out of 7158 background genes, 0.9% | 7.46e-09 |
| ribonucleoprotein complex assembly | 96 out of 2508 genes, 3.8% | 156 out of 7158 background genes, 2.2% | 1.23e-08 |
| macromolecule metabolic process | 1096 out of 2508 genes, 43.7% | 2752 out of 7158 background genes, 38.4% | 1.71e-08 |
| biopolymer metabolic process | 1033 out of 2508 genes, 41.2% | 2580 out of 7158 background genes, 36.0% | 2.55e-08 |
| ribosomal subunit assembly | 44 out of 2508 genes, 1.8% | 56 out of 7158 background genes, 0.8% | 4.08e-08 |
| cellular localization | 303 out of 2508 genes, 12.1% | 644 out of 7158 background genes, 9.0% | 4.38e-08 |
| ribosome assembly | 49 out of 2508 genes, 2.0% | 65 out of 7158 background genes, 0.9% | 4.46e-08 |
| cellular metabolic process | 1340 out of 2508 genes, 53.4% | 3448 out of 7158 background genes, 48.2% | 4.55e-08 |
| metabolic process | 1356 out of 2508 genes, 54.1% | 3496 out of 7158 background genes, 48.8% | 6.00e-08 |
| rRNA processing | 137 out of 2508 genes, 5.5% | 250 out of 7158 background genes, 3.5% | 9.13e-08 |
| ncRNA processing | 172 out of 2508 genes, 6.9% | 335 out of 7158 background genes, 4.7% | 3.92e-07 |
| cellular macromolecular complex organization | 184 out of 2508 genes, 7.3% | 364 out of 7158 background genes, 5.1% | 4.79e-07 |
| establishment of localization in cell | 278 out of 2508 genes, 11.1% | 594 out of 7158 background genes, 8.3% | 6.51e-07 |
| RNA processing | 234 out of 2508 genes, 9.3% | 487 out of 7158 background genes, 6.8% | 9.22e-07 |
| gene expression | 704 out of 2508 genes, 28.1% | 1708 out of 7158 background genes, 23.9% | 9.27e-07 |
| intracellular transport | 260 out of 2508 genes, 10.4% | 551 out of 7158 background genes, 7.7% | 9.36e-07 |
| regulation of translation | 38 out of 2508 genes, 1.5% | 49 out of 7158 background genes, 0.7% | 1.88e-06 |
| maturation of 5.8S rRNA | 37 out of 2508 genes, 1.5% | 48 out of 7158 background genes, 0.7% | 4.22e-06 |
| maturation of 5.8S rRNA from tricistronic rRNA | 37 out of 2508 genes, 1.5% | 48 out of 7158 background genes, 0.7% | 4.22e-06 |
| posttranscriptional regulation of gene expression | 44 out of 2508 genes, 1.8% | 61 out of 7158 background genes, 0.9% | 4.99e-06 |
| macromolecular complex subunit organization | 197 out of 2508 genes, 7.9% | 404 out of 7158 background genes, 5.6% | 5.30e-06 |
| nuclear export | 64 out of 2508 genes, 2.6% | 102 out of 7158 background genes, 1.4% | 1.28e-05 |
| localization | 448 out of 2508 genes, 17.9% | 1046 out of 7158 background genes, 14.6% | 1.35e-05 |
| nuclear transport | 76 out of 2508 genes, 3.0% | 128 out of 7158 background genes, 1.8% | 1.86e-05 |
| nucleocytoplasmic transport | 76 out of 2508 genes, 3.0% | 128 out of 7158 background genes, 1.8% | 1.86e-05 |
| cleavages during rRNA processing | 29 out of 2508 genes, 1.2% | 36 out of 7158 background genes, 0.5% | 3.89e-05 |
| RNA 5'-end processing | 26 out of 2508 genes, 1.0% | 31 out of 7158 background genes, 0.4% | 4.30e-05 |
| transcription from RNA polymerase I promoter | 30 out of 2508 genes, 1.2% | 38 out of 7158 background genes, 0.5% | 5.25e-05 |
| regulation of cellular biosynthetic process | 46 out of 2508 genes, 1.8% | 68 out of 7158 background genes, 0.9% | 5.66e-05 |
GO process analysis (p < 0.01) for Cse4 combined targets, for Ste12 combined targets and for PolII combined targets.
For a complete GO analysis, please refer to Additional File 5.
Figure 5PolII signal profiles recapitulate findings from Steinmetz et al. PolII ChIP-Seq signal profiles resemble very closely to those published in Figure 3 of Steinmetz et al [54]. We obtained consistent binding at the Bap2-Tat1 loci (a) and at the Sed1-Shu2 loci (b). As expected, we did not observe binding at the Flo11 locus (c). For PolII ChIP-Seq experiments, two biological replicates were barcoded with ACGT (PolII_Rep1, dark blue; PolII_Rep2, orange), one was barcoded with TGCT (PolII_Rep3, red) and a fourth replicate had non-barcoded adapters (PolII_Rep4, green). Input DNA serves as a reference (light blue). Axis and scale normalizations are similar to Figure 2. ORFs above the coordinates axis are on the Watson strand while ORFs below this axis are on the Crick strand.
Figure 6Cse4p is found robustly at centromeres. All biological replicates were strongly and tightly bound to centromeres, as it is depicted here in the case of CEN11. Two barcoded replicates (Cse4_Rep2, dark blue; Cse4_Rep1, red) and a non-barcoded replicate (Cse4_Rep3, green) were compared to input DNA (light blue). Cse4 ChIP samples were scored against a pool of input DNA. IGB signal tracks of the CEN11 on chromosome 11 are shown for each sample. CEN11 is highlighted in a yellow box. Axis and scale normalizations are similar to Figure 2.
Cost per sample and number of mapped reads with increasing multiplexing
| 1 | 100% | 10.00 M |
| 4 | 34.61% | 2.50 M |
| 8 | 23.71% | 1.25 M |
| 12 | 20.08% | 0.83 M |
| 16 | 18.26% | 0.63 M |
| 40 | 15.00% | 0.25 M |
*Includes Illumina DNA library preparation (fixed cost), sequencing run
*Excludes ChIP reagents, Illumina GAII sequencer, HR and data storage
*For a non-barcoded sample, the cost per sample is $498.23 USD as of October 2008.
**Assumes optimal mapping values for a lane run on a GA II (10 million mapped reads)
ChIP-Seq simulations of depth of sequencing for S. cerevisiae
| Fold Enrichment | 2× | 5× | 10× | 50× |
| Number of Reads | 2,500,000 | 260,000 | 90,000 | 18,000 |
Number of reads required in order to identify greater than 95% of 600 simulated binding sites distributed evenly over the 12 Mb S. cerevisiae genome with average fold enrichments of 2×, 5×, 10× and 50×. See Methods for details of the simulation.
ChIP-Seq simulations of depth of sequencing for three model organisms
| Fold Enrichment of Simulated Sites | |||
| Genome Size (Mb) | 5× | 10× | |
| 100 | 2,400,000 | 800,000 | |
| 123 | 3,500,000 | 1,000,000 | |
| 157 | 6,800,000 | 1,500,000 | |
Number of reads required in order to identify greater than 95% of 1,000 simulated binding sites distributed over the genomes of C. elegans, D. melanogaster and A. thaliana for average fold enrichments of 5× and 10×. See Methods for details of the simulation.
Yeast strains used in this study
| CMY 288-1B | MATα his3Δ1 leu2Δ0 lys2Δ0 ura3Δ0 | BY 4741 | Christopher Yellman, unpublished |
| YJM 339 | MATa/α HO/HO | Clinical isolate | McCusker et al., 1994 |
| CMY 291 | MATa/α HO/HO STE12-13MYC-kanMX6/STE12-13MYC-kanMX6 | YJM 339 | Christopher Yellman, unpublished |
| CMY 8058-3-4 | MATa ade2-1 bar1::loxP can1H cyh2H gal1H his3Δ1 leu2-3,112 trp1-289 ura3-52 | A364a | Christopher Yellman, unpublished |
| CMY 8082-20-3 | MATa ade2-1 bar1::loxP can1H cyh2H gal1H his3Δ1 leu2-3,112 trp1-289 ura3-52 CSE4-3HA-URA3 | A364a | Christopher Yellman, unpublished; Meluh et al., 1998 |
Primer sequences for qPCR analysis of Cse4, PolII and Ste12 ChIPs
| Cse4 | Cse4P1 | Cse4P1for | GATCAGCGCCAAACAATATGGAAAATCC |
| Cse4 | Cse4P1 | Cse4P1rev | AACTTCCACCAGTAAACGTTTCATATATCC |
| Cse4 | Cse4P2 | Cse4P2for | CGTATTACAATGGCCGAAGGC |
| Cse4 | Cse4P2 | Cse4P2rev | GCGACAACAAGAGGGAAATGA |
| Cse4 | Cse4P3 | Cse4P3for | CGTCCAAACATGAAAGTGCTCC |
| Cse4 | Cse4P3 | Cse4P3rev | CAGCGATTGACTTTCTCCCATT |
| Cse4 | Cse4P4 | Cse4P4for | GAAGCGTTATGGAACCTGTCGAA |
| Cse4 | Cse4P4 | Cse4P4rev | GTCGGTCGTCCAATATCATCGTAAA |
| PolII | Pol2P1 | Pol2P1for | ACCGGTACAAGGACAAGACG |
| PolII | Pol2P1 | Pol2P1rev | GTTCGTTCTCACGCACTTCA |
| PolII | Pol2P2 | Pol2P2for | AAGACGCTCGAAACCAAGTG |
| PolII | Pol2P2 | Pol2P2rev | GCTCACGTTTTGCAATGATG |
| PolII | Pol2P3 | Pol2P3for | GACCGTTGCAAGGATTGATAA |
| PolII | Pol2P3 | Pol2P3rev | TCAACCGAAGGAAGGAGAAA |
| PolII | Pol2P4 | Pol2P4for | ATTGCCTGGTTCTTGTCCTG |
| PolII | Pol2P4 | Pol2P4rev | CGTTGGCATATCACACCTTG |
| Ste12 | Ste12P1 | Pseudo35 | CCCGTAGTCCGGTTTAATCA |
| Ste12 | Ste12P1 | Pseudo36 | AACTGTGCATGAGCCAAGAG |
| Ste12 | Ste12P2 | Pseudo37 | AAAAGGAGATAGGGCCCAGA |
| Ste12 | Ste12P2 | Pseudo38 | CCAGAACAGCCAGCTAGACC |
| Ste12 | Ste12P3 | Pseudo39 | TCGGGCTTCTAAGGCAAATA |
| Ste12 | Ste12P3 | Pseudo40 | TCCTTTAAATGATGTTGCGATG |
| Ste12 | Ste12P4 | Pseudo41 | TGTAGCCCAACGGATTCTTC |
| Ste12 | Ste12P4 | Pseudo42 | AGAAGCTTTGCCAGGTGAAA |
Figure 7qPCR analysis for ChIP samples Cse4-3HA (a), Ste12-13Myc (b) and RNA polymerase PolII (c). For all qPCR analysis, normalization using the 2-ΔΔCp method was used to compare results from a given primer pair to a negative primer pair (respectively Cse4P2, Ste12P1 and Pol2P4) (Table 9). Error bars represent standard deviation across three biological replicates for relative enrichment to the negative primer pair. (a) Cse4p is enriched preferentially at the centromeres (Cse4P1 for CEN3 and Cse4P4 for CEN6) but not at two random non-centromeric locations (Cse4P2 and Cse4P3). (b) Ste12p binds known target sites in pseudohyphal growth. Ste12P1, Ste12P2, Ste12P3 and Ste12P4 represent respectively sites with no enrichment, low enrichment, medium enrichment and high enrichment as determined by ChIP-chip studies (Christopher M. Yellman, unpublished data). While the low enrichment site was not found to be significantly enriched for this ChIP sample, the medium and high enrichment sites were strongly present in our samples used for qPCR. (c) PolII primer pairs were selected using Steinmetz microarray data [54,64] to have three positive pairs (Pol2P1, Pol2P2 and Pol2P3) and one negative pair (Pol2P4). Positive targets were all significantly enriched over the negative control.