| Literature DB >> 27769162 |
Arvind Y M Sundaram1, Timothy Hughes1, Shea Biondi2, Nathalie Bolduc3, Sarah K Bowman4,5, Andrew Camilli6, Yap C Chew2, Catherine Couture7, Andrew Farmer3, John P Jerome8, David W Lazinski6, Andrew McUsic7, Xu Peng9, Kamran Shazand8, Feng Xu9, Robert Lyle10, Gregor D Gilfillan11.
Abstract
BACKGROUND: ChIP-seq is the primary technique used to investigate genome-wide protein-DNA interactions. As part of this procedure, immunoprecipitated DNA must undergo "library preparation" to enable subsequent high-throughput sequencing. To facilitate the analysis of biopsy samples and rare cell populations, there has been a recent proliferation of methods allowing sequencing library preparation from low-input DNA amounts. However, little information exists on the relative merits, performance, comparability and biases inherent to these procedures. Notably, recently developed single-cell ChIP procedures employing microfluidics must also employ library preparation reagents to allow downstream sequencing.Entities:
Keywords: Chromatin immunoprecipitation; HTS; Low-input; Micro-ChIP; NGS
Mesh:
Year: 2016 PMID: 27769162 PMCID: PMC5073829 DOI: 10.1186/s12864-016-3135-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Low-input library preparation methods tested in this study
| Technique | Reference/Commercial supplier | Salient details | Reported DNA input range | Sequencing platform compatibility |
|---|---|---|---|---|
| Accel-NGS® 2S (Accel-2S) | Swift Biosciences, Inc. | 5-step process of DNA repair, adapter ligation and PCR amplification. 5 purification steps. | 0.01 – 1000 ng | Illumina |
| Modified Illumina method (Bowman) | Kingston Lab [ | 4 step procedure of end-repair, A-tailing, adapter ligation and PCR. 4 purification steps. | 0.1 – 1000 ng | Illumina |
| HTML-PCR (HTML) | Camilli lab [ | 4-step procedure of end-repair, poly-C-tailing, poly-G-adapter oligo ligation and PCR. 4 purification steps. | 0.01 – 100 ng | Illumina |
| SeqPlex™ | Sigma Aldrich, Inc. | 3-stage process of semi-random primed pre-amplification, PCR amplification, and primer removal. 2 purification steps. | 0.1 – 1 ng | Agnostic (subsequent library prep required) |
| DNA SMART™ ChIP-Seq Kit (SMART) | Takara Bio USA, Inc. | 5-step procedure of denaturation, dephosphorylation, T-tailing, DNA replication and template switching by reverse transcriptase and PCR. Compatible also with ssDNA. 1 purification step. | 0.1 – 10 ng | Illumina |
| TELP | Xu lab [ | 5-step procedure of end-repair, poly-C-tailing, biotinylated primer extension, exonuclease digestion & streptavidin purification, adapter ligation and PCR. Compatible also with ssDNA. 3 purification steps. | 0.025–25 ng | Illumina |
| ThruPLEX® | Rubicon Genomics | 3 stage process of end repair, stem-loop adapter ligation and PCR amplification. 1 purification step. | 0.05–50 ng | Illumina |
Fig. 1Experimental design and sequencing read mapping. a Experimental design overview. b Genomic mapping of sequence reads. The proportion of reads that were unmapped (red), those mapping to single genomic positions (green), and those mapping to multiple locations (repeats, in blue) are illustrated. Reads mapping to single genomic positions are broken down into reads present as a unique copy, and those present in two or more identical copies (duplicates). Results shown are the mean of 5 replicates for each method, using 25 million reads per replicate. Error bars show the standard deviation from the mean
Fig. 2Library complexity and H3K4me3 ChIP signals intensity. a Library complexity curves generated using Preseq. Yield of uniquely mapping reads based on down-sampled data of 25 million reads. PCR-free (three replicates) method from 100 ng input is shown in black. Red and blue lines represent the five replicates used in 1 ng and 0.1 ng input across six low-input methods. Actual and extrapolated values are shaded in white and grey, respectively. X-axis is represented in log2 scale. b Read depth in a 4 kb window centered on known TSS. Colours as in (a)
Fig. 3Data visualized in the Integrative Genomics Viewer [47]. a H3K4me3 distribution in an 80 kb genomic region. A single example PCR-free library is shown in black at the top, and libraries derived from 1 ng and 0.1 ng are shown in grey and light grey respectively.b Data for all libraries shown as heat maps of the same genomic region. Y-axis scale in all cases is read depth 0–28
Peak calling, sensitivity and specificity
| Method | Input Amount (ng) | Mean number of peaks called | Mean number of reference peaks overlapped by sample peaks | Mean number of sample peaks not found in reference dataset | Mean sensitivity (% reference peaks detected) | Mean specificity (% method peaks found in reference dataset) |
|---|---|---|---|---|---|---|
| PCR-free | 100 | 19 221 ± 157 | 17 124 ± 0 | 2 097 ± 157 | 100 % | 89 % |
| Accel-2S | 1 | 18 190 ± 123 | 16 574 ± 38 | 1 616 ± 115 | 97 % | 91 % |
| 0.1 | 18 179 ± 124 | 16 505 ± 62 | 1 675 ± 103 | 96 % | 91 % | |
| Bowman | 1 | 19 082 ± 334 | 16 096 ± 111 | 2 986 ± 228 | 94 % | 84 % |
| 0.1 | 18 986 ± 365 | 16 155 ± 48 | 2 831 ± 384 | 94 % | 85 % | |
| SeqPlex | 1 | 21 382 ± 265 | 13 612 ± 78 | 7 770 ± 255 | 79 % | 64 % |
| 0.1 | 35 867 ± 3 861 | 13 905 ± 113 | 21 962 ± 3 902 | 81 % | 39 % | |
| SMART | 1 | 17 906 ± 765 | 15 723 ± 99 | 2 182 ± 807 | 92 % | 88 % |
| 0.1 | 19 893 ± 3 196 | 15 622 ± 166 | 4 271 ± 3 349 | 91 % | 80 % | |
| TELP | 1 | 20 529 ± 1 592 | 15 528 ± 81 | 5 001 ± 1 641 | 91 % | 76 % |
| 0.1 | 20 149 ± 1 619 | 15 370 ± 105 | 4 778 ± 1 683 | 90 % | 77 % | |
| ThruPLEX | 1 | 18 377 ± 152 | 16 462 ± 36 | 1 916 ± 138 | 96 % | 90 % |
| 0.1 | 18 015 ± 178 | 16 298 ± 44 | 1 717 ± 200 | 95 % | 90 % |
Peaks were called using MACS, using only 5.5 million uniquely mapping non-duplicate reads per sample. Peak regions present in all three PCR-free datasets (n = 17,124) were used as the reference dataset to which all other samples were compared to measure sensitivity and specificity. Data presented are mean +/− standard deviation
Fig. 4Correlation of peak calling and reproducibility of datasets. a Number of overlapping peaks found in all 5 replicates for the 1 ng and 0.1 ng input methods, and all three PCR-free replicates. b 2d density estimation of number of read bases within PCR-free peaks (present in all three replicates) against the number of reads bases within the same regions for all replicates of other methods. Data calculated using 5.5 million uniquely mapping non-duplicate reads for all methods, with 1 ng and 0.1 ng input level. Only reads mapping to peaks found in the PCR-free datasets were analyzed and included in correlation calculations. Spearman correlation coefficients for each method are given. Black line represents slope 1 and is provided for reference. c Irreproducible discovery rate (IDR) at different numbers of selected peaks, plotted at various IDR cutoffs for all methods. IDR for PCR-free (100 ng) method is shown in both 1 ng and 0.1 ng input panels for illustration