| Literature DB >> 23148716 |
A Eliot Shearer1, Michael S Hildebrand, Harini Ravi, Swati Joshi, Angelica C Guiffre, Barbara Novak, Scott Happe, Emily M LeProust, Richard J H Smith.
Abstract
BACKGROUND: Targeted genomic enrichment (TGE) is a widely used method for isolating and enriching specific genomic regions prior to massively parallel sequencing. To make effective use of sequencer output, barcoding and sample pooling (multiplexing) after TGE and prior to sequencing (post-capture multiplexing) has become routine. While previous reports have indicated that multiplexing prior to capture (pre-capture multiplexing) is feasible, no thorough examination of the effect of this method has been completed on a large number of samples. Here we compare standard post-capture TGE to two levels of pre-capture multiplexing: 12 or 16 samples per pool. We evaluated these methods using standard TGE metrics and determined the ability to identify several classes of genetic mutations in three sets of 96 samples, including 48 controls. Our overall goal was to maximize cost reduction and minimize experimental time while maintaining a high percentage of reads on target and a high depth of coverage at thresholds required for variant detection.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23148716 PMCID: PMC3534602 DOI: 10.1186/1471-2164-13-618
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Experimental design showing pertinent steps in the TGE protocol including the indexing PCR during which barcodes are incorporated into separate DNA libraries.A) standard post-capture TGE, 96 captures are required as barcodes are incorporated during second PCR after capture and prior to sequencing, B) pre-capture multiplexing with pools of 12 (pre-capture 12a and pre-capture 12b), 8 captures required as barcodes are incorporated when adaptors are ligated prior to the first PCR and capture, and C) pre-capture multiplexing with pools of 16 (pre-capture 16), 6 captures required.
Figure 2Sequencing performance. Average results for 96 samples run in a single lane using different multiplexing methods. Bars show standard error from the mean. A) Average percent of reads mapped to the reference human genome using BWA per multiplexing method, B) Average capture efficiency, as defined by the percentage of mapped sequencing reads overlapping targeted intervals, and C) Average percent of all reads identified as duplicates and optical duplicates by Picard tools. * p < 0.01 (independent samples T-test).
Figure 3Targeted region coverage statistics. Average results for 96 samples run in a single lane using different multiplexing methods. Bars show standard error from the mean. Dashed boxes show simulated data for pre-capture 16 (see Results) in varying decreasing numbers of pools per lane. A) Coverage at 1X, 10X, and 20X, and B) Mean depth of coverage. * p < 0.01 (Independent samples T-test).
Figure 4Comparison of frequency histograms for average depth of coverage for each multiplexing method. Histograms show the average depth of coverage results for 96 samples run in a single lane using different multiplexing methods with normal curve in blue.