| Literature DB >> 21559511 |
Dale J Hedges1, Toumy Guettouche, Shan Yang, Guney Bademci, Ashley Diaz, Ashley Andersen, William F Hulme, Sara Linker, Arpit Mehta, Yvonne J K Edwards, Gary W Beecham, Eden R Martin, Margaret A Pericak-Vance, Stephan Zuchner, Jeffery M Vance, John R Gilbert.
Abstract
Despite the ever-increasing throughput and steadily decreasing cost of next generation sequencing (NGS), whole genome sequencing of humans is still not a viable option for the majority of genetics laboratories. This is particularly true in the case of complex disease studies, where large sample sets are often required to achieve adequate statistical power. To fully leverage the potential of NGS technology on large sample sets, several methods have been developed to selectively enrich for regions of interest. Enrichment reduces both monetary and computational costs compared to whole genome sequencing, while allowing researchers to take advantage of NGS throughput. Several targeted enrichment approaches are currently available, including molecular inversion probe ligation sequencing (MIPS), oligonucleotide hybridization based approaches, and PCR-based strategies. To assess how these methods performed when used in conjunction with the ABI SOLID3+, we investigated three enrichment techniques: Nimblegen oligonucleotide hybridization array-based capture; Agilent SureSelect oligonucleotide hybridization solution-based capture; and Raindance Technologies' multiplexed PCR-based approach. Target regions were selected from exons and evolutionarily conserved areas throughout the human genome. Probe and primer pair design was carried out for all three methods using their respective informatics pipelines. In all, approximately 0.8 Mb of target space was identical for all 3 methods. SOLiD sequencing results were analyzed for several metrics, including consistency of coverage depth across samples, on-target versus off-target efficiency, allelic bias, and genotype concordance with array-based genotyping data. Agilent SureSelect exhibited superior on-target efficiency and correlation of read depths across samples. Nimblegen performance was similar at read depths at 20× and below. Both Raindance and Nimblegen SeqCap exhibited tighter distributions of read depth around the mean, but both suffered from lower on-target efficiency in our experiments. Raindance demonstrated the highest versatility in assay design.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21559511 PMCID: PMC3084696 DOI: 10.1371/journal.pone.0018595
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Enrichment methods performed for each sample.
| sample_id | source | Agilent SureSelect | Nimbelgen SeqCap | Raindance |
|
| ||||
| s1 | blood | Y | Y | Y |
| s2 | blood | Y | Y | Y |
| s3 | blood | Y | Y | Y |
| s4 | blood | Y | Y | Y |
| s5 | blood | Y | Y | Y |
| s6 | blood | Y | Y | Y |
|
| ||||
| s7 | blood | Y | N | N |
| s8 | blood | N | Y | N |
| s9 | blood | N | N | Y |
| s10 | blood | N | N | Y |
| s11 | blood | N | N | Y |
| s12 | blood | N | N | Y |
| s13 | blood | Y | Y | N |
| s14 | blood | Y | Y | N |
| s15 | cell | Y | Y | N |
| s16 | cell | Y | Y | N |
| s17 | blood | Y | N | Y |
| s18 | blood | N | Y | Y |
Oligonucleotide sequences used, in conjunction with the Universal Probe Library # 149 (Roche), for qPCR of the SOLiD sequencing library.
| Sequence | Sequence Name |
|
|
|
|
|
|
Percent on-target, matched sample sets (N = 6).
| SampleID | Nimblegen | Agilent | Raindance |
|
| 53.33 | 60.79 | 52.50 |
|
| 53.35 | 61.45 | 49.90 |
|
| 49.64–57.31 | 56.45–63.09 | 44.71–63.57 |
|
| 2.91 | 2.46 | 7.11 |
Percent on-target, unmatched sample sets (N = 6).
| SampleID | Nimblegen | Agilent | Raindance |
|
| 55.56 | 61.64 | 46.18 |
|
| 56.13 | 62.56 | 45.28 |
|
| 53.29–57.75 | 54.48–68.55 | 38.05–52.65 |
|
| 1.86 | 5.09 | 5.65 |
Figure 1Depth of sequencing coverage of matched sample set (N = 6 unique samples).
Percent of on-target bases (y-axis) covered at a given sequence depth (x-axis). On target percentage calculated as the fraction of nucleotide bases falling on targeted regions divided by the total number of nucleotides mapping anywhere in the genome. Thick lines represent average coverage for each platform (Agilent Surelect = blue circles; Nimblegen SeqCap = green triangles; Raindance parallel PCR = red diamonds). Dashed lines represent two standard deviations above and below the average for each platform.
Figure 2Depth of sequencing coverage of matched sample set (N = 6 samples (per method)).
Percent of on-target bases (y-axis) covered at a given sequence depth (x-axis). On target percentage calculated as the fraction of nucleotide bases falling on targeted regions divided by the total number of nucleotides mapping anywhere in the genome. Thick lines represent average coverage for each platform (Agilent Surelect = blue circles; Nimblegen SeqCap = green triangles; Raindance parallel PCR = red diamonds). Dashed lines represent two standard deviations above and below the average for each platform.
Figure 3Kernel density of coverage depth.
Depicts kernel density function for the three enrichment platforms studied. The set of coverage depth values at each target position were pooled across all individuals from the matched sample set and the frequency of values at each depth were used to calculate the density function.
Figure 4Pearson correlation matrix for coverage depth.
Pearson correlation matrix depicting sample to sample comparisons for each independent platform. Only matched samples (i.e. individual samples that were separately enriched on across all three platforms) were used for this analysis. Cells with higher correlation values appear in darker shades.
Genotype concordance.
| Sample ID | matches/comparisons | Agilent concord. | matches/comparisons | Nimblegen concord. | matches/comparisons | Raindance concord. |
| s2 | 186/187 | 0.99 | 191/192 | 0.99 | 170/173 | 0.98 |
| s3 | 190/190 | 1.00 | 191/192 | 0.99 | 167/172 | 0.97 |
| s4 | 188/188 | 1.00 | 192/192 | 1.00 | 177/179 | 0.99 |
| s5 | 186/187 | 0.99 | 190/191 | 0.99 | 176/178 | 0.99 |
| s6 | 186/186 | 1.00 | 188/188 | 1.00 | 177/180 | 0.98 |
|
| 0.998 | 0.997 | 0.983 | |||
|
|
|
|
|