| Literature DB >> 25711446 |
Gareth Highnam1, Jason J Wang1, Dean Kusler1, Justin Zook2, Vinaya Vijayan3, Nir Leibovich1, David Mittelman4.
Abstract
The standardization and performance testing of analysis tools is a prerequisite to widespread adoption of genome-wide sequencing, particularly in the clinic. However, performance testing is currently complicated by the paucity of standards and comparison metrics, as well as by the heterogeneity in sequencing platforms, applications and protocols. Here we present the genome comparison and analytic testing (GCAT) platform to facilitate development of performance metrics and comparisons of analysis tools across these metrics. Performance is reported through interactive visualizations of benchmark and performance testing data, with support for data slicing and filtering. The platform is freely accessible at http://www.bioplanet.com/gcat.Entities:
Mesh:
Year: 2015 PMID: 25711446 PMCID: PMC4351570 DOI: 10.1038/ncomms7275
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1Benchmarking the accuracy of read alignments and the calibration of mapping quality scores.
Mapping benchmarks were performed using simulated paired-end 100-bp Illumina reads. (a) The ROC-like curve illustrates, for each mapper, the number of incorrectly mapped reads as a function of correctly mapped reads, sorted by map quality. As such, greater accuracy is graphically represented as a lower curve that is farther right. Mapping quality thresholds begin at the highest quality and then progressively decrease. (b) To directly characterize mapping quality scores, a histogram indicates the distribution of incorrect reads across normalized mapping quality scores for various tools. Read count is displayed on a log scale, and mapping qualities are binned by 10%.
Figure 2Performance testing variant callers.
The Genome in a Bottle confident call set is used as the ‘ground truth’ for the NA12878 genome. Variant calling pipelines are evaluated based on their concordance to the confident call set in the high-confidence regions. (a) Precision, sensitivity and specificity metrics are shown for pipelines in which various mappers are used to generate the read alignments, but the same variant caller, GATK UnifiedGenotyper, is used to identify variants. (b) Precision, sensitivity and specificity metrics are shown for Illumina’s Isaac pipeline compared with three pipelines in which the same mapper, Novoalign3, was used to generate read alignments and different variant callers were used. (c) True-positive rate (TP/(TP+FN)) is plotted as a ROC-like curve and as a function of false-positive rate (FP/(FP+TN)), sorted by the variant quality score threshold. For each threshold, sites with variant quality scores above the given threshold are counted as true or false positives, and sites with variant quality scores below the given threshold are counted as true or false negatives. (d) Variant calling precision as a function of read depth for the different pipelines. The abbreviations ‘UG’ and ‘HC’ represent UnifiedGenotyper and HaplotypeCaller, respectively.