| Literature DB >> 27604516 |
Steve Laurie1,2, Marcos Fernandez-Callejo1,2, Santiago Marco-Sola1,2, Jean-Remi Trotta1,2, Jordi Camps1,2, Alejandro Chacón3, Antonio Espinosa3, Marta Gut1,2, Ivo Gut1,2, Simon Heath1,2, Sergi Beltran1,2.
Abstract
As whole genome sequencing becomes cheaper and faster, it will progressively substitute targeted next-generation sequencing as standard practice in research and diagnostics. However, computing cost-performance ratio is not advancing at an equivalent rate. Therefore, it is essential to evaluate the robustness of the variant detection process taking into account the computing resources required. We have benchmarked six combinations of state-of-the-art read aligners (BWA-MEM and GEM3) and variant callers (FreeBayes, GATK HaplotypeCaller, SAMtools) on whole genome and whole exome sequencing data from the NA12878 human sample. Results have been compared between them and against the NIST Genome in a Bottle (GIAB) variants reference dataset. We report differences in speed of up to 20 times in some steps of the process and have observed that SNV, and to a lesser extent InDel, detection is highly consistent in 70% of the genome. SNV, and especially InDel, detection is less reliable in 20% of the genome, and almost unfeasible in the remaining 10%. These findings will aid in choosing the appropriate tools bearing in mind objectives, workload, and computing infrastructure available.Entities:
Keywords: NA12878; NGS; alignment; benchmark; bioinformatics; computing speed; variant calling; whole exome sequencing; whole genome sequencing
Mesh:
Year: 2016 PMID: 27604516 PMCID: PMC5129537 DOI: 10.1002/humu.23114
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.878
Summary of Variant Calling for Eight Pipelines for the WGS Sample
| Dataset | Total calls | TP | FP | FN | Specificity | Sensitivity | F1 score |
|---|---|---|---|---|---|---|---|
| Whole genome SNVs | |||||||
| NIST v2.18 Gold Standard | 2,740,732 | ||||||
| BWA‐MEM‐MEM + FreeBayes | 2,744,545 | 2,738,200 | 6,345 | 2,532 | 0.99769 | 0.99908 | 0.99838 |
| BWA‐MEM + HaplotypeCaller | 2,748,582 | 2,738,426 | 10,156 | 2,306 | 0.99631 | 0.99916 | 0.99773 |
| BWA‐MEM + SAMtools fast | 2,748,866 | 2,738,489 | 10,377 | 2,243 | 0.99622 | 0.99918 | 0.99770 |
| BWA‐MEM + SAMtools normal | 2,736,410 | 2,732,882 | 3,528 | 7,850 | 0.99871 | 0.99714 | 0.99792 |
| GEM3 + FreeBayes | 2,742,937 | 2,735,581 | 7,356 | 5,151 | 0.99732 | 0.99812 | 0.99772 |
| GEM3 + HaplotypeCaller | 2,745,423 | 2,738,414 | 7,009 | 2,318 | 0.99745 | 0.99915 | 0.99830 |
| GEM3 + SAMtools fast | 2,749,554 | 2,736,718 | 12,836 | 4,014 | 0.99533 | 0.99854 | 0.99693 |
| GEM3 + SAMtools normal | 2,736,871 | 2,732,313 | 4,558 | 8,419 | 0.99833 | 0.99693 | 0.99763 |
| Whole genome deletions | |||||||
| NIST v2.18 Gold Standard | 85,958 | ||||||
| BWA‐MEM + FreeBayes | 82,263 | 75,674 | 6,589 | 10,284 | 0.91990 | 0.88036 | 0.89970 |
| BWA‐MEM + HaplotypeCaller | 86,323 | 84,789 | 1,534 | 1,169 | 0.98223 | 0.98640 | 0.98431 |
| BWA‐MEM + SAMtools fast | 77,671 | 68,591 | 9,080 | 17,367 | 0.88310 | 0.79796 | 0.83837 |
| BWA‐MEM + SAMtools normal | 77,712 | 68,615 | 9,097 | 17,343 | 0.88294 | 0.79824 | 0.83846 |
| GEM3 + FreeBayes | 81,602 | 76,002 | 5,600 | 9,956 | 0.93137 | 0.88418 | 0.90716 |
| GEM3 + HaplotypeCaller | 86,132 | 84,783 | 1,349 | 1,175 | 0.98434 | 0.98633 | 0.98533 |
| GEM3 + SAMtools fast | 80,905 | 69,096 | 11,809 | 16,862 | 0.85404 | 0.80383 | 0.82818 |
| GEM3 + SAMtools normal | 80,955 | 69,124 | 11,831 | 16,834 | 0.85386 | 0.80416 | 0.82826 |
| Whole genome insertions | |||||||
| NIST v2.18 Gold Standard | 84,583 | ||||||
| BWA‐MEM + FreeBayes | 78,592 | 73,890 | 4,702 | 10,693 | 0.94017 | 0.87358 | 0.90565 |
| BWA‐MEM + HaplotypeCaller | 84,521 | 83,473 | 1,048 | 1,110 | 0.98760 | 0.98688 | 0.98724 |
| BWA‐MEM + SAMtools fast | 79,762 | 69,389 | 10,373 | 15,194 | 0.86995 | 0.82037 | 0.84443 |
| BWA‐MEM + SAMtools normal | 79,762 | 69,396 | 10,366 | 15,187 | 0.87004 | 0.82045 | 0.84452 |
| GEM3 + FreeBayes | 78,417 | 73,154 | 5,263 | 11,429 | 0.93288 | 0.86488 | 0.89760 |
| GEM3 + HaplotypeCaller | 83,973 | 83,189 | 784 | 1,394 | 0.99066 | 0.98352 | 0.98708 |
| GEM3 + SAMtools fast | 92,775 | 71,928 | 20,847 | 12,655 | 0.77530 | 0.85038 | 0.81111 |
| GEM3 + SAMtools normal | 92,795 | 71,938 | 20,857 | 12,645 | 0.77524 | 0.85050 | 0.81113 |
TP, true positives; FP, false positives; FN, false negatives; specificity, number of TP calls as a proportion of total calls; sensitivity, number of TP calls as a proportion of the number of NIST reference set calls; F1‐score, measure of overall accuracy calculated as (2 × TP)/((2 × TP) + FP + FN).
Figure 1Venn diagrams illustrating concordance of variant identification for BWA‐MEM alignments. Separate Venn diagrams show the number and percentage of concordant calls for a particular variant type by pipeline for the NIST reliably callable regions, the NIST non‐reliably callable but mappable regions, and the NIST non‐reliably callable regions. SNVs, single nucleotide variants; Dels, deletions; Ins, insertions.
Figure 2Venn diagrams illustrating concordance of variant identification for GEM3 alignments. Separate Venn diagrams show the number and percentage of concordant calls for a particular variant type by pipeline for the NIST reliably callable regions, the NIST non‐reliably callable but mappable regions, and the NIST non‐reliably callable regions. SNVs, single nucleotide variants; Dels, deletions; Ins, insertions.
Figure 3Venn diagrams illustrating concordance of WGS and WES variant identification. Separate Venn diagrams show the number and percentage of concordant calls for a particular variant type for the GEM3 pipelines for variants identified in the intersection of the NIST reliably callable region and exome capture regions (∼34.7 MB). SNVs, single nucleotide variants.
Elapsed and CPU Times (Minutes), and Maximum RAM (GB) Requirements for Alignment Algorithms
| Exome | Genome | |||||||
|---|---|---|---|---|---|---|---|---|
| BWA‐MEM | GEM3 | BWA‐MEM | GEM3 | |||||
| Total | Largest Chunk | Total | Largest Chunk | Total | Largest Chunk | Total | Largest Chunk | |
| Elapsed time | 16 | 7 | 6 | 2.5 | 570 | 149 | 120 | 31 |
| CPU time | 468 | 192 | 162 | 66 | 17,706 | 4,633 | 3,458 | 904 |
| Max RAM | 7.6 | 15.4 | 8.3 | 15.7 | ||||
Alignment was performed on a compute node that allowed for up to 32 processing threads, supported by 256 GB of available RAM. Elapsed time is the real time required for the process to complete, whereas CPU time is the sum of the times that CPU threads were actively processing. Exome data came from 3 FASTQ files, whereas WGS came from 4 FASTQ files, the reads of which were mapped independently. The largest chunk time is the maximum time required for mapping of the largest individual chunk of a particular dataset.
Elapsed and CPU Times (Minutes) for Variant Calling
| BWA‐MEM Exome | GEM3 Exome | BWA‐MEM WGS | GEM3 WGS | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Total elapsed time | CPU time | Total elapsed time | CPU time | Total elapsed time | Largest chunk elapsed time | CPU time | Total elapsed time | Largest chunk elapsed time | CPU time | |
| FreeBayes | 45 | 558 | 41 | 497 | 155 | N/A | 2,104 | 165 | N/A | 2,156 |
| HaplotypeCaller | 362 | 1,119 | 269 | 1,284 | 2,155 | 241 | 7,930 | 2,681 | 238 | 8,231 |
| SAMtools normal | 199 | 201 | 211 | 213 | 3,045 | 250 | 3,025 | 3,559 | 334 | 3,537 |
| SAMtools fast | 103 | 104 | 114 | 116 | 1,328 | 117 | 1,309 | 1,610 | 139 | 1,591 |
Elapsed time is the real time required for the process to complete, whereas CPU time is the sum of the times that CPU threads were actively processing. For exome samples, variant calls were generated in a single process, whereas for WGS samples variant calling was performed on each chromosome independently, except in the case of FreeBayes which was sufficiently fast that separating into chunks was unnecessary.