| Literature DB >> 27489955 |
Yongchao Liu1, Martin Loewer2, Srinivas Aluru3, Bertil Schmidt4.
Abstract
BACKGROUND: Various approaches to calling single-nucleotide variants (SNVs) or insertion-or-deletion (indel) mutations have been developed based on next-generation sequencing (NGS). However, most of them are dedicated to a particular type of mutation, e.g. germline SNVs in normal cells, somatic SNVs in cancer/tumor cells, or indels only. In the literature, efficient and integrated callers for both germline and somatic SNVs/indels have not yet been extensively investigated.Entities:
Keywords: Bayesian model; Indel calling; SNP calling; Somatic SNV calling
Mesh:
Year: 2016 PMID: 27489955 PMCID: PMC4977481 DOI: 10.1186/s12918-016-0300-5
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Performance and runtimes on SMASH synthetic benchmark
| Caller | SNP calling (%) | Indel calling (%) | Time(s) | ||||
|---|---|---|---|---|---|---|---|
| Recall | Precision |
| Recall | Precision |
| ||
| Venter | |||||||
| SNVSniffer(M1) |
| 97.1 | 97.8 | 68.9 | 83.0 | 75.3 |
|
| SNVSniffer(M2) | 98.3 | 97.4 | 97.8 | 69.0 | 83.7 | 75.6 | 320 |
| SNVSniffer(M3) | 97.9 | 98.5 | 98.2 | 70.4 | 83.6 | 76.4 | 1331 |
| SAMtools | 98.3 | 97.1 | 97.7 | 63.7 | 69.5 | 66.5 | 2046 |
| GATK | 98.1 |
|
|
|
|
| 2538 |
| FaSD | 98.0 | 97.5 | 97.7 | − | − | − | 2005 |
| Contaminated venter | |||||||
| SNVSniffer(M1) | 98.0 | 97.4 | 97.7 | 69.0 | 84.0 | 75.8 |
|
| SNVSniffer(M2) | 97.8 | 97.8 | 97.8 | 68.7 | 84.5 | 75.8 | 336 |
| SNVSniffer(M3) | 97.3 |
|
| 69.7 | 84.3 | 76.3 | 1387 |
| SAMtools |
| 97.3 | 97.7 | 62.7 | 72.1 | 67.1 | 2046 |
| GATK | 97.9 | 96.8 | 97.3 |
|
|
| 2803 |
| FaSD | 97.4 | 97.5 | 97.4 | − | − | − | 2070 |
Best results are highlighted in boldface
Performance and runtimes on SMASH sampled human benchmark
| Caller | NA12878 | NA12878+ | NA18507 | NA19240 |
|---|---|---|---|---|
| Sensitivity (%) | ||||
| SNVSniffer(M1) | 98.9 | 98.9 | 99.0 | 99.1 |
| SNVSniffer(M2) | 98.9 | 98.9 | 98.9 | 99.0 |
| SNVSniffer(M3) | 98.8 | 98.8 | 98.9 | 99.0 |
| SAMtools |
|
| 99.3 | 99.4 |
| GATK | 99.1 | 99.1 |
|
|
| FaSD | 98.9 | 98.9 | 99.1 | 99.2 |
| Time(s) | ||||
| SNVSniffer(M1) |
|
|
|
|
| SNVSniffer(M2) | 560 | 541 | 474 | 1065 |
| SNVSniffer(M3) | 2550 | 2543 | 2093 | 3379 |
| SAMtools | 3730 | 3694 | 3147 | 3379 |
| GATK | 6541 | 6249 | 6321 | 5936 |
| FaSD | 2132 | 2054 | 1979 | 2068 |
Best results are highlighted in boldface
Performance and runtimes on GCAT Illumina 150× exome sequencing data
| Caller | SNP calling (%) | Indel calling (%) | Time(s) | |||
|---|---|---|---|---|---|---|
| Sensitivity | Specificity |
| Sensitivity | Specificity | ||
| SNVSniffer(M1) | 94.86 | 99.9982 | 2.223 | 50.48 | 99.9981 |
|
| SNVSniffer(M2) | 94.86 | 99.9983 | 2.235 | 50.94 | 99.9981 | 931 |
| SNVSniffer(M3) | 94.69 | 99.9982 |
| 50.94 | 99.9981 | 7597 |
| SAMtools |
|
| 1.450 | 43.08 | 99.9987 | 12825 |
| GATK | 97.31 | 99.9982 | 1.920 |
|
| 16568 |
| FaSD | 79.83 | 99.9922 | 1.123 | − | − | 17986 |
Best results are highlighted in boldface
Somatic SNV calling performance comparison
| Metric | Error | SNVSniffer(M1) | SomaticSniper | VarScan2 | JSM2 | MuTect |
|---|---|---|---|---|---|---|
| Recall | 1.0 % |
| 86.62 | 77.43 | 91.26 | 93.47 |
| 1.5 % |
| 86.62 | 80.67 | 91.24 | 93.47 | |
| 2.0 % |
| 86.62 | 80.50 | 91.23 | 93.47 | |
| Precision | 1.0 % | 95.87 |
| 95.02 | 99.68 | 84.76 |
| 1.5 % | 95.86 |
| 95.29 | 99.68 | 84.76 | |
| 2.0 % | 95.85 |
| 95.36 | 99.68 | 84.76 | |
|
| 1.0 % |
| 92.73 | 85.33 | 95.28 | 88.90 |
| 1.5 % | 95.17 | 92.73 | 87.37 |
| 88.90 | |
| 2.0 % | 94.92 | 92.74 | 87.30 |
| 88.90 | |
| Time(s) | 1.0 % | 275 |
| 1828 | 1364 | 3677 |
| 1.5 % | 273 |
| 1947 | 1361 | 3644 | |
| 2.0 % | 271 |
| 1990 | 1368 | 3658 |
Best results are highlighted in boldface
Somatic indel calling performance comparison
| Error | Caller | Recall | Precision |
|
|---|---|---|---|---|
| 1.0 % | SNVSniffer(M1) |
|
|
|
| VarScan2 | 14.91 | 17.71 | 16.19 | |
| 1.5 % | SNVSniffer(M1) |
|
|
|
| VarScan2 | 15.26 | 17.53 | 16.32 | |
| 2.0 % | SNVSniffer(M1) |
|
|
|
| VarScan2 | 14.83 | 17.22 | 15.94 |
Best results are highlighted in boldface
Fig. 1Recall on virtual tumors in the function of tumor purity
Fig. 2Precision on virtual tumors in the function of tumor purity
Fig. 3F-score on virtual tumors in the function of tumor purity
Sensitivity and runtimes comparison using real tumors
| Dataset | SNVSniffer(M1) | VarScan2 | SomaticSniper | JSM2 | MuTect |
|---|---|---|---|---|---|
| Sensitivity (%) | |||||
| T1 | 66.86 | 38.29 | 61.148 | 0.001 |
|
| T2 | 82.35 | 35.29 | 63.73 | 0.00 |
|
| T3 | 72.36 | 59.35 | 75.61 | 0.00 |
|
| T4 | 93.55 | 83.87 | 87.10 | 0.00 |
|
| T5 | 75.00 | 43.75 | 68.75 | 0.00 |
|
| Time (h) | |||||
| T1 | 2.67 | 5.75 | 1.58 | 10.41 | 22.87 |
| T2 | 2.56 | 4.73 | 1.33 | 9.48 | 19.37 |
| T3 | 2.41 | 4.30 | 1.34 | 7.39 | 19.27 |
| T4 | 2.91 | 5.26 | 1.39 | 10.00 | 24.16 |
| T5 | 2.87 | 4.75 | 1.55 | 8.11 | 20.50 |
Best results are highlighted in boldface
Specificity and runtime comparison using real tumors
| Caller | FP | Specificity (%) | Time (h) |
|---|---|---|---|
| SNVSniffer(M1) | 12387 | 99.9995 | 1.8 |
| SomaticSniper | 209530 | 99.9923 |
|
| VarScan2 |
|
| 2.9 |
| JSM2 | 38550 | 99.9986 | 5.0 |
| MuTect | 2463700 | 99.9096 | 8.7 |
Best results are highlighted in boldface
Fig. 4Program diagram of SNVSniffer for variant calling
Type classification of somatic SNVs
|
| AA | AB | BB |
|---|---|---|---|
| AA | Wild | Somatic | Somatic |
| AB | LOH | Germline | LOH |
| BB | Unknown | Unknown | Germline |