| Literature DB >> 29945233 |
Sarah Sandmann1, Mohsen Karimi2, Aniek O de Graaf3, Christian Rohde4, Stefanie Göllner4, Julian Varghese1, Jan Ernsting1, Gunilla Walldin5, Bert A van der Reijden3, Carsten Müller-Tidow4, Luca Malcovati6, Eva Hellström-Lindberg5, Joop H Jansen3, Martin Dugas1.
Abstract
Motivation: The application of next-generation sequencing in research and particularly in clinical routine requires valid variant calling results. However, evaluation of several commonly used tools has pointed out that not a single tool meets this requirement. False positive as well as false negative calls necessitate additional experiments and extensive manual work. Intelligent combination and output filtration of different tools could significantly improve the current situation.Entities:
Mesh:
Year: 2018 PMID: 29945233 PMCID: PMC6289140 DOI: 10.1093/bioinformatics/bty518
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Overview of the analysis performed by appreci8
Fig. 2.General principle of filtration with appreci8. Calls are classified as ‘Mutations’, ‘Polymorphism’ or ‘Artifact’ on the basis of an artifact- and a polymorphism score
Main characteristics of the training- and test sets analyzed with appreci8
| Set | n | Sequencer | Disease | Target [bp] | Coverage | Background | ||
|---|---|---|---|---|---|---|---|---|
| all | Coding | >50x (%) | noise | |||||
| Training | 1 | 54 | Illumina HiSeq | MDS | 42 322 | 23 162 | 95 | |
| 2 | 111 | Illumina NextSeq | MDS | 42 322 | 23 162 | 97 | ||
| Test | 1 | 237 | Illumina HiSeq | MDS | 42 322 | 23 162 | 92 | |
| 2 | 46 | Illumina HiSeq | MDS | 42 322 | 23 162 | 93 | ||
| 3 | 89 | Roche 454 | MDS | 42 322 | 23 162 | 84 | ||
| 4 | 22 | Illumina NextSeq | AML | 125 459 | 78 866 | 99 | ||
| 5 | 119 | Illumina HiScanSQ | AML | 958 547 | 218 179 | 94 | ||
Fig. 3.Relation between positive predictive value and sensitivity in case of GATK, Platypus, VarScan, LoFreq, FreeBayes, SNVer, SAMtools, VarDict, the combined output of all tools (eight tools), single-appreci8 and appreci8 in training sets 1 and 2
Positive predictive value and sensitivity in case of GATK, Platypus, VarScan, LoFreq, FreeBayes, SNVer, SAMtools, VarDict, the combined output of all tools (eight tools), single-appreci8 and appreci8 in training sets 1 and 2
| Approach | Training set 1 | Training set 2 | ||
|---|---|---|---|---|
| Sens | PPV | Sens | PPV | |
| GATK | 0.92 | 0.85 | 0.82 | 0.71 |
| Platypus | 0.93 | 0.80 | 0.83 | 0.42 |
| VarScan | 0.89 | 0.97 | 0.47 | 0.73 |
| LoFreq | 0.91 | 0.35 | 0.78 | 0.23 |
| FreeBayes | 1.00 | 0.03 | 0.99 | 0.02 |
| SNVer | 0.93 | 0.92 | 0.55 | 0.07 |
| SAMtools | 0.85 | 0.87 | 0.64 | 0.77 |
| VarDict | 0.97 | 0.96 | 0.94 | 0.15 |
| 8 tools | 1.00 | 0.03 | 1.00 | 0.02 |
| single-appreci8 | 0.98 | 0.98 | 0.99 | 0.35 |
| appreci8 | 0.98 | 0.99 | 0.98 | 0.94 |
Fig. 4.Relation between positive predictive value and sensitivity in case of GATK, Platypus, VarScan, LoFreq, FreeBayes, SNVer, SAMtools, VarDict, the combined output of all tools (eight tools), single-appreci8 and appreci8 in test sets 1–5
Positive predictive value and sensitivity in case of GATK, Platypus, VarScan, LoFreq, FreeBayes, SNVer, SAMtools, VarDict, the combined output of all tools (eight tools), single-appreci8 and appreci8 in test sets 1–5
| Approach | Test set 1 | Test set 2 | Test set 3 | Test set 4 | Test set 5 | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Sens | PPV | Sens | PPV | Sens | PPV | Sens | PPV | Sens | PPV | |
| GATK | 0.92 | 0.82 | 0.90 | 0.73 | 0.95 | 0.31 | 0.86 | 0.73 | 0.92 | 0.94 |
| Platypus | 0.93 | 0.81 | 0.93 | 0.83 | 0.97 | 0.13 | 0.91 | 0.30 | 0.90 | 0.94 |
| VarScan | 0.84 | 0.84 | 0.82 | 0.73 | 0.91 | 0.51 | 0.73 | 0.55 | 0.71 | 0.94 |
| LoFreq | 0.98 | 0.29 | 0.98 | 0.71 | 0.94 | 0.31 | 0.73 | 0.01 | 0.79 | 0.88 |
| FreeBayes | 0.99 | 0.02 | 1.00 | 0.01 | 0.99 | 0.07 | 0.95 | 0.01 | 0.96 | 0.25 |
| SNVer | 0.91 | 0.91 | 0.94 | 0.81 | 0.97 | 0.10 | 0.64 | 0.04 | 0.73 | 0.98 |
| SAMtools | 0.82 | 0.83 | 0.81 | 0.75 | 0.93 | 0.68 | 0.70 | 0.53 | 0.86 | 0.99 |
| VarDict | 0.96 | 0.78 | 0.99 | 0.30 | 0.99 | 0.25 | 0.91 | 0.09 | 0.97 | 0.83 |
| 8 tools | 1.00 | 0.02 | 1.00 | 0.01 | 0.99 | 0.05 | 0.99 | 0.01 | 1.00 | 0.25 |
| single-appreci8 | 0.99 | 0.88 | 1.00 | 0.76 | 0.99 | 0.36 | 0.93 | 0.19 | 0.96 | 0.97 |
| appreci8 | 0.98 | 0.99 | 1.00 | 0.99 | 0.99 | 0.76 | 0.93 | 0.65 | 1.00 | 1.00 |