| Literature DB >> 31874647 |
Qing Wang1, Vassiliki Kotoula2,3, Pei-Chen Hsu1,4, Kyriaki Papadopoulou3, Joshua W K Ho1,5,6, George Fountzilas3,7, Eleni Giannoulatou8,9.
Abstract
BACKGROUND: The application of next-generation sequencing in cancer has revealed the genomic landscape of many tumour types and is nowadays routinely used in research and clinical settings. Multiple algorithms have been developed to detect somatic variation from sequencing data using either paired tumour-blood or tumour-only samples. Most of these methods have been developed and evaluated for the identification of somatic variation using Illumina sequencing datasets of moderate coverage. However, a comprehensive evaluation of somatic variant detection algorithms on Ion Torrent targeted deep sequencing data has not been performed.Entities:
Keywords: Cancer genome; Ion torrent deep sequencing; Methods evaluation; Mutational signature; Read depth; Somatic variant calling
Mesh:
Year: 2019 PMID: 31874647 PMCID: PMC6929331 DOI: 10.1186/s12920-019-0636-y
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Features of the variant calling tools that were compared for somatic variant detection from IONT data
| Somatic variant caller | Statistical Method | Version | Variant Types | Somatic Variation Identification strategy | Input | Output |
|---|---|---|---|---|---|---|
| Torrent Variant Caller (TVC) | Subtraction | 4.2 | Somatic, germline | Call tumour and normal sample separately | Original BAM file | VCF file |
| MuTect2 | Bayesian classifier | 4.0 | Somatic | Call paired tumour-normal or tumour-only samples | Sorted, indexed BAM file | VCF file |
| VarScan2 | Fisher’s Exact Test | 2.2.3 | Somatic, germline | Call paired tumour-normal sample only | Pileup BAM file | VCF file |
Fig. 1Concordance and discrepancy of the three somatic variant callers. The Venn diagrams illustrate the total counts and comparison of somatic mutations called by each of the somatic variant callers for a SNVs and b INDELs identified from paired tumour-blood samples as well as c SNVs and d INDELs from unpaired tumour-only samples. The numbers in the parenthesis reveal the percentage of variants against all the variants
Fig. 2Comparison between SNVs identified by the three somatic variant callers. The Y-axis of each scatter plot indicates the read depth of SNVs in tumour tissue sample, the X-axis shows the VAF of each SNV. The dots represent SNVs from tumour-blood pairs. The colour indicates the variant calling method. a All callers: SNVs identified by all three callers; Two callers: SNVs identified by any two of the three callers. b MuTect2: SNVs uniquely identified by MuTect2. c Torrent: SNVs uniquely identified by TVC. d VarScan2: SNVs uniquely identified by VarScan2
Fig. 3Mutational signatures of SNVs called from paired samples. The Y-axis reveals the frequency of each mutation type in the whole human genome, the X-axis indicates the mutation types according to 96 possible substitution types defined by the bases 5′ and 3′ to the mutated base. Signature A: mutational signature inferred from SNVs detected by all three callers; Signature M: mutational signature inferred from all the SNVs detected by MuTect2; Signature T: mutational signature inferred from all the SNVs detected by TVC; Signature V: mutational signature inferred from all the SNVs detected by VarScan2
Correlation between detected mutational signatures and the COSMIC ovarian cancer mutational signatures
| Signature 1 | Signature 3 | Signature 5 | |
|---|---|---|---|
| Signature A (All) | 0.44 (< 0.05) | 0.06 (0.59) | 0.64 (< 0.05) |
| Signature M (MuTect2) | 0.27 (< 0.05) | 0.09 (0.37) | 0.46 (< 0.05) |
| Signature T (TVC) | 0.34 (< 0.05) | 0.08 (0.43) | 0.57 (< 0.05) |
| Signature V (VarScan2) | 0.04 (< 0.05) | −0.01 (0.90) | 0.24 (< 0.05) |
The number of pathogenic mutations identified in 208 paired tumour-blood samples in five genes that are frequently mutated in ovarian cancer. The number of unique mutations is shown and in brackets the number of samples with at least one mutation is shown
| MuTect2 | 1056 ( | 364 ( | 51 ( | 45 ( | 738 ( |
| TVC | 126 ( | 30 ( | 10 ( | 16 ( | 139 ( |
| VarScan2 | 94 ( | 57 ( | 7 ( | 10 (20) | 111 ( |
| Common | 33 ( | 8 ( | 7 ( | 7 ( | 81 ( |
| Common (%) | 2.9% (37%) | 2.1% (16.4%) | 13.7% (10.8%) | 14.9% (10.6%) | 10.7% (57.8%) |
The number of pathogenic mutations identified in 253 unpaired tumour samples in five genes that are frequently mutated in ovarian cancer. The number of unique mutations is shown and in brackets the number of samples with at least one mutation is shown
| MuTect2 | 1190 ( | 390 ( | 56 ( | 47 ( | 820 ( |
| TVC | 164 ( | 54 ( | 11 ( | 18 ( | 215 ( |
| Common | 163 ( | 52 ( | 11 ( | 14 ( | 210 ( |
| Common (%) | 13.7% (34.4%) | 13.3% (16.9%) | 19.6% (16.8%) | 27.5% (12.7%) | 25.5% (69.8%) |