| Literature DB >> 26647970 |
Tyler S Alioto1,2, Ivo Buchhalter3,4, Sophia Derdak1,2, Barbara Hutter4, Matthew D Eldridge5, Eivind Hovig6,7, Lawrence E Heisler8, Timothy A Beck8, Jared T Simpson8, Laurie Tonon9, Anne-Sophie Sertier9, Ann-Marie Patch10,11, Natalie Jäger3,12, Philip Ginsbach3, Ruben Drews3, Nagarajan Paramasivam3, Rolf Kabbe3, Sasithorn Chotewutmontri13, Nicolle Diessl13, Christopher Previti13, Sabine Schmidt13, Benedikt Brors4, Lars Feuerbach4, Michael Heinold4, Susanne Gröbner14, Andrey Korshunov15, Patrick S Tarpey16, Adam P Butler16, Jonathan Hinton16, David Jones16, Andrew Menzies16, Keiran Raine16, Rebecca Shepherd16, Lucy Stebbings16, Jon W Teague16, Paolo Ribeca1,2, Francesc Castro Giner1,2, Sergi Beltran1,2, Emanuele Raineri1,2, Marc Dabad1,2, Simon C Heath1,2, Marta Gut1,2, Robert E Denroche8, Nicholas J Harding8, Takafumi N Yamaguchi8, Akihiro Fujimoto17, Hidewaki Nakagawa17, Víctor Quesada18, Rafael Valdés-Mas18, Sigve Nakken6, Daniel Vodák6,19, Lawrence Bower5, Andrew G Lynch5, Charlotte L Anderson5,20, Nicola Waddell10,11, John V Pearson10,11, Sean M Grimmond10,21, Myron Peto22, Paul Spellman22, Minghui He23, Cyriac Kandoth24, Semin Lee25, John Zhang25,26, Louis Létourneau27, Singer Ma28, Sahil Seth26, David Torrents29, Liu Xi30, David A Wheeler30, Carlos López-Otín18, Elías Campo31, Peter J Campbell16, Paul C Boutros9,32, Xose S Puente18, Daniela S Gerhard33, Stefan M Pfister14,34, John D McPherson8,32, Thomas J Hudson8,32,35, Matthias Schlesner3, Peter Lichter36,37, Roland Eils3,37,38,39, David T W Jones34, Ivo G Gut1,2.
Abstract
As whole-genome sequencing for cancer genome analysis becomes a clinical tool, a full understanding of the variables affecting sequencing analysis output is required. Here using tumour-normal sample pairs from two different types of cancer, chronic lymphocytic leukaemia and medulloblastoma, we conduct a benchmarking exercise within the context of the International Cancer Genome Consortium. We compare sequencing methods, analysis pipelines and validation methods. We show that using PCR-free methods and increasing sequencing depth to ∼ 100 × shows benefits, as long as the tumour:control coverage ratio remains balanced. We observe widely varying mutation call rates and low concordance among analysis pipelines, reflecting the artefact-prone nature of the raw data and lack of standards for dealing with the artefacts. However, we show that, using the benchmark mutation set we have created, many issues are in fact easy to remedy and have an immediate positive impact on mutation detection accuracy.Entities:
Mesh:
Year: 2015 PMID: 26647970 PMCID: PMC4682041 DOI: 10.1038/ncomms10001
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Summary of medulloblastoma tumour-normal pair library construction and sequencing.
| Library | Starting DNA (μg) | Fragment Size (bp) | Size selection | Library protocol | PCR cycles | Sequencing machine | Chemistry (Illumina) | Depth (×) control:tumour |
|---|---|---|---|---|---|---|---|---|
| L.A | 4 | ∼400 | 2% Agarose gel | KapaBio | 0 | HiSeq 2500 HiSeq 2000MiSeq | V1 (RR)V3V2 | 29.6 : 40.5 |
| L.B | 1 | ∼400 | 2% Agarose gel, Invitrogen E-gel | TrueSeq DNA | 10 | HiSeq 2000 | V3 | 44.9 : 62.8 |
| L.C | 2.5 | ∼500 | 2% Agarose gel | NEBNext | 12 | HiSeq 2500HiSeq 2000 | V1 (RR)V3 | 58.9 : 66.8 |
| L.D | 1 | ∼550 | Agarose gel | TrueSeq DNA | 10 | HiSeq 2000 | V3 | 35.3 : 39.1 |
| L.E | 2.8 | ∼620 | 1.5% Agarose gel pippin | NEBNext | 0 | HiSeq 2000 | V3 | 40.5 : 60.4 |
| L.F | 1 | ∼400 | AMPureXP beads | NEBDNA | 10 | HiSeq 2000 | V3 | 38.7 : 37.9 |
| L.G | 1 | ∼350 | AMPureXP beads | TrueSeq DNA PCR-Free | 0 | HiSeq 2000 | V3 | 19.4 : 19.3 |
| L.H | 0.5 | ∼175 | AMPureXP beads | SureSelect WGS | 10 | HiSeq 2500 | V3 | 28.7 : 26.5 |
Figure 1Differences between the different sample libraries.
Libraries A, E and G are PCR-free. (a) GC bias of the different libraries. The genome was segmented into 10-kb windows. For each window, the GC content was calculated and the coverage for the respective library was added. For better comparability, the coverage was normalized by dividing by the mean. The major band in normal corresponds to autosomes, while the lower band corresponds to sex chromosomes. The increased number of bands in the tumour is because of a higher number of ploidy states in the (largely) tetraploid tumour sample. (b) Cumulative coverage displayed for different libraries. Displayed are all libraries sequenced to at least 28 ×. To make the values comparable, we downsampled all samples to a coverage of 28 × (the lowest coverage of the initially sequenced libraries). The plot shows the percentage of the genome (y axis) covered with a given minimum coverage (x axis). (c) Percentage of certain regions of interest covered with less than 10 ×. Different colours are used to distinguish centres.
Figure 2Effect of sequencing coverage on the ability to call SSMs.
(a) Overlap of SSMs called on different balanced coverages. (b) Density plots of the variant allele frequencies for different balanced coverages of tumour and control (tumour_versus_control) and number of SSMs called in total (calls were performed using the DKFZ calling pipeline, MB.I). (c) Plot of the number of SSMs (y axis) found for a given coverage (x axis). The different colours represent different levels of normal ‘contamination' in the tumour (0% black, 17% blue, 33% green and 50% orange). Solid lines represent the real data and dashed lines are simulated. Lines are fitted against the Michaelis–Menten model using the ‘drc' package in R. Solid lines are fitted to the data points and dashed lines are simulated using a mixed inhibition model for enzyme kinetics.
Classification of SSM and SIM Gold Set mutations for the medulloblastoma benchmark.
| Definition | MB SSM | MB SIM | |
|---|---|---|---|
| Class 1 | Mutant AF≥0.10 | 962 | 337 |
| Class 2 | 0.05≤Mutant AF<0.10 | 139 | |
| Class 3 | Mutant AF<0.05 | 154 | |
| Class 4 | Ambiguous alignment | 8 | 10 |
| Class 5 | High or low depth | 29 | |
| Tier 1 | Class 1 | 962 | 337 |
| Tier 2 | Classes 1 and 2 | 1,101 | |
| Tier 3 | Classes 1, 2 and 3 | 1,255 | |
| Tier 4 | Classes 1, 2, 3 and 4 | 1,263 | 347 |
| Tier 5 | Classes 1, 2, 3, 4 and 5 | 1,292 |
AF, allele frequency; MB, medulloblastoma; SIM, somatic insertion/deletion mutations; SNP, single-nucleotide polymorphisms; SNV, single-nucleotide variant; SSM, somatic single-base mutation.
Numbers of curated mutations falling in each class or tier are shown. Successive tiers represent cumulative addition of lower AF mutations, followed by those supported by ambiguous alignments, and finally those with either too low or too high a depth. SIMs were not subjected to such fine classification, with calls only assigned to classes 1 and 4. Note that we use the terms SSM and SIM for somatic mutations instead of more commonly used terms that ought to be reserved for germline variants such as SNP (refers to a single base variable position in the germline with a frequency of >1% in the general population) or SNV (refers to any single base variable position in the germline including those with a frequency <1% in the general population).
Figure 3Overlap of somatic mutation calls for each level of concordance.
Shared sets of calls are vertically aligned. GOLD indicates the Gold Set. (a) Medulloblastoma SSM calls shared by at least two call sets. (b) Medulloblastoma SIM calls shared by at least two call sets.
Figure 4Somatic mutation calling accuracy against Gold Sets.
Decreasing sensitivity on Tiers 1, 2 and 3 shown as series for each SSM call set, while precision remains the same. (a) Medulloblastoma SSMs. (b) Medulloblastoma SIMs.
Summary of accuracy measures.
| SSM calls | Aligner | SSM Detection Software | TP | FP | FN | P | R | F1 |
|---|---|---|---|---|---|---|---|---|
| MB.GOLD | BWA, GEM | Curated | 1,255 (8) | 0 | 0 | 1.00 | 1.00 | 1.00 |
| MB.A | BWA | In-house | 775 (0) | 147 | 480 | 0.84 | 0.62 | 0.71 |
| MB.B | BWA | samtools, Varscan | 788 (1) | 12 | 467 | 0.99 | 0.63 | 0.77 |
| MB.C | GEM | samtools, bcftools | 766 (3) | 1,025 | 489 | 0.43 | 0.61 | 0.50 |
| MB.D | n.a. | SMuFin | 737 (4) | 1,086 | 518 | 0.41 | 0.59 | 0.48 |
| MB.E | BWA | SomaticSniper | 750 (4) | 229 | 505 | 0.77 | 0.60 | 0.67 |
| MB.F | BWA | Strelka | 884 (2) | 165 | 371 | 0.84 | 0.70 | 0.77 |
| MB.G | BWA | Caveman, Picnic | 899 (3) | 140 | 356 | 0.87 | 0.72 | 0.78 |
| MB.H | Novoalign | MuTect | 947 (3) | 6,296 | 308 | 0.13 | 0.22 | |
| MB.I | BWA | samtools | 879 (7) | 129 | 376 | 0.87 | 0.70 | 0.78 |
| MB.J | None, BWA | SGA+freebayes | 856 (1) | 62 | 399 | 0.93 | 0.68 | |
| MB.K | BWA | Atlas2-snp | 945 (8) | 7,923 | 310 | 0.11 | 0.75 | 0.19 |
| MB.L1 | BWA | MuTect, Strelka | 385 (0) | 3 | 870 | 0.31 | 0.47 | |
| MB.L2 | BWA | MuTect, Strelka | 900 (1) | 253 | 355 | 0.78 | 0.72 | 0.75 |
| MB.M | BWA mem | samtools, GATK+MuTect | 937 (4) | 1,695 | 318 | 0.36 | 0.75 | 0.48 |
| MB.N | BWA | Strelka | 847 (1) | 289 | 408 | 0.75 | 0.68 | 0.71 |
| MB.O | BWA | MuTect | 944 (3) | 272 | 311 | 0.78 | 0.75 | 0.76 |
| MB.P | BWA | Sidron | 833 (3) | 256 | 422 | 0.77 | 0.66 | 0.71 |
| MB.Q | BWA | qSNP+GATK | 842 (2) | 25 | 413 | 0.97 | 0.67 | |
| MB.GOLD | BWA, GEM | Curated | 337 (10) | 0 | 0 | 1.00 | 1.00 | 1.00 |
| MB.A | BWA | In-house | 16 (0) | 63 | 321 | 0.20 | 0.05 | 0.08 |
| MB.B | BWA | GATK SomaticIndelDetector, Varscan | 167 (0) | 20 | 173 | 0.89 | 0.49 | 0.63 |
| MB.C | GEM | samtools, bcftools | 103 (0) | 26 | 236 | 0.80 | 0.30 | 0.44 |
| MB.D | none | SMuFin | 29 (0) | 25 | 308 | 0.54 | 0.09 | 0.15 |
| MB.F | BWA | Strelka | 147 (8) | 12 | 193 | 0.93 | 0.43 | 0.58 |
| MB.G | BWA | Pindel | 189 (2) | 82 | 152 | 0.70 | 0.55 | 0.61 |
| MB.H | Novoalign | VarScan2 | 55 (0) | 248 | 282 | 0.18 | 0.16 | 0.17 |
| MB.I | BWA | Platypus | 271 (7) | 224 | 70 | 0.55 | ||
| MB.J | None | SGA | 90 (1) | 34 | 249 | 0.72 | 0.26 | 0.38 |
| MB.K | BWA | Atlas2-indel | 268 (6) | 444 | 72 | 0.38 | 0.79 | 0.51 |
| MB.L1 | BWA | Strelka | 64 (1) | 3 | 273 | 0.19 | 0.32 | |
| MB.L2 | BWA | Strelka | 130 (3) | 13 | 210 | 0.91 | 0.38 | 0.53 |
| MB.N | BWA | Strelka | 128 (6) | 16 | 209 | 0.89 | 0.38 | 0.53 |
| MB.O | BWA | GATK SomaticIndelDetector | 140 (1) | 47 | 197 | 0.75 | 0.42 | 0.53 |
| MB.P | BWA | bcftools, PolyFilter | 37 (0) | 57 | 301 | 0.39 | 0.11 | 0.17 |
| MB.Q | BWA | Pindel | 100 (2) | 61 | 237 | 0.63 | 0.30 | 0.40 |
F1, F1 score; FN, false negative; FP, false positives; P, precision; R, recall; TP, true positives.
Shown are the evaluation results with respect to the medulloblastoma Gold Set (Tier 3). Shown are the number of true calls (TP) with additional Tier 4 calls in parentheses, the number of FP, the number of FN, P, R and F1. The submissions with the best precision, recall and F1 score are in bold.
Figure 5Rainfall plot showing distribution of called mutations on the genome.
The distance between mutations is plotted in the log scale (y axis) versus the genomic position on the x axis. TPs (blue), FPs (green) and FNs (red). Four MB submissions representative of distinct patterns are shown. (a) MB.Q is one of best balanced between FPs and FNs, with low positional bias. (b) MB.L1 has many FNs. (c) MB.C has clusters of FPs near centromeres and FNs on the X chromosome. (d)MB.K has a high FP rate with short distance clustering of mutations.
Figure 6Enrichment or depletion of genomic and alignment features in FP calls for each medulloblastoma SSM submission.
For each feature, the difference in frequency with respect to the Gold Set is multiplied by the FP rate. Blue indicates values less than zero and thus the proportion of variants or their score on that feature is lower in the FP set with respect to the true variants. Reddish colours correspond to a higher proportion of variants or higher scores for the feature in FP calls versus the Gold Set. Both features and submissions are clustered hierarchically. The features shown here include same AF (the probability that the AF in the tumour sample is not higher than that in the normal samples, derived from the snape-cmp-counts score), DacBL (in ENCODE DAC mappability blacklist region), DukeBL (in Encode Duke Mappability blacklist region), centr (in centromere or centromeric repeat), mult100 (1—mappability of 100mers with 1% mismatch), map150 (1—mappability of 150mers with 1% mismatch), DPNhi (high depth in normal), DPNlo (low depth in normal), dups (in high-identity segmental duplication), nestRep (in nested repeat), sRep (in simple repeat), inTR (in tandem repeat), adjTR (immediately adjacent to tandem repeat), msat (in microsatellite), hp (in or next to homopolymer of length >6), AFN (mutant AF in normal) and AFTlo (mutant AF in tumour<10%).
Figure 7Accuracy of re-filtered pipeline SSM calls.
Unfiltered calls (MB.F0 and CLL.F0) are shown as a red squares, while the calls using the tuned filters (MB.F2 and CLL.F2) are shown as red circles for the medulloblastoma (a) and CLL (b) benchmark GOLD sets. For MB, only the recall versus Tier 3 is shown. Overall, 1,019 (81.2%) of the medulloblastoma SSMs (indicated by the dotted line) are considered callable at 40 × coverage; 236 MB SSMs (18.2%) were not called by any pipeline. For CLL, verification was carried out on SSMs originally called on the 40 × data, which explains the higher recall.