| Literature DB >> 29237398 |
Matthias Beyens1,2, Nele Boeckx3,4, Guy Van Camp3,4, Ken Op de Beeck3,4, Geert Vandeweyer3.
Abstract
BACKGROUND: Haloplex targeted resequencing is a popular method to analyze both germline and somatic variants in gene panels. However, involved wet-lab procedures may introduce false positives that need to be considered in subsequent data-analysis. No variant filtering rationale addressing amplicon enrichment related systematic errors, in the form of an all-in-one package, exists to our knowledge.Entities:
Keywords: Germline; Next-generation sequencing; Somatic; Targeted resequencing; Variant filtering
Mesh:
Year: 2017 PMID: 29237398 PMCID: PMC5729461 DOI: 10.1186/s12859-017-1985-1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Sashimi target enrichment plots. mTOR Exon 54 coverage for two different target enrichment methods is represented by a Sashimi plot: 1) hybridization capture-based technique, showing more bell-shaped enrichment of random fragments (red histogram) and 2) amplicon- or multiplex PCR-based approach, showing vertical enrichment blocks of identical fragments (blue histogram)
Fig. 2Vertical read blocks and variant calling bias visualization. Typical vertically enriched read blocks are illustrated in (2a and 2b). Aspecific fragments, not corresponding to predicted amplicons are shown in (2a) and variants restricted to read ends are shown in (2b). Called variants are indicated by a red dashed rectangular. Reads are given in blue and pink colored horizontal bars, indicating read orientation. Theoretical manufacturer designed Haloplex probes are presented in green colored horizontal bars below their corresponding enriched reads
Additional VCF information fields. After running pyAmpli a new VCF is generated with additional INFO fields. These fields provide the user information on amplicon fractions, counts and offsets of reference and alternative alleles
| INFO ID | Description |
|---|---|
| AmpFR | Amplicon fraction for reference allele |
| AmpFA | Amplicon fraction for alternative allele |
| AmpCR | Amplicon count for reference allele |
| AmpCA | Amplicon count for alternative allele |
| AmpC | Amplicon total count |
| AmpF_OA | Amplicon count offset compared to allelic depth, for alternative allele |
| AmpF_OR | Amplicon count offset compared to allelic depth, for reference allele |
pyAmpli true and false positive rates.False and true positive rates in percentages before (−) and after (+) pyAmpli germline, somatic and LOH variant filtering with corresponding total variant number for ratio calculation. Germline variants were called with the GATK Unified Genotyper. Somatic and LOH variants were called with VarScan2
| Variant | Filter | True positive rate (%) | False positive rate (%) | Number of variants |
|---|---|---|---|---|
| Germline | – | 39 | 61 | 15,673 |
| + | 64 | 36 | 11,368 | |
| Somatic | – | 29 | 71 | 4028 |
| + | 37 | 63 | 885 | |
| LOH | – | 45 | 55 | 934 |
| + | 81 | 19 | 208 |