| Literature DB >> 31745186 |
Mark J McCabe1,2,3, Marie-Emilie A Gauthier1,4,5, Chia-Ling Chan1, Tanya J Thompson2, Sunita M C De Sousa6,7,8,9, Clare Puttick1, John P Grady1, Velimir Gayevskiy1, Jiang Tao1, Kevin Ying1, Arcadi Cipponi10, Niantao Deng3,10, Alex Swarbrick10, Melissa L Thomas11, Reginald V Lord11,12, Amber L Johns10, Maija Kohonen-Corish10,13,14, Sandra A O'Toole15,16,17,18, Jonathan Clark4,19,20, Simon A Mueller1,4,21, Ruta Gupta4,19,20, Ann I McCormack2,3,22, Marcel E Dinger1,3, Mark J Cowley23,24,25.
Abstract
Next generation sequencing has revolutionised genomic studies of cancer, having facilitated the development of precision oncology treatments based on a tumour's molecular profile. We aimed to develop a targeted gene sequencing panel for application to disparate cancer types with particular focus on tumours of the head and neck, plus test for utility in liquid biopsy. The final panel designed through Roche/Nimblegen combined 451 cancer-associated genes (2.01 Mb target region). 136 patient DNA samples were collected for performance and application testing. Panel sensitivity and precision were measured using well-characterised DNA controls (n = 47), and specificity by Sanger sequencing of the Aryl Hydrocarbon Receptor Interacting Protein (AIP) gene in 89 patients. Assessment of liquid biopsy application employed a pool of synthetic circulating tumour DNA (ctDNA). Library preparation and sequencing were conducted on Illumina-based platforms prior to analysis with our accredited (ISO15189) bioinformatics pipeline. We achieved a mean coverage of 395x, with sensitivity and specificity of >99% and precision of >97%. Liquid biopsy revealed detection to 1.25% variant allele frequency. Application to head and neck tumours/cancers resulted in detection of mutations aligned to published databases. In conclusion, we have developed an analytically-validated panel for application to cancers of disparate types with utility in liquid biopsy.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31745186 PMCID: PMC6864073 DOI: 10.1038/s41598-019-52000-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Genes targeted in Panel Version 1 and added to Panel Version 2 (bold).
*Two protein coding isoforms were targeted of the same gene. **Including promoter region.
Figure 1Performance metrics for PV1 and PV2. (A) Box plots presenting data comparisons between PV1 (n = 80 samples tested) and PV2 (n = 111 samples tested) for unique reads (in millions; M), percent duplication, target coverage and percent base coverage at >20x and >100x. (B) Mean coverage over coding regions of shared genes was compared between PV1 and PV2 after additional probes to target poorly-covered regions were added to the latter. (C) Changes to individual shared targets between PV1 and PV2 are also presented. (D) The final mean coverage for PV2 is presented following the addition of 139 new genes for a total of 451 targeted genes, as well as (E) the final percentage of targeted coding bases in PV2 with a percent mean coverage of >100x. Dashed lines in panels (B–E) represent mean coverage. ***p < 0.001, ns = not significant.
Detection of kConFab variants.
| Chr | Pos | Ref | Alt | Mut type | Gene | Detected CDS | Detected AA | Detection | VAF (%) |
|---|---|---|---|---|---|---|---|---|---|
| 2 | 47702181 | C | T | SNV | c.1777C > T | p.Gln593Ter | D | 48 | |
| 2 | DUP | Exon 9-11 duplication | D | ||||||
| 2 | 215645969 | GTT | G | DEL | c.627_628delAA | p.Lys209Asnfs*4 | D | 51 | |
| 9 | 97912356 | G | A | SNV | c.535C > T | p.Arg179Ter | D | 53 | |
| 9 | 98011506 | TC | T | DEL | c.67delG | p.Asp23Ilefs | D | 49 | |
| 10 | 89690810 | G | T | SNV | c.217G > T | p.Glu73Ter | D | 56 | |
| 10 | 89692818 | TCA | CC | CV | c.302_304delTCAinsCC | p.Ile101Thrfs*12 | D | 47 | |
| 10 | 89692904 | C | T | SNV | c.388C > T | p.Arg130Ter | D | 49 | |
| 11 | 108165787 | G | A | DEL | c.4909 + 1G > A | p.Met297_Ser301del | D | 49 | |
| 13 | 32913708 | ATTAAGT | A | DEL | c.5217_5223delTTTAAGT | p.Tyr1739Terfs | D | 53 | |
| 13 | 32913778 | T | A | SNV | c.5286T > A | p.Tyr1762Ter | D | 45 | |
| 13 | 32914174 | C | G | SNV | c.5682C > G | p.Tyr1894Ter | D | 50 | |
| 13 | 32937635 | AC | A | DEL | c.8297delC | p.Thr2766Asnfs*11 | D | 52 | |
| 13 | 32954180 | C | T | SNV | c.9154C > T | p.Arg305Trp | D | 51 | |
| 15 | 91328183 | C | T | SNV | c.2695C > T | p.Arg899Ter | D | 44 | |
| 16 | 23632683 | C | T | SNV | c.3113G > A | p.Trp1038Ter | D* | 45 | |
| 16 | 23632683 | C | T | SNV | c.3113G > A | p.Trp1038Ter | D* | 42 | |
| 16 | 23634303 | CA | CAA | INS | c.2982dupT | p.Ala995Cysfs*16 | D | 43 | |
| 16 | 23641239 | CT | C | DEL | c.2235delA | p.Lys745Lysfs * 19 | D | 51 | |
| 16 | 23641528 | CT | C | DEL | c.1947_1948insA | p.Glu650Argfs * 13 | D | 44 | |
| 17 | 7577539 | G | A | SNV | c.742C > T | p.Arg248Trp | D | 53 | |
| 17 | 7578406 | C | T | SNV | c.524G > A | p.Arg175His | D | 39 | |
| 17 | 7578552 | G | T | SNV | c.378C > A | p.Tyr126Ter | D | 45 | |
| 17 | 7674945 | G | A | SNV | c.586C > T | p.Arg196Ter | D | 52 | |
| 17 | DUP | Exons 2–6 duplication | D | ||||||
| 17 | 41197774 | A | T | SNV | c.5513T > A | p.Val1838Glu | D | 49 | |
| 17 | 41197784 | G | A | SNV | c.5503C > T | p.Arg1835Ter | D | 47 | |
| 17 | 41243788 | TAGAC | T | DEL | c.3756_3759delGTCT | p.Ser1253Argfs * 10 | D | 52 | |
| 17 | 41244353 | A | AC | INS | c.3193dupG | p.Asp1065Glyfs * 2 | D# | 52 | |
| 17 | 41245666 | T | TGG | INS | c.1879_1880insCC | p.Val627Alafs * 6 | D# | 45 | |
| 17 | 41246260 | C | CT | INS | c.1287dupA | p.Asp430Argfs * 6 | D | 45 | |
| 17 | 41258551 | C | A | SNV | c.135-1G > T | p.Phe46_Arg71del | ND§ | ||
| 17 | DUP | Exon 13 duplication | D† | ||||||
| 22 | 29095862 | AG | A | DEL | c.1100delC | D | 50 | ||
| 22 | 29121230 | C | T | SNV | c.444 + 1G > A | p.Thr367Metfs | D† | 41 |
Abbreviations: AA; Amino acid, Alt; Alternative, CDS; Coding sequence, Chr; Chromosome, CV; Complex variant, D; Detected, Del; Deletion, Dup; Duplication, INS; Insertion, Mut; Mutation, ND; Not detected, Pos; Position, Ref; Reference, SNV; Single nucleotide variant, VAF; Variant allele frequency. *The same variant was expected in two samples. #Reported in kConFab depository as c.3194_3195insG, p.Asp1065Glufs * 2 and c.1881_1882insCC, p.Ser628Profs * 5 respectively. §Visible as a single read against ~1200 wild type reads on Integrative Genomics Viewer. †Detected on PV2 only.
Figure 2Intragenic copy number detection and comparison to commercial panels. (A) Control genomic DNA samples were acquired from kConFab for sensitivity testing. Three of these samples included known exon duplications in BRCA1, TP53 and MSH2, which were assessed by the DeCON tool. Exons are numbered along the x-axis, and those of normal copy number are presented as blue dots. Amplifications are shown in red. The TP53 and AURKB genes are on opposing DNA strands hence the presence of the latter and its exons in this Figure. A similar genetic-overlap is observed for MSH2 to the left of the panel. (B) A commercially available pool of synthetic oligos against a normal genomic background was also obtained. Mutations were provided at variant allele frequencies (VAF) of 5–15% and 15–35%, or at germline frequencies. Presented are the number of detected and missed variants in our PV1 and PV2 panels relative to what was expected in AcroMetrix. This was compared to three other panels [AmpliSeq Cancer Hotspot Panel v2 (CHPv2), Illumina TruSeq Amplicon – Cancer Panel (TSCAP) and TruSight Tumor Panel 26 (TSTP)], the data for which were provided by the AcroMetrix manufacturer. Percent values on the right indicate the proportion of AcroMetrix variants actually targeted by the panels.
Figure 3Copy number evaluation in breast cancer cell lines. (A) Heat map representation of ~300 targeted genes in the SK-BR-3 and BT-474 breast cancer cells lines. Data was extracted from PV1-captured DNA and processed through CNVkit and compared to those expected according to the Broad Institute’s Cancer Cell Line Encyclopedia (CCLE). Copy number deletions are represented in blue and as copy numbers approach diploidy colours become white. Amplification in copy number is represented in red. (B) Regression analysis was carried out using the correlation coefficient to quantify the strength of relationship between PV1 called variants in SK-BR-3 and (C) BT-474, and those expected according to CCLE. An r-value of >0.7 is considered a strong, positive relationship. (D–G) Ten lowest (red) and ten largest CNV (blue) changes as detected by PV1 in SK-BR-3 (D) and BT-474 (F). These were compared to expected values according to CCLE (E,G). Note the log2 y-axes and the gene names on the x-axes. Complete data is presented in Supplementary Fig. S4.
Sensitivity and precision in well characterised genomes, and specificity measure.
| Sample | True positive (TP) | False positive (FP) | True negative (TN) | False negative (FN) | Precision** | Sensitivity*** | Specificity**** |
|---|---|---|---|---|---|---|---|
| NA12878 | 602 | 16 | ND | 4 | 0.974 | 0.993 | ND |
| NA23485 | 562 | 13 | ND | 1 | 0.977 | 0.998 | ND |
| NA24149 | 580 | 11 | ND | 1 | 0.981 | 0.998 | ND |
| NA24143 | 552 | 12 | ND | 2 | 0.979 | 0.996 | ND |
| 13 | 0 | 88,377 | 0 | 1.000 | 1.000 | 1.000 |
Abbreviations: ND; Not done. *AIP gene (993 bp) was Sanger sequenced in 89 FPTS samples alongside application of PV1 and PV2. **Precision calculated by TP/(TP + FP). ***Sensitivity calculated by TP/(TP + FN). ****Specificity (AIP only) calculated by TN/(TN + FP).
Figure 4Rare variants are detectable in FPTS, OSCC and cSCC samples, at a similar ratio to online databases. Our validated panel was applied to cohorts of patients with FPTS (n = 18) (A), OSCC (n = 39) (B), and cSCC (n = 27) (C), for the detection of rare (≤1% control population) variations of various categories. Data for OSCC and cSCC was compared to TCGA and Pickering/Inman databases[51,53], with genes arranged in order of most commonly mutated within those databases. Variants were restricted to high and medium impact. No such database exists for pituitary tumours, and genes were restricted to the 8 described in FPTS. Variants of any impact were included in the data presented herein for FPTS.
Detection of synthetic ctDNA oligos.
| Gene/mutation | 5.00% VAF | 1.25% VAF | 0.625% VAF | 0.125% VAF |
|---|---|---|---|---|
| Yes | Yes | No | No | |
| Yes | No | No | No | |
| Yes | Yes | No | No | |
| Yes | Yes | No | No | |
| Yes | No | No | No | |
| Yes | Yes | No | No | |
| Yes | Yes | No | No | |
| Yes | No | No | No | |
| No | Yes | No | No |
Abbreviation: VAF; Variant Allele Frequency.