Literature DB >> 31745186

Development and validation of a targeted gene sequencing panel for application to disparate cancers.

Mark J McCabe^1,2,3, Marie-Emilie A Gauthier^1,4,5, Chia-Ling Chan¹, Tanya J Thompson², Sunita M C De Sousa^6,7,8,9, Clare Puttick¹, John P Grady¹, Velimir Gayevskiy¹, Jiang Tao¹, Kevin Ying¹, Arcadi Cipponi¹⁰, Niantao Deng^3,10, Alex Swarbrick¹⁰, Melissa L Thomas¹¹, Reginald V Lord^11,12, Amber L Johns¹⁰, Maija Kohonen-Corish^10,13,14, Sandra A O'Toole^15,16,17,18, Jonathan Clark^4,19,20, Simon A Mueller^1,4,21, Ruta Gupta^4,19,20, Ann I McCormack^2,3,22, Marcel E Dinger^1,3, Mark J Cowley^23,24,25.

Abstract

Next generation sequencing has revolutionised genomic studies of cancer, having facilitated the development of precision oncology treatments based on a tumour's molecular profile. We aimed to develop a targeted gene sequencing panel for application to disparate cancer types with particular focus on tumours of the head and neck, plus test for utility in liquid biopsy. The final panel designed through Roche/Nimblegen combined 451 cancer-associated genes (2.01 Mb target region). 136 patient DNA samples were collected for performance and application testing. Panel sensitivity and precision were measured using well-characterised DNA controls (n = 47), and specificity by Sanger sequencing of the Aryl Hydrocarbon Receptor Interacting Protein (AIP) gene in 89 patients. Assessment of liquid biopsy application employed a pool of synthetic circulating tumour DNA (ctDNA). Library preparation and sequencing were conducted on Illumina-based platforms prior to analysis with our accredited (ISO15189) bioinformatics pipeline. We achieved a mean coverage of 395x, with sensitivity and specificity of >99% and precision of >97%. Liquid biopsy revealed detection to 1.25% variant allele frequency. Application to head and neck tumours/cancers resulted in detection of mutations aligned to published databases. In conclusion, we have developed an analytically-validated panel for application to cancers of disparate types with utility in liquid biopsy.

Entities: CellLine Chemical Disease Gene Mutation Species

Mesh：

Substances：

Year: 2019 PMID： 31745186 PMCID： PMC6864073 DOI： 10.1038/s41598-019-52000-3

Source DB: PubMed Journal: Sci Rep ISSN： 2045-2322 Impact factor: 4.379

Introduction

Next generation sequencing (NGS) has revolutionised our understanding of cancer. New taxonomies defined by molecular features rather than tissue of origin are emerging as powerful tools to better diagnose and treat cancers, and are becoming routine clinical practice in oncology and pathology[1,2]. The unprecedented success of gene- or genetic pathway-targeting therapies such as imatinib[3,4], gefitinib[5,6] and trastuzumab[7] over a decade ago was arguably the driving force behind this re-classification of cancer, with 75.6% of United States-based oncologists now using NGS to guide treatment decisions[8]. While whole exome/genome sequencing remains largely cost prohibitive[9,10], targeted gene panels have proliferated in mainstream clinical care due to their relative affordability and focused application; allowing for simplified data interpretation by omitting less-relevant genes and reducing the number of variants of unknown significance, and increased depth of coverage to improve detection sensitivity[9,11,12]. While to date, standardised validation criteria for diagnostic accreditation do not exist due to the lack of uniformity across the multitude of NGS platforms and within NGS processes[13], stringent validation guidelines have been published to provide standardised frameworks for clinical genetic tests[11,14,15]. Transparency in the literature has promoted the detailing of targeted panels’ development and validation across a variety of diseases, and highlighting their sensitivity, specificity and precision[9,16-18]; factors viewed as paramount for enabling clinicians to make important clinical management and treatment decisions[19]. In oncology, targeted panels have been developed for detecting hereditary cancer, monitoring somatic changes in progressive cancer, and importantly for elucidating the landscape of genetic aberrations that occur across multiple cancers, and using this information to identify novel therapeutics or repurpose existing ones[8,19-21]. In a landmark study in 2013, Frampton et al.[22] identified clinically-actionable mutations in 76% of 2221 tumours studied; a 3-fold increase in drug-actionable detection as compared with contemporary diagnostic tests including Sanger sequencing, mass spectrometric genotyping, fluorescent in situ hybridisation and immunohistochemistry. As an example of therapeutic repurposing, the HER2-amplified breast cancer treatment trastuzumab (Herceptin) has been shown to be effective in advanced gastric cancer, and potentially HER2 amplified pancreatic ductal adenocarcinoma[23,24]. Targeted gene panels are now also being produced for liquid biopsy and ctDNA sequencing; an emerging tool that holds promise to monitor for cancer initiation or relapse, tumour burden and evolution including the onset of treatment resistance[25-28]. Unlike standard surgical-core biopsies, liquid biopsies can be performed repeatedly, non-invasively, and are not spatio-temporally constrained, that is, results reflect all individual tumour mutations in real-time; thus informing the broadest range of therapeutic options available. In a preliminary study, we recently applied a targeted gene sequencing panel to 44 patients with familial pituitary tumour syndromes (FPTS) and detected rare germline variants across the 8 analysed genes (AIP, MEN1, CDKN1B, PRKAR1A, SDHA, SDHB, SDHC, SDHD) in 11 patients[29]. Recognising the utility of this approach to poorly studied cancers, we hypothesised herein that further developing, expanding and validating our targeted gene panel would benefit patients with other cancers of the head and neck, and beyond. We therefore aimed to re-access our extended FPTS cohort, and expand our panel by adding hundreds of known cancer-associated genes, validate its sensitivity, precision and specificity against several well-characterised controls, and then apply it to two patient cohorts with oral and cutaneous squamous cell carcinoma of the head and neck (OSCC and cSCC respectively). We also aimed to determine our panel’s utility in liquid biopsy by targeting known mutations in genes at decreasing variant allele frequencies (VAFs), in a commercially available synthetic standard.

Methods

Patients

DNA samples were obtained from patients with the following conditions: FPTS [n = 38 (blood-derived) and 10 (tumour-derived)], sporadic pituitary tumours [n = 10 (blood-derived) and 6 (tumour-derived)], OSCC [n = 39 (tumour-derived)], cSCC [n = 26 (tumour-derived)], miscellaneous [n = 3 adrenocortical carcinoma (1 blood-derived and 2 tumour-derived), 4 pancreatic ductal carcinoma (3 blood-derived and 1 tumour-derived)]. Informed consent had been provided, and prior ethics approval obtained through St Vincent’s Hospital Sydney Human Research Ethics Committee, NSW, Australia (FPTS, sporadic pituitary tumours and adrenocortical carcinomas; protocol X13-0109), and Royal Prince Alfred Hospital, NSW, Australia (OSCC, cSCC and pancreatic ductal carcinomas; protocols X13-0417, LNR/13/RPAH/582 and HREC/11/RPAH/329). All experiments were performed in accordance with relevant guidelines and regulations.

Development of the cancer gene sequencing panel PV1 and PV2

Two methods were used for deciding the genetic targets of our panels. Firstly, genes were selected from the following commercially available panels: TruSight Tumour 26 and TruSeq Amplicon Cancer Panels (Illumina), SureSeq Solid Tumour panel (Oxford Gene Technology), Foundation One Panel (Foundation Medicine), OncoCarta Panels Versions 1–3 (OncoCarta) and Haloplex Cancer Research Panel (Agilent). While each of these panels except Foundation One target specific exons or mutational hotspots, our design incorporated the entire coding region plus 10 bp of flanking intron of each gene included. Secondly, considering our panel was to expand upon our previously published FPTS data[29], we searched for more pituitary tumour targets by conducting a literature search on PubMed (https://www.ncbi.nlm.nih.gov/pubmed/) for genes associated with aggressive pituitary tumours, familial pituitary tumours, and embryonic pituitary development. In total 312 cancer-associated genes were chosen for panel version 1 – PV1 (Table 1), with a target region of 0.97 Mb across 4,837 exons. The panel prototype was designed using the Roche/NimbleGen EZ Choice Library NimbleDesign platform [https://design.nimblegen.com/nimbledesign/#/ (cat #06266282001)].

Table 1

Genes targeted in Panel Version 1 and added to Panel Version 2 (bold).

ABL1	BLM	CENPE	EPAS1	FLT4	IKZF1	MAP3K1	NFE2L2	PIK3R2	RYR1	SUFU
ACTN4	BMP4	CGA	EPCAM	FOXA1	IL7R	MAP3K20	NFKBIA	PMS2	S100A9	SUPV3L1
ADAMTS6	BMP8	CHD1	EPHA3	FOXL2	INHBA	MAPK2	NGN2	POMC	SDHA	TBXAS1
ADAMTS7	BMPR1A	CHD7	EPHA5	FSHB	IRF4	MAST4	NKX2-1	POU1F1	SDHAF2	TCF7L1
ADBRK2	BMPR2	CHEK1	EPHB1	GADD45B	IRS2	MAX	NKX3-1	PPP2R1A	SDHB	TERT**
AHR	BRAF	CHEK2	EPBH2	GADD45G	JAK1	MCC	NME1-NME2	PRB3	SHDC	TESC
AIP	BRCA1	CHRM3	ERBB2	GATA1	JAK2	MCL1	NOTCH1	PRDM1	SDHD	TET2
AKT1	BRCA2	CIC	ERBB3	GATA2	JAK3	MDM2	NOTCH2	PRDM2	SETD2	TFG
AKT2	BRIP1	CITED1	ERBB4	GATA3	JPH2	MDM4	NOTCH3	PRDM8	SF3B1	TGFBR2
AKT3	BTK	CLDN11	ERG	GID4	JRK	MED12	NPM1	PRDX2	SHH	TH
ALK	C2CD3	COX2	ESR1	GLI2	JUN	MEF2B	NR0B1	PRKAA2	SIRT1	TMEM43
AMER1	CACNA1H	CREBBP	ESR2	GNA11	KAT6A	MEN1	NR2C2	PRKAR1A	SIX3	TMEM127
ANOS1	CACNA1S	CRKL	EZH2	GNA13	KCNA5	MET	NR3C1	PRKDC	SIX6	TNFAIP3
AOX1	CAPN1	CRLF2	FAM46C	GNAQ	KDM5A	MGAM	NRAS	PRLR	SKI	TNFRSF14
APC	CARD11	CRMP1	FANCA	GNAS	KDM5C	MGMT	NTR3	PROCA1	SLIT2	TOP1
AR	C8FB	CSF1R	FANCC	GNRHR	KDM6A	MITF	NTKR1	PROP1	SMAD2	TOP2A
ARAF	CBL	CTCF	FANCD2	GPR101	KDR	MLH1	NTRK2	PTCH1	SMAD4	TP53
ARFRP1	CCAR2	CTNNA1	FANCE	GPR124	KEAP1	MLL1	NTRK3	PTEN	SMARCA4	TPM3
ARID1A	CCNB1	CTNNB1	FANCF	GREM1	KIAA0226	MLL2	NUP93	PTPN11	SMARCB1	TPM4
ARID2	CCND1	CXADR	FANCG	GRIN1	KIAA0363	MLL3	OCLN	PTTG	SMO	TRIOBP
ARMC5	CCND2	CYP19A1	FANCL	GRIN2A	KIF1B	MMP2	OGDH	PYCR1	SNAP25	TSC1
ASXL1	CCND3	CYP2D6	FANCM	GRIN2B	KIT	MMP9	OR51B4	RAB18	SNIP1	TSC2
ATAD2B	CCNE1	CYP21A2	FBXO4	GSK3B	KLHL6	MMP17	OSBP	RAB1A	SNX3	TSHB
ATM	CCR10	DAXX	FBXW7	HESX1	KRAS	MPL	OTX2	RAC1	SOCS1	TSHR
ATP6V0A1	CD79A	DBF4	FGF2	HGF	LATS2	MRE11A	PAK3	RAD50	SOCS2	UBR4
ATPAF2	CD79B	DCAMKL3	FGF3	HIF1α	LGALS3	MSH2	PALB2	RAD51	SON	UCP3
ATR	CDC2L2	DDR2	FGF4	HMGA1	LHB	MSH6	PALLD	RAD51C	SOS1	UGT1A1
ATRX	CDC73	DDX3×	FGF6	HMGA2	LHX2	MTOR	PARP10	RAD51D	SOX2	UGT2B7
AURKA	CDH1	DICER1	FGF8	HMGB1	LHX3	MUTYH	PAX5	RAF1	SOX3	USP8
AURKB	CDK3	DMD	FGF10	HNF1A	LHX4	MX2	PAX6	RARA	SOX9	VEGFA
AXL	CDK4	DNMT3A	FGF14	HNRPU	LOXL1	MYC	PAX7	RARS	SOX10	VHL
BAP1	CDK6	DOC1	FGF19	HOXB13	LRP1B	MYCL1	PBRM1	RASA4	SPANXN2	WIF1
BARD1	CDK8	DOT1L	FGF23	HPGD	LRP2	MYCN	PCDH11X	RB1	SPEN	WISP3
BAX	CDK12	DPCR1	FGFR1	HRAS	LRRC50	MYD88	PDGFD	RET	SPOP	WNT4
BBC3	CDKN1A	DSPP	FGFR2	HSP90AA1	LTBP4	MYO5A	PDGFRA	REXO4	SPTA1	WNT9A
BCL2	CDKN1B	DST	FGFR3	IDH1	MAG	NBN	PDGFRB	RICTOR	SRC	WT1
BCL2L2	CDKN2A (p14ARF)*	EGF	FGFR4	IDH2	MAN1A1	NCAM1	PDK1	RNF43	SRP9	XPO1
BCL6	CDKN2A (p16)*	EGFL7	FH	IGF1R	MAP1LC3B	NDRG4	PIK3CA	RPA1	SSR3	ZFHX3
BCL9	CDKN2B	EGFR	FLNA	IGFBP5	MAP2K1	NF1	PIK3CB	RPTOR	STAG2	ZNF217
BCOR	CDKN2C	EGLN1	FLT1	IGSF1	MAP2K2	NF2	PIK3CG	RUNX1	STAT4	ZNF703
BCORL1	CEBPA	EMSY	FLT3	IKBKE	MAP2K4	NFE2L1	PIK3R1	RXRG	STK11	ZNF717

*Two protein coding isoforms were targeted of the same gene. **Including promoter region.

Genes targeted in Panel Version 1 and added to Panel Version 2 (bold). *Two protein coding isoforms were targeted of the same gene. **Including promoter region. Panel version 2 (PV2) was a refined design leveraging off PV1 data to include more probes to improve read depth over poorly covered regions, plus incorporate the coding region (and 10 bp flanking intron) of 139 new genes (Table 1) associated with pituitary tumours and pituitary function; acquired through further PubMed literature search and clinical consultation. The target region of PV2 was 2.01 Mb, for a total of 6,929 target exons and 451 genes. We used 80 and 111 DNA samples to measure the performance metrics for PV1 and PV2 respectively (see Bioinformatics below). These comprised 31 samples from the patient cohorts, 6 extracted DNA samples from 4 cancer cell lines (liposarcoma 94T778, breast cancer SK-BR-3, and melanoma SK-MEL-28 and A375), and 43 control samples for PV1; and 105 samples from the patient cohorts and 6 controls for PV2.

Controls

Several controls were obtained to test the ability of PV1 and PV2 to detect well-characterised single nucleotide variants (SNVs), small insertions and deletions (indels) and copy number variants (CNVs): (1) 42 controls from breast cancer patients with known inherited mutations in 11 genes (Supplementary Fig. S1). We only retained controls that kConFab reviewed and classified/endorsed (n = 35) to test kit sensitivity; (2) an AcroMetrix Oncology Hotspot Control (Thermo Fisher Scientific, cat #969056), which consists of a pool of synthetic oligos against a normal genomic DNA background encompassing 521 somatic and 34 germline mutations in 53 genes (Supplementary Fig. S1); (3) a pair of DNA controls for testing our panel’s sensitivity to CNVs was derived from the well-characterised breast cancer cell lines SK-BR-3 and BT-474; (4) four controls from individuals NA12878, NA24385, NA24149 and NA21143 (Coriell Institute, NJ, USA), whose genomes have been deeply characterised using 12 technologies used to calibrate, benchmark and validate genomic tools for clinical practice worldwide[30]. The kConFab controls, the cell lines and the AcroMetrix Oncology Hotspot Control were tested on PV1. Two repeat kConFab control samples and a repeat of the AcroMetrix Oncology Hotspot Control where expected mutations were missed in PV1 due to lack of coverage, and the Coriell samples were tested on PV2.

Panel specificity testing

Routine clinical practice in St Vincent’s Hospital’s Department of Endocrinology for patients with suspected FPTS, includes Sanger sequencing of the AIP gene. Sanger sequencing is required for panel specificity validation, as it verifies true negatives, that is, it confirms that alleles labelled as wild type by next generation sequencing panels, are truly wild type and not mutated as previously defined[18]. We accessed the Sanger sequencing data for 89 FPTS patients who had been sequenced with our panels for this study, and the aforementioned preliminary study[29].

Liquid biopsy for ctDNA

To test for the applicability of our panel to liquid biopsy, we acquired synthetic oligos of analogous lengths to ctDNA, representing 9 known mutations across 6 cancer-associated genes (Seraseq Circulating Tumor DNA (ctDNA) Reference Material, SeraCare Life Sciences, MA, USA; cat #0710-0018). The mutated oligos were provided at different allelic frequencies (5%, 1.25%, 0.625%, 0.125% and 0%) in a matrix containing human proteins to closely resemble human plasma.

NGS library preparation and sequencing

Genomic DNA

Genomic germline DNA was extracted from peripheral blood leukocytes using the QIAamp DNA Blood Midi Kit (Qiagen; cat #51183), or from fresh-frozen tumour samples using Qiagen’s DNeasy kit (cat #69504), through Garvan Molecular Genetics. DNA library preparation for all PV1 samples was conducted on 100 ng of input using the KAPA Hyper Library Preparation Kit (Roche, cat #KK8504), according to manufacturer’s instructions. For PV2, library preparation was conducted using the KAPA HyperPlus Kit (Roche, cat #KK8514) according to manufacturer’s instructions. KK8514 uses enzymatic digestion rather than sonication to fragment genomic DNA as per KK8504. NGS was performed using Illumina’s HiSeq2500 for all samples in multiplexed pools of 17–24/lane.

ctDNA

Libraries were prepared using the QIAseq Ultralow Input Library Kit (Qiagen; cat #180495), according to manufacturer’s instructions. Oligos were captured with PV1 and sequenced on Illumina’s NextSeq500 platform in multiplexed pools.

Bioinformatics

Our bioinformatics pipeline had been accredited for Medical Testing (ISO15189) by the National Association of Testing Authorities, Australia (NATA)/Royal College of Pathologists of Australasia (RCPA) in 2016. In brief, raw fastq files were transferred to the cloud-based genomic analysis platform DNAnexus (www.dnanexus.com). Bioinformatic read alignment and improvement, variant annotation, conversion, filtration and prioritisation were conducted as previously published[31-33]; the exception being for ctDNA where read duplications were not marked. Germline and somatic variants were called using the Genome Analysis Toolkit’s (GATK) HaplotypeCaller (v3.3) in GVCF mode[34] and VarDictJava (v1.4.6)[35] respectively. Copy number variants occurring across entire genes was determined using CNVkit (v0.9.1)[36], and within genes using DECoN[37]. Panel performance metrics including number of total reads, number of unique reads, mean coverage across regions captured by each panel version and targeted genes specifically (RefSeq coding exons), read enrichment and read duplication were assessed using the GATK’s CollectHsMetrics (Picard) and DepthofCoverage tools. By using controls with well-characterised SNVs, indels and CNVs, we were able to measure the sensitivity and/or precision of PV1 and PV2 as previously defined[18]. To measure precision recall and sensitivity for the four Coriell samples, we added a hard filter step recommended by GATK for small panels and then applied vcfeval in RTG Tools[38].

Statistics

Statistical analysis was performed using R (v3.6). Differences between groups were measured by ANOVA and identified using post-hoc Tukey’s HSD test. For the comparison between two groups, t-test was employed. p < 0.05 was used to determine if results were significantly different.

Results and Discussion

Performance evaluation

The mean total number of reads between PV1 and PV2 were similar at approximately 22.5 M (Supplementary Table S1), however the total number of unique reads for PV2 was nearly double that of PV1 (p < 0.001), consistent with the significantly lower PCR duplication rate in PV2 (p < 0.001) (Fig. 1A).

Figure 1

Performance metrics for PV1 and PV2. (A) Box plots presenting data comparisons between PV1 (n = 80 samples tested) and PV2 (n = 111 samples tested) for unique reads (in millions; M), percent duplication, target coverage and percent base coverage at >20x and >100x. (B) Mean coverage over coding regions of shared genes was compared between PV1 and PV2 after additional probes to target poorly-covered regions were added to the latter. (C) Changes to individual shared targets between PV1 and PV2 are also presented. (D) The final mean coverage for PV2 is presented following the addition of 139 new genes for a total of 451 targeted genes, as well as (E) the final percentage of targeted coding bases in PV2 with a percent mean coverage of >100x. Dashed lines in panels (B–E) represent mean coverage. ***p < 0.001, ns = not significant. We achieved the vendor performance recommendations for both PV1 and PV2 designs, with a mean depth of coverage across regions captured by each panel version of >100x and a mean percent of captured bases covered at >20x across panel versions of ≥95% (Supplementary Table S1). While mean depth of coverage and percent bases with >100x coverage were not significantly different between PV1 and PV2 (Fig. 1A), greater variability was observed for PV1 for percent bases with >20x coverage, with a mean significantly lower than for PV2 (p < 0.001) (Fig. 1A). We then evaluated how well the designed probes covered the actual exons of targeted genes. As mentioned previously, exons identified with low coverage in PV1 were balanced with more probes in PV2. The result was an increase in the mean depth of coverage across the targeted genes shared between the panels, from 351x (PV1) to 460x (PV2) (Fig. 1B), and at individual gene level, the mean depth of coverage across samples was consistently >100x in PV2 but not in PV1 (Fig. 1C). If we take into consideration the full PV2 design, which includes an additional 139 genes, the global mean gene depth of coverage for this panel version was 395x (Fig. 1D) and the mean percent of targeted bases across genes with > 100x coverage was 84% (Fig. 1E). Overall the performance of PV2 was an improvement upon PV1. Our panel far exceeds the typical 30x coverage recommended by providers of whole genome sequencing for germline analyses, and the 100x coverage recommended for somatic variant detection.

Sensitivity assessment

kConFab

42 DNA samples from patients with mutations of various types in 11 genes were acquired from kConFab (Table 2, Supplementary Fig. S1), with the 35 that had been classified and reviewed proceeding through to library preparation and analysis. We detected 34 of the expected mutations (Table 2). The undetected variant, BRCA1 c.135-1G > T, had been identified in the donor by two different clinical laboratories plus a research laboratory, and appeared to segregate with 12 other family members being affected. Our data however raises the potential of an annotation error, or potential sample swap, with just one read on the Integrative Genomics Viewer (IGV)[39] exhibiting the mutation against a backdrop of ~1200 wild type reads of excellent mapping quality (Supplementary Fig. S2A). Two other potential annotation errors were observed among the detected variants (Table 2). Firstly, we detected BRCA1 c.3193_3194dupG, p.Asp1065Glyfs*2, but it had been reported as c.3194_3195insG, p.Asp1065Glufs * 2 by clinical and research laboratories across numerous family members (Supplementary Fig. S2B). Secondly, we detected insertion BRCA1 c.1879_1880insCC, resulting in a p.Val627Alafs*6 change (Table 2 and Supplementary Fig. S2C) instead of c.1881_1882_insCC, p.Ser628Profs*5 that was recorded by other laboratories. Next generation sequencing does have difficulties resolving indels in highly repetitive or homopolymer regions, as well as regions of high A-T and G-C content[15], however the regions in Supplementary Fig. S2B,C do not appear to be problematic. As per the ‘undetected’ variant above, mapping quality for these two potential annotation errors was excellent, and excellent depth was achieved over both wild type and inserted alleles, providing confidence in our calls.

Table 2

Detection of kConFab variants.

Chr	Pos	Ref	Alt	Mut type	Gene	Detected CDS	Detected AA	Detection	VAF (%)
2	47702181	C	T	SNV	MSH2	c.1777C > T	p.Gln593Ter	D	48
2				DUP	MSH2	Exon 9-11 duplication		D
2	215645969	GTT	G	DEL	BARD1	c.627_628delAA	p.Lys209Asnfs*4	D	51
9	97912356	G	A	SNV	FANCC	c.535C > T	p.Arg179Ter	D	53
9	98011506	TC	T	DEL	FANCC	c.67delG	p.Asp23Ilefs	D	49
10	89690810	G	T	SNV	PTEN	c.217G > T	p.Glu73Ter	D	56
10	89692818	TCA	CC	CV	PTEN	c.302_304delTCAinsCC	p.Ile101Thrfs*12	D	47
10	89692904	C	T	SNV	PTEN	c.388C > T	p.Arg130Ter	D	49
11	108165787	G	A	DEL	ATM	c.4909 + 1G > A	p.Met297_Ser301del	D	49
13	32913708	ATTAAGT	A	DEL	BRCA2	c.5217_5223delTTTAAGT	p.Tyr1739Terfs	D	53
13	32913778	T	A	SNV	BRCA2	c.5286T > A	p.Tyr1762Ter	D	45
13	32914174	C	G	SNV	BRCA2	c.5682C > G	p.Tyr1894Ter	D	50
13	32937635	AC	A	DEL	BRCA2	c.8297delC	p.Thr2766Asnfs*11	D	52
13	32954180	C	T	SNV	BRCA2	c.9154C > T	p.Arg305Trp	D	51
15	91328183	C	T	SNV	BLM	c.2695C > T	p.Arg899Ter	D	44
16	23632683	C	T	SNV	PALB2	c.3113G > A	p.Trp1038Ter	D*	45
16	23632683	C	T	SNV	PALB2	c.3113G > A	p.Trp1038Ter	D*	42
16	23634303	CA	CAA	INS	PALB2	c.2982dupT	p.Ala995Cysfs*16	D	43
16	23641239	CT	C	DEL	PALB2	c.2235delA	p.Lys745Lysfs * 19	D	51
16	23641528	CT	C	DEL	PALB2	c.1947_1948insA	p.Glu650Argfs * 13	D	44
17	7577539	G	A	SNV	TP53	c.742C > T	p.Arg248Trp	D	53
17	7578406	C	T	SNV	TP53	c.524G > A	p.Arg175His	D	39
17	7578552	G	T	SNV	TP53	c.378C > A	p.Tyr126Ter	D	45
17	7674945	G	A	SNV	TP53	c.586C > T	p.Arg196Ter	D	52
17				DUP	TP53	Exons 2–6 duplication		D
17	41197774	A	T	SNV	BRCA1	c.5513T > A	p.Val1838Glu	D	49
17	41197784	G	A	SNV	BRCA1	c.5503C > T	p.Arg1835Ter	D	47
17	41243788	TAGAC	T	DEL	BRCA1	c.3756_3759delGTCT	p.Ser1253Argfs * 10	D	52
17	41244353	A	AC	INS	BRCA1	c.3193dupG	p.Asp1065Glyfs * 2	D^#	52
17	41245666	T	TGG	INS	BRCA1	c.1879_1880insCC	p.Val627Alafs * 6	D^#	45
17	41246260	C	CT	INS	BRCA1	c.1287dupA	p.Asp430Argfs * 6	D	45
17	41258551	C	A	SNV	BRCA1	c.135-1G > T	p.Phe46_Arg71del	ND^§
17				DUP	BRCA1	Exon 13 duplication		D^†
22	29095862	AG	A	DEL	CHEK2	c.1100delC		D	50
22	29121230	C	T	SNV	CHEK2	c.444 + 1G > A	p.Thr367Metfs	D^†	41

Abbreviations: AA; Amino acid, Alt; Alternative, CDS; Coding sequence, Chr; Chromosome, CV; Complex variant, D; Detected, Del; Deletion, Dup; Duplication, INS; Insertion, Mut; Mutation, ND; Not detected, Pos; Position, Ref; Reference, SNV; Single nucleotide variant, VAF; Variant allele frequency. *The same variant was expected in two samples. #Reported in kConFab depository as c.3194_3195insG, p.Asp1065Glufs * 2 and c.1881_1882insCC, p.Ser628Profs * 5 respectively. §Visible as a single read against ~1200 wild type reads on Integrative Genomics Viewer. †Detected on PV2 only.

Detection of kConFab variants. Abbreviations: AA; Amino acid, Alt; Alternative, CDS; Coding sequence, Chr; Chromosome, CV; Complex variant, D; Detected, Del; Deletion, Dup; Duplication, INS; Insertion, Mut; Mutation, ND; Not detected, Pos; Position, Ref; Reference, SNV; Single nucleotide variant, VAF; Variant allele frequency. *The same variant was expected in two samples. #Reported in kConFab depository as c.3194_3195insG, p.Asp1065Glufs * 2 and c.1881_1882insCC, p.Ser628Profs * 5 respectively. §Visible as a single read against ~1200 wild type reads on Integrative Genomics Viewer. †Detected on PV2 only. Two of the expected events, BRCA1 exon 13 duplication (Fig. 2A) and CHEK2 c.444 + 1G > A (Supplementary Fig. S3A), were only detected using PV2 after not being detected in PV1. This adds to the evidence of improved coverage in PV2 described above. In addition to the BRCA1 duplication, two other expected multi-exonic duplications were successfully detected in MSH2 and TP53 using DECoN (Fig. 2A, Table 2).

Figure 2

Intragenic copy number detection and comparison to commercial panels. (A) Control genomic DNA samples were acquired from kConFab for sensitivity testing. Three of these samples included known exon duplications in BRCA1, TP53 and MSH2, which were assessed by the DeCON tool. Exons are numbered along the x-axis, and those of normal copy number are presented as blue dots. Amplifications are shown in red. The TP53 and AURKB genes are on opposing DNA strands hence the presence of the latter and its exons in this Figure. A similar genetic-overlap is observed for MSH2 to the left of the panel. (B) A commercially available pool of synthetic oligos against a normal genomic background was also obtained. Mutations were provided at variant allele frequencies (VAF) of 5–15% and 15–35%, or at germline frequencies. Presented are the number of detected and missed variants in our PV1 and PV2 panels relative to what was expected in AcroMetrix. This was compared to three other panels [AmpliSeq Cancer Hotspot Panel v2 (CHPv2), Illumina TruSeq Amplicon – Cancer Panel (TSCAP) and TruSight Tumor Panel 26 (TSTP)], the data for which were provided by the AcroMetrix manufacturer. Percent values on the right indicate the proportion of AcroMetrix variants actually targeted by the panels.

Acrometrix oncology hotspot control (synthetic oligos)

PV1 detected 97% (n = 538) of the 555 expected mutations of various types across 53 genes (Supplementary Fig. S1, Supplementary Table S2) in the synthetic oligos. This included 98% of the somatic variants at 5–15% VAF, 97% of the somatic variants at 15–35% VAF, and 97% of germline variants. The addition to PV2 of new probes to cover poorly or non-targeted regions, greatly improved sensitivity, such that 553 (99.6%) of the expected mutations were detected (Figs. 2B and S3B). In comparison to the three other commercial sequencing panels presented in Fig. 2B, our panel had greater breadth, targeting 100% of the AcroMetrix oligo pool, and greater sensitivity, detecting proportionally more expected variants, with AmpliSeq Cancer Hotspot Panel v2 (CHPv2), Illumina’s TruSeq Amplicon – Cancer Panel (TSCAP) and TruSight Tumor Panel 26 (TSTP) detecting 98.8%, 97.1% and 97.4% of their respective targets. With respect to the missed variants using PV2, both of which were expected at low VAFs (5–15%); the HNF1A c.872_873insC, p.G292fs*25 change, occurred in a highly repetitive region involving a poly-C tract, possibly creating alignment difficulties[15] (Supplementary Fig. S3C). This is supported by the insertion being placed either side of the guanine nucleotide as shown in the top and bottom IGV panels of Supplementary Fig. S3C. Potentially compounding difficulties with this particular call, was the presence of HNF1A variant c.864G > C, p.G288G in the background genomic DNA. This variant was detected by both HaplotypeCaller and VarDict in both panel versions, with the former occasionally calling the correct insertion in PV1 at the same time. Consistent with this potential interference of accurate calling, was a large somatic indel (in addition to the substitution) called by VarDict in PV2 at position c.866_914. Thus in a real patient we may accurately identify c.872_873insC. We did detect all other expected indels and hence this is not an intrinsic issue with PV2. Our inability to detect PIK3CA c.1633G > A, p.E545K might reflect the threshold of detection of VarDict, since the variant was observed on IGV with 3.8% VAF (8/209 reads). However, this mutation occurs at a hotspot, with three other nearby mutations in cis all detected at the same VAF (Supplementary Fig. S3D). Thus, while PV2 will not detect 100% of expected variants, its sensitivity of >99% is comparable to commercially available panels. Further, our ability to detect SNVs and indels at various VAFs confirms that our achieved coverage is sufficient for application to somatic cancer analyses. Nonetheless, we next investigated which variants of known clinical significance our panel would have difficulty detecting, using two well established databases; (i) ClinVar (https://www.ncbi.nlm.nih.gov/clinvar/), a catalogue of genomic variation and (ii) the Catalogue Of Somatic Mutations In Cancer (COSMIC; https://cancer.sanger.ac.uk/cosmic). Based on the literature, 20–30x coverage is sufficient for the detection of germline variants[15,40]. Assessment of our targeted regions which recorded mean depths of coverage of <20x, revealed that five targeted mutations of clinical significance in the ClinVar database, would be poorly covered and may not be identified (Supplementary Table S3). These five variants all occurred across three alleles within the same targeted region of the LHB gene. With respect to detecting somatic variants, 100x coverage is sufficient for detecting 80% of expected variants in tumour tissue of at least 55% purity[40,41]. Using 100x as a cut off, 158 targeted mutations encompassing 137 alleles in 23 genes in the COSMIC database may be missed in tumours of lower purity (Supplementary Table S3). These somatic alleles fall under 30 targeted regions of our panel, which when added to the poorly covered region identified in the ClinVar comparison, amounts to <0.5% of the total targeted region of our panel. Improvements could be made in future iterations of our panel design to compensate for these very few insufficiencies, or greater amounts of highly pure input DNA could be used at initial library preparations when searching for variants in these regions using PV2.

Control cell lines

We applied PV1 to breast cancer cell lines SK-BR-3 and BT-474. Both are well characterised with respect to CNVs, and fully annotated data is available through the Broad Institute’s Cancer Cell Line Encyclopedia (CCLE; https://portals.broadinstitute.org/ccle/data)[42]. Heat map analysis qualitatively revealed large concordance between PV1 detected amplifications and deletions, with those expected in the CCLE data for both cell lines (Fig. 3A). These were confirmed quantitatively through regression analysis, with correlation coefficients of 0.79 and 0.83, demonstrating strong relationships between PV1 and CCLE in SK-BR-3 and BT-474 respectively (Fig. 3B,C). The top ten deletions and amplifications detected by PV1 compared well to those expected according to CCLE in both cell lines (Fig. 3D–G), with notable deletions including CDKN2A and CDKN2B, and notable amplifications in ERBB2, CDK12 and MYC. The data highlight the utility of our panel in detecting larger CNVs, with all detected and expected deletions (copy number <1.5) and amplifications (copy number >2.5) presented in Supplementary Fig. S4. It is also noteworthy that our panel detected 100% of the expected SNVs and indels in both cell lines (data not shown).

Figure 3

Copy number evaluation in breast cancer cell lines. (A) Heat map representation of ~300 targeted genes in the SK-BR-3 and BT-474 breast cancer cells lines. Data was extracted from PV1-captured DNA and processed through CNVkit and compared to those expected according to the Broad Institute’s Cancer Cell Line Encyclopedia (CCLE). Copy number deletions are represented in blue and as copy numbers approach diploidy colours become white. Amplification in copy number is represented in red. (B) Regression analysis was carried out using the correlation coefficient to quantify the strength of relationship between PV1 called variants in SK-BR-3 and (C) BT-474, and those expected according to CCLE. An r-value of >0.7 is considered a strong, positive relationship. (D–G) Ten lowest (red) and ten largest CNV (blue) changes as detected by PV1 in SK-BR-3 (D) and BT-474 (F). These were compared to expected values according to CCLE (E,G). Note the log2 y-axes and the gene names on the x-axes. Complete data is presented in Supplementary Fig. S4.

Coriell controls and specificity

As previously stated, the four genomes originally sourced from the Coriell Institute have been extensively characterised by 12 NGS technologies and are used as benchmarks for validation. As per our sensitivity controls above, our panel detected known variants with a sensitivity of >99% in all four samples (Table 3). Precision, which is a measure of reproducibility, was at >97% in all four samples.

Table 3

Sensitivity and precision in well characterised genomes, and specificity measure.

Sample	True positive (TP)	False positive (FP)	True negative (TN)	False negative (FN)	Precision**	Sensitivity***	Specificity****
NA12878	602	16	ND	4	0.974	0.993	ND
NA23485	562	13	ND	1	0.977	0.998	ND
NA24149	580	11	ND	1	0.981	0.998	ND
NA24143	552	12	ND	2	0.979	0.996	ND
AIP*	13	0	88,377	0	1.000	1.000	1.000

Abbreviations: ND; Not done. *AIP gene (993 bp) was Sanger sequenced in 89 FPTS samples alongside application of PV1 and PV2. **Precision calculated by TP/(TP + FP). ***Sensitivity calculated by TP/(TP + FN). ****Specificity (AIP only) calculated by TN/(TN + FP).

Sensitivity and precision in well characterised genomes, and specificity measure. Abbreviations: ND; Not done. *AIP gene (993 bp) was Sanger sequenced in 89 FPTS samples alongside application of PV1 and PV2. **Precision calculated by TP/(TP + FP). ***Sensitivity calculated by TP/(TP + FN). ****Specificity (AIP only) calculated by TN/(TN + FP). The third essential parameter for validation is specificity, which relies on ‘true negative’ detection, such that a second technology, usually Sanger sequencing, is employed for verification. We did not use Sanger sequencing on our four Coriell samples. However, as stated, Sanger sequencing data was available for AIP screening from 89 FPTS patients also screened on our panel. All bases called as wild type in AIP using our panel, were shown to be true, providing a specificity value for our panel of 100% (Table 3). This is comparable to current commercially available panels. Given we only tested one gene which encompasses a target region much smaller than the panel in its entirety (~1 kb), one could reasonably speculate that when taking the whole panel into account, that specificity may drop. However, given the extremely low numbers of false positives and false negatives relative to true positives detected across the entire panel (Table 3), we expect that true negatives will be proportionately high, and that any change in specificity would be negligible. Further, our data for true negatives in AIP was reproducible across a large number of patients.

Application of PV2 to FPTS, OSCC and cSCC samples

Following validation, we tested our ability to detect rare variants (≤1% occurrence in control populations) in three cohorts of patients with different tumour types of the head and neck. Pituitary adenomas are common, with 1:1000 patients presenting with clinically significant disease[29,43]. They are classically associated with gross morbidity encompassing the effects of excess hormone production, vision loss, osteoporosis, diabetes, cardiovascular disease, infertility and disfigurement among others, but their low mortality in general has contributed to poor molecular classification[44]; unlike for OSCC and cSCC below, no extensive online databases are available for pituitary tumours. Germline variants in eight genes have been associated with FPTS (AIP, MEN1, CDKN1B, PRKAR1A, SDHA, SDHB, SDHC and SDHD)[29], and as the occurrence of pathogenic variants is low and we assessed just 18 patients, we included variants of predicted low significance in our data presented herein. SNVs and indels occurring within exons, introns, splice sites and untranslated regions (UTRs) were successfully detected across the genes, except in SDHD (Fig. 4A). Coverage of this gene was a high 479x and hence we postulate that the lack of variants detected was due to the low sample numbers and mutation rarity. Indeed in our previously published data in 44 FPTS patients using the developmental version of this panel, just one predicted benign variant was detected in SDHD[29].

Figure 4

Rare variants are detectable in FPTS, OSCC and cSCC samples, at a similar ratio to online databases. Our validated panel was applied to cohorts of patients with FPTS (n = 18) (A), OSCC (n = 39) (B), and cSCC (n = 27) (C), for the detection of rare (≤1% control population) variations of various categories. Data for OSCC and cSCC was compared to TCGA and Pickering/Inman databases[51,53], with genes arranged in order of most commonly mutated within those databases. Variants were restricted to high and medium impact. No such database exists for pituitary tumours, and genes were restricted to the 8 described in FPTS. Variants of any impact were included in the data presented herein for FPTS. The data presented for our tumour cohorts of OSCC (n = 39; Fig. 4B) and cranio-facial cSCC (n = 24; Fig. 4C), includes variants of high and medium impact only. 8% of worldwide malignancies are mucosal head and neck squamous cell carcinomas, which represent a heterogeneous group of malignancies including OSCC[45]. OSCC is classically associated with alcohol and tobacco abuse in older men[46]. cSCC is associated with UV-induced DNA damage due to sunlight exposure[47]. Its incidence is poorly defined[48], though like for OSCC, it is increasing. Prognosis is poor for OSCC and also for the <5% of cSCC that is metastatic to regional lymph nodes[49-52]. Genetic stratification for both tumours is improving; for OSCC on The Cancer Genome Atlas (TCGA; a collection of >20,000 extensively-characterised primary tumours spanning 33 cancer types) and for cSCC in the literature[51,53]. The genes presented in Fig. 4B,C are ordered according to their mutation rate in TCGA and Pickering et al.[51] and Inman et al.[53]. Our panel identified mutations in these genes at a similar order of occurrence in our cohorts. The exception is for CACNA1C in cSCC which was not targeted by PV2. Consistent with our panel validation data, we were able to detect mutations of various categories, and when taking into account that the TCGA and Pickering/Inman databases used larger cohorts than that used herein, we detected these mutation categories at similar proportions to the databases. Detailed analysis of these three cohorts of data will be presented elsewhere.

Application of our panel to synthetic ctDNA oligos

Having produced a panel with high sensitivity, precision and specificity, and having demonstrated its application to cancers of the head and neck, we tested the utility of our panel in liquid biopsy; an emerging tool in tumour burden monitoring and early detection of cancer or relapse that aids personalised medicine. The commercially available synthetic oligos used were produced at fragment sizes analogous to ctDNA (167 bp)[54], suspended in a substrate similar to human plasma, and provided in aliquots of four different VAFs. We achieved a mean depth of coverage of 2,196x across the aliquots, and detected 8/9 and 6/9 mutations at 5.00% and 1.25% VAF respectively (Table 4). We could not detect any expected variants at 0.625% or 0.125% VAF (Table 4).

Table 4

Detection of synthetic ctDNA oligos.

Gene/mutation	5.00% VAF(2314x mean coverage)	1.25% VAF(2015x mean coverage)	0.625% VAF(2091x mean coverage)	0.125% VAF(2364x mean coverage)
BRAF; p.V600E	Yes	Yes	No	No
EGFR; p.T790M	Yes	No	No	No
EGFR; p.D770_N771insG	Yes	Yes	No	No
EGFR; p.E746_A750del	Yes	Yes	No	No
PIK3CA; p.H1047R	Yes	No	No	No
PIK3CA; p.N1068fs*4	Yes	Yes	No	No
NRAS; p.Q61R	Yes	Yes	No	No
KRAS; p.G12D	Yes	No	No	No
KIT; p.D816V	No	Yes	No	No

Abbreviation: VAF; Variant Allele Frequency.

Detection of synthetic ctDNA oligos. Abbreviation: VAF; Variant Allele Frequency. Shu et al.[54] employed a panel of similar size to ours and detected tumour specific mutations in 87% of 605 ctDNA samples sourced from various cancer types, with VAF >1% at 300–500x depth. Increasing depth of coverage to ~2000x did not significantly alter variant-detectability[54]. However, in early stage cancer ctDNA can exist at levels around 0.1% VAF, and thus for early detection of cancer initiation or recurrence, previous studies sought to achieve a 10,000x coverage; a target both expensive to achieve and prone to sequencing errors and false positives[54-57]. New technologies have been developed to identify and reduce sequencing errors to enable ultra-deep sequencing of ctDNA[54-59], but to aid this economically, smaller sequencing panels may need to be developed. Early data suggest our panel’s applicability for the monitoring of established cancers shedding higher quantities of ctDNA. To detect earlier cancers, or variants in tumours known to shed less ctDNA (eg glioma and renal cell carcinoma)[60], modifications would need to be made such as reducing the breadth of our target region to common cancer genes or to mutational hotspots, or adopting some of the new, emerging technologies[54-59]. In conclusion, we have developed a targeted gene panel for application to disparate cancer types that encompasses 451 cancer-associated genes. The novelty of our panel is that compared to many commercially available products, PV2 targets genes in their entirety and not just select exons or mutational hotspots. Additionally, our panel targets a broad range of pituitary tumour-associated genes. We validated our panel by assessing its analytical sensitivity, precision and specificity. This was achieved through application to 47 well-characterised controls, including 4 that are routinely used for benchmarking. Our sensitivity and specificity results were aligned to commercially available panels and early data suggest promise in liquid biopsy, an emerging tool in precision oncology. Finally, we applied our validated panel to cohorts of patients with pituitary tumours and OSCC and cSCC. We were able to detect variants of various categories and for OSCC and cSCC, at similar proportions to those expected, suggesting that our panel would be beneficial to patients with poorly studied cancers of the head and neck, and beyond.

57 in total

1. Trastuzumab in combination with chemotherapy versus chemotherapy alone for treatment of HER2-positive advanced gastric or gastro-oesophageal junction cancer (ToGA): a phase 3, open-label, randomised controlled trial.

Authors: Yung-Jue Bang; Eric Van Cutsem; Andrea Feyereislova; Hyun C Chung; Lin Shen; Akira Sawaki; Florian Lordick; Atsushi Ohtsu; Yasushi Omuro; Taroh Satoh; Giuseppe Aprile; Evgeny Kulikov; Julie Hill; Michaela Lehle; Josef Rüschoff; Yoon-Koo Kang
Journal: Lancet Date: 2010-08-19 Impact factor: 79.321

2. High prevalence of pituitary adenomas: a cross-sectional study in the province of Liege, Belgium.

Authors: Adrian F Daly; Martine Rixhon; Christelle Adam; Anastasia Dempegioti; Maria A Tichomirowa; Albert Beckers
Journal: J Clin Endocrinol Metab Date: 2006-09-12 Impact factor: 5.958

3. FACTERA: a practical method for the discovery of genomic rearrangements at breakpoint resolution.

Authors: Aaron M Newman; Scott V Bratman; Henning Stehr; Luke J Lee; Chih Long Liu; Maximilian Diehn; Ash A Alizadeh
Journal: Bioinformatics Date: 2014-08-20 Impact factor: 6.937

Review 4. Cutaneous squamous cell carcinoma: Incidence, risk factors, diagnosis, and staging.

Authors: Syril Keena T Que; Fiona O Zwald; Chrysalyne D Schmults
Journal: J Am Acad Dermatol Date: 2018-02 Impact factor: 11.527

5. Sanger Confirmation Is Required to Achieve Optimal Sensitivity and Specificity in Next-Generation Sequencing Panel Testing.

Authors: Wenbo Mu; Hsiao-Mei Lu; Jefferey Chen; Shuwei Li; Aaron M Elliott
Journal: J Mol Diagn Date: 2016-10-06 Impact factor: 5.568

6. Detection of circulating tumor DNA in early- and late-stage human malignancies.

Authors: Chetan Bettegowda; Mark Sausen; Rebecca J Leary; Isaac Kinde; Yuxuan Wang; Nishant Agrawal; Bjarne R Bartlett; Hao Wang; Brandon Luber; Rhoda M Alani; Emmanuel S Antonarakis; Nilofer S Azad; Alberto Bardelli; Henry Brem; John L Cameron; Clarence C Lee; Leslie A Fecher; Gary L Gallia; Peter Gibbs; Dung Le; Robert L Giuntoli; Michael Goggins; Michael D Hogarty; Matthias Holdhoff; Seung-Mo Hong; Yuchen Jiao; Hartmut H Juhl; Jenny J Kim; Giulia Siravegna; Daniel A Laheru; Calogero Lauricella; Michael Lim; Evan J Lipson; Suely Kazue Nagahashi Marie; George J Netto; Kelly S Oliner; Alessandro Olivi; Louise Olsson; Gregory J Riggins; Andrea Sartore-Bianchi; Kerstin Schmidt; le-Ming Shih; Sueli Mieko Oba-Shinjo; Salvatore Siena; Dan Theodorescu; Jeanne Tie; Timothy T Harkins; Silvio Veronese; Tian-Li Wang; Jon D Weingart; Christopher L Wolfgang; Laura D Wood; Dongmei Xing; Ralph H Hruban; Jian Wu; Peter J Allen; C Max Schmidt; Michael A Choti; Victor E Velculescu; Kenneth W Kinzler; Bert Vogelstein; Nickolas Papadopoulos; Luis A Diaz
Journal: Sci Transl Med Date: 2014-02-19 Impact factor: 17.956

7. Mutational landscape of aggressive cutaneous squamous cell carcinoma.

Authors: Curtis R Pickering; Jane H Zhou; J Jack Lee; Jennifer A Drummond; S Andrew Peng; Rami E Saade; Kenneth Y Tsai; Jonathan L Curry; Michael T Tetzlaff; Stephen Y Lai; Jun Yu; Donna M Muzny; Harshavardhan Doddapaneni; Eve Shinbrot; Kyle R Covington; Jianhua Zhang; Sahil Seth; Carlos Caulin; Gary L Clayman; Adel K El-Naggar; Richard A Gibbs; Randal S Weber; Jeffrey N Myers; David A Wheeler; Mitchell J Frederick
Journal: Clin Cancer Res Date: 2014-10-10 Impact factor: 12.531

8. Evaluation of genomic high-throughput sequencing data generated on Illumina HiSeq and genome analyzer systems.

Authors: André E Minoche; Juliane C Dohm; Heinz Himmelbauer
Journal: Genome Biol Date: 2011-11-08 Impact factor: 13.583

9. CNVkit: Genome-Wide Copy Number Detection and Visualization from Targeted DNA Sequencing.

Authors: Eric Talevich; A Hunter Shain; Thomas Botton; Boris C Bastian
Journal: PLoS Comput Biol Date: 2016-04-21 Impact factor: 4.475

10. Integrated digital error suppression for improved detection of circulating tumor DNA.

Authors: Aaron M Newman; Alexander F Lovejoy; Daniel M Klass; David M Kurtz; Jacob J Chabon; Florian Scherer; Henning Stehr; Chih Long Liu; Scott V Bratman; Carmen Say; Li Zhou; Justin N Carter; Robert B West; George W Sledge; Joseph B Shrager; Billy W Loo; Joel W Neal; Heather A Wakelee; Maximilian Diehn; Ash A Alizadeh
Journal: Nat Biotechnol Date: 2016-03-28 Impact factor: 54.908

5 in total

Review 1. Comprehensive comparison of theranostic nanoparticles in breast cancer.

Authors: Amin Nikdouz; Nima Namarvari; Ramin Ghasemi Shayan; Arezoo Hosseini
Journal: Am J Clin Exp Immunol Date: 2022-02-15

2. The MURAL collection of prostate cancer patient-derived xenografts enables discovery through preclinical models of uro-oncology.

Authors: Gail P Risbridger; Ashlee K Clark; Laura H Porter; Mitchell G Lawrence; Renea A Taylor; Roxanne Toivanen; Andrew Bakshi; Natalie L Lister; David Pook; Carmel J Pezaro; Shahneen Sandhu; Shivakumar Keerthikumar; Rosalia Quezada Urban; Melissa Papargiris; Jenna Kraska; Heather B Madsen; Hong Wang; Michelle G Richards; Birunthi Niranjan; Samantha O'Dea; Linda Teng; William Wheelahan; Zhuoer Li; Nicholas Choo; John F Ouyang; Heather Thorne; Lisa Devereux; Rodney J Hicks; Shomik Sengupta; Laurence Harewood; Mahesh Iddawala; Arun A Azad; Jeremy Goad; Jeremy Grummet; John Kourambas; Edmond M Kwan; Daniel Moon; Declan G Murphy; John Pedersen; David Clouston; Sam Norden; Andrew Ryan; Luc Furic; David L Goode; Mark Frydenberg
Journal: Nat Commun Date: 2021-08-19 Impact factor: 14.919

3. HGNChelper: identification and correction of invalid gene symbols for human and mouse.

Authors: Sehyun Oh; Jasmine Abdelnabi; Ragheed Al-Dulaimi; Ayush Aggarwal; Marcel Ramos; Sean Davis; Markus Riester; Levi Waldron
Journal: F1000Res Date: 2020-12-21

4. Cancer Type Classification in Liquid Biopsies Based on Sparse Mutational Profiles Enabled through Data Augmentation and Integration.

Authors: Alexandra Danyi; Myrthe Jager; Jeroen de Ridder
Journal: Life (Basel) Date: 2021-12-21

Review 5. Shifting Gears in Precision Oncology-Challenges and Opportunities of Integrative Data Analysis.

Authors: Ka-Won Noh; Reinhard Buettner; Sebastian Klein
Journal: Biomolecules Date: 2021-09-04

5 in total