Literature DB >> 31415627

A research-based gene panel to investigate breast, ovarian and prostate cancer genetic risk.

Madison R Bishop1,2, Anna L W Huskey1,2, John Hetzel2, Nancy D Merner1,2.   

Abstract

There is a need to investigate and better understand the inherited risk of cancer to ensure that clinical applications provide more accurate assessments and management strategies. Developing research-based next-generation sequencing gene panels that not only target (present-day) clinically actionable susceptibility genes but also genes that currently lack sufficient evidence for risk as well as candidate genes, such as those in DNA repair pathways, can help aid this effort. Therefore, gene panel B.O.P. (Breast, Ovarian, and Prostate) was developed to evaluate the genetic risk of breast, ovarian and/or prostate cancer, and this manuscript serves as an introduction to B.O.P. and highlights its initial analytical validity assessment. B.O.P targets 87 genes that have been suggested, predicted, or clinically proven to be associated with breast, ovarian, and/or prostate cancer risk using Agilent Technologies Haloplex probes. The probes were designed for 100 base pair reads on an Illumina platform and target both coding and non-coding exons as well as 10 intronic base pairs flanking the intron-exon boundaries. The initial B.O.P screening involved 43 individuals from the Alabama Hereditary Cancer Cohort, and an average sequencing depth of 809X was obtained. Upon variant filtering and validation, true positives had an average sequencing depth of 659X and allele balance of 0.51. The average false positive sequencing depth was 34X and allele balance was 0.33. Although low sequencing depth was not always indicative of a false positive, high sequencing depths (>100X) signified a true positive. Furthermore, sensitivity and specificity of BRCA1/2 were calculated to be 100% and 92.3%, respectively. Overall, this screening enabled the establishment of criteria to alleviate future validation efforts and strongly supports the use of B.O.P. to further elucidate hereditary cancer susceptibility. Ultimately, continued B.O.P. screening will provide insights toward the genetic risk of and overlap between breast, ovarian, and/or prostate cancer.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31415627      PMCID: PMC6695138          DOI: 10.1371/journal.pone.0220929

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Gene panels enable the simultaneous screening of a number of genes. Panels are typically customized for specific screening purposes; thus, the genes (and/or specific gene regions) on such panels are unique to the screening goals. In recent years, with technological sequencing advances, panel-based screening has become extremely efficient and cost-effective. These advancements involve the targeted enrichment of selected genes followed by massively parallel sequencing, which is also known as next-generation sequencing (NGS) [1, 2]. NGS gene panels have been implemented into clinical practice to assess inherited risk of cancer [1-3]. These panels include clinically valid genes for which clinical management guidelines have been established, such as genetic risk assessment criteria and mutation-positive management strategies. In the U.S., the National Comprehensive Cancer Network (NCCN) provides such guidelines to maximize clinical utility [4]. The American College of Medical Genetics and Genomics (ACMG) has established clinical laboratory standards for NGS gene panels but, ultimately, these panels are regulated by the Clinical Laboratory Improvement Amendments (CLIA), a federal program that certifies and oversees clinical laboratory testing [5, 6]. CLIA primarily assesses analytical validity–the accuracy of mutation detection–in order to maintain quality standards and ensure the effectiveness of each laboratory test [6]. CLIA does not regulate research-based genetic testing, but similar analytical assessments can be carried out to ensure accurate mutation detection in a research setting. We have developed a research-based gene panel, B.O.P. (reast, varian, and rostate), to assess inherited risk of hereditary breast cancer (BC) and associated cancers. B.O.P. is an exploratory gene panel. In addition to targeting clinically valid genes that have NCCN management strategies, it also targets genes that have been suggested to be associated with an increased risk but currently lack sufficient evidence, as well as candidate genes, such as those in DNA repair pathways. Therefore, the ultimate goal in utilizing this panel is to better elucidate risk. Regarding hereditary BC, NCCN clinically valid genes only account for a minority of the associated genes reported in the literature [1, 7, 8]. Furthermore, NCCN risk management strategies have primarily been developed for overtly pathogenic, truncation mutations in clinically valid genes–resulting in the detection of many variants of unknown significance (VUS), and clinically valid mutations explain less than 30% of hereditary BC cases. Additional exploration is critical to fill these knowledge gaps, and B.O.P. can aid in this investigation. However, prior to using B.O.P. as a way to increase knowledge in these areas, it must be evaluated for its ability to accurately detect variants. The purpose of this manuscript is to introduce B.O.P., present the analytical assessment of 10 NCCN regulated genes (in order to ensure the accurate detection of clinically relevant variants), and discuss the future potential of the panel.

Materials and methods

Ethical compliance and informed consent

All procedures performed in studies involving human participants were in accordance with the ethical standards of Auburn University and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards. Specifically, this research was reviewed and approved by the Auburn University Institutional Review Board for the recruitment, enrollment and biobanking of the Alabama Hereditary Cancer Cohort (AHCC; IRB protocols 14–232, 14–335, and 15–111) [9]. Informed consent was obtained in writing from all individual participants included in the study.

Panel design

B.O.P. targets ~500 kilobases (Kb) of DNA including 87 genes that are suggested, predicted, or clinically proven to be associated with BC, OvC, and/or prostate cancer (PC) risk (S1 Table). Agilent Technologies Haloplex probes were designed using Agilent Technologies SureDesign software (https://earray.chem.agilent.com/suredesign/). The “Advanced HaloPlex” design allowed for the selection of the desired genes of which the targeted regions included both coding and non-coding exons as well as 10 intronic base pairs flanking the intron-exon boundaries. The probe set was designed for 100 base pair reads on an Illumina platform. Overall, probes were predicted to cover 98.93% of the targeted genes/regions (Table 1).
Table 1

Ten clinically relevant genes on the B.O.P. panel assessed for analytical validity.

Gene or targeted regionsAccession number# of targeted regionsSize (bp)Predicted target 1X coverage* (%)Average sequencing depth (X)Interquartile range (X)% bases covered greater than or equal to:
First quartileMedianThird quartile1X10X20X50X100X250X500X1000X10000X
Genes investigated for analytical validationATMNM_000051651554598.0781336659106897.595.793.687.478.360.041.020.80.8
BRCA1NM_00730024775098.51017397787127198.197.195.690.581.965.247.427.21.1
BRCA2NM_000059281207899.1960445803122598.797.896.591.683.566.648.125.71.1
CDH1NM_00436016526998.9934418834124298.597.195.591.183.466.348.326.61.0
CHEK2NM_00100573523460596.5726279588102395.893.491.384.574.756.638.518.90.7
NBNNM_00248522668198.069631261095997.696.093.886.676.857.938.017.90.6
PALB2NM_024675134318100.01001557876126199.999.098.094.286.870.952.328.31.1
PTENNM_000314101024898.059722146885396.993.290.081.370.650.831.314.10.5
STK11NM_000455103476100.050517538777493.289.085.977.667.546.528.311.50.3
TP53NM_00054614421699.0788316676103997.494.692.687.377.759.341.721.40.8
All targeted B.O.P. regions1417499,521 98.9809359687109298.296.694.788.879.961.943.021.60.8

*Based on design report by Agilent Technologies

*Based on design report by Agilent Technologies

Capture and sequencing

The genomic DNA of 43 cancer–affected individuals (23 African American [AA] and 20 European American [EA]) from the AHCC [9] was selected to undergo the first B.O.P. screening (Figs 1 and 2). Two study participants (1CAD-a and 1CAD-f) were knowingly related (first cousins). The HaloPlex HS Target Enrichment System For Illumina Sequencing Protocol (Version C0, December 2015) was followed for the targeted-capture, allowing each of the 43 samples to be uniquely barcoded/indexed, individually captured, and pooled in equimolar amounts for Illumina paired-end sequencing. One pooled sample with a final concentration of 24.13 nanomoles/liter (and DNA fragments ranging from 175 to 625 base pairs) was sent for sequencing on one lane of a flow cell on an Illumina HiSeq 2500 at the Genomic Services Laboratory (GSL) at HudsonAlpha Institute for Biotechnology. The final DNA quality/quantity of the pooled sample was assessed using the High Sensitivity DNA kit using the ABI 2100 Bioanalyzer.
Fig 1

Screening process for individuals in the AHCC.

*In situations where blood is unattainable, another set of three distinct biological samples (i.e. saliva) is collected.

Fig 2

Pipeline for B.O.P. panel screening: Targeted capture, next-generation sequencing, bioinformatics pipeline, and variant filtering.

Screening process for individuals in the AHCC.

*In situations where blood is unattainable, another set of three distinct biological samples (i.e. saliva) is collected.

Bioinformatics analyses

The sequencing data generated for each indexed sample (43 forward and 43 reverse FASTQ files) were downloaded using the GSL’s wget downloader (Fig 2). Trimmomatic (v.0.35) was used to trim the unique barcodes and Illumina adaptors. After generating trimmed FASTQ files, FastQC (v.0.10.1) was used to ensure that the repeated sequences had been trimmed from the sequences. The trimmed, paired forward and reverse FASTQ files were then aligned to the soft-masked human reference genome (GRCh38/hg38) using Burrows-wheeler Aligner (BWA v.0.7.12), generating SAM (Sequence Alignment/Mapping) files, which were then compressed into BAM (binary SAM) files and sorted using SAMtools (v.1.2). PicardTools (v.1.79) was used to add read groups and then index the sorted, compressed BAM files along with realigning insertions and deletions (indels). As previously suggested for Haloplex, duplicates were not marked or removed [10]. Variants in B.O.P targeted-regions were called from the sorted BAM files using the HaplotypeCaller tool in the Genome Analysis Tool Kit (GATK; v.3.4–46); the generated VCF (Variant Calling Format) files were then merged using Tabix (v.0.2.6) and VCFTools (v.0.1.12a). Variants in the merged VCF files were annotated using ANNOVAR (June2017). Overall, this pipeline was adapted from the GATK Best Practice Pipeline [11]. Lastly, Samtools flagstat (v.1.2) was used to gather metrics of the analyzed reads, and the DepthOfCoverage tool within the GATK (v.3.4–46) was used to calculate the depth of coverage for the targeted regions.

Analytical assessment

Ten of the 87 genes were selected for our initial analytical validation efforts (Table 1). The selected genes represented clinically actionable BC susceptibility genes in order to facilitate analytical validity calculations, as well as to begin to determine the landscape of potential clinically significant variants in the AHCC. Using Exome Variant Server (EVS) as a control repository, B.O.P. variants in those 10 genes were filtered for minor allele frequencies (MAFs) of less than or equal to 2% in both EAs and AAs [12]. Variants were further filtered; all coding variants as well as intronic variants that were located within 10 base pairs of an intron-exon boundary were carried through for validation using polymerase chain reactions (PCR) and Sanger sequencing (Fig 1). Primer sequences and amplification conditions are available upon request. P values and Odds ratios (ORs) were calculated using Fisher exact test in R (v 3.5.1), which were not adjusted for multiple testing. Upon consent and enrollment into the AHCC, study participants provided information about previous clinical genetic screening [9]. Thus, for genes with clinical screening results provided by the 43 participants involved in this initial B.O.P. screening, sensitivity and specificity were calculated. Sensitivity was defined as the total number of true positives (TPs) divided by the sum of the total number of TPs and false negatives (FNs; TPs/ [TPs + FNs]). TPs were defined as (i) variants that had been previously identified through clinical gene screening, initially confirmed in the research laboratory by PCR and Sanger sequencing, and, subsequently, detected upon B.O.P. screening, or, (ii) in the case of no clinical screening results, as variants detected upon B.O.P. screening and then validated through PCR and Sanger sequencing (Figs 1 and 2). FNs were variants that had been previously identified through clinical gene screening and confirmed in the research laboratory by PCR and Sanger sequencing but not detected through B.O.P. screening. Specificity was defined as the ratio of the total number of true negatives (TNs) over the sum the total number of TNs and false positives (FPs; TNs / [TNs + FPs]). TNs were defined as individuals who did not have a pathogenic variant detected through clinical gene screening as well as B.O.P. screening. FPs were defined as variants detected through B.O.P. screening but were not validated upon subsequent PCR and Sanger sequencing (Figs 1 and 2). Lastly, the false discovery rate (FDR; FP / [TP + FP]) was calculated for the 10 genes individually. FDRs were calculated in two ways, (i) including all B.O.P. called variants, and (ii) excluding B.O.P. variants with an allele balance less than or equal to 0.20.

Results

The number of reads that passed QC assessment per individual averaged 11.3 million (M), with 98.6% of those reads mapping to the human genome. On average, 50.9% of the reads mapped to B.O.P. targeted regions (S2 Table). The average sequencing depth for all targeted base pairs was 809X; however, sequencing depth was not uniform with a large interquartile range (Table 1). A probe design report provided by Agilent Technologies predicted that 98.9% of the targeted base pairs would be covered at least 1X, which was similar to the actual coverage of 98.2%; thus, 1.8% of the targeted base pairs were not covered at all (Table 1). The 10 assessed genes had average sequencing depths that ranged from 505X-1017X (Table 1). Furthermore, assessment of the 225 different regions that targeted those 10 genes revealed that the majority had average sequencing depths between 800-899X but ranged from 68X-2053X (Table 2; S3 Table). Although rare, regions with average sequencing depths less than 100X missed on average 24.3% of the targeted base pairs and only covered 52.2%, 34.6%, and 21.3% of targeted base pairs at or greater than 20X, 50X, and 100X, respectively (Table 2; S3 Table). Regions with the highest average sequencing depth, 1500X or greater, had over 99% and 96.5% of the targeted base pairs covered at least 50X and 100X, respectively. However, 28.0% of the 225 regions-of-focus did not, on average, obtain 100% coverage at 1X, which included regions with average sequencing depths ranging from 68X-1354X (Table 2; S3 Table).
Table 2

Average sequencing depth analyses of the 225 regions targeting the ten B.O.P. genes being assessed.

Average region sequencing depth (X)# of regionsAverage % bases covered greater than or equal to:
1X10X20X50X100X250X500X1000X10000X
<100275.763.252.234.621.37.31.80.90.0
100–199484.475.969.154.839.515.65.02.30.0
200–299791.588.882.670.154.926.18.12.70.0
300–3991297.093.188.277.964.538.516.55.00.0
400–4992297.895.692.883.271.847.924.87.00.1
500–5992198.496.994.887.576.056.732.911.10.2
600–6992799.398.296.289.679.259.337.914.90.5
700–7992899.197.896.691.181.664.542.719.10.7
800–8992999.698.998.093.885.268.849.424.10.9
900–99913100.099.799.296.388.673.254.027.81.2
1000–10991199.899.498.996.189.474.357.631.41.3
1100–11991299.599.198.195.189.072.457.234.61.4
1200–12991499.899.799.497.291.477.261.837.61.6
1300–1399399.899.799.698.793.680.668.341.32.0
1400–14997100.0100.099.898.894.781.668.244.21.7
>150013100.0100.099.999.296.584.872.451.32.2
Upon variant annotation (Fig 2), a total of 24,915 variants (2,858 unique) were called. After filtering for variants in the 10 genes (Table 1), a total of 1960 (287 unique) remained, 74 (56 unique) of which had MAFs less than 2% in both ethnicities (Table 3; Fig 2). A total of 61 of the 74 variants were validated and classified as TPs, averaging a sequencing depth of 659X and an allele balance of 0.51; this included 100% of the variants in seven out of the 10 genes (ATM, BRCA2, CHEK2, NBN, PALB2, STK11, and TP53), resulting in FDRs of 0 (Table 3). BRCA1 and CDH1 each had one FP, and PTEN had the highest FDR with 11 FPs, all in intron7/exon8 (Table 3). Despite that the average FP sequencing depth was 34X (ranging from 12X-63X), sequencing depths of the three regions harboring FPs revealed that all achieved an average greater than 427X (Table 3 and S3 Table). The average FP allele balance was 0.33, ranging from 0.13–0.68.
Table 3

Summary of called variants after bioinformatics pipeline and variant filtering.

Gene NameChrStart positionRef. AlleleAlt. AlleleFunctionExon/ IntronDNA ChangeAmino Acid ChangePolyphen2 prediction~EA EVS#AA EVS#CLINVAR^Number of individuals with variant calledIndividual/ SampleEthnicityGTGQTotal DepthAllele balance≥ 100X Depth AND 0.40 Allele BalanceValidation results (TP/FP)Percent validatedFDRSensitivitySpecificity
Including all called variantsExcluding variants with ≤ 0.20 allele balance
ATM (NM_000051)chr11108227849CGNSexon 3c.146C>Gp.S49CP0.01360.0027Risk Factor11CAI-aAAHet998300.48YesTP100%0.000.00N/AN/A
chr11108229171CTIntronicintron 3c.186-7C>TN/A..0.0142Benign14CA-aAAHet9916850.43YesTP
chr11108244860CTNSexon 7c.735C>Tp.V245V.0.01340.0023Likely Benign11ED-aEAHet999070.50YesTP
chr11108248927TGIntronicintron 8c.1066-6T>GN/A.0.00260.0002VUS11CBE-aEAHet99790.43NoTP
chr11108249096TCNSexon 9c.1229T>Cp.V410AB0.00220.0009VUS11EE-aEAHet993760.64YesTP
chr11108251973TCNSexon 11c.1744T>Cp.F582LB0.0009.Likely Benign11EAD-aEAHet992400.58YesTP
chr11108253901TCSexon 13c.1986T>Cp.F662F.0.0006.Likely Benign11EAC-aEAHet993310.54YesTP
chr11108254034TCNSexon 13c.2119T>Cp.S707PB0.01090.0039Likely Benign11CBC-aAAHet995810.41YesTP
chr11108259051CANSexon 16c.2442C>Ap.D814EB0.00010.0198Likely Benign11EBA-aAAHet991950.37NoTP
chr11108267276TCNSexon 17c.2572T>Cp.F858LP0.0120.0032Benign11EAJ-aEAHet992270.47YesTP
chr11108284478GTIntronicintron 26c.3993+5G>TN/A..0.0098Likely Benign11CAD-fAAHet993680.59YesTP
chr11108315883GANSexon 41c.6067G>Ap.G2023RD0.00310.0007VUS11CG-aEAHet9917250.54YesTP
chr11108315904AGNSexon 41c.6088A>Gp.I2030VB.0.0148Likely Benign11CAF-aAAHet997450.43YesTP
chr11108317409GANSexon 43c.6235G>Ap.V2079IB0.00060.0166Benign11EA-aEAHet999970.51YesTP
chr11108327713GASexon 48c.7044G>Ap.T2348T..0.0014Likely Benign11CAC-aAAHet991030.59YesTP
BRCA1 (NM_007300)chr1743049113AGIntronicintron 22c.5469+8T>CN/A..0.0145Likely Benign23CC-aAAHet9917560.45YesTP91.67%0.080.00100%92.3%
4CA-aAAHet9913430.51YesTP
chr1743051071ACNSexon 21c.5387T>Gp.M1796RD..Pathogenic11CAD-aAAHet991140.60YesTP
chr1743070958CTNSexon 16c.5019G>Ap.M1673IB0.01520.002VUS11EA-aEAHet994570.49YesTP
chr1743091492TCNSexon 10c.4039A>Gp.R1347GB0.00670.0011VUS11EAC-aEAHet9915790.49YesTP
chr1743092362TCNSexon 10c.3169A>Gp.S1057GB..VUS11CE-aAAHet991310.51YesTP
chr1743092509TCNSexon 10c.3022A>Gp.M1008VB.0.0023VUS11EAG-aAAHet991870.42YesTP
chr1743093035TASexon 10c.2496A>Tp.P832P...Likely benign11CD-aAAHet9914560.48YesTP
chr1743093626AGSexon 10c.1905T>Cp.N635N..0.0005Likely benign11CF-aAAHet992640.52YesTP
chr1743094408GTNSexon 10c.1123C>Ap.L375IP...11CG-aEAHet99630.13NoFP
chr1743097280GTNSexon 7c.557C>Ap.S186YD.0.0068VUS23CC-aAAHet998140.50YesTP
4CA-aAAHet994070.50YesTP
BRCA2 (NM_000059)chr1332332629CTNSexon 10c.1151C>Tp.S384FD0.00150.0002VUS11CD-aAAHet9912290.52YesTP100%0.000.00
chr1332332753AGSexon 10c.1275A>Gp.E425E..0.0098Benign11CAI-aAAHomo9910461.00YesTP
chr1332333266TCSexon 10c.1788T>Cp.D596D.0.00030.0163Likely Benign21CCB-aAAHet9911150.51YesTP
1EB-aAAHet993470.50YesTP
chr1332333276TCNSexon 10c.1798T>Cp.Y600HB.0.0055VUS11CAG-aAAHet99720.42NoTP
chr1332333395TG-Intronicintron 10c.1909+8delTGN/A.0.00050.0059VUS11CAG-aAAHet99190.58NoTP
chr1332339375AGNSexon 11c.5020A>Gp.S1674GB..VUS21CAD-aAAHet993440.61YesTP
1CAD-fAAHet9912940.47YesTP
chr1332339554CTSexon 11c.5199C>Tp.S1733S.0.00540.0009Likely Benign11CBE-aEAHet9913980.47YesTP
chr1332339966_ 32339970AGTAA-FSDexon 11c.5611_5615 delAGTAA$p.S1871fs$...Pathogenic11CB-aAAHet992790.57YesTP
chr1332340678GANSexon 11c.6323G>Ap.R2108HB0.00150.0068VUS11CAH-aAAHomo991,0901.00YesTP
chr1332357750GASexon 16c.7626G>Ap.T2542T..0.0061Likely Benign11EAH-aAAHet99360.33NoTP
chr1332363385TCNSexon 18c.8183T>Cp.V2728AP..VUS11CAD-fAAHet9910010.52YesTP
chr1332371035ACNSexon 20c.8567A>Cp.E2856AD0.0020.0005VUS11CAB-aEAHet99470.55NoTP
CDH1 (NM_004360)chr1668801830AGSexon 3c.324A>Gp.R108R.0.00030.0061Likely Benign21CAF-aAAHet991,2100.49YesTP75%0.250.00N/AN/A
1CBH-aAAHet991200.48YesTP
chr1668813324GTNSexon 9c.1149G>Tp.Q383HB...11EE-aEAHet99200.20NoFP
chr1668819394GCSexon 11c.1680G>Cp.T560T.0.00290.0007Likely Benign11CAI-aAAHet993,0530.53YesTP
CHEK2 (NM_001005735)chr2228695232AGNSexon 13c.1399T>Cp.Y467HD0.0003.VUS11EA-aEAHet995670.47YesTP100%0.000.00N/AN/A
NBN (NM_002485)chr889955458TCNSexon 10c.1222A>Gp.K408EP.0.0082Likely Benign11CC-aAAHet9913410.53YesTP100%0.000.00N/AN/A
chr889980833AGSexon 4c.381T>Cp.A127A.0.00450.0018Likely Benign11EAJ-aEAHet991640.45YesTP
chr889984520CTIntronicintron 1c.37+5G>AN/A.0.0010.0182Likely Benign21CAH-aAAHet991770.54YesTP
1CC-aAAHet991090.53YesTP
PALB2 (NM_024675)chr1623629898TCSexon 5c.2256A>Gp.G752G..0.0055Likely Benign11CCB-aAAHet9911520.51YesTP100%0.000.00N/AN/A
chr1623635127TGSexon 4c.1419A>Cp.P473P..0.0084Likely Benign11CAI-aAAHet999550.49YesTP
chr1623635536AGNSexon 4c.1010T>Cp.L337SB0.01970.0036VUS11EG-aEAHet992810.52YesTP
chr1623638125TCNSexon 2c.53A>Gp.K18RD.0.0155VUS21CC-aAAHet992760.49YesTP
1EAG-aAAHet991110.42YesTP
PTEN (NM_000314)chr1087931070CTSexon 4c.234C>Tp.T78T..0.0002Likely Benign11CA-aEAHet993390.53YesTP8.33%0.920.92N/AN/A
chr1087960892ATIntronicintron 7c.802-2A>TN/A...Pathogenic41CAF-aAAHet91270.30NoFP
1CD-aAAHet99450.29NoFP
1CF-aAAHet96410.46NoFP
1EAJ-aEAHet86120.42NoFP
chr1087960896CANSexon 8c.804C>Ap.D268EB..Not provided31CC-aAAHet99320.34NoFP
1EE-aEAHet83340.24NoFP
3CC-aAAHet99630.27NoFP
chr1087960902GTNSexon 8c.810G>Tp.M270IP...31CAB-aEAHet90190.32NoFP
1CAF-aAAHet90250.32NoFP
1CC-aAAHet41340.68NoFP
chr1087960906CTNSexon 8c.814C>Tp.H272YD...11CC-aAAHet89310.29NoFP
STK11 (NM_000455)chr191218495GASexon 2c.369G>Ap.Q123Q.0.00020.0154Likely Benign51CAG-aAAHet991070.46YesTP100%0.000.00N/AN/A
1CBH-aAAHet99810.57NoTP
1CE-aAAHet99880.49NoTP
1EBA-aAAHet998490.51YesTP
1EB-aAAHet999040.49YesTP
TP53 (NM_000546)chr177670613ACNSexon 10c.1096T>Gp.S366AB..VUS11EBA-aAAHet998630.43YesTP100%0.000.00N/AN/A
chr177673776GANSexon 8c.844C>Tp.R282WD0.0002.Pathogenic11EC-aAAHet992400.28NoTP

Table 3 Key: (Chr) Chromosome; (Ref.) Reference; (Alt.) Alternate; (NS) Nonsynonymous; (S) Synonymous; (FSD) Frame-shift deletion; (VUS) Variant of Uncertain Significance; (EA) European American; (AA) African American

(~) Polyphen2 HDIV prediction; (B) Benign; (P) Probably damaging; (D) Damaging

(#) esp6500siv2

(^) most severe clinical significance classification; (GT) Genotype; (GQ) Genotype quality; (Het) Heterozygous; (Homo) Homozygous; (FDR) False discovery rate; (TP) True positive; (FP) False positive

($) The deletion was named using ANNOVAR (v.); however, it is within a short tandem repeat and commonly referred to as BRCA2 c.5616_5620del5 (p.K1872Nfs) since Human Genome Variant Society (HGVS) nomenclature rules state to arbitrarily assign the deletion to the most 3’ nucleotide.

Table 3 Key: (Chr) Chromosome; (Ref.) Reference; (Alt.) Alternate; (NS) Nonsynonymous; (S) Synonymous; (FSD) Frame-shift deletion; (VUS) Variant of Uncertain Significance; (EA) European American; (AA) African American (~) Polyphen2 HDIV prediction; (B) Benign; (P) Probably damaging; (D) Damaging (#) esp6500siv2 (^) most severe clinical significance classification; (GT) Genotype; (GQ) Genotype quality; (Het) Heterozygous; (Homo) Homozygous; (FDR) False discovery rate; (TP) True positive; (FP) False positive ($) The deletion was named using ANNOVAR (v.); however, it is within a short tandem repeat and commonly referred to as BRCA2 c.5616_5620del5 (p.K1872Nfs) since Human Genome Variant Society (HGVS) nomenclature rules state to arbitrarily assign the deletion to the most 3’ nucleotide. Though not optimal, low sequencing depth was not always indicative of a FP. Of the 20 variants covered less than 100X, 13 were FPs, and 7 were TPs with average sequencing depth of 60X and allele balance of 0.48, ranging from 0.33–0.58 (Table 3). In contrast, higher coverage, such as sequencing depth greater than 100X, was an indicator of a TP; all 54 variants covered over 100X were determined to be TPs. This included two homozygous TPs, which were each covered over 1000X with the alternate allele being the only one detected (Table 3), and 52 heterozygous TPs that had an average sequencing depth of 724X and allele balance of 0.50. To note the importance of allele balance, 95% of the TPs had an allele balance above 0.40 compared to only 23% of the FPs. Eighty five percent of the TPs had over 100X coverage and an allele balance of 0.40 (Table 3). No variants with an allele balance less than 0.20 were TPs, and additional filtering to exclude such variants improved FDRs (Table 3). Prior to B.O.P. screening, positive and negative BRCA1 and BRCA2 mutation status was known for eight of the 43 study participants; thus, sensitivity and specificity could be calculated for those genes. Seven study participants had previously undergone clinical BRCA1/2 screening; six reported negative results with no pathogenic variants identified. One individual, 1CB-a, received a positive report indicating a pathogenic BRCA2 frame-shifting mutation (c.5611_5615delAGTAA [p.S1871fs] also known as c.5616_5620delAGTAA [p.K1872Nfs), which was confirmed using PCR and Sanger sequencing prior to B.O.P. screening (Fig 3 and Table 3). The eighth individual, 1CAD-a, had not personally obtained clinical gene screening; however, a deceased family member had undergone clinical BRCA1/2 screening and received a positive report indicating a pathogenic missense mutation (BRCA1 c.5387T>G [p.M1796R]). Thus, this individual was screened for the familial mutation using PCR and Sanger sequencing prior to B.O.P. screening and tested positive (Fig 3 and Table 3). Noteworthy, another BC-affected family member, 1CAD-f, tested negative for BRCA1 p.M1796R in the research laboratory prior to B.O.P. screening but could not be considered a TN for the specificity calculation since full gene screening had not been carried out (Table 3). B.O.P. variant calling reported 12 and 14 variants in BRCA1 and BRCA2, respectively (Table 3). Upon Sanger sequencing confirmation, this included 11 TPs, zero FNs, six TNs and one FP in BRCA1, and 14 TPs, six TNs, and zero FNs and FPs in BRCA2 (Table 3), which corroborated the previously reported BRCA1 and BRCA2 mutation statuses. Therefore, B.O.P. screening of BRCA1/2 resulted in 100% sensitivity and 92.3% specificity. However, specificity became 100% with the elimination of variants with an allele balance of 0.20 or less.
Fig 3

B.O.P. positive controls: Mutations that were previously reported through clinical gene screening.

The Sanger sequence electropherogram (above) and the Integrative Genomics Viewer (v.2.4.5) image (below) are depicted for two B.O.P. screened positive controls: 1CB-a (Panel A) and 1CAD-a (Panel B). ($) This deletion is within a short tandem repeat and is commonly referred to as BRCA2 c.5616_5620del5 (p.K1872Nfs) since HGVS nomenclature rules arbitrarily assign the deletion to the most 3’ nucleotide.

B.O.P. positive controls: Mutations that were previously reported through clinical gene screening.

The Sanger sequence electropherogram (above) and the Integrative Genomics Viewer (v.2.4.5) image (below) are depicted for two B.O.P. screened positive controls: 1CB-a (Panel A) and 1CAD-a (Panel B). ($) This deletion is within a short tandem repeat and is commonly referred to as BRCA2 c.5616_5620del5 (p.K1872Nfs) since HGVS nomenclature rules arbitrarily assign the deletion to the most 3’ nucleotide. Of the 61 TPs, 45 were detected in AAs; this included 34 unique variants, eight of which were detected in multiple individuals (Table 3). According to ClinVar [13, 14], the 34 variants were categorized as pathogenic/risk factor (n = 4), VUSs (n = 11), or benign/likely benign (n = 19). A total of five variants were predicted to be deleterious in Polyphen, two of which have been defined as pathogenic non-synonymous variants in ClinVar; the other three are currently classified as VUSs (Table 3). Of the eight variants detected in more than one individual, BRCA2 c.5020A>G; p.S1674G, was identified in two first cousins. The remaining seven were in seemingly unrelated individuals. This includes STK11 c.369G>A;p.Q123Q, a seemingly benign variant, which is reported to have a MAF of 1.5% in the general AA population but was detected in five of the 23 AAs in this study, indicating a MAF of 10.8% (P value 8.50 X 10−4; Odds ratio 7.79 CI95[2.32–20.70]). Furthermore, the 45 AA TPs were detected in 96% of the AAs screened, and multiple variants were detected in 70% of the cases. In contrast, 16 TPs were validated in 55% of the EAs, and only 20% had multiple variants. The difference in the number of individuals from each ethnicity with at least one TP was significant (P value 2.71 X 10−3; Odds ratio 16.83 CI95[1.93–819.72]) as well as the number of cases from each ethnicity with multiple TPs (P value 1.95 X 10−3; Odds ratio 8.60 CI95[1.89–49.30]). No overtly pathogenic variants were validated in EAs, but 50% of the EA TPs (8/16) were listed as a VUS, three of which were predicted to be deleterious in Polyphen [15].

Discussion

Our group has developed B.O.P., a research-based NGS gene panel, which targets 87 genes that have been suggested, predicted, or clinically proven to be associated with risk of BC, OvC, and/or PC. The overall purpose of this new panel is to gain additional insights toward the genetic risk of and overlap between those three cancers. This manuscript served to introduce B.O.P. by reporting its initial screening, which involved 43 cancer-affected individuals from the AHCC [16]. Targeting ~500 Kb of DNA, 98.9% of the base pairs were covered at least 1X, and an average sequencing depth of 809X was obtained. We took a closer look at 10 NCCN regulated genes in order to begin the analytical assessment of the panel and ensure the accurate detection of clinically relevant variants; upon variant filtering and validation, 100% of the variants in seven of the 10 genes were TPs. TPs had an average sequencing depth of 659X and allele balance of 0.51, whereas the average FP sequencing depth and allele balance was 34X and 0.33, respectively. Although FPs had a much lower average sequencing depth compared to TPs, a low sequencing depth was not always indicative of a FP. Contrarily, all variants called with high sequencing depths (>100X) signified a TPs. Furthermore, sensitivity and specificity of BRCA1/2 were calculated to be 100% and 92.3%, respectively. There are a number of different targeted enrichment options to choose from when designing gene panels, all of which can have different affects on sequencing outcomes [10]. When comparing methods, Samorodnitsky et al. concluded that Haloplex had the highest on-target read alignment and normalized sequencing depth but the least uniformity [10]. Noteworthy, despite reports of Haloplex resulting in a high percentage (>90%) of on-target read alignments [10, 17], 50.9% of our QC passed reads mapped to the B.O.P. targeted regions. With other Haloplex gene panel studies not reporting such data [18-20], it is difficult to make general conclusions about Haloplex on-target read alignment specificity. However, similar on-target read alignment percentages have been reported; Castera et al. used SureSelect baits in order to target hereditary BC and OvC susceptibility genes, and reported an average of 42% of reads on-target [21]. Ultimately, the percentage of off-target reads is likely dependent on a number of factors, including the specific genes/regions being targeted [22]. Of the reads that mapped on-target, B.O.P.’s overall sequencing depth averaged 809X, and each individually assessed gene obtained average sequencing depths from 505X-1017X. Nevertheless, large interquartile ranges indicated that depth was not uniform. This was expected since no current enrichment and sequencing approach provides complete uniformity primarily because of complex genomic regions that are very difficult to capture/sequence and result in low sequencing depths or even no coverage at all [10, 22]. By focusing on a select set of genes/regions, NGS gene panels target a smaller number of base pairs compared to more broad applications such as exome and whole genome sequencing. The smaller target-capture provides the option to achieve a high average sequencing depth, which aids in variant identification [10, 23]. Therefore, the overall goal is to obtain 100% coverage as well as the appropriate/desired sequencing depths at all targeted base pairs. Since this goal is not generally achieved, complementary assays can be used to fill in gaps, which is commonly implemented for clinical applications. In such cases, regions of low/no coverage are normally Sanger sequenced [22, 23]. B.O.P. was able to cover, on average, 98.2% of its targeted base pairs at 1X. Being a research panel, no gap-filling assays were carried out; however, region-specific coverage analyses provided insight towards the feasibility of gap-filling. Gap-filling criteria has been described in a number of BC NGS gene panel publications, specifically, those that highlighted panel performance and analytical validity [21, 24–26]. Being clinical panels, regions covered less than 20X [21] or 50X [24-26] were checked by conventional methods. Interestingly, only two B.O.P. regions had average sequencing depths less than 100X (68X and 82X), which would not have required complementary assays to fill in gaps according to the criteria set in the referenced studies [21, 24–26]. This is despite that, on average, those two B.O.P. regions missed 24.3% of the targeted base pairs and only covered 52.2%, 34.6%, and 21.3% of targeted base pairs at or greater than 20X, 50X, and 100X, respectively. Furthermore, 63 of the 225 B.O.P. regions-of-focus did not, on average, obtain 100% coverage at 1X; these regions had average sequencing depths ranging from 68X-1354X. Thus, regions with high sequencing depths still had base pairs with no/low coverage, which happened to be where FPs were detected in this study. Therefore, only gap-filling regions with ‘low’ (20X or 50X) sequencing depths, will not guarantee 100% coverage. Establishing mapping criteria to ensure all base pairs are covered at a desired depth is ideal but would likely reveal gaps in too many regions, making gap-filling infeasible. Overall, gaps in B.O.P. as well as other panels, even with gap-filling criteria, can provide less than definitive negative results [23]; however, in noting that, zero FNs were identified in BRCA1 and BRCA2, resulting in 100% sensitivity. In addition to gap-filling, conventional approaches are also used to validate called variants. A total of 1960 variants were detected in the 10 B.O.P. assessed genes and, to reduce the number of variants to validate for this analytical assessment, only variants with MAFs less than 2% in both ethnicities were Sanger sequenced. This included 74 variants, 61 of which were confirmed and defined as TPs revealing 13 FPs. The validation process ultimately provided insight regarding the likelihood of confirmation based on variant quality, such as sequencing depth since all 54 variants covered over 100X were TPs. These results corroborated the criteria established by Mu et al., which set high confidence calls as having a minimum sequencing depth of 100X and allele balance of 40%. Additionally, Mu et al. indicated that such calls did not require confirmation. Although, all B.O.P. variants covered at or above 100X were TPs despite allele balance, the criteria from Mu et al. will be implemented in the future in order to be thorough. This will limit validation efforts to low confidence calls, reducing the cost and time of validation. In this study, 22 variants had low confidence calls. This included nine TPs, seven of which were covered less than 100X and two that failed to meet the required allele balance. The remaining 13 were FPs with an average sequencing depth of 34X and allele balance of 0.33, reiterating that low sequencing depths are susceptible to sequencing artifacts [10, 23]. Interestingly, as mentioned in the previous paragraph, the regions harboring the FPs did not have low sequencing depths, stressing the potential lack of uniformity within a targeted region. On another note, 11 of the 13 FPs were in PTEN. Considering PTEN has a processed pseudogene, PTENP1 on chromosome 9, their homology could have contributed to probe mis-priming as well as read mis-alignments. Overall, encountering problematic regions, such as regions with high homology or GC rich content, is common and referred to in many studies [18, 22, 23]. Overall, for each assessed gene, FDRs ranged from 0 to 0.92, the latter being PTEN. Of course, FDRs improved as additional filtering was implemented. Initial B.O.P. specificity, which could only be calculated for BRCA1/BRCA2, was 92.3%. Upon filtering out variants with an allele balance equal to or less than 0.20, specificity was 100%. Ultimately, Sanger sequencing all low confidence calls will eliminate FPs and provide 100% specificity; therefore, it is common to complement NGS gene panels with Sanger sequencing validation in order to consider the test complete and optimize specificity [21, 24–28]. In addition to enabling B.O.P.’s initial analytical assessment, the first B.O.P. screening, which involved 23 AAs and 20 EAs from the AHCC [9], has provided insight regarding variant contributions and ethnic differences. Overall, compared to EAs, AAs had a significantly higher number of individuals with at least one TP (P value 2.71 X 10−3) as well as individuals with multiple TPs (P value 1.95 X 10−3). Of course, comparisons to ethnic-specific controls will determine if these differences contribute to an inherited cancer risk. Interestingly, according to ClinVar [13, 14], none of the variants identified in EAs were considered pathogenic/risk variants, whereas 17.4% (4/23) of the AAs had a variant with that classification. The majority of the detected variants were classified as VUSs or benign/likely benign; ultimately, elucidating how VUSs and, even, synonymous variants contribute towards risk is very important. Synonymous variants, though normally ignored and considered benign, can affect splicing, gene expression or translation dynamics, all of which can contribute to a disease phenotype [29]. To further stress their importance, they have been reported to act as driver mutations in human cancers [30] and, through this initial B.O.P. screening, STK11 c.369G>A;p.Q123Q was detected in significantly more AA cases than controls (P value 8.50 X 10−4). Additionally, despite recognizing that hereditary BC risk is polygenic [31], little effort has been put forth to thoroughly investigate all variants in clinically relevant BC susceptibility genes, not to mention variants in genes currently lacking clinical significance, and determine if different variant combinations increase risk. Altogether, seemingly benign variant combinations could, in fact, be pathogenic, and paired with the striking difference between ethnicities regarding the number of cases with multiple variants, further investigation is warranted. Of course, a larger number of cases will be required for this effort. In summary, this effort assesses the analytical validity of the B.O.P. panel and demonstrates the panel’s ability to accurately detect mutations in 10 NCCN clinically actionable genes [8]. Despite the potential biases of the B.O.P. capture and NGS, the high depth of coverage, low FDR, and great sensitivity and specificity strongly support the use of this research gene panel to further elucidate hereditary BC/OvC/PC genetics. Although the cohort for this initial assessment is small, B.O.P. has begun to determine the mutation contributions of clinically valid genes in different ethnicities as well as permit the investigation of VUSs and other variant types and their effect towards polygenic risk. Furthermore, continued B.O.P. screening can provide additional evidence to confirm or refute previously suggested susceptibility genes, lessening the number of genes that lack clinical validity on commercially available panels [1]. Additionally, with the incorporation of candidate genes on B.O.P., it has the potential to identify novel genetic risk factors that are contributing towards BC, OvC, and PC. Lastly, as new susceptibility genes are discovered that are not currently on the B.O.P. panel, it is important to stress the ability to edit the targeted genes in order to best reflect clinical screening. As described herein, there are many potential benefits of B.O.P. screening, and we aim to make advances in cancer genetics research through its implementation in our research efforts.

Targeted genes on the B.O.P. panel.

(XLSX) Click here for additional data file.

Percentage of mapped reads.

(XLSX) Click here for additional data file.

Targeted regions of the 10 assessed genes and coverage.

(XLSX) Click here for additional data file.
  26 in total

1.  Identification of deleterious synonymous variants in human genomes.

Authors:  Orion J Buske; AshokKumar Manickaraj; Seema Mital; Peter N Ray; Michael Brudno
Journal:  Bioinformatics       Date:  2013-06-04       Impact factor: 6.937

2.  Synonymous mutations frequently act as driver mutations in human cancers.

Authors:  Fran Supek; Belén Miñana; Juan Valcárcel; Toni Gabaldón; Ben Lehner
Journal:  Cell       Date:  2014-03-13       Impact factor: 41.582

3.  From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline.

Authors:  Geraldine A Van der Auwera; Mauricio O Carneiro; Christopher Hartl; Ryan Poplin; Guillermo Del Angel; Ami Levy-Moonshine; Tadeusz Jordan; Khalid Shakir; David Roazen; Joel Thibault; Eric Banks; Kiran V Garimella; David Altshuler; Stacey Gabriel; Mark A DePristo
Journal:  Curr Protoc Bioinformatics       Date:  2013

4.  Next-generation sequencing for the diagnosis of hereditary breast and ovarian cancer using genomic capture targeting multiple candidate genes.

Authors:  Laurent Castéra; Sophie Krieger; Antoine Rousselin; Angélina Legros; Jean-Jacques Baumann; Olivia Bruet; Baptiste Brault; Robin Fouillet; Nicolas Goardon; Olivier Letac; Stéphanie Baert-Desurmont; Julie Tinat; Odile Bera; Catherine Dugast; Pascaline Berthet; Florence Polycarpe; Valérie Layet; Agnes Hardouin; Thierry Frébourg; Dominique Vaur
Journal:  Eur J Hum Genet       Date:  2014-02-19       Impact factor: 4.246

5.  Predicting functional effect of human missense mutations using PolyPhen-2.

Authors:  Ivan Adzhubei; Daniel M Jordan; Shamil R Sunyaev
Journal:  Curr Protoc Hum Genet       Date:  2013-01

6.  ACMG clinical laboratory standards for next-generation sequencing.

Authors:  Heidi L Rehm; Sherri J Bale; Pinar Bayrak-Toydemir; Jonathan S Berg; Kerry K Brown; Joshua L Deignan; Michael J Friez; Birgit H Funke; Madhuri R Hegde; Elaine Lyon
Journal:  Genet Med       Date:  2013-07-25       Impact factor: 8.822

Review 7.  Solving the molecular diagnostic testing conundrum for Mendelian disorders in the era of next-generation sequencing: single-gene, gene panel, or exome/genome sequencing.

Authors:  Yuan Xue; Arunkanth Ankala; William R Wilcox; Madhuri R Hegde
Journal:  Genet Med       Date:  2014-09-18       Impact factor: 8.822

8.  Evaluation of next generation sequencing platforms for population targeted sequencing studies.

Authors:  Olivier Harismendy; Pauline C Ng; Robert L Strausberg; Xiaoyun Wang; Timothy B Stockwell; Karen Y Beeson; Nicholas J Schork; Sarah S Murray; Eric J Topol; Samuel Levy; Kelly A Frazer
Journal:  Genome Biol       Date:  2009-03-27       Impact factor: 13.583

9.  ClinVar: public archive of relationships among sequence variation and human phenotype.

Authors:  Melissa J Landrum; Jennifer M Lee; George R Riley; Wonhee Jang; Wendy S Rubinstein; Deanna M Church; Donna R Maglott
Journal:  Nucleic Acids Res       Date:  2013-11-14       Impact factor: 16.971

10.  The validation and clinical implementation of BRCAplus: a comprehensive high-risk breast cancer diagnostic assay.

Authors:  Hansook Kim Chong; Tao Wang; Hsiao-Mei Lu; Sara Seidler; Hong Lu; Steven Keiles; Elizabeth C Chao; A J Stuenkel; Xiang Li; Aaron M Elliott
Journal:  PLoS One       Date:  2014-05-15       Impact factor: 3.240

View more
  3 in total

1.  PARP Inhibition in Prostate Cancer With Homologous Recombination Repair Alterations.

Authors:  Alexander von Werdt; Laura Brandt; Orlando D Schärer; Mark A Rubin
Journal:  JCO Precis Oncol       Date:  2021-10-22

2.  Rare and potentially pathogenic variants in hydroxycarboxylic acid receptor genes identified in breast cancer cases.

Authors:  Cierla McGuire Sams; Kasey Shepp; Jada Pugh; Madison R Bishop; Nancy D Merner
Journal:  BMC Med Genomics       Date:  2021-12-01       Impact factor: 3.063

Review 3.  Lemur Tyrosine Kinases and Prostate Cancer: A Literature Review.

Authors:  Elena Ferrari; Valeria Naponelli; Saverio Bettuzzi
Journal:  Int J Mol Sci       Date:  2021-05-21       Impact factor: 5.923

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.