| Literature DB >> 32242007 |
Deepak N Subramanian1,2, Magnus Zethoven3, Simone McInerny4, James A Morgan4, Simone M Rowley1, Jue Er Amanda Lee1,2, Na Li1,2, Kylie L Gorringe2, Paul A James1,2,4, Ian G Campbell5,6.
Abstract
High-grade serous ovarian carcinoma (HGSOC) has a significant hereditary component, approximately half of which cannot be explained by known genes. To discover genes, we analyse germline exome sequencing data from 516 BRCA1/2-negative women with HGSOC, focusing on genes enriched with rare, protein-coding loss-of-function (LoF) variants. Overall, there is a significant enrichment of rare protein-coding LoF variants in the cases (p < 0.0001, chi-squared test). Only thirty-four (6.6%) have a pathogenic variant in a known or proposed predisposition gene. Few genes have LoF mutations in more than four individuals and the majority are detected in one individual only. Forty-three highly-ranked genes are identified with three or more LoF variants that are enriched by three-fold or more compared to GnomAD. These genes represent diverse functional pathways with relatively few involved in DNA repair, suggesting that much of the remaining heritability is explained by previously under-explored genes and pathways.Entities:
Mesh:
Year: 2020 PMID: 32242007 PMCID: PMC7118163 DOI: 10.1038/s41467-020-15461-z
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Flowchart illustrating the filtering, ranking, prioritisation and curation steps used on the processed exome variant (vcf) data.
Steps performed in the post-sequencing pipeline (i.e. alignment of FASTQ reads, variant calling and annotation) are not displayed. Numerical figures refer to unique variants and genes. LoF loss of function, VEP Variant Effect Predictor, MAF minor allele frequency, RF failed random forests filter.
Known ovarian carcinoma predisposition genes with loss-of-function (LoF) and deleterious missense variants in the total case cohort.
| Gene | No. of cases (%) | Variants in cases | ||
|---|---|---|---|---|
| Transcript | Protein | Consequence | ||
| 1 (0.19) | c.2731 C > T | p.Arg911Ter | Stop-gain | |
| 3 (0.58) | c.313delT | p.Ser105GlnfsTer3 | Frameshift | |
| c.145 + 1_145 + 2insC | — | Splice donor | ||
| c.773 G > A | p.Arg258His | Missense | ||
| 1 (0.19) | c.556 C > T | p.Arg186Ter | Stop-gain | |
| 1 (0.19) | c.1372 G > T | p.Glu458Ter | Stop-gain | |
| Total | 6 (1.2) | |||
aOther Lynch syndrome genes (MLH1, MSH2, PMS2) had no deleterious variants present within the cohort.
Proposed ovarian carcinoma predisposition genes with loss-of-function (LoF) and known deleterious missense variants in the discovery case cohort.
| Gene | No. of pathogenic alleles in casesa (%) | No. of LoF alleles in GnomADb (%) | OR (95% CI)c | |
|---|---|---|---|---|
| 7 (0.69)d | 195 (0.17) | 2.98 (0.95−7.1) | 0.030 | |
| 3 (0.29)e | 131 (0.11) | 2.66 (0.54−7.96) | 0.11 | |
| 2 (0.20) | 401 (0.34) | 0.57 (0.07−2.09) | 0.59 | |
| 4 (0.39) | 344 (0.29) | 1.35 (0.36−3.49) | 0.55 | |
| 3 (0.29) | 57 (0.048) | 6.11 (1.22−18.8) | 0.015 | |
| 1 (0.098) | 90 (0.076) | 1.29 (0.03−7.37) | 0.54 | |
| 1 (0.098) | 19 (0.016) | 6.10 (0.15−38.5) | 0.16 | |
| 3 (0.29) | 86 (0.073) | 4.05 (0.82−12.3) | 0.041 | |
| 2 (0.20) | 168 (0.14) | 1.38 (0.17−5.08) | 0.66 | |
| 2 (0.20) | 289 (0.24) | 0.80 (0.10−2.92) | 1.0 | |
| Total | 28 (2.7) |
aTotal n = 1020 alleles tested per gene.
bGnomAD non-Finnish European (NFE), non-cancer sub-population.
cFisher’s exact test results (OR odds ratio, CI confidence interval). Calculations exclude missense variants.
dFigure for ATM includes two missense variants (c.7271 T > G, c.8147 T > C) classed as ‘pathogenic’ in NCBI ClinVar.
eFigure for BLM excludes additional stop-gain variant (c.2208 T > G) found in cis with frameshift variant (c.2206dupT) in the same individual.
Fig. 2Waterfall bar chart displaying the degree of enrichment for all protein-coding genes represented on the WES panel (n = 19,818) with filtered LoF variants in comparison to GnomAD, ordered by decreasing log10p value. Any negative log10p values for genes with calculated odds ratios > 1 were transformed into positive values prior to plotting.
Genes with log10 p values of 0 (equivalent to a p value of 1 i.e. no difference to GnomAD frequency) were not plotted. Shaded areas to left and right of dashed lines represent genes with p values < 0.01 and odds ratios > 1 (n = 133) or <1 (n = 19), respectively. Genes are labelled on the x-axis every 50 rows from the ordered list for illustration only.
Fig. 3Frequency bar chart displaying number of genes containing LoF variants vs. the number of individual cases they were present in.
Total number of genes stated above bar for each class of number of cases.
Top-ranked candidate genes remaining after curation with loss-of-function (LoF) variants present in three or more discovery case cohort individuals.
| Gene | Protein functiona | No. of LoF alleles in casesb (%) | No. of LoF alleles in GnomADc (%) | OR (95% CI)d | No. of COSMIC cases with pathogenic somatic tumour mutationse (%) | |
|---|---|---|---|---|---|---|
| K/Cl transporter | 6 (0.59) | 54 (0.046) | 12.9 (4.54−30.1) | 0.000013 | 0 (0) | |
| Microtubule stabilisation | 3 (0.29) | 9 (0.0076) | 38.6 (6.72−156) | 0.00013 | 0 (0) | |
| Sorbitol metabolism | 6 (0.59) | 89 (0.075) | 7.85 (2.80−17.8) | 0.00018 | 0 (0) | |
| Apoptosis inhibition | 5 (0.49) | 57 (0.048) | 10.2 (3.19−25.3) | 0.00020 | 1 (0.18) | |
| Mitochondrial metabolic transport | 4 (0.39)g | 36 (0.031) | 12.9 (3.33−36.2) | 0.00038 | 1 (0.18) | |
| Unknown | 4 (0.39) | 37 (0.031) | 12.6 (3.25−35.0) | 0.00042 | 0 (0) | |
| Elastin fibre remodelling | 4 (0.39) | 39 (0.033) | 11.9 (3.09−33.2) | 0.00050 | 0 (0) | |
| Gonadal transcription repressor | 3 (0.29) | 11 (0.013) | 22.6 (4.04−85.8) | 0.00056 | 0 (0) | |
| Ribosomal RNA methylation | 12 (1.2) | 444 (0.38) | 3.14 (1.61−5.57) | 0.00070 | 2 (0.36) | |
| DNA damage checkpoint signalling | 3 (0.29) | 18 (0.015) | 19.4 (3.65−66.5) | 0.00074 | 0 (0) | |
| Mitochondrial cobalamin transport | 5 (0.49) | 78 (0.066) | 7.46 (2.35−18.2) | 0.00076 | 1 (0.18) | |
| Guanine nucleotide synthesis | 3 (0.29) | 19 (0.016) | 18.3 (3.47−62.3) | 0.00085 | 0 (0) | |
| Unknown | 3 (0.29) | 19 (0.016) | 18.3 (3.47−62.3) | 0.00086 | 0 (0) | |
| Telomere maintenance/p53 regulation | 4 (0.39) | 49 (0.041) | 9.49 (2.48−26.0) | 0.0011 | 0 (0) | |
| Unknown | 3 (0.29) | 22 (0.019) | 15.8 (3.03−52.8) | 0.0013 | 1 (0.18) | |
| aPKC regulation | 4 (0.39) | 55 (0.047) | 8.45 (2.22−23.0) | 0.0017 | 1 (0.18) | |
| Cell-extracellular matrix regulator | 3 (0.29) | 26 (0.022) | 13.4 (2.59−43.8) | 0.0019 | 0 (0) | |
| Intercellular adhesion | 3 (0.29) | 26 (0.022) | 13.3 (2.58−43.7) | 0.0020 | 2 (0.36) | |
| Apoptosis stimulator | 4 (0.39) | 58 (0.049) | 7.99 (2.10−21.6) | 0.0020 | 1 (0.18) | |
| Unknown | 4 (0.39) | 60 (0.051) | 7.75 (2.04−21.0) | 0.0023 | 0 (0) | |
| Developmental role | 4 (0.39) | 60 (0.051) | 7.74 (2.04−20.9) | 0.0023 | 1 (0.18) | |
| TGF-beta targeting | 3 (0.29) | 28 (0.024) | 12.4 (2.42−40.4) | 0.0023 | 4 (0.71) | |
| Thromboxane A2 synthesis | 4 (0.39) | 61 (0.052) | 7.62 (2.01−20.6) | 0.0024 | 2 (0.36) | |
| Centriole regulator | 6 (0.59) | 150 (0.13) | 4.65 (1.67−10.4) | 0.0024 | 0 (0) | |
| Unknown | 3 (0.29) | 32 (0.027) | 10.8 (2.12−34.7) | 0.0034 | 0 (0) | |
| Mitotic microtubule protein | 3 (0.29) | 33 (0.028) | 10.6 (2.07−33.8) | 0.0036 | 1 (0.18) | |
| Glutamine-H+ antiporter | 3 (0.29) | 34 (0.029) | 10.2 (2.01−32.7) | 0.0039 | 0 (0) | |
| Intracellular signalling | 3 (0.29) | 36 (0.031) | 9.63 (1.89−30.5) | 0.0046 | 1 (0.18) | |
| Unknown | 6 (0.59) | 174 (0.15) | 4.01 (1.45−8.94) | 0.0048 | 0 (0) | |
| Cell adhesion and migration | 3 (0.29) | 37 (0.031) | 9.42 (1.85−29.8) | 0.0049 | 0 (0) | |
| MAPK signalling repressor | 5 (0.49) | 127 (0.11) | 4.58 (1.46−11.0) | 0.0058 | 0 (0) | |
| Cell cycle arrest after DNA damage | 4 (0.39) | 80 (0.068) | 5.78 (1.53−15.4) | 0.0061 | 0 (0) | |
| DNA damage-sensing cell cycle inhibitor | 5 (0.49) | 129 (0.11) | 4.51 (1.44−10.8) | 0.0061 | 0 (0) | |
| Mitotic microtubule regulator | 3 (0.29) | 41 (0.035) | 8.49 (1.68−26.7) | 0.0064 | 0 (0) | |
| Unknown | 4 (0.39) | 82 (0.069) | 5.67 (1.51−15.2) | 0.0065 | 0 (0) | |
| Unknown | 3 (0.29) | 44 (0.037) | 7.91 (1.57−24.8) | 0.0077 | 0 (0) | |
| Mitochondrial tRNA synthesis | 3 (0.29) | 45 (0.038) | 7.74 (1.54−24.2) | 0.0081 | 3 (0.53) | |
| Unknown | 3 (0.29) | 46 (0.039) | 7.58 (1.50−23.6) | 0.0086 | 1 (0.18) | |
| cAMP-dependent kinase subunit | 3 (0.29) | 46 (0.039) | 7.57 (1.50−23.6) | 0.0086 | 0 (0) | |
| Cyclin-dependent kinase-like protein | 4 (0.39) | 90 (0.077) | 5.13 (1.37−13.7) | 0.0090 | 0 (0) | |
| Junctional adhesion molecule | 3 (0.29) | 33 (0.039) | 7.54 (1.48−24.1) | 0.0091 | 1 (0.18) | |
| Unknown | 4 (0.39) | 91 (0.077) | 5.11 (1.36−13.6) | 0.0092 | 0 (0) | |
| Unknown | 3 (0.29) | 47 (0.040) | 7.38 (1.46−23.0) | 0.0092 | 0 (0) |
aProtein function information sourced from NCBI Gene, OMIM and references contained therein.
bTotal n = 1020 alleles tested per gene.
cGnomAD non-Finnish European (NFE), non-cancer sub-population.
dFisher’s exact test results (OR odds ratio, CI confidence interval). Genes ranked by ascending p value.
eConfirmed somatic serous ovarian tumour sample mutation data for each candidate gene (sourced from COSMIC v90[38], n = 562 samples) includes pathogenic missense, splice site, frameshift and stop-gain mutations.
fGene with data (as referenced) supporting a role in tumourigenesis.
gTwo variants found in one individual—phase unknown.
List of DNA repair genes with loss-of-function (LoF) variants in the discovery case cohort, grouped by pathway.
| Repair pathway | Genesa | No. of LoF alleles in casesb (%) | No. of LoF alleles in GnomADc (%) | OR (95% CI)d | |
|---|---|---|---|---|---|
| Base excision repair | 16 (0.052) | 2146 (0.061) | 0.854 (0.49−1.39) | 0.64 | |
| Mismatch repair | 4 (0.028) | 1041 (0.063) | 0.439 (0.12−1.13) | 0.095 | |
| Nucleotide excision repair | 17 (0.043) | 1691 (0.037) | 1.15 (0.67−1.84) | 0.51 | |
| Homologous recombination repair | 23 (0.090) | 1614 (0.056) | 1.62 (1.02−2.43) | 0.032 | |
| Fanconi anaemia genes | 24 (0.12) | 1980 (0.090) | 1.39 (0.89−2.07) | 0.12 | |
| Non-homologous end-joining | 3 (0.042) | 275 (0.033) | 1.27 (0.26−3.74) | 0.52 | |
| Direct DNA damage reversal | 1 (0.033) | 378 (0.11) | 0.305 (0.01−1.71) | 0.39 | |
| All unique DNA repair genes | 77 (0.066) | 8268 (0.063) | 1.06 (0.84−1.33) | 0.59 | |
| Other associated genes, i.e. indirect regulators of genomic stability | 42 (0.057) | 4733 (0.056) | 1.02 (0.73−1.38) | 0.88 | |
| Total (all DNA repair and other associated genes) | 119 (0.063) | 13,001 (0.061) | 1.05 (0.87 to 1.25) | 0.60 | |
aGenes and their associated pathways sourced from list curated by Chae et al.[30]. Table and calculations exclude known HBOC genes (BRCA1, BRCA2, MLH1, MSH2, MSH6, PMS2, RAD51C, RAD51D, BRIP1), as samples with pathogenic LoF variants in these genes were removed from the cohort during filtering.
bTotal n = 1020 multiplied by number of genes in pathway.
cGnomAD non-Finnish European (NFE), non-cancer sub-population.
dFisher’s exact test results (OR odds ratio, CI confidence interval).
eFigure for BLM excludes additional stop-gain variant (c.2208 T > G) found in cis with frameshift variant (c.2206dupT) in the same individual.
Characteristics of total case cohort.
| Number (%) | |
|---|---|
| Total patients | 516 (100) |
| Age at diagnosis of ovarian carcinoma | |
| <30 | 5 (1) |
| 30−39 | 14 (3) |
| 40−49 | 72 (14) |
| 50−59 | 149 (29) |
| 60−69 | 176 (34) |
| 70−79 | 80 (15) |
| ≥80 | 20 (4) |
| Histopathology | |
| High-grade serous (incl. carcinosarcoma) | 443 (86) |
| High-grade endometrioid | 35 (7) |
| Mixed epithelial (with predominant high-grade serous component) | 11 (2) |
| Serous (grade unknown/uncertain, presumed high-grade) | 5 (1) |
| Adenocarcinoma NOS (presumed high-grade serous) | 9 (2) |
| Unknown (presumed high-grade serous) | 13 (2) |
| Personal history of cancer | |
| Breast (excl. DCIS) | 47 (9) |
| Other (incl. breast DCIS) | 62 (12) |
| No history of cancer | 407 (79) |
| Family history of ovarian or breast cancer (first- and second-degree relatives only) | |
| One ovarian cancer case (no breast cancer) | 46 (9) |
| ≥2 ovarian cancer cases (no breast cancer) | 4 (1) |
| One breast cancer case (no ovarian cancer) | 131 (25) |
| ≥2 breast cancer cases (no ovarian cancer) | 47 (9) |
| ≥2 ovarian and breast cancer cases | 34 (7) |
| No known cases of breast or ovarian cancer | 254 (49) |
NOS not otherwise specified, DCIS ductal carcinoma in situ.