| Literature DB >> 33277540 |
Roxana Moslehi1, Hui-Shien Tsao2,3, Nur Zeinomar2,4, Cristy Stagnar2,5, Sean Fitzpatrick2, Amiran Dzutsev6.
Abstract
Up to 30% of all breast cancer cases may be inherited and up to 85% of those may be due to segregation of susceptibility genes with low and moderate risk [odds ratios (OR) ≤ 3] for (mostly peri- and post-menopausal) breast cancer. The majority of low/moderate-risk genes, particularly those with minor allele frequencies (MAF) of < 30%, have not been identified and/or validated due to limitations of conventional association testing approaches, which include the agnostic nature of Genome Wide Association Studies (GWAS). To overcome these limitations, we used a hypothesis-driven integrative genomics approach to test the association of breast cancer with candidate genes by analyzing multi-omics data. Our candidate-gene association analyses of GWAS datasets suggested an increased risk of breast cancer with ERCC6 (main effect: 1.29 ≤ OR ≤ 2.91, 0.005 ≤ p ≤ 0.04, 11.8 ≤ MAF ≤ 40.9%), and implicated its interaction with ERCC8 (joint effect: 3.03 ≤ OR ≤ 5.31, 0.01 ≤ pinteraction ≤ 0.03). We found significant upregulation of ERCC6 (p = 7.95 × 10-6) and ERCC8 (p = 4.67 × 10-6) in breast cancer and similar frequencies of ERCC6 (1.8%) and ERCC8 (0.3%) mutations in breast tumors to known breast cancer susceptibility genes such as BLM (1.9%) and LSP1 (0.3%). Our integrative genomics approach suggests that ERCC6 may be a previously unreported low- to moderate-risk breast cancer susceptibility gene, which may also interact with ERCC8.Entities:
Year: 2020 PMID: 33277540 PMCID: PMC7718875 DOI: 10.1038/s41598-020-77037-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Breast cancer susceptibility.
ERCC6 and ERCC8 single nucleotide polymorphisms (SNPs) significantly associated with breast cancer in at least one breast cancer genome-wide association study (GWAS) dataset used as secondary data in our study.
| SNP | Alleles | Odds ratio (OR), 95% confidence interval (CI), p-value | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| CGEMSa | NSABPb | WHIc | BPC3d | |||||||
| Case (N = 1089) | Control (N = 1093) | Case (N = 430) | Control (N = 822) | Case (N = 465) | Control (N = 1394) | Case (N = 977) | Control (N = 1026) | |||
| rs3750751 | 0 | GG | 1.00 | 1.00 | 1.00 | 1.00 | ||||
| 1 | GA | 0.99 (0.79–1.25), 0.96 | 0.63 (0.45–0.88), 0.01 | NA | 1.35 (1.01–1.80), 0.04 | |||||
| 2 | AA | 2.91 (1.05–8.06), 0.04 | 2.53 (0.59–10.97), 0.21 | NA | 2.90 (0.52–16.07), 0.22 | |||||
| Trend | 1.10 (0.89–1.35), 0.37 | 0.74 (0.54–1.00), 0.05 | NA | 1.38 (1.05–1.81), 0.02 | ||||||
| rs2229760 | 0 | GG | 1.00 | 1.00 | 1.00 | 1.00 | ||||
| 1 | GA | 1.08 (0.89–1.31), 0.43 | 1.40 (1.08–1.83), 0.01 | NA | 0.88 (0.70–1.09), 0.24 | |||||
| 2 | AA | 1.03 (0.80–1.33), 0.80 | 1.28 (0.90–1.81), 0.16 | NA | 1.15 (0.84–1.56), 0.39 | |||||
| Trend | 1.03 (0.91–1.16), 0.68 | 1.17 (0.99–1.38), 0.07 | NA | 1.02 (0.88–1.19), 0.76 | ||||||
| rs3750749 | 0 | TT | 1.00 | 1.00 | 1.00 | 1.00 | ||||
| 1 | TC | 1.40 (1.07–1.82), 0.01 | 1.00 (0.69–1.46), 1.00 | NA | 0.86 (0.64–1.15), 0.31 | |||||
| 2 | CC | 1.06 (0.21–5.28), 0.94 | 0.40 (0.05–3.43), 0.40 | NA | 0.20 (0.02–1.73), 0.14 | |||||
| Trend | 1.36 (1.05–1.75), 0.02 | 0.94 (0.66–1.34), 0.74 | NA | 0.81 (0.61–1.07), 0.14 | ||||||
| rs4253082 | 0 | GG | 1.00 | 1.00 | 1.00 | 1.00 | ||||
| 1 | GA | 1.32 (1.09–1.61), 0.005 | 1.04 (0.80–1.34), 0.79 | 1.27 (0.99–1.64), 0.06 | 1.24 (0.64–2.38), 0.52 | |||||
| 2 | AA | 0.79 (0.44–1.41), 0.43 | 0.81 (0.34–1.93), 0.64 | 0.98 (0.45–2.11), 0.96 | 1.27 (0.67–2.40), 0.46 | |||||
| Trend | 1.17 (0.99–1.39), 0.06 | 0.91 (0.71–1.16), 0.44 | NA | 1.05 (0.87–1.28), 0.59 | ||||||
| rs2228528 | 0 | GG | 1.00 | 1.00 | 1.00 | 1.00 | ||||
| 1 | GA | NA | 1.04 (0.80–1.35), 0.77 | 1.29 (1.01–1.66), 0.04 | 0.99 (0.79–1.24), 0.91 | |||||
| 2 | AA | NA | 0.82 (0.35–1.93), 0.64 | 1.02 (0.47–2.21), 0.95 | 0.82 (0.43–1.56), 0.55 | |||||
| Trend | NA | 1.00 (0.80–1.26), 0.98 | NA | 0.96 (0.79–1.16), 0.69 | ||||||
| rs1012553 | 0 | TT | 1.00 | 1.00 | 1 | 1.00 | ||||
| 1 | TA | NA | NA | 1.35 (1.07–1.71), 0.01 | NA | |||||
| 2 | AA | NA | NA | 1.13 (0.69–1.84), 0.63 | NA | |||||
| Trend | NA | NA | 1.20 (1.00–1.44), 0.05 | NA | ||||||
NA Not analyzed due to missing SNPs or subjects (i.e., cells containing 0 subjects).
aCancer Genetic Markers of Susceptibility[10,11]: Our analysis of raw data involved Caucasian women ≥ 55 years of age using unconditional logistic regression adjusting for family history of breast cancer.
bNational Surgical Adjuvant Breast and Bowel Project (NSABP) Prevention Trials (P-1[14] and P-2[13]): Our analysis of raw data involved Caucasian women ≥ 50 years of age using conditional logistic regression maintaining matching criteria set by the original study investigators (i.e., age at trial entry, time in the study, history of lobular carcinoma in situ, and 5-year predicted breast cancer risk based on the Gail model).
cWomen's Health Initiative (WHI)[15,16] Hormone Therapy Trials data was used to create a nested case–control dataset of women diagnosed with invasive breast cancer ≥ 50 years of age (N = 465) and healthy controls (N = 1394) frequency-matched to the cases based on age in 3:1 control to case ratio: Our analysis of raw data involved Caucasian women ≥ 50 years of age using unconditional logistic regression adjusting for family history of breast cancer, parity, oral contraceptive use, breast feeding and body mass index.
dBreast and Prostate Cancer Cohort Consortium (BPC3)[17,18]: Our analysis of raw data involved Caucasian women ≥ 50 years of age using unconditional logistic regression adjusting for family history of breast cancer.
Joint effect analysis of ERCC6 and ERCC8 diplotypes.
| A. Cancer Genetic Markers of Susceptibility (CGEMS) | ||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Hap0/Hap0 | Hap2/Hap5 | Hap1/Hap6 | ||||||||||||||||
| Control | Case | Adjusted OR (95% CI) | p-Value | Control | Case | Adjusted OR (95% CI) | p-Value | Control | Case | Adjusted OR (95% CI) | p-Value | |||||||
| n | % | n | % | n | % | n | % | n | % | n | % | |||||||
| Hap1/Hap1 | 162 | 78.3 | 163 | 69.10 | 1 | |||||||||||||
| Hap1/Hap2 | 18 | 8.7 | 9 | 3.80 | 0.50 (0.22, 1.14) | 0.100 | 1 | 0.5 | 1 | 0.4 | 1.11 (0.07,17.89) | 0.943 | NA | NA | ||||
| Hap1/Hap3 | 9 | 4.3 | 15 | 6.40 | 1.72 (0.73, 4.05) | 0.217 | 1 | 0.5 | 2 | 0.8 | 1.92 (0.17,21.59) | 0.599 | NA | NA | ||||
CGEMS: p-value for interaction = 0.010 (Wald Test, Chi-Square 19.98).
WHI: p-value for interaction = 0.034 (Wald Test, Chi-Square 31.66).
BPC3: p-value interaction = 0.047 (Wald Test, Chi-Square 21.21).
NA Not analyzed due to missing diplotypes or subjects (i.e., cell containing 0 subjects).
AUnconditional logistic regression analysis adjusted for family history of breast cancer.
BUnconditional logistic regression adjusted for family history of breast cancer, parity, oral contraceptive use, number of months of breast feeding, and body mass index.
CUnconditional logistic regression models adjusted for family history of breast cancer and consent group [i.e., three cohorts that make up our BPC3 dataset, namely Prostate, Lung, and Colorectal Cancer (PLCO), European Prospective Investigation into Cancer and Nutrition (EPIC), and Polish Breast Cancer Study (PBCS) and restricted to Caucasian ≥ 50 years of age.
Stratified analysis of ERCC6 rs3750751 in the three cohorts of our Breast and Prostate Cancer Consortium (BPC3) dataset, namely Prostate, Lung, and olorectal Cancer (PLCO), European Prospective Investigation into Cancer and Nutrition (EPIC), and Polish Breast Cancer Study (PBCS).
| Study | SNP | Alleles | Case (N = 255) | Case % | Control (N = 340) | Control % | OR (95% CI) | p-value |
|---|---|---|---|---|---|---|---|---|
| PLCO | GG | 211 | 82.75 | 283 | 83.24 | 1 | ||
| GA | 40 | 15.70 | 55 | 16.18 | 1.04 (0.66 | 0.87 | ||
| AA | 4 | 1.57 | 2 | 0.59 | 2.78 (0.50 | 0.25 | ||
| EPIC | ||||||||
| GG | 303 | 82.34 | 294 | 83.05 | 1 | |||
| GA | 65 | 17.66 | 58 | 16.38 | 1.25 (0.64 | 0.51 | ||
| AA | 0 | 0.00 | 2 | 0.56 | NA | NA | ||
| PBCS | Case % | |||||||
| GG | 298 | 84.18 | 302 | 90.96 | 1 | |||
| GA | 56 | 15.82 | 30 | 9.04 | 1.88 (1.17–3.02) | 0.009 | ||
| AA | 0 | 0.00 | 0 | 0.00 | NA | NA |
Unconditional logistic regression models adjusted for family history of breast cancer and restricted to Caucasian subjects ≥ 50 years.
NA Not analyzed due to small number of subjects (cells with 0 value).
Figure 2Individual analysis of gene expression microarray dataset GSE10780 containing invasive ductal breast cancer (IDBC) cases (n = 42) and control samples (n = 143) among peri- and post-menopausal women.
Figure 3(a,b) Frequency of somatic mutations in ERCC6 and ERCC8 in comparison with known breast cancer susceptibility genes in the Cancer Genome Atlas (TCGA) Dataset. (a) Mutation frequency in all cancers versus breast cancer. (b) Mutation frequencies of breast cancer susceptibility genes in TCGA breast cancers.
Figure 4(a,b) Mutation analysis of selected genes in all cancers in the Cancer Genome Atlas (TCGA) dataset. (a) Ratio of high- to moderate-impact mutations in ERCC6 and ERCC8 in comparison to known breast cancer susceptibility genes and control genes. (b) Somatic mutation landscape of ERCC6, ERCC8, BRCA1 and BRCA2.