| Literature DB >> 28362817 |
Joshua D Hoffman1, Rebecca E Graff1, Nima C Emami1,2, Caroline G Tai1, Michael N Passarelli3, Donglei Hu4, Scott Huntsman4, Dexter Hadley5, Lancelote Leong1, Arunabha Majumdar1, Noah Zaitlen4, Elad Ziv1, John S Witte1,6,7.
Abstract
Breast cancer is the most common solid organ malignancy and the most frequent cause of cancer death among women worldwide. Previous research has yielded insights into its genetic etiology, but there remains a gap in the understanding of genetic factors that contribute to risk, and particularly in the biological mechanisms by which genetic variation modulates risk. The National Cancer Institute's "Up for a Challenge" (U4C) competition provided an opportunity to further elucidate the genetic basis of the disease. Our group leveraged the seven datasets made available by the U4C organizers and data from the publicly available UK Biobank cohort to examine associations between imputed gene expression and breast cancer risk. In particular, we used reference datasets describing the breast tissue and whole blood transcriptomes to impute expression levels in breast cancer cases and controls. In trans-ethnic meta-analyses of U4C and UK Biobank data, we found significant associations between breast cancer risk and the expression of RCCD1 (joint p-value: 3.6x10-06) and DHODH (p-value: 7.1x10-06) in breast tissue, as well as a suggestive association for ANKLE1 (p-value: 9.3x10-05). Expression of RCCD1 in whole blood was also suggestively associated with disease risk (p-value: 1.2x10-05), as were expression of ACAP1 (p-value: 1.9x10-05) and LRRC25 (p-value: 5.2x10-05). While genome-wide association studies (GWAS) have implicated RCCD1 and ANKLE1 in breast cancer risk, they have not identified the remaining three genes. Among the genetic variants that contributed to the predicted expression of the five genes, we found 23 nominally (p-value < 0.05) associated with breast cancer risk, among which 15 are not in high linkage disequilibrium with risk variants previously identified by GWAS. In summary, we used a transcriptome-based approach to investigate the genetic underpinnings of breast carcinogenesis. This approach provided an avenue for deciphering the functional relevance of genes and genetic variants involved in breast cancer.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28362817 PMCID: PMC5391966 DOI: 10.1371/journal.pgen.1006690
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Characteristics of the Up for a Challenge datasets (discovery) and the UK Biobank (replication).
| Dataset (Source Dataset) | Race / Ethnicity | # Cases | # Controls | Genotyping Platform |
|---|---|---|---|---|
| AABC (AABC) | African | 2,755 | 2,461 | Illumina Human1M-Duo BeadChip |
| African (African Diaspora) | African | 699 | 606 | Illumina HumanOmni2.5-Quad |
| African American / Barbadian (African Diaspora) | African | 934 | 1,400 | Illumina HumanOmni2.5-Quad |
| CGEMS (CGEMS) | European | 1,125 | 1,126 | Illumina HumanHap550 |
| CPSII (BPC3) | European | 289 | 292 | HumanHap550; HumanHap 660 |
| EPIC (BPC3) | European | 501 | 491 | HumanHap550; HumanHap 660 |
| Latina Admixture (Latina Admixture) | Latina | 800 | 365 | Affymetrix GWAS SNP Array 6.0 |
| MEC–European (BPC3) | European | 85 | 98 | HumanHap550; HumanHap 660 |
| MEC–Japanese (MEC) | East Asian | 885 | 822 | Human660W; Human-1M |
| MEC–Latina (MEC) | Latina | 520 | 544 | Human660W; Human-1M |
| NHS2 (BPC3) | European | 71 | 372 | HumanHap550; HumanHap 660 |
| PBCS (BPC3) | European | 532 | 495 | HumanHap550; HumanHap 660 |
| PLCO (BPC3) | European | 252 | 337 | HumanHap550; HumanHap 660 |
| Shanghai (Shanghai) | East Asian | 2,631 | 2,033 | Affymetrix GWAS SNP Array 6.0 |
| UK Biobank | European | 3,370 | 19,717 | UK BiLEVE Axiom; UK Biobank Axiom |
Abbreviations: AABC: African American Breast Cancer GWAS; African Diaspora: GWAS of Breast Cancer in the African Diaspora; BPC3: Breast and Prostate Cancer Cohort Consortium GWAS; CGEMS: Cancer Genetic Markers of Susceptibility Breast Cancer GWAS; CPSII: Cancer Prevention Study II; EPIC: European Prospective Investigation into Cancer and Nutrition; GWAS: genome-wide association study; Latina Admixture: San Francisco Bay Area Latina Breast Cancer Study; MEC: Multiethnic Cohort GWAS in African Americans, Latinos, and Japanese; NHS2: Nurses' Health Study 2; PBCS: Polish Breast Cancer Study; PLCO: Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial; Shanghai: Shanghai Breast Cancer Genetics Study; SNP: single nucleotide polymorphism
a After all quality control steps
Effect estimates and standard errors for gene expression suggestively (p-value < 1.0x10-04) associated with breast cancer risk in a meta-analysis of the Up for a Challenge and UK Biobank datasets.
| # SNPs in | Imputation | U4C | UK Biobank | Meta-analysis | |||||
|---|---|---|---|---|---|---|---|---|---|
| Gene | Location | Prediction | Quality | Beta (SE) | Beta (SE) | Beta (SE) | |||
| | 15q26.1 | 24 | 0.16 | -0.11 (0.038) | 5.8x10-03 | -0.24 (0.057) | 2.6x10-05 | -0.15 (0.032) | 3.6x10-06 |
| | 16q22.2 | 7 | 0.026 | -0.52 (0.12) | 2.4x10-05 | -0.29 (0.15) | 0.056 | -0.43 (0.095) | 7.1x10-06 |
| | 19p13.11 | 6 | 0.081 | 0.19 (0.093) | 0.044 | 0.43 (0.12) | 1.9x10-04 | 0.28 (0.072) | 9.3x10-05 |
| | 15q26.1 | 20 | 0.35 | -0.074 (0.026) | 4.7x10-03 | -0.14 (0.039) | 2.7x10-04 | -0.095 (0.022) | 1.2x10-05 |
| | 17p13.1 | 19 | 0.39 | 0.098 (0.037) | 7.9x10-03 | 0.11 (0.033) | 7.9x10-04 | 0.11 (0.025) | 1.9x10-05 |
| | 19p13.11 | 33 | 0.35 | 0.086 (0.029) | 2.7x10-03 | 0.094 (0.034) | 6.5x10-03 | 0.089 (0.022) | 5.2x10-05 |
Abbreviations: SE: standard error; SNP: single nucleotide polymorphism; U4C: Up for a Challenge
a According to human reference genome GRCh37/hg19
b r2 estimate derived from 10 fold cross-validation of true gene expression and predicted gene expression
SNPs nominally (p-value < 0.05) associated with breast cancer risk that contribute to expression of genes suggestively associated with breast cancer risk.
| Proportion | U4C | UK Biobank | Meta-analysis | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| SNP | Alleles | of Weight | EAF | OR (95% CI) | EAF | OR (95% CI) | OR (95% CI) | |||
| RCCD1 | ||||||||||
| rs3826033 | G / A | 0.13 | 0.32 | 0.92 (0.88, 0.98) | 4.1x10-03 | 0.13 | 0.86 (0.79,0.93) | 2.3x10-04 | 0.90 (0.86,0.94) | 9.5x10-06 |
| rs2290202 | G / T | 0.24 | 0.3 | 0.93 (0.89, 0.98) | 5.3x10-03 | 0.13 | 0.86 (0.79,0.93) | 1.9x10-04 | 0.91 (0.88,0.95) | 1.7x10-05 |
| rs4347602 | A / C | 0.025 | 0.72 | 0.94 (0.90,0.98) | 6.5x10-03 | 0.77 | 0.96 (0.90,1.02) | 0.16 | 0.94 (0.91,0.98) | 2.4x10-03 |
| rs11207 | C / T | 0.030 | 0.35 | 0.97 (0.93, 1.02) | 0.21 | 0.24 | 0.93 (0.87,0.98) | 0.015 | 0.96 (0.93,0.99) | 0.016 |
| DHODH | ||||||||||
| rs3213422 | C / A | 0.56 | 0.42 | 0.92 (0.88,0.96) | 2.8x10-05 | 0.48 | 0.95 (0.90,1.00) | 0.039 | 0.93 (0.90,0.96) | 4.5x10-06 |
| rs2240243 | G / A | 0.055 | 0.47 | 0.93 (0.89,0.97) | 2.7x10-04 | 0.34 | 0.98 (0.93,1.04) | 0.53 | 0.95 (0.92,0.98) | 1.0x10-03 |
| rs12708928 | C / A | 0.019 | 0.47 | 0.93 (0.89,0.96) | 2.5x10-04 | 0.34 | 0.99 (0.93,1.04) | 0.59 | 0.95 (0.92,0.98) | 1.2x10-03 |
| ANKLE1 | ||||||||||
| rs34084277 | A / G | 0.23 | 0.19 | 1.09 (1.02,1.15) | 7.1x10-03 | 0.19 | 1.11 (1.04,1.18) | 2.0x10-03 | 1.10 (1.05,1.14) | 4.7x10-05 |
| rs8170 | G / A | 0.26 | 0.19 | 1.08 (1.02,1.15) | 7.2x10-03 | 0.19 | 1.11 (1.04,1.18) | 2.6x10-03 | 1.09 (1.05,1.14) | 6.3x10-05 |
| RCCD1 | ||||||||||
| rs3826033 | G / A | 0.33 | 0.32 | 0.92 (0.88,0.98) | 4.1x10-03 | 0.13 | 0.86 (0.79,0.93) | 2.3x10-04 | 0.90 (0.86,0.94) | 9.5x10-06 |
| rs2290202 | G / T | 0.29 | 0.3 | 0.93 (0.89,0.98) | 5.3x10-03 | 0.13 | 0.86 (0.79,0.93) | 1.9x10-04 | 0.91 (0.88,0.95) | 1.7x10-05 |
| rs7180016 | G / A | 0.012 | 0.49 | 0.97 (0.93,1.01) | 0.13 | 0.16 | 0.90 (0.84,0.97) | 5.7x10-03 | 0.95 (0.92,0.99) | 7.3x10-03 |
| rs11073961 | A / G | 0.049 | 0.35 | 0.97 (0.93,1.01) | 0.21 | 0.27 | 0.92 (0.87,0.98) | 7.5x10-03 | 0.95 (0.93,0.99) | 9.9x10-03 |
| rs11207 | C / T | 0.0092 | 0.35 | 0.97 (0.93,1.02) | 0.21 | 0.24 | 0.93 (0.87,0.98) | 0.015 | 0.96 (0.93,0.99) | 0.016 |
| rs2285937 | A / G | 0.0064 | 0.46 | 0.98 (0.94,1.02) | 0.31 | 0.16 | 0.90 (0.84,0.97) | 4.9x10-03 | 0.96 (0.93,0.99) | 0.023 |
| rs3809583 | A / G | 0.0035 | 0.36 | 0.97 (0.93,1.01) | 0.12 | 0.32 | 0.96 (0.91,1.01) | 0.15 | 0.96 (0.93,1.00) | 0.035 |
| ACAP1 | ||||||||||
| rs35776863 | A / G | 0.49 | 0.85 | 1.08 (1.00,1.16) | 0.045 | 0.77 | 1.11 (1.04,1.18) | 0.15 | 1.10 (1.04,1.15) | 1.4x10-04 |
| rs9892383 | C / T | 0.030 | 0.76 | 1.04 (0.98,1.09) | 0.17 | 0.73 | 1.10 (1.03,1.18) | 0.76 | 1.06 (1.02,1.11) | 3.6x10-03 |
| rs5412 | G / A | 0.060 | 0.12 | 1.04 (0.97,1.12) | 0.26 | 0.17 | 1.09 (1.02,1.17) | 0.12 | 1.07 (1.02,1.12) | 8.0x10-03 |
| rs4791423 | A / C | 0.0068 | 0.45 | 1.04 (1.00,1.09) | 0.033 | 0.34 | 1.03 (0.98,1.09) | 0.55 | 1.04 (1.01,1.08) | 0.018 |
| rs35721044 | T / C | 0.031 | 0.84 | 1.11 (1.02,1.22) | 0.012 | 0.76 | 1.03 (0.97,1.10) | 0.16 | 1.06 (1.01,1.12) | 0.019 |
| LRRC25 | ||||||||||
| rs11668719 | C / T | 0.25 | 0.5 | 1.06 (1.01,1.11) | 0.011 | 0.54 | 1.10 (1.05,1.16) | 1.87x10-04 | 1.08 (1.04,1.12) | 1.2x10-05 |
| rs7257932 | A / G | 0.091 | 0.55 | 1.05 (1.01,1.10) | 0.011 | 0.67 | 1.08 (1.02,1.14) | 7.01x10-03 | 1.06 (1.03,1.10) | 2.5x10-04 |
| rs13344313 | A / G | 0.16 | 0.68 | 1.06 (1.02,1.11) | 6.6x10-03 | 0.71 | 1.04 (0.98,1.10) | 0.20 | 1.05 (1.02,1.09) | 3.2x10-03 |
| rs3795026 | C / T | <0.001 | 0.54 | 1.04 (1.00,1.08) | 0.051 | 0.68 | 1.05 (0.99,1.11) | 0.12 | 1.04 (1.01,1.08) | 0.013 |
| rs7251067 | A / G | 0.031 | 0.85 | 1.00 (0.95,1.06) | 0.94 | 0.86 | 1.14 (1.06,1.23) | 6.70x10-04 | 1.05 (1.00,1.10) | 0.041 |
Abbreviations: CI: confidence interval; EAF: effect allele frequency; OR: odds ratio; SNP: single nucleotide polymorphism; U4C: Up for a Challenge
a Reference allele / effect allele
b Proportion of total weight attributed to SNP in gene prediction model
c Effect allele frequency in controls
d Previously implicated in breast cancer or in high linkage disequilibrium (r2 > 0.5 in 1000 Genomes Phase 3 populations) with known risk variants