| Literature DB >> 27792995 |
Yosr Hamdi1, Penny Soucy1, Véronique Adoue2,3,4, Kyriaki Michailidou5,6, Sander Canisius7, Audrey Lemaçon8, Arnaud Droit8, Irene L Andrulis9,10, Hoda Anton-Culver11, Volker Arndt12, Caroline Baynes13, Carl Blomqvist14, Natalia V Bogdanova15,16, Stig E Bojesen17,18,19, Manjeet K Bolla5, Bernardo Bonanni20, Anne-Lise Borresen-Dale21, Judith S Brand22, Hiltrud Brauch23,24,25, Hermann Brenner12,25,26, Annegien Broeks7, Barbara Burwinkel27,28, Jenny Chang-Claude29,30, Fergus J Couch31, Angela Cox32, Simon S Cross33, Kamila Czene22, Hatef Darabi22, Joe Dennis5, Peter Devilee34,35, Thilo Dörk16, Isabel Dos-Santos-Silva36, Mikael Eriksson22, Peter A Fasching37,38, Jonine Figueroa39,40, Henrik Flyger41, Montserrat García-Closas40, Graham G Giles42,43, Mark S Goldberg44,45, Anna González-Neira46, Grethe Grenaker-Alnæs21, Pascal Guénel47, Lothar Haeberle37, Christopher A Haiman48, Ute Hamann49, Emily Hallberg50, Maartje J Hooning51, John L Hopper43, Anna Jakubowska52, Michael Jones53, Maria Kabisch49, Vesa Kataja54,55, Diether Lambrechts56,57, Loic Le Marchand58, Annika Lindblom59, Jan Lubinski52, Arto Mannermaa54,60,61, Mel Maranian13, Sara Margolin62, Frederik Marme27,63, Roger L Milne42,43, Susan L Neuhausen64, Heli Nevanlinna65, Patrick Neven66, Curtis Olswold50, Julian Peto36, Dijana Plaseska-Karanfilska67, Katri Pylkäs68,69, Paolo Radice70, Anja Rudolph29, Elinor J Sawyer71, Marjanka K Schmidt7, Xiao-Ou Shu72, Melissa C Southey73, Anthony Swerdlow74, Rob A E M Tollenaar75, Ian Tomlinson76, Diana Torres49,77, Thérèse Truong47, Celine Vachon50, Ans M W Van Den Ouweland, Qin Wang5, Robert Winqvist68,69, Wei Zheng72, Javier Benitez46,78, Georgia Chenevix-Trench79, Alison M Dunning13, Paul D P Pharoah5,13, Vessela Kristensen21,80, Per Hall22, Douglas F Easton5,13, Tomi Pastinen81,82, Silje Nord21, Jacques Simard1.
Abstract
There are significant inter-individual differences in the levels of gene expression. Through modulation of gene expression, cis-acting variants represent an important source of phenotypic variation. Consequently, cis-regulatory SNPs associated with differential allelic expression are functional candidates for further investigation as disease-causing variants. To investigate whether common variants associated with differential allelic expression were involved in breast cancer susceptibility, a list of genes was established on the basis of their involvement in cancer related pathways and/or mechanisms. Thereafter, using data from a genome-wide map of allelic expression associated SNPs, 313 genetic variants were selected and their association with breast cancer risk was then evaluated in 46,451 breast cancer cases and 42,599 controls of European ancestry ascertained from 41 studies participating in the Breast Cancer Association Consortium. The associations were evaluated with overall breast cancer risk and with estrogen receptor negative and positive disease. One novel breast cancer susceptibility locus on 4q21 (rs11099601) was identified (OR = 1.05, P = 5.6x10-6). rs11099601 lies in a 135 kb linkage disequilibrium block containing several genes, including, HELQ, encoding the protein HEL308 a DNA dependant ATPase and DNA Helicase involved in DNA repair, MRPS18C encoding the Mitochondrial Ribosomal Protein S18C and FAM175A (ABRAXAS), encoding a BRCA1 BRCT domain-interacting protein involved in DNA damage response and double-strand break (DSB) repair. Expression QTL analysis in breast cancer tissue showed rs11099601 to be associated with HELQ (P = 8.28x10-14), MRPS18C (P = 1.94x10-27) and FAM175A (P = 3.83x10-3), explaining about 20%, 14% and 1%, respectively of the variance inexpression of these genes in breast carcinomas.Entities:
Keywords: association studies; breast cancer; cis-regulatory variants; differential allelic expression; genetic susceptibility
Mesh:
Substances:
Year: 2016 PMID: 27792995 PMCID: PMC5340257 DOI: 10.18632/oncotarget.12818
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Associations with breast cancer risk for SNPs showing evidence of differential allelic expression (overall p <0.01)
| SNP | Chr | Position | Alleles | MAF | OR | P1df | OR | P1df | OR | P1df | Genes |
|---|---|---|---|---|---|---|---|---|---|---|---|
| rs697004 | 1 | 211842808 | G/T | 0.32 | 0.96 (0.94-0.98) | 3.67 × 10−04 | 0.96 (0.94-0.99) | 2.84 × 10−03 | 0.97 (0.93-1.01) | 1.46 × 10−01 | |
| rs13447450 | 1 | 91965850 | C/T | 0.37 | 0.97 (0.95-0.99) | 1.16 × 10−03 | 0.96 (0.94-0.99) | 1.53 × 10−03 | 0.96 (0.92-1.00) | 3.61 × 10−02 | |
| rs12125947 | 1 | 91990487 | T/C | 0.49 | 0.97 (0.95-0.99) | 1.59 × 10−03 | 0.97 (0.95-0.99) | 4.23 × 10−03 | 0.97 (0.93-1.01) | 9.47 × 10−02 | |
| rs10490250 | 2 | 58509628 | A/C | 0.21 | 1.03 (1.01-1.06) | 7.51 × 10−03 | 1.03 (1.01-1.06) | 1.85 × 10−02 | 1.05 (1.01-1.10) | 4.29 × 10−02 | |
| rs13099560 | 3 | 48204768 | C/T | 0.34 | 0.97 (0.95-0.99) | 3.99 × 10−03 | 0.96 (0.94-0.98) | 8.75 × 10−04 | 1.00 (0.96-1.04) | 9.24 × 10−01 | |
| 4 | 84382763 | A/G | 0.50 | 1.05 (1.03-1.07) | 1.05 (1.03-1.08) | 1.07 (1.03-1.11) | 4.08 × 10−04 | ||||
| rs17355027 | 4 | 84388915 | C/T | 0.08 | 0.95 (0.92-0.98) | 4.46 × 10−03 | 0.95 (0.91-0.99) | 1.17 × 10−02 | 0.92 (0.86-0.98) | 1.55 × 10−02 | |
| rs2362974 | 5 | 36156654 | C/T | 0.12 | 0.96 (0.93-0.99) | 5.96 × 10−03 | 0.97 (0.93-1.00) | 5.45 × 10−02 | 0.99 (0.94-1.06) | 9.56 × 10−01 | |
| rs733590 | 6 | 36645203 | T/C | 0.36 | 1.04 (1.03-1.06) | 1.77 × 10−04 | 1.03 (1.01-1.06) | 7.30 × 10−03 | 1.05 (1.01-1.09) | 1.99 × 10−02 | |
| 11 | 65621057 | C/T | 0.33 | 1.05 (1.02-1.07) | 1.05 (1.03-1.07) | 1.03 (0.99-1.07) | 2.25 × 10−01 | ||||
| rs570933 | 15 | 43824030 | T/C | 0.29 | 0.97 (0.95-0.99) | 7.18 × 10−03 | 0.97 (0.95-0.99) | 1.78 × 10−02 | 0.96 (0.92-1.00) | 2.89 × 10−02 | |
| rs7234479 | 18 | 20599564 | A/C | 0.11 | 1.05 (1.02-1.08) | 1.59 × 10−03 | 1.05 (1.01-1.09) | 7.57 × 10−03 | 1.04 (0.98-1.10) | 1.52 × 10−01 | |
| 22 | 28792887 | C/T | 0.10 | 1.07 (1.03-1.10) | 1.09 (1.05-1.13) | 1.03 (0.97-1.09) | 3.87 × 10−01 |
Chromosome
Build 37 position
Major/minor allele, based on the forward strand and minor allele frequency in Europeans
Mean minor allele frequency over all European controls in iCOGS
Per-allele OR for the minor allele relative to the major allele
One-degree-of-freedom P-value
SNPs highlighted in bold are those with associations for overall breast cancer risk reaching p<10-4 (significance cut-off after Bonferroni correction)
Figure 1Regional plots of breast cancer risk association at 4q21
Regional plot of association result, recombination hotspots and LD for the 4q21: 84,132,874-84,631,193 loci. The index SNP rs11099601 is plotted as a blue triangle. Directly genotyped SNPs are represented as triangles and imputed SNPs (r2 > 0.3, MAF > 0.02) are represented as circles. The LD (r2) for the index SNP with each SNP was computed based on European ancestry subjects included in the 1000 Genome Mar 2012 EUR. Pairwise r2 values are plotted using a red scale, where white and red signify r2 = 0 and 1, respectively. P-values were from the single-marker analysis based on logistic regression models after adjusted for age, study sites and the first six principal components plus one additional principal component for the LMBC in analyses of data from European descendants. SNPs are plotted according to their chromosomal position: physical locations are based on GRCh37/hg19. Gene annotation was based on the NCBI RefSeq genes from the UCSC Genome Browser.
Figure 2Functional annotation of the 4q21 locus
A. Functional annotations using data from the ENCODE and NIH Roadmap Epigenomics projects. From top to bottom, epigenetic signals evaluated included DNase clusters in MCF7 and HMEC cells, chromatin state segmentation by Hidden Markov Model (ChromHMM) in HMEC, breast myoepithelial cells (BMC) and Variant human mammary epithelial cells (vHMEC), where red represents an active promoter region, orange a strong enhancer and yellow a poised enhancer respectively (the detailed color scheme of chromatin states is described in the UCSC browser), histone modifications in MCF7, HMEC and BMC cell lines; and overlap between candidate variants and Max binding site in MCF7 cells. All tracks were generated by the UCSC genome browser (hg 19). B. Long-range chromatin interactions. From top to bottom, ChIA-Pet interactions for PolII and CTCF in MCF7 cells and Hi-C interactions in HMEC cells. The ChIA-PET raw data available on GEO under the following accession (GSE63525.K56, GSE33664, GSE39495) were processed with the GenomicRanges package. C. Maps of mammary cell super-enhancer locations as defined in Hnisz et al. are shown in HMEC cells. Predicted enhancer-promoter determined interactions in MCF7 and HMEC cells, as defined by the integrated method for predicting enhancer targets (IM-PET) are shown. D. RNA-Seq data from MCF7 and HMEC cell lines. The value of the RNA-Seq analysis corresponds to the mean RPM value for FAM175A, MRPS18C, HELQ, AGPAT9, HSPE and COQ2 from four HMEC and 19 MCF7 datasets, respectively. The annotation was obtained through the Bioconductor annotation package TxDb.Hsapiens.UCSC.hg19.knownGene. The tracks have been generated using ggplot2 and ggbio library in R.
Figure 3Boxplots representing differential expression of HELQ (A), MRPS18C (B), FAM175A (C) and HPSE (D) in breast tissues
Differential expression between normal breast and tumor tissue was determined by a Kruskal-Wallis rank sum test using TCGA breast cancer RNAseq data from primary tumor, metastasis and adjacent normal. Horizontal bars indicate mean expression levels.
Figure 4Boxplots representing expression levels of HELQ (A), MRPS18C (B), FAM175A (C) and HPSE (D) in the 5 molecular subtypes (PAM50 classifier) of breast primary tumors
Differential expression between normal breast and tumor tissue was determined by a Kruskal-Wallis rank sum test. Analysis was performed using TCGA breast cancer RNAseq data from five molecular subtypes of breast primary tumors: Luminal A (LumA), Luminal B (LumB), Human epidermal growth factor receptor 2-enriched (Her2), Basal-like (Basal) and Normal-like (Normal). Horizontal bars indicate mean expression levels.
Figure 5Manhattan plots of association for the eQTL results at the 4q21 locus in normal breast and breast cancer tissue
Y-axis shows -log10(P-value) while x-axis shows physical position. Circles of various shades of blue represent breast cancer risk associations for all breast cancer tumors, ER+ and ER- tumors. Other colored circles represent eQTL results in the following datasets: normal breast (NB93, NB116) in various shades of green, breast carcinomas in pink (BC241) and red (BC765). Risk association results as well as eQTL results are for both imputed and genotyped SNPs for all datasets.
Figure 6Boxplots representing the most significant eQTL results for variant rs11099601 in normal breast tissue and breast tumor datasets
Box plots represent the expression levels of the indicated transcripts with respect to the rs11099601 genotypes. Expression levels are shown for A. HELQ in breast carcinoma BC241 dataset, B. HELQ in breast carcinoma BC765 dataset normalized per isoform, C. HELQ in normal breast NB93 dataset normalized by gene isoform, D. MRPS18C in breast carcinoma BC765 dataset, E. MRPS18C in breast carcinoma BC765 dataset normalized per isoform, F. FAM175A in breast carcinoma BC765 dataset and G. HSPE in normal breast NB116 dataset. Horizontal bars indicate mean expression level per genotype. r2 values indicate the percentage of variance in respective gene expression levels explained by rs11099601.