| Literature DB >> 31949161 |
Sander Canisius1,2, Marjanka K Schmidt3,4, Maria Escala-Garcia5, Jean Abraham6,7,8, Irene L Andrulis9,10, Hoda Anton-Culver11, Volker Arndt12, Alan Ashworth13, Paul L Auer14,15, Päivi Auvinen16,17,18, Matthias W Beckmann19, Jonathan Beesley20, Sabine Behrens21, Javier Benitez22,23, Marina Bermisheva24, Carl Blomqvist25,26, William Blot27,28, Natalia V Bogdanova29,30,31, Stig E Bojesen32,33,34, Manjeet K Bolla35, Anne-Lise Børresen-Dale36,37, Hiltrud Brauch38,39,40, Hermann Brenner12,40,41, Sara Y Brucker42, Barbara Burwinkel43,44, Carlos Caldas45,46, Federico Canzian47, Jenny Chang-Claude21,48, Stephen J Chanock49, Suet-Feung Chin50, Christine L Clarke51, Fergus J Couch52, Angela Cox53, Simon S Cross54, Kamila Czene55, Mary B Daly56, Joe Dennis35, Peter Devilee57,58, Janet A Dunn59, Alison M Dunning6, Miriam Dwek60, Helena M Earl8,61, Diana M Eccles62, A Heather Eliassen63,64, Carolina Ellberg65, D Gareth Evans66,67,68, Peter A Fasching19,69, Jonine Figueroa49,70,71, Henrik Flyger72, Manuela Gago-Dominguez73,74, Susan M Gapstur75, Montserrat García-Closas49,76, José A García-Sáenz77, Mia M Gaudet75, Angela George78, Graham G Giles79,80,81, David E Goldgar82, Anna González-Neira22, Mervi Grip83, Pascal Guénel84, Qi Guo85, Christopher A Haiman86, Niclas Håkansson87, Ute Hamann88, Patricia A Harrington6, Louise Hiller59, Maartje J Hooning89, John L Hopper80, Anthony Howell90, Chiun-Sheng Huang91, Guanmengqian Huang88, David J Hunter64,92,93, Anna Jakubowska94,95, Esther M John96, Rudolf Kaaks21, Pooja Middha Kapoor21,97, Renske Keeman5, Cari M Kitahara98, Linetta B Koppert99, Peter Kraft64,92, Vessela N Kristensen36,37, Diether Lambrechts100,101, Loic Le Marchand102, Flavio Lejbkowicz103, Annika Lindblom104,105, Jan Lubiński94, Arto Mannermaa18,106,107, Mehdi Manoochehri88, Siranoush Manoukian108, Sara Margolin109,110, Maria Elena Martinez74,111, Tabea Maurer48, Dimitrios Mavroudis112, Alfons Meindl113, Roger L Milne79,80,114, Anna Marie Mulligan115,116, Susan L Neuhausen117, Heli Nevanlinna118, William G Newman66,67, Andrew F Olshan119, Janet E Olson120, Håkan Olsson65, Nick Orr121, Paolo Peterlongo122, Christos Petridis123, Ross L Prentice14, Nadege Presneau60, Kevin Punie124, Dhanya Ramachandran30, Gad Rennert103, Atocha Romero125, Mythily Sachchithananthan51, Emmanouil Saloustros126, Elinor J Sawyer123, Rita K Schmutzler127,128, Lukas Schwentner129, Christopher Scott120, Jacques Simard130, Christof Sohn131, Melissa C Southey114,132, Anthony J Swerdlow78,133, Rulla M Tamimi63,64,92, William J Tapper134, Manuel R Teixeira135,136, Mary Beth Terry137, Heather Thorne138,139, Rob A E M Tollenaar140, Ian Tomlinson141,142, Melissa A Troester119, Thérèse Truong84, Clare Turnbull78, Celine M Vachon120, Lizet E van der Kolk143, Qin Wang35, Robert Winqvist144,145, Alicja Wolk87,146, Xiaohong R Yang49, Argyrios Ziogas11, Paul D P Pharoah6,35, Per Hall55,109, Lodewyk F A Wessels147,148, Georgia Chenevix-Trench20, Gary D Bader10,149, Thilo Dörk30, Douglas F Easton6,35.
Abstract
Identifying the underlying genetic drivers of the heritability of breast cancer prognosis remains elusive. We adapt a network-based approach to handle underpowered complex datasets to provide new insights into the potential function of germline variants in breast cancer prognosis. This network-based analysis studies ~7.3 million variants in 84,457 breast cancer patients in relation to breast cancer survival and confirms the results on 12,381 independent patients. Aggregating the prognostic effects of genetic variants across multiple genes, we identify four gene modules associated with survival in estrogen receptor (ER)-negative and one in ER-positive disease. The modules show biological enrichment for cancer-related processes such as G-alpha signaling, circadian clock, angiogenesis, and Rho-GTPases in apoptosis.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31949161 PMCID: PMC6965101 DOI: 10.1038/s41467-019-14100-6
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Network analysis pipeline (see “Methods” for details).
a Cox models were used to estimate the association between each genetic variant and breast cancer-specific survival in 84,457 patients of the Breast Cancer Association Consortium (BCAC) dataset (discovery set). b The P values of the survival analyses for the genetic variants (blue diamonds) were used to compute gene scores using the Pascal algorithm. These gene scores were based on the maximum chi-squared signal within a window size of 50-kb around the gene region and accounted for linkage disequilibrium structure (depicted in a gradient blue scale). c The HotNet2 method was used to identify gene modules based on the −log10 P value of the computed gene scores. d The modules found by HotNet2 were filtered to obtain a selection of high-confidence germline-related prognostic modules (GRPMs). We constructed a polygenic hazard score (PHS) summarizing the prognostic effects of a set of selected genetic variants in the module. We then tested the association of this PHS with survival in both the discovery set (gray) and the independent set (orange). e We performed a functional characterization of the high-confidence GRPMs by studying the downstream transcriptional effects. For that, we used genotype and expression data from The Cancer Genome Atlas (TCGA). We computed the correlation between a GRPM’s polygenic hazard score and the expression of all available genes. Based on these correlation values, a gene set enrichment analysis assigned biological processes that were enriched among the genes most correlated with the prognostic variants in the GRPM.
Fig. 2Manhattan plots of the gene-level associations with breast cancer-specific survival.
Plots show the association in a all breast cancer cases (n = 84,457), b estrogen receptor (ER)-negative (n = 14,529), and c ER-positive (n = 55,701). The −log10 gene P values from the Pascal algorithm is shown on the y axis and genomic position on the x axis. The top significant genes and the most significant gene per chromosome (if −log10(P) > 3) are shown in red.
Fig. 3High-confidence germline-related prognostic modules (GRPMs).
The GRPM is shown at the center of the circles, surrounded by the biological processes enriched among the downstream transcriptional effects of each module. Three modules were found for estrogen receptor (ER)-negative breast cancer (a–c) and one module was found for ER-positive breast cancer (d). a G-alpha signaling GRPMs. b Circadian clock GRPM. c Regulators of cell growth and angiogenesis GRPM. d Rho GTPases and apoptosis GRPM. e Plots illustrating the association between each GRPM’s PHS and 10-year breast cancer specific-survival in the discovery and independent sets. HR hazard ratio (per standard deviation of the PHS), CI confidence interval. The error bars show the 95% confidence interval. The confidence intervals shown are two sided, whereas the significance test performed was one sided (see “Methods”).
Fig. 4Genomic region 19p13.3 with the two genes GNA11 and GNA15.
The two G-alpha signaling high-confidence germline-related prognostic modules (GRPMs) identified in the estrogen receptor (ER)-negative subtype have a shared genetic signal in the same genomic region. a Top: −log10(P) for the association with survival (y axis) of all variants in the region 19p13.3 (y axis). Bottom: regression coefficients from the survival model for the genetic variants in the module’s polygenic hazard scores (PHSs). b Scatter plot comparing the two modules’ PHSs in the iCOGS independent validation set. PHS of the GNA11 GRPM on the x axis and PHS of the GNA15 GRPM on the y axis.