| Literature DB >> 30554720 |
Nasim Mavaddat1, Kyriaki Michailidou2, Joe Dennis3, Michael Lush3, Laura Fachal4, Andrew Lee3, Jonathan P Tyrer4, Ting-Huei Chen5, Qin Wang3, Manjeet K Bolla3, Xin Yang3, Muriel A Adank6, Thomas Ahearn7, Kristiina Aittomäki8, Jamie Allen3, Irene L Andrulis9, Hoda Anton-Culver10, Natalia N Antonenkova11, Volker Arndt12, Kristan J Aronson13, Paul L Auer14, Päivi Auvinen15, Myrto Barrdahl16, Laura E Beane Freeman7, Matthias W Beckmann17, Sabine Behrens16, Javier Benitez18, Marina Bermisheva19, Leslie Bernstein20, Carl Blomqvist21, Natalia V Bogdanova22, Stig E Bojesen23, Bernardo Bonanni24, Anne-Lise Børresen-Dale25, Hiltrud Brauch26, Michael Bremer27, Hermann Brenner28, Adam Brentnall29, Ian W Brock30, Angela Brooks-Wilson31, Sara Y Brucker32, Thomas Brüning33, Barbara Burwinkel34, Daniele Campa35, Brian D Carter36, Jose E Castelao37, Stephen J Chanock7, Rowan Chlebowski38, Hans Christiansen27, Christine L Clarke39, J Margriet Collée40, Emilie Cordina-Duverger41, Sten Cornelissen42, Fergus J Couch43, Angela Cox30, Simon S Cross44, Kamila Czene45, Mary B Daly46, Peter Devilee47, Thilo Dörk48, Isabel Dos-Santos-Silva49, Martine Dumont50, Lorraine Durcan51, Miriam Dwek52, Diana M Eccles53, Arif B Ekici54, A Heather Eliassen55, Carolina Ellberg56, Christoph Engel57, Mikael Eriksson45, D Gareth Evans58, Peter A Fasching59, Jonine Figueroa60, Olivia Fletcher61, Henrik Flyger62, Asta Försti63, Lin Fritschi64, Marike Gabrielson45, Manuela Gago-Dominguez65, Susan M Gapstur36, José A García-Sáenz66, Mia M Gaudet36, Vassilios Georgoulias67, Graham G Giles68, Irina R Gilyazova69, Gord Glendon70, Mark S Goldberg71, David E Goldgar72, Anna González-Neira73, Grethe I Grenaker Alnæs74, Mervi Grip75, Jacek Gronwald76, Anne Grundy77, Pascal Guénel41, Lothar Haeberle17, Eric Hahnen78, Christopher A Haiman79, Niclas Håkansson80, Ute Hamann81, Susan E Hankinson82, Elaine F Harkness83, Steven N Hart84, Wei He45, Alexander Hein17, Jane Heyworth85, Peter Hillemanns48, Antoinette Hollestelle86, Maartje J Hooning86, Robert N Hoover7, John L Hopper87, Anthony Howell88, Guanmengqian Huang81, Keith Humphreys45, David J Hunter89, Milena Jakimovska90, Anna Jakubowska91, Wolfgang Janni92, Esther M John93, Nichola Johnson61, Michael E Jones94, Arja Jukkola-Vuorinen95, Audrey Jung16, Rudolf Kaaks16, Katarzyna Kaczmarek76, Vesa Kataja96, Renske Keeman42, Michael J Kerin97, Elza Khusnutdinova69, Johanna I Kiiski98, Julia A Knight99, Yon-Dschun Ko100, Veli-Matti Kosma101, Stella Koutros7, Vessela N Kristensen25, Ute Krüger56, Tabea Kühl102, Diether Lambrechts103, Loic Le Marchand104, Eunjung Lee79, Flavio Lejbkowicz105, Jenna Lilyquist84, Annika Lindblom106, Sara Lindström107, Jolanta Lissowska108, Wing-Yee Lo109, Sibylle Loibl110, Jirong Long111, Jan Lubiński76, Michael P Lux17, Robert J MacInnis112, Tom Maishman51, Enes Makalic87, Ivana Maleva Kostovska90, Arto Mannermaa101, Siranoush Manoukian113, Sara Margolin114, John W M Martens86, Maria Elena Martinez115, Dimitrios Mavroudis67, Catriona McLean116, Alfons Meindl117, Usha Menon118, Pooja Middha119, Nicola Miller97, Fernando Moreno66, Anna Marie Mulligan120, Claire Mulot121, Victor M Muñoz-Garzon122, Susan L Neuhausen20, Heli Nevanlinna98, Patrick Neven123, William G Newman58, Sune F Nielsen124, Børge G Nordestgaard23, Aaron Norman84, Kenneth Offit125, Janet E Olson84, Håkan Olsson56, Nick Orr126, V Shane Pankratz127, Tjoung-Won Park-Simon48, Jose I A Perez128, Clara Pérez-Barrios129, Paolo Peterlongo130, Julian Peto49, Mila Pinchev105, Dijana Plaseska-Karanfilska90, Eric C Polley84, Ross Prentice131, Nadege Presneau52, Darya Prokofyeva132, Kristen Purrington133, Katri Pylkäs134, Brigitte Rack92, Paolo Radice135, Rohini Rau-Murthy136, Gad Rennert105, Hedy S Rennert105, Valerie Rhenius4, Mark Robson136, Atocha Romero129, Kathryn J Ruddy137, Matthias Ruebner17, Emmanouil Saloustros138, Dale P Sandler139, Elinor J Sawyer140, Daniel F Schmidt141, Rita K Schmutzler78, Andreas Schneeweiss142, Minouk J Schoemaker94, Fredrick Schumacher143, Peter Schürmann48, Lukas Schwentner92, Christopher Scott84, Rodney J Scott144, Caroline Seynaeve86, Mitul Shah4, Mark E Sherman145, Martha J Shrubsole111, Xiao-Ou Shu111, Susan Slager84, Ann Smeets123, Christof Sohn142, Penny Soucy50, Melissa C Southey146, John J Spinelli147, Christa Stegmaier148, Jennifer Stone149, Anthony J Swerdlow150, Rulla M Tamimi151, William J Tapper152, Jack A Taylor153, Mary Beth Terry154, Kathrin Thöne102, Rob A E M Tollenaar155, Ian Tomlinson156, Thérèse Truong41, Maria Tzardi157, Hans-Ulrich Ulmer158, Michael Untch159, Celine M Vachon84, Elke M van Veen58, Joseph Vijai125, Clarice R Weinberg160, Camilla Wendt114, Alice S Whittemore161, Hans Wildiers123, Walter Willett162, Robert Winqvist134, Alicja Wolk163, Xiaohong R Yang7, Drakoulis Yannoukakos164, Yan Zhang12, Wei Zheng111, Argyrios Ziogas10, Alison M Dunning4, Deborah J Thompson3, Georgia Chenevix-Trench165, Jenny Chang-Claude166, Marjanka K Schmidt167, Per Hall168, Roger L Milne169, Paul D P Pharoah170, Antonis C Antoniou3, Nilanjan Chatterjee171, Peter Kraft172, Montserrat García-Closas7, Jacques Simard50, Douglas F Easton170.
Abstract
Stratification of women according to their risk of breast cancer based on polygenic risk scores (PRSs) could improve screening and prevention strategies. Our aim was to develop PRSs, optimized for prediction of estrogen receptor (ER)-specific disease, from the largest available genome-wide association dataset and to empirically validate the PRSs in prospective studies. The development dataset comprised 94,075 case subjects and 75,017 control subjects of European ancestry from 69 studies, divided into training and validation sets. Samples were genotyped using genome-wide arrays, and single-nucleotide polymorphisms (SNPs) were selected by stepwise regression or lasso penalized regression. The best performing PRSs were validated in an independent test set comprising 11,428 case subjects and 18,323 control subjects from 10 prospective studies and 190,040 women from UK Biobank (3,215 incident breast cancers). For the best PRSs (313 SNPs), the odds ratio for overall disease per 1 standard deviation in ten prospective studies was 1.61 (95%CI: 1.57-1.65) with area under receiver-operator curve (AUC) = 0.630 (95%CI: 0.628-0.651). The lifetime risk of overall breast cancer in the top centile of the PRSs was 32.6%. Compared with women in the middle quintile, those in the highest 1% of risk had 4.37- and 2.78-fold risks, and those in the lowest 1% of risk had 0.16- and 0.27-fold risks, of developing ER-positive and ER-negative disease, respectively. Goodness-of-fit tests indicated that this PRS was well calibrated and predicts disease risk accurately in the tails of the distribution. This PRS is a powerful and reliable predictor of breast cancer risk that may improve breast cancer prevention programs.Entities:
Keywords: breast; cancer; epidemiology; genetic; polygenic; prediction; risk; score; screening; stratification
Mesh:
Substances:
Year: 2018 PMID: 30554720 PMCID: PMC6323553 DOI: 10.1016/j.ajhg.2018.11.002
Source DB: PubMed Journal: Am J Hum Genet ISSN: 0002-9297 Impact factor: 11.025
Comparison of Methods for Deriving the PRS: Results for Overall Breast Cancer in the Validation Set
| 77 | 77 | 1.49 | 1.44–1.56 | 0.612 | |
| <5 × 10−8 | 1,817 | 123 | 1.59 | 1.52–1.66 | 0.626 |
| <10−6 | 2,603 | 197 | 1.62 | 1.55–1.68 | 0.634 |
| <10−5 | 3,818 | 305 | 1.65 | 1.58–1.72 | 0.637 |
| <10−4 | 6,743 | 669 | 1.62 | 1.56–1.69 | 0.631 |
| <10−3 | 14,760 | 1,707 | 1.55 | 1.49–1.62 | 0.623 |
| Lasso | 15,032 | 3,820 | 1.71 | 1.64–1.79 | 0.647 |
The p value cut off refers to the SNPs considered based on their marginal associations in the training set; the same p value threshold was used in each case in the stepwise regression. Parameter selection and effect size estimation for derivation of the PRS was carried out in the training set as described in the Material and Methods.
OR per 1 SD for the PRS. OR for association with breast cancer in the validation set was derived using logistic regression adjusting for country and ten PCs. AUCs were adjusted for country. The lasso was carried out after pre-selecting SNPs at p < 10−3 based on their marginal association in the training set. For the lasso λ = 0.003 gave the optimal PRS in the validation set.
Association between PRS and Breast Cancer Risk in the Validation Set and Prospective Test Datasets
| Overall BC | 1.49 | 1.44–1.56 | 0.612 | 1.46 | 1.42–1.49 | 0.603 |
| ER-positive | 1.56 | 1.49–1.63 | 0.623 | 1.52 | 1.48–1.56 | 0.615 |
| ER-negative | 1.40 | 1.30–1.50 | 0.596 | 1.35 | 1.27–1.43 | 0.584 |
| Overall BC | 1.65 | 1.59–1.72 | 0.639 | 1.61 | 1.57–1.65 | 0.630 |
| ER-positive | 1.74 | 1.66–1.82 | 0.651 | 1.68 | 1.63–1.73 | 0.641 |
| ER-negative | 1.47 | 1.37–1.58 | 0.611 | 1.45 | 1.37–1.53 | 0.601 |
| Overall BC | 1.71 | 1.64–1.79 | 0.646 | 1.66 | 1.61–1.70 | 0.636 |
| ER-positive | 1.81 | 1.73–1.89 | 0.659 | 1.73 | 1.68–1.78 | 0.647 |
| ER-negative | 1.48 | 1.37–1.59 | 0.611 | 1.44 | 1.36–1.53 | 0.600 |
Parameter selection and effect size estimation for derivation of the PRS was carried out in the training set as described in the Material and Methods. The optimal subtype-specific PRS was obtained by carrying out case-only logistic regression and estimating effect sizes in the relevant subtype for SNPs passing a p value of 0.025 in case-only ordinary logistic regression (ER-positive versus ER-negative disease). OR for association with breast cancer in the validation set derived using logistic regression adjusting for country and ten PCs. AUCs were adjusted for by country. In the prospective test set, logistic regression models were adjusted for study and 15 PCs. AUCs were adjusted for by study.
OR per 1 SD for the PRS.
Figure 1Association between the 313 SNP Polygenic Risk Score and Breast Cancer Risk
Association between the 313 SNP polygenic risk score (PRS) and breast cancer risk in women of European origin for (A) overall breast cancers, (B) estrogen receptor (ER)-positive disease, and (C) ER-negative disease, in the validation (dashed line) and test (solid line) sets. Odds ratios are for different quantiles of the PRS relative to the mean PRS. Odds ratios and 95% confidence intervals are shown.
Figure 2Prospective Validation for the 313 SNP Polygenic Risk Score
Prospective validation for the 313 SNP polygenic risk score (PRS) by study for (A) overall breast cancer, (B) ER-positive disease, and (C) ER-negative disease. Association between the 313 SNP PRS and breast cancer risk in women of European origin. Odds ratios and 95% confidence intervals are shown. I-squared and p value for heterogeneity were calculated using fixed effect meta-analysis.
Associations between the 313-SNP PRS (PRS313) and Breast Cancer Risk by First-Degree Family History of Breast Cancer in the Combined Validation and Prospective Test Dataset
| PRS unadjusted | 1.67 | 1.62–1.72 | 1.44 | 1.37–1.54 |
| PRS in women without family history | 1.71 | 1.65–1.78 | 1.45 | 1.36–1.57 |
| PRS in women with family history | 1.55 | 1.48–1.65 | 1.40 | 1.27–1.55 |
| Interaction between PRS and family history | 0.91 | 0.85–0.97 (p = 0.004) | 0.96 | 0.85–1.09 (p = 0.53) |
| Family history unadjusted for PRS | 1.59 | 1.46–1.72 | 1.66 | 1.41–1.95 |
| Family history adjusted for PRS | 1.44 | 1.33–1.57 | 1.56 | 1.32–1.83 |
Association with breast cancer risk was tested for using logistic regression adjusting for study and ten PCs. For these analyses the validation and test datasets were combined. Analyses were restricted to women with known age and family history information. For ER-negative disease, 4,440 women with and 13,132 women without a family history of breast cancer were included in these analyses. For ER-positive disease, 6,787 women with and 17,351 women without a family history of breast cancer were included in these analyses.
OR per 1 SD for the PRS.
Figure 3Cumulative and 10-Year Absolute Risk of Developing Breast Cancer
Cumulative and 10-year absolute risk of developing breast cancer for (A) overall breast cancer, (B) ER-positive disease, and (C) ER-negative disease by percentiles of the 313 SNP polygenic risk scores (PRSs). Note different scales and PRS categories in the different panels. The red line shows the 2.6% risk threshold corresponding to the mean risk for women aged 47 years. Absolute risks were calculated based on UK incidence and mortality data and using the PRS relative risks estimated as described in the Material and Methods.