| Literature DB >> 35027648 |
Eileen O Dareng1, Jonathan P Tyrer2, Daniel R Barnes1, Michelle R Jones3, Xin Yang1, Katja K H Aben4,5, Muriel A Adank6, Simona Agata7, Irene L Andrulis8,9, Hoda Anton-Culver10, Natalia N Antonenkova11, Gerasimos Aravantinos12, Banu K Arun13, Annelie Augustinsson14, Judith Balmaña15,16, Elisa V Bandera17, Rosa B Barkardottir18,19, Daniel Barrowdale1, Matthias W Beckmann20, Alicia Beeghly-Fadiel21, Javier Benitez22,23, Marina Bermisheva24, Marcus Q Bernardini25, Line Bjorge26,27, Amanda Black28, Natalia V Bogdanova11,29,30, Bernardo Bonanni31, Ake Borg32, James D Brenton33, Agnieszka Budzilowska34, Ralf Butzow35, Saundra S Buys36, Hui Cai21, Maria A Caligo37, Ian Campbell38,39, Rikki Cannioto40, Hayley Cassingham41, Jenny Chang-Claude42,43, Stephen J Chanock44, Kexin Chen45, Yoke-Eng Chiew46,47, Wendy K Chung48, Kathleen B M Claes49, Sarah Colonna36, Linda S Cook50,51, Fergus J Couch52, Mary B Daly53, Fanny Dao54, Eleanor Davies55, Miguel de la Hoya56, Robin de Putter49, Joe Dennis1, Allison DePersia57,58, Peter Devilee59,60, Orland Diez61,62, Yuan Chun Ding63, Jennifer A Doherty64, Susan M Domchek65, Thilo Dörk30, Andreas du Bois66,67, Matthias Dürst68, Diana M Eccles69, Heather A Eliassen70,71, Christoph Engel72,73, Gareth D Evans74,75, Peter A Fasching20,76, James M Flanagan77, Renée T Fortner42, Eva Machackova78, Eitan Friedman79,80, Patricia A Ganz81, Judy Garber82, Francesca Gensini83, Graham G Giles84,85,86, Gord Glendon8, Andrew K Godwin87, Marc T Goodman88, Mark H Greene89, Jacek Gronwald90, Eric Hahnen91,92, Christopher A Haiman93, Niclas Håkansson94, Ute Hamann95, Thomas V O Hansen96, Holly R Harris97,98, Mikael Hartman99,100, Florian Heitz66,67,101, Michelle A T Hildebrandt102, Estrid Høgdall103,104, Claus K Høgdall105, John L Hopper85, Ruea-Yea Huang106, Chad Huff102, Peter J Hulick57,58, David G Huntsman107,108,109,110, Evgeny N Imyanitov111, Claudine Isaacs112, Anna Jakubowska90,113, Paul A James39,114, Ramunas Janavicius115,116, Allan Jensen103, Oskar Th Johannsson117, Esther M John118,119, Michael E Jones120, Daehee Kang121,122,123, Beth Y Karlan124, Anthony Karnezis125, Linda E Kelemen126, Elza Khusnutdinova24,127, Lambertus A Kiemeney4, Byoung-Gie Kim128, Susanne K Kjaer103,105, Ian Komenaka129, Jolanta Kupryjanczyk34, Allison W Kurian118,119, Ava Kwong130,131,132, Diether Lambrechts133,134, Melissa C Larson135, Conxi Lazaro136, Nhu D Le137, Goska Leslie1, Jenny Lester124, Fabienne Lesueur138,139,140, Douglas A Levine54,141, Lian Li45, Jingmei Li142, Jennifer T Loud89, Karen H Lu143, Jan Lubiński90, Phuong L Mai144, Siranoush Manoukian145, Jeffrey R Marks146, Rayna Kim Matsuno147, Keitaro Matsuo148,149, Taymaa May25, Lesley McGuffog1, John R McLaughlin150, Iain A McNeish151,152, Noura Mebirouk138,139,140, Usha Menon153, Austin Miller154, Roger L Milne84,85,86, Albina Minlikeeva155, Francesmary Modugno156,157, Marco Montagna7, Kirsten B Moysich155, Elizabeth Munro158,159, Katherine L Nathanson65, Susan L Neuhausen63, Heli Nevanlinna160, Joanne Ngeow Yuen Yie161,162, Henriette Roed Nielsen163, Finn C Nielsen96, Liene Nikitina-Zake164, Kunle Odunsi165, Kenneth Offit166,167, Edith Olah168, Siel Olbrecht169, Olufunmilayo I Olopade170, Sara H Olson171, Håkan Olsson14, Ana Osorio23,172, Laura Papi83, Sue K Park121,122,123, Michael T Parsons173, Harsha Pathak87, Inge Sokilde Pedersen174,175,176, Ana Peixoto177, Tanja Pejovic158,159, Pedro Perez-Segura56, Jennifer B Permuth178, Beth Peshkin112, Paolo Peterlongo179, Anna Piskorz33, Darya Prokofyeva180, Paolo Radice181, Johanna Rantala182, Marjorie J Riggan183, Harvey A Risch184, Cristina Rodriguez-Antona22,23, Eric Ross185, Mary Anne Rossing97,98, Ingo Runnebaum68, Dale P Sandler186, Marta Santamariña172,187,188, Penny Soucy189, Rita K Schmutzler91,92,190, V Wendy Setiawan93, Kang Shan191, Weiva Sieh192,193, Jacques Simard194, Christian F Singer195, Anna P Sokolenko111, Honglin Song196, Melissa C Southey84,86,197, Helen Steed198, Dominique Stoppa-Lyonnet199,200,201, Rebecca Sutphen202, Anthony J Swerdlow120,203, Yen Yen Tan195, Manuel R Teixeira177,204, Soo Hwang Teo205,206, Kathryn L Terry70,207, Mary Beth Terry208, Mads Thomassen163, Pamela J Thompson88, Liv Cecilie Vestrheim Thomsen26,27, Darcy L Thull209, Marc Tischkowitz210,211, Linda Titus212, Amanda E Toland213, Diana Torres95,214, Britton Trabert28, Ruth Travis215, Nadine Tung216, Shelley S Tworoger70,178, Ellen Valen26,27, Anne M van Altena4, Annemieke H van der Hout217, Els Van Nieuwenhuysen169, Elizabeth J van Rensburg218, Ana Vega172,219,220, Digna Velez Edwards221, Robert A Vierkant135, Frances Wang222,223, Barbara Wappenschmidt91,92, Penelope M Webb224, Clarice R Weinberg225, Jeffrey N Weitzel226, Nicolas Wentzensen28, Emily White98,227, Alice S Whittemore118,228, Stacey J Winham135, Alicja Wolk94,229, Yin-Ling Woo230, Anna H Wu93, Li Yan231, Drakoulis Yannoukakos232, Katia M Zavaglia37, Wei Zheng21, Argyrios Ziogas10, Kristin K Zorn144, Zdenek Kleibl233, Douglas Easton1,2, Kate Lawrenson3,234, Anna DeFazio46,47, Thomas A Sellers235, Susan J Ramus236,237, Celeste L Pearce238,239, Alvaro N Monteiro178, Julie Cunningham240, Ellen L Goode240, Joellen M Schildkraut241, Andrew Berchuck183, Georgia Chenevix-Trench173, Simon A Gayther3, Antonis C Antoniou1, Paul D P Pharoah242,243.
Abstract
Polygenic risk scores (PRS) for epithelial ovarian cancer (EOC) have the potential to improve risk stratification. Joint estimation of Single Nucleotide Polymorphism (SNP) effects in models could improve predictive performance over standard approaches of PRS construction. Here, we implemented computationally efficient, penalized, logistic regression models (lasso, elastic net, stepwise) to individual level genotype data and a Bayesian framework with continuous shrinkage, "select and shrink for summary statistics" (S4), to summary level data for epithelial non-mucinous ovarian cancer risk prediction. We developed the models in a dataset consisting of 23,564 non-mucinous EOC cases and 40,138 controls participating in the Ovarian Cancer Association Consortium (OCAC) and validated the best models in three populations of different ancestries: prospective data from 198,101 women of European ancestries; 7,669 women of East Asian ancestries; 1,072 women of African ancestries, and in 18,915 BRCA1 and 12,337 BRCA2 pathogenic variant carriers of European ancestries. In the external validation data, the model with the strongest association for non-mucinous EOC risk derived from the OCAC model development data was the S4 model (27,240 SNPs) with odds ratios (OR) of 1.38 (95% CI: 1.28-1.48, AUC: 0.588) per unit standard deviation, in women of European ancestries; 1.14 (95% CI: 1.08-1.19, AUC: 0.538) in women of East Asian ancestries; 1.38 (95% CI: 1.21-1.58, AUC: 0.593) in women of African ancestries; hazard ratios of 1.36 (95% CI: 1.29-1.43, AUC: 0.592) in BRCA1 pathogenic variant carriers and 1.49 (95% CI: 1.35-1.64, AUC: 0.624) in BRCA2 pathogenic variant carriers. Incorporation of the S4 PRS in risk prediction models for ovarian cancer may have clinical utility in ovarian cancer prevention programs.Entities:
Mesh:
Year: 2022 PMID: 35027648 PMCID: PMC8904525 DOI: 10.1038/s41431-021-00987-7
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 5.351
Fig. 1PRS model development using penalized regression and LDPred Bayesian approach.
Shown in the left panel is the two-stage approach with five-fold cross validation used for individual level genotype data while the right panel shows the LDPred approach used for summary level data.
Performance of different PRS models in five-fold cross-validation of OCAC data.
| Model | Number of SNPsa | Tuning parameter for best performance | AUC | OR per 1 SD of PRS | 95% CI |
|---|---|---|---|---|---|
| (a) Models based on individual level genotype data | |||||
| Lasso | 1403 | λ = 3.3 | 0.583 | 1.35 | 1.30–1.39 |
| Elastic net | 10,797 | λ = 3.3, | 0.586 | 1.36 | 1.31–1.40 |
| MCP | 1403 | λ = 3.3 | 0.583 | 1.35 | 1.30–1.39 |
| (b) Models based on summary statistics | |||||
| LDPred | 5,291,719 | 0.552 | 1.21 | 1.13–1.29 | |
| Stepwise | 22 | λ = 5.4 | 0.572 | 1.30 | 1.26–1.34 |
| Select and Shrink (OCAC) | 27,240 | a = 2.75, b = 2, φ = 3e−6 | 0.593 | 1.39 | 1.34–1.44 |
AUC area under the receiver operating characteristic (ROC) curve AUC), OR odds ratio, SD standard deviation, PRS polygenic risk score, CI confidence interval, NA not applicable.
aNumber of SNPs in PRS model run on full OCAC data set after selection of model parameters.
External validation of PRS models in European populations using data from UK Biobank and CIMBA.
| Model (data set) | SNPs | UK Biobank | CIMBA | CIMBA | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| AUC | OR | 95% CI | AUC | HR | 95% CI | AUC | HR | 95% CI | |||
| (a) PRS models based on OCAC data | |||||||||||
| Lasso (OCAC) | 1403 | 0.587 | 1.37 | 1.27–1.48 | 0.573 | 1.27 | 1.21–1.34 | 0.627 | 1.48 | 1.33–1.63 | |
| Elastic net (OCAC) | 10,797 | 0.588 | 1.36 | 1.26–1.47 | 0.583 | 1.32 | 1.26–1.39 | 0.617 | 1.47 | 1.33–1.63 | |
| Stepwise (OCAC) | 22 | 0.588 | 1.35 | 1.26–1.46 | 0.563 | 1.21 | 1.16–1.26 | 0.605 | 1.39 | 1.26–1.54 | |
| Select and shrink (OCAC) | 27,240 | 0.588 | 1.38 | 1.28–1.48 | 0.592 | 1.36 | 1.29–1.43 | 0.624 | 1.49 | 1.35–1.64 | |
| (b) PRS models based on meta-analysis of OCAC and CIMBA data | |||||||||||
| Stepwise (OCAC-CIMBA)b | 36 | 0.595 | 1.39 | 1.29–1.50 | NA | NA | NA | NA | NA | NA | |
| Select and shrink (OCAC-CIMBA) | 18,007 | 0.596 | 1.42 | 1.32–1.54 | NA | NA | NA | NA | NA | NA | |
AUC area under the receiver operating characteristic curve, OR odds ratio, HR hazards ratio.
aEstimates are from unadjusted models.
bResults in CIMBA are overfitted as the CIMBA data was used for model development.
Fig. 2Association between the PLR PRS models and non-mucinous ovarian cancer by PRS percentiles.
Shown are estimated odds ratios (OR) and confidence intervals for women of European ancestries by percentiles of polygenic risk scores derived from lasso (A), elastic net (B), stepwise (C) and S4 (D) models relative to the middle quintile.
Association between polygenic risk scores and non-mucinous EOC by PRS percentiles and ancestry.
| UK Biobank | East Asian | African | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Percentile | Controls ( | Cases ( | OR (95% CI) | Controls ( | Cases ( | OR (95% CI) | Controls ( | Cases ( | OR (95% CI) | |
| (a) Lasso | ||||||||||
| 0–5 | 9880 | 12 | 0.42 (0.22–0.72) | 278 | 106 | 0.65 (0.51–0.83) | 35 | 19 | 0.89 (0.47–1.65) | |
| 5–10 | 9870 | 24 | 0.83 (0.52–1.27) | 271 | 112 | 0.71 (0.55–0.90) | 41 | 13 | 0.52 (0.25–1.01) | |
| 10–20 | 19,733 | 53 | 0.92 (0.66–1.27) | 487 | 280 | 0.98 (0.82–1.18) | 81 | 26 | 0.53 (0.31–0.88) | |
| 20–40 | 39,468 | 104 | 0.90 (0.69–1.18) | 993 | 541 | 0.93 (0.80–1.08) | 154 | 60 | 0.64 (0.42–0.99) | |
| 40–60 | 39,457 | 115 | 1 | 967 | 566 | 1 | 133 | 81 | 1 | |
| 60–80 | 39,425 | 147 | 1.28 (1.00–1.64) | 941 | 593 | 1.08 (0.93–1.25) | 136 | 78 | 0.94 (0.64–1.39) | |
| 80–90 | 19,699 | 87 | 1.52 (1.14–2.00) | 466 | 301 | 1.10 (0.92–1.32) | 63 | 44 | 1.15 (0.71–1.84) | |
| 90–95 | 9842 | 51 | 1.78 (1.27–2.46) | 214 | 169 | 1.35 (1.07–1.69) | 34 | 20 | 0.97 (0.51–1.78) | |
| 95–100 | 9830 | 64 | 2.23 (1.64–3.02) | 211 | 173 | 1.40 (1.12–1.76) | 27 | 27 | 1.64 (0.90–3.00) | |
| (b) Elastic net | ||||||||||
| 0–5 | 9876 | 17 | 0.67 (0.39–1.09) | 277 | 107 | 0.72 (0.56–0.92) | 35 | 19 | 0.90 (0.47–1.64) | |
| 5–10 | 9876 | 17 | 0.67 (0.39–1.09) | 271 | 112 | 0.78 (0.61–0.99) | 41 | 13 | 0.52 (0.25–1.01) | |
| 10–20 | 19,740 | 45 | 0.89 (0.62–1.26) | 497 | 270 | 1.02 (0.85–1.22) | 81 | 26 | 0.53 (0.31–0.88) | |
| 20–40 | 39,453 | 120 | 1.19 (0.91–1.55) | 967 | 567 | 1.10 (0.95–1.28) | 154 | 60 | 0.64 (0.42–0.96) | |
| 40–60 | 39,471 | 101 | 1 | 1000 | 533 | 1 | 133 | 81 | 1 | |
| 60–80 | 39,413 | 159 | 1.58 (1.23–2.03) | 926 | 608 | 1.23 (1.06–1.43) | 136 | 78 | 0.94 (0.64–1.39) | |
| 80–90 | 19,695 | 91 | 1.80 (1.36–2.40) | 457 | 310 | 1.27 (1.06–1.52) | 63 | 44 | 1.15 (0.71–1.84) | |
| 90–95 | 9841 | 52 | 2.07 (1.47–2.87) | 226 | 157 | 1.30 (1.04–1.64) | 34 | 20 | 0.97 (0.51–1.78) | |
| 95–100 | 9839 | 55 | 2.18 (1.56–3.02) | 207 | 177 | 1.60 (1.28–2.01) | 27 | 27 | 1.64 (0.90–3.00) | |
| (c) Stepwise | ||||||||||
| 0–5 | 9880 | 13 | 0.39 (0.21–0.67) | 254 | 130 | 0.90 (0.71–1.14) | 40 | 14 | 0.75 (0.37–1.44) | |
| 5–10 | 9874 | 19 | 0.57 (0.34–0.91) | 268 | 115 | 0.76 (0.59–0.96) | 43 | 11 | 0.55 (0.26–1.10) | |
| 10–20 | 19,742 | 44 | 0.67 (0.47–0.93) | 494 | 273 | 0.98 (0.81–1.17) | 80 | 27 | 0.72 (0.42–1.21) | |
| 20–40 | 39,470 | 102 | 0.77 (0.60–1.00) | 970 | 564 | 1.03 (0.89–1.19) | 142 | 72 | 1.09 (0.73–1.63) | |
| 40–60 | 39,440 | 132 | 1 | 979 | 564 | 1 | 146 | 68 | 1 | |
| 60–80 | 39,414 | 158 | 1.20 (0.95–1.51) | 951 | 583 | 1.08 (0.94–1.25) | 130 | 84 | 1.39 (0.93–2.07) | |
| 80–90 | 19,697 | 88 | 1.33 (1.02–1.75) | 456 | 311 | 1.21 (1.01–1.44) | 61 | 46 | 1.62 (1.00–2.61) | |
| 90–95 | 9853 | 41 | 1.24 (0.86–1.75) | 236 | 147 | 1.10 (0.87–1.38) | 35 | 19 | 1.17 (0.61–2.17) | |
| 95–100 | 9834 | 60 | 1.82 (1.33–2.46) | 220 | 164 | 1.32 (1.04–1.65) | 27 | 27 | 2.15 (1.17–3.95) | |
| (d) Select and shrink | ||||||||||
| 0–5 | 9957 | 16 | 0.54 (0.31–0.89) | 279 | 105 | 0.63 (0.49–0.81) | 38 | 16 | 0.71 (0.36–1.33) | |
| 5–10 | 9888 | 15 | 0.51 (0.29–0.85) | 254 | 129 | 0.85 (0.67–1.08) | 41 | 13 | 0.53 (0.26–1.03) | |
| 10–20 | 19,812 | 51 | 0.87 (0.62–1.20) | 489 | 278 | 0.96 (0.80–1.14) | 81 | 26 | 0.54 (0.32–0.90) | |
| 20–40 | 39,435 | 113 | 0.97 (0.75–1.25) | 1013 | 521 | 0.86 (0.75–1.00) | 156 | 58 | 0.62 (0.41–0.94) | |
| 40–60 | 39,512 | 117 | 1 | 961 | 572 | 1 | 134 | 80 | 1 | |
| 60–80 | 39,316 | 158 | 1.36 (1.07–1.73) | 950 | 584 | 1.03 (0.89–1.20) | 137 | 77 | 0.94 (0.63–1.40) | |
| 80–90 | 19,718 | 77 | 1.32 (0.98–1.76) | 434 | 333 | 1.29 (1.08–1.54) | 61 | 46 | 1.26 (0.79–2.02) | |
| 90–95 | 9791 | 45 | 1.55 (1.09–2.17) | 233 | 150 | 1.08 (0.86–1.36) | 30 | 24 | 1.34 (0.73–2.45) | |
| 95–100 | 9775 | 65 | 2.25 (1.65–3.03) | 215 | 169 | 1.32 (1.05–1.66) | 26 | 28 | 1.80 (0.99–3.31) | |
OR odds ratio, CI confidence interval.
Fig. 3Cumulative risk of ovarian cancer between birth and age 80 by PRS percentiles and PRS models.
Shown are the cumulative risk of ovarian cancer risk in UK women by polygenic risk score percentiles. The lasso (A) and elastic net (B) penalized regression models were applied to individual level genotype data, while the stepwise (C) and S4 (D) models were applied to summary level statistics. Note that the median and the mean risk differ because the distribution of the relative risk in the population is left-skewed (the log relative risk is a Normal distribution).
External validation of PRS models in East Asian and African Populations.
| Model | East Asian ancestries | African ancestries | ||||
|---|---|---|---|---|---|---|
| AUC | OR | 95% CI | AUC | OR | 95% CI | |
| Lasso | 0.541 | 1.16 | (1.11–1.22) | 0.576 | 1.28 | (1.13–1.45) |
| Elastic net | 0.543 | 1.17 | (1.12–1.23) | 0.574 | 1.29 | (1.14–1.47) |
| Stepwise (OCAC) | 0.528 | 1.11 | (1.06–1.16) | 0.581 | 1.34 | (1.18–1.52) |
| Select and shrink (OCAC) | 0.538 | 1.14 | (1.08–1.19) | 0.593 | 1.38 | (1.21–1.58) |
| Stepwise (OCAC-CIMBA) | 0.542 | 1.17 | (1.11–1.23) | 0.594 | 1.37 | (1.20–1.56) |
| Select and shrink (OCAC-CIMBA) | 0.537 | 1.14 | (1.08–1.19) | 0.596 | 1.41 | (1.23–1.61) |