| Literature DB >> 35585550 |
Leila Dorling1, Sara Carvalho1, Jamie Allen1, Michael T Parsons2, Cristina Fortuno2, Anna González-Neira3, Stephan M Heijl4, Muriel A Adank5, Thomas U Ahearn6, Irene L Andrulis7,8, Päivi Auvinen9,10,11, Heiko Becher12, Matthias W Beckmann13, Sabine Behrens14, Marina Bermisheva15, Natalia V Bogdanova16,17,18, Stig E Bojesen19,20,21, Manjeet K Bolla1, Michael Bremer16, Ignacio Briceno22, Nicola J Camp23, Archie Campbell24,25, Jose E Castelao26, Jenny Chang-Claude14,27, Stephen J Chanock6, Georgia Chenevix-Trench2, J Margriet Collée28, Kamila Czene29, Joe Dennis1, Thilo Dörk17, Mikael Eriksson29, D Gareth Evans30,31,32,33, Peter A Fasching13,34, Jonine Figueroa6,25,35, Henrik Flyger36, Marike Gabrielson29, Manuela Gago-Dominguez37,38, Montserrat García-Closas6, Graham G Giles39,40,41, Gord Glendon7, Pascal Guénel42, Melanie Gündert43,44,45, Andreas Hadjisavvas46,47, Eric Hahnen48,49, Per Hall29,50, Ute Hamann51, Elaine F Harkness32,33,52, Mikael Hartman53,54,55, Frans B L Hogervorst5, Antoinette Hollestelle56, Reiner Hoppe57,58, Anthony Howell33,59, Anna Jakubowska60,61, Audrey Jung14, Elza Khusnutdinova15,62, Sung-Won Kim63, Yon-Dschun Ko64, Vessela N Kristensen65,66, Inge M M Lakeman67,68, Jingmei Li54,69, Annika Lindblom70,71, Maria A Loizidou46,47, Artitaya Lophatananon72, Jan Lubiński60, Craig Luccarini73, Michael J Madsen23, Arto Mannermaa9,74,75, Mehdi Manoochehri51, Sara Margolin50,76, Dimitrios Mavroudis77, Roger L Milne39,40,41, Nur Aishah Mohd Taib78,79, Kenneth Muir72, Heli Nevanlinna80, William G Newman30,31,33, Jan C Oosterwijk81, Sue K Park82,83,84, Paolo Peterlongo85, Paolo Radice86, Emmanouil Saloustros87, Elinor J Sawyer88, Rita K Schmutzler48,49,89, Mitul Shah73, Xueling Sim53, Melissa C Southey39,41,90, Harald Surowy43,44, Maija Suvanto80, Ian Tomlinson91,92, Diana Torres51,93, Thérèse Truong42, Christi J van Asperen68, Regina Waltes17, Qin Wang1, Xiaohong R Yang6, Paul D P Pharoah1,73, Marjanka K Schmidt94,95, Javier Benitez3,96, Bas Vroling4,97, Alison M Dunning73, Soo Hwang Teo78,98, Anders Kvist99, Miguel de la Hoya100, Peter Devilee67,101, Amanda B Spurdle2, Maaike P G Vreeswijk67, Douglas F Easton102,103.
Abstract
BACKGROUND: Protein truncating variants in ATM, BRCA1, BRCA2, CHEK2, and PALB2 are associated with increased breast cancer risk, but risks associated with missense variants in these genes are uncertain.Entities:
Keywords: Breast cancer; Genetic epidemiology; Missense variants; Risk prediction
Mesh:
Year: 2022 PMID: 35585550 PMCID: PMC9116026 DOI: 10.1186/s13073-022-01052-8
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 15.266
Breast cancer risk association results from logistic regression and mixture models of population training samples
| Logistic regression model | Mixture model | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Risk group | Variantsa | Cases | Controls | ORb | 95% CIc | Missense OR (95% CI)d | 95% CIf | ||
| Log-likelihood = − 48,624.97 | Log-likelihood = − 48,624.64 | ||||||||
| Non-carriers | – | 33,351 | 37,001 | 1 | – | – | 0 | – | |
| Carriers | 2.16 (1.78–2.63)h | ||||||||
| Variant outside FAT and PIK domains | 714 | 1259 | 1443 | 0.98 | (0.91–1.06) | 0.67 | 0.0041 | (0.001–0.02) | |
| Variant inside FAT or PIK domain and CADD score quintiles 1–4 g | 171 | 317 | 333 | 1.10 | (0.94–1.29) | 0.24 | 0.055 | (0.03–0.12) | |
| Variant inside FAT or PIK domain and CADD score quintile 5 g | 103 | 239 | 162 | 1.64 | (1.33–2.02) | 3.1 × 10−6 | 0.54 | (0.41–0.68) | |
| Log-likelihood = − 48,652.14 | Log-likelihood = − 48,652.29 | ||||||||
| Non-carriers | – | 34,191 | 37,996 | 1 | – | – | 0 | – | |
| Carriers | 10.61 (7.92–14.21)h | ||||||||
| Variant outside RING and BRCT domains | 479 | 811 | 856 | 1.01 | (0.92–1.12) | 0.79 | 0.0015 | (9.4 × 10−5–0.025) | |
| Variant inside RING or BRCT domain and low Helix score | 79 | 120 | 103 | 1.18 | (0.90–1.55) | 0.23 | 1.0 × 10−11 | NA | |
| Variant inside RING or BRCT domain and high Helix score | 23 | 63 | 16 | 4.94 | (2.83–8.61) | 1.9 × 10−8 | 0.48 | (0.19–0.78) | |
| Log-likelihood = − 48,641.97 | Log-likelihood = − 48,638.78 | ||||||||
| Non-carriers | – | 33,006 | 36,517 | 1 | – | – | 0 | – | |
| Carriers | 5.87 (4.75–7.24)h | ||||||||
| Variant with low Helix score | 1160 | 2062 | 2323 | 0.98 | (0.92–1.04) | 0.47 | 5.1 × 10−5 | (2.4 × 10−9–0.52) | |
| Variant with high Helix score | 62 | 114 | 94 | 1.28 | (0.96–1.70) | 0.087 | 0.11 | (0.04–0.25) | |
| Log-likelihood = − 48,728.96 | Log-likelihood = − 48,728.70 | ||||||||
| Non-carriers | – | 34,582 | 38,480 | 1 | – | – | 0 | – | |
| Carriers | 1.75 (1.47–2.08)i | ||||||||
| Variant with low Helix score | 157 | 403 | 363 | 1.26 | (1.08–1.46) | 0.0025 | 0.33 | (0.25–0.43) | |
| Variant with high Helix score | 121 | 265 | 177 | 1.73 | (1.42–2.11) | 4.7 × 10−8 | 0.95 | (0.86–0.98) | |
| Log-likelihood = − 48,728.67 | Log-likelihood = − 48,729.17 | ||||||||
| Non-carriers | – | 34,622 | 38,291 | 1 | – | – | 0 | – | |
| Carriers | 424 | 618 | 713 | 0.95 | (0.85–1.06) | 0.34 | 4.87 (3.50–6.77)h | 1.1 × 10−4 | (1.6 × 10−9–0.88) |
a Number of unique missense substitutions in population dataset
b Logistic regression odds ratio estimate for missense variant carriers
c 95% confidence interval for logistic regression OR estimate for missense variant carriers
d Mixture model odds ratio and 95% confidence interval for missense variant carriers
e Alpha: estimated proportion of risk associated missense variants
f 95% confidence interval for alpha
g CADD quintiles 1–4 includes all CADD score values ≤ 3.736542; CADD quintile 5 includes all CADD score values > 3.736542
h Missense variant odds ratio constrained to equal odds ratio for protein truncating variants
i Missense variant odds ratio unconstrained
Fig. 1Odds ratios and alpha estimates for each of five genes in population training samples. A ATM. Odds ratios for breast cancer risk from logistic regression models. Alpha is the estimated proportion of risk associated variants from mixture models, based on variants in control samples. ATM risk categories: variants lying within the FAT or PI3K/PI4K protein domains with CADD score in the fifth quintile (FAT/PIK + CADD5); variants lying within the FAT or PI3K/PI4K protein domains with CADD score in any of the first four quintiles (FAT/PIK + CADD1-4); variants lying outside the FAT and PI3K/PI4K protein domains (Outside FAT/PIK). B BRCA1. Odds ratios for breast cancer risk from logistic regression models. Alpha is the estimated proportion of risk associated variants from mixture models, based on variants in control samples. BRCA1 risk categories: variants lying within the RING or BRCT domains with a high Helix score (RING/BRCT + Helix-high); variants lying with the RING or BRCT domains with a low Helix score (RING/BRCT + Helix-low); variants lying outside the RING and BRCT domains (Outside RING/BRCT). C BRCA2. Odds ratios for breast cancer risk from logistic regression models. Alpha is the estimated proportion of risk associated variants from mixture models, based on variants in control samples. BRCA2 risk categories: variants with a high Helix score (Helix-high); variants with a low Helix score (Helix-low). D CHEK2. Odds ratios for breast cancer risk from logistic regression models. Alpha is the estimated proportion of risk associated variants from mixture models, based on variants in control samples. CHEK2 risk categories: variants with a high Helix score (Helix-high); variants with a low Helix score (Helix-low). E PALB2. Odds ratios for breast cancer risk from logistic regression models. Alpha is the estimated proportion of risk associated variants from mixture models, based on variants in control samples. PALB2 risk categories: carriers of any missense variant (Carriers)
Fig. 2Case and control carriers across all samples for each observed missense variant by gene. A ATM. ATM risk categories: variants lying within the FAT or PI3K/PI4K protein domains with CADD score in fifth quintile (FAT/PIK + CADD5); variants lying within the FAT or PI3K/PI4K protein domains with CADD score in any of first four quintiles (FAT/PIK + CADD1-4); variants lying outside the FAT and PI3K/PI4K protein domains (Outside FAT/PIK). B BRCA1. BRCA1 risk categories: variants lying within the RING or BRCT domains with a high Helix score (RING/BRCT + Helix-high); variants lying with the RING or BRCT domains with a low Helix score (RING/BRCT + Helix-low); variants lying outside the RING and BRCT domains (Outside RING/BRCT). C BRCA2. BRCA2 risk categories: variants with a high Helix score (Helix-high); variants with a low Helix score (Helix-low). D CHEK2. CHEK2 risk categories: variants with a high Helix score (Helix-high); variants with a low Helix score (Helix-low). E PALB2. PALB2 risk categories: carriers of any missense variant (Carriers)
Fig. 3Breast cancer risk estimates from composite gene model in validation samples. Black marks indicate corresponding ORs from training models. Risk categories: ATM FAT/PIK + CADD5: ATM variants lying within the FAT or PI3K/PI4K protein domains with CADD score in fifth quintile; ATM FAT/PIK + CADD1-4: ATM variants lying within the FAT or PI3K/PI4K protein domains with CADD score in any of first four quintiles; ATM outside FAT/PIK: variants lying outside the FAT and PI3K/PI4K protein domains; BRCA1 RING/BRCT + Helix-high: BRCA1 variants lying within the RING or BRCT domains with a high Helix score; BRCA1 RING/BRCT + Helix-low: BRCA1 variants lying with the RING or BRCT domains with a low Helix score; BRCA1 outside RING/BRCT: BRCA1 variants lying outside the RING and BRCT domains; BRCA2 Helix-high: BRCA2 variants with a high Helix score; BRCA2 Helix-low: BRCA2 variants with a low Helix score; CHEK2 Helix-high: CHEK2 variants with a high Helix score; CHEK2 Helix-low: CHEK2 variants with a low Helix score; PALB2 carriers: carriers of any missense variant in PALB2