| Literature DB >> 35311964 |
Yaohua Yang1, Ran Tao2, Xiang Shu3, Qiuyin Cai1, Wanqing Wen1, Kai Gu4, Yu-Tang Gao5, Ying Zheng6, Sun-Seog Kweon7,8, Min-Ho Shin7, Ji-Yeob Choi9,10,11, Eun-Sook Lee12,13,14, Sun-Young Kong12,13,14, Boyoung Park14,15, Min Ho Park16, Guochong Jia1, Bingshan Li17, Daehee Kang10,11,18,19, Xiao-Ou Shu1, Jirong Long1, Wei Zheng1.
Abstract
Importance: Polygenic risk scores (PRSs) have shown promise in breast cancer risk prediction; however, limited studies have been conducted among Asian women. Objective: To develop breast cancer risk prediction models for Asian women incorporating PRSs and nongenetic risk factors. Design, Setting, and Participants: This diagnostic study included women of Asian ancestry from the Asia Breast Cancer Consortium. PRSs were developed using data from genomewide association studies (GWASs) of breast cancer conducted among 123 041 women with Asian ancestry (including 18 650 women with breast cancer) using 3 approaches: (1) reported PRS for women with European ancestry; (2) breast cancer-associated single-nucleotide variations (SNVs) identified by fine-mapping of GWAS-identified risk loci; and (3) genomewide risk prediction algorithms. A nongenetic risk score (NGRS) was built, including 7 well-established nongenetic risk factors, using data of 416 case participants and 1558 control participants from a prospective cohort study. PRSs were initially validated in an independent data set including 1426 case participants and 1323 control participants and further evaluated, along with the NGRS, in the second data set including 368 case participants and 736 control participants nested within a prospective cohort study. Main Outcomes and Measures: Logistic regression was used to examine associations of risk scores with breast cancer risk to estimate odds ratios (ORs) with 95% CIs and area under the receiver operating characteristic curve (AUC).Entities:
Mesh:
Year: 2022 PMID: 35311964 PMCID: PMC8938714 DOI: 10.1001/jamanetworkopen.2021.49030
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Summary of Participating Studies Included in the Current Study
| Study | Case participants, No. | Control participants, No. | Age at enrollment, mean (SD), y | |
|---|---|---|---|---|
| Case participants | Control participants | |||
| PRS training and testing | ||||
| Training set | ||||
| SBCGS | 5384 | 6347 | 52.8 (9.3) | 52.1 (9.2) |
| HCES-Br | 274 | 273 | 49.1 (10.8) | 54.0 (7.4) |
| KPOP | 963 | 921 | NA | NA |
| BBJ2 | 5552 | 89 731 | NA | NA |
| SeBCS | 2246 | 2052 | NA | NA |
| BCAC-Asians | 4231 | 5067 | 54.4 (10.4) | 53.8 (10.0) |
| Validation set | ||||
| SBCGS | 1426 | 1323 | 50.1 (11.3) | 50.6 (9.5) |
| Subtotal | 20 076 | 105 714 | NA | NA |
| Prospective study | ||||
| SWHS | 368 | 736 | 52.1 (8.7) | 51.6 (9.5) |
Abbreviations: BBJ2, The Biobank Japan Project 2; BCAC, Breast Cancer Association Consortium; HCES-Br, Hwasun Cancer Epidemiology Study–Breast; KPOP, Korea Precision Oncology Program; NA, not applicable; PRS, polygenic risk score; SBCGS, Shanghai Breast Cancer Genetic Study; SeBCS, Seoul Breast Cancer Study; SWHS, Shanghai Women’s Health Study.
Individual level data were not available for KPOP, BBJ2, and SeBCS.
Associations of PRSs With Breast Cancer Risk in the Validation Set and Prospective Test Set, the Asia Breast Cancer Consortium
| PRS development methods | Validation set (1426 case participants vs 1323 control participants) | Prospective test set (368 case participants vs 736 control participants) | ||||
|---|---|---|---|---|---|---|
| OR (95% CI) | AUC (95% CI) | OR (95% CI) | AUC (95% CI) | |||
| Published European PRS | ||||||
| PRS263-European | 1.42 (1.31 to 1.53) | 0.597 (0.575 to 0.618) | 2.47 × 10−18 | 1.62 (1.42 to 1.85) | 0.625 (0.590 to 0.659) | 2.71 × 10−12 |
| PRS263-Asian | 1.44 (1.33 to 1.56) | 0.601 (0.580 to 0.622) | 5.47 × 10−20 | 1.58 (1.38 to 1.80) | 0.616 (0.582 to 0.651) | 1.41 × 10−11 |
| PRS263-meta | 1.44 (1.33 to 1.55) | 0.600 (0.579 to 0.621) | 1.54 × 10−19 | 1.63 (1.43 to 1.87) | 0.626 (0.592 to 0.661) | 1.25 × 10−12 |
| Fine-mapping | ||||||
| PRS111, with COJO | 1.45 (1.34 to 1.57) | 0.603 (0.582 to 0.624) | 2.72 × 10−20 | 1.67 (1.46 to 1.92) | 0.639 (0.604 to 0.674) | 1.28 × 10−13 |
| PRS112, with COJO | 1.42 (1.31 to 1.53) | 0.597 (0.575 to 0.618) | 1.38 × 10−18 | 1.63 (1.42 to 1.87) | 0.632 (0.597 to 0.667) | 1.70 × 10−12 |
| PRS135, with COJO | 1.38 (1.28 to 1.49) | 0.592 (0.571 to 0.613) | 3.30 × 10−16 | 1.54 (1.35 to 1.76) | 0.619 (0.584 to 0.655) | 1.55 × 10−10 |
| Genomewide risk prediction algorithms | ||||||
| LDpred, with 4 487 284 SNVs | 1.44 (1.34 to 1.56) | 0.600 (0.579 to 0.621) | 4.96 × 10−20 | 1.52 (1.34 to 1.74) | 0.616 (0.581 to 0.651) | 4.08 × 10−10 |
| LDpred2, with 855 680 SNVs | 1.40 (1.29 to 1.51) | 0.591 (0.570 to 0.612) | 4.77 × 10−17 | 1.51 (1.33 to 1.72) | 0.612 (0.577 to 0.648) | 7.47 × 10−10 |
| PRS-CSx, with 855 680 SNVs | 1.51 (1.39 to 1.63) | 0.613 (0.592 to 0.634) | 3.03 × 10−24 | 1.70 (1.49 to 1.95) | 0.642 (0.608 to 0.676) | 1.37 × 10−14 |
Abbreviations: AUC, area under the receiver operating characteristic curve; COJO, conditional and joint association; OR, odds ratio; PRS, polygenic risk score; SNVs, single-nucleotide variants.
OR per SD increase in PRS scores; P values were estimated using logistic regression.
Of the 330 SNVs included in the European-ancestry PRS reported by Zhang et al,[11] data on 263 SNVs were available in our validation and prospective test sets and thus included in this analysis. These PRSs were developed using weights from Breast Cancer Association Consortium–European data only (PRS263-European), Asian data only (PRS263-Asian), and meta-analyses of these 2 data sets (PRS263-meta), respectively.
Developed using SNVs selected from fine-mapping of Asian data and showing consistent association directions in Breast Cancer Association Consortium–European data with P < .05. All weights were derived using Asian data.
For each algorithm, only the most predictive PRS in the validation set is presented. Weights of SNVs from our training set were estimated using each algorithm.
Figure 1. Distributions of Scores on Standardized Polygenic Risk Scores (PRSs) Among Patients With and Without Breast Cancer in the Prospective Test Set
The PRS was standardized by subtracting the mean and dividing by the standard deviation. B and D, The upper edge, center line, and lower edge of the box represent the first, second, and third quartiles, respectively, of PRS percentile. The whiskers indicate the full range of the data. PRS111 indicates the PRS using 111 single-nucleotide variants; PRS263-meta, the PRS based on a meta-analysis of European and Asian data.
Performance of Risk Scores in the Prospective Test Set
| Model | AUC (95% CI) | Predictor | OR (95% CI) | |
|---|---|---|---|---|
| NGRS | 0.565 (0.529-0.601) | NGRS | 1.29 (1.14-1.46) | 6.36 × 10−5 |
| PRS111 | 0.639 (0.604-0.674) | PRS111 | 1.67 (1.46-1.92) | 1.28 × 10−13 |
| PRS263-meta | 0.626 (0.592-0.661) | PRS263-meta | 1.63 (1.43-1.87) | 1.25 × 10−12 |
| IRPM111 | 0.648 (0.613-0.682) | PRS111 | 1.66 (1.46-1.91) | 2.14 × 10−13 |
| NGRS | 1.17 (1.03-1.33) | .02 | ||
| IRPM263-meta | 0.632 (0.597-0.666) | PRS263-meta | 1.62 (1.42-1.86) | 2.91 × 10−12 |
| NGRS | 1.16 (1.02-1.32) | .02 |
Abbreviations: AUC, area under the receiver operating characteristic curve; IRPM, integrated risk prediction model; NGRS, nongenetic risk score; OR, odds ratio; PRS, polygenic risk score.
OR per SD increase; P values were estimated using logistic regression.
The NGRS was derived from body mass index, menopause status, waist-to-hip ratio, a previous diagnosis of benign breast disease, age at menarche, age at first live birth, family history of breast cancer, and an interaction term between body mass index and menopause status. Weights of these factors were derived from the training set including 416 individuals with breast cancer and 1558 control participants from the Shanghai Women’s Health Study.
Results for PRS111, the best PRS derived in the present study, are presented for comparison purposes. IRPM111 was the model including PRS111 and the NGRS.
Results for PRS263-meta, which was derived based on meta-analysis results of Asian and Breast Cancer Association Consortium–European data for 330 single-nucleotide variants initially reported in populations with European ancestry (Zhang et al[11]), are presented for comparison purposes. IRPM263-meta was the model including PRS263-meta and the NGRS.
Figure 2. Ten-Year Absolute Risk of Developing Breast Cancer Estimated Using Data From 10 207 Chinese Women
A and B, Odds ratios of breast cancer for percentiles of scores on the polygenic risk score using 111 single-nucleotide variants (PRS111) and the polygenic risk score based on a meta-analysis of European and Asian data (PRS263-meta) compared with the average risk group (ie, 40th-60th percentile). C and D, Ten-year absolute risk of breast cancer by percentiles of PRS111 and PRS263-meta score for women in different age categories.