Literature DB >> 34983122

Prediction of clinically significant prostate cancer using polygenic risk models in Asians.

Sang Hun Song1,2, Eunae Kim3, Eunjin Woo3, Eunkyung Kwon1,3, Sungroh Yoon4, Jung Kwon Kim1, Hakmin Lee1, Jong Jin Oh1,2, Sangchul Lee1, Sung Kyu Hong1,2, Seok-Soo Byun1,3,5.   

Abstract

PURPOSE: To develop and evaluate the performance of a polygenic risk score (PRS) constructed in a Korean male population to predict clinically significant prostate cancer (csPCa).
MATERIALS AND METHODS: Total 2,702 PCa samples and 7,485 controls were used to discover csPCa susceptible single nucleotide polymorphisms (SNPs). Males with biopsy-proven or post-radical prostatectomy Gleason score 7 or higher were included for analysis. After genotype imputation for quality control, logistic regression models were applied to test association and calculate effect size. Extracted candidate SNPs were further tested to compare predictive performance according to number of SNPs included in the PRS. The best-fit model was validated in an independent cohort of 311 cases and 822 controls.
RESULTS: Of the 83 candidate SNPs with significant PCa association reported in previous literature, rs72725879 located in PRNCR1 showed the highest significance for PCa risk (odds ratio, 0.597; 95% confidence interval [CI], 0.555-0.641; p=4.3×10-45). Thirty-two SNPs within 26 distinct loci were further selected for PRS construction. Best performance was found with the top 29 SNPs, with AUC found to be 0.700 (95% CI, 0.667-0.734). Males with very-high PRS (above the 95th percentile) had a 4.92-fold increased risk for csPCa.
CONCLUSIONS: Ethnic-specific PRS was developed and validated in Korean males to predict csPCa susceptibility using the largest csPCa sample size in Asia. PRS can be a potential biomarker to predict individual risk. Future multi-ethnic trials are required to further validate our results. © The Korean Urological Association, 2022.

Entities:  

Keywords:  Genome-wide association study; Multifactorial inheritance; Polymorphism, single nucleotide; Prostatic neoplasms

Mesh:

Year:  2022        PMID: 34983122      PMCID: PMC8756152          DOI: 10.4111/icu.20210305

Source DB:  PubMed          Journal:  Investig Clin Urol        ISSN: 2466-0493


INTRODUCTION

The incidence of prostate cancer (PCa) is constantly on the rise, currently second in cancer diagnosis and sixth in cancer mortality in men worldwide as of 2020 [1]. This trend is more apparent in Asia where PCa has traditionally had low prevalence, with literature pointing to the gradual transition to a westernized high-fat intake diet and obesity increase as well as broader implementation of nationwide PCa screening as notable factors [2]. As such, genetic origins of PCa pathogenesis have been of keen clinical interest, and with the approval of PARP (poly [adenosine diphosphate-ribose] polymerase) inhibitors as novel therapy for actionable genetic mutations including BRCA 1/2, the National Comprehensive Cancer Network (NCCN) now recommends germline testing for PCa not only in men with a positive family history but also in high- to very-high risk regional or metastatic PCa. Alterations in homologous DNA repair and mismatch repair genes are strongly linked to 3- to 6-fold increase in overall risk [3]. However, not all PCa can be explained with nominal genes of high penetrance, but rather can be attributed to the cumulative effect of small variant alleles that confer varying levels of individual risk to disease [34]. With the pivotal study that identified 5 singular single nucleotide polymorphisms (SNPs) having strong association with metastatic PCa [5], the search for PCa susceptible variants have culminated in over 100 SNPs found in large-scale genome-wide association studies (GWASs), attributing to 60% of heritability [6]. Associations from germline mutations provide valuable information in predicting lifetime trajectory, allowing individualized medical care at an early stage of screening and diagnosis. Polygenic risk score (PRS) takes advantage of common polymorphisms to assess a person’s genetic risk of disease, and when used with other clinical variables, has added value in anticipating tumor aggressiveness and escaping unnecessary biopsy [78]. Current literature on PRS in PCa are scarce compared to the vast number of reported SNPs with variable levels of performance, limiting application in clinical practice. Most large scale models are also constructed in males of mainly Caucasian and European ancestry, resulting in lower predictive power when replicated in Asian cohorts likely due to non-shared genomic loci [4]. As such, ethnic-specific PRS is required to account for genetic variations. In this study, we developed and validated a PRS model predicting clinically significant PCa (csPCa) in a Korean population.

MATERIALS AND METHODS

1. Study population and genotyping

Participants of this study include PCa patients who received radical prostatectomy (RP) and/or core transrectal ultrasound prostate biopsy at one of the four tertiary hospitals in the Republic of Korea (Seoul National University of Bundang Hospital, Seoul National University Hospital, Chungbuk National University Hospital, and the Catholic University of Korea, St. Vincent’s Hospital). All participants provided informed consent. This multi-center study was approved by the Institutional Review Board of Seoul National University Bundang Hospital (SNUBH) (approval number: B-1607/355-302). Gleason score (GS) of PCa was reviewed based on biopsy and/or RP specimens. For selection of csPCa cases, patients with GS 7 or higher were included for this study. Intact genomic DNA (200 ng) extracted from blood or saliva samples was genotyped using the Korea Biobank array (K-CHIP) following the manufacturer’s instructions. The control genotype and phenotype data were obtained from the Korean Genome and Epidemiology Study (KoGES) conducted by the Center for Genome Science, the Korea National Institute of Health (KNIH) [9]. The genotype data were produced using K-CHIP array. The control samples were further selected for analyses who are male, 60 years or older and had never been diagnosed with any cancer. More detailed information about the external validation cohort is available in a previously published article [9].

2. Quality control of genotype data and genotype imputation

All sample and marker quality control (QC) was performed using PLINK software (v1.90 beta) [10]. Samples were excluded based on the following criteria: i) low genotype call rate (<95%), ii) excessive heterogeneity, iii) genetic relatedness, and iv) sex inconsistencies. Markers i) that are not SNPs, ii) MAF <5%, iii) low call rate (<95%), or iv) significantly deviated from Hardy–Weinberg equilibrium (p<1.0×10-6) were excluded. Imputation of missing genotypes in the PCa cases were conducted using the Michigan Imputation Server which utilizes Eagle v2.4 for phasing and Minimac 4 with the 1000 Genomes Project Phase 3 as the reference genome panel for genotype imputation [1112]. Genotypeimputed SNPs were filtered with genotype quality (R2>0.8) for the further analyses. For control samples, the genotype data that had been imputed were provided by KNIH, after phasing using Eagle v2.3 and genotype imputation using IMPUTE 4 with 1000 Genomes project phase 3 and Korean reference genome (397 samples) as the reference panel [1112]. The imputed SNPs were filtered based on INFO score >0.8 and MAF >0.1. The imputed genotype data for cases and controls were merged after SNPs commonly found in both cases and controls were kept for the further analyses. The post-imputation QC was performed based on the same criteria applied in the pre-imputation QC. After sample and marker QC, the resulting 11,320 samples (3,013 cases and 8,307 controls) and 4,724,872 SNPs were included for the downstream analyses.

3. Statistical analysis

A total of 10,187 samples (2,702 cases and 7,485 controls) were used for discovery of PCa-associated SNPs and their summary statistics. We performed association tests between csPCa and previously reported PCa-associated SNPs. Previously reported SNPs were extracted after literature review of GWAS studies on PCa. Logistic regression models were applied to test associations for each of those SNPs. Based on the association results, SNPs were filtered based on statistical significance of p<0.001. Linkage disequilibrium (LD)-clumping was performed to extract a set of lead SNPs within each LD block. Based on the association results, we obtained odds ratio (OR) of PCa-associated SNPs on the risk of PCa development. PRS was calculated as log(OR)-weighted sum of the number of risk alleles as following: β The individual PRS was computed upon the training set for internal validation and the test set for external validation. To test performance of PRS models in predicting risk of PCa development, the test set (311 cases and 822 controls) was used. The predictive performance of PRS was evaluated using the area under the ROC (receiver operating characteristics) curve (AUC) [13]. The AUC was compared according to the number of SNPs included for PRS models. Improvement in AUC between ROC curves was tested using Delong’s method [6]. The optimal PRS cutoff value was determined at the maximal Youden’s Index (J, sensitivity+specificity-1) value. For calculation of individual risk, we fitted the logistic regression model to obtain the regression coefficient of PRS on PCa risk and calculated the individual risk with OR = exp(βPRS*(PRS, where βPRS is the regression coefficient of PRS on PCa risk, PRS is the ith individual’s PRS, µPRS is the mean PRS of the controls.

RESULTS

The mean age of PCa cases and controls were 67.8 and 64.7 years, respectively. The mean BMI of cases and controls was 24.6 kg/m2 and 24.1 kg/m2, respectively. The mean prostate-specific antigen (PSA) level of PCa cases were 35.1 ng/mL with standard deviation of 209.4 ng/mL. Most of PCa cases had GS 7 (n=2,105), followed by GS 9 or higher (n=497) and GS 8 (n=411) (Table 1).
Table 1

Characteristics of the study population

VariableAll (n=11,320)Training set (n=10,187)Test set (n=1,133)
Case (n=3,013)Control (n=8,307)Case (n=2,702)Control (n=7,485)Case (n=311)Control (n=822)
Age (y)67.8±7.564.7±3.667.6±7.564.7±3.668.8±7.764.9±3.6
BMI (kg/m2)24.6±2.724.1±2.724.6±2.724.1±2.7NA24.2±2.8
PSA (ng/mL)35.1±209.4NA36.7±221.1NA21.3±31.0NA
Gleason score-
72,105-1,928-177-
8411-338-73-
≥9497-436-61-

Values are presented as mean±standard deviation or number only.

BMI, body mass index; PSA, prostate-specific antigen; NA, not available.

We identified 83 SNPs that were previously reported to be PCa-associated in one or more studies (Table 2). rs72725879 located within PRNCR1 was most significantly associated with PCa risk (OR, 0.597; 95% confidence interval [CI], 0.555–0.641; p=4.3×10-45). Other 38 SNPs also showed negative associations between minor allele and PCa risk. Of the 83 SNPs, 23 SNPs were located within 8q24.21 region, spanning PRNCR1, CASC8, PCAT2, PCAT1, LOC105375751, and CCAT2, 8 SNPs were located within HNF1B (17q12) and 5 SNPs were located within 11q13.3. After LD clumping, 31 SNPs located within 26 distinct genomic loci had been extracted as candidate SNPs for PRS construction (Table 3).
Table 2

SNPs associated with development of PCa

SNPCHRBPMinor alleleOR (95% CI)p-valueLocus
rs727258798128103969C0.597 (0.555–0.641)4.30E-45 PRNCR1
rs42423848128518554C1.761 (1.628–1.906)4.70E-45 8q24.21
rs78430318128533473T1.790 (1.650–1.942)1.10E-44 8q24.21
rs78376888128539360T1.784 (1.645–1.935)3.50E-44 8q24.21
rs14563158128103937C0.584 (0.541–0.630)8.10E-44 PRNCR1
rs45825248128528435G1.748 (1.613–1.894)2.30E-42 8q24.21
rs100901548128532137T1.703 (1.570–1.846)5.10E-38 8q24.21
rs119862208128531689A1.690 (1.559–1.832)4.60E-37 8q24.21
rs132547388128104343A0.633 (0.590–0.680)4.60E-36 PRNCR1
rs14472958128485038A1.598 (1.476–1.730)4.30E-31 CASC8
rs560052458128113426T1.435 (1.342–1.533)1.80E-26 8q24.21
rs10163438128093297T1.417 (1.328–1.512)7.00E-26 PRNCR1/PCAT2
rs126823448128106784G1.441 (1.342–1.547)6.60E-24 8q24.21
rs69835618128106880C1.439 (1.341–1.545)8.80E-24 8q24.21
rs169019798128124916A1.413 (1.317–1.516)6.30E-22 8q24.21
rs105054838128125195T1.413 (1.317–1.516)6.90E-22 8q24.21
rs112637631736103565G0.700 (0.650–0.753)1.00E-21 HNF1B
rs80644541736101586A0.700 (0.651–0.753)1.00E-21 HNF1B
rs116510521736102381A0.702 (0.652–0.755)1.60E-21 HNF1B
rs75019391736101156T0.704 (0.654–0.757)5.10E-21 HNF1B
rs132522988128095156G0.712 (0.662–0.765)2.80E-20 PRNCR1/PCAT2
rs48710098128108416T1.367 (1.277–1.465)4.40E-19 8q24.21
rs44307961736098040G0.743 (0.692–0.797)9.40E-17 HNF1B
rs20057051736096300A0.746 (0.695–0.802)1.20E-15 HNF1B
rs3393316117210052C0.775 (0.724–0.829)1.40E-13 RFX6
rs3393516117200434A0.778 (0.727–0.832)2.80E-13 RFX6
rs1512268823526463T1.276 (1.195–1.363)3.80E-13 LOC107986930
rs1160267823529521G1.272 (1.191–1.358)7.90E-13 LOC107986930
rs125497618128540776G0.762 (0.706–0.823)3.20E-12 8q24.21
rs116497431736074979A0.781 (0.729–0.838)4.00E-12 HNF1B/LOC105371754
rs10503733823534018T1.246 (1.167–1.331)6.00E-11 LOC107986930
rs11125927262752975G1.247 (1.165–1.336)2.30E-10 2p15
rs14078391710122834482T0.748 (0.683–0.818)2.50E-10 LOC105378521
rs7821330823522452C1.228 (1.150–1.310)6.30E-10 LOC107986930
rs74894091373716861C1.213 (1.139–1.292)1.50E-9 13q22.1
rs108964491168994667G1.404 (1.253–1.574)5.30E-9 11q13.3
rs74633268128027954A0.788 (0.723–0.858)4.40E-8 PCAT1/LOC105375751
rs79299621168985583T1.375 (1.226–1.543)5.70E-8 11q13.3
rs3765923641169011693T1.383 (1.229–1.557)7.60E-8 11q13.3
rs100869088128011937C0.796 (0.732–0.867)1.30E-7 LOC105375751
rs80237931566942093C0.837 (0.783–0.895)1.50E-7 LINC01169
rs122706411169012244A1.374 (1.219–1.549)2.10E-7 11q13.3
rs26591241951354597A0.845 (0.792–0.902)4.30E-7 LOC105372441
rs58235267263277843C0.843 (0.788–0.903)8.90E-7 OTX1
rs109939941051549496T1.161 (1.091–1.235)2.40E-6 MSMB
rs112285831169009114T1.348 (1.190–1.527)2.60E-6 11q13.3
rs771675342173319930T0.833 (0.771–0.899)3.30E-6 ITGA6
rs96000791373728139T1.158 (1.087–1.233)5.00E-6 13q22.1
rs7591218243637998G0.850 (0.791–0.912)6.60E-6 THADA
rs27358391951364623A0.862 (0.808–0.920)7.30E-6 19q13.33
rs1983891641536427T1.155 (1.083–1.233)1.30E-5 FOXP4
rs4714485641536587G1.154 (1.081–1.231)1.50E-5 FOXP4
rs224265251280028A0.835 (0.769–0.906)1.70E-5 TERT
rs118175441080236999A0.837 (0.772–0.909)2.00E-5 10q22.3
rs126212782173311553G0.849 (0.787–0.916)2.30E-5 ITGA6
rs1466184432173309803T0.851 (0.789–0.918)3.10E-5 ITGA6
rs1038822243738173C0.863 (0.804–0.926)4.40E-5 THADA
rs75204040262780325G1.141 (1.070–1.215)5.00E-5 2p15
rs105054778128407443A1.139 (1.069–1.213)5.60E-5 CASC8
rs69832678128413305G1.138 (1.068–1.212)6.00E-5 CASC8/CCAT2
rs8178729110144887C1.143 (1.070–1.221)7.20E-5 LOC107987111, RAD23B
rs1465618243553949C0.866 (0.806–0.930)8.00E-5 THADA
rs561593481176267331G0.836 (0.764–0.915)1.00E-4 11q13.5, EMSY
rs746344571566835704G1.202 (1.095–1.320)1.10E-4 ZWILCH
rs45548251080244623T0.853 (0.787–0.925)1.20E-4 10q22.3
rs66605381163295678C0.878 (0.821–0.940)1.70E-4 NUF2
rs225200410122844709A0.872 (0.812–0.937)2.00E-4 10q26.12
rs4711748643694598T1.125 (1.057–1.197)2.10E-4 POLR1C, RP1-261G23.5
rs26590511951345568C0.882 (0.825–0.943)2.20E-4 LOC105372441
rs22387762219757892A0.889 (0.835–0.947)2.40E-4 TBX1
rs10807290643710381T0.889 (0.835–0.947)2.40E-4 POLR1C
rs6955627792577760T0.879 (0.820–0.942)2.60E-4 7q21.2
rs2660753387110674T1.135 (1.060–1.216)2.90E-4 3p12.1
rs17023900387134800G1.163 (1.070–1.264)3.80E-4 3p12.1
rs12831043106962521G1.119 (1.051–1.192)4.70E-4 DUBR
rs71536481461122526C1.149 (1.062–1.244)5.60E-4 14q23.1
rs9284813387152169G1.130 (1.054–1.211)6.10E-4 LINC00506
rs31106411736047417A1.124 (1.051–1.202)6.90E-4 HNF1B/LOC105371755
rs143745027387144017G1.155 (1.062–1.256)7.80E-4 LINC00506
rs80056211461106699G1.160 (1.064–1.266)7.90E-4 SALRNA1
rs12500426495514609A1.113 (1.045–1.185)8.00E-4 PDLIM5
rs6545977263301164A0.881 (0.817–0.949)8.50E-4 2p15
rs561035031154980351C0.849 (0.771–0.936)9.50E-4 ZBTB7B

SNP, single nucleotide polymorphism; PCa, prostate cancer; CHR, chromosome; BP, base pairs; OR, odds ratio; CI, confidence interval.

Table 3

Candidate SNPs for PRS construction

SNPCHRBP>Minor alleleβ (95% CI)p-valueSNPs in LD blockLocus
rs727258798128103969C-0.517 (-0.589 to -0.445)4.30E-45rs1016343, rs13252298, rs1456315, rs13254738, rs12682344, rs6983561, rs4871009, rs56005245, rs16901979, rs10505483 PRNCR1
rs42423848128518554C0.566 (0.487 to 0.645)4.70E-45rs1447295, rs4582524, rs11986220, rs10090154, rs7843031, rs7837688, rs12549761 8q24.21
rs112637631736103565G-0.357 (-0.431 to -0.284)1.00E-21rs3110641, rs11649743, rs2005705, rs4430796, rs7501939, rs8064454, rs11651052 HNF1B
rs3393316117210052C-0.255 (-0.323 to -0.188)1.40E-13rs339351 RFX6
rs1512268823526463T0.244 (0.178 to 0.310)3.80E-13rs7821330, rs1160267, rs10503733 LOC107986930
rs11125927262752975G0.221 (0.153 to 0.290)2.30E-10rs75204040 2p15
rs14078391710122834482T-0.291 (-0.381 to -0.201)2.50E-10rs2252004 LOC105378521
rs74894091373716861C0.193 (0.130 to 0.256)1.50E-9rs9600079 13q22.1
rs108964491168994667G0.339 (0.226 to 0.454)5.30E-9rs7929962, rs11228583, rs376592364, rs12270641 11q13.3
rs74633268128027954A-0.239 (-0.324 to -0.153)4.40E-8rs10086908 PCAT1/LOC105375751
rs80237931566942093C-0.178 (-0.244 to -0.111)1.50E-7rs74634457 LINC01169
rs26591241951354597A-0.168 (-0.233 to -0.103)4.30E-7rs2659051, rs2735839 LOC105372441
rs58235267263277843C-0.171 (-0.239 to -0.103)8.90E-7rs6545977 OTX1
rs109939941051549496T0.149 (0.087 to 0.211)2.40E-6- MSMB
rs771675342173319930T-0.183 (-0.260 to -0.106)3.30E-6rs146618443, rs12621278 ITGA6
rs7591218243637998G-0.163 (-0.234 to -0.092)6.60E-6rs1465618, rs1038822 THADA
rs1983891641536427T0.144 (0.080 to 0.209)1.30E-5rs4714485 FOXP4
rs224265251280028A-0.180 (-0.262 to -0.098)1.70E-5- TERT
rs118175441080236999A-0.177 (-0.259 to -0.096)2.00E-5rs4554825 10q22.3
rs105054778128407443A0.130 (0.067 to 0.193)5.60E-5rs6983267 CASC8
rs8178729110144887C0.134 (0.068 to 0.200)7.20E-5- LOC107987111, RAD23B
rs561593481176267331G-0.179 (-0.269 to -0.089)1.00E-4- 11q13.5, EMSY
rs66605381163295678C-0.130 (-0.197 to -0.062)1.70E-4- NUF2
rs4711748643694598T0.118 (0.055 to 0.180)2.10E-4rs10807290 POLR1C, RP1-261G23.5
rs22387762219757892A-0.118 (-0.181 to -0.055)2.40E-4- TBX1
rs6955627792577760T-0.129 (-0.198 to -0.060)2.60E-4- 7q21.2
rs2660753387110674T0.127 (0.058 to 0.196)2.90E-4rs17023900, rs143745027, rs9284813 3p12.1
rs12831043106962521G0.112 (0.050 to 0.176)4.70E-4- DUBR
rs71536481461122526C0.139 (0.060 to 0.218)5.60E-4rs8005621 14q23.1
rs12500426495514609A0.107 (0.044 to 0.170)8.00E-4- PDLIM5
rs561035031154980351C-0.164 (-0.261 to -0.067)9.50E-4- ZBTB7B

SNP, single nucleotide polymorphism; PRS, polygenic risk score; CHR, chromosome; BP, base pairs; CI, confidence interval; LD, linkage disequilibrium.

We calculated individual PRS in the test set (311 cases and 822 controls) and tested the performance in predicting PCa risk (Fig. 1). PRS models composed of top 29 SNPs, top 26 SNPs and top 28 SNPs showed comparably superior performance compared to other models (Table 4). The AUC of the prediction model of PRS composed of 29 SNPs was estimated to be 0.700 (95% CI, 0.667–0.734), with a sensitivity and specificity of 0.672 and 0.662, respectively (Fig. 2A).
Fig. 1

Distribution of PRS in cases and controls. Mean PRS in controls: pink dashed line, Mean PRS in cases: blue dashed line, and optimal PRS cutoff: black solid line. PRS, polygenic risk score.

Table 4

Predictive performance of PRS models for development of PCa

Number of top SNPs included for PRS calculationMean PRS of controlsMean PRS of casesAUC (95% CI)JSensitivitySpecificityImprovement in AUC (p)
29-0.0090.0000.700 (0.667–0.734)0.3340.6720.662-
26-0.014-0.0040.700 (0.667–0.734)0.3210.6590.6620 (0.993)
28-0.010-0.0010.700 (0.666–0.733)0.3200.6370.684-0.001 (0.726)
27-0.012-0.0030.698 (0.664–0.732)0.3230.6460.676-0.002 (0.411)
31-0.0080.0010.697 (0.664–0.731)0.3130.5980.715-0.003 (0.263)
30-0.0070.0020.697 (0.664–0.731)0.3140.6750.639-0.003 (0.115)
25-0.013-0.0030.696 (0.663–0.730)0.3370.6400.697-0.004 (0.350)
17-0.017-0.0040.692 (0.658–0.726)0.3100.6880.622-0.008 (0.292)
24-0.012-0.0010.692 (0.658–0.726)0.3150.6240.691-0.009 (0.065)
18-0.018-0.0050.690 (0.656–0.724)0.3190.6660.653-0.010 (0.173)
20-0.015-0.0030.690 (0.656–0.724)0.3310.6560.675-0.011 (0.106)
22-0.013-0.0020.689 (0.655–0.723)0.3150.6750.640-0.012 (0.035)
21-0.012-0.0010.688 (0.654–0.722)0.3110.6300.681-0.012 (0.420)
13-0.024-0.0080.687 (0.654–0.721)0.3050.5880.717-0.013 (0.168)
16-0.021-0.0070.687 (0.653–0.722)0.2980.5690.729-0.013 (0.110)
23-0.014-0.0040.687 (0.653–0.721)0.3120.6240.689-0.013 (0.013)
19-0.019-0.0070.687 (0.653–0.721)0.3160.6750.641-0.013 (0.059)
12-0.022-0.0040.684 (0.650–0.718)0.3090.6430.665-0.016 (0.094)
15-0.019-0.0050.684 (0.650–0.718)0.2900.5400.749-0.016 (0.060)
14-0.018-0.0030.683 (0.649–0.718)0.2930.5720.720-0.017 (0.061)
11-0.0180.0000.683 (0.649–0.717)0.2920.6560.636-0.018 (0.090)
10-0.0130.0060.680 (0.646–0.714)0.2840.6210.663-0.020 (0.064)
8-0.0140.0090.675 (0.641–0.709)0.2830.6370.646-0.026 (0.030)
5-0.040-0.0060.675 (0.640–0.709)0.2600.6530.607-0.026 (0.060)
7-0.028-0.0020.674 (0.639–0.708)0.2640.7780.485-0.027 (0.027)
6-0.0240.0050.672 (0.638–0.707)0.2700.5950.675-0.028 (0.029)
4-0.070-0.0310.670 (0.636–0.705)0.2850.6750.609-0.030 (0.042)
9-0.0100.0110.669 (0.635–0.704)0.2730.6660.607-0.031 (0.006)

PRS, polygenic risk score; PCa, prostate cancer; SNP, single nucleotide polymorphism; AUC, area under the receiver operating characteristics curve.

Fig. 2

(A) ROC curve of PRS models in prediction of PCa risk. (B) Distribution of PRS in cases and controls with two cutoff values to categorize PRS risk groups into moderate, high and very-high risk group. (C) Risk of development of PCa by PRS risk group. AUC, area under the receiver operating characteristics curve; OR, odds ratio; ROC, receiver operating characteristic; PRS, polygenic risk score; PCa, prostate cancer.

From the best performing PRS model composed of top 29 SNPs, we chose two different cutoff values, i) the optimal PRS value at the maximum Youden’s index (-0.05) and ii) 95th percentile PRS cutoff (0.02) (Fig. 2B). We defined those with PRS below the optimal cutoff, those with PRS above 95th percentile cutoff and those with PRS between the two cutoff values as moderate, very-high and high-risk group, respectively. High PRS and very-high PRS group showed 3.80-fold and 5.93-fold increased risk, respectively, for PCa, compared to the moderate PRS group (the reference group) (Fig. 2C). The very-high PRS group had a 4.92-fold increased risk compared to the remaining population.

DISCUSSION

Based on the OR estimates of each SNP in PCa specific to a Korean population, we developed and evaluated PRS in Korean males. Of the 83 SNPs identified with significant association, 31 SNPs were selected for PRS construction. The best performing PRS composed of 29 SNPs reported an AUC of 0.700, with the top 5th percentile having more than a 4-fold increased risk of Grade Group (GG) ≥2 PCa. To note, minor alleles of many SNPs included in this study showed protective effects on PCa risk, which highlights that common risk alleles may have a cumulative impact in the development of PCa. With trends of increasing prevalence worldwide, familial PCa and heritability is thoroughly researched for potential identification of actionable mutations. Hereditary PCa is caused by inherited genes of high penetrance, including homologous DNA pair genes (e.g., BRCA1/2, ATM, CHEK2, PALB2) and mismatch repair genes (e.g., MLH1, MSH2, MSH6, PMS2), the latter related to Lynch syndrome. While such nominal mutations are easily identified and linked to PCa risk, not all familial PCa can be explained by singular mutations, but rather is accountable by the sum of multiple common variants with small effect size. PRS is a novel and only biomarker that uses the concept of polygenic risk as a single combinatory value estimate of the individual’s genetic propensity of a complex disease to potentially stratify high- and low-risk patients in order to apply different methods of screening and prevention. While the exact number required to achieve the full potential of PRS is unknown, summary statistics based on sufficient data is required to detect rare but significant variants shared between phenotypes and approximate the populational burden of disease. PCa is reported to have variants with relatively larger effect size and hence has been suggested to require a smaller sample to assess heritability compared to other, more polygenic cancers. While falling short of over 60,000 cases required to explain 80% of heritability as suggested by Zhang et al. (2020) [14], one of the largest cohorts of over 2,700 males in biopsy and RP-proven GS ≥7 PCa was utilized, compared to previous Korean [15] and Japanese [16] PRSs developed from 1,001 and 689 cases, respectively. Selection of appropriate SNPs for inclusion critically influences PRS performance, due to obvious uncertainties inherent in estimating effect size from a potentially biased population. As such, statistical techniques must be applied prior to PRS development, a process necessary to weed out insignificant variants and to avoid overfitting or under-fitting of risk alleles. p-value thresholds are often implemented as a cut-off, and while p<1×10-5 is most commonly used, it is often an arbitrary value. A strict, parsimonious threshold risks the loss of otherwise powerful variants confounded by nearby SNPs resulting in false negatives, whereas an overly flexible threshold risks inclusion less predictive alleles as well as a resulting SNP combination too large to be feasible in actual practice. Hence, to optimize SNP selection, a relative lenient p-value threshold of p<10-3 was applied to filter a relatively large set of potential candidates, followed by LD clumping to extract lead SNPs and quality control via imputation for missing genotypes as well as SNP combination comparisons, a similar approach described in our previous report [15]. To additionally bolster performance, our study selectively used previously reported, well-established PCa-susceptible SNPs, many found in the 8q24.21 region. 8q24 is commonly considered a “desert” region due to scarcity of genes, but has been the source of numerous PCa susceptible variants, contributing to 8% to 9% of two-fold increase in familial risk [17]. A proto-oncogene Myc is closest, and while polymorphisms in this region have been associated with regulatory enhancers and androgen receptor (AR) response, mechanisms of action are not yet well understood. Nonetheless, strong associations have been made in both European and East Asian populations [1819], with rs7837688 and rs1016343 more common in Korean males [15], a finding comparable in this study as 4th and 12th in statistical significance. Eleven candidate SNPs were associated with PRNCR1 in the 8q24 region, whose upregulation is implicated in aggressive PCa via AR-mediated transcriptional programs [20]. Thirteen were located in HNF1B (17q12) and 11q13.3, both well-documented as susceptible loci in males of European and Asian descent [2122]. The ultimate goal of this research was to evaluate whether PRS can be utilized in actual clinical practice to avoid unnecessary intervention in low-risk PCa. Therefore, we selected patients with biopsy or pathology-proven csPCa with ≥GG2, excluding males under the age of 60. While generally preferred in the Korean population over active surveillance (AS) or watchful waiting, early intervention and surgery in low-grade PCa risks overtreatment and economic burden. As such, imaging modalities and biomarkers have been introduced to focus biopsy and treatment in csPCa, which generally includes any of histopathology ISUP grade ≥2 (≥GG2) and/or tumor volume ≥0.5 cc [23]. Our study, selecting only males of intermediate- to high-risk PCa category, showed performance comparable to previous literature. Seibert et al. (2018) [7], defining aggressive PCa as any cancer not eligible for surveillance based on the NCCN guidelines (i.e., any ≥GG2, stage T3–4, PSA ≥10 ng/mL, and any nodal or distant metastasis), developed a PRS from 54 SNPs to predict early age at onset of aggressive PCa (z=11.2, p<10-16), with males in the 98th percentile having a 2.9-fold increased risk. Stockholm3 combined a PRS generated from 232 SNPs with 6 plasma proteins and other clinical variables to predict ≥GG2 PCa, outperforming PSA alone with a resulting AUC of 0.74 (95% CI, 0.72–0.75) [8]. Takata et al. (2019) [24] published results from a large Japanese cohort with a development set of 5,088 PCa cases from Biobank Japan and 10,682 controls from multiple institutions, but 1,806 (35.5%) patients had missing GS, and 1,293 (25.4%) had GS ≤6. Our effect size estimates calculated from csPCa better represent genetic polymorphisms that require intervention in actual practice. Generalizability is key to the performance of a PRS, and population bias from a limited cohort poses a unique challenge in genomic prediction. Previously reported models are generated from cohorts of largely European ancestry and hence often fail to have the same predictive power in Asian males, primarily because unshared genomic loci, as well as variable population-specific effect size. A meta-analysis of multiancestry GWAS data found East Asians to have a 0.73-fold lower risk score compared to their European counterparts, highlighting the probability of potent germline variations among ethnic populations [25]. As such, accurate prediction of PCa risk requires tailoring to each race and ethnicity. Also, with increasing prevalence, even a modest performance is able to substantially stratify absolute populational risk, allowing application into clinical practice of early detection and prevention. AUC of 0.700 in our study is one of the highest discriminatory performances of PRS developed in an Asian cohort to date [4], compared to a previous report by our team conferring an AUC of 0.605 for any PCa utilizing 5 SNPs [15] and those developed in Japanese and Chinese populations with AUCs of 0.60 and 0.659, respectively [1626]. The 4.92-fold increased risk of csPCa in the 95th percentile of our study is comparable to results validated in western counterparts with a 4 to 5-fold increased risk in the 99th percentile [1827]. Despite successful replication of significant PCa susceptible polymorphisms and outperforming some previous models [15162829], our study is not without limitations. First, although we utilized data from 2,702 males with csPCa, it is still a relatively small sample size to accurately represent true populational variation. Second, because both development and validation cohorts consisted of Koreans, future evaluation in other race and ethnicities are required to fully assess predictive performance and generalizability. Due to the inherent selection bias in design, not all significant SNPs may be adequately represented in our data. In addition, while we limited the study population to csPCa, applying our model to males with any PCa at any age may further PRS utility, as both patients and clinicians can have additive information to guide screening schedules and opt for early intervention in AS-eligible males with high genetic risk. Nonetheless, this study provides valuable evidence for prediction of csPCa based on the cumulative effects of small genetic variants specific to Asian populations. PRS calculated from individual GWAS data is the hallmark of modern precision medicine with the possibility of predicting a patient’s lifetime trajectory for disease, offering the chance for early screening and personalized treatment, as in the case of coronary heart disease [30] where genetics is making great strides in real-world practice. Future acquisition of more sample summary statistics will further potentiate PRS performance in PCa and produce more accurate stratification of risk groups.

CONCLUSIONS

We successfully developed and validated PRS models for csPCa risk developed in a large sample of 2,702 ≥GS7 PCa cases and 7,485 healthy controls. A model utilizing the top 29 SNPs conferred the best predictive performance with an AUC 0.700, with over 4-fold risk predicted in males in above the 95th percentile of PRS. Future prospective, large-scale studies are required to further validate our results.
  30 in total

1.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

Review 2.  8q24 rs4242382 polymorphism is a risk factor for prostate cancer among multi-ethnic populations: evidence from clinical detection in China and a meta-analysis.

Authors:  Cheng-Xiao Zhao; Ming Liu; Yong Xu; Kuo Yang; Dong Wei; Xiao-Hong Shi; Fan Yang; Yao-Guang Zhang; Xin Wang; Si-Ying Liang; Fan Zhao; Yu-Rong Zhang; Na-Na Wang; Xin Chen; Liang Sun; Xiao-Quan Zhu; Hui-Ping Yuan; Ling Zhu; Yi-Ge Yang; Lei Tang; Hai-Yan Jiao; Zheng-Hao Huo; Jian-Ye Wang; Ze Yang
Journal:  Asian Pac J Cancer Prev       Date:  2014

Review 3.  Bringing Prostate Cancer Germline Genetics into Clinical Practice.

Authors:  Sanjay Das; Simpa S Salami; Daniel E Spratt; Samuel D Kaffenberger; Michelle F Jacobs; Todd M Morgan
Journal:  J Urol       Date:  2019-07-08       Impact factor: 7.450

4.  Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries.

Authors:  Hyuna Sung; Jacques Ferlay; Rebecca L Siegel; Mathieu Laversanne; Isabelle Soerjomataram; Ahmedin Jemal; Freddie Bray
Journal:  CA Cancer J Clin       Date:  2021-02-04       Impact factor: 508.702

5.  Two independent prostate cancer risk-associated Loci at 11q13.

Authors:  S Lilly Zheng; Victoria L Stevens; Fredrik Wiklund; Sarah D Isaacs; Jielin Sun; Shelly Smith; Kristen Pruett; Kathleen E Wiley; Seong-Tae Kim; Yi Zhu; Zheng Zhang; Fang-Chi Hsu; Aubrey R Turner; Jan-Erik Johansson; Wennuan Liu; Jin Woo Kim; Bao-Li Chang; David Duggan; John Carpten; Carmen Rodriguez; William Isaacs; Henrik Grönberg; Jianfeng Xu
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2009-06       Impact factor: 4.254

6.  Association of six susceptibility Loci with prostate cancer in northern chinese men.

Authors:  Yu-Rong Zhang; Yong Xu; Kuo Yang; Ming Liu; Dong Wei; Yao-Guang Zhang; Xiao-Hong Shi; Jian-Ye Wang; Fan Yang; Xin Wang; Si-Ying Liang; Cheng-Xiao Zhao; Fei Wang; Xin Chen; Liang Sun; Xiao-Quan Zhu; Ling Zhu; Yi-Ge Yang; Lei Tang; Hai-Yan Jiao; Zheng-Hao Huo; Ze Yang
Journal:  Asian Pac J Cancer Prev       Date:  2012

7.  Association analyses of more than 140,000 men identify 63 new prostate cancer susceptibility loci.

Authors:  Fredrick R Schumacher; Ali Amin Al Olama; Sonja I Berndt; Sara Benlloch; Mahbubl Ahmed; Edward J Saunders; Tokhir Dadaev; Daniel Leongamornlert; Ezequiel Anokian; Clara Cieza-Borrella; Chee Goh; Mark N Brook; Xin Sheng; Laura Fachal; Joe Dennis; Jonathan Tyrer; Kenneth Muir; Artitaya Lophatananon; Victoria L Stevens; Susan M Gapstur; Brian D Carter; Catherine M Tangen; Phyllis J Goodman; Ian M Thompson; Jyotsna Batra; Suzanne Chambers; Leire Moya; Judith Clements; Lisa Horvath; Wayne Tilley; Gail P Risbridger; Henrik Gronberg; Markus Aly; Tobias Nordström; Paul Pharoah; Nora Pashayan; Johanna Schleutker; Teuvo L J Tammela; Csilla Sipeky; Anssi Auvinen; Demetrius Albanes; Stephanie Weinstein; Alicja Wolk; Niclas Håkansson; Catharine M L West; Alison M Dunning; Neil Burnet; Lorelei A Mucci; Edward Giovannucci; Gerald L Andriole; Olivier Cussenot; Géraldine Cancel-Tassin; Stella Koutros; Laura E Beane Freeman; Karina Dalsgaard Sorensen; Torben Falck Orntoft; Michael Borre; Lovise Maehle; Eli Marie Grindedal; David E Neal; Jenny L Donovan; Freddie C Hamdy; Richard M Martin; Ruth C Travis; Tim J Key; Robert J Hamilton; Neil E Fleshner; Antonio Finelli; Sue Ann Ingles; Mariana C Stern; Barry S Rosenstein; Sarah L Kerns; Harry Ostrer; Yong-Jie Lu; Hong-Wei Zhang; Ninghan Feng; Xueying Mao; Xin Guo; Guomin Wang; Zan Sun; Graham G Giles; Melissa C Southey; Robert J MacInnis; Liesel M FitzGerald; Adam S Kibel; Bettina F Drake; Ana Vega; Antonio Gómez-Caamaño; Robert Szulkin; Martin Eklund; Manolis Kogevinas; Javier Llorca; Gemma Castaño-Vinyals; Kathryn L Penney; Meir Stampfer; Jong Y Park; Thomas A Sellers; Hui-Yi Lin; Janet L Stanford; Cezary Cybulski; Dominika Wokolorczyk; Jan Lubinski; Elaine A Ostrander; Milan S Geybels; Børge G Nordestgaard; Sune F Nielsen; Maren Weischer; Rasmus Bisbjerg; Martin Andreas Røder; Peter Iversen; Hermann Brenner; Katarina Cuk; Bernd Holleczek; Christiane Maier; Manuel Luedeke; Thomas Schnoeller; Jeri Kim; Christopher J Logothetis; Esther M John; Manuel R Teixeira; Paula Paulo; Marta Cardoso; Susan L Neuhausen; Linda Steele; Yuan Chun Ding; Kim De Ruyck; Gert De Meerleer; Piet Ost; Azad Razack; Jasmine Lim; Soo-Hwang Teo; Daniel W Lin; Lisa F Newcomb; Davor Lessel; Marija Gamulin; Tomislav Kulis; Radka Kaneva; Nawaid Usmani; Sandeep Singhal; Chavdar Slavov; Vanio Mitev; Matthew Parliament; Frank Claessens; Steven Joniau; Thomas Van den Broeck; Samantha Larkin; Paul A Townsend; Claire Aukim-Hastie; Manuela Gago-Dominguez; Jose Esteban Castelao; Maria Elena Martinez; Monique J Roobol; Guido Jenster; Ron H N van Schaik; Florence Menegaux; Thérèse Truong; Yves Akoli Koudou; Jianfeng Xu; Kay-Tee Khaw; Lisa Cannon-Albright; Hardev Pandha; Agnieszka Michael; Stephen N Thibodeau; Shannon K McDonnell; Daniel J Schaid; Sara Lindstrom; Constance Turman; Jing Ma; David J Hunter; Elio Riboli; Afshan Siddiq; Federico Canzian; Laurence N Kolonel; Loic Le Marchand; Robert N Hoover; Mitchell J Machiela; Zuxi Cui; Peter Kraft; Christopher I Amos; David V Conti; Douglas F Easton; Fredrik Wiklund; Stephen J Chanock; Brian E Henderson; Zsofia Kote-Jarai; Christopher A Haiman; Rosalind A Eeles
Journal:  Nat Genet       Date:  2018-06-11       Impact factor: 38.330

8.  Influence of age on predictiveness of genetic risk score for prostate cancer in a Chinese hospital-based biopsy cohort.

Authors:  Yao Zhu; Cheng-Tao Han; Hai-Tao Chen; Fang Liu; Gui-Ming Zhang; Wei-Yi Yang; Jian-Feng Xu; Ding-Wei Ye
Journal:  Oncotarget       Date:  2015-09-08

Review 9.  Epidemiology and genomics of prostate cancer in Asian men.

Authors:  Yao Zhu; Miao Mo; Yu Wei; Junlong Wu; Jian Pan; Stephen J Freedland; Ying Zheng; Dingwei Ye
Journal:  Nat Rev Urol       Date:  2021-03-10       Impact factor: 14.432

10.  Reference-based phasing using the Haplotype Reference Consortium panel.

Authors:  Po-Ru Loh; Petr Danecek; Pier Francesco Palamara; Christian Fuchsberger; Yakir A Reshef; Hilary K Finucane; Sebastian Schoenherr; Lukas Forer; Shane McCarthy; Goncalo R Abecasis; Richard Durbin; Alkes L Price
Journal:  Nat Genet       Date:  2016-10-03       Impact factor: 38.330

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.