| Literature DB >> 34520132 |
Chong Wu1, Jingjing Zhu2, Austin King1, Xiaoran Tong3, Qing Lu4, Jong Y Park5, Liang Wang6, Guimin Gao7, Hong-Wen Deng8, Yaohua Yang9, Karen E Knudsen10, Timothy R Rebbeck11,12, Jirong Long9, Wei Zheng9, Wei Pan13, David V Conti14, Christopher A Haiman14, Lang Wu2.
Abstract
BACKGROUND: DNA methylation and gene expression are known to play important roles in the etiology of human diseases such as prostate cancer (PCa). However, it has not yet been possible to incorporate information of DNA methylation and gene expression into polygenic risk scores (PRSs). Here, we aimed to develop and validate an improved PRS for PCa risk by incorporating genetically predicted gene expression and DNA methylation, and other genomic information using an integrative method.Entities:
Keywords: integrative models; polygenic risk scores; predicted DNA methylation; predicted gene expression; prostate cancer; risk prediction
Mesh:
Year: 2021 PMID: 34520132 PMCID: PMC8696216 DOI: 10.1002/cac2.12205
Source DB: PubMed Journal: Cancer Commun (Lond) ISSN: 2523-3548
FIGURE 1Study design and workflow. Multiple sets of genome‐wide polygenic risk scores (PRSs) were derived by combining summary association statistics from association studies using data of the PRACTICAL consortium and a reference panel of 45,216 males in the UK Biobank Phase I dataset. Candidate PRSs were derived using six strategies: 1) pruning and thresholding (P + T)– aggregation of independent polymorphisms that exceed a specified level of significance in the discovery genome‐wide association study (GWAS) (24 candidates); 2) LDpred computational algorithm, a Bayesian approach to calculate a posterior mean effect for all variants based on a prior (effect size in the prior GWAS) and subsequent shrinkage based on linkage disequilibrium (8 candidates); 3) AnnoPred (6 candidates); 4) LDpredfun (1 score); 5) EBPRS (1 score); and 6) revised P + T approach incorporating predicted gene expression (55 candidates for blood and 55 scores for prostate tissue) and DNA methylation (55 candidates for blood). For each of the above categories, the optimal PRS was chosen based on the area under the receiver‐operator curve (AUC) in the UK Biobank tuning dataset (1,458 prevalent cases and 1,467 controls). We then derived the integrative model combining information from constructed scores (1,467 prevalent cases and 1,458 controls). We subsequently tested the model performance in an independent UK Biobank testing dataset (4,832 incident cases and 142,869 controls)
FIGURE 2Polygenic risk score assessment with incident cases. (A) Receiver operator characteristic curves and C statistics for different models in the independent testing dataset of 147,701 participants with 4,832 incident prostate cancer events. (B) The cumulative absolute risk of developing prostate cancer by quantiles of the overall polygenic score. The absolute risk was calculated based on UK incidence and mortality data and using the PRS relative risks estimated as described in the Material and Methods. The shaded part is 95% confidence interval. (C) The absolute risk of prostate cancer according to 100 groups of the testing cohort binned according to the percentile of the integrative polygenic risk score
Net reclassification improvement (NRI) of the developed polygenic risk score and family history in predicting the risk of PCa
| Population | No. of subjects | Five‐year risk | Ten‐year risk | ||
|---|---|---|---|---|---|
| NRI for PRS (95% CI) | NRI for family history (95% CI) | NRI for PRS (95% CI) | NRI for family history (95% CI) | ||
| PCa cases | 4,832 | 0.294 (0.283 to 0.324) | −0.711 (−0.731 to −0.686) | 0.266 (0.234 to 0.277) | −0.717 (−0.731 to −0.659) |
| Non‐cases | 142,869 | 0.419 (0.411 to 0.435) | 0.84 (0.837 to 0.841) | 0.423 (0.400 to 0.434) | 0.842 (0.824 to 0.844) |
| Full population | 147,701 | 0.713 (0.697 to 0.756) | 0.129 (0.109 to 0.151) | 0.690 (0.649 to 0.705) | 0.125 (0.113 to 0.165) |
The NRI (in continuous case) of 5‐ and 10‐year risk was calculated by adding the PRS or family history to the baseline model.
PCa, prostate cancer; CI, confidence interval.