| Literature DB >> 32764609 |
Lang Wu1, Yaohua Yang2, Xingyi Guo2, Xiao-Ou Shu2, Qiuyin Cai2, Xiang Shu2, Bingshan Li3,4, Ran Tao4,5, Chong Wu6, Jason B Nikas7, Yanfa Sun8,9, Jingjing Zhu8, Monique J Roobol10, Graham G Giles11,12, Hermann Brenner13,14,15, Esther M John16, Judith Clements17,18, Eli Marie Grindedal19, Jong Y Park20, Janet L Stanford21,22, Zsofia Kote-Jarai23, Christopher A Haiman24, Rosalind A Eeles23, Wei Zheng2, Jirong Long25.
Abstract
It remains elusive whether some of the associations identified in genome-wide association studies of prostate cancer (PrCa) may be due to regulatory effects of genetic variants on CpG sites, which may further influence expression of PrCa target genes. To search for CpG sites associated with PrCa risk, here we establish genetic models to predict methylation (N = 1,595) and conduct association analyses with PrCa risk (79,194 cases and 61,112 controls). We identify 759 CpG sites showing an association, including 15 located at novel loci. Among those 759 CpG sites, methylation of 42 is associated with expression of 28 adjacent genes. Among 22 genes, 18 show an association with PrCa risk. Overall, 25 CpG sites show consistent association directions for the methylation-gene expression-PrCa pathway. We identify DNA methylation biomarkers associated with PrCa, and our findings suggest that specific CpG sites may influence PrCa via regulating expression of candidate PrCa target genes.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32764609 PMCID: PMC7413371 DOI: 10.1038/s41467-020-17673-9
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fifteen novel methylation-prostate cancer associations for CpG sites located at genomic loci at least 500 kb away from any known prostate cancer risk varianta.
| CpG site | Chr | Position (build37) | Classification | OR (95% CI)c | risk SNP | Distance to the risk SNP (kb) | |||
|---|---|---|---|---|---|---|---|---|---|
| cg18800143 | 1 | 16393791 | Intronic | 0.10 | 1.12 (1.07–1.17) | 7.56 × 10−8 | rs636291 | 5837.7 | |
| cg07645299 | 2 | 63991864 | Intergenic | 0.01 | 1.49 (1.30–1.71) | 1.58 × 10−8 | rs58235267 | 714.0 | 0.80 |
| cg12627844 | 2 | 64245000 | Intronic | 0.03 | 1.38 (1.28–1.50) | 1.98 × 10−15 | rs58235267 | 967.2 | 0.61 |
| cg16397176 | 5 | 110899314 | ncRNA_intronic | 0.05 | 1.15 (1.09–1.22) | 6.42 × 10−7 | rs10793821 | 22936.9 | |
| cg11562153 | 6 | 28493500 | Upstream | 0.04 | 1.22 (1.13–1.31) | 1.57 × 10−7 | rs7767188 | 1580.3 | 1.56 × 10−4 |
| cg13866093 | 6 | 28502727 | UTR3 | 0.05 | 1.14 (1.09–1.20) | 2.09 × 10−7 | rs7767188 | 1571.0 | 3.26 × 10−5 |
| cg24388424 | 6 | 28565403 | Intronic | 0.01 | 0.78 (0.71–0.86) | 3.31 × 10−7 | rs7767188 | 1508.4 | 1.08 × 10−5 |
| cg00444740 | 8 | 129162178 | Upstream | 0.02 | 1.21 (1.13–1.30) | 1.55 × 10−7 | rs7837688 | 622.8 | 1.01 × 10−3 |
| cg06836406 | 9 | 130461544 | Intergenic | 0.02 | 0.79 (0.72−0.86) | 3.55 × 10−7 | rs1182 | 2114.5 | |
| cg20100049 | 11 | 67979188 | Intronic | 0.02 | 1.30 (1.22–1.39) | 2.79 × 10−15 | rs11228565 | 999.4 | 2.44 × 10−4 |
| cg22370235 | 11 | 68451852 | Upstream | 0.02 | 1.29 (1.17–1.41) | 1.50 × 10−7 | rs11228565 | 526.7 | 0.37 |
| cg04739953 | 11 | 68451858 | Upstream | 0.01 | 1.62 (1.41–1.87) | 2.06 × 10−11 | rs11228565 | 526.7 | 0.15 |
| cg01715842 | 16 | 85045600 | Upstream | 0.47 | 1.05 (1.03–1.07) | 2.95 × 10−7 | rs199737822 | 2866.7 | NA |
| cg13230424 | 17 | 45930033 | Intronic | 0.05 | 0.87 (0.82–0.91) | 3.16 × 10−7 | rs138213197 | 875.7 | |
| cg23397578 | 19 | 37742925 | ncRNA_exonic | 0.01 | 1.40 (1.24–1.57) | 1.81 × 10−8 | rs8102476 | 992.7 | 1.57 × 10−3 |
NA not available. Bold values represent that these association p values remain largely unchanged after adjusting for risk SNP.
aRisk SNPs identified in previous GWAS or fine-mapping studies.
bR2: model prediction performance (R2) derived using FHS data.
cOR (odds ratio) and CI (confidence interval) per one standard deviation increase in genetically predicted DNA methylation.
dP value: derived from association analyses of 79,194 cases and 61,112 controls (two-sided); associations with P ≤ 6.47 × 10−7 based on Bonferroni correction of 77,243 tests (0.05/77,243) are shown.
eUsing COJO method.
Fig. 1A Manhattan plot of the association results from the prostate cancer methylome-wide association study using S-PrediXcan.
The red line represents P = 6.47 × 10−7 (Bonferroni correction of 77,243 tests (0.05/77,243)). Each dot represents the genetically predicted DNA methylation of one specific CpG site. The x axis represents the genomic position of the corresponding CpG site, and the y axis represents the negative logarithm of the association P value. CpG sites at novel loci were highlighted with green color. Two-sided test was conducted.
Associations between genetically predicted mRNA expression levels of candidate target genes of identified CpG sites and prostate cancer risk.
| Gene | Blood tissue prediction model | Prostate tissue prediction model | ||||
|---|---|---|---|---|---|---|
| OR (95% CI)b | OR (95% CI)b | |||||
| 0.14 | 3.80 (2.91–4.96) | 1.39 × 10−22 | 0.18 | 1.41 (0.67–2.96) | 0.36 | |
| 0.06 | 0.36 (0.29–0.45) | 1.55 × 10−19 | NAd | NA | NA | |
| 0.03 | 2.21 (1.84–2.67) | 5.86 × 10−17 | NA | NA | NA | |
| 0.46 | 0.78 (0.73–0.83) | 2.03 × 10−16 | NA | NA | NA | |
| 0.09 | 1.86 (1.56–2.22) | 8.81 × 10−12 | NA | NA | NA | |
| 0.04 | 0.43 (0.34–0.55) | 1.19 × 10−11 | 0.03 | 0.97 (0.53–1.78) | 0.93 | |
| 0.40 | 1.11 (1.07–1.15) | 1.99 × 10−8 | 0.21 | 1.18 (1.11–1.25) | 3.24 × 10−8 | |
| 0.03 | 0.71 (0.62–0.80) | 2.13 × 10−8 | NA | NA | NA | |
| 0.05 | 3.20 (2.12–4.83) | 2.81 × 10−8 | 0.09 | 1.35 (1.17–1.55) | 3.59 × 10−5 | |
| 0.22 | 0.92 (0.89–0.95) | 3.65 × 10−8 | 0.06 | 0.79 (0.69–0.89) | 2.18 × 10−4 | |
| 0.44 | 1.07 (1.04–1.10) | 2.40 × 10−7 | 0.15 | 1.10 (1.06–1.14) | 5.61 × 10−7 | |
| 0.04 | 1.25 (1.14–1.36) | 7.44 × 10−7 | 0.03 | 1.91 (1.61–2.26) | 9.11 × 10−14 | |
| 0.48 | 1.06 (1.03–1.10) | 9.52 × 10−5 | 0.17 | 1.11 (1.04–1.18) | 1.16 × 10−3 | |
| 0.37 | 0.94 (0.90–0.97) | 8.86 × 10−4 | 0.18 | 0.89 (0.85–0.94) | 3.32 × 10−6 | |
| 0.01 | 0.66 (0.51–0.85) | 1.37 × 10−3 | 0.09 | 1.08 (0.99–1.18) | 0.08 | |
| 0.10 | 0.80 (0.69–0.92) | 2.52 × 10−3 | 0.15 | 0.83 (0.77–0.89) | 3.78 × 10−7 | |
| 0.10 | 1.19 (1.05–1.34) | 5.01 × 10−3 | NA | NA | NA | |
| 0.63 | 1.03 (1.01–1.06) | 9.02 × 10−3 | 0.45 | 0.95 (0.92–0.98) | 2.86 × 10−3 | |
aR2: mRNA expression prediction model performance (R2) derived using GTEx data.
bOR (odds ratio) and CI (confidence interval) per one standard deviation increase in genetically predicted mRNA expression levels.
cP value: derived from association analyses (two-sided); associations of genetically predicted expression in blood tissue with FDR < 0.05 are shown.
dNA: no prostate tissue prediction model was built.
Associations showing consistent direction of effect for the methylation–gene expression–prostate cancer risk pathway.
| CpG site | Chr | Position | Associated gene | Classification | DNA methylation and prostate cancer risk | DNA methylation and gene expression | Gene expression and prostate cancer risk | |||
|---|---|---|---|---|---|---|---|---|---|---|
| OR | Association coefficient | Association | OR | |||||||
| cg20240347 | 1 | 204465584 | Upstream | 0.93 | 2.61 × 10−19 | 0.21 | 1.69 × 10−14 | 0.36 | 1.55 × 10−19 | |
| cg15199181 | 1 | 205670604 | Upstream | 0.94 | 5.10 × 10−9 | −0.08 | 2.18 × 10−3 | 3.20 | 2.81 × 10−8 | |
| cg14893161 | 1 | 205819251 | UTR5 | 0.97 | 1.11 × 10−7 | −0.08 | 2.70 × 10−3 | 1.07 | 2.40 × 10−7 | |
| cg07167872 | 1 | 205819463 | Upstream | 0.97 | 1.47 × 10−7 | −0.08 | 1.83 × 10−3 | |||
| cg24503407 | 1 | 205819492 | Upstream | 0.97 | 1.27 × 10−7 | −0.08 | 2.78 × 10−3 | |||
| cg07157834 | 1 | 205819609 | Upstream | 0.96 | 1.07 × 10−7 | −0.08 | 2.12 × 10−3 | |||
| cg02652597 | 2 | 85811292 | Upstream | 0.93 | 6.31 × 10−7 | −0.16 | 8.76 × 10−9 | 1.19 | 5.01 × 10−3 | |
| cg10165864 | 2 | 173419899 | Upstream | 0.89 | 6.02 × 10−14 | −0.14 | 9.34 × 10−8 | 1.86 | 8.81 × 10−12 | |
| cg16797009 | 2 | 173472347 | Downstream | 0.90 | 2.31 × 10−16 | −0.17 | 3.52 × 10−10 | |||
| cg25053018 | 2 | 173477995 | Downstream | 1.19 | 4.47 × 10−20 | 0.11 | 3.10 × 10−5 | |||
| cg07128416 | 3 | 113160490 | Upstream | 1.25 | 9.81 × 10−11 | 0.09 | 6.67 × 10−4 | 1.25 | 7.44 × 10−7 | |
| cg07054641 | 3 | 113160554 | Upstream | 1.22 | 6.46 × 10−11 | 0.09 | 6.47 × 10−4 | |||
| cg20138861 | 3 | 169775992 | Intronic | 1.17 | 3.70 × 10−14 | −0.11 | 5.97 × 10−5 | 0.78 | 2.03 × 10−16 | |
| cg24064041 | 6 | 30165027 | Intronic | 0.91 | 3.36 × 10−9 | 0.13 | 8.69 × 10−7 | 0.43 | 1.19 × 10−11 | |
| cg00266604 | 6 | 30178343 | Intronic | 1.21 | 2.05 × 10−12 | −0.10 | 3.84 × 10−4 | |||
| cg12001709 | 6 | 31466798 | Intronic | 0.96 | 4.25 × 10−8 | 0.10 | 1.73 × 10−4 | 0.94 | 8.86 × 10−4 | |
| cg13892322 | 6 | 31648564 | Upstream | 0.88 | 5.48 × 10−7 | −0.12 | 4.42 × 10−6 | 1.06 | 9.52 × 10−5 | |
| cg22786465 | 6 | 31649502 | Downstream | 1.23 | 7.28 × 10−10 | 0.08 | 2.49 × 10−3 | |||
| cg02733847 | 6 | 31649519 | Downstream | 1.27 | 2.76 × 10−7 | 0.11 | 1.05 × 10−4 | |||
| cg25769566 | 6 | 31651278 | Downstream | 1.05 | 5.09 × 10−8 | 0.26 | <2.00 × 10−16 | |||
| cg24520975 | 6 | 31651362 | Downstream | 1.15 | 6.87 × 10−10 | 0.10 | 2.37 × 10−4 | |||
| cg07306190 | 6 | 34760872 | Intronic | 0.95 | 2.36 × 10−8 | −0.33 | <2.00 × 10−16 | 1.11 | 1.99 × 10−8 | |
| cg01715842 | 16 | 85045600 | Upstream | 1.05 | 2.95 × 10−7 | −0.09 | 6.68 × 10−4 | 0.80 | 2.52 × 10−3 | |
| cg01799818 | 17 | 594735 | Intronic | 1.10 | 7.40 × 10−19 | 0.09 | 4.81 × 10−4 | 1.03 | 9.02 × 10−3 | |
| cg10288850 | 22 | 43539588 | Upstream | 2.18 | 6.23 × 10−19 | −0.09 | 8.52 × 10−4 | 0.71 | 2.13 × 10−8 | |
Fig. 2Study design.
a Study design flow chart; b overview of the integrative-omics analysis. (1) Genetic prediction model building for blood DNA methylation levels; (2) associations of genetically predicted DNA methylation in blood and prostate cancer risk; (3) expression quantitative trait methylation; (4) genetic prediction models for blood and prostate tissue gene expression levels; (5) associations of genetically predicted gene expression in blood and prostate tissue with prostate cancer risk. Results in 1 were based on data of the Framingham Heart Study (FHS) (N = 1595). Results in 2 and 5 were based on the summary statistics of the PRACTICAL, CRUK, CAPS, BPC3, and PEGASUS consortia (N = 79,194 cases and 61,112 controls). Results in 3 were based on data of the FHS (N = 1367) and The Cancer Genome Atlas (N = 34). Results in 4 were based on data of the Genotype-Tissue Expression project (version 8).