| Literature DB >> 34355204 |
Hongjie Chen1, Arunabha Majumdar2,3, Lu Wang4, Siddhartha Kar5, Kevin M Brown6, Helian Feng7, Constance Turman8, Joe Dennis9, Douglas Easton9, Kyriaki Michailidou10,11, Jacques Simard12, Timothy Bishop13, Iona C Cheng14, Jeroen R Huyghe15, Stephanie L Schmit16, Tracy A O'Mara17, Amanda B Spurdle17, Puya Gharahkhani18, Johannes Schumacher19, Janusz Jankowski20,21, Ines Gockel22, Melissa L Bondy23, Richard S Houlston24, Robert B Jenkins25, Beatrice Melin26, Corina Lesseur27,28, Andy R Ness29,30, Brenda Diergaarde31,32, Andrew F Olshan33,34, Christopher I Amos35, David C Christiani8,36, Maria T Landi6, James D McKay28, Myriam Brossard37,38, Mark M Iles39, Matthew H Law18,40, Stuart MacGregor18, Jonathan Beesley17, Michelle R Jones41, Jonathan Tyrer42, Stacey J Winham43, Alison P Klein44,45, Gloria Petersen43, Donghui Li46, Brian M Wolpin47, Rosalind A Eeles48,49, Christopher A Haiman50, Zsofia Kote-Jarai48,49, Fredrick R Schumacher51,52, Paul Brennan29, Stephen J Chanock6, Valerie Gaborieau29, Mark P Purdue6, Paul Pharoah9, Rayjean J Hung38, Laufey T Amundadottir6, Peter Kraft7,8, Bogdan Pasaniuc2,53,54, Sara Lindström1,15.
Abstract
Genome-wide association studies (GWASs) have identified thousands of cancer risk loci revealing many risk regions shared across multiple cancers. Characterizing the cross-cancer shared genetic basis can increase our understanding of global mechanisms of cancer development. In this study, we collected GWAS summary statistics based on up to 375,468 cancer cases and 530,521 controls for fourteen types of cancer, including breast (overall, estrogen receptor [ER]-positive, and ER-negative), colorectal, endometrial, esophageal, glioma, head/neck, lung, melanoma, ovarian, pancreatic, prostate, and renal cancer, to characterize the shared genetic basis of cancer risk. We identified thirteen pairs of cancers with statistically significant local genetic correlations across eight distinct genomic regions. Specifically, the 5p15.33 region, harboring the TERT and CLPTM1L genes, showed statistically significant local genetic correlations for multiple cancer pairs. We conducted a cross-cancer fine-mapping of the 5p15.33 region based on eight cancers that showed genome-wide significant associations in this region (ER-negative breast, colorectal, glioma, lung, melanoma, ovarian, pancreatic, and prostate cancer). We used an iterative analysis pipeline implementing a subset-based meta-analysis approach based on cancer-specific conditional analyses and identified ten independent cross-cancer associations within this region. For each signal, we conducted cross-cancer fine-mapping to prioritize the most plausible causal variants. Our findings provide a more in-depth understanding of the shared inherited basis across human cancers and expand our knowledge of the 5p15.33 region in carcinogenesis.Entities:
Year: 2021 PMID: 34355204 PMCID: PMC8336922 DOI: 10.1016/j.xhgg.2021.100041
Source DB: PubMed Journal: HGG Adv ISSN: 2666-2477
Overview of the cancer GWAS datasets included in this study
| Cancer types | No. of cases | No. of controls | No. of SNPs after QC[ | Reference |
|---|---|---|---|---|
| Breast, overall | 122,977 | 105,974 | 9,934,907 | Michailidou et al., 2017[ |
| Breast, ER-negative | 21,468 | 100,564 | 9,942,394 | Michailidou et al., 2017[ |
| Breast, ER-positive | 69,501 | 95,042 | 10,267,258 | Michailidou et al., 2017[ |
| Colorectal | 55,168 | 65,160 | 7,910,462 | Huyghe et al., 2019[ |
| Endometrial | 12,906 | 108,979 | 11,595,492 | O’Mara et al., 2018[ |
| Esophageal | 4,112 | 13,663 | 9,038,176 | Gharahkhani et al., 2016[ |
| Glioma | 12,488 | 18,169 | 6,931,587 | Melin et al., 2017[ |
| Head/neck | 6,034 | 6,585 | 7,471,918 | Lesseur et al., 2016[ |
| Lung | 29,266 | 56,450 | 7,673,197 | McKay et al., 2017[ |
| Melanoma | 12,814 | 23,203 | 7,748,523 | Law et al., 2015[ |
| Ovarian | 22,406 | 40,951 | 9,870,154 | Phelan et al., 2017[ |
| Pancreatic | 8,638 | 12,217 | 9,568,913 | Klein et al.,2018[ |
| Prostate | 79,166 | 61,106 | 10,002,813 | Schumacher et al., 2018[ |
| Renal | 10,784 | 20,407 | 8,362,393 | Scelo et al., 2017[ |
Filtered out variants with imputation quality score < 0.3, minor allele frequency (MAF) < 1%, and |log odds ratio| > 3.
Figure 1.Analytical pipeline for the study
Regions with significant pairwise local genetic correlation were first identified by ρHESS. For regions harboring disproportionally high shared heritability across cancers, joint test of ASSET two-sided meta-analysis and COJO conditional analysis was then repeatedly conducted to identify independent signals, until no variant reached genome-wide significance (p < 5 × 10−8) in two-sided ASSET meta-analysis. For each signal, GWAS summary statistics conditional on other signals of selected cancer were used in multi-trait fine-mapping to estimate the posterior probability of being causal.
Genomic regions with statistically significant local genetic correlations between cancers
| Cancer site 1 | Cancer site 2 | Region | Region start | Region end | No. of SNPs | Direction | p value[ |
|---|---|---|---|---|---|---|---|
| ER-negative breast | prostate | 1q32 | 203334734 | 204681068 | 2,364 | negative | 3.45E–06 |
| Colorectal | prostate | 4q24 | 105305294 | 107501305 | 2,986 | positive | 1.05E–05 |
| Glioma | prostate | 5p15.33 | 982252 | 2132442 | 2,631 | negative | 4.03E–19 |
| Colorectal | glioma | 5p15.33 | 982252 | 2132442 | 2,465 | negative | 1.24E–05 |
| ER-negative breast | prostate | 5p15.33 | 982252 | 2132442 | 3,111 | negative | 1.90E–05 |
| ER-negative breast | glioma | 5p15.33 | 982252 | 2132442 | 2,631 | positive | 2.40E–05 |
| Melanoma | pancreatic | 5p15.33 | 982252 | 2132442 | 2,849 | positive | 4.85E–06 |
| Lung | pancreatic | 5p15.33 | 982252 | 2132442 | 2,935 | negative | 1.39E–07 |
| Overall breast | colorectal | 5q11.2 | 55417349 | 56621102 | 2,131 | positive | 1.97E–05 |
| Colorectal | prostate | 8q24 | 126410917 | 128659111 | 4,275 | positive | 1.97E–16 |
| ER-positive breast | prostate | 10q26.13 | 123231465 | 123900545 | 1,481 | negative | 1.22E–06 |
| Endometrial | prostate | 17q12 | 34469036 | 36809344 | 2,748 | positive | 5.01E–09 |
| ER-negative breast | ovarian | 19p13.11 | 16374416 | 18409862 | 4,103 | positive | 1.11E–07 |
Local genetic correlation between cancers across the genome (N = 1,703 regions) was estimated using HESS.
Cutoff of the statistical significance was defined as p < 0.05/1,703 = 2.94E–05, after adjusting for multiple comparison.
Figure 2.Pairwise local genetic correlation between selected cancer types at chromosome 5p15.33 (982,252–2,132,442 bp)
Cancer pairs with statistically significant (p value < 0.05/1,703 = 2.94 × 10−5) local genetic correlation are annotated with an asterisk.
Figure 3.Categorizing 14 cancer types into three tiers based on their p value distribution at 5p15.33
Pattern A cancers (A) have one single peak by the TERT gene; pattern B cancers (B) have a broader signal at the CLPTM1L gene as well as a signal by the TERT gene; pattern C cancers (C) have no genome-wide significant association in this region. Genome-wide significant levels at p value = 5 × 10−8 are marked with red dashed line in (A)–(C). Distribution of Z scores at the 5p15.33 region from the GWAS results of ER-negative breast, glioma, and prostate cancer (D). Only variants with p < 0.05 for both cancers are included. While the associations for ER-negative breast cancer and glioma overlap, the SNP associations with ER-negative breast cancer and prostate, as well as glioma and prostate, are in opposite directions.
Figure 4.Distribution of two-sided subset-based meta-analysis p values across eight cancer types at the 5p15.33 region
Index variants of ten independent candidate signals, identified by the iterative COJO-ASSET analysis, are annotated and marked in red.
Ten independent cross-cancer signals in 5p15.33 region identified in the joint analysis of COJO-ASSET
| No. of iteration | SNP with top ASSET p value | ASSET p value[ | Significant cancer subset, identified by ASSET[ | GWAS p values[ | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| ER-neg BrCa | Colorectal | Glioma | Lung | Melanoma | Ovarian | Pancreatic | Prostate | ||||
| Initiation | rs10069690 (5:1279790:C:T) | 4.05E–126 | set 1: ER-neg BrCa, glioma; | 1.34E–35 | 1.13E–01 | 2.32E–66 | 9.39E–01 | 9.08E–01 | 1.74E–08 | 3.29E–03 | 1.44E–45 |
| 1 | rs465498 (5:1325803:A:G) | 1.75E–59 | set 1: melanoma, pancreatic; | 1.61E–02 | 3.89E–02 | 6.85E–05 | 2.68E–32 | 2.08E–17 | 1.49E–01 | 7.45E–17 | 9.15E–05 |
| 2 | rs2853677 (5:1287194:G:A) | 3.24E–39 | set 1: ER-neg BrCa, colorectal, lung; | 4.94E–05 | 3.65E–10 | 1.08E–28 | 2.66E–18 | 1.12E–02 | 1.49E–06 | 2.87E–08 | 1.53E–02 |
| 3 | rs11414507 (5:1291331:A:AC) | 1.29E–19 | set 1: ER-neg BrCa; | 1.34E–16 | NA | NA | NA | NA | 1.63E–04 | 8.18E–01 | 1.61E–45 |
| 4 | rs35033501 (5:1253918:C:T) | 9.18E–15 | set 1: lung, melanoma, prostate; | 9.90E–05 | 8.55E–03 | 1.24E–02 | 3.84E–01 | 3.66E–02 | 5.55E–01 | 4.48E–05 | 2.37E–15 |
| 5 | rs7705526 (5:1285974:C:A) | 1.42E–11 | set 1: glioma, lung, melanoma; | 1.37E–04 | 3.17E–04 | 5.01E–61 | 1.01E–18 | 3.24E–03 | 1.34E–09 | 2.15E–03 | 2.78E–14 |
| 6 | rs192723047 (5:1273183:A:G) | 1.63E–11 | set 1: prostate; | 4.37E–17 | NA | NA | NA | NA | 1.21E–02 | 8.96E–02 | 5.40E–24 |
| 7 | rs35226131 (5:1295373:C:T) | 2.32E–09 | set 1: pancreatic; | 8.83E–02 | 5.17E–07 | 8.66E–01 | NA | NA | 3.63E–01 | 4.30E–08 | 3.20E–06 |
| 8 | rs35334674 (5:1292299:G:A) | 1.24E–08 | set 1: ER-neg BrCa, pancreatic; | 3.36E–02 | 4.40E–07 | 4.07E–02 | 2.15E–02 | NA | 5.93E–01 | 1.43E–02 | 3.23E–02 |
| 9 | rs3888705 (5:1298645:G:A) | 2.65E–08 | set 1: ER-neg BrCa, colorectal, ovarian; set 2: pancreatic, prostate | 1.78E–02 | 1.04E–03 | 9.32E–07 | 3.62E–03 | NA | 2.75E–03 | 7.13E–01 | 1.94E–03 |
| 10 | rs148487301 (5:1318797:T:C) | 8.70E–06 | not reached genome-wide significance, iteration stopped | ||||||||
Genome-wide significance with p value < 5 × 10−8. BrCa, breast cancer; NA, the SNP was not included in the GWAS results of corresponding cancer..
p values from the ASSET meta-analysis allowing opposite direction of the effect (two-sided analysis).
Cancer subsets included in the two-sided ASSET meta-analysis with best p value. Set 1/2 represents the selected cancer types with positive/negative association with the SNP.
p values from the original GWAS results of eight cancers.
Figure 5.Correlation matrix showing the pairwise linkage disequilibrium (LD) between 10 candidate signals, identified using an iterative COJO-ASSET analysis
LD was calculated based on the European ancestry populations in 1000 Genomes (1000G) Project.
Statistical fine-mapping prioritized the potential causal SNP within 10 independent cross-cancer signals in 5p15.33 region, using PAINTOR v.3.0
| Index SNP | Fine-mapped cancer types | Fine-mapping without functional prior | Fine-mapping with functional prior[ | ||||||
|---|---|---|---|---|---|---|---|---|---|
| 95% PP credible set[ | PP, index SNP | SNP with highest PP | Highest PP | 95% PP credible set | PP, index SNP | SNP with highest PP | Highest PP | ||
| rs35033501 (5:1253918:C:T) | ER-neg BrCa, lung, melanoma, pancreatic, prostate | rs35033501, rs71595003 | 0.875 | rs35033501 (5:1253918:C:T) | 0.875 | rs71595003 | <0.001 | rs71595003 (5:1292118:G:A) | 0.999 |
| rs192723047 (5:1273183:A:G) | ER-neg BrCa, prostate | rs192723047 | 1.000 | rs192723047 (5:1273183:A:G) | 1.000 | rs192723047 | 1.000 | rs192723047 (5:1273183:A:G) | 1.000 |
| rs10069690 (5:1279790:C:T) | ER-neg BrCa, glioma, ovarian, pancreatic, prostate | rs10069690 | 1.000 | rs10069690 (5:1279790:C:T) | 1.000 | rs10069690 | 1.000 | rs10069690 (5:1279790:C:T) | 1.000 |
| rs7705526 (5:1285974:C:A) | ER-neg BrCa, colorectal, glioma, lung, melanoma, ovarian, pancreatic, prostate | rs7705526 | 1.000 | rs7705526 (5:1285974:C:A) | 1.000 | rs7705526 | 1.000 | rs7705526 (5:1285974:C:A) | 1.000 |
| rs2853677 (5:1287194:G:A) | ER-neg BrCa, colorectal, glioma, lung, melanoma, ovarian, pancreatic, prostate | rs2853677 | 1.000 | rs2853677 (5:1287194:G:A) | 1.000 | rs2853677 | 1.000 | rs2853677 (5:1287194:G:A) | 1.000 |
| rs11414507 (5:1291331:A:AC) | ER-neg BrCa, prostate | rs7712562, rs74682426, rs11414507, rs7449190 | 0.265 | rs7712562 (5:1296072:A:G) | 0.367 | rs7712562, rs11414507 | 0.419 | rs7712562 (5:1296072:A:G) | 0.581 |
| rs35334674 (5:1292299:G:A) | ER-neg BrCa, colorectal, glioma, lung, pancreatic, prostate | rs35334674 | 0.987 | rs35334674 (5:1292299:G:A) | 0.987 | rs35334674 | 0.988 | rs35334674 (5:1292299:G:A) | 0.988 |
| rs35226131 (5:1295373:C:T) | colorectal, glioma, pancreatic, prostate | rs35226131, rs35161420, rs61748181, rs33958877, rs114616103 | 0.273 | rs35226131 (5:1295373:C:T) | 0.273 | rs35226131, rs35161420, rs61748181, rs33958877, rs114616103 | 0.273 | rs35226131 (5:1295373:C:T) | 0.273 |
| rs3888705 (5:1298645:G:A) | ER-neg BrCa, colorectal, ovarian, pancreatic, prostate | rs34156553, rs4075202, rs3888705, rs77776598, rs4975539, rs6875445, rs4583925, rs78844046, rs79323805, rs4507531, rs78368589, rs4487533, rs6554678, rs4498293, rs4532396 | 0.092 | rs34156553 (5:1243245:C:T) | 0.103 | rs34156553, rs4075202, rs3888705, rs77776598, rs4975539, rs6875445, rs4583925, rs78844046, rs79323805, rs4507531, rs78368589, rs4487533, rs6554678, rs4498293, rs4532396 | 0.092 | rs34156553 (5:1243245:C:T) | 0.103 |
| rs465498 (5:1325803:A:G) | lung, melanoma, pancreatic | rs380286, rs421629, rs465498, rs452932, rs459961, rs455433, rs13178866, rs460073 | 0.146 | rs380286 (5:1320247:G:A) | 0.462 | rs421629, rs465498 | 0.437 | rs421629 (5:1320136:G:A) | 0.563 |
Used open chromatin narrow peaks identified from the normal tissue or primary cells of the disease-related organs as the functional prior. Open chromatin narrow peaks were obtained from the ENCODE project.
SNPs within the credible set were ranked by the posterior probability (PP).
Figure 6.Open chromatin in different cancer types
Genomic location of tissue-specific open chromatin narrow peaks, which were used as functional prior in the fine-mapping analysis.