| Literature DB >> 27379672 |
Mengmeng Du1,2, Shuo Jiao2, Stephanie A Bien2,3, Manish Gala4, Goncalo Abecasis5, Stephane Bezieau6, Hermann Brenner7,8, Katja Butterbach7, Bette J Caan9, Christopher S Carlson2, Graham Casey10, Jenny Chang-Claude11, David V Conti10, Keith R Curtis2, David Duggan12, Steven Gallinger13, Robert W Haile10, Tabitha A Harrison2, Richard B Hayes14, Michael Hoffmeister7, John L Hopper15, Thomas J Hudson16,17, Mark A Jenkins15, Sébastien Küry6, Loic Le Marchand18, Suzanne M Leal19, Polly A Newcomb2,3, Deborah A Nickerson20, John D Potter2,3,21, Robert E Schoen22, Fredrick R Schumacher10, Daniela Seminara23, Martha L Slattery24, Li Hsu2, Andrew T Chan4,25, Emily White2,3, Sonja I Berndt26, Ulrike Peters2,3.
Abstract
Genome-wide association studies (GWAS) have identified many common single nucleotide polymorphisms (SNPs) associated with colorectal cancer risk. These SNPs may tag correlated variants with biological importance. Fine-mapping around GWAS loci can facilitate detection of functional candidates and additional independent risk variants. We analyzed 11,900 cases and 14,311 controls in the Genetics and Epidemiology of Colorectal Cancer Consortium and the Colon Cancer Family Registry. To fine-map genomic regions containing all known common risk variants, we imputed high-density genetic data from the 1000 Genomes Project. We tested single-variant associations with colorectal tumor risk for all variants spanning genomic regions 250-kb upstream or downstream of 31 GWAS-identified SNPs (index SNPs). We queried the University of California, Santa Cruz Genome Browser to examine evidence for biological function. Index SNPs did not show the strongest association signals with colorectal tumor risk in their respective genomic regions. Bioinformatics analysis of SNPs showing smaller P-values in each region revealed 21 functional candidates in 12 loci (5q31.1, 8q24, 11q13.4, 11q23, 12p13.32, 12q24.21, 14q22.2, 15q13, 18q21, 19q13.1, 20p12.3, and 20q13.33). We did not observe evidence of additional independent association signals in GWAS-identified regions. Our results support the utility of integrating data from comprehensive fine-mapping with expanding publicly available genomic databases to help clarify GWAS associations and identify functional candidates that warrant more onerous laboratory follow-up. Such efforts may aid the eventual discovery of disease-causing variant(s).Entities:
Mesh:
Year: 2016 PMID: 27379672 PMCID: PMC4933364 DOI: 10.1371/journal.pone.0157521
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Association results for variants showing the smallest P-values (top SNPs) in 31 regions surrounding previous GWAS-identified variants (index SNPs).
| Locus | Index SNP | Level | # SNPs | Top SNP | Level | Position | Genetic | Top SNP results | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| in | region | index | Ref/ | Ref | OR (95% CI) | ||||||||
| region | SNP | other | allele | ||||||||||
| allele | freq | ||||||||||||
| 1q25.3 | rs10911251 | 1886 | rs6669796 | 183082825 | 0.87 | G/C | 0.56 | 1.11 (1.07, 1.16) | 5.8E-07 | 6.0E-01 | |||
| 1q41 | rs6687758 | 1885 | rs143030473 | 222161943 | <0.2 | C/T | 0.99 | 1.97 (1.35, 2.87) | 4.5E-04 | 9.2E-01 | |||
| rs6691170 | 2096 | rs143030473 | 222161943 | <0.2 | C/T | 0.99 | 1.97 (1.35, 2.87) | 4.5E-04 | 9.2E-01 | ||||
| 2q32.3 | rs11903757 | 1536 | rs6731095 | 192589442 | 1.00 | A/G | 0.84 | 0.89 (0.84, 0.95) | 2.3E-04 | 1.1E-01 | |||
| 3q26.2 | rs10936599 | 1651 | rs2421771 | 169411370 | <0.2 | C/T | 0.96 | 1.21 (1.05, 1.40) | 7.5E-03 | 9.2E-01 | |||
| 5q31.1 | rs647161 | 1499 | rs2193941 | 134469594 | 0.57 | G/A | 0.72 | 1.09 (1.05, 1.14) | 4.0E-05 | 9.2E-01 | |||
| 6p21 | rs1321311 | 2364 | rs13215272 | 36589502 | <0.2 | C/T | 0.73 | 1.10 (1.04, 1.15) | 5.3E-04 | 8.0E-02 | |||
| 8q23.3 | rs16892766 | 1432 | rs16888589 | 117635602 | 0.92 | A/G | 0.91 | 0.80 (0.75, 0.86) | 3.3E-10 | 7.1E-01 | |||
| 8q24 | rs6983267 | 2257 | rs7013278 | 128414892 | 0.40 | C/T | 0.66 | 0.88 (0.85, 0.92) | 7.8E-11 | 7.4E-01 | |||
| 9p24 | rs719725 | 1907 | rs7875812 | 6364533 | 1.00 | A/T | 0.63 | 1.09 (1.04, 1.13) | 7.9E-05 | 7.4E-01 | |||
| 10p14 | rs10795668 | 2363 | rs1537603 | 8734295 | 0.45 | C/T | 0.50 | 1.06 (1.02, 1.10) | 1.2E-03 | 7.9E-01 | |||
| 11q13.4 | rs3824999 | 1788 | rs72977282 | 74300441 | 0.59 | T/A | 0.59 | 0.92 (0.88, 0.95) | 9.8E-06 | 8.1E-01 | |||
| 11q23 | rs3802842 | 1830 | rs7130173 | 111154072 | 0.95 | C/A | 0.73 | 0.89 (0.85, 0.93) | 2.7E-08 | 3.8E-01 | |||
| 12p13.32 | rs10774214 | 1656 | rs3217874 | 4400808 | <0.2 | C/T | 0.58 | 0.89 (0.85, 0.93) | 3.1E-08 | 7.8E-01 | |||
| rs3217810 | 1571 | rs3217874 | 4400808 | <0.2 | C/T | 0.58 | 0.89 (0.85, 0.93) | 3.1E-08 | 7.8E-01 | ||||
| rs3217901 | 1539 | rs3217874 | 4400808 | 0.65 | C/T | 0.58 | 0.89 (0.85, 0.93) | 3.1E-08 | 7.8E-01 | ||||
| 12q13.13 | rs11169552 | 1310 | rs7306677 | 51205763 | <0.2 | C/T | 0.62 | 0.92 (0.89, 0.96) | 6.3E-05 | 3.4E-01 | |||
| rs7136702 | 967 | rs11169524 | 51089734 | 0.67 | T/A | 0.67 | 0.92 (0.88, 0.96) | 8.5E-05 | 5.3E-01 | ||||
| 12q24.21 | rs59336 | 2072 | rs1427760 | 115100714 | 0.71 | T/C | 0.50 | 0.90 (0.87, 0.94) | 5.0E-07 | 2.5E-01 | |||
| 14q22.2 | rs1957636 | 1613 | rs10130587 | 54419110 | <0.2 | G/C | 0.63 | 0.89 (0.85, 0.93) | 4.1E-08 | 9.0E-01 | |||
| rs4444235 | 1659 | rs10130587 | 54419110 | 0.67 | G/C | 0.63 | 0.89 (0.85, 0.93) | 4.1E-08 | 9.0E-01 | ||||
| 15q13 | rs11632715 | 1735 | rs2293582 | 33010412 | 0.23 | G/A | 0.80 | 0.86 (0.82, 0.91) | 1.0E-09 | 3.8E-01 | |||
| rs16969681 | 1692 | rs2293582 | 33010412 | 0.22 | G/A | 0.80 | 0.86 (0.82, 0.91) | 1.0E-09 | 3.8E-01 | ||||
| rs4779584 | 1701 | rs2293582 | 33010412 | 0.71 | G/A | 0.80 | 0.86 (0.82, 0.91) | 1.0E-09 | 3.8E-01 | ||||
| 16q22.1 | rs9929218 | 1867 | rs9932005 | 68822019 | 0.63 | C/T | 0.76 | 1.08 (1.03, 1.13) | 1.2E-03 | 6.6E-01 | |||
| 18q21 | rs4939827 | 1931 | rs2337113 | 46452327 | 0.91 | A/G | 0.55 | 1.13 (1.09, 1.18) | 1.0E-10 | 2.5E-01 | |||
| 19q13.1 | rs10411210 | 2307 | rs75414102 | 33302424 | <0.2 | G/A | 0.99 | 0.51 (0.33, 0.79) | 2.6E-03 | 9.1E-01 | |||
| 20p12.3 | rs2423279 | 1901 | rs118184022 | 7719449 | <0.2 | C/T | 0.98 | 0.58 (0.41, 0.83) | 2.8E-03 | 9.4E-01 | |||
| rs4813802 | 1825 | rs6085662 | 6698372 | 1.00 | G/C | 0.65 | 0.91 (0.87, 0.95) | 5.1E-06 | 3.0E-01 | ||||
| rs961253 | 1990 | rs56083061 | 6430696 | 0.28 | G/A | 0.79 | 0.90 (0.85, 0.94) | 5.6E-06 | 7.6E-01 | ||||
| 20q13.33 | rs4925386 | 2173 | rs1760073 | 60926106 | 0.81 | G/A | 0.67 | 1.08 (1.04, 1.12) | 1.9E-04 | 8.8E-02 | |||
Abbreviations: SNP, single nucleotide polymorphism; Ref, reference; Freq, frequency; OR, odds ratio; CI, confidence interval; P-het, P value for test of heterogeneity across studies
aLevel of statistical evidence:
*SNP with P ≤ 0.05
**SNP with P ≤ 0.01
***SNP with P≤0.001. P-values are not adjusted for multiple testing as these data are intended to help refine previously identified CRC-related regions and highlight sets of variants warranting bioinformatics-based follow-up.
bBased on NCBI build 37 data.
cBased on 1000 Genomes Project data in European populations.
dAdjusted for age in years, sex, first 3 principal components, study center, batch (ASTERISK only), and smoking (PHS only).
eEstimate calculated using the log-additive genetic model for each additional reference allele.
Fig 1Comparison of P-values for GWAS-identified variants (index SNPs) vs. variants with the smallest P-values (top SNPs) in 31 regions.
The height of each bar reflects the–log10 P-value of each SNP in our study population. A grey bar indicates the index SNP, and a black bar indicates the top SNP.
SNPs showing strong evidence for functional importance based on bioinformatics.
| Index SNP | Strong | Mean | Predicted | Genomic | # Cell lines | Show histone regulatory marks | Proteins | Altered binding | Evidence | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| functional | imput- | index | regulated | location of | showing | # Cancer/ | # Normal | Normal | bound | motifs | for | ||
| candidate | ation | SNP | gene | functional | open | progenitor | cell lines | colorectal | conserved | ||||
| candidate | chromatin | cell lines | tissues | region | |||||||||
| 5q31.1 | rs647161 | rs1366111 | 0.89 | 0.57 | intronic | 2 | 2 | — | 1, 2 | POL2,EGR1 | Zfx | 0 | |
| 8q24 | rs6983267 | rs6983267 | 0.95 | index | intergenic | 1 CRC line | 1 | 2 | 2 | TCF4 (CRC line), | AP-1, Sox, TCF4 | 1 | |
| P300, FOXA1, | |||||||||||||
| RXRA, SP1 | |||||||||||||
| 11q13.4 | rs3824999 | rs3824999 | 0.99 | index | intronic | 4 | — | 2 | 4 | JUND | AP-3, Evi-1, Mef2 | 0 | |
| 11q23 | rs3802842 | rs7130173 | 0.95 | 0.93 | intronic | 31 including | — | — | 2, 3 | RAD21, SMC3, | GSP1, SRF | 0 | |
| 1 CRC line | YY1, CTCF | ||||||||||||
| 12p13.32 | rs3217901 | rs3217827 | 0.85 | 0.61 | intronic | 1 | — | — | 3 | RAD21, CTCF | — | 0 | |
| 12q24.21 | rs59336 | rs71807 | 0.89 | 0.86 | intergenic | 1 | 2 | 4 | 1, 3 | BAF155, HAE2F1, | — | 0 | |
| CTCF | |||||||||||||
| rs484443 | 0.91 | 0.90 | intronic | 8 | 1 | 2 | 2, 4 | P300,USF1 | RREB-1, Rad21 | 0 | |||
| 14q22.2 | rs4444235 | rs2071047 | 1.00 | 0.75 | intronic | 9 | 3 | 3 | 1–4 | TCF4 (CRC line) | BCL, HNF4, RXRA | 0 | |
| rs10130587 | 0.74 | 0.67 | intronic | 2 | 3 | 3 | 1–4 | GATA3 | 5 motifs | 0 | |||
| rs35107139 | 0.82 | 0.64 | intronic | 2 | 3 | 3 | 1, 3, 4 | GATA3 | 12 motifs | 0 | |||
| 15q13 | rs4779584 | rs2293582 | 0.94 | 0.71 | intronic | 4 | 3 | 5 | 1–4 | POL2 | — | 0 | |
| rs2293581 | 0.97 | 0.66 | intronic | 36 | 4 | 5 | 1–4 | SUZ12 | — | 0 | |||
| rs1406389 | 0.97 | 0.66 | intergenic | 12 | 3 | 4 | 1, 2, 4 | SUZ12 | Irf, SIX5 | 0 | |||
| 18q21 | rs4939827 | rs11874392 | 0.93 | 1.00 | intronic | 4 | 3 | 1 | 1, 2, 4 | 11 proteins | — | 1 | |
| rs4939567 | 0.91 | 1.00 | intronic | 1 | 4 | 3 | 1–4 | MAFK | RREB-1, VDR_2 | 1 | |||
| rs34007497 | 0.91 | 0.57 | intronic | 22 | 2 | 4 | 1–4 | CTCF,STAT1 | 4 motifs | 0 | |||
| 19q13.1 | rs10411210 | rs10411210 | 0.94 | index | intronic | 48 including | 2 | 1 | 1, 2 | 17 proteins | Mef2, STAT | 1 | |
| 1 CRC line | |||||||||||||
| 20p12.3 | rs961253 | rs966817 | 1.00 | 0.87 | intergenic | 18 | — | 1 | — | CJUN | Evi-1, FAC1, GLI | 1 | |
| rs4813802 | rs4813802 | 0.86 | index | intergenic | 10 | 2 | 2 | 2 | 6 proteins | — | 1 | ||
| rs6085661 | 0.89 | 0.76 | intergenic | 19 | 1 | — | 2 | 7 proteins | BCL, NRSF, VDR | 1 | |||
| 20q13.33 | rs4925386 | rs1741634 | 0.96 | 0.80 | intronic | 7 | — | 2 | 1, 2 | GR | EWSR1-FLI1 | 0 | |
aEstimated value of the squared correlation between imputed genotypes and true (unobserved) genotypes for the strong functional candidate, averaged across studies.
bBased on data from 1000 Genomes Pilot 1 CEU.
cBased on HaploReg data using proximity of each variant to a gene as well as the orientation (3’ or 5’) relative to the nearest end of the gene.
dBased on data from DNAse I hypersensitivity assays.
eENCODE ChIP-seq assay.
fBased on data from ChIP-seq assays in the Roadmap Epigenomics Project in the following normal (non-cancerous) tissues: (1) colon mucosa, (2) rectal mucosa, (3) colon smooth muscle, and (4) rectal smooth muscle.
gPosition weight matrix score between SNP alleles confers a change in log-odds (LOD) score > 5.
hBased on PhastCons scores: 1 = strong evidence of conserved element; 0 = no evidence of conserved element.