| Literature DB >> 30510241 |
Jeroen R Huyghe1, Stephanie A Bien1, Tabitha A Harrison1, Hyun Min Kang2, Sai Chen2, Stephanie L Schmit3, David V Conti4, Conghui Qu1, Jihyoun Jeon5, Christopher K Edlund4, Peyton Greenside6, Michael Wainberg7, Fredrick R Schumacher8, Joshua D Smith9, David M Levine10, Sarah C Nelson10, Nasa A Sinnott-Armstrong11, Demetrius Albanes12, M Henar Alonso13,14,15, Kristin Anderson16, Coral Arnau-Collell17, Volker Arndt18, Christina Bamia19,20, Barbara L Banbury1, John A Baron21, Sonja I Berndt12, Stéphane Bézieau22, D Timothy Bishop23, Juergen Boehm24, Heiner Boeing25, Hermann Brenner18,26,27, Stefanie Brezina28, Stephan Buch29, Daniel D Buchanan30,31,32, Andrea Burnett-Hartman33, Katja Butterbach18, Bette J Caan34, Peter T Campbell35, Christopher S Carlson1,36, Sergi Castellví-Bel17, Andrew T Chan37,38,39,40,41,42, Jenny Chang-Claude43,44, Stephen J Chanock12, Maria-Dolores Chirlaque14,45, Sang Hee Cho46, Charles M Connolly1, Amanda J Cross47,48, Katarina Cuk18, Keith R Curtis1, Albert de la Chapelle49, Kimberly F Doheny50, David Duggan51, Douglas F Easton52,53, Sjoerd G Elias54, Faye Elliott23, Dallas R English55,56, Edith J M Feskens57, Jane C Figueiredo58,59, Rocky Fischer60, Liesel M FitzGerald56,61, David Forman62, Manish Gala37,39, Steven Gallinger63, W James Gauderman4, Graham G Giles55,56, Elizabeth Gillanders64, Jian Gong1, Phyllis J Goodman65, William M Grady66, John S Grove67, Andrea Gsur28, Marc J Gunter68, Robert W Haile69, Jochen Hampe29, Heather Hampel70, Sophia Harlid71, Richard B Hayes72, Philipp Hofer28, Michael Hoffmeister18, John L Hopper55,73, Wan-Ling Hsu10, Wen-Yi Huang12, Thomas J Hudson74, David J Hunter41,75, Gemma Ibañez-Sanz13,76,77, Gregory E Idos4, Roxann Ingersoll50, Rebecca D Jackson78, Eric J Jacobs35, Mark A Jenkins55, Amit D Joshi39,41, Corinne E Joshu79, Temitope O Keku80, Timothy J Key81, Hyeong Rok Kim82, Emiko Kobayashi1, Laurence N Kolonel83, Charles Kooperberg1, Tilman Kühn43, Sébastien Küry22, Sun-Seog Kweon84,85, Susanna C Larsson86, Cecelia A Laurie10, Loic Le Marchand67, Suzanne M Leal87, Soo Chin Lee88,89, Flavio Lejbkowicz90,91,92, Mathieu Lemire74, Christopher I Li1, Li Li93, Wolfgang Lieb94, Yi Lin1, Annika Lindblom95,96, Noralane M Lindor97, Hua Ling50, Tin L Louie10, Satu Männistö98, Sanford D Markowitz99, Vicente Martín14,100, Giovanna Masala101, Caroline E McNeil102, Marilena Melas4, Roger L Milne55,56, Lorena Moreno17, Neil Murphy68, Robin Myte71, Alessio Naccarati103,104, Polly A Newcomb1,36, Kenneth Offit105,106, Shuji Ogino40,41,107,108, N Charlotte Onland-Moret54, Barbara Pardini104,109, Patrick S Parfrey110, Rachel Pearlman70, Vittorio Perduca111,112, Paul D P Pharoah52, Mila Pinchev91, Elizabeth A Platz79, Ross L Prentice1, Elizabeth Pugh50, Leon Raskin113, Gad Rennert91,92,114, Hedy S Rennert91,92,114, Elio Riboli115, Miguel Rodríguez-Barranco14,116, Jane Romm50, Lori C Sakoda1,117, Clemens Schafmayer118, Robert E Schoen119, Daniela Seminara64, Mitul Shah53, Tameka Shelford50, Min-Ho Shin84, Katerina Shulman120, Sabina Sieri121, Martha L Slattery122, Melissa C Southey123, Zsofia K Stadler124, Christa Stegmaier125, Yu-Ru Su1, Catherine M Tangen65, Stephen N Thibodeau126, Duncan C Thomas4, Sushma S Thomas1, Amanda E Toland127, Antonia Trichopoulou19,20, Cornelia M Ulrich24, David J Van Den Berg4, Franzel J B van Duijnhoven57, Bethany Van Guelpen71, Henk van Kranen128, Joseph Vijai124, Kala Visvanathan79, Pavel Vodicka103,129,130, Ludmila Vodickova103,129,130, Veronika Vymetalkova103,129,130, Korbinian Weigl18,27,131, Stephanie J Weinstein12, Emily White1, Aung Ko Win32,55, C Roland Wolf132, Alicja Wolk86,133, Michael O Woods134, Anna H Wu4, Syed H Zaidi74, Brent W Zanke135, Qing Zhang136, Wei Zheng137, Peter C Scacheri138, John D Potter1, Michael C Bassik11, Anshul Kundaje7,11, Graham Casey139, Victor Moreno13,14,15,77, Goncalo R Abecasis2, Deborah A Nickerson9, Stephen B Gruber4, Li Hsu1,10, Ulrike Peters140,141.
Abstract
To further dissect the genetic architecture of colorectal cancer (CRC), we performed whole-genome sequencing of 1,439 cases and 720 controls, imputed discovered sequence variants and Haplotype Reference Consortium panel variants into genome-wide association study data, and tested for association in 34,869 cases and 29,051 controls. Findings were followed up in an additional 23,262 cases and 38,296 controls. We discovered a strongly protective 0.3% frequency variant signal at CHD1. In a combined meta-analysis of 125,478 individuals, we identified 40 new independent signals at P < 5 × 10-8, bringing the number of known independent signals for CRC to ~100. New signals implicate lower-frequency variants, Krüppel-like factors, Hedgehog signaling, Hippo-YAP signaling, long noncoding RNAs and somatic drivers, and support a role for immune function. Heritability analyses suggest that CRC risk is highly polygenic, and larger, more comprehensive studies enabling rare variant analysis will improve understanding of biology underlying this risk and influence personalized screening strategies and drug development.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30510241 PMCID: PMC6358437 DOI: 10.1038/s41588-018-0286-6
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 41.307
New CRC risk loci reaching genome-wide significance (P < 5×10−8) in the combined (Stage 1 and Stage 2) meta-analysis.
| Stage 1 meta-analysis: up to 34,869 cases and 29,051 controls | Stage 2 meta-analysis: up to 23,262 cases and 38,296 controls | Combined meta-analysis: up to 58,131 cases and 67,347 controls | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Locus | Nearby gene(s) | rsID lead variant | Chr. | Position (Build 37) | Alleles (risk/other) | RAF (%) | OR | 95% CI | OR | 95% CI | OR | 95% CI | |||
| 5q21.1 | rs145364999 | 5 | 98,206,082 | T/A | 99.69 | 1.57 | 1.20–2.05 | 9.0×10−4 | 1.93 | 1.48–2.52 | 1.0×10−6 | 1.74 | 1.45–2.10 | 6.3×10−9 | |
| 3q13.2 | rs72942485 | 3 | 112,999,560 | G/A | 98.02 | 1.16 | 1.07–1.26 | 2.5×10−4 | 1.23 | 1.12–1.35 | 1.5×10−5 | 1.19 | 1.12–1.26 | 2.1×10−8 | |
| 1p34.3 | rs4360494[ | 1 | 38,455,891 | G/C | 45.39 | 1.05 | 1.03–1.08 | 2.9×10−5 | 1.06 | 1.03–1.08 | 3.3×10−5 | 1.05 | 1.04–1.07 | 3.8×10−9 | |
| 1p32.3 | rs12144319 | 1 | 55,246,035 | C/T | 25.48 | 1.07 | 1.04–1.10 | 1.4×10−6 | 1.07 | 1.04–1.10 | 5.5×10−6 | 1.07 | 1.05–1.09 | 3.3×10−11 | |
| 2q24.2 | rs448513[ | 2 | 159,964,552 | C/T | 32.60 | 1.06 | 1.03–1.08 | 1.9×10−5 | 1.05 | 1.02–1.08 | 5.8×10−4 | 1.05 | 1.03–1.07 | 4.4×10−8 | |
| 2q33.1 | rs983402 | 2 | 199,781,586 | T/C | 33.12 | 1.05 | 1.03–1.08 | 7.2×10−5 | 1.08 | 1.05–1.11 | 1.0×10−8 | 1.07 | 1.05–1.09 | 7.7×10−12 | |
| 3q22.2 | rs10049390[ | 3 | 133,701,119 | A/G | 73.53 | 1.06 | 1.03–1.09 | 4.9×10−5 | 1.07 | 1.04–1.10 | 1.8×10−5 | 1.06 | 1.04–1.08 | 3.8×10−9 | |
| 4q24 | rs1391441 | 4 | 106,128,760 | A/G | 67.20 | 1.05 | 1.02–1.07 | 1.5×10−4 | 1.06 | 1.03–1.09 | 2.3×10−5 | 1.05 | 1.03–1.07 | 1.6×10−8 | |
| 4q31.21 | rs11727676 | 4 | 145,659,064 | C/T | 9.80 | 1.08 | 1.03–1.13 | 4.5×10−4 | 1.10 | 1.05–1.14 | 1.5×10−5 | 1.09 | 1.06–1.12 | 2.9×10−8 | |
| 6p21.32 | rs9271695 | 6 | 32,593,080 | G/A | 79.54 | 1.09 | 1.06–1.13 | 1.3×10−7 | 1.09 | 1.05–1.12 | 1.7×10−7 | 1.09 | 1.07–1.12 | 1.1×10−13 | |
| 7p13 | rs12672022[ | 7 | 45,136,423 | T/C | 83.45 | 1.07 | 1.04–1.11 | 1.6×10−5 | 1.06 | 1.03–1.10 | 4.4×10−4 | 1.07 | 1.04–1.09 | 2.8×10−8 | |
| 9p21.3 | rs1537372[ | 9 | 22,103,183 | G/T | 56.92 | 1.05 | 1.02–1.07 | 1.4×10−4 | 1.06 | 1.03–1.08 | 2.4×10−5 | 1.05 | 1.03–1.07 | 1.4×10−8 | |
| 9q22.33 | rs34405347[ | 9 | 101,679,752 | T/G | 90.34 | 1.08 | 1.04–1.13 | 5.5×10−5 | 1.09 | 1.04–1.13 | 1.5×10−4 | 1.09 | 1.05–1.12 | 3.1×10−8 | |
| 9q31.3 | rs10980628 | 9 | 113,671,403 | C/T | 21.06 | 1.05 | 1.02–1.09 | 3.1×10−4 | 1.08 | 1.05–1.11 | 1.3×10−6 | 1.07 | 1.04–1.09 | 2.8×10−9 | |
| 11q22.1 | rs2186607 | 11 | 101,656,397 | T/A | 51.78 | 1.05 | 1.03–1.08 | 1.1×10−5 | 1.05 | 1.03–1.08 | 3.3×10−5 | 1.05 | 1.04–1.07 | 1.5×10−9 | |
| 12q12 | rs11610543[ | 12 | 43,134,191 | G/A | 50.13 | 1.05 | 1.03–1.08 | 1.1×10−5 | 1.06 | 1.03–1.08 | 2.8×10−5 | 1.05 | 1.04–1.07 | 1.3×10−9 | |
| 12q13.3 | rs4759277 | 12 | 57,533,690 | A/C | 35.46 | 1.07 | 1.04–1.09 | 8.4×10−7 | 1.04 | 1.02–1.07 | 1.6×10−3 | 1.05 | 1.04–1.07 | 9.4×10−9 | |
| 13q13.3 | rs7333607 | 13 | 37,462,010 | G/A | 23.50 | 1.09 | 1.06–1.12 | 2.5×10−8 | 1.07 | 1.04–1.10 | 4.4×10−6 | 1.08 | 1.06–1.10 | 6.3×10−13 | |
| 13q22.1 | rs78341008[ | 13 | 73,791,554 | C/T | 7.19 | 1.13 | 1.07–1.18 | 1.4×10−6 | 1.11 | 1.05–1.16 | 4.8×10−5 | 1.12 | 1.08–1.16 | 3.2×10−10 | |
| 13q34 | rs8000189 | 13 | 111,075,881 | T/C | 64.01 | 1.05 | 1.02–1.07 | 2.1×10−4 | 1.07 | 1.04–1.10 | 1.3×10−6 | 1.06 | 1.04–1.08 | 1.8×10−9 | |
| 14q23.1 | rs17094983[ | 14 | 59,189,361 | G/A | 87.73 | 1.10 | 1.07–1.15 | 8.4×10−8 | 1.08 | 1.04–1.12 | 9.0×10−5 | 1.09 | 1.06–1.12 | 4.6×10−11 | |
| 15q22.33 | rs56324967 | 15 | 67,402,824 | C/T | 67.57 | 1.07 | 1.04–1.10 | 2.2×10−7 | 1.08 | 1.05–1.11 | 9.8×10−8 | 1.07 | 1.05–1.09 | 1.1×10−13 | |
| 16q23.2 | rs9930005[ | 16 | 80,043,258 | C/A | 43.03 | 1.05 | 1.03–1.08 | 1.3×10−5 | 1.05 | 1.02–1.07 | 4.0×10−4 | 1.05 | 1.03–1.07 | 2.1×10−8 | |
| 17p12 | rs1078643 | 17 | 10,707,241 | A/G | 76.36 | 1.07 | 1.04–1.10 | 9.2×10−6 | 1.09 | 1.05–1.12 | 1.1×10−7 | 1.08 | 1.05–1.10 | 6.6×10−12 | |
| 17q24.3 | rs983318[ | 17 | 70,413,253 | A/G | 25.26 | 1.07 | 1.04–1.10 | 1.2×10−6 | 1.05 | 1.02–1.08 | 8.0×10−4 | 1.06 | 1.04–1.08 | 5.6×10−9 | |
| 17q25.3 | rs75954926 | 17 | 81,061,048 | G/A | 65.68 | 1.10 | 1.07–1.13 | 9.4×10−11 | 1.09 | 1.06–1.12 | 4.8×10−9 | 1.09 | 1.07–1.11 | 3.0×10−18 | |
| 19p13.11 | rs34797592[ | 19 | 16,417,198 | T/C | 11.82 | 1.09 | 1.05–1.13 | 8.2×10−6 | 1.09 | 1.05–1.13 | 1.2×10−5 | 1.09 | 1.06–1.12 | 4.2×10−10 | |
| 19q13.43 | rs73068325 | 19 | 59,079,096 | T/C | 18.26 | 1.06 | 1.03–1.09 | 2.1×10−4 | 1.07 | 1.04–1.11 | 5.0×10−5 | 1.07 | 1.04–1.09 | 4.2×10−8 | |
| 20q13.12 | rs6031311 | 20 | 42,666,475 | T/C | 75.91 | 1.07 | 1.04–1.10 | 1.7×10−6 | 1.05 | 1.02–1.08 | 7.6×10−4 | 1.06 | 1.04–1.08 | 6.8×10−9 | |
| 20q13.33 | rs2738783[ | 20 | 62,308,612 | T/G | 20.29 | 1.07 | 1.04–1.10 | 2.6×10−6 | 1.05 | 1.02–1.08 | 3.3×10−3 | 1.06 | 1.04–1.08 | 5.3×10−8 | |
Lead variant is the most associated variant at the locus. rsIDs based on NCBI dbSNP Build 150. Alleles are on the + strand. Chr.: Chromosome. RAF: Risk allele frequency, based on stage 2 data. OR, odds ratio estimate for the risk allele. All P-values reported in this table are based on fixed-effects inverse variance-weighted meta-analysis.
Indicates that variant or LD proxy (r2>0.7) was selected for our custom genotyping panel and formally replicates in the Stage 2 meta-analysis at a Bonferroni significance threshold of P < 7.8×10−6.
Indicates that variant or LD proxy (r2>0.7) was selected for our custom genotyping panel but did not attain Bonferroni significance in the Stage 2 meta-analysis.
This SNP reached genome-wide significance in the combined (Stage 1 + Stage 2) sample-size weighted meta-analysis based on likelihood ratio test results (P = 4.9×10−8).
Additional new conditionally independent association signals at known and newly identified CRC risk loci that reach genome-wide significance (P < 5×10−8) in the combined meta-analysis of up to 125,478 individuals.
| Joint multiple-variant analysis | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Locus | Nearby gene(s) | rsID lead variant | Chr. | Position (Build 37) | Alleles (risk/other) | RAF (%) | ORunconditional | 95% CI | Conditioning variant(s) | ORconditional | 95% CI | ||
| 11q13.4 | rs61389091 | 11 | 74,427,921 | C/T | 96.06 | 1.23 | 1.18–1.29 | 1.2×10−18 | rs7121958 | 1.21 | 1.16–1.27 | 3.7×10−16 | |
| 2q33.1 | rs11884596 | 2 | 199,612,407 | C/T | 38.23 | 1.06 | 1.04–1.08 | 1.1×10−9 | rs983402 | 1.06 | 1.04–1.07 | 3.6×10−9 | |
| 5p15.33 | rs78368589 | 5 | 1,240,204 | T/C | 5.97 | 1.14 | 1.10–1.18 | 9.4×10−12 | rs2735940 | 1.12 | 1.08–1.16 | 4.1×10−9 | |
| 5p13.1 | rs7708610 | 5 | 40,102,443 | A/G | 35.64 | 1.04 | 1.02–1.06 | 1.5×10−5 | rs12514517 | 1.06 | 1.04–1.08 | 3.8×10−9 | |
| 6p21.32 | rs2516420 | 6 | 31,449,620 | C/T | 92.63 | 1.10 | 1.06–1.13 | 1.3×10−7 | rs9271695, rs116685461, rs116353863 | 1.12 | 1.08–1.16 | 2.0×10−10 | |
| 8q24.21 | rs4313119 | 8 | 128,571,855 | G/T | 74.86 | 1.06 | 1.04–1.08 | 1.0×10−9 | rs6983267 | 1.06 | 1.04–1.08 | 2.1×10−9 | |
| 12p13.32 | rs3217874 | 12 | 4,400,808 | T/C | 42.82 | 1.08 | 1.06–1.10 | 1.2×10−17 | rs3217810 | 1.06 | 1.04–1.08 | 2.4×10−9 | |
| 15q13.3 | rs17816465 | 15 | 33,156,386 | A/G | 20.55 | 1.07 | 1.04–1.09 | 6.8×10−9 | rs2293581 | 1.07 | 1.05–1.10 | 1.4×10−10 | |
| 20p12.3 | rs28488 | 20 | 6,762,221 | T/C | 63.88 | 1.06 | 1.04–1.08 | 2.6×10−11 | rs189583 | 1.07 | 1.05–1.09 | 2.6×10−14 | |
| 20p12.3 | rs994308 | 20 | 6,603,622 | C/T | 59.39 | 1.08 | 1.06–1.10 | 4.8×10−18 | rs189583 | 1.06 | 1.05–1.08 | 8.6×10−12 | |
Lead variant is the most associated variant at the locus in the conditional analysis. rsIDs based on NCBI dbSNP Build 150. Alleles are on the + strand. Chr.: Chromosome. RAF: Risk allele frequency, based on stage 2 data. OR, odds ratio estimates are for the risk allele. Conditioning variants are the lead variant of other conditionally independent association signals with P < 1×10−5 within 1-Mb of the new association signal. Because of extensive LD we used a 2-Mb distance for the MHC region (6p21.32). All lead variants for the new association signals are in linkage equilibrium with any previously reported CRC risk variants at the locus (r2 <0.10).
Indicates that the conditioning variant is either the index variant, or a variant in LD with the index variant reported in previous GWAS. Details and full results are provided in Supplementary Table 5.
Figure 1Conditionally independent association signals at the BMP2 locus.
Regional association plot showing the unconditional −log10(P-value) for the association with CRC risk in the combined meta-analysis of up to 125,478 individuals, as a function of genomic position (Build 37) for each variant in the region. The lead variants are indicated by a diamond symbol and its positions are indicated by dashed vertical lines. The color-labeling and shape of all other variants indicate the lead variant with which they are in strongest LD. The two new genome-wide significant signals are indicated by an asterisk.
Figure 2Functional genomic annotation of new CRC risk locus overlapping KLF5 super-enhancer.
Top: Regional association plot showing the unconditional −log10(P-value) for the association with CRC risk in the combined meta-analysis of up to 125,478 individuals, as a function of genomic position (Build 37) for each variant in the region. The lead variants are indicated by a diamond symbol and its positions are indicated by dashed vertical lines. The color-labeling and shape of all other variants indicate the lead variant with which they are in strongest LD. Bottom: UCSC genome browser annotations for region overlapping the super-enhancer flanked by KLF5 and KLF12, and spanning variants in LD with rs78341008, and with two conditionally independent association signals indexed by rs45597035 and rs1924816. The region is annotated with the following tracks (from top to bottom): UCSC gene annotations; epigenomic profiles showing MACS2 peak calls as transparent overlays for different samples taken from non-diseased colonic crypt cells or colon tissue (purple) and from different primary CRC cell lines or tumor samples (teal); position of the lead variants and variants in LD with the lead; variants in the 99% credible set; the union of super-enhancers called using the ROSE package; gray bars highlight the targeted enhancers (e1,e3, and e4) previously shown by Zhang et al.[31] to have combinatorial effects on KLF5 expression. ATAC-seq data newly generated for this study show high resolution annotation of putative binding regions within the active super-enhancer further fine-mapping putative causal variants at each of the three signals.
Figure 3Recommended age to start CRC screening based on a polygenic risk score (PRS).
The PRS was constructed using the 95 known and newly discovered variants. The horizontal lines represent the recommended age for the first endoscopy for an average-risk person in the current screening guideline for CRC. The risk threshold to determine the age for the first screening was set as the average of 10-year CRC risks for a 50-year-old man (1.25%) and woman (0.68%), i.e. (1.25%+0.68%)/2 = 0.97%, who have not previously received an endoscopy. Details are given in the Online Methods.