| Literature DB >> 30820706 |
Stephanie A Bien1,2, Yu-Ru Su3,4, David V Conti5,6,4, Tabitha A Harrison3,4, Conghui Qu3,4, Xingyi Guo7,4, Yingchang Lu7,4, Demetrius Albanes8,4, Paul L Auer9,4, Barbara L Banbury3,4, Sonja I Berndt8,4, Stéphane Bézieau10,11,4, Hermann Brenner12,13,14,4, Daniel D Buchanan15,16,17,4, Bette J Caan18,4, Peter T Campbell19,4, Christopher S Carlson3,4, Andrew T Chan20,21,4, Jenny Chang-Claude22,23,4, Sai Chen24,4, Charles M Connolly3,4, Douglas F Easton25,4, Edith J M Feskens26,4, Steven Gallinger27,4, Graham G Giles15,28,4, Marc J Gunter29,4, Jochen Hampe30,4, Jeroen R Huyghe3,4, Michael Hoffmeister12,4, Thomas J Hudson31,32,4, Eric J Jacobs19,4, Mark A Jenkins15,4, Ellen Kampman26,4, Hyun Min Kang24,4, Tilman Kühn33,4, Sébastien Küry10,11,4, Flavio Lejbkowicz34,35,4, Loic Le Marchand36,4, Roger L Milne15,28,4, Li Li37,4, Christopher I Li3,4, Annika Lindblom38,39,4, Noralane M Lindor40,4, Vicente Martín41,42,4, Caroline E McNeil5,4, Marilena Melas5,4, Victor Moreno42,43,44,4, Polly A Newcomb3,4, Kenneth Offit45,4, Paul D P Pharaoh46,4, John D Potter3,4, Chenxu Qu5,4, Elio Riboli47,4, Gad Rennert34,35,4, Núria Sala48,49,4, Clemens Schafmayer50,4, Peter C Scacheri51,4, Stephanie L Schmit52,53,4, Gianluca Severi54,4, Martha L Slattery55,4, Joshua D Smith56,4, Antonia Trichopoulou57,58,4, Rosario Tumino59,4, Cornelia M Ulrich60,4, Fränzel J B van Duijnhoven26,4, Bethany Van Guelpen61,4, Stephanie J Weinstein8,4, Emily White3,4, Alicja Wolk62,63,4, Michael O Woods64,4, Anna H Wu5,6,4, Goncalo R Abecasis24,4, Graham Casey54,4, Deborah A Nickerson56,4, Stephen B Gruber5,4, Li Hsu3,4, Wei Zheng7,4,65, Ulrike Peters3,4.
Abstract
Genome-wide association studies have reported 56 independently associated colorectal cancer (CRC) risk variants, most of which are non-coding and believed to exert their effects by modulating gene expression. The computational method PrediXcan uses cis-regulatory variant predictors to impute expression and perform gene-level association tests in GWAS without directly measured transcriptomes. In this study, we used reference datasets from colon (n = 169) and whole blood (n = 922) transcriptomes to test CRC association with genetically determined expression levels in a genome-wide analysis of 12,186 cases and 14,718 controls. Three novel associations were discovered from colon transverse models at FDR ≤ 0.2 and further evaluated in an independent replication including 32,825 cases and 39,933 controls. After adjusting for multiple comparisons, we found statistically significant associations using colon transcriptome models with TRIM4 (discovery P = 2.2 × 10- 4, replication P = 0.01), and PYGL (discovery P = 2.3 × 10- 4, replication P = 6.7 × 10- 4). Interestingly, both genes encode proteins that influence redox homeostasis and are related to cellular metabolic reprogramming in tumors, implicating a novel CRC pathway linked to cell growth and proliferation. Defining CRC risk regions as one megabase up- and downstream of one of the 56 independent risk variants, we defined 44 non-overlapping CRC-risk regions. Among these risk regions, we identified genes associated with CRC (P < 0.05) in 34/44 CRC-risk regions. Importantly, CRC association was found for two genes in the previously reported 2q25 locus, CXCR1 and CXCR2, which are potential cancer therapeutic targets. These findings provide strong candidate genes to prioritize for subsequent laboratory follow-up of GWAS loci. This study is the first to implement PrediXcan in a large colorectal cancer study and findings highlight the utility of integrating transcriptome data in GWAS for discovery of, and biological insight into, risk loci.Entities:
Mesh:
Year: 2019 PMID: 30820706 PMCID: PMC6483948 DOI: 10.1007/s00439-019-01989-8
Source DB: PubMed Journal: Hum Genet ISSN: 0340-6717 Impact factor: 4.132
Fig. 1Schematic illustration of the study design training data was comprised of joint observations of imputed variant genotypes and tissue-specific gene expression from reference datasets (DGN and GTEx). Elastic net regularization was used to train genetic variant predictors of gene expression and downloaded from PredictDB.org. Models for colon transverse tissues and whole blood were used for imputation of expression into independent GWAS datasets for Colorectal Cancer (CRC). Imputed gene expression was then tested for association with case (ca.)–control (co.) status in the discovery stage. Novel gene associations with a false discovery rate (FDR) = 0.2 were assessed in an independent CRC GWAS dataset. As a secondary analysis, the association of genetically determined expression of genes in 44 GWAS-associated risk regions was examined
Genes passing discovery threshold in novel loci from colon transverse PrediXcan
| Locus | Gene | Direction of gene expression for increased CRC risk | Discovery ( | Replication ( | PrediXcan gene model information | |
|---|---|---|---|---|---|---|
|
|
|
| Number of predictive variants | |||
| 7q22.1 |
| Decrease | 1.7 × 10− 4 | 1.1 × 10− 2 | 0.51 | 62 |
| 14q22.1 |
| Decrease | 2.3 × 10− 4 | 8.7 × 10− 4 | 0.26 | 23 |
| 16q24.3 |
| Increase | 1.3 × 10− 4 | 0.62 | 0.14 | 29 |
P For the association between CRC and the genetically determined gene expression in discovery and replication GWAS studies
R2 = the cross-validated R2 value found when training the model (predictive R2 from PredictDB.org). Replicated at α = 0.05/3 genes = 1.7 × 10− 2
Known GWAS-risk regions overlapping genes that show association of genetically regulated gene expression with CRC
| Region | Gene count in region | PrediXcan results for genes with | GWAS publication for independent index variant(s)c | Variant(s) with differential allelic effects and gene regulated | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CCDS gene build | Genes with genetically imputed gene expressiona | Gene set (decreasing order of significance) | Number of genes (% reduced from CCDS)b | Reported gene(s) | rsID dbSNP function (note) | References | |||||||
| CT | WB | CT∩WB (% overlap)d | CT | WB | CT + WB | CT | WB | ||||||
| 1p36.12 | 20 | 11 | 16 | 9 (50) |
|
| 1 (95) | – | 0.02 |
| rs72647484, intergenic | Al-Tassan et al. ( | – |
| 1q25.3 | 19 | 8 | 14 | 7 (47) |
|
| 4 (79) | 0.02 | 3 × 10− 6 |
| rs10911251, intronic | Peters et al. ( | – |
| 1q41 | 8 | 6 | 8 | 5 (56) |
|
| 3 (63) | 0.05 | 0.02 |
| rs6691170, intergenic | Houlston et al. ( | – |
| 2q35 | 41 | 12 | 34 | 10 (27) |
|
| 8 (80) | 3 × 10− 3 | 8 × 10− 5 |
| rs992157, intronic (tags missense) | Orlando et al. ( | – |
| 3p22.1 | 9 | 2 | 5 | 2 (40) |
|
| 1 (88) | – | 0.04 |
| rs35360328, intergenic | Schumacher et al. ( | – |
| 3p14.1 | 4 | 2 | 4 | 2 (50) |
|
| 1 (75) | 2 × 10− 3 | 1 × 10− 3 |
| rs812481, intronic | Schumacher et al. ( | – |
| 5p15.33 | 20 | 17 | 16 | 10 (43) |
|
| 2 (90) | 0.02 | 0.02 |
| rs2736100, intronic | Kinnersley et al. ( | – |
| 5q22.2 | 8 | 6 | 6 | 6 (100) |
|
| 1 (87) | 9 × 10− 3 | – |
| rs1801155, missense- | Niell et al. ( | – |
| 5q31.1 | 24 | 10 | 14 | 8 (50) |
|
| 2 (92) |
| 0.04 |
| rs647161, intergenic | Jia et al. ( | – |
| 6p21.2 | 31 | 21 | 24 | 15 (50) |
|
| 4 (87) | – | 0.01 |
| rs1321311, intergenic | Dunlop et al. ( | – |
| 6p21.1 | 30 | 14 | 22 | 10 (38) |
|
| 1 (97) |
| 0.01 |
| rs4711689, intronic | Zeng et al. ( | – |
| 6q22.1 | 16 | 9 | 6 | 3 (25) |
|
| 3 (81) | 9 × 10− 3 | 0.01 |
| rs4946260, intronic | Schumacher et al. ( | – |
| 6q25.3 | 16 | 11 | 11 | 8 (57) |
|
| 1 (94) | 7 × 10− 3 |
|
| rs7758229 | Cui et al. ( | – |
| 8q23.3 | 6 | 5 | 5 | 3 (43) |
|
| 3 (50) | 0.02 | 6 × 10− 3 |
| rs2450115, intergenic; rs16892766, intergenic; rs6469656, intergenic | Tomlinson et al. ( | rs16888589; |
| 8q24.21 | 5 | 2 | 4 | 2 (67) |
|
| 2 (60) | 6 × 10− 10 | 0.01 |
| rs6983267, intergenic | Tomlinson et al. ( | rs6983267; |
| 9q24 | 11 | 8 | 9 | 6 (55) |
|
| 1 (91) | 0.03 |
|
| rs719725, intergenic | Zanke et al. ( | – |
| 10p14 | 5 | 2 | 4 | 2 (50) |
|
| 1 (80) | 0.01 |
|
| rs10795668 | Tomlinson et al. ( | – |
| 10q24.2 | 74 | 26 | 42 | 9 (53) |
|
| 6 (70) | 5 × 10− 3 | 9 × 10− 4 |
| rs1035209, intergenic | Whiffin et al. ( | – |
| 10q25.2 | 12 | 6 | 7 | 5 (63) |
|
| 1 (92) |
| 0.05 |
| rs12241008, intronic; rs11196172 | Zhang et al. ( | – |
| 11q12.2 | 74 | 26 | 42 | 15 (28) |
|
| 8 (89) | 4 × 10− 3 | 5 × 10− 4 |
| rs174537, intronic rs60892987, intergenic | Zhang et al. ( | – |
| 11q13.4 | 29 | 14 | 22 | 9 (33) |
|
| 4 (67) | 7 × 10− 5 | 8 × 10− 3 |
| rs3824999, intronic | Dunlop et al. ( | – |
| 11q23.1 | 27 | 14 | 13 | 8 (42) |
|
| 4 (85) | 1 × 10− 6 | – |
| rs3802842, intronic | Tenesa et al. ( | rs7130173; |
| 12p13.32 | 71 | 40 | 53 | 33 (55) |
|
| 3 (96) | 0.04 | 6 × 10− 3 |
| rs10774214, intergenic; rs3217810, intergenic rs10849432, intergenic rs11064437, splice donor- | Jia et al. ( | – |
| 12q13.12 | 32 | 16 | 20 | 9 (33) |
|
| 13 (59) | 8 × 10− 6 | 3 × 10− 4 |
| rs11169552, intronic | Houlston et al. ( | – |
| 12q24.12 | 24 | 12 | 18 | 9 (43) |
|
| 9 (63) | 2 × 10− 3 | 1 × 10− 6 |
| rs3184504, missense | Schumacher et al. ( | – |
| 12q24.22 | 14 | 6 | 11 | 4 (31) |
|
| 2 (86) | 1 × 10− 2 | 9 × 10− 3 |
| rs7320812 | Schumacher et al. ( | – |
| 15q13.3 | 9 | 5 | 2 | 2 (40) |
| – | 1 (88) | 0.04 | – |
| rs16969681 intergenic rs11632715, intergenic | Tomlinson et al. ( | rs16969681; |
| 16q22.1 | 41 | 23 | 35 | 19 (49) | – |
| 2 (98) | – | 8 × 10− 3 |
| rs9929218, intronic | COGENT Study et al. ( | rs5030625; |
| 17p13.3 | 27 | 19 | 24 | 17 (65) | FAM57A, GEMIN4, BMLHA9 |
| 3 (89) | 1 × 10− 3 | 0.01 |
| rs12603526, intronic | Zhang et al. ( | – |
| 18q21.1 | 10 | 5 | 7 | 3 (33) |
|
| 3 (70) | 8 × 10− 3 | 0.04 |
| rs7229639 intronic rs4939827 intronic | Broderick et al. ( | rs6507874, rs6507875, rs8085824, and rs5892087, |
| 19q13.11 | 20 | 13 | 17 | 11 (58) |
|
| 1 (95) | 0.04 | 0.02 |
| rs10411210 intronic | COGENT Study et al. ( | – |
| 19q13.2 | 59 | 24 | 37 | 15 (33) |
|
| 5 (92) | 0.03 | 6 × 10− 3 |
| rs1800469 intronic (tags missense) | Zhang et al. ( | – |
| 20q13.13 | 9 | 4 | 7 | 3 (38) |
|
| 2 (78) | 7 × 10− 3 | 7 × 10− 3 |
| rs6066825 intronic | Schumacher et al. ( | – |
| 20q13.33 | 27 | 20 | 23 | 15 (54) |
|
| 3 (89) | 0.05 | 5 × 10− 3 |
| rs4925386 intronic | Houlston et al. ( | – |
CCDS genes were counted, regardless of tissue relevance, 500 kb upstream or downstream of an index variant
CT colon transverse, WB whole blood, No. number— no genes meeting criteria. In known loci, genes with gene expression predictive R2 < 0.01 were included
aGenes with predicted expression in the corresponding tissue
bNumber of genes with a P value ≤ 0.05. % Red. = (# of genes with P value ≤ 0.05/# CCDS genes) × 100
cConditionally independent in statistical models containing both variants or LD r2 < 0.2
dThe intersect of genes in CT and WB models