Literature DB >> 31818908

Systematic meta-analyses, field synopsis and global assessment of the evidence of genetic association studies in colorectal cancer.

Zahra Montazeri1, Xue Li2, Christine Nyiraneza1, Xiangyu Ma3, Maria Timofeeva4, Victoria Svinti4, Xiangrui Meng2, Yazhou He2, Yacong Bo5, Samuel Morgan1, Sergi Castellví-Bel6, Clara Ruiz-Ponte7, Ceres Fernández-Rozadilla7, Ángel Carracedo7, Antoni Castells6, Timothy Bishop8, Daniel Buchanan9,10,11, Mark A Jenkins9, Temitope O Keku12, Annika Lindblom13,14, Fränzel J B van Duijnhoven15, Anna Wu16, Susan M Farrington4, Malcolm G Dunlop4, Harry Campbell2, Evropi Theodoratou17,18, Wei Zheng19, Julian Little1.   

Abstract

OBJECTIVE: To provide an understanding of the role of common genetic variations in colorectal cancer (CRC) risk, we report an updated field synopsis and comprehensive assessment of evidence to catalogue all genetic markers for CRC (CRCgene2).
DESIGN: We included 869 publications after parallel literature review and extracted data for 1063 polymorphisms in 303 different genes. Meta-analyses were performed for 308 single nucleotide polymorphisms (SNPs) in 158 different genes with at least three independent studies available for analysis. Scottish, Canadian and Spanish data from genome-wide association studies (GWASs) were incorporated for the meta-analyses of 132 SNPs. To assess and classify the credibility of the associations, we applied the Venice criteria and Bayesian False-Discovery Probability (BFDP). Genetic associations classified as 'positive' and 'less-credible positive' were further validated in three large GWAS consortia conducted in populations of European origin.
RESULTS: We initially identified 18 independent variants at 16 loci that were classified as 'positive' polymorphisms for their highly credible associations with CRC risk and 59 variants at 49 loci that were classified as 'less-credible positive' SNPs; 72.2% of the 'positive' SNPs were successfully replicated in three large GWASs and the ones that were not replicated were downgraded to 'less-credible' positive (reducing the 'positive' variants to 14 at 11 loci). For the remaining 231 variants, which were previously reported, our meta-analyses found no evidence to support their associations with CRC risk.
CONCLUSION: The CRCgene2 database provides an updated list of genetic variants related to CRC risk by using harmonised methods to assess their credibility. © Author(s) (or their employer(s)) 2020. Re-use permitted under CC BY. Published by BMJ.

Entities:  

Keywords:  colorectal cancer

Mesh:

Substances:

Year:  2019        PMID: 31818908      PMCID: PMC7398467          DOI: 10.1136/gutjnl-2019-319313

Source DB:  PubMed          Journal:  Gut        ISSN: 0017-5749            Impact factor:   23.059


Colorectal cancer (CRC) is a global public health challenge. A large number of genetic association studies have been conducted to assess the potential correlation between common genetic variations and CRC risk. Using an established framework for grading credibility of genetic associations, we classified 14 independent variants at 12 loci (MUTYH, SMAD7, TERT, CDH1, RHPN2, BMP2, TGFB1 and common variants tagging loci at 8q24, 8q23.3, 10p14, 11q23.1, 20p12.3) as highly credibly associated with CRC risk. A total of 63 variants at 52 loci were classified as ‘less-credible positive’ SNPs; variants of nine of these genes could be mostly highly prioritised for further investigation. For 231 variants previously reported to be associated with CRC, our meta-analyses found no evidence to support such associations. This database will be helpful for future research by promoting the investigation of these variants and corresponding genetic loci in populations other than of European origin, serving as a genetic basis for predicting risk estimates for population groups and providing candidate genes for further functional studies or gene-environment interaction studies.

Introduction

Colorectal cancer (CRC) is one of the most commonly diagnosed malignancies and the leading cause of cancer deaths in the world, with 1.65 million new cases and about 835 000 deaths in 2015.1 The global burden of CRC is expected to increase by 60%, with more than 2.2 million new cases and almost 1.1 million deaths occurring annually by 2030.2 The distribution of CRC global burden varies widely, with more than two-thirds of all cases and about 60% of all deaths occurring in countries with a high or very high human development index, including Australia and New Zealand, Europe and North America, while incidence and mortality rates in Africa and South-Central Asia are relatively low.1 2 These geographic differences appear to be attributable to the differences in both environmental exposures and the background of genetically determined susceptibility.3 It is estimated that 15%–25% of CRC risk variance is attributed to inherited genetic factors, and the first-degree relatives of CRC patients have two to four times higher risk of developing CRC.4 5 The inherited genetic risk of CRC can be partly accounted for by a combination of rare high-penetrance mutations and large numbers of common genetic variants each conferring small risk.6 Although a number of highly penetrant mutations (eg, DNA mismatch repair genes, APC, SMAD4, LKB1/STK11, MUTYH) have been identified to influence CRC susceptibility with large effects, overall they account for only 2%–5% of incident CRC cases in the general population because these mutations are very rare.7–9 Candidate gene association studies have investigated the role of a large number of common genetic variants in CRC risk, but only a small number of them have been successfully replicated in subsequent studies.10 11 In 2012 and 2014, we reported two independently conducted series of meta-analyses to systematically evaluate associations between CRC and common variants using data from candidate gene studies and genome-wide association studies (GWASs) and identified a number of promising genetic risk variants for CRC risk.10 11 Our first field synopsis (published in 2012) reported 16 variants in 13 independent loci (MUTYH, MTHFR, SMAD7, 8q24, 8q23.3, 11q23.1, 14q22.2, 1q41, 20p12.3, 20q13.33, 3q26.2, 16q22.1, 19q13.1),10 and the second field synopsis (published in 2014) identified 8 additional variants in 5 independent loci (APC, CHEK2, DNMT3B, MLH1, MUTYH) that were strongly associated with CRC.11 These two synopses used slightly different methodologies, in that the 2012 field synopsis only included variants reported in four or more studies in meta-analyses, whereas the 2014 included variants with three or more studies, and there were some differences in the criteria applied for the evidence appraisal.10 11 In this study, we aimed to perform an updated field synopsis for CRC risk by including the most recently published genetic association studies, by following established guidelines12 13 and using harmonised methods for evidence appraisal.14–16 We systematically captured all published genetic association data on CRC for meta-analyses and subsequently incorporated data from GWAS consortia for interrogation. This study provides an up-to-date and publicly available database for CRC genetics (CRCgene2) and presents these data within a defined statistical and causal inference framework to aid interpretation of the results.13 We aimed to provide new insights into the fundamental biological mechanisms involved in colorectal carcinogenesis.

Methods

Literature search and data collection

To identify genetic association studies of CRC risk, we searched the Medline database via the Ovid gateway and the search terms comprising medical subject headings (MeSH) and keywords relating to colorectal neoplasms, the MeSH heading ‘genetic predisposition to disease’, and the keywords ‘gene$’ and ‘associate$’ were applied to terms in the entire article. The latest literature search was performed on 21 November 2018. We screened the eligibility of retrieved publications in a three-step parallel review of title, abstract and full text by following the predefined inclusion and exclusion criteria. Thus, each eligible study evaluated the association between a polymorphic genetic variant (with minor allele frequency ≥0.01 in the reference panel of the 1000 genomes) and a sporadic CRC. Studies investigating only premalignant conditions such as adenomas, polyps or dysplastic tissue were excluded. Studies investigating hereditary CRC syndromes, such as familial adenomatous polyposis, hereditary non-polyposis CRC, juvenile polyposis syndrome and Gardner’s syndrome; solely focusing on the progression or histological phenotype of CRC; or studies in animals; were excluded. Case-control, cohort and GWASs were included, while family-based studies were excluded. All included studies were published in English in a peer-reviewed journal; studies only reported as conference abstracts were excluded. Data from the eligible studies were abstracted into two standardised tables, including the key variables with regard to the study identifiers and context, study design and limitations, genotype information and outcome effects. A list of genetic variants that were investigated in meta-analyses was summarised and data from three GWAS consortia (Scotland, Canada and Spain) were incorporated for meta-analyses, when the genotype data are available for the listed variants. In brief, the Study of CRC in Scotland (SOCCS) is a population-based case-control GWAS that includes 3417 cases and 3500 controls. The Assessment of Risk for Colorectal Tumors In Canada (ARCTIC) is a case-control GWAS database that includes 1231 cases and 1240 controls. The population-based cohort study in Spain comprised two phases (EPICOLON I and EPICOLON II) adding up to 2000 cases and 2000 controls. Restricted candidate gene genotyping data were available from both phases and GWAS data were only accessible from 881 cases and 667 controls from phase 2.17 18 More details about these GWAS datasets are present in online supplementary text.

Statistical analysis

Meta-analysis was performed for genetic variants with data available from at least three independent studies. Summary crude ORs and 95% CI for allelic, recessive and dominant genetic models were calculated by applying either the fixed-effect model (Mantel-Haenszel method) or the random-effects model (DerSimonian-Laird method) in case of the existence of substantial heterogeneity. The Q statistic (with a threshold of p value <0.05) and I2 metric were calculated to quantify between-study heterogeneity. Funnel plot analysis with an Egger test was conducted to test for small study effects. We also estimated the statistical power of each meta-analysis based on the significance level of α=0.05, the effect sizes and the allele frequencies of genetic variants (an integral component of the Bayesian False-Discovery Probability (BFDP) analysis).19 All statistical analysis was conducted by using R software (R x64 3.1.0).

Credibility of the identified genetic associations

We first applied the BFDP19 and the Venice criteria12 13 to assess the credibility of any observed genetic associations with p<0.05 in at least one genetic model. We then validated these associations in three additional GWAS consortia: Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO),20 Colorectal Transdisciplinary Study (CORECT, https://research.fhcrc.org/peters/en/corect-study.html) and Colon Colorectal Cancer Family Registry (CFR).21 With meta-analysis of these three GWAS datasets, we validated the observed genetic associations using data from 58 131 CRC cases and 67 347 controls (online supplementary text), and the statistical power of validation was estimated accordingly. The BFDP assesses the noteworthiness of an observed association. The BFDP was selected rather than the false-positive report probability (FPRP) because it uses more information, defines the noteworthiness threshold explicitly in terms of the costs of false discovery and non-discovery, and does not suffer from the inferential limitations identified for the FPRP.22 We calculated BFDP values at two levels of prior probability: a medium/low prior level (0.05 to 10−3), close to what would be expected for a candidate gene, and a very low prior level (10−4 to 10−6), close to what would be expected for a random SNP. A noteworthy threshold was defined as 0.2 based on the assumption that the cost of false discovery would be four times higher than that of false non-discovery.19 According to the Venice criteria, the credibility of associations is assessed for three aspects: the amount of evidence, the extent of replication and the protection from bias.12 13 We used statistical power to assess the volume of evidence and a grade of A, B and C was assigned, respectively, when statistical power was greater than 80%, 50%–79% or less than 50%. The extent of replication was assessed by the measurement of heterogeneity (I2 criterion), and a grade of A, B and C was assigned, respectively, when I2 was less than 25%, 25%–49% or greater than 50%.12 For protection from bias, a complete assessment is difficult; instead, we considered the following aspects: (1) the phenotype definition was addressed by our inclusion criteria—namely that cases would have newly incident CRC; (2) genotyping error rates are generally low; (3) the criterion of replication across studies in part addresses potential concerns about variation in genotyping quality between studies; and (4) the magnitude of effect of population stratification appears to be small in general.23 Genetic associations were then classified into four categories based on the following criteria. Associations were classified as ‘positive’ if they: (1) were statistically significant at a p value level of 0.05 in at least two of the genetic models, (2) had a BFDP less than 0.20 at least at the p value level of 0.05, (3) had a statistical power greater than 80%, (4) had an I2 less than 50%. A class of ‘less-credible positive’, with a less-stringent threshold, was assigned to the associations (1) that were statistically significant at a p value threshold of 0.05 in at least one of the genetic models, but (2) their BFDP was greater than 0.20 or their statistical power was between 50% and 79% or had an I2 greater than 50%. Associations with p value large than 0.05 were further classified as ‘null’ or ‘negative’ by assessing if there are more than 5000 cases. After credibility assessment, genetic variants classified as ‘positive’ and ‘less-credible positive’ were sent to the CORECT coordinating centre to validate their associations with CRC risk using additive, dominant and recessive models. At this stage, ‘positive’ associations that failed to be validated (at p<0.05) were downgraded to ‘less-credible positive’. A schematic diagram is shown in figure 1 to demonstrate datasets included in each phase of the analysis.
Figure 1

Diagram of the study design. ARCTIC, Assessment of Risk for Colorectal Tumors In Canada; BFDP, Bayesian False-Discovery Probability; CFR, Colorectal Cancer Family Registry; CORECT, Colorectal Transdisciplinary Study; CRC, colorectal cancer; EPICOLON, Gastrointestinal Oncology Group of the Spanish Gastroenterological Association; GECCO, Genetics and Epidemiology of Colorectal Cancer Consortium; GWASs, genome-wide association studies; SOCCS, Study of CRC in Scotland.

Diagram of the study design. ARCTIC, Assessment of Risk for Colorectal Tumors In Canada; BFDP, Bayesian False-Discovery Probability; CFR, Colorectal Cancer Family Registry; CORECT, Colorectal Transdisciplinary Study; CRC, colorectal cancer; EPICOLON, Gastrointestinal Oncology Group of the Spanish Gastroenterological Association; GECCO, Genetics and Epidemiology of Colorectal Cancer Consortium; GWASs, genome-wide association studies; SOCCS, Study of CRC in Scotland.

Results

A total of 20 900 citations were identified from literature search. Of these, 6770 (32.4%) papers were published after the search period of the most recent field synopsis (31 December 2012).11 After eligibility screening, we finally included and extracted data from 869 publications (figure 2), reporting the association of CRC risk with 1063 polymorphisms in 303 different genes, of which 308 polymorphisms were reported in at least three independent studies.
Figure 2

The distribution of included studies published from 1991 to 2018.

The distribution of included studies published from 1991 to 2018.

Meta-analyses

Meta-analyses were conducted for 308 polymorphisms in 158 different loci with data available in three or more candidate or GWA studies (online supplementary table 1). On average, these meta-analyses were based on 6149 CRC cases (median; IQR=2301–7334) and 7337 controls (median; IQR=2809–8885) originating from 8 (median; IQR=4–9) case–control studies. Data from the Scottish, Canadian and/ or Spanish GWAS were incorporated in the meta-analyses for 132 SNPs. Summary crude ORs and 95% CI for the allelic, dominant and recessive models are presented in table 1. Of the meta-analyses for 308 polymorphisms (tagged at different 158 loci), a total of 77 SNPs (25.6%) (tagged in 61 different loci) were identified to have a nominally statistically significant association (p value <0.05) with CRC risk in at least one of the three genetic models and were eligible for credibility assessment using the BFDP19 (online supplementary tables 2–4) and the Venice criteria12 13 and for validation in the three (GECCO, CORECT and CCFR) GWAS consortia (online supplementary tables 5–7).
Table 1

Summary of ‘credible positive associations’ from meta-analyses and credibility assessment

GeneVariantCases vs controls(number of samples)Allelic modelPowerRecessive model: var/var vs wt/wt and wt/varDominant model: wt/var and var/var vs wt/wtClassification
NEffect sizeHeterogeneityNEffect sizeHeterogeneityPowerNEffect sizeHeterogeneityPower
OR (95% CI)P valueI2 (95% CI)P valueOR (95% CI)P valueI2 (95% CI)P valueOR (95% CI)P valueI2 (95% CI)P value
MUTYH rs3605399328 302 vs 20 935 (19*)191.42 (1.22 to 1.66)4.89E-060 (0 to 33)0.7030.99nananananana191.31 (1.12 to 1.53)0.0010 (0 to 24)0.8370.88Positive
MUTYH rs3461234228 180 vs 20 923 (19*)171.89 (1.47 to 2.42)5.72E-070 (0 to 62)0.8281.0073.40 (1.22 to 9.48)0.0190 (0 to 0)0.9981.00171.80 (1.39 to 2.31)6.42E-060 (0 to 52)0.8841.00Positive
SMAD7 rs1295371728 006 vs 26 539 (13*)131.11 (1.07 to 1.16)6.24E-0749 (0 to 93)0.0241.00131.17 (1.08 to 1.26)8.34E-0545 (0 to 94)0.0411.00131.14 (1.07 to 1.21)5.00E-0554 (10 to 95)0.0201.00Positive
SMAD7 rs446414817 772 vs 17 356 (9*)91.12 (1.08 to 1.16)2.66E-0912 (0 to 92)0.3371.0091.13 (1.00 to 1.28)0.05449 (0 to 98)0.0470.9791.16 (1.11 to 1.21)2.09E-102 (0 to 86)0.4161.00Positive
8q24 rs1050547718 562 vs 20 132 (14)141.14 (1.10 to 1.19)2.04E-1144 (0 to 82)0.0401.00141.16 (1.10 to 1.23)4.06E-0735 (0 to 74)0.0921.00141.24 (1.17 to 1.32)1.04E-1136 (0 to 81)0.0841.00Positive
20p12.3 rs96125322 971 vs 25 270 (14*)141.11 (1.06 to 1.16)1.58E-0560 (28 to 91)0.0021.00131.14 (1.06 to 1.22)7.10E-0436 (0 to 96)0.0981.00141.14 (1.07 to 1.21)4.75E-0557 (21 to 88)0.0041.00Positive
8q23.3 rs168927668351 vs 8878 (6*)61.24 (1.15 to 1.34)1.67E-084 (0 to 92)0.3930.9961.21 (0.88 to 1.67)0.2400 (0 to 85)0.7470.2261.27 (1.17 to 1.38)4.00E-093 (0 to 90)0.4001.00Positive
10p14 rs1079566816 763 vs 18 302 (16*)160.88 (0.83 to 0.94)4.35E-0567 (34 to 89)6.94E-051.00160.83 (0.77 to 0.89)4.77E-0711 (0 to 70)0.3321.00160.85 (0.79 to 0.93)2.27E-0468 (34 to 87)3.06E-051.00Positive
11q23.1 rs380284222 320 vs 22 965 (20*)201.15 (1.11 to 1.20)5.21E-1243 (0 to 78)0.0241.00201.24 (1.13 to 1.35)3.62E-0645 (5 to 86)0.0171.00201.19 (1.13 to 1.25)7.22E-1233 (0 to 70)0.0751.00Positive
BMP2 rs35552713 539 vs 14 375 (9)91.12 (1.08 to 1.17)1.65E-100 (0 to 0)0.9770.9991.18 (1.09 to 1.27)3.54E-050 (0 to 94)0.4580.9991.16 (1.10 to 1.21)2.43E-090 (0 to 9)0.9661.00Positive
CDH1 rs186274817 436 vs 18 418 (11)110.91 (0.88 to 0.94)1.98E-080 (0 to 63)0.7030.98110.83 (0.77 to 0.90)1.59E-062 (0 to 68)0.4271.00110.91 (0.87 to 0.95)6.31E-060 (0 to 70)0.5110.99Positive
RHPN2 rs725937115 762 vs 16 700 (9*)90.87 (0.82 to 0.93)1.65E-0552 (0 to 90)0.0341.0090.85 (0.73 to 0.98)0.02717 (0 to 86)0.2950.7190.86 (0.80 to 0.92)3.81E-0642 (0 to 87)0.0841.00Positive
TERT rs273610016 176 vs 18 135 (8)81.07 (1.04 to 1.10)2.95E-050 (0 to 82)0.5260.8881.06 (1.00 to 1.13)0.06915 (0 to 86)0.3160.6481.13 (1.08 to 1.20)2.03E-060 (0 to 64)0.7881.00Positive
TGFB1 rs18004694021 vs 6024 (10)100.89 (0.80 to 0.99)0.03653(2t o 92)0.0231.00100.84 (0.74 to 0.94)0.0030 (0 to 75)0.5271.00100.84 (0.71 to 1.00)0.05158 (15 to 92)0.0111.00Positive

*Includes GWAS data from SOCCS.

GWASs, genome-wide association studies; SOCCS, Study of CRC in Scotland.

Summary of ‘credible positive associations’ from meta-analyses and credibility assessment *Includes GWAS data from SOCCS. GWASs, genome-wide association studies; SOCCS, Study of CRC in Scotland. Credibility assessment indicated 18 variants (5.8% of the meta-analysed SNPs) tagging 16 loci (rs36053993 and rs34612342 in MUTYH, rs2066847 in NOD2, rs12953717 and rs4464148 in SMAD7, rs1569686 in DNMT3B, rs2736100 in TERT, rs9858822 in PPAR-gamma, rs1862748 in CDH1, rs7259371 in RHPN2, rs355527 in BMP2, rs1800469 in TGFB1, rs10505477 in 8q24, rs16892766 in 8q23.3, rs3802842 in 11q23.1, rs961253 in 20p12.3, rs10795668 in 10p14, rs4951291 in 1q32.1) had the most credible associations with CRC risk and are therefore referred to as ‘positive’ SNPs (table 1, online supplementary tables 2–4). These findings are based on accrued data on 1224 to 43 652 cases and on 1381 to 60 883 controls, with a median of 17 100 cases per meta-analysis. The linkage disequilibrium (LD) between these ‘positive’ polymorphisms was checked pairwise using the Ensembl LD calculator with reference to the 1000 genomes: phase 3 CEU population and we found two pairs of SNPs (rs355527 and rs961253, rs12953717 and rs4464148) with r2 >0.20 (online supplementary tables 8). The other 59 variants (19.2% of the meta-analysed SNPs) in 49 loci with p value <0.05 were classified as ‘less-credible positive’ SNPs, given high heterogeneity, low statistical power or a high possibility of being false positive (BFDP >0.2) for their association with CRC risk (table 2, online supplementary tables 2–4). The summary findings for ‘less-credible positive’ SNPs were based on accrued data on 246 to 51 730 CRC cases and on 399 to 53 589 controls, with a median of at least 4287 CRC cases per meta-analysis.
Table 2

Summary of ‘less-credible associations’ from meta-analyses and credibility assessment

GeneVariantCases vs controls(number of samples)Allelic modelRecessive model: var/var vs wt/wt and wt/varDominant model: wt/var and var/var vs wt/wtClassification
NEffect sizeHeterogeneityNEffect sizeHeterogeneityPowerNEffect sizeHeterogeneityPower
OR (95% CI)P valueI2 (95% CI)P valuePowerOR (95% CI)P valueI2 (95% CI)P valueOR (95% CI)P valueI2 (95% CI)P value
MTHFR rs180113343 652 vs 60 883 (94*†)910.96 (0.93 to 0.99)8.79E-0353 (59 to 85)3.61E-090.84930.87 (0.81 to 0.93)7.36E-0556 (63 to 86)6.79E-111.00930.98 (0.94 to 1.02)0.24948 (52 to 82)1.98E-070.36Less credible
miR rs8958191322 vs 1641 (4)41.18 (1.06 to 1.32)0.0034 (0 to 95)0.3730.0941.50 (1.23 to 1.83)6.32E-051 (0 to 96)0.3850.4441.08 (0.93 to 1.26)0.3020 (0 to 94)0.4740.05Less credible
ADH1B rs12299844449 vs 6995 (9)91.05 (0.94 to 1.18)0.40157 (8 to 91)0.0180.1691.23 (1.02 to 1.48)0.0280 (0 to 70)0.4930.6891.04 (0.90 to 1.20)0.59560 (14 to 92)0.0100.11Less credible
XRCC1 rs2548713 017 vs 18 166 (37)371.11 (1.03 to 1.19)0.00871 (68 to 90)8.64E-120.74361.17 (1.02 to 1.34)0.02559 (39 to 80)3.14E-060.91371.11 (1.01 to 1.21)0.02363 (58 to 89)1.44E-070.88Less credible
HFE rs18005626547 vs 34 156 (10‡)101.10 (1.01 to 1.20)0.0350 (0 to 40)0.8500.46101.15 (0.69 to 1.92)0.58224 (0 to 84)0.2400.12101.10 (1.01 to 1.21)0.0370 (0 to 25)0.9370.68Less credible
MTHFR rs180113123 523 vs 35 193 (57*)570.98 (0.95 to 1.01)0.25822 (0 to 57)0.0750.19570.91 (0.84 to 0.98)0.01023 (0 to 62)0.0640.94571.00 (0.96 to 1.04)0.97010 (0 to 48)0.2700.50Less credible
CYP2C9 rs17998539588 vs 11 428 (12)120.95 (0.90 to 1.01)0.0860 (0 to 73)0.5300.26121.16 (0.96 to 1.40)0.1270 (0 to 73)0.6670.39120.93 (0.87 to 0.99)0.0150 (0 to 72)0.4520.65Less credible
CRP rs18009472853 vs 3381 (3)21.03 (0.88 to 1.20)0.7450 (0 to 100)0.3340.1023.95 (1.35 to 11.56)0.0127 (0 to 99)0.3011.0030.96 (0.82 to 1.12)0.6040 (0 to 81)0.9110.08Less credible
CRP rs12053037 vs 9333 (4)41.07 (1.00 to 1.15)0.0650 (0 to 95)0.4790.3441.18 (1.02 to 1.37)0.0270 (0 to 94)0.4310.7341.05 (0.96 to 1.16)0.2880 (0 to 90)0.6690.21Less credible
EGF rs4444903899 vs 976 (5)50.83 (0.64 to 1.07)0.14370 (18 to 97)0.0100.5250.91 (0.63 to 1.31)0.60150 (0 to 92)0.0930.1350.65 (0.44 to 0.96)0.03067 (12 to 97)0.0160.99Less credible
ESR2 rs9285542574 vs 2977 (3)31.09 (1.01 to 1.18)0.0250 (0 to 89)0.7430.3531.12 (0.97 to 1.29)0.1180 (0 to 93)0.5900.3631.13 (1.01 to 1.26)0.0410 (0 to 89)0.8760.57Less credible
HPGD rs87523968 vs 4830 (3‡)31.05 (0.92 to 1.21)0.46780 (23 to 99)0.0070.2031.12 (1.00 to 1.26)0.0476 (0 to 97)0.3450.5331.03 (0.83 to 1.29)0.78683 (34 to 99)0.0030.10Less credible
LIPC rs60837667 vs 7980 (4‡)40.96 (0.90 to 1.02)0.17241 (0 to 95)0.1670.2340.89 (0.80 to 0.99)0.03322 (0 to 94)0.2780.6840.98 (0.91 to 1.05)0.51911 (0 to 92)0.3400.10Less credible
miR rs22928322355 vs 2571 (5‡)50.94 (0.86 to 1.02)0.1520 (0 to 82)0.6500.1751.06 (0.89 to 1.27)0.5220 (0 to 73)0.7890.1050.87 (0.78 to 0.98)0.0210 (0 to 88)0.7040.68Less credible
MSH2 rs46085774308 vs 4011 (3‡)30.94 (0.86 to 1.02)0.1120 (0 to 91)0.7950.1930.72 (0.56 to 0.92)0.0080 (0 to 95)0.6480.6830.96 (0.88 to 1.05)0.4060 (0 to 62)0.9450.14Less credible
MSH3 rs1849678151 vs 10 103 (4‡)41.08 (1.00 to 1.16)0.04935 (0 to 98)0.2040.4741.23 (1.02 to 1.49)0.0310 (0 to 91)0.7020.5941.07 (0.99 to 1.16)0.09828 (0 to 98)0.2420.54Less credible
MSH3 rs267796050 vs 8024 (6‡)61.07 (1.02 to 1.14)0.01311 (0 to 92)0.3430.4661.13 (0.98 to 1.31)0.08536 (0 to 93)0.1660.5961.08 (1.01 to 1.15)0.0300 (0 to 85)0.6820.62Less credible
MTHFD1 rs19509029059 vs 11 358 (7‡)70.94 (0.89 to 0.99)0.0263 (0 to 82)0.4000.3971.01 (0.87 to 1.18)0.8780 (0 to 91)0.7670.0570.92 (0.86 to 0.98)0.01515 (0 to 90)0.3130.79Less credible
PARP1 rs11364107002 vs 8328 (7‡)71.05 (0.95 to 1.17)0.30052 (0 to 98)0.0520.2471.23 (1.05 to 1.44)0.0120 (0 to 92)0.4750.7871.03 (0.93 to 1.15)0.52936 (0 to 94)0.1530.14Less credible
PGR rs10428385232 vs 5733 (4‡)41.08 (1.01 to 1.16)0.0340 (0 to 95)0.5070.3240.97 (0.77 to 1.23)0.8190 (0 to 85)0.8800.0641.11 (1.02 to 1.21)0.0120 (0 to 96)0.4500.71Less credible
rs495103913 791 vs 14 288 (7‡)70.91 (0.81 to 1.02)0.10681 (49 to 95)1.48E-050.7970.78 (0.66 to 0.93)0.0050 (0 to 68)0.6200.8270.91 (0.80 to 1.04)0.16382 (51 to 96)9.30E-060.94Less credible
SCD rs78492011 vs 2580 (3)30.85 (0.73 to 0.98)0.02529 (0 to 98)0.2470.5330.64 (0.40 to 1.02)0.05841 (0 to 99)0.1850.6830.85 (0.71 to 1.02)0.08433 (0 to 99)0.2250.72Less credible
XPC rs22280002677 vs 4253 (6)60.99 (0.71 to 1.36)0.92994 (82 to 99)1.35E-160.0560.95 (0.49 to 1.81)0.86691 (71 to 99)2.18E-100.0960.58 (0.41 to 0.82)0.00292 (83 to 99)1.49E-111.00Less credible
VEGF rs6999474497 vs 5334 (10‡)101.09 (1.00 to 1.19)0.05646 (0 to 91)0.0530.56101.14 (1.01 to 1.29)0.02914 (0 to 89)0.3170.74101.09 (0.95 to 1.24)0.21947 (0 to 87)0.0470.50Less credible
IGF1 rs357672792 vs 4988 (3)30.95 (0.80 to 1.14)0.59959 (0 to 99)0.0870.1530.75 (0.62 to 0.91)0.0030 (0 to 97)0.5670.7930.98 (0.79 to 1.21)0.82660 (0 to 99)0.0810.07Less credible
NAT2 rs17999291861 vs 1952 (5*)50.84 (0.74 to 0.95)0.00420 (0 to 94)0.2890.5350.68 (0.47 to 0.99)0.04468 (14 to 98)0.0150.8150.85 (0.74 to 0.97)0.0160 (0 to 85)0.8720.61Less credible
NAT2 rs17999306446 vs 7193 (8*)81.07 (0.99 to 1.16)0.09944 (0 to 96)0.0820.2580.99 (0.88 to 1.12)0.8970 (0 to 64)0.8120.0881.13 (1.00 to 1.27)0.04351 (0 to 97)0.0450.70Less credible
ABCB1 (MDR1)rs92825649536 vs 9145 (6‡)61.07 (1.00 to 1.14)0.0530 (0 to 77)0.5610.2960.88 (0.65 to 1.19)0.42418 (0 to 86)0.2990.1461.09 (1.01 to 1.17)0.0180 (0 to 70)0.7290.65Less credible
ABCB1 (MDR1)rs10456427537 vs 8396 (16)160.94 (0.88 to 1.01)0.10846 (1 to 88)0.0240.50160.89 (0.81 to 0.97)0.01218 (0 to 78)0.2480.87160.97 (0.85 to 1.10)0.60857 (32 to 91)0.0030.14Less credible
ABCB1 (MDR1)rs12021687000 vs 6485 (5)51.06 (0.99 to 1.14)0.07145 (0 to 93)0.1240.3951.04 (0.95 to 1.14)0.3500 (0 to 81)0.6990.1451.12 (1.00 to 1.24)0.04252 (0 to 93)0.0800.88Less credible
ADIPOR1 rs13423872472 vs 2848 (5‡)50.87 (0.78 to 0.98)0.01848 (0 to 95)0.1050.7150.84 (0.73 to 0.96)0.0120 (0 to 62)0.9000.6850.85 (0.69 to 1.03)0.09863 (0 to 96)0.0290.79Less credible
AXIN2 rs22403084723 vs 4932 (4‡)41.06 (1.00 to 1.12)0.0440 (0 to 92)0.3970.3041.07 (0.96 to 1.19)0.21917 (0 to 95)0.3060.2941.10 (1.01 to 1.21)0.0370 (0 to 76)0.7580.54Less credible
MMP1 rs17997501660 vs 2024 (10)90.78 (0.66 to 0.93)0.00559 (8 to 88)0.0120.9590.78 (0.61 to 0.99)0.04121 (0 to 68)0.2540.75100.70 (0.56 to 0.88)0.00257 (9 to 88)0.0131.00Less credible
CYP1A1 rs10489439661 vs 11 774 (19*)191.28 (1.07 to 1.52)0.00684 (83 to 96)1.15E-151.00181.26 (1.04 to 1.52)0.0160 (0 to 55)0.4560.72191.36 (1.08 to 1.72)0.00988 (93 to 98)2.16E-231.00Less credible
GSTT1 Null variant19 133 vs 27 821 (56)70.91 (0.70 to 1.20)0.50871 (21 to 99)0.0150.49531.17 (1.07 to 1.28)0.00170 (71 to 90)1.62E-151.0061.00 (0.73 to 1.37)0.97953 (0 to 98)0.0590.05Less credible
VDR rs154441014 789 vs 15 922 (16‡)160.83 (0.72 to 0.96)0.01193 (91 to 95)5.59E-401.00160.81 (0.67 to 0.98)0.03385 (77 to 89)1.08E-141.00160.82 (0.69 to 0.96)0.01588 (83 to 91)2.62E-201.00Less credible
8q24 rs698326751 730 vs 53 589 (34‡)341.11 (1.06 to 1.17)1.75E-0585 (77 to 93)1.93E-281.00341.16 (1.08 to 1.25)4.92E-0579 (66 to 89)2.80E-181.00341.16 (1.08 to 1.24)2.98E-0580 (69 to 92)8.68E-191.00Less credible
RAD18 rs3735726560 vs 6906 (5‡)51.15 (1.01 to 1.30)0.03175 (33 to 98)0.0030.9651.23 (1.01 to 1.49)0.04154 (0 to 95)0.0670.9351.15 (0.98 to 1.36)0.08073 (21 to 98)0.0050.98Less credible
NQ01 rs180056611 183 vs 12 525 (16‡)161.17 (1.05 to 1.30)0.00680 (66 to 94)1.17E-091.00161.21 (0.99 to 1.48)0.06055 (17 to 87)0.0040.91161.20 (1.06 to 1.37)0.00678 (64 to 94)8.77E-091.00Less credible
PTGS2/COX2 rs204177785 vs 11 371 (18)171.10 (1.00 to 1.2)0.03537 (0 to 83)0.0620.57171.24 (1.04 to 1.47)0.0160 (0 to 46)0.9280.64181.12 (1.00 to 1.26)0.05051 (14 to 87)0.0190.90Less credible
NOD2 rs20668443297 vs 3088 (9)91.38 (1.04 to 1.84)0.02639 (0 to 91)0.1090.6691.97 (0.55 to 7.04)0.2990 (0 to 83)0.6890.1791.35 (1.02 to 1.78)0.03333 (0 to 90)0.1550.86Less credible
CCND1 rs178521536500 vs 8885 (20)201.10 (1.02 to 1.19)0.01958 (45 to 92)0.0010.83201.11 (0.97 to 1.28)0.12863 (54 to 94)7.69E-050.82201.15 (1.05 to 1.26)0.00318 (0 to 76)0.2260.94Less credible
GSTM1 Null variant28 240 vs 38 880 (74)131.12 (0.89 to 1.40)0.33570 (22 to 98)0.0051.0061.18 (0.90 to 1.55)0.23467 (0 to 98)0.0281.00741.10 (1.05 to 1.16)0.00147 (34 to 75)6.63E-061.00Less credible
9p24 rs71972513 513 vs 14 999 (14)141.07 (1.02 to 1.11)0.00224 (0 to 79)0.1900.79141.09 (1.02 to 1.15)0.00629 (0 to 75)0.1480.94141.09 (1.02 to 1.17)0.0080 (0 to 66)0.5210.72Less credible
ERCC5 rs176559653 vs 11 367 (14)131.07 (1.02 to 1.12)0.0030 (0 to 73)0.6260.60130.98 (0.89 to 1.08)0.6880 (0 to 77)0.5380.07141.14 (1.04 to 1.24)0.00444 (0 to 85)0.0381.00Less credible
GH1 rs26658023275 vs 3848 (7)70.89 (0.80 to 0.99)0.02549 (0 to 97)0.0690.6870.83 (0.71 to 0.98)0.02837 (0 to 95)0.1440.8770.89 (0.78 to 1.02)0.10336 (0 to 96)0.1570.62Less credible
TP73 G4C14858 vs 1173 (4)41.21 (1.04 to 1.40)0.0134 (0 to 92)0.3710.4641.75 (1.23 to 2.48)0.0020 (0 to 94)0.5130.9141.10 (0.83 to 1.45)0.50155 (0 to 96)0.0830.18Less credible
TGFB1 rs48034553747 vs 4513 (3‡)31.11 (1.04 to 1.18)0.0010 (0 to 71)0.9030.6631.12 (1.01 to 1.24)0.0270 (0 to 80)0.8450.5931.18 (1.06 to 1.30)0.0010 (0 to 85)0.8790.90Less credible
APC rs18011552389 vs 4223 (5)51.60 (1.21 to 2.11)0.00127 (0 to 94)0.2440.85nananananana51.62 (1.21 to 2.16)0.00128 (0 to 94)0.2350.99Less credible
MLH1 rs63750447937 vs 812 (4)32.14 (1.12 to 4.10)0.02239 (0 to 99)0.1940.75nananananana32.17 (1.14 to 4.14)0.01936 (0 to 98)0.2080.96Less credible
VDR rs115688204001 vs 4682 (5*)41.14 (1.05 to 1.23)0.0010 (0 to 92)0.6480.7141.37 (1.02 to 1.85)0.03935 (0 to 97)0.2030.9051.14 (1.04 to 1.24)0.0040 (0 to 0)0.9890.85Less credible
ABCB1 (MDR1)rs1128503246 vs 399 (3)30.70 (0.51 to 0.96)0.02745 (0 to 98)0.1640.5930.77 (0.37 to 1.59)0.48371 (0 to 99)0.0310.3130.52 (0.35 to 0.76)0.0010 (0 to 96)0.6170.93Less credible
ADIPOQ rs22417661517 vs 1909 (5)51.17 (1.01 to 1.35)0.03329 (0 to 97)0.2310.5051.04 (0.74 to 1.45)0.83014 (0 to 98)0.3250.0651.26 (1.09 to 1.47)0.0028 (0 to 96)0.3640.92Less credible
CHEK2 rs17879961687 vs 3370 (3)31.47 (1.12 to 1.92)0.0060 (0 to 98)0.4990.53nananananana31.48 (1.13 to 1.95)0.0050 (0 to 98)0.4960.81Less credible
CHEK2 1100delC2417 vs 3615 (4)41.66 (1.01 to 2.73)0.0480 (0 to 83)0.8350.60nananananana41.66 (1.01 to 2.75)0.0470 (0 to 83)0.8350.86Less credible
TP53 rs104252210591 vs 12 673 (30)301.11 (0.99 to 1.24)0.07785 (84 to 95)4.42E-260.73301.21 (1.02 to 1.44)0.02670 (55 to 88)2.77E-090.98301.11 (0.96 to 1.28)0.16382 (84 to 05)7.58E-200.64Less credible
TP53 rs178783621812 vs 2319 (6)60.98 (0.76 to 1.26)0.85970 (23 to 96)0.0050.0661.28 (0.81 to 2.03)0.2820 (0 to 79)0.6380.2260.95 (0.71 to 1.27)0.73471 (25 to 96)0.0040.40Less credible
KRAS rs7121982 vs 2194 (3)31.22 (0.86 to 1.72)0.27090 (55 to 100)6.87E-050.1931.72 (0.73 to 4.06)0.21587(44 to 100)3.74E-040.1631.17 (0.84 to 1.65)0.35083 (32 to 99)0.0030.28Less credible
ARLTS1 rs38031852281 vs 3196 (5†)51.09 (1.01 to 1.18)0.0260 (0 to 76)0.8030.3551.08 (0.90 to 1.29)0.43243 (0 to 93)0.1330.2151.18 (0.95 to 1.46)0.13162 (0 to 96)0.0320.77Less credible
1q32.1 rs495129115 835 vs 16 724 (9‡)90.91 (0.84 to 0.99)0.02168 (22 to 88)0.0020.8590.74 (0.62 to 0.87)2.72E-040 (0 to 69)0.6160.9590.92 (0.83 to 1.01)0.07772 (30 to 90)4.33E-040.91Less credible
PPAR-gamma rs98588222152 vs 2630 (5)41.40 (1.19 to 1.65)4.69E-050 (0 to 98)0.6970.9621.69 (1.19 to 2.39)0.0030 (0 to 98)0.8010.5441.47 (1.19 to 1.83)4.55E-040 (0 to 98)0.6181.00Less credible
DNMT3B rs15696861224 vs 1381 (5)40.57 (0.47 to 0.68)1.81E-090 (0 to 0)0.9921.0040.93 (0.54 to 1.58)0.7750 (0 to 97)0.4140.0650.46 (0.36 to 0.59)2.40E-0930 (0 to 95)0.2231.00Less credible
NOD2 rs20668474573 vs 3733 (10)101.39 (1.16 to 1.66)3.21E-040 (0 to 71)0.5470.78102.36 (0.69 to 8.13)0.1730 (0 to 76)0.7200.31101.38 (1.14 to 1.66)0.0010 (0 to 74)0.4880.96Less credible

*Includes GWAS data from Ontario.

†Tomlinson 2008 was based on 10 samples.

‡Includes GWAS data from SOCCS.

GWASs, genome-wide association studies; SOCCS, Study of CRC in Scotland.

Summary of ‘less-credible associations’ from meta-analyses and credibility assessment *Includes GWAS data from Ontario. †Tomlinson 2008 was based on 10 samples. ‡Includes GWAS data from SOCCS. GWASs, genome-wide association studies; SOCCS, Study of CRC in Scotland. Polymorphisms classified as either ‘positive’ or ‘less-credible positive’ SNPs were sent for validation in synthesised data from the GECCO, CORECT and CCFR consortia. Validation was only able to be performed for 68 out of 77 polymorphisms, as one ‘positive’ variant (rs34612342 in MUTYH [Y179C] gene) was dropped due to its low imputation quality in these GWASs; 5 ‘less-credible positive’ polymorphisms (in GSTT1, GSTM1, TP73, CHEK2 and CYP2E1) had no rs numbers; and 3 polymorphisms (in TP53 and KRAS) were not available in these GWASs; therefore, they were not able to be validated. Of the 17 ‘positive’ SNPs sent for validation, 7 polymorphisms (41.2%) in 6 different loci (rs12953717 and rs4464148 in SMAD7, rs355527 in BMP2, rs10505477 in 8q24, rs961253 in 20p12.3, rs16892766 in 8q23.3, rs3802842 in 11q23.1) reached a genome-wide statistical significance (p≤5×10−8) in at least one meta-analysis model and 6 polymorphisms (35.3%) in 6 different loci (rs7259371 in RHPN2, rs2736100 in TERT, rs10795668 in 10p14, rs36053993 in MUTYH, rs1862748 in CDH1 and rs1800469 in TGFB1) reached a nominally statistical significance (p<0.05) in at least one meta-analysis model. However, the remaining 4 polymorphisms (23.5%) in 4 different loci (rs2066847 in NOD2, rs1569686 in DNMT3B, rs9858822 in PPAR-gamma and rs4951291 in 1q32.1) failed in the GWAS validation of all genetic models (online supplementary tables 5–7) and they were therefore downgraded to ‘less-credible positive’. Of the 51 ‘less-credible positive’ SNPs sent for validation, 2 polymorphisms (3.9%) in 2 different loci (rs6983267 in 8q24 and rs1801155 in APC) reached a genome-wide statistical significance (p≤5×10−8) in at least one meta-analysis model and 11 polymorphisms (21.6%) in 11 different loci (rs1136410 in PARP1, rs11568820 in VDR, rs1342387 in ADIPOR1, rs719725 in TPD52L3, rs20417 in PTGS2/COX2, rs2665802 in GH1, rs4803455 in TGFB1, rs7849 in SCD, rs8752 in HPGD, rs36053993 in MUTYH and rs928554 in ESR2) reached a nominally statistical significance (p<0.05) in at least one meta-analysis model, whereas the remaining 38 polymorphisms (74.5%) failed in the GWAS validation of all genetic models (online supplementary tables 5–7). Overall, 26 (33.8%) out of the 77 nominally significant polymorphisms tested via meta-analysis were successfully validated in these GWASs (p<0.05). Funnel plots were produced for all ‘credible’ (online supplementary figures 1–14) and ‘less-credible’ SNPs (online supplementary figures 15–77) for their summary estimates in allelic model. Small study effects were reported for 2 (14.3%) of the 14 ‘positive’ polymorphisms (rs36053993 in MUTYH and rs4464148 in SMAD7) and for 10 (15.9%) of 63 ‘less-credible’ polymorphisms (rs1229984 in ADH1B, rs25487 in XRCC1, rs2240308 in AXIN2, rs1799750 in MMP1, rs1048943 in CYP1A1, rs373572 in RAD18, rs1800566 in NQ01, rs2066844 in NOD2, rs2665802 in GH1, rs712 in KRAS). Therefore, their reported association with CRC risk should be interpreted with caution. The remaining 231 polymorphisms (75.0%) assessed in meta-analyses were reported to have no statistically significant association (p>0.05) with CRC risk in all three genetic models (online supplementary table 9), based on accrued data on 192 to 21 929 cases and 251 to 24 054 controls, with median of at least 4017 cases per meta-analysis. Of them, 148 polymorphisms were classified as having negative associations with CRC risk because the number of cases was less than 5000 for which the null results could be due to limited statistical power, while another 83 SNPs with p>0.05 and the number of cases>5000 were classified as null variants with adequate statistical power (online supplementary table 9).

Discussion

This systematic, comprehensive field synopsis of genetic association studies on CRC updates the two previous field synopses10 11 with application of harmonised methods for evidence appraisal and further validation of the identified genetic associations in three GWAS consortia. Specifically, we extracted and collated data for 1063 polymorphisms in 303 different genes from 869 publications and performed up-to-date meta-analyses for 308 variants in 158 different genes that had data from at least three independent studies available for analysis. After credibility assessment and validation, we identified a total of 12 genetic loci credibly associated with CRC risk, of which 6 loci (MUTYH, SMAD7, 8q24, 8q23.3, 11q23.1, 20p12.3) were also classified as credibly associated with CRC risk in the previous field synopses and the other 6 loci (TGFB1, TERT, CDH1, RHPN2, BMP2 and 10p14) are novel findings as they have not been assessed or reported as credible risk loci in the previous field synopses. We note that a synopsis was undertaken of literature on genetic associations with CRC published in the period 2012–2017.24 Our analysis differs from that study, in that it includes and updates data from our previous field synopses,10 11 includes data from three GWA studies in the meta-analysis and further validates the findings in three GWA consortia. Similar to our previous field synopses,10 11 the present study reported two SNPs at 8q24 locus (rs6983267 and rs10505477) with strong evidence supporting significant associations with CRC risk and with these associations further replicated in three GWAS consortia. In biopsies of the rectum, sigmoid colon and cecum mucosa, proliferation has been reported to be higher among homozygotes for the risk alleles of the rs6983267 and rs10505477 variants compared with those with other genotypes in the general population.25 The rs6983267 variant has been assessed as having a highly credible association with colorectal adenomas.26 27 In fine-mapping and bioinformatic analysis performed within the GECCO-CCFR consortia, the rs6983267 variant was appraised as having a strong functional evidence.28 The rs6983267 may be a somatic target in CRC29 and may be associated with enhanced responsiveness to Wnt signalling.30 Furthermore, rs6983267 has also been found to be associated with other types of cancer, including prostate cancer.31–33 Interaction with the MYC proto-oncogene has been controversial,34–37 but in functional studies in cell lines, interaction between enhancer elements in the 8q24 locus and the MYC promoter, via transcription factor Tcf-4 binding and allele-specific regulation of MYC expression, has been demonstrated.38 Expression levels of one of these, CARLo-5, in normal colon tissue have been found to be statistically significantly correlated with rs6983267, and chromosome conformation capture analysis of genomic DNA from CRC-derived cell lines provided evidence of physical interaction between the active regulatory region of the CARLo-5 promoter and the MYC enhancer region.39 Since the end of our search period (21 November 2018), an analysis of GWAS data on 22 775 cases and 47 731 controls from 14 studies in East Asia detected a genome-wide significant association with the rs6983267 variant.40 In addition, in a combined meta-analysis of up to 58 131 cases and 67 347 controls from the GECCO, CORECT and CCFR consortia, in which imputed variants from a whole-genome sequencing analysis and Haplotype Reference Consortium panel variants were included, analysis in the 8q24.21 region conditioned on the rs6983267 and rs7013278 variants identified a genome-wide significant association with the rs4313119.41 Two genetic polymorphisms (rs12953717 and rs4464148) tagging in SMAD7 were identified to be associated with CRC with highly credible evidence. Associations with rs12953717 and rs4464148 were successfully validated in the three GWAS consortia. In fine-mapping and bioinformatics analysis performed within the GECCO-CCFR consortia, rs4464148 was appraised as having less strong functional evidence than the highly correlated rs9932005 variant, which is located within 5 kb away.28 The SMAD7 protein is an inhibitor for the TGF-ß signalling pathway.42 43 There were highly credible associations with the rs1862748 (tagging CDH1), rs355527 (BMP2), rs961253 (BMP2) and rs7259371 (RHPN2) variants, which are TGF-ß related, and replicated in the data from the GWAS consortia. In the recent East Asian analysis, the association with the rs961253 (BMP2) variant was replicated, as well as additional variants of SMAD7 (rs7229639, rs4939827), CDH1 (rs9929218), BMP2 (rs4813802) and RHPN2 (rs10411210).40 In the analysis from the GECCO, CORECT and CCFR consortia, conditioned on the rs4813802 and rs189583 variants of BMP2 as well as each other, novel associations with the BMP2 variants rs28488 and rs994308 were detected.41 Additionally, two variants (rs34612342 and rs36053993) tagging the MUTYH gene were highly credibly associated with increased CRC risk, of which rs36053993 was validated in the three GWAS consortia data, while rs34612342 was not tested due to its poor imputation quality. The MUTYH gene is known to be involved in the dysfunction of base-excision repair, which is the major pathway for repairing oxidative damage. This biological pathway in which the MUTYH gene is involved contributes to the development of multiple colorectal adenomas and carcinomas (MUTYH-associated polyposis (MAP) syndrome).44 In our analysis, the p values were nominally significant for all models for the rs34612342 variant and for the allelic and dominant models for the rs36053993 variant. For the former variant, the magnitude of effect was greatest for the recessive model, although with wide CIs, which is intriguing as MAP, in which highly penetrant mutations are implicated, and is inherited in an autosomal recessive manner.45 Highly credible associations were also reported for four variants tagging four different genes that are involved in inflammation or immune response. First, a positive association with the rs3802842 variant in the 11q23.1 was identified for CRC risk and this association was replicated in the data from the GWAS consortia and in the recent East Asian analysis.40 Fine-mapping and bioinformatics analysis performed within the GECCO-CCFR consortia support this variant with strong functional evidence.28 Fine mapping identified two genes COLCA1 and COLCA2 arranged on opposite strands and sharing a regulatory region containing rs3802842.46 It is reported that carrying the risk allele of rs3802842 is associated with the expression levels of COLCA1 and COLCA2, which is further correlated with lymphocyte infiltration of colonic lamina propia. Further, in an expression quantitative trait locus analysis in colorectal tissue, there were signals for COLCA1 and COLCA2.47 The polymorphism rs10795668 in 10p14 locus tagging GATA3 gene was reported with a highly credible association with CRC, and this association was further replicated in the GWAS consortia and in the recent East Asian analysis.40 However, in fine-mapping and bioinformatics analysis, this variant presents with weak functional evidence.28 Another less-credible positive association with the rs9858822 variant of PPARγ was identified; however, this association was not replicated in the GWAS consortia. Bioinformatics analysis showed that PPARγ and its ligands have been found to block proinflammatory genes in colon cancer cell lines, activated macrophages and monocytes.48 PPARγ is also involved in lipid metabolism, adipocyte differentiation, and glucose homeostasis and insulin sensitivity.48 A less-credible positive association also reported one variant (rs2066847) in NOD2 gene; however, this association was not replicated. Evidence from experimental studies in mice investigating the role of Nod2 in colorectal tumour risk has also been inconsistent.49–51 The recent analyses in East Asia,40 the GECCO, CORECT and CCFR consortia41 and of five UK studies and a further 10 from the COGENT consortium47 identified new associations in the major histocompatibility region. The remaining two variants with highly credible associations with CRC risk included rs2736100 tagging TERT and rs16892766 at 8q23.3 tagging EIF3H. These variants were all validated in the GWAS consortia datasets. Fine-mapping and bioinformatics analysis performed within the GECCO-CCFR consortia supported these variants with strong functional evidence, for which polymorphism rs2736100 in TERT gene has been reported to be associated with telomere length and the risk of different types of cancer and chronic diseases other than cancer.52 For the association with the rs16892766 variant at 8q23.3 tagging EIF3H, another variant rs16888589 was previously reported with the lowest p value for the association with CRC in this locus.25 Functional analysis and chromosome conformation capture analysis in CRC cell lines have found that the genomic region harbouring rs16888589 increases EIF3H expression,53 but analysis of expression quantitative trait loci around rs16892766 suggested that UTP23 rather than EIF3H is the target of genetic variation associated with CRC in this region.54 The less credible associations with 63 variants of 52 genes involved the following pathways—adhesion (AXIN2, MMP1); alcohol metabolism (ADH1B); angiogenesis (VEGF); blood clotting (SERPINE1); DNA repair (CHEK2, ERCC5, MSH2, MSH3, PARP1, RAD18, XPC, XRCC1); hormone metabolism (ESR2, GH1, PGR); inflammation and immune response (CRP, HPGD, PTGS2/COX2); inhibition of cell growth (CCND1, EGF, TGFB1); iron metabolism (HFE); lipid metabolism (ADIPOQ, ADIPOR1, LIPC, SCD); one-carbon metabolism (MTHFR, MTFD1, MTRR); substrate metabolism (ABCB1; CYP1A1, CYP2C9, CYP2E1, GSTM1, GSTT1, NAT2, NQ01); tumour suppression (ARLTS1, miR, TP73); vitamin D metabolism (VDR)—common low penetrance variants at 1q32.1 (rs4951039, LINC00303) and 9p24 (rs719725); and the common rs1801155 (I1307K) variant of APC, for which large numbers of rare variants have been identified,55 and rs63750447 (V384D) variant of MLH1, for which rare variants confer a high risk of Lynch syndrome.56 These variants were classified as less-credible SNPs because of either the substantial heterogeneity or the high possibility of false positive; however, we would like to highlight a number of less-credible genetic loci (PARP1, MYC, VDR, ADIPOR1, APC, PTGS2/COX2, SCD, HPGD and ESR2), which were replicated in the GWAS data and for which their linked pathways are worthy of further investigation in future studies. Updating field synopses is challenging because genetic analysis is such a fast-moving field. Recent trends highlighted by three articles published since we completed our search in November 2018 include the extension of consortia to increase statistical power to detect and replicate associations,47 the investigation of populations other than of European origin,40 and the use of whole-genome sequencing and more comprehensive reference panels to extend the range of genetic variants considered to include those that are rare or of low frequency.41 These three articles have added new loci for CRC susceptibility and indicate that most of the risk loci previously associated with CRC in populations of European origin are also associated with CRC risk in East Asian populations. We checked whether the 14 variants we classified as ‘highly credible’ were replicated in the paper of Law et al.47 For one variant, rs34612342 MUTYH, there was poor imputation quality. However, as there was no satisfactory proxy for this, we report the available information, which supported the association (p=0.029). All the remaining 13 variants had p values for association less than 5.0×10−5 with no more than moderate heterogeneity. Such an empirical comparison was not appropriate with the data of Huyghe et al 41 because of the considerable overlap of included participants. In summary, we have conducted a comprehensive study to capture and meta-analyse all SNP data for common genetic variants. The analysis clearly identifies 14 variants at 12 loci for which there is robust evidence of their impact on CRC risk, 63 variants at 52 loci for which further evidence through international collaboration should be generated and 231 variants for which the overall evidence does not support any association with CRC risk. With increasing availability of data from multiple SNPs, it is clear that studies to test associations must achieve very high levels of statistical stringency. Nonetheless, the analysis here provides a resource for mining available data and puts into context the sample sizes required for the identification of true associations for common genetic variants. Future resequencing studies are expected to identify rarer variants (eg, prevalence 0.05%–5%) with intermediate or perhaps even large effects, and GWAS of structural variation will likely identify deletions, amplifications and other copy number variations that may also influence CRC risk.6 This study highlights a number of common genetic variants that could be incorporated into genetic risk-prediction algorithms as further risk factors are identified and highlights the loci at which further research effort should be targeted. All data are available from the CRCgene2 database.
  54 in total

1.  A critique of the false-positive report probability.

Authors:  Joseph F Lucke
Journal:  Genet Epidemiol       Date:  2009-02       Impact factor: 2.135

2.  Familial colorectal cancer.

Authors:  M S Lung; A H Trainer; I Campbell; L Lipton
Journal:  Intern Med J       Date:  2015-05       Impact factor: 2.048

3.  Genetic variation in the inflammation and innate immunity pathways and colorectal cancer risk.

Authors:  Hansong Wang; Darin Taverna; Daniel O Stram; Barbara K Fortini; Iona Cheng; Lynne R Wilkens; Terrilea Burnett; Karen W Makar; Noralane M Lindor; John L Hopper; Steve Gallinger; John A Baron; Robert Haile; Laurence N Kolonel; Brian E Henderson; Polly A Newcomb; Graham Casey; David Duggan; Cornelia M Ulrich; Loïc Le Marchand
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2013-09-17       Impact factor: 4.254

4.  The 8q24 cancer risk variant rs6983267 shows long-range interaction with MYC in colorectal cancer.

Authors:  Mark M Pomerantz; Nasim Ahmadiyeh; Li Jia; Paula Herman; Michael P Verzi; Harshavardhan Doddapaneni; Christine A Beckwith; Jennifer A Chan; Adam Hills; Matt Davis; Keluo Yao; Sarah M Kehoe; Heinz-Josef Lenz; Christopher A Haiman; Chunli Yan; Brian E Henderson; Baruch Frenkel; Jordi Barretina; Adam Bass; Josep Tabernero; José Baselga; Meredith M Regan; J Robert Manak; Ramesh Shivdasani; Gerhard A Coetzee; Matthew L Freedman
Journal:  Nat Genet       Date:  2009-06-28       Impact factor: 38.330

5.  Genome-wide association studies, field synopses, and the development of the knowledge base on genetic variation and human diseases.

Authors:  Muin J Khoury; Lars Bertram; Paolo Boffetta; Adam S Butterworth; Stephen J Chanock; Siobhan M Dolan; Isabel Fortier; Montserrat Garcia-Closas; Marta Gwinn; Julian P T Higgins; A Cecile J W Janssens; James Ostell; Ryan P Owen; Roberta A Pagon; Timothy R Rebbeck; Nathaniel Rothman; Jonine L Bernstein; Paul R Burton; Harry Campbell; Anand Chockalingam; Helena Furberg; Julian Little; Thomas R O'Brien; Daniela Seminara; Paolo Vineis; Deborah M Winn; Wei Yu; John P A Ioannidis
Journal:  Am J Epidemiol       Date:  2009-06-04       Impact factor: 4.897

6.  Allelic imbalance at rs6983267 suggests selection of the risk allele in somatic colorectal tumor evolution.

Authors:  Sari Tuupanen; Iina Niittymäki; Kari Nousiainen; Sakari Vanharanta; Jukka-Pekka Mecklin; Kyösti Nuorva; Heikki Järvinen; Sampsa Hautaniemi; Auli Karhu; Lauri A Aaltonen
Journal:  Cancer Res       Date:  2008-01-01       Impact factor: 12.701

7.  SMAD7 and MGMT genotype variants and cancer incidence in the European Prospective Investigation into Cancer and Nutrition (EPIC)-Norfolk Study.

Authors:  Yet Hua Loh; Panagiota N Mitrou; Angela Wood; Robert N Luben; Alison McTaggart; Kay-Tee Khaw; Sheila A Rodwell
Journal:  Cancer Epidemiol       Date:  2010-11-12       Impact factor: 2.984

8.  Association analyses identify 31 new risk loci for colorectal cancer susceptibility.

Authors:  Philip J Law; Maria Timofeeva; Ceres Fernandez-Rozadilla; Peter Broderick; James Studd; Juan Fernandez-Tajes; Susan Farrington; Victoria Svinti; Claire Palles; Giulia Orlando; Amit Sud; Amy Holroyd; Steven Penegar; Evropi Theodoratou; Peter Vaughan-Shaw; Harry Campbell; Lina Zgaga; Caroline Hayward; Archie Campbell; Sarah Harris; Ian J Deary; John Starr; Laura Gatcombe; Maria Pinna; Sarah Briggs; Lynn Martin; Emma Jaeger; Archana Sharma-Oates; James East; Simon Leedham; Roland Arnold; Elaine Johnstone; Haitao Wang; David Kerr; Rachel Kerr; Tim Maughan; Richard Kaplan; Nada Al-Tassan; Kimmo Palin; Ulrika A Hänninen; Tatiana Cajuso; Tomas Tanskanen; Johanna Kondelin; Eevi Kaasinen; Antti-Pekka Sarin; Johan G Eriksson; Harri Rissanen; Paul Knekt; Eero Pukkala; Pekka Jousilahti; Veikko Salomaa; Samuli Ripatti; Aarno Palotie; Laura Renkonen-Sinisalo; Anna Lepistö; Jan Böhm; Jukka-Pekka Mecklin; Daniel D Buchanan; Aung-Ko Win; John Hopper; Mark E Jenkins; Noralane M Lindor; Polly A Newcomb; Steven Gallinger; David Duggan; Graham Casey; Per Hoffmann; Markus M Nöthen; Karl-Heinz Jöckel; Douglas F Easton; Paul D P Pharoah; Julian Peto; Federico Canzian; Anthony Swerdlow; Rosalind A Eeles; Zsofia Kote-Jarai; Kenneth Muir; Nora Pashayan; Andrea Harkin; Karen Allan; John McQueen; James Paul; Timothy Iveson; Mark Saunders; Katja Butterbach; Jenny Chang-Claude; Michael Hoffmeister; Hermann Brenner; Iva Kirac; Petar Matošević; Philipp Hofer; Stefanie Brezina; Andrea Gsur; Jeremy P Cheadle; Lauri A Aaltonen; Ian Tomlinson; Richard S Houlston; Malcolm G Dunlop
Journal:  Nat Commun       Date:  2019-05-14       Impact factor: 14.919

9.  Genetic variants associated with colorectal cancer risk: comprehensive research synopsis, meta-analysis, and epidemiological evidence.

Authors:  Xiangyu Ma; Ben Zhang; Wei Zheng
Journal:  Gut       Date:  2013-08-14       Impact factor: 23.059

10.  Genetic Variants Associated with Colorectal Adenoma Susceptibility.

Authors:  Anna Abulí; Antoni Castells; Luis Bujanda; Juan José Lozano; Xavier Bessa; Cristina Hernández; Cristina Álvarez-Urturi; Maria Pellisé; Clara Esteban-Jurado; Elizabeth Hijona; Andrea Burón; Francesc Macià; Jaume Grau; Rafael Guayta; Sergi Castellví-Bel; Montserrat Andreu
Journal:  PLoS One       Date:  2016-04-14       Impact factor: 3.240

View more
  11 in total

1.  Ability of known colorectal cancer susceptibility SNPs to predict colorectal cancer risk: A cohort study within the UK Biobank.

Authors:  Aviv Gafni; Gillian S Dite; Erika Spaeth Tuff; Richard Allman; John L Hopper
Journal:  PLoS One       Date:  2021-09-15       Impact factor: 3.752

2.  Association between vitamin D receptor BsmI, FokI, and Cdx2 polymorphisms and osteoporosis risk: an updated meta-analysis.

Authors:  Bin Chen; Wang-Fa Zhu; Yi-Yang Mu; Biao Liu; Hong-Zhuo Li; Xiao-Feng He
Journal:  Biosci Rep       Date:  2020-07-31       Impact factor: 3.840

Review 3.  Non-genetic biomarkers and colorectal cancer risk: Umbrella review and evidence triangulation.

Authors:  Xiaomeng Zhang; Dipender Gill; Yazhou He; Tian Yang; Xue Li; Grace Monori; Harry Campbell; Malcolm Dunlop; Konstantinos K Tsilidis; Maria Timofeeva; Evropi Theodoratou
Journal:  Cancer Med       Date:  2020-05-12       Impact factor: 4.452

4.  Research on diagnosis-related group grouping of inpatient medical expenditure in colorectal cancer patients based on a decision tree model.

Authors:  Suo-Wei Wu; Qi Pan; Tong Chen
Journal:  World J Clin Cases       Date:  2020-06-26       Impact factor: 1.337

Review 5.  Application of artificial neural networks for automated analysis of cystoscopic images: a review of the current status and future prospects.

Authors:  Misgana Negassi; Rodrigo Suarez-Ibarrola; Simon Hein; Arkadiusz Miernik; Alexander Reiterer
Journal:  World J Urol       Date:  2020-01-10       Impact factor: 4.226

Review 6.  Prognostic value of IGFBP2 in various cancers: a systematic review and meta-analysis.

Authors:  Biao Zhang; Chao-Qun Hong; Yu-Hao Luo; Lai-Feng Wei; Yun Luo; Yu-Hui Peng; Yi-Wei Xu
Journal:  Cancer Med       Date:  2022-05-11       Impact factor: 4.711

7.  Crosstalk Between DNA Methylation and Gene Mutations in Colorectal Cancer.

Authors:  Maria Dobre; Alessandro Salvi; Iulia Andreea Pelisenco; Florina Vasilescu; Giuseppina De Petro; Vlad Herlea; Elena Milanesi
Journal:  Front Oncol       Date:  2021-07-01       Impact factor: 6.244

8.  miRNAs-Based Molecular Signature for KRAS Mutated and Wild Type Colorectal Cancer: An Explorative Study.

Authors:  Elena Milanesi; Maria Dobre; Alina Ioana Bucuroiu; Vlad Herlea; Teodora Ecaterina Manuc; Alessandro Salvi; Giuseppina De Petro; Mircea Manuc; Gabriel Becheanu
Journal:  J Immunol Res       Date:  2020-06-23       Impact factor: 4.818

9.  Association of IL-6 174G/C (rs1800795) and 572C/G (rs1800796) polymorphisms with risk of osteoporosis: a meta-analysis.

Authors:  Bin Chen; Hong-Zhuo Li
Journal:  BMC Musculoskelet Disord       Date:  2020-05-28       Impact factor: 2.362

Review 10.  Evaluation of Association Studies and an Updated Meta-Analysis of VDR Polymorphisms in Osteoporotic Fracture Risk.

Authors:  Yi-Yang Mu; Biao Liu; Bin Chen; Wang-Fa Zhu; Xiang-Hua Ye; Hong-Zhuo Li; Xiao-Feng He
Journal:  Front Genet       Date:  2022-01-07       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.