| Literature DB >> 26553438 |
Maria N Timofeeva1, Ben Kinnersley2, Susan M Farrington1, Nicola Whiffin2, Claire Palles3, Victoria Svinti1, Amy Lloyd2, Maggie Gorman3, Li-Yin Ooi1, Fay Hosking2, Ella Barclay3, Lina Zgaga1, Sara Dobbins2, Lynn Martin3, Evropi Theodoratou1,4, Peter Broderick2, Albert Tenesa5,6, Claire Smillie1, Graeme Grimes6, Caroline Hayward6, Archie Campbell6,7, David Porteous6,7, Ian J Deary8, Sarah E Harris6,8, Emma L Northwood9, Jennifer H Barrett9, Gillian Smith10, Roland Wolf10, David Forman11, Hans Morreau12, Dina Ruano12, Carli Tops13, Juul Wijnen14, Melanie Schrumpf12, Arnoud Boot12, Hans F A Vasen15, Frederik J Hes13, Tom van Wezel12, Andre Franke16, Wolgang Lieb17, Clemens Schafmayer18, Jochen Hampe19, Stephan Buch19, Peter Propping20, Kari Hemminki21,22, Asta Försti21,22, Helga Westers23, Robert Hofstra23,24, Manuela Pinheiro25, Carla Pinto25, Manuel Teixeira25, Clara Ruiz-Ponte26, Ceres Fernández-Rozadilla26,3, Angel Carracedo26, Antoni Castells27, Sergi Castellví-Bel27, Harry Campbell1,4, D Timothy Bishop9, Ian P M Tomlinson3, Malcolm G Dunlop1, Richard S Houlston2.
Abstract
Whilst common genetic variation in many non-coding genomic regulatory regions are known to impart risk of colorectal cancer (CRC), much of the heritability of CRC remains unexplained. To examine the role of recurrent coding sequence variation in CRC aetiology, we genotyped 12,638 CRCs cases and 29,045 controls from six European populations. Single-variant analysis identified a coding variant (rs3184504) in SH2B3 (12q24) associated with CRC risk (OR = 1.08, P = 3.9 × 10(-7)), and novel damaging coding variants in 3 genes previously tagged by GWAS efforts; rs16888728 (8q24) in UTP23 (OR = 1.15, P = 1.4 × 10(-7)); rs6580742 and rs12303082 (12q13) in FAM186A (OR = 1.11, P = 1.2 × 10(-7) and OR = 1.09, P = 7.4 × 10(-8)); rs1129406 (12q13) in ATF1 (OR = 1.11, P = 8.3 × 10(-9)), all reaching exome-wide significance levels. Gene based tests identified associations between CRC and PCDHGA genes (P < 2.90 × 10(-6)). We found an excess of rare, damaging variants in base-excision (P = 2.4 × 10(-4)) and DNA mismatch repair genes (P = 6.1 × 10(-4)) consistent with a recessive mode of inheritance. This study comprehensively explores the contribution of coding sequence variation to CRC risk, identifying associations with coding variation in 4 genes and PCDHG gene cluster and several candidate recessive alleles. However, these findings suggest that recurrent, low-frequency coding variants account for a minority of the unexplained heritability of CRC.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26553438 PMCID: PMC4639776 DOI: 10.1038/srep16286
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Results of meta-analysis for variants reaching exome-wide level of significance (4 × 10-7) under a fixed effects model.
| SNP rsID | Gene | Annotation | CHR | BP | Risk Allele | Reference Allele | EAF(cases/controls) | N studies | N cases | N controls | OR | P value | P value Bonferroni adjusted |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| rs1129406 | coding-synon | 12 | 51203371 | A | G | 0.43/ 0.40 | 6 | 4730 | 12603 | 1.11 | 8.30 × 10−9 | 7.44 × 10−04 | |
| rs12303082 | missense | 12 | 50754563 | A | C | 037/0.35 | 9 | 10207 | 19886 | 1.09 | 7.40 × 10−8 | 6.63 × 10−03 | |
| rs6580742 | missense | 12 | 50727811 | A | G | 0.20/0.19 | 9 | 12539 | 29208 | 1.11 | 1.20 × 10−7 | 0.01 | |
| rs16888728 | missense | 8 | 117783975 | A | G | 0.11/0.10 | 8 | 10621 | 26779 | 1.15 | 1.40 × 10−7 | 0.01 | |
| rs3184504 | missense | 12 | 111884608 | G | A | 0.53/0.51 | 9 | 12530 | 29197 | 1.08 | 3.90 × 10−7 | 0.03 |
EAF – effect allele frequency.
Meta-analysis of gene-based (SKAT-O) tests.
| SetID | Gene | N of variants # | Description | Chr | band | p.value |
|---|---|---|---|---|---|---|
| (A) low frequency (MAF < 5%) variants (n = 16,585) | ||||||
| ENSG00000254245 | 89 | protocadherin gamma subfamily A, 3 | 5 | q31.3 | 7.29E-07 | |
| ENSG00000081853 | 90 | protocadherin gamma subfamily A, 2 | 5 | q31.3 | 7.49E-07 | |
| ENSG00000204956 | 91 | protocadherin gamma subfamily A, 1 | 5 | q31.3 | 7.86E-07 | |
| ENSG00000254221 | 82 | protocadherin gamma subfamily B, 1 | 5 | q31.3 | 1.43E-06 | |
| ENSG00000262576 | 79 | protocadherin gamma subfamily A, 4 | 5 | q31.3 | 2.91E-06 | |
| (B) High and Moderate low frequency (MAF < 5%) variants (n = 16,081) | ||||||
| ENSG00000254245 | 83 | protocadherin gamma subfamily A, 3 | 5 | q31.3 | 2.59E-06 | |
| ENSG00000081853 | 84 | protocadherin gamma subfamily A, 2 | 5 | q31.3 | 2.79E-06 | |
| ENSG00000204956 | 85 | protocadherin gamma subfamily A, 1 | 5 | q31.3 | 2.96E-06 | |
Top significant results for SKAT-O gene-based test for different subsets. We used Bonferroni correction to identify Exome-Wide level of significance for each of the subgroup separately. Only variants, which were observed in at least two independant studies, were included in the analysis. Genes with less than 2 variants per gene were exluded. Variants were defined High and Moderate accordind to classification adapted by SnpEff. # N of variants is based by the number of SNPs located within the genes and may vary by study, e.g. in case of monomorphic alleles.