| Literature DB >> 25990418 |
Nada A Al-Tassan1, Nicola Whiffin2, Fay J Hosking2, Claire Palles3, Susan M Farrington4, Sara E Dobbins2, Rebecca Harris5, Maggie Gorman3, Albert Tenesa6, Brian F Meyer1, Salma M Wakil1, Ben Kinnersley2, Harry Campbell7, Lynn Martin3, Christopher G Smith5, Shelley Idziaszczyk5, Ella Barclay3, Timothy S Maughan8, Richard Kaplan9, Rachel Kerr10, David Kerr11, Daniel D Buchanan12, Daniel D Buchannan, Aung Ko Win13, John Hopper13, Mark Jenkins13, Noralane M Lindor14, Polly A Newcomb15, Steve Gallinger16, David Conti17, Fred Schumacher17, Graham Casey17, Malcolm G Dunlop4, Ian P Tomlinson3, Jeremy P Cheadle5, Richard S Houlston2.
Abstract
Genome-wide association studies (GWAS) of colorectal cancer (CRC) have identified 23 susceptibility loci thus far. Analyses of previously conducted GWAS indicate additional risk loci are yet to be discovered. To identify novel CRC susceptibility loci, we conducted a new GWAS and performed a meta-analysis with five published GWAS (totalling 7,577 cases and 9,979 controls of European ancestry), imputing genotypes utilising the 1000 Genomes Project. The combined analysis identified new, significant associations with CRC at 1p36.2 marked by rs72647484 (minor allele frequency [MAF] = 0.09) near CDC42 and WNT4 (P = 1.21 × 10(-8), odds ratio [OR] = 1.21 ) and at 16q24.1 marked by rs16941835 (MAF = 0.21, P = 5.06 × 10(-8); OR = 1.15) within the long non-coding RNA (lncRNA) RP11-58A18.1 and ~500 kb from the nearest coding gene FOXL1. Additionally we identified a promising association at 10p13 with rs10904849 intronic to CUBN (MAF = 0.32, P = 7.01 × 10(-8); OR = 1.14). These findings provide further insights into the genetic and biological basis of inherited genetic susceptibility to CRC. Additionally, our analysis further demonstrates that imputation can be used to exploit GWAS data to identify novel disease-causing variants.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25990418 PMCID: PMC4438486 DOI: 10.1038/srep10442
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Genome-wide P-values (–log10P, y-axis) plotted against their respective chromosomal positions (x-axis).
Known regions attaining genome-wide significance (i.e. P = 5.0 × 10−8) are labelled with their chromosomal location. Variants in grey lie in novel regions that reach the significance threshold level (P = 1.0 × 10−7) required for variants to be analysed further in this study. Variants in black lie in novel regions attaining genome-wide significance.
Summary statistics for variants showing an association with CRC risk at < 1.0 × 10 .
| 1p36.12 | rs72647484 | 22,587,728 | TC | 0.91 | 0.94 (0.85-0.99) | 3.25 ×10−2 | 3.32 ×10−5 | 4.99 ×10−2 | 4.08 ×10−2 | 4.58 ×10−2 | 3.47 ×10−2 | 1.24 (1.15-1.33) | 1.21 ×10-8 | 0.33 | |
| 5q15 | rs202110856 | 96,129,872 | GGC | 0.99 | 0.79 (0.66-0.92) | 2.97 ×10−1 | 5.96 ×10-8 | 2.81 ×10−2 | 4.43 ×10−1 | 3.35 ×10−1 | 3.67 ×10−1 | 1.51 (1.23-1.86) | 6.67 ×10-8 | 0.13 | |
| 10p13 | rs10904849 | 16,997,266 | GT | 0.68 | 0.98 (0.97-1.00) | 2.90 ×10−2 | 3.39 ×10−1 | 2.36 ×10−2 | 8.68 ×10−3 | 7.73 ×10−2 | 1.29 ×10−3 | 1.13 (1.08-1.19) | 7.01×10-8 | 0.83 | |
| 16p13.2 | rs79900961 | 9,297,812 | GA | 0.98 | 0.70 (0.61-0.74) | 2.21 ×10−1 | 8.68 ×10−2 | 1.04 ×10−3 | 2.54 ×10−2 | 2.41 ×10−1 | 1.02 ×10−3 | 1.49 (1.26-1.76) | 4.93 ×10-8 | 0.76 | |
| 16q24.1 | rs16941835 | 86,695,720 | GC | 0.21 | 0.97 (0.92-0.99) | 1.04 ×10−1 | 1.17 ×10−1 | 1.57 ×10−4 | 3.74 ×10−3 | 1.25 ×10−2 | 3.65 ×10−1 | 1.16 (1.09-1.22) | 5.06 ×10-8 | 0.40 | |
For each variant shown along with meta-analysis test statistics are the P-values from the six individual studies and imputation Information scores. Risk alleles are given in bold.
INFO, imputation Information score; P-het, P-value of heterogeneity between studies; RAF, risk allele frequency.
Figure 2Forest plot of the odds ratios for the association between rs72647484, rs16941835, rs10904849 and CRC.
Studies were weighted according to the inverse of the variance of the log of the OR calculated by unconditional logistic regression. Horizontal lines: 95% confidence intervals (95% CI). Box: OR point estimate; its area is proportional to the weight of the study. Diamond (and broken line): overall summary estimate, with confidence interval given by its width. Unbroken vertical line: null value (OR = 1.0).
Figure 3Regional plot of association results and recombination rates for the (a) 1p36.12, (b) 10p13 and (c) 16q24.1 risk loci. Association results of both genotyped (triangles) and imputed (circles) SNPs in the GWAS samples and recombination rates within the loci at 1p36.12 (a), 10p13 (b) and 16q24 (c). For each plot, −log10 P values (y axis) of the SNPs are shown according to their chromosomal positions (x axis). The top imputed SNP in each combined analysis is shown as a large triangle and is labelled by its rsID. The colour intensity of each symbol reflects the extent of LD with the top SNP: white (r2 = 0) through to dark red (r2 = 1.0), with r2 estimated from the 1000 Genomes Phase 1 data. Genetic recombination rates (cM/Mb), are shown with a light blue line. Physical positions are based on NCBI build 37 of the human genome. Also shown are the relative positions of genes and transcripts mapping to each region of association. The lower panel shows the chromatin state segmentation track (ChromHMM).