| Literature DB >> 26751797 |
Sanford D Markowitz1,2,3, Nora L Nock4,3, Stephanie L Schmit5,6, Zsofia K Stadler7, Vijai Joseph7, Lu Zhang4, Joseph E Willis8,3, Peter Scacheri2,3, Martina Veigl3, Mark D Adams2, Leon Raskin9, John F Sullivan7, Kelly Stratton7, Jinru Shia10, Nathan Ellis11, Hedy S Rennert12,13,14, Christopher Manschreck7, Li Li4,3, Kenneth Offit7, Robert C Elston4,3, Gadi Rennert12,13,14, Stephen B Gruber5,6.
Abstract
We tested for germline variants showing association to colon cancer metastasis using a genome-wide association study that compared Ashkenazi Jewish individuals with stage IV metastatic colon cancers versus those with stage I or II non-metastatic colon cancers. In a two-stage study design, we demonstrated significant association to developing metastatic disease for rs60745952, that in Ashkenazi discovery and validation cohorts, respectively, showed an odds ratio (OR) = 2.3 (P = 2.73E-06) and OR = 1.89 (P = 8.05E-04) (exceeding validation threshold of 0.0044). Significant association to metastatic colon cancer was further confirmed by a meta-analysis of rs60745952 in these datasets plus an additional Ashkenazi validation cohort (OR = 1.92; 95% CI: 1.28-2.87), and by a permutation test that demonstrated a significantly longer haplotype surrounding rs60745952 in the stage IV samples. rs60745952, located in an intergenic region on chromosome 4q31.1, and not previously associated with cancer, is, thus, a germline genetic marker for susceptibility to developing colon cancer metastases among Ashkenazi Jews.Entities:
Mesh:
Year: 2016 PMID: 26751797 PMCID: PMC4709047 DOI: 10.1371/journal.pone.0146435
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Characteristics of the Colon Cancer Study Populations.
| Discovery Dataset | Validation Dataset #1 | Validation Dataset #2 | |
|---|---|---|---|
| 323 | 462 | 343 | |
| Mean (s.d.) | 71.16 (10.33) | 72.24 (10.39) | 64.97 (11.93) |
| Males | 169 (52.32%) | 247 (53.46%) | 187 (54.52%) |
| Females | 154 (47.68%) | 215 (46.54%) | 156 (45.48%) |
| 89 (27.55%) | 89 (19.26%) | 86 (25.07%) | |
| 234 (72.45%) | 373 (80.74%) | 257 (74.93%) | |
| AA (or TT) Genotype | 170 (72.65%) | 254 (68.28%) | 173 (67.58%) |
| GA (or CT) Genotype | 63 (26.92%) | 110 (29.57%) | 77 (30.08%) |
| GG (or CC) Genotype | 1 (0.43%) | 8 (2.15%) | 6 (2.34%) |
| MAF | 0.14 | 0.17 | 0.17 |
| AA (or TT) Genotype | 45 (50.56%) | 50 (56.82%) | 55 (63.95%) |
| GA (or CT) Genotype | 35 (39.33%) | 31 (35.23%) | 28 (32.56%) |
| GG (or CC) Genotype | 9 (10.11%) | 8 (7.95%) | 3 (3.49%) |
| MAF | 0.30 | 0.26 | 0.20 |
| AA (or TT) Genotype | 215 (66.56%) | 304 (65.94%) | 228 (66.67%) |
| GA (or CT) Genotype | 98 (30.34%) | 141 (30.59%) | 105 (30.70%) |
| GG (or CC) Genotype | 10 (3.10%) | 16 (3.47%) | 9 (2.63%) |
| MAF | 0.18 | 0.19 | 0.18 |
| MAF | 0.17 | 0.16 | - |
1 MAF = Minor allele frequency
2 rs60745952 could not be genotyped in 1 stage I/II case
Associations Between SNPs and Stage IV vs. Stage I/II Colon Cancers for Top 20 SNPs Selected from Discovery Dataset for Replication.
| Discovery Dataset | Validation Dataset #1 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SNP | Chr | BP | Nearest Gene | A1 | MAF | OR | 95% CI | P-value | A1 | MAF | OR | 95% CI | P-value |
| rs2024846 | 6 | 85575780 | TBX18 | LOC100289423 | A | 0.20 | 2.62 | (1.76, 3.92) | 1.20E-06 | A | 0.27 | 0.90 | (0.62, 1.30) | 2.15E-01 |
| 4 | 149743890 | NR3C2 | LOC100287246 3 | G | 0.14 | 2.83 | (1.81, 4.44) | 2.73E-06 | G | 0.16 | 1.89 | (1.27, 2.82) | 9.12E-04 | |
| 4 | 149748994 | NR3C2 | LOC100287246 | G | 0.14 | 2.83 | (1.81, 4.44) | 2.73E-06 | C | 0.17 | 1.89 | (1.27, 2.80) | 8.05E-04 | |
| rs73351705 | 8 | 114783818 | CSMD3 | TRPS1 | A | 0.03 | 5.53 | (2.54, 12.05) | 8.49E-06 | A | 0.05 | 1.25 | (0.61, 2.57) | 2.71E-01 |
| rs77801238 | 8 | 114825951 | CSMD3 | TRPS1 | G | 0.01 | 8.87 | (3.27, 24.11) | 9.35E-06 | C | 0.04 | 1.67 | (0.79, 3.51) | 8.97E-02 |
| rs74602504 | 8 | 114823103 | CSMD3 | TRPS1 3 | G | 0.01 | 8.87 | (3.27, 24.11) | 9.35E-06 | NA | NA | NA | NA | NA |
| rs76047883 | 4 | 149768284 | NR3C2 | LOC100287246 | A | 0.13 | 2.68 | (1.70, 4.23) | 1.18E-05 | NA | NA | NA | NA | NA |
| rs7737423 | 5 | 123132486 | CSNK1G3 | ZNF608 | A | 0.04 | 3.78 | (2.04, 7.00) | 1.26E-05 | NA | NA | NA | NA | NA |
| rs78793716 | 11 | 124941849 | SLC37A2 | A | 0.06 | 3.69 | (2.01, 6.77) | 1.30E-05 | A | 0.08 | 0.64 | (0.33, 1.27) | 3.98E-01 |
| rs13058496 | 22 | 17398812 | LOC440786 | G | 0.23 | 2.18 | (1.51, 3.13) | 1.38E-05 | G | 0.24 | 1.20 | (0.83, 1.73) | 1.72E-01 |
| rs6705378 | 2 | 240193889 | HDAC4 | G | 0.43 | 2.24 | (1.53, 3.26) | 1.44E-05 | G | 0.46 | 0.97 | (0.70, 1.34) | 7.70E-02 |
| rs11068687 | 12 | 118256497 | KSR2 | A | 0.18 | 2.35 | (1.58, 3.52) | 1.48E-05 | T | 0.20 | 1.04 | (0.70, 1.54) | 4.24E-01 |
| rs7669737 | 4 | 162361217 | FSTL5 | G | 0.38 | 2.16 | (1.50, 3.11) | 1.59E-05 | G | 0.42 | 1.21 | (0.87, 1.68) | 1.26E-01 |
| rs62377104 | 5 | 123074679 | CSNK1G3 | ZNF608 3 | C | 0.05 | 3.60 | (1.95, 6.63) | 2.00E-05 | G | 0.05 | 1.01 | (0.51, 2.00) | 4.87E-01 |
| rs1556055 | 10 | 17523142 | ST8SIA6 | PTPLA | G | 0.20 | 2.40 | (1.58, 3.64) | 2.08E-05 | G | 0.24 | 1.03 | (0.70, 1.50) | 4.47E-01 |
| rs16962543 | 19 | 29578972 | LOC148145 | UQCRFS1 | A | 0.06 | 3.34 | (1.87, 5.96) | 2.17E-05 | A | 0.10 | 0.60 | (0.31, 1.15) | 4.37E-01 |
| rs12367527 | 12 | 59398014 | LRIG3 | SLC16A7 | A | 0.09 | 2.62 | (1.65, 4.16) | 2.26E-05 | A | 0.13 | 0.83 | (0.50, 1.39) | 2.55E-01 |
| rs308420 | 4 | 123767943 | FGF2 | A | 0.04 | 4.33 | (2.14, 8.75) | 2.27E-05 | A | 0.06 | 0.97 | (0.49, 1.92) | 3.76E-02 |
| rs62451177 | 6 | 85559189 | TBX18 | LOC100289423 | G | 0.10 | 2.88 | (1.73, 4.80) | 2.29E-05 | NA | NA | NA | NA | NA |
| rs2203879 | 2 | 22186722 | LOC100129278 | KLHL29 | A | 0.02 | 6.47 | (2.63, 15.88) | 2.36E-05 | NA | NA | NA | NA | NA |
1 Adjusted for age, sex and first 2 principal components (PCs); analyses conducted using a log-additive genetic model in PLINK
2 MAF = Minor allele (A1) frequency in Stage I/II cases; 3 Redundant SNPs.
Fig 1Manhattan Plot of the Discovery Dataset for Associations Between SNPs and Stage IV vs. Stage I/II Colon Cancers.
The vertical axis indicates the (-log10 transformed) observed P-value and the horizontal axis indicates the chromosomal position of each SNP. Arrow denotes position of rs60745952 and rs72737810, which are not individually discernible at this scale of presentation.
Fig 2Regional Plot of loci on chromosomal region 4 (4q31.1) associated with Stage IV vs. Stage I/II colon cancer in Discovery Dataset.
The horizontal axis shows SNPs along the chromosomal region and the left vertical axis shows (-log10 transformed) observed P-value. SNPs near the most significant SNP (rs60745952) are color coded to depict their LD with this SNP (derived as pairwise R2 values from HapMap CEU data). rs60745952, which is shown in the purple circle, is in strong LD with rs72737820, which is illustrated by the red circle. Estimated recombination rates from HapMap are plotted on the right vertical axis in cyan to reflect the local LD structure.
Fig 3Summary of Association and Test of Homogeneity Results (a) and Forest Plots (b) for the Meta-Analysis evaluating the rs60745952 SNP and Stage IV vs. Stage I and II Colon Cancers in the Discovery and Validation #1 and Validation #2 data sets under a fixed effects and random effects model.
Fig 4LD Plots for rs60745952 in Stage IV and Stage I/II, in Discovery and Validation #1 Datasets.
Vertical axis is the LD as D’ for the rs60745952 SNP (blue line) and other SNPs up and downstream of this location available on the genotyping arrays. Red dots indicate SNPs that were on the genotyping arrays in both the Discovery Dataset and Validation Dataset #1.
Number of SNPs with D’ = 1.0 in a Selected Chromosomal Region for Stage IV and Stage I/II Colon Cancers in Discovery and Validation #1 Datasets.
| Stage I/II | Stage IV | Difference | |||||
|---|---|---|---|---|---|---|---|
| Observed | S.D. | Observed | S.D. | Observed | S.D. | P-value | |
| Discovery | 41 | 2.76 | 58 | 3.42 | 17 | 4.39 | 2.7 x 10−5 |
| Validation Dataset #1 | 52 | 1.23 | 65 | 3.27 | 13 | 3.50 | 5.0 x 10−5 |
1 SD = √(V1+V2), V1 and V2 were estimated from the permutation distribution. See S4 File for details.
2 P-value from a one-sided test.