| Literature DB >> 30738427 |
Michelle E Penney1, Patrick S Parfrey2, Sevtap Savas1,3, Yildiz E Yilmaz4,5,6.
Abstract
BACKGROUND: Differentiating between cancer patients who will experience metastasis within a short time and who will be long-term survivors without metastasis is a critical aim in healthcare. The microsatellite instability (MSI)-high tumor phenotype is such a differentiator in colorectal cancer, as patients with these tumors are unlikely to experience metastasis. Our aim in this study was to determine if germline genetic variations could further differentiate colorectal cancer patients based on the long-term risk and timing of metastasis.Entities:
Keywords: Colorectal cancer; Genome-wide association study; Mixture cure model; Single nucleotide polymorphisms; Time-to-metastasis
Year: 2019 PMID: 30738427 PMCID: PMC6368959 DOI: 10.1186/s12885-019-5346-5
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Fig. 1Kaplan-Meier survival functions stratified by microsatellite instability (MSI) status. Kaplan-Meier survival functions stratified according to MSI status for the sub-cohort excluding stage IV patients and patients with unknown MSI tumor data (n = 431). MSI-H: microsatellite instability high; MSI-L: microsatellite instability low; MSS: microsatellite stable
Fig. 2Methods of analysis used in this study. For each SNP, all four genetic models were considered: additive, dominant, recessive, and co-dominant. However, for some SNPs, the number of patients in a genotype category was zero or very small (<2 patients) when the recessive (for 64,809 SNPs) and co-dominant (for 75,912 SNPs) genetic models were applied. As such, these SNPs were not analyzed under these specific genetic models. HR: hazard ratio; MAF: minor allele frequency; OR: odds ratio; p: metastasis-free survival probability
Baseline characteristics of the patient cohort (n = 379) including metastasis proportion
| Variable | Number of patientsa | % total | Number with metastasis | % metastasis |
|---|---|---|---|---|
| Sex | ||||
| Female | 139 | 36.7% | 29 | 20.9% |
| Male | 240 | 63.3% | 52 | 21.7% |
| Age | ||||
| ≤60 | 157 | 41.4% | 41 | 26.1% |
| 60–70 | 154 | 40.6% | 29 | 18.8% |
| > 70 | 68 | 17.9% | 11 | 16.2% |
| Familial risk | ||||
| Low | 196 | 51.7% | 34 | 17.3% |
| Intermediate/High | 183 | 48.3% | 47 | 25.7% |
| 5-FU based treatment | ||||
| 5-FU treated | 214 | 56.5% | 59 | 27.6% |
| Other/No chemo | 159 | 42.0% | 17 | 10.7% |
| Unknown | 6 | 1.6% | 5 | 83.3% |
| Stage | ||||
| I | 81 | 21.4% | 8 | 9.9% |
| II | 158 | 41.7% | 30 | 19.0% |
| III | 140 | 36.9% | 43 | 30.7% |
| Location | ||||
| Colon | 233 | 61.5% | 41 | 17.6% |
| Rectum | 146 | 38.5% | 40 | 27.4% |
| Histology | ||||
| Non-mucinous | 343 | 90.5% | 75 | 21.9% |
| Mucinous | 36 | 9.5% | 6 | 16.7% |
| Vascular invasion | ||||
| Absence | 242 | 63.9% | 45 | 18.6% |
| Presence | 111 | 29.3% | 30 | 27.0% |
| Unknown | 26 | 6.9% | 6 | 23.1% |
| Lymphatic invasion | ||||
| Absence | 237 | 62.5% | 44 | 18.6% |
| Presence | 116 | 30.6% | 31 | 26.7% |
| Unknown | 26 | 6.9% | 6 | 23.1% |
| Absence | 333 | 87.9% | 72 | 21.6% |
| Presence | 19 | 5.0% | 8 | 42.1% |
| Unknown | 27 | 7.1% | 1 | 3.7% |
a Patients with MSI-H tumors and Stage IV patients were excluded. 5-FU: 5-fluorouracil
Fig. 3Kaplan-Meier survival function for the most significant SNPs in the multivariable analysis under the (a) mixture cure model and (b) Cox proportional hazards regression model. n: number of patients in that genotype category; d: number of metastasis in that genotype category. a rs5749032 was the only SNP maintaining genome-wide significance after the multivariable analysis using the mixture cure model. In the rs5749032 GG genotype subgroup, the clear plateau at approximately 80% metastasis-free survival probability indicates the existence of a large proportion of long-term metastasis-free survivors. b In the rs2327990 TT genotype subgroup, all the patients experienced metastasis within approximately the first two years. Therefore, a standard survival analysis method is appropriate
Results from the multivariablea analysis using the mixture cure model on the significant SNPs identified by the univariable mixture cure model
| Genomic location | Genetic model | rs number (genotypes | Genotype freq. | Metastasis probability | Time-to-metastasis | ||||
|---|---|---|---|---|---|---|---|---|---|
| OR | 95% CI | HR | 95% CI | ||||||
| 22:17793969 | Recessive | rs5749032 (GG vs. AA + AG) | 14% | 0.38 | 0.14–1.07 | 0.066 | 15.86 | 6.83–36.83 | 1.28 × 10−10 |
| 17:77361176 | Co-Dominant | rs12949587 (CT vs. CC) | 20% | 0.66 | 0.32–1.37 | 0.261 | 7.56 | 3.44–16.61 | 4.63 × 10−7 |
| 20:15111138 | Co-Dominant | rs6110524 (AG vs. GG) | 17% | 0.95 | 0.44–2.04 | 0.887 | 4.80 | 2.00–11.53 | 4.52 × 10−4 |
| 7:33913404 | Recessive | rs3815652 (TT vs. CC + CT) | 4% | 0.59 | 0.13–2.65 | 0.488 | 12.97 | 3.26–51.66 | 2.78 × 10−4 |
| 14:100691178 | Recessive | rs756055 (CC vs. TT + TC) | 13% | 0.28 | 0.10–0.82 | 0.020 | 7.58 | 2.53–22.65 | 2.90 × 10−4 |
| 14:100730920 | Recessive | rs7153665 (AA vs. GG + AG) | 13% | 0.28 | 0.10–0.82 | 0.020 | 7.58 | 2.53–22.65 | 2.90 × 10−4 |
| 11:100430053 | Recessive | rs4754687 (AA vs. CC + CA) | 11% | 0.51 | 0.18–1.43 | 0.201 | 8.13 | 2.59–25.53 | 3.28 × 10−4 |
| 5:155345221 | Dominant | rs2163746 (CT + CC vs. TT) | 24% | 0.49 | 0.23–1.07 | 0.075 | 9.65 | 3.67–25.37 | 4.29 × 10−6 |
| 5:155361116 | Dominant | rs17053011 (TG + TT vs. GG) | 24% | 0.49 | 0.23–1.07 | 0.075 | 9.65 | 3.67–25.37 | 4.29 × 10−6 |
aAdjusted for the significant baseline characteristics: tumor location, 5-fluorouracil treatment status, and tumor stage. Each SNP was analyzed separately adjusting for these factors. Patients with missing data were excluded, resulting in the inclusion of 349 stage I-III patients with MSI-L/MSS tumors
Linkage disequilibrium (LD) calculations indicated that rs756055 and rs7153665 as well as rs2163746 and rs17053011 are in complete pairwise LD (r2 = 1)
The SNPs listed yielded similar hazard ratio estimates under the univariable (Additional file 1: Table S3) and multivariable analyses. Consequently, all of the SNPs identified in this study could be considered independent prognostic factors for time-to-metastasis in colorectal cancer if the results are replicated using independent cohort data
Genotype freq. frequency of genotype a calculated from the patient cohort, OR odds ratio for metastasis comparing odds of metastasis in subgroup a with that in subgroup b, HR hazard ratio comparing metastasis rate in subgroup a with that in subgroup b among those who are susceptible to metastasis, CI confidence interval
Genotypes significantly associated with time-to-metastasis after adjusting for significant baseline characteristics identified in the Cox proportional hazards regression model
| Genomic location | rs number (genotypes | Genotype freq. | Univariable | Multivariablea | ||||
|---|---|---|---|---|---|---|---|---|
| HR | 95% CI | HR | 95% CI | |||||
| 20:16189263 | rs2327990 (TT vs. CC + CT) | 1.3% | 21.97 | 8.42–57.33 | 2.74 × 10− 10 | 22.58 | 8.32–61.31 | 9.59 × 10− 10 |
| 3:134513356 | rs11918092 (CC vs. AA + AC) | 0.5% | 216.98 | 35.64–1321.13 | 5.32 × 10− 9 | 535.33 | 63.20–4534.30 | 8.23 × 10− 9 |
| 3:134515336 | rs3732568 (AA vs. CC + CA) | 0.5% | 216.98 | 35.64–1321.13 | 5.32 × 10− 9 | 535.33 | 63.20–4534.30 | 8.23 × 10− 9 |
| 3:59930672 | rs2366964 (CC vs. TT + TC) | 0.8% | 41.19 | 11.81–143.66 | 5.40 × 10−9 | 56.53 | 14.98–213.26 | 2.59 × 10− 9 |
| 2:6769988 | rs1563948 (AA vs. GG + GA) | 0.8% | 34.43 | 10.35–114.58 | 7.97 × 10−9 | 33.97 | 9.57–120.54 | 4.87 × 10− 8 |
| 2:6773920 | rs11694697 (TT vs. CC + CT) | 0.8% | 34.43 | 10.35–114.58 | 7.97 × 10−9 | 33.97 | 9.57–120.54 | 4.87 × 10− 8 |
| 2:6777992 | rs11692570 (TT vs. CC + CT) | 0.8% | 34.43 | 10.35–114.58 | 7.97 × 10−9 | 33.97 | 9.57–120.54 | 4.87 × 10− 8 |
| 2:6779277 | rs2219613 (TT vs. CC + CT) | 0.8% | 34.43 | 10.35–114.58 | 7.97 × 10−9 | 33.97 | 9.57–120.54 | 4.87 × 10−8 |
| 6:91187510 | rs1145724 (GG vs. AA + AG) | 0.8% | 30.76 | 9.27–102.03 | 2.14 × 10−8 | 36.43 | 10.21–129.93 | 3.00 × 10−8 |
aAdjusted for tumor location, 5-fluorouracil treatment status, BRAF V600E somatic mutation status, and tumor stage. Each SNP was analyzed separately adjusting for these factors. Patients with missing data were excluded, resulting in the inclusion of 349 stage I-III patients with MSI-L/MSS tumors
LD calculations indicated that rs11918092 and rs3732568 are in high pairwise LD (r2 = 0.96). In addition, rs1563948, rs11694697, rs11692570, and rs2219613 are all highly linked to each other (0.94 ≤ r2 ≤ 1)
The SNPs listed yielded similar risk estimates under the univariable and multivariable analyses. Consequently, all of the SNPs identified in this study could be considered independent prognostic factors for time-to-metastasis in colorectal cancer if the results are replicated using independent cohort data
Genotype freq. frequency of genotype a calculated from the patient cohort, HR hazard ratio comparing metastasis rate in subgroup a with that in subgroup b, CI confidence interval
Variant information for the significant genotypes in the multivariable mixture cure and Cox proportional hazards regression models
| Genomic location | rs number (genotypea) | MAFb | Statistical modelc | Type of variant (gene)d | DNA binding evidencee |
|---|---|---|---|---|---|
| 22:17793969 | rs5749032 (GG) | 40% | Mixture cure | Intergenic | ND |
| 20:16189263 | rs2327990 (TT) | 11% | Cox proportional hazards | Intergenic | Less likely to affect binding |
| 3:134513356 | rs11918092 (CC) | 8% | Cox proportional hazards | Intronic ( | Minimal binding evidence |
| 3:134515336 | rs3732568 (AA) | 8% | Cox proportional hazards | Intronic ( | Minimal binding evidence |
| 3:59930672 | rs2366964 (CC) | 8% | Cox proportional hazards | Intronic ( | ND |
| 2:6769988 | rs1563948 (AA) | 11% | Cox proportional hazards | Intronic ( | Minimal binding evidence |
| 2:6773920 | rs11694697 (TT) | 11% | Cox proportional hazards | Intronic ( | ND |
| 2:6777992 | rs11692570 (TT) | 11% | Cox proportional hazards | Intronic ( | Minimal binding evidence |
| 2:6779277 | rs2219613 (TT) | 11% | Cox proportional hazards | Intronic ( | Minimal binding evidence |
| 6:91187510 | rs1145724 (GG) | 9% | Cox proportional hazards | Intergenic | Minimal binding evidence |
a Risk increasing/decreasing genotype, b MAF calculated from patient cohort analyzed. Values comparable to CEU population based on 1000 Genomes Project Phase 3 28 data obtained through the Ensembl database (http://grch37.ensembl.org/), c Statistical model identifying the association, d based on Ensembl database [25], e based on RegulomeDB database [26]. ND: no data
Fig. 4Known and hypothesized relationships between the identified SNPs, genes they are located in, and the risk of metastasis