| Literature DB >> 28544645 |
Salem Werdyani1, Yajun Yu1, Georgia Skardasi1, Jingxiong Xu2, Konstantin Shestopaloff3, Wei Xu2,3, Elizabeth Dicks4, Jane Green1,5, Patrick Parfrey4, Yildiz E Yilmaz1,4,6, Sevtap Savas1,5.
Abstract
INDELs and CNVs are structural variations that may play roles in cancer susceptibility and patient outcomes. Our objectives were a) to computationally detect and examine the genome-wide INDEL/CNV profiles in a cohort of colorectal cancer patients, and b) to examine the associations of frequent INDELs/CNVs with relapse-free survival time. We also identified unique variants in 13 Familial Colorectal Cancer Type X (FCCX) cases. The study cohort consisted of 495 colorectal cancer patients. QuantiSNP and PennCNV algorithms were utilized to predict the INDELs/CNVs using genome-wide signal intensity data. Duplex PCR was used to validate predictions for 10 variants. Multivariable Cox regression models were used to test the associations of 106 common variants with relapse-free survival time. Score test and the multivariable Cox proportional hazards models with time-varying coefficients were applied to identify the variants with time-varying effects on the relapse-free survival time. A total of 3486 distinct INDELs/CNVs were identified in the patient cohort. The majority of these variants were rare (83%) and deletion variants (81%). The results of the computational predictions and duplex PCR results were highly concordant (93-100%). We identified four promising variants significantly associated with relapse-free survival time (P < 0.05) in the multivariable Cox proportional hazards regression models after adjustment for clinical factors. More importantly, two additional variants were identified to have time-varying effects on the risk of relapse. Finally, 58 rare variants were identified unique to the FCCX cases; none of them were detected in more than one patient. This is one of the first genome-wide analyses that identified the germline INDEL/CNV profiles in colorectal cancer patients. Our analyses identified novel variants and genes that can biologically affect the risk of relapse in colorectal cancer patients. Additionally, for the first time, we identified germline variants that can potentially be early-relapse markers in colorectal cancer.Entities:
Keywords: CNVs; colorectal cancer; early-relapse markers; genetic variation; markers with time-varying effects; relapse-free survival
Mesh:
Year: 2017 PMID: 28544645 PMCID: PMC5463068 DOI: 10.1002/cam4.1074
Source DB: PubMed Journal: Cancer Med ISSN: 2045-7634 Impact factor: 4.452
Figure 1The main steps of the computational analysis that were used to detect, describe, and examine the INDELs/CNVs in the patient cohort.CNV, copy number variation; CNVR, CNV region; INDEL, insertion/deletion.
The baseline features of the patient cohort
| Features | Number | % |
|---|---|---|
| Sex | ||
| Female | 194 | 39.19 |
| Male | 301 | 60.81 |
| Age at diagnosis | ||
| <65 | 312 | 63.03 |
| ≥65 | 183 | 36.97 |
| Location | ||
| Colon | 328 | 66.26 |
| Rectum | 167 | 33.74 |
| Histology | ||
| Nonmucinous | 438 | 88.48 |
| Mucinous | 57 | 11.52 |
| Stage | ||
| I | 89 | 17.98 |
| II | 193 | 38.99 |
| III | 164 | 33.13 |
| IV | 49 | 9.90 |
| Grade | ||
| Well/moderately differentiated | 457 | 92.32 |
| Poorly differentiated | 34 | 6.87 |
| Unknown | 4 | 0.81 |
| Vascular invasion | ||
| Absent | 300 | 60.61 |
| Present | 158 | 31.92 |
| Unknown | 37 | 7.47 |
| Lymphatic invasion | ||
| Absent | 290 | 58.59 |
| Present | 166 | 33.54 |
| Unknown | 39 | 7.88 |
| MSI status | ||
| MSI‐L/MSS | 421 | 85.05 |
| MSI‐H | 53 | 10.71 |
| Unknown | 21 | 4.24 |
| Tumor | ||
| Absent | 402 | 81.21 |
| Present | 47 | 9.49 |
| Unknown | 46 | 9.29 |
MSI‐H, microsatellite instability‐high; MSI‐L, microsatellite instability‐low, MSS, microsatellite stable.
The main features of the distinct, high‐confidence INDELs/CNVs identified in the study cohort
| Variable | Number | |
|---|---|---|
| Total number of distinct INDELs/CNVs | 3486 | |
| Mean distinct INDEL/CNV length | 35,187 bps | |
| Length | Number | % |
| INDELs | 360 | 10.33 |
| CNVs | 3126 | 89.67 |
| Frequency | Number | % |
| Rare INDELs/CNVs (< 5% of the patients) | 2891 | 82.93 |
| Common INDELs/CNVs (≥ 5% of the patients) | 595 | 17.07 |
| Number of INDELs/CNVs per CN state | Number | % |
| INDELs/CNVs with two CN states |
|
|
| (CN = 0) Two copy deletion | 685 | 19.65 |
| (CN = 1) One copy deletion | 1596 | 45.78 |
| (CN = 3) One copy duplication | 607 | 17.41 |
| (CN = 4) Two or more copy duplication | 17 | 0.49 |
| INDELs/CNVs with multiple CN states |
|
|
| A. INDELs/CNVs with three CN states |
|
|
| CN = 0 or 1 | 543 | 15.58 |
| CN = 0 or 3 | 7 | 0.20 |
| CN = 0 or 4 | 2 | 0.06 |
| CN = 1 or 3 | 13 | 0.37 |
| CN = 3 or 4 | 12 | 0.34 |
| B. INDELs/CNVs with four CN states |
|
|
| CN = 0, 3 or 4 | 1 | 0.03 |
| CN = 0, 1 or 4 | 1 | 0.03 |
| CN = 0, 1 or 3 | 2 | 0.06 |
CN, Copy number state; CNV, copy number variation; INDEL, insertion/deletion.
The “normal” CN state of 2 copies is not shown.
Figure 2Distribution of the number of predicted INDELs/CNVs in the patient cohort. CNV, copy number variation; INDEL, insertion/deletion.
Genes possibly affected by the INDELs/CNVs
| Affected genes | Numbers |
|---|---|
| Genes completely covered by INDELs/CNVs | 659 |
| Genes partially overlapped with INDELs/CNVs | 880 |
| Genes completely or partially overlapped with different INDELs/CNVs | 134 |
CNV, copy number variation; INDEL, insertion/deletion.
Figure 3PANTHER database results showing the major biological pathways possibly affected by the INDELs/CNVs. CNV, copy number variation; INDEL, insertion/deletion.
Results of the Cox regression models with time‐varying coefficients for the three variants that violated the proportionality assumption
| Time‐point (years) | Variables in the model | HR | 95% CI for HR (lower) | 95% CI for HR (higher) |
|
|
|---|---|---|---|---|---|---|
| 4.3 | Stage (II vs. I) | 1.433 | 0.856 | 2.398 | 0.171 | 0.588 |
| Stage (III vs. I) | 2.266 | 1.374 | 3.736 |
| 0.568 | |
| Stage (IV vs. I) | 5.950 | 3.441 | 10.289 |
| 0.146 | |
| Location (Rectum vs. colon) | 1.411 | 1.046 | 1.904 |
| 0.111 | |
| MSI status (MSI‐H vs. MSS/MSI‐L) | 0.327 | 0.152 | 0.708 |
| 0.230 | |
|
| ||||||
| Before the time‐point | 1.400 | 0.848 | 2.310 | 0.188 | 0.906 | |
| After the time‐point | 0.159 | 0.022 | 1.153 | 0.069 | 0.898 | |
| 2.6 | Stage (II vs. I) | 1.502 | 0.899 | 2.509 | 0.120 | 0.832 |
| Stage (III vs. I) | 2.390 | 1.450 | 3.940 |
| 0.800 | |
| Stage (IV vs. I) | 6.591 | 3.807 | 11.412 |
| 0.082 | |
| Location (Rectum vs. colon) | 1.419 | 1.051 | 1.916 |
| 0.183 | |
| MSI status (MSI‐H vs. MSS/MSI‐L) | 0.315 | 0.145 | 0.683 |
| 0.206 | |
|
| ||||||
| Before the time‐point | 2.108 | 1.317 | 3.373 |
| 0.794 | |
| After the time‐point | 0.726 | 0.423 | 1.245 | 0.244 | 0.864 | |
| 2.8 | Stage (II vs. I) | 1.477 | 0.883 | 2.470 | 0.138 | 0.678 |
| Stage (III vs. I) | 2.354 | 1.428 | 3.879 |
| 0.693 | |
| Stage (IV vs. I) | 5.952 | 3.448 | 10.274 |
| 0.086 | |
| Location (Rectum vs. colon) | 1.421 | 1.052 | 1.919 |
| 0.103 | |
| MSI status (MSI‐H vs. MSS/MSI‐L) | 0.323 | 0.149 | 0.700 |
| 0.224 | |
|
| ||||||
| Before the time‐point | 0.416 | 0.182 | 0.955 |
| 0.770 | |
| After the time‐point | 1.511 | 0.927 | 2.463 | 0.098 | 0.848 |
Chr, chromosome; CI, confidence interval; CN, copy number state; HR, hazard ratio; MSI‐H, microsatellite instability‐high; MSI‐L, microsatellite instability‐low; MSS, microsatellite stable; PH, proportional hazards; P < 0.05 are bolded.
Genes that overlap with the variants are shown in parentheses.
Variants that satisfied the proportionality assumption and significantly associated with the relapse‐free survival time
| Gene | Variant |
| HR | 95% CI (lower) | 95% CI (higher) |
|---|---|---|---|---|---|
|
| Chr_1_92232111_92233227 (0 CN vs. 2 CN) | 0.0454 | 0.5211 | 0.2752 | 0.9867 |
|
| Chr_3_99628822_99629567 (0 CN vs. 1 or 2 CN) | 0.015 | 1.6936 | 1.1076 | 2.5896 |
|
| Chr_3_192875738_192885153 (0 CN vs. 2 or 4 CN) | 0.0394 | 1.3586 | 1.0149 | 1.8186 |
|
| Chr_7_89810608_89812114 (0 CN vs. 2 CN) | 0.0372 | 0.5776 | 0.3447 | 0.968 |
Chr, chromosome; CI, confidence interval; CN, copy number state; HR, hazard ratio.