| Literature DB >> 33193579 |
Tianyi You1, Kai Song1, Wenbing Guo1, Yelin Fu1, Kai Wang1, Hailong Zheng1, Jing Yang1, Liangliang Jin1, Lishuang Qi1, Zheng Guo1,2,3, Wenyuan Zhao1.
Abstract
A part of colorectal cancer which is characterized by simultaneous numerous hypermethylation CpG islands sites is defined as CpG island methylator phenotype (CIMP) status. Stage II and III CIMP-positive (CIMP+) right-sided colon cancer (RCC) patients have a better prognosis than CIMP-negative (CIMP-) RCC treated with surgery alone. However, there is no gold standard available in defining CIMP status. In this work, we selected the gene pairs whose relative expression orderings (REOs) were associated with the CIMP status, to develop a qualitative transcriptional signature to individually predict CIMP status for stage II and III RCC. Based on the REOs of gene pairs, a signature composed of 19 gene pairs was developed to predict the CIMP status of RCC through a feature selection process. A sample is predicted as CIMP+ when the gene expression orderings of at least 12 gene pairs vote for CIMP+; otherwise the CIMP-. The difference of prognosis between the predicted CIMP+ and CIMP- groups was more significantly different than the original CIMP status groups. There were more differential methylation and expression characteristics between the two predicted groups. The hierarchical clustering analysis showed that the signature could perform better for predicting CIMP status of RCC than current methods. In conclusion, the qualitative transcriptional signature for classifying CIMP status at the individualized level can predict outcome and guide therapy for RCC patients.Entities:
Keywords: CpG island methylator phenotype; gene pairs; relative expression ordering; right-sided colon cancer; the qualitative transcriptional signature
Year: 2020 PMID: 33193579 PMCID: PMC7658404 DOI: 10.3389/fgene.2020.00971
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
The datasets detected CpG island methylator phenotype (CIMP) status in this study.
| GSE39582 ( | GSE39084 ( | GSE25070 ( | E-TABM-328 ( | |
| I | 37 | – | – | – |
| II | 247 | 20 | – | – |
| III | 167 | 14 | – | – |
| IV | 59 | – | – | – |
| CIMP+ | 93 | 6 | 6 | 11 |
| CIMP− | 417 | 13 | 16 | 36 |
| Right | 210 | 19 | 13 | 22 |
| Left | 300 | – | 9 | 25 |
| Yes | 296 | – | – | – |
| No | 201 | – | – | – |
| NA | 16 | – | – | – |
| Affymetrix Human Genome U133 Plus 2.0 Array | Affymetrix Human Genome U133 Plus 2.0 Array | Illumina Human Ref-8v3.0 expression beadchip | Whole Human Genome Microarray 4x44K | |
The datasets detected both gene expression and DNA methylation profiles in this study.
| GSE25070 ( | GSE25062 ( | GSE79793 ( | GSE79740 ( | |
| Data type | Expression profiling | Methylation profiling | Expression profiling | Methylation profiling |
| CIMP+ | 6 | 6 | – | – |
| CIMP− | 16 | 16 | ||
| Illumina Human Ref-8 v3. 0 expression beadchip | Illumina Human Methylation27 BeadChip | Illumina Human HT-12 WG-DASL V4. 0 R2 expression beadchip | Illumina Human Methylation450 BeadChip | |
FIGURE 1Flowchart of this study. CIMP, CpG island methylator phenotype; RCC, right-sided colon cancer; FD, frequency difference; F-score, harmonic mean value; 19-GPS, 19 gene pairs signatures.
FIGURE 2The F-score and number of the gene pairs within different range of FD values (A) and composition of 19-GPS (B). The x-axis represents the range of FD value and the relative expression orderings (REOs) (gene1 > gene2) of 19-GPS, respectively.
The performance of 19-GPS for right-sided colon cancer (RCC) samples in the training and validation datasets.
| pre-CIMP+ (CIMP+:CIMP−) | pre-CIMP− (CIMP+:CIMP−) | Sensitivity | Specificity | ||
| GSE39582 | 70 (58:12) | 111 (6:105) | 0.91 | 0.90 | 0.95 |
| GSE39084 | 11 (6:5) | 8 (0:8) | 1 | 0.62 | 0.76 |
| GSE25070 | 6 (5:1) | 7 (1:6) | 0.83 | 0.86 | 0.85 |
| E-TABM-328 | 13 (10:3) | 9 (8:1) | 0.91 | 0.73 | 0.81 |
| Total RCC | 100 (79:21) | 135 (15:120) | 0.84 | 0.85 | 0.85 |
FIGURE 3The ROC curves for 19-GPS in four independent datasets. (A) The right-sided colon cancer (RCC) of the training dataset, (B) The RCC of GSE39084, (C) The RCC of GSE25070, and (D) The RCC of E-TABM-328.
FIGURE 4The Kaplan–Meier curves for the prediction of 19-GPS and original CIMP status in training dataset. (A,B) Stage III RCC of the CIMP+ and CIMP− patients treated with surgery alone. The red and blue lines represent the CIMP+ and CIMP− patients treated with surgery alone, respectively. (C,D) All of stage III RCC of the CIMP− patients. The red and blue lines represent the CIMP− patients receiving 5-FU-based ACT and treated with surgery alone, respectively.
FIGURE 5The complete linkage hierarchical clustering analysis of the RCC samples in four independent datasets. (A) GSE39582, (B) GSE39084, (C) GSE25070, and (D) E-TABM-328 based on the differentially expressed genes between the signature-confirmed CIMP+ and CIMP− samples. predict-CIMP/origin-CIMP, predict-CIMP represented the predicted CIMP status by 19-GPS and origin-CIMP represented the original CIMP status.
FIGURE 6Volcano plots of the differentially methylated CpG sites between CIMP+ and CIMP− samples. (A) The samples with predicted CIMP status by 19-GPS in GSE25062. (B) The samples with original CIMP status in GSE25062. (C) The samples with predicted CIMP status by 19-GPS in GSE79740. The log2 (fold change) beta value difference in DNA methylation between the samples with CIMP+ and CIMP− status is plotted on the x-axis, and the P value (−1 × log10 P value) for limma test of differences between the two subtypes is plotted on the y-axis. The CpG sites which are significantly different and log2 (fold change) > 0 between the two subtypes are shown in red, and the CpG sites which are significantly different and log2 (fold change) < 0 are shown in blue.
The predicted CIMP status of samples with different percentage of tumor cells.
| Sample ID | Percentage of tumor cells (%) | Predicted CIMP status |
| HCF1 | 40 | Negative |
| HCF2 | 100 | Negative |
| HCF3 | 100 | Negative |
| LGL1 | 50 | Negative |
| LGL2 | 90 | Positive |
| LGL3 | 90 | Positive |
| SDL1 | 100 | Negative |
| SDL2 | 100 | Negative |
| WCY1 | 60 | Negative |
| WCY2 | 100 | Negative |
| WCY3 | 100 | Negative |
| ZCH1 | 70 | Negative |
| ZCH3 | 40 | Negative |