| Literature DB >> 26463173 |
Matahi Moarii1,2,3, Fabien Reyal4,5,6, Jean-Philippe Vert7,8,9.
Abstract
BACKGROUND: The CpG island methylator phenotype (CIMP) was first characterized in colorectal cancer but since has been extensively studied in several other tumor types such as breast, bladder, lung, and gastric. CIMP is of clinical importance as it has been reported to be associated with prognosis or response to treatment. However, the identification of a universal molecular basis to define CIMP across tumors has remained elusive.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26463173 PMCID: PMC4603341 DOI: 10.1186/s40246-015-0048-9
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
Patients’ dataset. Number of samples available for the different cancer types (first column) for methylation (Meth) and gene expression (GE). The “Meth/GE” column summarizes the number of samples with both methylation and gene expression, while the “Meth/Mutations” column shows the number of samples with both methylation and DNA mutation data
| Tissue | Meth | GE | Meth/GE | Meth/Mutations |
|---|---|---|---|---|
| Bladder | 373 | 56 |
|
|
| Breast | 626 | 778 |
|
|
| Colon | 291 | 193 |
|
|
| Lung | 452 | 125 |
|
|
| Stomach | 338 | 373 |
|
|
| Overall |
|
|
|
|
CIMP proportion. For each cancer type, this table shows the number of samples clustered in the CIMP-negative and CIMP-positive clusters and the percentage of CIMP-positive samples
| Tissue | Negative | Positive | Ratio (%) |
|---|---|---|---|
| Bladder | 262 | 111 | 30 |
| Breast | 509 | 117 | 19 |
| Colon | 232 | 59 | 20 |
| Lung | 136 | 316 | 70 |
| Stomach | 144 | 194 | 57 |
| Overall |
|
|
|
Fig. 1Pan-cancer clustering on common epigenetic signature clusters CIMP-positive and CIMP-negative tumors independently of tissue type. a Venn diagram of the CIMP signatures for each tissue. b Hierarchical clustering on the common epigenetic signature for all tissues. c Gene ontology analysis of the genes associated with the promoters of the common epigenetic signature
List of genes associated with the common set of CGIs that define CIMP in each tissue
| LOC339524, GSTM1, CD1D, LMX1A | |
| CACNA1E,NR5A2, WNT3A, GNG4 | |
| EMX1, CTNNA2,LRRTM1, DLX1 | |
| EVX2, HOXD13, GBX2, SYN2 | |
| HAND2, NBLA00301, EBF1, HIST1H2BB | |
| Epigenetic | HIST1H3C, HLA-DRB1, C6orf186, IKZF1 |
| Signature | CDKN2A, HMX3, KNDC1, KLHL35 |
| HOTAIR, SLC6A15, ALX1, RFX4 | |
| CLDN10, ADCY4, RIPK3, NID2 | |
| OTX2, OTX2OS1, GSC, KIF26A | |
| GREM1, SEC14L5, HS3ST3B1, IGF2BP1 | |
| HOOK2, NFIX, ZNF577, ZNF649 | |
| CPXM1, CDH22, CHRNA4 |
Fig. 2Variation in the empirical cumulative distributive function Δ(K) for each tissue. The empirical cumulative distributive function Δ(K) is a data-driven criterion which can indicate the number of clusters K in the data when it reaches its maximum [25]. This plots shows how Δ(K) varies as a function of K, for the different tissues
CIMP Proportion in samples with both methylation and gene expression data. This table shows the number of CIMP-positive and CIMP-negative samples characterized by both methylation and gene expression data, for each cancer type, as well as the proportion of CIMP-positive samples
| Tissue | Negative | Positive | Ratio (%) |
|---|---|---|---|
| Bladder | 27 | 16 | 37 |
| Breast | 385 | 93 | 20 |
| Colon | 27 | 7 | 20 |
| Lung | 22 | 60 | 75 |
| Stomach | 131 | 178 | 58 |
| Overall |
|
|
|
Accuracy of CIMP prediction using gene expression profiles
| Accuracy | ||||
|---|---|---|---|---|
| Random | Lasso | Combined lasso | Group lasso | |
| Bladder | 62.8 | 62.9 ( | 74.2 ( | 72.1 ( |
| Breast | 80.5 | 83.9 ( | 84.7 ( | 85.5 ( |
| Colon | 79.4 | 79.5 ( | 95.0 ( | 94.2 ( |
| Lung | 73.2 | 84.2 ( | 76.2 ( | 86.6 ( |
| Stomach | 57.6 | 81.2 ( | 83.0 ( | 84.8 ( |
| Overall |
|
|
|
|
This table shows the accuracy, assessed by threefold cross-validation repeated 100 times over each tissue (first column), of sample classification in CIMP-positive and CIMP-negative classes from gene expression data using random classification (second column), lasso logistic regression (third column), combined lasso (fourth column), or group lasso logistic regression (fifth column)
Fig. 3Gene expression variations predictive of CIMP. a Stability of each gene signature for each tissue-specific CIMP prediction as well as the “Combined-Lasso” and the “Group-Lasso” CIMP prediction task obtained and ranked by frequency of appearance using bootstrap (n=100 repeats). For bladder and colon CIMP prediction task, the signature was non-robust (frequency of the most redundant gene inferior to 10 %). The combined prediction task signature outperforms the tissue-specific signatures in robustness. b Venn diagram of the tissue-specific gene signatures using lasso for each tissue separately. c Distribution of the accuracy of the CIMP-phenotype prediction task given the patient gene expression profile using n=100 bootstrap and threefold cross-validation for several methods (pink = “tissue-specific” lasso, green = “Combined-Lasso,” blue = “Group-Lasso,” red star = random prediction). d Venn diagram representing the intersection between the “Combined” and “Group” lasso gene signatures
Intersection of the genetic signatures for “Combined-Lasso” and “Group-Lasso” predictive of CIMP ranked by decreasing level of robustness
|
| |
|
| |
| Over-expressed |
|
|
| |
|
| |
|
| |
|
| |
| Under-expressed |
|
|
|
Fig. 4Mutation analysis. a Association between specific mutations (IDH1, IDH2, BRAF, and KRAS) with the CIMP phenotype for all tissues (yellow = CIMP positive, blue = CIMP negative). b Significantly higher mutation rate for CIMP-positive (yellow) compared to CIMP-negative (blue) tumors is observed for colon and gastric cancers only and is concordant with CIMP association with microsatellite instability for these tissues
Clinical impact of CIMP. Overall survival proportion given the CIMP phenotype and the p value associated with the survival analysis (logrank test)
| Tissue | Event |
| |
|---|---|---|---|
| CIMP − | CIMP + | ||
| BLCA | 47/214 | 21/96 | 0.74 |
| BRCA | 29/495 | 9/114 | 0.20 |
| COAD | 28/218 | 6/54 | 0.57 |
| LUAD | 24/127 | 67/295 | 0.49 |
| STAD | 26/141 | 20/193 | 0.29 |