| Literature DB >> 30954552 |
Lochlan Fennell1, Troy Dumenil2, Leesa Wockner3, Gunter Hartel3, Katia Nones4, Catherine Bond2, Jennifer Borowsky2, Cheng Liu2, Diane McKeone2, Lisa Bowdler2, Grant Montgomery2, Kerenaftali Klein3, Isabell Hoffmann5, Ann-Marie Patch4, Stephen Kazakoff4, John Pearson4, Nicola Waddell4, Pratyaksha Wirapati6, Paul Lochhead7, Yu Imamura8, Shuji Ogino9, Renfu Shao10, Sabine Tejpar11, Barbara Leggett12, Vicki Whitehall13.
Abstract
BACKGROUND & AIMS: Colorectal cancer is an epigenetically heterogeneous disease, however, the extent and spectrum of the CpG island methylator phenotype (CIMP) is not clear.Entities:
Keywords: BRAF; CIMP; Colorectal Cancer; DNA Methylation; Epigenetics; KRAS
Mesh:
Substances:
Year: 2019 PMID: 30954552 PMCID: PMC6699251 DOI: 10.1016/j.jcmgh.2019.04.002
Source DB: PubMed Journal: Cell Mol Gastroenterol Hepatol ISSN: 2352-345X
Clinicopathologic Details of the 216 Colorectal Adenocarcinomas as Stratified for Methylation-Based CIMP Clustering, Measured on Illumina HM450 Arrays, Using the 5000 Most Variable CpG Sites That Were Not Hypermethylated in Normal Mucosal Tissue
| n | CIMP-H1 | CIMP-H2 | CIMP-L1 | CIMP-L2 | CIMP-Neg | ||
|---|---|---|---|---|---|---|---|
| Total, n | 216 | 23 | 22 | 52 | 66 | 53 | |
| Mean age, | 67.9 | 75.2 | 73.4 | 70.1 | 66.8 | 61.9 | <.0001 |
| Sex | |||||||
| Male | 100 (46.4%) | 5 (21.7%) | 9 (40.9%) | 24 (46.2%) | 35 (53.0%) | 27 (50.9%) | .11 |
| Female | 116 (53.7%) | 18 (78.3%) | 13 (59.1%) | 28 (53.8%) | 31 (47.0%) | 26 (49.1%) | |
| Site | |||||||
| Proximal | 75/213 (35.2%) | 19 (82.6%) | 13 (59.1%) | 20 (39.2%) | 15 (23.4%) | 8 (15.1%) | <.0001 |
| Distal | 96/213 (45.1%) | 4 (17.4%) | 6 (27.3%) | 21 (41.2%) | 32 (50.0%) | 33 (62.3%) | |
| Rectal | 42/213 (19.7%) | 0 | 3 (13.6%) | 10 (19.6%) | 17 (26.6%) | 12 (22.6%) | |
| CIMP status | |||||||
| CIMP-high | 24 (11.1%) | 16 (69.6%) | 3 (13.6%) | 3 (5.8%) | 2 (3.0%) | 0 | <.0001 |
| CIMP-low | 44 (20.4%) | 6 (26.1%) | 13 (59.1%) | 16 (30.8%) | 8 (12.1%) | 1 (1.9%) | |
| CIMP-neg | 148 (68.5%) | 1 (4.3%) | 6 (27.3%) | 33 (63.5%) | 56 (84.8%) | 52 (98.1%) | |
| Mutation | |||||||
| | 75 (34.7%) | 4 (17.4%) | 12 (54.5%) | 34 (65.4%) | 19 (28.8%) | 7 (13.2%) | <.0001 |
| | 29 (13.4%) | 17 (73.9%) | 2 (9.1%) | 6 (11.5%) | 4 (6.0%) | 0 (0%) | <.0001 |
| | 77/185 (41.6%) | 12/21 (57.1%) | 6/21 (28.6%) | 18/45 (40.0%) | 22/54 (40.7%) | 19/44 (43.2%) | .45 |
| Microsatellite instability | |||||||
| MSI | 26 (12.0%) | 11 (47.8%) | 1 (4.8%) | 8 (15.4%) | 6 (9.1%) | 0 | <.0001 |
| MSS | 190 (88.0%) | 12 (52.2%) | 21 (95.2%) | 44 (84.6%) | 60 (90.9%) | 0 | |
| CMS | |||||||
| CMS1 | 35 (16.2%) | 16 (69.6%) | 4 (18.2%) | 5 (9.6%) | 9 (13.6%) | 1 (1.9%) | <.0001 |
| CMS2 | 68 (31.5%) | 0 | 4 (18.2%) | 10 (19.2%) | 30 (45.5%) | 24 (45.3%) | |
| CMS3 | 53 (24.5%) | 3 (13.0%) | 12 (54.5%) | 21 (40.4%) | 10 (15.2%) | 7 (13.2%) | |
| CMS4 | 60 (27.8%) | 4 (17.4%) | 2 (9.1%) | 16 (30.8%) | 17 (25.8%) | 21 (39.6%) | |
| Stage | |||||||
| I | 30/111 | 0/15 | 5/11 (45.5%) | 8/30 (26.7%) | 13/35 (37.1%) | 4/20 (20.0%) | .15 |
| II | 33/111 | 7/15 (46.7%) | 1/11 (9.1%) | 10/30 (33.3%) | 10/35 (28.6%) | 5/20 (25.0%) | |
| III | 34/111 | 6/15 (40.0%) | 4/11 (36.4%) | 7/30 (23.3%) | 11/35 (31.4%) | 6/20 (30.0%) | |
| IV | 14/111 | 2/15 (13.3%) | 1/11 (9.1%) | 5/30 (16.7%) | 1/35 (2.9%) | 5/20 (25.0%) | |
| LINE1 | 70.3 | 68.75 | 68.96 | 72.05 | 70.45 | 69.67 | .38 |
NOTE. P values reported were obtained using analysis of variance for continuous variables and chi-squared analysis for categoric variables.
MSS, microsatellite stable.
Figure 1Methylation heatmap of unselected 216 colorectal cancers using the 5000 most variable β values in CpG sites that were not hypermethylated in normal mucosal tissue. Clustering was performed using the RPMM R package. Clustering showed 5 distinct clusters, termed CIMP-H1, CIMP-H2, CIMP-L1, CIMP-L2, and CIMP-Neg. This was faithfully recapitulated in TCGA.
Clinicopathologic and Molecular Details of 374 Colorectal Adenocarcinomas From TCGA Stratified for CIMP Subtype
| n | CIMP-H1 | CIMP-H2 | CIMP-L1 | CIMP-L2 | CIMP-neg | ||
|---|---|---|---|---|---|---|---|
| Total, n | 374 | 19 (5.1%) | 39(10.4%) | 81 (21.7%) | 133 (35.6%) | 102 (27.3%) | |
| Mean age, | 64.5 | 72.2 | 67.8 | 66.5 | 64.5 | 57.1 | <.0001 |
| Sex | |||||||
| Male | 199 | 7 (36.8%) | 21 (53.8%) | 47 (58.0%) | 74 (55.6%) | 50 (49.5%) | NS |
| Female | 174 | 12 (63.2%) | 18 (46.2%) | 34 (42.0%) | 59 (44.4%) | 51 (50.5%) | |
| Site | |||||||
| Proximal | 167 | 17 (100%) | 28 (84.8%) | 53 (67.9%) | 53 (40.8%) | 16 (16.5%) | <.0001 |
| Distal | 143 | f0 | 4 (12.1%) | 18 (23.1%) | 57 (43.8%) | 64 (65.9%) | |
| Rectal | 45 | 0 | 1 (3.0%) | 7 (9.0%) | 20 (15.4%) | 17 (17.5%) | |
| Mutation | |||||||
| BRAF | 35 | 10 (52.6%) | 19 (48.7%) | 5 (6.2%) | 1 (0.8%) | 0 | <.0001 |
| KRAS | 150 | 5 (26.3%) | 17 (43.6%) | 48 (59.3%) | 59 (44.4%) | 21 (20.6%) | <.0001 |
| TP53 | 234 | 10 (52.6%) | 19 (48.7%) | 44 (54.3%) | 85 (63.9%) | 76 (74.5%) | .01 |
| Microsatellite instability | |||||||
| MSI | 51 | 10 (52.6%) | 17 (50%) | 11 (16.7%) | 7 (6.2%) | 6 (6.7%) | <.0001 |
| MSS | 269 | 9 (47.4%) | 17 (50%) | 55 (83.3%) | 105 (93.8%) | 83 (93.3%) | |
| CMS | |||||||
| CMS1 | 42 | 10 (58.8%) | 20 (69%) | 9 (14.3%) | 3 (2.8%) | 0 (0%) | <.0001 |
| CMS2 | 121 | 2 (11.8%) | 1 (3.4%) | 25 (39.7%) | 48 (45.3%) | 45 (51.1%) | |
| CMS3 | 45 | 4 (23.5%) | 4 (13.8%) | 16 (25.4%) | 14 (13.2%) | 7 (8%) | |
| CMS4 | 95 | 1 (5.9%) | 4 (13.8%) | 13 (20.6%) | 41 (38.7%) | 36 (40.9%) | |
| Stage | |||||||
| I | 54 | 3 (15%) | 9 (23.7%) | 16 (20.8%) | 11 (8.7%) | 15 (16%) | <.01 |
| II | 133 | 9 (45%) | 18 (47.4%) | 32 (41.6%) | 50 (39.4%) | 24 (25.5%) | |
| III | 119 | 5 (25%) | 11 (28.9%) | 20 (26%) | 46 (36.2%) | 37 (39.4%) | |
| IV | 50 | 3 (15%) | 0 (0%) | 9 (11.7%) | 20 (15.7%) | 18 (19.1%) |
NOTE. P values reported were obtained using analysis of variance for continuous variables and chi-squared for categoric variables and represent the P value for an association between all subtypes and the feature in question.
MSS, microsatellite stable.
Distribution of Differentially Hypermethylated Probes in Reference to CpG Islands Vs Normal Mucosal Tissue
| CpG location | CIMP-H1 | CIMP-H2 | CIMP-L1 | CIMP-L2 | CIMP-neg | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| + | - | + | - | + | - | + | - | + | - | |
| Island | 21,011 | 204 | 19,651 | 426 | 11,297 | 118 | 5685 | 127 | 754 | 162 |
| South Shore | 3196 | 586 | 3003 | 1359 | 1253 | 426 | 513 | 284 | 78 | 242 |
| North Shore | 4745 | 890 | 4641 | 1885 | 2095 | 617 | 911 | 420 | 184 | 346 |
| South Shelf | 229 | 743 | 181 | 1620 | 83 | 574 | 49 | 331 | 19 | 238 |
| North Shelf | 280 | 738 | 259 | 1660 | 92 | 591 | 58 | 342 | 35 | 246 |
| Sea | 2056 | 8396 | 1721 | 15,575 | 647 | 6812 | 297 | 4189 | 104 | 3428 |
| Total | 31,517 | 11,557 | 29,453 | 22,525 | 15,467 | 9138 | 7513 | 5693 | 1174 | 4662 |
NOTE. Cancers were stratified for CIMP clustering. Differential methylation was deemed as an absolute β value change of more than 0.2 and an FDR corrected P value less than .01 compared with 32 normal colorectal mucosal samples.
+, differential hypermethylation; -, differential hypomethylation.
Figure 2Differentially regulated hallmark gene sets between CIMP-H1 and CIMP-H2 cancers as assessed by single-sample gene set enrichment analysis. IL, interleukin; ssGSEA, single sample gene set enrichment analysis.
Figure 3(A) Number of differentially methylated promoters in each CIMP cluster vs the cohort of normal mucosal samples. (B) The proportion of methylation events within each cluster that resulted in gene repression at the transcript level.
Tumor-Suppressor Genes That Were Recurrently Methylated and Repressed in More Than 3 CIMP Subtypes
| Gene name | Description |
|---|---|
| Protocadherin 9 (source: HGNC symbol; Acc: HGNC: 8661) | |
| Cysteine dioxygenase type 1 (source: HGNC symbol; Acc: HGNC: 1795) | |
| Mal, T-cell differentiation protein (source: HGNC symbol; Acc: HGNC: 6817) | |
| Erythrocyte membrane protein band 4.1-like 3 (source: HGNC symbol; Acc: HGNC: 3380) | |
| A-kinase anchoring protein 12 (source: HGNC symbol; Acc: HGNC: 370) | |
| NDRG family member 4 (source: HGNC symbol; Acc: HGNC: 14466) | |
| LIF-receptor α (source: HGNC symbol; Acc: HGNC: 6597) | |
| Signal peptide, CUB domain, and EGF-like domain containing 2 (source: HGNC symbol; Acc: HGNC: 30425) | |
| Transmembrane protein with EGF-like and 2 follistatin-like domains 2 (source: HGNC symbol; Acc: HGNC: 11867) | |
| Dual-specificity phosphatase 26 (source: HGNC symbol; Acc: HGNC: 28161) | |
| Chromosome 2 open reading frame 40 (source: HGNC symbol; Acc: HGNC: 24642) | |
| Secreted frizzled-related protein 1 (source: HGNC symbol; Acc: HGNC: 10776) | |
| Ubiquitin C-terminal hydrolase L1 (source: HGNC symbol; Acc: HGNC: 12513) | |
| IKAROS family zinc finger 1 (source: HGNC symbol; Acc: HGNC: 13176) | |
| Cell adhesion molecule 2 (source: HGNC symbol; Acc: HGNC: 29849) | |
| C-X-C motif chemokine ligand 12 (source: HGNC symbol; Acc: HGNC: 10672) | |
| Interferon regulatory factor 4 (source: HGNC symbol; Acc: HGNC: 6119) | |
| Zinc finger and BTB domain containing 16 (source: HGNC symbol; Acc: HGNC: 12930) | |
| Checkpoint with forkhead and ring finger domains (source: HGNC symbol; Acc: HGNC: 20455) | |
| Slit guidance ligand 2 (source: HGNC symbol; Acc: HGNC: 11086) | |
| ZFP82 zinc finger protein (source: HGNC symbol; Acc: HGNC: 28682) |
Acc, accession number; BTB, Broad-Complex, Tramtrack and Bric a brac; EGF, epidermal growth factor; HGNC, Human Genome Organisation Gene Nomenclature Committee; LIF, leukocyte inhibitory factor; NDRG, N-Myc downregulated gene.
Figure 4(A) Proportion of SUZ12-occupied regions in hESC1 cells that contained hypermethylated probes in respective CIMP clusters. (B) Proportion of differential hypermethylation events that overlapped with Polycomb Repressive Complex-2 (PRC2)-occupied regions.
Motifs That Were Most Significantly and Exclusively Enriched at Methylated Promoters in CIMP-H1 and CIMP-H2
| CIMP-H1 | CIMP-H2 | ||||||
|---|---|---|---|---|---|---|---|
| Motif name | Motif | Raw | Adjusted | Motif name | Motif | Raw | Adjusted |
| Smad4 | TGTCTRGM | 1.2E-21 | 1.2E-24 | SPDEF_DBD_2 | GTGGTCCCGGATTAT | 7.2E-33 | 7.2E-30 |
| FOXP3_DBD | RTAAACA | 4.1E-20 | 4.1E-23 | UP00142_1 | VNTAATTAATTAABGSG | 2.4E-20 | 2.4E-17 |
| FOXP3 | RTAAACA | 4.1E-20 | 4.1E-23 | FLI1_full_2 | ACCGGAAATCCGGT | 1.1E-19 | 1.1E-16 |
| POU2F2_DBD_2 | HWTRMATATKCAWA | 4.5E-19 | 4.5E-22 | UP00200_1 | GVWAATTAATTAMYBBG | 3.5E-19 | 3.5E-16 |
| Zscan4_primary | DHNATGTGCACAYAHWN | 1.2E-18 | 1.3E-21 | NHLH1_DBD | CGCAGCTGCS | 2.1E-18 | 2.1E-15 |
| HOXC10 | GTCRTAAAAH | 1.3E-18 | 1.3E-21 | ERG_full_2 | ACCGGAWATCCGGT | 4.8E-18 | 4.8E-15 |
| Bbx_secondary | HVWNNGTTAACASHNRV | 3.1E-16 | 3.1E-19 | MA0680.1 | TAATCGATTA | 8.7E-18 | 8.6E-15 |
| Foxc1_DBD_1 | GTAAAYAAACA | 1.3E-15 | 1.3E-18 | PAX7_DBD | TAATYRATTA | 1.4E-16 | 1.4E-13 |
Figure 5Pathways significantly enriched for genes that contained CpG islands that were devoid of methylation in both CIMP-H clusters. VEGF, vascular endothelial growth factor.
Oncogenes With Significantly Higher Methylation Within the Body of the Gene
| CIMP-H1 | CIMP-H2 | ||||
|---|---|---|---|---|---|
| Gene | Expression | Description | Gene | Expression | Description |
| Down-regulated | FEV, ETS transcription factor | Down-regulated | LIM domain only 2 | ||
| Down-regulated | BCL2, apoptosis regulator | Down-regulated | Catenin Δ 2 | ||
| Down-regulated | KIT proto-oncogene receptor tyrosine kinase | Up-regulated | Staphylococcal nuclease and tudor domain containing 1 | ||
| Up-regulated | Paired box 3 | Up-regulated | Catenin α2 | ||
| Up-regulated | Staphylococcal nuclease and tudor domain containing 1 | Up-regulated | T-cell leukemia homeobox 1 | ||
| No difference | LIM domain only 2 | No difference | PI-3,4,5-trisphosphate-dependent Rac exchange factor 2 | ||
| No difference | R-spondin 3 | No difference | R-spondin 3 | ||
| No difference | Catenin delta 2 | No difference | Ret proto-oncogene | ||
| No difference | T-cell leukemia homeobox 3 | No difference | LIM domain only 1 | ||
| No difference | SIX homeobox 1 | No difference | Fms-related tyrosine kinase 3 | ||
| No difference | Homeobox C13 | No difference | Calcium voltage-gated channel subunit α1 D | ||
| No difference | LIM domain only 1 | No difference | WW domain containing transcription regulator 1 | ||
| No difference | Zinc finger protein 521 | No difference | Carbohydrate sulfotransferase 11 | ||
| No difference | Spalt like transcription factor 4 | No difference | Paired box 3 | ||
| No difference | Zinc finger E-box binding homeobox 1 | No difference | Fms-related tyrosine kinase 4 | ||
| No difference | PI-3,4,5-trisphosphate dependent Rac exchange factor 2 | No difference | C-X-C motif chemokine receptor 4 | ||
| No difference | Oligodendrocyte transcription factor 2 | No difference | T-cell leukemia homeobox 3 | ||
| No difference | Smoothened, frizzled class receptor | No difference | TAL bHLH transcription factor 1, erythroid differentiation factor | ||
| No difference | Fms related tyrosine kinase 3 | No difference | SIX homeobox 1 | ||
| No difference | GATA binding protein 2 | No difference | Homeobox C11 | ||
| No difference | T-cell leukemia homeobox 1 | No difference | Oligodendrocyte transcription factor 2 | ||
| No difference | TAL bHLH transcription factor 1, erythroid differentiation factor | No difference | Myogenic differentiation 1 | ||
| No difference | Calcium voltage-gated channel subunit a1 D | No difference | Zinc finger E-box binding homeobox 1 | ||
| No difference | Myogenic differentiation 1 | No difference | Homeobox C13 | ||
| No difference | Catenin a2 | No difference | Zinc finger protein 521 | ||
| No difference | Carbohydrate sulfotransferase 11 | No difference | Smoothened, frizzled class receptor | ||
| No difference | Nuclear receptor subfamily 4 group A member 3 | No difference | GATA binding protein 2 | ||
| No difference | Nuclear receptor subfamily 4 group A member 3 | ||||
BCL, B-cell lymphoma; bHLH, basic helix-loop-helix; ETS, E26 transformation specific; FEV, fifth ewing variant; PI, phosphatidylinositol; SIX, Sineoculis homeobox homolog; TAL, T-cell acute lymphocyctic.
Overlap Between Genes Marked by the PRC2 Complex and H3K27Me3 in hEScells and Genes That Undergo Significant Gene Body Methylation in Colorectal Cancer Development
| Gene set name | CIMP-H1 | CIMP-H2 | CIMP-L1 | CIMP-L2 | ||||
|---|---|---|---|---|---|---|---|---|
| Overlap fraction | FDR | Overlap fraction | FDR | Overlap fraction | FDR | Overlap fraction | FDR | |
| BENPORATH_ES_WITH_H3K27ME3 | 30.59% | 1.34E-280 | 31.04% | 2.50E-300 | 13.06% | 6.11E-122 | 8.50% | 1.60E-78 |
| BENPORATH_EED_TARGETS | 30.70% | 3.91E-267 | 31.07% | 1.12E-284 | 12.81% | 8.75E-112 | 8.66% | 8.47E-77 |
| BENPORATH_SUZ12_TARGETS | 30.92% | 5.05E-264 | 30.73% | 9.67E-273 | 12.91% | 1.29E-110 | 8.48% | 2.02E-72 |
| BENPORATH_PRC2_TARGETS | 37.27% | 1.04E-218 | 38.04% | 8.59E-235 | 16.41% | 4.56E-98 | 11.04% | 2.05E-66 |
NOTE. The overlap fraction represents the gene bodies that are methylated (k) divided by the number of genes marked by each respective mark in hES cells (K) (k/K). The FDR corrected P value was obtained through modeling a hypergeometric distribution (k-1, K, N-K, n; where k is the number of genes methylated in each cluster; K is the number of genes in the gene set; N is the number of genes in the human genome; and n is the number of genes in the query set) using the compute overlaps tool on the Gene Set Enrichment Analysis (GSEA) web portal using the Benporath gene sets, which were obtained though ChIP-on a Chip analysis of human embryonic stem cells.
Figure 6High-impact mutations in epigenetic regulator genes are frequent in cancers with higher genomic methylation. Del, deletion; Ins, insertion.
Figure 7High impact mutations in ARID1A are common in colorectal adenocarcinomas. Del, deletion; Ins, insertion.