| Literature DB >> 25473433 |
Tania Witte1, Christoph Plass1, Clarissa Gerhauser1.
Abstract
The comparison of DNA methylation patterns across cancer types (pan-cancer methylome analyses) has revealed distinct subgroups of tumors that share similar methylation patterns. Integration of these data with the wealth of information derived from cancer genome profiling studies performed by large international consortia has provided novel insights into the cellular aberrations that contribute to cancer development. There is evidence that genetic mutations in epigenetic regulators (such as DNMT3, IDH1/2 or H3.3) mediate or contribute to these patterns, although a unifying molecular mechanism underlying the global alterations of DNA methylation has largely been elusive. Knowledge gained from pan-cancer methylome analyses will aid the development of diagnostic and prognostic biomarkers, improve patient stratification and the discovery of novel druggable targets for therapy, and will generate hypotheses for innovative clinical trial designs based on methylation subgroups rather than on cancer subtypes. In this review, we discuss recent advances in the global profiling of tumor genomes for aberrant DNA methylation and the integration of these data with cancer genome profiling data, highlight potential mechanisms leading to different methylation subgroups, and show how this information can be used in basic research and for translational applications. A remaining challenge is to experimentally prove the functional link between observed pan-cancer methylation patterns, the associated genetic aberrations, and their relevance for the development of cancer.Entities:
Year: 2014 PMID: 25473433 PMCID: PMC4254427 DOI: 10.1186/s13073-014-0066-6
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Figure 1DNA methylation patterns in normal and cancer cells. (A) In normal cells, most CpGs located outside of promoters in gene bodies and intergenic regions are methylated (red circles), whereas promoter-associated CpG islands are protected from DNA methylation (white circles). (B) In cancer cells, a global or localized loss of 5-methylcytosine occurs at gene bodies and intergenic regions, whereas CpG-rich regions like promoters are usually heavily methylated, which might lead to transcriptional repression. Regions of intermediate CpG levels such as shores are associated with tissue-specific methylation. Global loss (left plot) and focal gain (right plot) of DNA methylation are depicted as tracks of the University of California Santa Cruz genome browser [118] using whole-genome bisulfite sequencing data for normal and cancer cell lines. Tracks for CpG islands and selected histone modifications, including H3K4me3, which is associated with transcriptionally active promoters, and H3K4me1 and H3K27ac as markers for enhancers, are illustrated below the gene track. Each color of the histone tracks represents an individual ENCODE cell line. The deleted in colon cancer gene (DCC) was taken as an exemplary locus for which long-range hypomethylation regions (horizontal blue bars) are observed in the breast cancer cell line HCC1954 and in the liver carcinoma cell line HepG2, but not in normal mammary epithelial cells (HMEC) or the myofibroblast cell line IMR90. The glutathione S-transferase P1 gene (GTSP1) represents an example of promoter hypermethylation (highlighted in red) in cancer cell lines compared to normal cells. TSS, transcription start site.
Pan-cancer patterns of DNA methylation
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| AML | High |
| A | Associated with patients presenting with an intermediate-risk karyotype | [ |
| Co-occurrence of | |||||
| Bladder urothelial | High |
| Smoking-pack years as predictor of CIMP phenotype | [ | |
| Low | ↑ | B | |||
| Breast | B-CIMP | ↓ mutation rate | Luminal ER/PR-positive tumors | [ | |
| Low metastatic risk and better clinical outcome | |||||
| Enriched for genes targeted by the PRC2 (e.g. | |||||
| B-CIMP-negative | ↑ | B | Basal-like tumors (ER/PR-negative) | ||
| High metastatic risk and poor clinical outcome | |||||
| Cholangiocarcinoma | High |
| A | Longer survival | [ |
| Chondrosarcoma | High |
| A | [ | |
| Colorectal | CIMP-H |
| C | MSI | [ |
| Right/ascending colonic region | |||||
| ↑ mutation rate | |||||
| ↑ | |||||
| Good prognosis | |||||
| ↑ | |||||
| CIMP-L |
| CIN (non-MSI) | |||
| Poor prognosis | |||||
| Two non-CIMP | ↑ | B | Anatomic origins distinct from CIMP groups | ||
| ↑ SCNAs | |||||
| Endometrial | High |
| C | MSI | [ |
|
| |||||
| ↑ mutation rate | |||||
| Low | ↑ | B | Serous-like tumors | ||
| ↑ SCNAs | Poor prognosis | ||||
| Two non-methylated | ↑ | Endometrioid tumors | |||
| ↑ SCNAs |
| ||||
| Gastric | EBV-CIMP | ↑ | EBV-positive tumors | [ | |
|
| |||||
| Amplifications of | |||||
| Gastric CIMP |
| C | MSI | ||
| ↑ mutation rate | |||||
| Cluster 3 – low |
| Enriched for the diffuse histological variant | |||
| Genomically stable | Also fusions involving RHO-family GTPase-activating proteins | ||||
| Cluster 4 – low | ↑ | B | CIN | ||
| Focal amplifications of receptor tyrosine kinases | |||||
| Glioblastoma | G-CIMP |
| A | Secondary tumors with proneural expression | [ |
|
| |||||
|
| |||||
| Younger age at diagnosis | |||||
| Better survival rates | |||||
| G-CIMP negative proneural | No | Relative hypomethylation | |||
|
| Proneural subtype cases without | ||||
| Pediatric glioblastoma | Global loss of methylation at non-promoter regions |
|
| [ | |
| Renal cell carcinoma | Global loss of methylation |
|
| [ | |
| Loss of methylation at non-promoter regions | |||||
| One of the tumor types with the lowest frequency of DNA methylation events | |||||
| Lung ADCA | CIMP-high |
| Associated either with ↑ ploidy, ↑ mutation and the PI subtype or with ↓ ploidy, ↓ mutation rate and the TRU subtype | [ | |
|
| |||||
| Mutations in chromatin modifiers such as | |||||
| Lung SQCC | High |
| Classical expression subtype | [ | |
|
| |||||
| Chromosomal instability | |||||
| ↑ SCNAs | |||||
| Low | Primitive expression subtype | ||||
| Serous ovarian | High | Germline and somatic | More differentiated tumors | [ | |
| Better survival | |||||
| Low | ↑ | B |
| ||
| ↑ SCNAs | |||||
|
| |||||
| Poor clinical outcome |
*Methylation patterns A, B and C indicate common genetic and epigenetic aberrations across different tumors. †These molecular aberrations were not necessarily associated with a specific methylation subgroup. ADCA, adenocarcinoma; AML, acute myeloid leukemia; CIMP, CpG island methylator phenotype; CIN, chromosomal instability; EBV, Epstein-Barr virus; ER, estrogen receptor; MSI, microsatellite instability; PI, proximal inflammatory; PR, progesterone receptor; PRC, polycomb repressor complex; SCNAs, somatic copy-number alterations; SQCC, squamous cell carcinoma; TCGA, The Cancer Genome Atlas; TRU, terminal respiratory unit.
Figure 2Pan-cancer methylome representation for ten cancer cohorts from The Cancer Genome Atlas. The Cancer Genome Atlas PANCAN12 DNA methylation data, representing 24,980 CpG sites acquired from the 27 k Illumina platform and corresponding to 2,224 tumor samples, were downloaded from the University of California Santa Cruz Cancer Genomics Browser [119]. CpG sites located on chromosome X and Y were removed, as well as the ones associated with single-nucleotide polymorphisms (n = 2,750). DNA methylation data for ten tumor entities - OV (n = 600), UCEC (n = 117), BRCA (n = 315), LUAD (n = 126), LUSC (n = 133), READ (n = 67), COAD (n = 166), GBM (n = 287), KIRC (n = 219) and AML (n = 194) - are included in the PANCAN12 dataset. For each of the tumor entities, color-coded on the top of the graph, the 500 most variable CpGs of the remaining 21,844 data points were selected. From the overlap, Qlucore Omics Explorer 3.0 software was used to select the 1,430 most variable CpGs, which were then hierarchically clustered as a heat map. Beta values are offset by −0.5 to shift the whole dataset to values between −0.5 (in dark blue) and 0.5 (in yellow) for improved graphical display [119]. DNA methylation patterns show relatively high homogeneity within tumor entities. We do not observe a common CpG island methylator phenotype-like group across several tumor types, suggesting that the ‘tissue of origin’ methylation signature is a strong decisive factor for the pattern. Colorectal cancer shows the highest overall methylation, whereas kidney cancer is characterized by low variance of methylation. The methylation patterns of ovarian, endometrial and breast cancer display a similar distribution of high and low methylation. CpG sites fall into high and intermediate DNA methylation clusters, covering all tumors entities, and a low methylation cluster with genes methylated in glioblastoma multiforme (GBM) or colorectal tumors and unmethylated in ovarian cancer. Unexpectedly, the high methylation cluster shows enrichment for membrane-associated genes including claudins (CLDN) and cadherins (CDH), while polycomb repressor complex PRC2 target genes are highly enriched in the intermediate and low methylation clusters. Some of these genes, as well as a selection of differentially methylated genes mentioned in the text such as MLH1, APC, BRCA1/2 and VHL, are indicated on the right side of the graph. For abbreviations of the tumor entities see Table 1.
International Cancer Genome Consortium projects with methylomes generated by Infinium BeadChips
|
|
|
|
|---|---|---|
| Breast | BRCA-US | 971 |
| Ovary | OV-US | 572 |
| Kidney | KIRC-US | 491 |
| Head and neck | THCA-US | 488 |
| Uterus | UCEC-US | 481 |
| Lung | LUAD-US | 460 |
| Colorectal | COAD-US | 414 |
| Lung | LUSC-US | 410 |
| Head and neck | HNSC-US | 407 |
| Brain | GBM-US | 393 |
| Skin | SKCM-US | 338 |
| Stomach | STAD-US | 328 |
| Brain | LGG-US | 293 |
| Bladder | BLCA-US | 198 |
| Prostate | PRAD-US | 196 |
| Blood | LAML-US | 194 |
| Pancreas | PACA-AU | 167 |
| Blood | CLLE-ES | 159 |
| Colorectal | READ-US | 150 |
| Liver | LIHC-US | 149 |
| Kidney | KIRP-US | 142 |
| Cervix | CESC-US | 127 |
| Brain | PBCA-DE | 115 |
| Ovary | OV-AU | 93 |
| Pancreas | PAAD-US | 72 |
| Pancreas | PAEN-AU | 23 |
Modified from the International Cancer Genome Consortium data portal [104]. AU, Australia; DE, Germany; ES, Spain; US, United States.
DNA methylation biomarkers and their potential clinical applications
|
|
|
|
|---|---|---|
|
| ||
|
| Breast | Whole blood DNA [ |
| 140 variable CpGs | Cervical | Normal uterine cervix cells [ |
|
| ||
|
| Prostate | Serum, urine, ejaculate [ |
|
| Prostate | Urine [ |
|
| Prostate | Urine [ |
|
| Prostate | Paraffin-embedded tissues [ |
|
| Colorectal | Blood plasma [ |
|
| Colorectal | Blood plasma [ |
|
| NSCLC | Bronchial fluid aspirates/ blood plasma [ |
|
| NSCLC | Sputum [ |
|
| Breast | Fine needle aspiration biopsy [ |
|
| Bladder | Urine [ |
|
| ||
| 20-gene signature | ALL | Leukemic cells from bone marrow and peripheral blood [ |
| 15-gene classifier | AML | |
|
| Breast | Serum [ |
|
| CLL | CD19 sorted mononuclear cells [ |
|
| CCR | Blood plasma [ |
|
| Head and neck | Tumor samples [ |
|
| NSCLC | Tumor samples [ |
|
| NSCLC | Primary tumors and lymph nodes [ |
|
| NSCLC | Tumor samples [ |
|
| OPSCC | Tumor samples [ |
|
| Prostate | Tumor samples [ |
|
| ||
|
| Breast | Tumor samples [ |
|
| Breast | Tumor samples [ |
|
| Breast | Tumor samples [ |
|
| Colon | Tumor samples [ |
|
| Glioma | Tumor samples [ |
|
| Melanoma | Tumor samples/cell lines [ |
|
| NSCLC | Tumor samples/cell lines [ |
|
| Ovary | Tumor samples [ |
ALL, acute lymphoblastic leukemia; AML, acute myeloid leukemia; CCR, colorectal cancer; CLL, chronic lymphocytic leukemia; DNAm, DNA methylation; NSCLC, non-small-cell lung cancer; OPSCC, oropharyngeal squamous cell carcinoma.