| Literature DB >> 28426276 |
Lucas A Salas1, Kevin C Johnson1,2, Devin C Koestler3, Dylan E O'Sullivan4, Brock C Christensen1,2,5.
Abstract
Genetic and epigenetic alterations are required for carcinogenesis and the mutation burden across tumor types has been investigated. Here, we investigate epigenetic alterations with a novel measure of global DNA methylation dysregulation, the methylation dysregulation index (MDI), across 14 cancer types in The Cancer Genome Atlas (TCGA) database. DNA methylation data-obtained using Illumina HumanMethylation450 BeadChip-was accessed from TCGA. We calculated the MDI in 14 tumor types (n = 5,592 tumors), using adjacent normal tissues (n = 701) from each tumor site. Copy number alteration, and mutation burden were retrieved from cBioportal (n = 5,152). We tested the relation of subject MDI across tumors and with age, gender, tumor stage, estimated tumor purity, and copy number alterations for both overall MDI and genomic-context-specific MDI. We also investigated the top most dysregulated loci shared across tumor types. There was a broad range of extent in methylation dysregulation across tumor types (P < 2.2E-16). However, a consistent pattern of methylation dysregulation stratified by genomic context was observed across tumor types where the highest dysregulation occurred at non-CpG island regions. Considering other summary measures of somatic alteration, MDI was correlated with copy number alterations but not with mutation burden. Using the top dysregulated CpG sites in common across tumors, 4 classes of cancer types were observed, and the functional consequences of these alterations to gene expression were confirmed. This work identified the global DNA methylation dysregulation patterns across 14 cancer types showing a higher impact for the non-CpG island areas. The most dysregulated loci across cancer types identified common clusters across cancer types that may have implications for future treatment and prevention measures.Entities:
Keywords: Copy number alterations; DNA methylation; epigenetics; pan-cancer
Mesh:
Year: 2017 PMID: 28426276 PMCID: PMC5687331 DOI: 10.1080/15592294.2017.1319043
Source DB: PubMed Journal: Epigenetics ISSN: 1559-2294 Impact factor: 4.528
General characteristics of the study population.
| AJCC tumor stage | |||||||
|---|---|---|---|---|---|---|---|
| Tumor | Normal | Female | Male | Age | Stage I/II | Stage III/IV | |
| Cancer type | n | n | n (%) | n (%) | Mean (SD) | n (%) | n (%) |
| BLCA | 411 | 21 | 108 (26.3) | 303 (73.7) | 68.56 (10.60) | 133 (32.5) | 276 (67.5) |
| BRCA | 779 | 97 | 770 (98.8) | 9 (1.2) | 58.62 (13.10) | 563 (72.8) | 210 (27.2) |
| COAD | 283 | 38 | 131 (46.3) | 152 (53.7) | 65.37 (13.20) | 150 (54.5) | 125 (45.5) |
| ESCA | 185 | 16 | 27 (14.6) | 158 (85.4) | 62.95 (11.87) | 98 (60.1) | 65 (39.9) |
| HNSC | 527 | 50 | 142 (26.9) | 385 (73.1) | 61.40 (11.93) | 103 (22.6) | 353 (77.4) |
| KIRC | 319 | 160 | 114 (35.7) | 205 (64.3) | 61.85 (11.84) | 187 (58.6) | 132 (41.4) |
| KIRP | 275 | 45 | 73 (26.5) | 202 (73.5) | 62.18 (12.09) | 188 (74.3) | 65 (25.7) |
| LIHC | 374 | 50 | 120 (32.1) | 254 (67.9) | 59.83 (13.31) | 261 (73.9) | 92 (26.1) |
| LUAD | 457 | 32 | 244 (53.4) | 213 (46.6) | 65.53 (10.16) | 362 (79.4) | 94 (20.6) |
| LUSC | 369 | 42 | 96 (26.0) | 273 (74.0) | 68.05 (8.73) | 309 (84.0) | 59 (16.0) |
| PAAD | 183 | 10 | 82 (44.8) | 101 (55.2) | 65.28 (10.99) | 171 (94.5) | 10 (5.5) |
| PRAD | 497 | 50 | NA | 497 (100.0) | 61.56 (6.79) | 185 (37.8) | 305 (62.2) |
| THCA | 504 | 56 | 368 (73.0) | 136 (27.0) | 47.83 (15.77) | 334 (66.5) | 168 (33.5) |
| UCEC | 429 | 34 | 429 (100.0) | NA | 64.60 (11.18) | 284 (69.6) | 124 (30.4) |
| Overall | 5592 | 701 | 2704 (48.4) | 2888 (51.6) | 61.66 (12.96) | 3328 (61.6) | 2078 (38.4) |
| <2.2E-16 | <2.2E-16 | <2.2E-16 | |||||
General American Joint Committee on Cancer Classification,
Normal adjacent samples
P-values based on:
Fisher exact test,
ANOVA test NA: Not Available/Not Applicable
Subject and genomic context methylation dysregulation indices, and fraction of genome altered by cancer types.
| gcMDI | |||||||
|---|---|---|---|---|---|---|---|
| sMDI | CpG Island | Shores | Shelves | Open Sea | Fraction of genome altered | ||
| Cancer type | n | median (IQR) | median (IQR) | median (IQR) | median (IQR) | median (IQR) | median (IQR) |
| BLCA | 411 | 10.7 (8.5, 13.4) | 7 (5.6, 8.4) | 11.1 (8.6, 13.6) | 11.7 (9, 15.4) | 13.6 (10.3, 17.2) | 28.2 (13.9, 45.2) |
| LIHC | 374 | 10.6 (8.4, 13.4) | 7 (5.6, 8.1) | 10.6 (8.6, 12.8) | 11.3 (8.4, 15.4) | 13.2 (9.9, 17.9) | 25.2 (15.9, 37.8) |
| UCEC | 429 | 10.4 (9.2, 11.9) | 7.2 (6.2, 8.5) | 11.3 (10.1, 12.8) | 10.6 (9, 12.3) | 12.5 (10.7, 14.6) | 8.1 (0.2, 34.9) |
| ESCA | 185 | 10.1 (8.8, 11.8) | 7.2 (5.6, 9.6) | 10.2 (8.9, 11.9) | 10.6 (9, 12.5) | 11.9 (10.2, 14) | 35.8 (22.6, 50.1) |
| COAD | 283 | 9 (7.4, 10.4) | 7.2 (5.8, 9.1) | 8.9 (7.6, 10.5) | 8.7 (7, 10.2) | 10 (8.1, 11.7) | 20.7 (9, 32.6) |
| LUSC | 369 | 8.6 (7.3, 10.1) | 5.7 (4.9, 6.8) | 9.1 (7.6, 10.9) | 9.1 (7.6, 11.3) | 10.4 (8.7, 12.6) | 38.9 (3.8, 52.5) |
| HNSC | 527 | 8.5 (7.2, 10.1) | 6.3 (5.4, 7.4) | 8.8 (7.4, 10.4) | 8.8 (7.2, 11.2) | 9.7 (8.2, 12.3) | 22.9 (12.5, 33.4) |
| BRCA | 779 | 8.5 (7, 10.2) | 6.1 (4.9, 7.8) | 9 (7.6, 10.6) | 8.4 (6.9, 10.4) | 9.8 (8, 12.3) | 22.8 (11.2, 41.3) |
| LUAD | 457 | 7.5 (6.1, 8.9) | 5.7 (4.6, 7) | 8 (6.5, 9.5) | 7.2 (5.8, 8.8) | 8.4 (6.8, 10.3) | 22 (9.2, 38.7) |
| PRAD | 497 | 7.5 (5.5, 8.9) | 5.6 (4.4, 6.8) | 8 (6, 9.6) | 7.1 (5.3, 8.6) | 8.5 (6.2, 10.4) | 6.4 (1.8, 12.2) |
| KIRP | 275 | 6.4 (5.7, 7.7) | 3.7 (3.4, 4.7) | 7.4 (6.5, 8.8) | 6.7 (6.1, 7.8) | 8 (7.1, 9.2) | 14.9 (8.3, 22.8) |
| KIRC | 319 | 6.3 (5.7, 7.1) | 3.8 (3.3, 4.6) | 6.9 (6.2, 7.8) | 6.8 (6.2, 7.8) | 7.7 (7.1, 8.8) | 12.1 (6.3, 20) |
| PAAD | 183 | 6.1 (5.1, 7.8) | 4.9 (3.8, 6.2) | 6.5 (5.4, 8.3) | 6.3 (5.3, 7.8) | 7.1 (6, 8.8) | 9.4 (0.2, 22.8) |
| THCA | 504 | 5.2 (4.8, 5.7) | 3 (2.8, 3.4) | 6.3 (5.7, 6.9) | 5.5 (5, 6.1) | 6.2 (5.7, 6.9) | 0.01 (0.003, 1.1) |
| Overall | 5592 | 8.1 (6.2, 10.2) | 5.8 (4.3, 7.4) | 8.6 (6.8, 10.7) | 8.1 (6.4, 10.7) | 9.4 (7.4, 12.3) | 17.4 (4.8, 34.3) |
| <2.2E-16 | <2.2E-16 | <2.2E-16 | <2.2E-16 | <2.2E-16 | <2.2E-16 | ||
Kruskal-Wallis rank sum test
Acronyms: Interquartile Range (IQR)
Figure 1.Comparison of subject DNA methylation dysregulation index, fraction of genome altered, mutation count burden, and proportion of methylation dysregulation by genomic context by cancer type. Each dot corresponds to the sample specific methylation dysregulation index-MDI, fraction of genome altered, or mutation count burden. Tumor types are ordered by their median frequency of each alteration. Overall the median MDI was similar between the compared tumors. Individual MDI were disaggregated according to the unweighted proportion contributed by each genomic context. Thyroid cancers showed the lowest alterations in all the 3 measures, while bladder was consistently highly altered in all 3 measurements. For this specific data set there were only reported 4 mutations in colon adenocarcinoma, and there was not information available for esophageal carcinoma.
Figure 2.MDI clustering by cancer and genomic context.
Factors affecting the subject Methylation dysregulation index.
| Age | Female vs. Male | Early vs. late stage | Estimated tumor purity | Fraction of genome altered | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| β (95% CI) | β (95% CI) | β (95% CI) | β (95% CI) | β (95% CI) | R2 | ||||||
| BLCA | −0.26 (−0.05,0.002) | 0.06 | 0.32 (−0.35,0.99) | 0.35 | −0.73 (−1.37,−0.1) | 0.02 | 0.49 (0.03,0.07) | 0.63 (0.05,0.08) | 25.1% | ||
| BRCA | 0.38 (0.03,0.05) | 1.66 (0.24,3.09) | 0.02 | −0.03 (−0.37,0.32) | 0.87 | 0.12 (0.004,0.02) | 0.42 (0.03,0.05) | 20.2% | |||
| COAD | 0.16 (−0.004,0.04) | 0.11 | −0.13 (−0.64,0.38) | 0.61 | −0.47 (−1.01,0.06) | 0.08 | 0.42 (0.02,0.06) | 0.27 (0.01,0.04) | 12.2% | ||
| ESCA | 0.01 (−0.03,0.03) | 0.93 | −0.46 (−1.41,0.5) | 0.35 | −0.11 (−0.81,0.58) | 0.75 | 0.21 (0.004,0.04) | 0.43 (0.02,0.06) | 14.8% | ||
| HNSC | −0.02 (−0.02,0.02) | 0.83 | −0.06 (−0.53,0.41) | 0.79 | 0.56 (0.06,1.05) | 0.03 | 0.12 (−0.001,0.02) | 0.07 | 0.6 (0.05,0.07) | 19.4% | |
| KIRC | 0.06 (−0.01,0.02) | 0.30 | 0.13 (−0.17,0.43) | 0.39 | 0.69 (0.4,0.98) | 0.003 (−0.01,0.01) | 0.96 | 0.32 (0.02,0.04) | 20.7% | ||
| KIRP | 0.01 (−0.02,0.02) | 0.90 | −0.17 (−0.69,0.35) | 0.52 | 1.6 (1.07,2.12) | 0.05 (−0.01,0.02) | 0.57 | 0.15 (0.003,0.03) | 18.3% | ||
| LIHC | 0.68 (0.04,0.1) | 0.39 (−0.4,1.19) | 0.33 | 0.68 (−0.18,1.54) | 0.12 | 0.35 (0.01,0.06) | 0.71 (0.05,0.09) | 18.8% | |||
| LUAD | 0.04 (−0.01,0.02) | 0.58 | 0.57 (0.26,0.89) | −0.05 (−0.44,0.34) | 0.80 | 0.04 (−0.01,0.01) | 0.39 | 0.77 (0.07,0.09) | 46.8% | ||
| LUSC | 0.08 (−0.03,0.04) | 0.63 | 0.35 (−0.32,1.02) | 0.30 | 0.21 (−0.58,1.01) | 0.60 | 0.2 (0.003,0.04) | 0.02 | 0.7 (0.06,0.08) | 25.8% | |
| PAAD | −0.03 (−0.02,0.02) | 0.79 | 0.59 (0.14,1.04) | −0.13 (−1.09,0.83) | 0.79 | 0.003 (−0.01,0.01) | 0.94 | 0.95 (0.08,0.11) | 58.9% | ||
| PRAD | 0.28 (0.001,0.06) | 0.04 | NA | NA | −0.11 (−0.5,0.28) | 0.58 | 0.35 (0.02,0.05) | 1.1 (0.09,0.13) | 34.1% | ||
| THCA | 0.09 (0.002,0.01) | 0.2 (0.01,0.38) | 0.04 | 0.14 (−0.07,0.35) | 0.18 | 0.1 (0.01,0.02) | 0.29 (0.02,0.04) | 11.7% | |||
| UCEC | 0.07 (−0.01,0.03) | 0.51 | NA | NA | −0.22 (−0.68,0.24) | 0.35 | 0.26 (0.01,0.04) | 0.02 (−0.01,0.01) | 0.60 | 3.8% | |
per each 10 y increase,
Early (I+II), and late (III+IV),
per each 10% increase
Note: In bold those P-values ≤ 0.01 (Bonferroni correction for 5 comparisons).
Figure 3.Unsupervised clustering of tumors using the top most commonly dysregulated probes (n = 275). Using 275/384 most dysregulated loci several patterns were explored using 2 clustering approaches among cancer types. On the left an unsupervised hierarchical Euclidean distance. On the right, the 4 groups derived from a semi-supervised recursively partitioned mixture model (RPMM) classification.
Relation between the top ranked dysregulated loci and the gene expression of the closest gene.
| BLCA | BRCA | COAD | HNSC | KIRC | KIRP | LIHC | LUAD | LUSC | PAAD | PRAD | THCA | UCEC | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CGID(Gene) | % Δ( | % Δ( | % Δ( | % Δ( | % Δ( | % Δ( | % Δ( | % Δ( | % Δ( | % Δ( | % Δ( | % Δ( | % Δ( |
| 1) Promoter related probes | |||||||||||||
| a) CpG located 200 bp upstream of the transcriptional start site (TSS200) | |||||||||||||
| cg15811515 (CSDAP1) | −3.1 | −0.2 | −8 | 5.4 | 4.5 | 0.3 | −0.2 | −1.8 | 0.1 | 4.6 | −0.1 | −0.2 | −7.1 |
| cg02772121 (TRIM15) | −4 | −8.7 | −17.5 | −5.4 | −40.9 | −42.7 | −30.8 | −28.9 | −12.6 | −35.2 | −2.5 | −4.8 | −23.9 |
| cg13356896 (BOLL) | −0.1 | −2.3 | −2.5 | −2.9 | −0.1 | −0.2 | −0.7 | −4.2 | −3.7 | −0.3 | −0.6 | −0.5 | −7.3 |
| cg11201447 (PVT1) | −23.5 | −23 | −10.3 | −22.2 | −28.2 | −20.7 | −25.9 | −24.2 | −27.9 | −29.1 | −22.9 | −22.4 | −16.8 |
| b) CpG located 1500 bp upstream of the transcriptional start site (TSS1500) | |||||||||||||
| cg13879483 (USP44) | −19.2 | −13.9 | −6.6 | −20 | −5.7 | −10.6 | −17.7 | −26.1 | −18.8 | −15 | −9.4 | −15.6 | NA |
| cg13591783 (ANXA1) | −32.4 | −15.1 | −17.2 | −17.4 | −16.6 | −20.4 | −12.8 | −30.9 | −24 | −23.1 | −27.1 | −20.5 | −21.2 |
| cg21039708 (OTX2) | −0.3 | 0.9 | NA | −0.5 | 1.1 | −0.1 | 0.2 | −7.6 | −12.4 | −1.4 | −0.3 | 0.3 | 15.2 |
| c) CpG located in 5’ untranslated region (5’UTR) | |||||||||||||
| cg22399133 (CRYGD) | NA | NA | −0.1 | −5.3 | −0.6 | NA | −0.2 | NA | −1.3 | −1.9 | −3.1 | −0.8 | −16.2 |
| 2) Intragenic CpG related probes | |||||||||||||
| d) 1st Exon | |||||||||||||
| cg22674699 (HOXD9) | −6.1 | −2.8 | −5.8 | −1.6 | −10.3 | −12.5 | −0.6 | −14.3 | −1 | −6.2 | −30.6 | 21.4 | −4.7 |
| e) Gene body | |||||||||||||
| cg27260772 (TFAP2B) | 7.3 | −39.5 | −6.2 | −5.8 | −7.2 | 7.1 | 2.9 | −4.3 | −17.4 | −0.1 | −0.4 | 0.4 | −12.1 |
| cg06766860 (GALNT9) | −1 | −0.1 | −3.6 | 1.9 | −10.5 | −16.2 | −0.6 | 0.8 | −0.5 | −6.4 | 7.4 | −3.2 | 15.8 |
| cg07974511 (OTX1) | 18.6 | 33 | 34.7 | 36.5 | 15.7 | 24.6 | 23.3 | 19.8 | 43.2 | 39 | 17.3 | 13.1 | 43 |
| cg00817367 (GRASP) | −3.4 | −3.7 | −10.5 | −4.6 | −6 | −6.1 | −8 | −4.4 | −7.8 | −8.3 | −19.5 | −3.8 | −6.5 |
| cg18236877 (CASZ1) | NA | NA | 5.1 | 5.2 | −14 | −8.6 | NA | −3.2 | NA | NA | −7 | −11.2 | NA |
| cg17774559 (IRX4) | 28.1 | 20.6 | −3 | 11.4 | 2.2 | 0.6 | 0.3 | 9.1 | 95.2 | −0.8 | 40.4 | 1.3 | 6.6 |
| cg14861089 (TLX1) | 16.6 | −2.2 | 24.7 | −6.1 | 9.7 | 8.5 | 26.1 | 10.4 | 11.9 | 20.1 | 4.4 | −3.7 | NA |
| cg03301058 (GABRR2) | −1.7 | 0.4 | −3.7 | −2.4 | −3.2 | 6.7 | −2.2 | 1.4 | 2.1 | −2.5 | −0.2 | 6 | −2.2 |
| cg09296001 (SND1) | −1.3 | 0.1 | 2.9 | 2.2 | 3.1 | 0.7 | 1.6 | 2 | 3.8 | 0.1 | 8.4 | 5.1 | 2.3 |
| 3) Intergenic CpG related probes | |||||||||||||
| cg10903903 (HIST1H2BL) | 0.6 | 5.4 | 6.4 | 5.5 | 1.8 | 3.2 | −2.4 | 2.2 | 5.4 | 3.2 | NA | NA | 6.2 |
| cg06962177 (FOXD3) | −6.2 | 23.9 | −5.2 | 24.8 | 0.6 | 3.4 | 2.5 | 24.7 | 45.5 | −6.6 | 30.1 | −1.4 | 17.8 |
| cg11536474 (OTX1) | 12.4 | 23.7 | 26.7 | 36.9 | 17.2 | 26.1 | 17.3 | 21.4 | 41.2 | 23.9 | 17.2 | 21.5 | 37.6 |
| cg08364561 (RCN1) | 8 | 3.6 | 3 | 13.7 | 3 | 3.2 | 6.1 | −1.5 | 26.1 | −0.6 | −0.7 | −3.8 | 14.6 |
| cg22797031 (PRRX1) | −0.6 | −3.4 | −26.1 | −12 | 7.5 | NA | 4.3 | 1.9 | 7.2 | 6.7 | −3.7 | 3.8 | 8.5 |
| cg13539545 (SIX6) | 0.1 | 1.3 | 0 | −0.6 | 0.2 | −0.1 | −0.1 | 1.1 | 1.2 | −1.6 | 3.5 | −0.1 | 3.7 |
| cg17754510 (TFAP2A) | 17.4 | 23.9 | 10.1 | 8.7 | 59.7 | 88.1 | 37.2 | 29.8 | 2.1 | 31.4 | 12.3 | 33.8 | 45 |
| cg01518607 (PRSS16) | 8 | 16.4 | 6.1 | 8.2 | 8.3 | −1.2 | 5.2 | 11.4 | 13.7 | 17.6 | −4.9 | 2.1 | 22.1 |
| cg09887059 (LHX5) | 7.8 | −0.5 | −2.4 | −18.1 | 0.1 | 2.1 | 0.2 | −5.5 | 0.1 | −7.7 | 1.4 | −0.2 | −0.2 |
| cg18177414 (KRBA1) | 6.2 | 2.7 | NA | 4.2 | 5.5 | −2.3 | 6.6 | 2.1 | 3.3 | −14.9 | 1.6 | 2.4 | 0.2 |
| cg22620090 (LIN28B) | 7.6 | 1.5 | 1.1 | −1.2 | 2.5 | 1.5 | NA | 5.6 | 8.5 | NA | 4.2 | 0.5 | 36.2 |
| cg04456219 (AHR) | −26.2 | −2 | 3 | −6.5 | −16.2 | −14.3 | 1.4 | −3.8 | 1 | −15.3 | 4.8 | −11 | −2.9 |
| cg08443563 (MERTK) | 19.8 | 3.3 | 41.4 | 48.1 | 11.1 | 3.4 | NA | 11.9 | 20.9 | 15.2 | 1.6 | 0.7 | 5.9 |
| cg18103859 (MNAT1) | 4.7 | 3.3 | NA | 6.6 | 2.4 | 0.6 | 1.5 | 5.3 | 11.6 | 4.9 | 3.5 | 1.7 | 4.1 |
| cg03498081 (DYNLRB2) | −3.4 | 13.2 | −1.3 | 4.9 | −15.2 | −12.5 | 4.6 | 4.7 | 6 | 28.2 | 7.1 | 8.8 | 14.5 |
| cg02389317 (ZDHHC17) | −0.2 | 2.3 | 2 | −4.7 | −2.4 | −0.5 | 0.4 | 0.1 | 1.8 | 4.4 | 0.9 | −2.4 | −0.2 |
% Δ: Percent change in gene expression per 10% increase of methylation. Both gene expression and methylation were adjusted for age, sex, cancer stage, percent of tumor cells on the sample, and fraction of genome altered. The residuals of both linear models were used for the calculations.
Nearest gene.
CpG is annotated to the antisense transcript