| Literature DB >> 28423671 |
Nitish Kumar Mishra1, Chittibabu Guda1,2,3,4.
Abstract
Pancreatic cancer (PC) is the fourth leading cause of cancer deaths in the United States with a five-year patient survival rate of only 6%. Early detection and treatment of this disease is hampered due to lack of reliable diagnostic and prognostic markers. Recent studies have shown that dynamic changes in the global DNA methylation and gene expression patterns play key roles in the PC development; hence, provide valuable insights for better understanding the initiation and progression of PC. In the current study, we used DNA methylation, gene expression, copy number, mutational and clinical data from pancreatic patients. We independently investigated the DNA methylation and differential gene expression profiles between normal and tumor samples and correlated methylation levels with gene expression patterns. We observed a total of ~23-thousand differentially methylated CpG sites (Δβ≥0.1) between normal and tumor samples, where majority of the CpG sites are hypermethylated in PC, and this phenomenon is more prominent in the 5'UTRs and promoter regions compared to the gene bodies. Differential methylation is observed in genes associated with the homeobox domain, cell division and differentiation, cytoskeleton, epigenetic regulation and development, pancreatic development and pancreatic signaling and pancreatic cancer core signaling pathways. Correlation analysis suggests that methylation in the promoter region and 5'UTR has mostly negative correlations with gene expression while gene body and 3'UTR associated methylation has positive correlations. Regulatory element analysis suggests that HOX cluster and histone core proteins are upstream regulators of hypomethylation, while SMAD4, STAT4, STAT5B and zinc finger proteins (ZNF) are upstream regulators of hypermethylation. Non-negative matrix factorization (NMF) clustering of differentially methylated sites generated three clusters in PCs suggesting the existence of distinct molecular subtypes. Cluster 1 and cluster 2 showed samples enriched with clinical phenotypes like neoplasm histological grade and pathologic T-stage T3, respectively, while cluster 3 showed the enrichment of samples with neoplasm histological grade G1. To the best of our knowledge, this is the first genome-scale methylome analysis of PC data from TCGA. Our clustering analysis provides a strong basis for future work on the molecular subtyping of epigenetic regulation in pancreatic cancer.Entities:
Keywords: TCGA; differential methylation; integrative analysis; molecular subtypes; pancreatic cancer
Mesh:
Substances:
Year: 2017 PMID: 28423671 PMCID: PMC5438707 DOI: 10.18632/oncotarget.15993
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Genome-wide differential DNA methylation patterns in pancreatic cancer
(A) Difference in DNA methylation in all CpG sites passing false discovery rate (FDR) with Δβ ≥ 0.1. Δβ is weighted by T-statistics such that the distance from central core (grey) indicates increasing level of statistical significance. Chromosomes are shown in clockwise from 1 to 22; we did not use sex chromosomes (X or Y) in our analysis. (B) Chromosome wise DNA methylation frequency distribution in pancreatic cancer. For each chromosome we calculated hypermethylation and hypomethylation frequency per megabase pairs. We sorted chromosomes on the basis of DNA methylation/Mb. Chromosome 4, 5, 13 and 18 have two-fold higher hypermethylation frequencies.
Total number of differentially methylated CpG sites in different sub-regions
| CpG subregions | Δβ ≥ 0.1 | Δ β ≥ 0.2 | Δβ ≥ 0.3 |
|---|---|---|---|
| 3′UTR | 797 (623) | 298 (236) | 45 (40) |
| 5′UTR | 3059 (1563) | 1523 (859) | 418 (301) |
| 1st Exon | 2168 (1086) | 1223 (674) | 407 (274) |
| Gene Body | 8593 (3663) | 3731 (1962) | 757 (530) |
| TSS200 | 2292 (1054) | 1249 (592) | 373 (224) |
| TSS1500 | 3125 (1691) | 1318 (763) | 294 (209) |
| TSS1.5Kb | 9465 (3167) | 4669 (1719) | 1243 (638) |
The numbers in parenthesis represent the total number of HGNC genes covered by these CpG sites.
UTR- Untranslated Region; TSS- Transcription Start Site, TSS1.5Kb- 1.5 Kb up and downstream from TSS
Top twenty hypermethylated and hypomethylated CpG sites in known genes
| Illumina probe id | Δβ value | Chromosome | Gene symbol | Fold change |
|---|---|---|---|---|
| cg22674699 | 0.60 | chr2 | HOXD9 | 4.713211 |
| cg03692651 | 0.59 | chr19 | ZNF729 | 3.736158 |
| cg03306374 | 0.57 | chr16 | PRKCB | 6.524701 |
| cg22784954 | 0.56 | chr5 | ADAMTS16 | 4.539619 |
| cg19717586 | 0.56 | chr11 | NTM | 5.896104 |
| cg16729415 | 0.55 | chr15 | GJD2 | 5.436232 |
| cg15811515 | 0.55 | chr16 | YBX3P1 | 4.782864 |
| cg24221648 | 0.54 | chr13 | RNF219-AS1 | 5.424716 |
| cg17495912 | 0.54 | chr13 | CCNA1 | 4.81373 |
| cg06304097 | 0.54 | chr10 | TCERG1L | 5.730066 |
| cg26296488 | 0.53 | chr4 | DRD5 | 5.07241 |
| cg22620090 | 0.53 | chr6 | LIN28B | 3.737363 |
| cg15506157 | 0.53 | chr7 | KLRG2 | 2.38998 |
| cg07915921 | 0.53 | chr12 | HOXC13-AS | 5.657705 |
| cg22797031 | 0.52 | chr1 | PRRX1 | 3.527956 |
| cg17774559 | 0.52 | chr5 | IRX4 | 4.203264 |
| cg14473102 | 0.52 | chr2 | HOXD8 | 4.714848 |
| cg20302133 | 0.51 | chr1 | KCNA3 | 6.175219 |
| cg17985646 | 0.50 | chr7 | TBX20 | 5.129886 |
| cg25397945 | 0.49 | chr19 | ZNF382 | 5.946388 |
| cg07805542 | −0.44 | chr1 | PIK3CD | 0.466222 |
| cg04214938 | −0.44 | chr2 | EN1 | 0.550001 |
| cg01077100 | −0.44 | chr10 | BTBD16 | 0.505858 |
| cg13446584 | −0.45 | chr7 | GTF2IRD1 | 0.548374 |
| cg10728351 | −0.45 | chr4 | ANXA5 | 0.590933 |
| cg09159452 | −0.45 | chr7 | IQCE | 0.540868 |
| cg09077096 | −0.45 | chr7 | CARD11 | 0.570784 |
| cg21011133 | −0.46 | chr2 | ADCY3 | 0.434894 |
| cg27411547 | −0.47 | chr8 | SLC45A4 | 0.587623 |
| cg07388969 | −0.47 | chr15 | SPRED1 | 0.464429 |
| cg20151476 | −0.48 | chr7 | PSMG3 | 0.56687 |
| cg07248223 | −0.48 | chr17 | CCR7 | 0.499523 |
| cg20518446 | −0.49 | chr11 | AHNAK | 0.501228 |
| cg20765408 | −0.50 | chr13 | PARP4 | 0.438326 |
| cg11303839 | −0.50 | chr7 | CCL26 | 0.505054 |
| cg20928945 | −0.53 | chr7 | ADAP1 | 0.436731 |
| cg09287864 | −0.53 | chr7 | AHR | 0.50094 |
| cg20852851 | −0.54 | chr2 | HDAC4 | 0.524387 |
| cg05926314 | −0.55 | chr7 | PTPRN2 | 0.483487 |
| cg23066280 | −0.56 | chr7 | PTPRN2 | 0.506833 |
Figure 2Genome-wide distribution of DNA methylation CpG islands in pancreatic cancer
For each known CpG island with more than 3 methylated CpG sites, we calculated median of β value. Difference in DNA methylation in all CpG islands passing FDR with Δβ ≥ 0.1. CpG islands that are hypomethylated are in blue color.
Top ten hypermethylated and hypomethylated CpG islands in TCGA pancreatic cancer data
| CpG Island | Delta beta | Fold change |
|---|---|---|
| chr6:105400877-105401149 | 047 | 2.91 |
| chr5:178003623-178004247 | 0.44 | 5.16 |
| chr14:57278709-57279116 | 0.43 | 5.02 |
| chr7:42267546-42267823 | 0.40 | 5.54 |
| chr1:158147433-158147854 | 0.39 | 2.15 |
| chr4:5709985-5710495 | 0.39 | 3.93 |
| chr2:127413696-127414171 | 0.39 | 7.09 |
| chr1:111216244-111217937 | 0.39 | 4.87 |
| chr7:19184818-19185033 | 0.38 | 4.30 |
| chr1:248020330-248021252 | 0.38 | 6.09 |
| chr15:66274583-66274838 | −0.18 | 0.29 |
| chr19:1856725-1857443 | −0.18 | 0.47 |
| chr19:1240154-1240546 | −0.18 | 0.26 |
| chr2:85811340-85811855 | −0.20 | 0.48 |
| chr16:29818681-29819554 | −0.20 | 0.30 |
| chr21:45148454-45149262 | −0.24 | 0.44 |
| chr12:6649677-6649897 | −0.25 | 0.46 |
| chr12:6649677-6649897 | −0.27 | 0.47 |
| chr8:103750881-103751088 | −0.31 | 0.50 |
| chr6:33244677-33245554 | −0.33 | 0.33 |
List of differentially methylated epigenetic enzyme, chromatin remodeler and histone proteins in TCGA pancreatic cancer
| Mark | Writer | Reader | Eraser/editor |
|---|---|---|---|
| DNA methylation | DNMT1, DNMT3A, DNMT3B | CHD2, CHD7, MBD1, ZBTB38, ZMYM4, ZMYM6, | APOBEC1, IDH2, MGMT, TET3 |
| Histone methylation | EHMT1, EHMT2, EZH2, KMT2C/MML3, KMT2D/MLL2, MECOM, PRDM1, PRDM2, PRDM4, PRDM6, PRDM7, PRDM8, PRDM11, PRDM12, PRDM13, PRDM14, PRDM15, PRDM16, SETBP1, SETD3, SETD7, SETMAR, SMYD2, SMYD3, WHSC1, WHSC1L1 | ATXN7, CBX5, CHD2, CHD7, DHX30, DNMT3A, EHMT1, EHMT2, GATAD2A, UHRF1, ZMYM8 | KDM2A, KDM2B, KDM3A, KDM3B, KDM4B, KDM6B |
| Histone acetylation | CREBBP, GTF3C1, KAT2A, KAT6B, NCOA1, NCOA2, NCOA7 | ATXN7, BRD1, BRD3, BRD4, DHX30, GATAD2A, ZMYM8 | HDAC4, HDAC5, HDAC9, HDAC11, SIRT6, SIRT7 |
| Arginine methylation | PRMT6, PRMT8 | ||
| *Chromatin remodeler | ARID1B, CHD2, CHD7, CHD8, DPF3, SMARCA2, SMARCD3, TTF2 | ||
| *Histone protein | H1F0, H1FOO, HIST3H2A, HIST1H1E, HIST1H2AG, HIST1H2APS1, HIST1H2BA, HIST3H2BB, HIST1H2BC, HIST2H2BF, HIST1H2BI, HIST1H2BN, HIST1H3B, HIST1H3C, HIST1H3E, HIST1H3F, HIST1H3G, HIST1H3H, HIST1H4F, HIST1H4H | ||
In this figure we are using green for hypermethylated probes, similarly red for hypomethylated and black color for genes which have both hypermethylated and hypomethylated CpG sites.
*In case of chromatin remodeler and histone proteins there is no reader, writer and eraser
Figure 3Patterns of DNA methylation in homeobox-containing gene cluster families in pancreatic cancer
For each homeobox-containing gene subfamily we calculated the number of differentially hypermethylated and hypomethylated CpG sites that meet the p-value and FDR thresholds of 0.01. Hypermethylated CpGs are on the right side of the Y-axis and hypomethylated on the left side. In the figure, blue and green color shows the total number of differentially hypermethylated and hypomethylated CpGs, respectively, and purple color shows the total number of differentially methylated CpG sites with Δβ ≥ 0.2.
WebGestalt based pathway analysis of differentially methylated genes which have delta beta value more than 0.2 in tumor vs. normal
| KEGG pathway | KEGG ID | Ratio | Raw p-value | BH adjusted p-value |
|---|---|---|---|---|
| Pathways in cancer | 05200 | 3.45 | 8.36e-31 | 1.50e-28 |
| MAPK signaling pathway | 04010 | 3.55 | 2.06e-27 | 1.84e-25 |
| Neuroactive ligand-receptor interaction | 04080 | 3.50 | 6.95e-27 | 4.15e-25 |
| Calcium signaling pathway | 04020 | 4.17 | 2.98e-26 | 1.33e-24 |
| Focal adhesion | 04510 | 3.91 | 1.33e-25 | 4.76e-24 |
| Regulation of actin cytoskeleton | 04810 | 3.72 | 2.11e-24 | 6.29e-23 |
| Endocytosis | 04144 | 3.35 | 2.12e-18 | 5.42e-17 |
| Metabolic pathways | 01100 | 1.76 | 1.63e-14 | 2.92e-13 |
| Vascular smooth muscle contraction | 04270 | 3.69 | 9.39e-14 | 1.53e-12 |
| Axon guidance | 04360 | 3.48 | 2.16e-13 | 3.22e-12 |
| ECM-receptor interaction | 04512 | 4.15 | 2.87e-13 | 3.95e-12 |
| Leukocyte transendothelial migration | 04670 | 3.60 | 4.81e-13 | 6.09e-12 |
| Pancreatic secretion | 04972 | 3.71 | 2.73e-12 | 3.05e-11 |
Figure 4Unsupervised clustering of PC patient data on the basis of differentially methylated CpG sites
We used only the CpG sites with Δβ ≥ 3. We used NMF for consensus clustering of samples by using 500 permutations. Vertical sidebars show information on the direction of methylation, fold change in tumor, probe relationship and probe annotation of differentially methylated CpG sites. Top annotations of heatmap plot are somatic mutations, copy number deletion and amplification from cBioPortal.
WebGestalt based pathway enrichment analysis of differentially expressed genes in TCGA pancreatic cancer data
| KEGG pathway | KEGG ID | Ratio | Raw p-value | BH adjusted p-value |
|---|---|---|---|---|
| Hematopoietic cell lineage | 4640 | 19.84 | 9.65e-11 | 2.22e-09 |
| Primary immunodeficiency | 5340 | 34.92 | 1.09e-09 | 1.67e-08 |
| Cytokine-cytokine receptor interaction | 4060 | 8.57 | 5.50e-09 | 6.33e-08 |
| B cell receptor signaling pathway | 4662 | 18.62 | 1.25e-08 | 1.15e-07 |
| Natural killer cell mediated cytotoxicity | 4650 | 11.55 | 1.02e-07 | 7.82e-07 |
| Chemokine signaling pathway | 4062 | 9.24 | 1.62e-07 | 1.06e-06 |
| Neuroactive ligand-receptor interaction | 4080 | 6.42 | 4.41e-06 | 2.25e-05 |
| Leukocyte transendothelial migration | 4670 | 7.53 | 0.0006 | 0.0023 |
| Graft-versus-host disease | 5332 | 12.78 | 0.0017 | 0.0056 |
| Intestinal immune network for IgA production | 4672 | 10.91 | 0.0027 | 0.0083 |
| Autoimmune thyroid disease | 5320 | 10.07 | 0.0033 | 0.0095 |
| Cell adhesion molecules (CAMs) | 4514 | 5.25 | 0.0074 | 0.0197 |
| Antigen processing and presentation | 4612 | 6.89 | 0.0096 | 0.0232 |
Figure 5CpG sites whose DNA methylation levels were significantly correlated with gene expression with Bonferroni corrected P-value < 0.05
(A) Significance level of correlation between DNA methylation β value and gene expression plotted against distance between CpG sites and transcription start site (TSS). (B) Significance level and genome-wide distribution of correlation between DNA methylation and gene expression. Red dots represent negative correlation and blue dots represent positive correlation. We did not use sex chromosomes in this analysis.
Figure 6Significant correlation between DNA methylation patterns in different gene regions and gene expression (Bonferroni corrected P-value < 0.05)
Pie chart plot shows the distribution of negative and positive correlations corresponding to the functional regions of genes. Distribution patterns are very different for the positive correlations compare to negative correlations.