| Literature DB >> 36185333 |
Hui-Ching Wang1,2,3, Hsiang-Yao Shih4, Chun-Chieh Wu5, Li-Tzong Chen6, Chi-Wen Luo7, Yi-Chang Liu2, Jeng-Shiun Du2, Min-Chin Huang8, Yung-Yeh Su6, Huan-Da Chen9, Hui-Hua Hsiao2, Sin-Hua Moi10, Mei-Ren Pan1,11,12.
Abstract
In recent years, translational research and pharmacological targeting of epigenetic modifications have become the focus of personalized therapy for patients with pancreatic cancer. Preclinical and clinical trials targeting post-translational modifications have been evaluated as monotherapy or in combination with standard chemotherapy. In this study, we selected 43 genes from seven families of chromatin-modifying enzymes and investigated the influences of epigenetic modifications and their interactions on pancreatic ductal adenocarcinoma (PDAC) using hierarchical clustering analysis. Our analysis also evaluated their effects on treatment modalities and regimens of chemotherapy for PDAC. RNA-seq data for a total of 177 patients with pancreatic cancer, obtained from The Cancer Genome Atlas database, were analyzed. Our results suggested that high-risk patients of survival significant chromatin remodeling-associated gene cluster (gene cluster 2), composed of histone methyltransferases, histone acetyltransferases, histone deacetylases, histone demethylases, and 10-11 translocation family, demonstrated inferior progression-free survival and overall survival in patients with PDAC, especially in men. Our novel biomarker, survival significant chromatin remodeling-associated gene cluster, showed superior prediction performance compared with the conventional TNM system. Overall, these findings suggest that epigenetic modifications and interactions play an important role in the prognosis and therapeutic response of patients with PDAC. © The author(s).Entities:
Keywords: Epigenetic modification; Hierarchical clustering analysis; Pancreatic Cancer
Mesh:
Substances:
Year: 2022 PMID: 36185333 PMCID: PMC9515693 DOI: 10.7150/ijms.73800
Source DB: PubMed Journal: Int J Med Sci ISSN: 1449-1907 Impact factor: 3.642
Seven families of chromatin modification enzymes.
| Family | Members |
|---|---|
| HAT | EP300, CBP, KAT2A, KAT7, KAT8 |
| DNMT | DNMT1, DNMT3A, DNMT3B |
| HMT | SUV39H1 (H3K9 HMT), SUV39H2 (H3K9 HMT), EHMT1 (H3K9 HMT), EHMT2 (H3K9 HMT), SETDB1 (H4K20 HMT), SETDB2 (H4K20 HMT), EZH2 (H3K27 HMT), EZH1 (H3K27 HMT), SETD8 (H3K4 HMT), SUV420H1 (H4K20 HMT), SUV420H2 (H4K20 HMT) |
| HDAC | HDAC1, HDAC2, HDAC3, HDAC4, HDAC6, HDAC7, SIRT1, SITR2 |
| HDM | KDM1A (H3K4 and H3K9 HDM), KDM3A (H3K9 HDM), KDM3B (H3K9 HDM), KDM4A (H3K9 HDM), KDM4B (H3K9 HDM), KDM5C (H3K4 HDM), KDM5D (H3K4 HDM), KDM6B (H3K27 HDM), KDM8 (H3K36 HDM) |
| TET | TET1, TET2, TET3 |
| BET | BRD2, BRD3, BRD4, BRDT |
Baseline characteristics of study population (n = 177).
| Characteristics | Overall, n=177 |
|---|---|
| Diagnosis age | 65 (35 - 88) |
| Sex | |
| Female | 80 (45.2%) |
| Male | 97 (54.8%) |
| Ethnicity | |
| Hispanic Or Latino | 5 (2.8%) |
| Not Hispanic Or Latino | 130 (73.4%) |
| Unknown | 42 (23.7%) |
| Race | |
| Asian | 11 (6.2%) |
| Black or African American | 6 (3.4%) |
| White | 156 (88.1%) |
| Unknown | 4 (2.3%) |
| Histology | |
| 8500/3 | 142 (80.2%) |
| Others | 35 (19.8%) |
| Pathological stage | |
| Stage 1 | 21 (12.0%) |
| Stage 2 | 146 (83.4%) |
| Stage 3 | 4 (2.3%) |
| Stage 4 | 4 (2.3%) |
| Unknown | 2 |
| LN+ | 123 (69.5%) |
| Treatment | |
| CT (Gemcitabine-based) | 70 (39.5%) |
| RT+CT (Gemcitabine-based) | 24 (13.6%) |
| CT (Others) | 8 (4.5%) |
| RT+CT (Others) | 15 (8.5%) |
| RT alone | 4 (2.3%) |
| None | 56 (31.6%) |
| Progressed disease | 103 (58.2%) |
| Died | 92 (52.0%) |
Figure 1Hierarchical clustering analysis results for candidate genes. (A) Optimal number of clusters determined using average silhouette width. (B) Dendrogram of agglomerative hierarchical clustering according to the similarity between candidate genes. (C) Cluster plot of optimal gene cluster determined by the hierarchical clustering algorithm.
RNA-seq expression of candidate genes involved in gene cluster 1 (n=177).
| Characteristics | Overall, n=177 | Gene cluster 1 (GC1) | ||
|---|---|---|---|---|
| Low-risk, n = 169 | High-risk, n = 8 |
| ||
| GC1 included genes | ||||
| BRD2 | 0.02 (-5.42 - 2.60) | 0.07 (-2.25 - 2.60) | -1.02 (-5.42 - -0.28) |
|
| BRD3 | 0.26 (-3.65 - 2.42) | 0.31 (-2.45 - 2.42) | -2.35 (-3.65 - -0.94) |
|
| BRD4 | 0.11 (-5.28 - 2.24) | 0.14 (-2.41 - 2.24) | -1.90 (-5.28 - -0.52) |
|
| BRDT | -2.37 (-2.37 - 2.78) | -2.37 (-2.37 - 2.78) | -2.37 (-2.37 - 0.54) | 0.858 |
| CREBBP | 0.13 (-3.07 - 2.83) | 0.20 (-2.27 - 2.83) | -2.68 (-3.07 - -1.26) |
|
| DNMT1 | 0.07 (-3.62 - 2.95) | 0.08 (-2.42 - 2.95) | -1.28 (-3.62 - 1.53) |
|
| EHMT1 | 0.14 (-3.33 - 2.40) | 0.17 (-3.20 - 2.40) | -1.06 (-3.33 - -0.49) |
|
| EP300 | 0.10 (-3.88 - 1.87) | 0.18 (-2.77 - 1.87) | -2.84 (-3.88 - -0.96) |
|
| EZH1 | -0.05 (-4.53 - 3.07) | 0.03 (-4.53 - 3.07) | -0.80 (-1.29 - 0.23) |
|
| HDAC4 | -0.01 (-2.67 - 4.45) | 0.01 (-2.67 - 4.45) | -0.65 (-2.40 - 0.79) | 0.060 |
| HDAC6 | 0.00 (-2.19 - 4.07) | 0.03 (-2.14 - 4.07) | -0.98 (-2.19 - -0.47) |
|
| JMJD5 | 0.03 (-2.64 - 3.60) | 0.06 (-2.64 - 3.60) | -0.43 (-2.38 - 1.44) | 0.514 |
| KDM3B | 0.18 (-4.75 - 2.59) | 0.24 (-2.13 - 2.59) | -2.13 (-4.75 - -1.70) |
|
| KDM5C | -0.08 (-4.60 - 2.74) | -0.05 (-4.60 - 2.74) | -1.78 (-2.52 - -0.98) |
|
| KDM6B | -0.12 (-3.52 - 3.05) | -0.10 (-2.16 - 3.05) | -0.89 (-3.52 - -0.52) |
|
| MYST2 | 0.03 (-3.19 - 2.99) | 0.07 (-2.88 - 2.99) | -1.83 (-3.19 - -0.15) |
|
| SETDB2 | 0.03 (-3.09 - 2.60) | 0.08 (-3.09 - 2.60) | -0.71 (-0.91 - 0.42) | 0.128 |
| SIRT1 | 0.05 (-2.98 - 2.13) | 0.10 (-2.41 - 2.13) | -2.28 (-2.98 - -1.14) |
|
| SUV420H1 | 0.07 (-3.76 - 2.03) | 0.07 (-2.90 - 2.03) | -2.08 (-3.76 - 0.57) |
|
| TET1 | 0.07 (-2.78 - 2.62) | 0.14 (-2.55 - 2.62) | -1.23 (-2.78 - 0.24) |
|
| TET2 | 0.16 (-4.09 - 2.52) | 0.17 (-3.00 - 2.52) | -2.53 (-4.09 - -0.78) |
|
P-value is estimated using Wilcoxon rank-sum test.
RNA-seq expression of candidate genes involved in gene cluster 2 (n=177).
| Characteristics | Overall, n=177 | Gene cluster 2 (GC2) | ||
|---|---|---|---|---|
| Low-risk, n = 144 | High-risk, n = 33 |
| ||
| GC2 included genes | ||||
| DNMT3A | 0.17 (-3.83 - 2.14) | 0.20 (-3.83 - 2.14) | -0.07 (-1.98 - 1.45) | 0.154 |
| DNMT3B | -0.03 (-2.40 - 3.68) | -0.03 (-2.40 - 3.68) | -0.06 (-2.16 - 1.75) | 0.292 |
| EHMT2 | 0.02 (-2.39 - 4.51) | 0.22 (-1.67 - 4.51) | -1.04 (-2.39 - 0.65) |
|
| EZH2 | -0.04 (-3.92 - 3.07) | -0.01 (-3.92 - 3.07) | -0.11 (-1.77 - 2.16) | 0.235 |
| HDAC1 | -0.11 (-2.39 - 2.98) | -0.11 (-2.39 - 2.98) | -0.12 (-1.71 - 1.84) | 0.568 |
| HDAC2 | 0.01 (-2.87 - 3.33) | -0.06 (-2.87 - 3.33) | 0.04 (-1.21 - 1.30) | 0.408 |
| HDAC3 | -0.05 (-2.76 - 4.11) | 0.07 (-2.76 - 4.11) | -0.47 (-1.38 - 0.45) |
|
| HDAC7 | 0.18 (-4.39 - 1.92) | 0.39 (-4.39 - 1.92) | -0.70 (-1.96 - 0.23) |
|
| KAT2A | -0.03 (-2.52 - 2.31) | 0.25 (-1.56 - 2.31) | -1.22 (-2.52 - 0.14) |
|
| KDM1A | -0.11 (-2.42 - 5.90) | -0.13 (-2.42 - 5.90) | -0.03 (-1.58 - 1.57) | 0.823 |
| KDM3A | 0.01 (-3.61 - 2.71) | 0.01 (-3.61 - 2.67) | 0.11 (-1.01 - 2.71) | 0.426 |
| KDM4A | 0.06 (-4.52 - 4.23) | -0.05 (-4.52 - 4.23) | 0.45 (-1.09 - 2.32) |
|
| KDM4B | 0.03 (-3.08 - 4.01) | 0.22 (-2.73 - 4.01) | -0.95 (-3.08 - 1.12) |
|
| KDM5D | 0.17 (-2.03 - 1.03) | 0.19 (-2.03 - 1.03) | -1.66 (-2.03 - 0.83) | 0.188 |
| MYST1 | 0.02 (-2.42 - 4.43) | 0.17 (-1.91 - 4.43) | -0.93 (-2.42 - 0.61) |
|
| SETD8 | -0.01 (-3.63 - 4.09) | 0.14 (-3.63 - 4.09) | -0.25 (-2.15 - 1.40) |
|
| SETDB1 | -0.13 (-2.71 - 3.53) | -0.13 (-2.71 - 3.53) | -0.13 (-1.88 - 1.70) | 0.719 |
| SIRT2 | -0.08 (-2.96 - 3.28) | -0.04 (-2.96 - 3.28) | -0.31 (-1.49 - 1.28) |
|
| SUV39H1 | -0.05 (-2.54 - 3.91) | -0.01 (-2.54 - 3.91) | -0.24 (-1.80 - 0.74) |
|
| SUV39H2 | -0.06 (-2.47 - 3.62) | -0.22 (-2.47 - 3.62) | 0.76 (-0.56 - 1.70) |
|
| SUV420H2 | 0.02 (-3.37 - 3.32) | 0.17 (-1.25 - 3.32) | -1.31 (-3.37 - 0.08) |
|
| TET3 | 0.23 (-4.94 - 1.55) | 0.15 (-4.94 - 1.55) | 0.42 (-1.05 - 1.37) |
|
P-value is estimated using Wilcoxon rank-sum test and Chi-squared test.
Baseline characteristics of risk subgroups in gene cluster 1 (n=177).
| Characteristics | Gene cluster 1 (GC1) | ||
|---|---|---|---|
| Low-risk, n = 169 | High-risk, n = 8 |
| |
| Diagnosis age | 65 (35 - 88) | 73 (57 - 82) | 0.065 |
| Sex | 0.074 | ||
| Female | 79 (46.7%) | 1 (12.5%) | |
| Male | 90 (53.3%) | 7 (87.5%) | |
| Ethnicity | 0.288 | ||
| Hispanic Or Latino | 5 (3.0%) | 0 (0.0%) | |
| Not Hispanic Or Latino | 126 (74.6%) | 4 (50.0%) | |
| Unknown | 38 (22.5%) | 4 (50.0%) | |
| Race | 0.183 | ||
| Asian | 9 (5.3%) | 2 (25.0%) | |
| Black or African American | 6 (3.6%) | 0 (0.0%) | |
| White | 150 (88.8%) | 6 (75.0%) | |
| Unknown | 4 (2.4%) | 0 (0.0%) | |
| Histology | 0.359 | ||
| 8500/3 | 134 (79.3%) | 8 (100.0%) | |
| Others | 35 (20.7%) | 0 (0.0%) | |
| Pathological stage | 1.000 | ||
| Stage 1 | 20 (11.9%) | 1 (14.3%) | |
| Stage 2 | 140 (83.3%) | 6 (85.7%) | |
| Stage 3 | 4 (2.4%) | 0 (0.0%) | |
| Stage 4 | 4 (2.4%) | 0 (0.0%) | |
| Unknown | 1 | 1 | |
| LN+ | 119 (70.4%) | 4 (50.0%) | 0.249 |
| Treatment | 0.705 | ||
| CT (Gemcitabine-based) | 68 (40.2%) | 2 (25.0%) | |
| RT+CT (Gemcitabine-based) | 22 (13.0%) | 2 (25.0%) | |
| CT (Others) | 8 (4.7%) | 0 (0.0%) | |
| RT+CT (Others) | 14 (8.3%) | 1 (12.5%) | |
| RT alone | 4 (2.4%) | 0 (0.0%) | |
| None | 53 (31.4%) | 3 (37.5%) | |
| Progressed disease | 97 (57.4%) | 6 (75.0%) | 0.471 |
| Died | 87 (51.5%) | 5 (62.5%) | 0.722 |
P-value is estimated using Wilcoxon rank-sum test and Fisher's exact test.
Baseline characteristics of risk subgroups in gene cluster 2 (n=177).
| Characteristics | Gene cluster 2 (GC2) | ||
|---|---|---|---|
| Low-risk, n = 144 | High-risk, n = 33 |
| |
| Diagnosis age | 65 (35 - 88) | 64 (41 - 84) | 0.177 |
| Sex | 0.232 | ||
| Female | 62 (43.1%) | 18 (54.5%) | |
| Male | 82 (56.9%) | 15 (45.5%) | |
| Ethnicity | 0.097 | ||
| Hispanic Or Latino | 3 (2.1%) | 2 (6.1%) | |
| Not Hispanic Or Latino | 103 (71.5%) | 27 (81.8%) | |
| Unknown | 38 (26.4%) | 4 (12.1%) | |
| Race |
| ||
| Asian | 10 (6.9%) | 1 (3.0%) | |
| Black or African American | 3 (2.1%) | 3 (9.1%) | |
| White | 130 (90.3%) | 26 (78.8%) | |
| Unknown | 1 (0.7%) | 3 (9.1%) | |
| Histology | 0.46 | ||
| 8500/3 | 114 (79.2%) | 28 (84.8%) | |
| Others | 30 (20.8%) | 5 (15.2%) | |
| Pathological stage | 0.336 | ||
| Stage 1 | 17 (12.0%) | 4 (12.1%) | |
| Stage 2 | 120 (84.5%) | 26 (78.8%) | |
| Stage 3 | 3 (2.1%) | 1 (3.0%) | |
| Stage 4 | 2 (1.4%) | 2 (6.1%) | |
| Unknown | 2 | 0 | |
| LN+ | 99 (68.8%) | 24 (72.7%) | 0.654 |
| Treatment | 0.482 | ||
| CT (Gemcitabine-based) | 58 (40.3%) | 12 (36.4%) | |
| RT+CT (Gemcitabine-based) | 20 (13.9%) | 4 (12.1%) | |
| CT (Others) | 8 (5.6%) | 0 (0.0%) | |
| RT+CT (Others) | 13 (9.0%) | 2 (6.1%) | |
| RT alone | 4 (2.8%) | 0 (0.0%) | |
| None | 41 (28.5%) | 15 (45.5%) | |
| Progressed disease | 77 (53.5%) | 26 (78.8%) |
|
| Died | 65 (45.1%) | 27 (81.8%) |
|
P-value is estimated using Wilcoxon rank-sum test, Chi-squared test, and Fisher's exact test.
Figure 2Annotated heatmap of candidate genes of (A) gene cluster 1 and (B) gene cluster 2 based on RNA-seq expression. All heatmaps were ordered according to the risk subgroup, sex, progression-free survival (PFS), and overall survival (OS) status. The red color indicates a higher expression level, green color indicates a lower expression level of RNA-seq.
Figure 3Correlation analysis results of candidate genes in (A) gene cluster 1 and (B) gene cluster 2. The lower triangular area shows the scatter plot of pairwise genes, and the upper triangular area shows the correlation coefficients of each pair. The middle diagonal cell revealed the histogram of each gene according to the distribution of RNA-seq expression in the study population. •p < 0.1, *p < 0.05, **p < 0.01, ***p < 0.001.
Figure 4Progression-free survival (PFS) and overall survival (OS) analyses result in gene clusters of the study population. Kaplan-Meier plot for (A) PFS and (B) OS of gene cluster 1. Kaplan-Meier plot for (C) PFS and (D) OS of gene cluster 2. The red solid line indicates the high-risk subgroup, and the blue solid line indicates the low-risk subgroup. P-value is estimated using the log-rank test.
Figure 5Sex-specified progression-free survival (PFS) and overall survival (OS) analyses results in two gene clusters of the study population. Kaplan-Meier plot for (A) PFS and (B) OS of gene cluster 1, (C) PFS and (D) OS of gene cluster 2 in the female cohort. Kaplan-Meier plot for (E) PFS and (F) OS of gene cluster 1, (G) PFS and (H) OS of gene cluster 2 in a male cohort. The red solid line indicates the high-risk subgroup, and the blue solid line indicates the low-risk subgroup. P-value is estimated using the log-rank test.
Figure 6Treatment-specified progression-free survival (PFS) and overall survival (OS) analyses result in gene clusters of radiotherapy and chemotherapy cohorts each. Kaplan-Meier plot for (A) PFS and (B) OS of gene cluster 1, (C) PFS and (D) OS of gene cluster 2 in the CT cohort alone. Kaplan-Meier plot for (E) PFS and (F) OS of gene cluster 1, (G) PFS and (H) OS of gene cluster 2 in CT+RT cohort. The red solid line indicates the high-risk subgroup, and the blue solid line indicates the low-risk subgroup. P-value is estimated using the log-rank test.
Cox regression analysis results for PFS and OS.
| Characteristics | Comparison | Crude-HR (95% CI) |
| Adjust-HRa (95% CI) |
|
|---|---|---|---|---|---|
|
| |||||
| Gene cluster 1 | High-risk vs low risk | 2.05 (0.89, 4.72) | 0.092 | 1.82 (0.72, 4.58) | 0.200 |
| Gene cluster 2 | High-risk vs low risk |
|
|
|
|
| Age | years | 1.01 (0.99, 1.03) | 0.180 | - | |
| Sex | Male vs female | 0.97 (0.66, 1.43) | 0.880 | - | |
| Ethnicity | Non-vs hispanic Or Latino | 1.3 (0.41, 4.15) | 0.650 | - | |
| Others vs hispanic Or Latino | 1.69 (0.51, 5.58) | 0.390 | - | ||
| Race | Black/African American vs Asian | 1.1 (0.31, 3.91) | 0.880 | - | |
| Others vs Asian | 1.45 (0.41, 5.13) | 0.570 | - | ||
| White vs Asian | 1.07 (0.47, 2.44) | 0.880 | - | ||
| Histology | Others vs 8500/3 |
|
| 0.56 (0.28, 1.08) | 0.085 |
| Pathological stage | Stage 4 vs 1 |
|
| 1.75 (0.78, 3.93) | 0.200 |
| Stage 4 vs 2 | 2.36 (0.48, 11.5) | 0.290 | 0.8 (0.10, 6.65) | 0.800 | |
| Stage 4 vs 3 | 2.85 (0.72, 11.2) | 0.130 | 1.5 (0.30, 7.47) | 0.600 | |
| LN | LN+ vs LN- |
|
| - | |
| Treatment | CT+RT vs CT only | 0.7 (0.43, 1.14) | 0.150 | - | |
| Others vs CT only | 0.81 (0.51, 1.29) | 0.380 | - | ||
|
| |||||
| Gene cluster 1 | High-risk vs low risk | 1.23 (0.50, 3.03) | 0.650 | - | |
| Gene cluster 2 | High-risk vs low risk |
|
| 1.57 (0.96, 2.58) | 0.075 |
| Age | years |
|
|
|
|
| Sex | Male vs female | 0.81 (0.54, 1.23) | 0.330 | - | |
| Ethnicity | Non-vs hispanic Or Latino | 1.52 (0.47, 4.89) | 0.480 | - | |
| Others vs hispanic Or Latino | 1.52 (0.45, 5.17) | 0.500 | - | ||
| Race | Black/African American vs Asian | 1.23 (0.33, 4.61) | 0.750 | - | |
| Others vs Asian | 1.29 (0.31, 5.41) | 0.730 | - | ||
| White vs Asian | 1.27 (0.51, 3.14) | 0.610 | - | ||
| Histology | Others vs 8500/3 |
|
|
|
|
| Pathological stage | Stage 4 vs 1 |
|
| - | |
| Stage 4 vs 2 | 1.05 (0.13, 8.60) | 0.960 | - | ||
| Stage 4 vs 3 | 2.15 (0.44, 10.5) | 0.350 | - | ||
| LN | LN+ vs LN- |
|
|
|
|
| Treatment | CT+RT vs CT only | 0.71 (0.40, 1.26) | 0.240 | 0.73 (0.41, 1.30) | 0.300 |
| Others vs CT only | 1.52 (0.96, 2.39) | 0.074 |
|
| |
a Adjust-HR is estimated using multivariate Cox regression, the included variables are selected via stepwise a selection procedure.