| Literature DB >> 35629083 |
John J Milner1, Zhao-Feng Chen2, James Grayson3, Shyang-Yun Pamela Koong Shiao4.
Abstract
Obesity with adiposity is a common disorder in modern days, influenced by environmental factors such as eating and lifestyle habits and affecting the epigenetics of adipose-based gene regulations and metabolic pathways in colorectal cancer (CRC). We compared epigenetic changes of differentially methylated regions (DMR) of genes in colon tissues of 225 colon cancer cases (154 non-obese and 71 obese) and 15 healthy non-obese controls by accessing The Cancer Genome Atlas (TCGA) data. We applied machine-learning-based analytics including generalized regression (GR) as a confirmatory validation model to identify the factors that could contribute to DMRs impacting colon cancer to enhance prediction accuracy. We found that age was a significant predictor in obese cancer patients, both alone (p = 0.003) and interacting with hypomethylated DMRs of ZBTB46, a tumor suppressor gene (p = 0.008). DMRs of three additional genes: HIST1H3I (p = 0.001), an oncogene with a hypomethylated DMR in the promoter region; SRGAP2C (p = 0.006), a tumor suppressor gene with a hypermethylated DMR in the promoter region; and NFATC4 (p = 0.006), an adipocyte differentiating oncogene with a hypermethylated DMR in an intron region, are also significant predictors of cancer in obese patients, independent of age. The genes affected by these DMR could be potential novel biomarkers of colon cancer in obese patients for cancer prevention and progression.Entities:
Keywords: biomarkers; colon cancer; differentially methylated regions; generalized regression; obesity
Year: 2022 PMID: 35629083 PMCID: PMC9142939 DOI: 10.3390/jpm12050660
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Demographic characteristics of the sample cases.
| Cancer | |||
|---|---|---|---|
| Mean ± SD | Control | Non-Obese | Obese |
| BMI, Kg/m2 | 25.5 ± 2.7 | 24.9 ± 3.1 * | 36.1 ± 2.8 * |
| Age, years | 81.7 ± 13.7 | 74.0 ± 15.1 * | 72.0 ± 12.0 * |
| Gender Female (%) | 9 (60) | 67 (43.5) | 37 (52) |
| Race White (%) | 13 (87) | 116 (75) | 49 (69) |
| Black (%) | 2 (13) | 30 (19) | 21 (30) |
| Other (%) | 0 | 8 (6) | 1 (1) |
Notes. BMI: Body Mass Index; * indicates p < 0.05.
Summary of differentially methylated regions (DMR) with unique protein-coding genes per group comparison.
| Groups | Non-Obese Cancer/Control | Obese Cancer/Control | Obese Cancer/Non-Obese Cancer | |||
|---|---|---|---|---|---|---|
| Differential Methylation | Hyper | Hypo | Hyper | Hypo | Hyper | Hypo |
| 5% | 4270 | 3744 | 4203 | 4073 | 178 | 340 |
| 10% | 2967 | 1644 | 2876 | 1909 | 25 | 48 |
| 15% | 2248 | 637 | 2173 | 828 | 6 | 10 |
Notes. Hypo refers to hypomethylated; Hyper refers to hypermethylated.
Genes associated with top hypomethylated differentially methylated regions (DMR) between obese and non-obese cancer groups.
| DMR | Dis to TSS | DNAm | Gene Function | |||
|---|---|---|---|---|---|---|
| Gene | # CpG | Region | Non-Obese | Obese | ||
|
| 25 | Promoter | 860 | 8.61 | 6.80 | DNA repair, MMR |
|
| 25 | Promoter | 701 | 8.61 | 6.80 | DNA repair, MMR |
|
| 18 | Promoter | −279 | 14.47 | 11.48 | Oncogene |
|
| 11 | Promoter | −24 | 17.43 | 14.47 | Oncogene |
|
| 3 | Intron | −593 | 9.93 | 8.26 | GTP binding |
|
| 13 | Promoter | 210 | 5.49 | 4.59 | Calcium homeostasis |
|
| 4 | Promoter | 109 | 10.32 | 8.63 | Oncogene |
|
| 4 | Intron | −10,235 | 15.79 | 13.38 | Inhibits Wnt pathway |
|
| 13 | Promoter | 107 | 8.03 | 6.82 | Suppresses adipogenesis |
|
| 13 | Promoter | 59 | 12.02 | 10.26 | Oncogene |
Notes: # CpG—number of methylated CpG sites; Dis to TSS—distance (in base pairs) to transcription start site from DMR start; DNAm—mean methylation percent; ^ represents genes that can be annotated to adiposity or glucose-related functions, † to cancer-related functions, and ‡ to both cancer and adipose/glucose-related functions.
Genes associated with top hypermethylated differentially methylated regions (DMR) between obese and non-obese cancer groups.
| DMR | Dis to TSS | DNAm | Gene Function | |||
|---|---|---|---|---|---|---|
| Gene | # CpG | Region | Non-Obese | Obese | ||
|
| 12 | Promoter | 107 | 7.86 | 9.74 | Protein metabolism |
|
| 9 | Promoter | 540 | 3.44 | 4.12 | Immune response |
|
| 11 | Promoter | 107 | 21.70 | 25.80 | Transcription regulation |
|
| 8 | Intron | −466 | 9.98 | 11.52 | Oncogene |
|
| 3 | Promoter | 31 | 2.81 | 3.24 | Transcription regulation |
|
| 9 | CDS | 3021 | 29.19 | 33.34 | Oncogene |
|
| 9 | Promoter | 879 | 29.19 | 33.34 | Tumor Suppression Gene |
|
| 3 | Promoter | 445 | 14.28 | 16.10 | Mediator of inflammation |
|
| 7 | Promoter | 188 | 7.18 | 8.07 | Transcription regulation |
|
| 3 | Intron | 2448 | 6.36 | 7.14 | Oncogene, immune response |
Notes: # CpG—number of methylated CpG sites; CDS—coding DNA sequence; Dis to TSS—distance (in base pairs) to transcription start site from DMR start; DNAm—mean methylation percent; ^ represents genes that can be annotated to adiposity or glucose-related functions, † to cancer-related functions, and ‡ to both cancer and adipose/glucose-related functions.
Figure 1Methylated genes from the generalized regression model. Notes. Genes with age interaction show mean values by age as trendlines; genes without interaction with age show overall mean trendline. Hypomethylation and hypermethylation percent indicates change between obese and non-obese groups. Genes are ordered (a–f) based on total effect from generalized regression model; ## indicates p < 0.05 for cancer group to control; ** indicates p < 0.05 between obese and non-obese cancer.
Predictors of obesity-associated differentially methylated regions in colon cancer.
| Generalized Regression Adaptive Elastic Net | ||||||
|---|---|---|---|---|---|---|
| Logistic Regression | Leave-One-Out | Validation Column | ||||
| Parameters | Estimate | Estimate | Estimate | |||
| Intercept | −0.1997 | 0.738 | 0.0114 | 0.984 | −0.2869 | 0.613 |
| Age (≥76) | −2.4211 | 0.004 | −2.3076 | 0.003 | −2.2997 | 0.004 |
| 1.1541 | 0.003 | 1.2026 | 0.001 | 1.1243 | 0.002 | |
| −1.1034 | 0.046 | −1.3256 | 0.006 | −1.1133 | 0.027 | |
| −1.2821 | 0.026 | −1.4779 | 0.006 | −1.2355 | 0.025 | |
| Age * | 1.7993 | 0.020 | 1.8545 | 0.008 | 1.7343 | 0.024 |
| Age * | 1.3051 | 0.078 | 1.1752 | 0.078 | 1.3065 | 0.069 |
| Age * | 1.0252 | 0.163 | 1.1410 | 0.084 | 0.9161 | 0.081 |
| Age * | 1.0565 | 0.153 | 1.1584 | 0.088 | 1.0062 | 0.168 |
| −0.3225 | 0.401 | −0.6169 | 0.089 | −0.3119 | 0.419 | |
| −0.2551 | 0.637 | −0.2889 | 0.550 | −0.2036 | 0.702 | |
| −0.0984 | 0.856 | 0.0115 | 0.979 | 0.0000 | 1.000 | |
| Misclassification rate | 0.290 | - | 0.277 | - | 0.290 | - |
| AICc | 76.63 | - | - | - | 71.067 | - |
| Area under the curve | 0.741 | - | 0.757 | - | 0.739 | - |
Notes—data not available; AICc—Akaike’s information criterion with corrections; * denotes interaction. † represents genes that can be annotated to cancer-related functions and ‡ to both cancer and adipose/glucose-related functions.
Figure 2Receiver operating characteristic (ROC) curve and area under the curve (AUC) for logistic and generalized regression with adaptive elastic net. (a) represents logistic regression, (b) is adaptive elastic net with validation column, and (c) is adaptive elastic net with leave-one-out validation.