| Literature DB >> 31834866 |
DaPeng Li1, Lei Zhang1, YuPeng Liu1, HongRu Sun1, Justina Ucheojor Onwuka1, ZhiGang Zhao2, WenJing Tian1, Jing Xu1, YaShuang Zhao1, HongYu Xu3.
Abstract
The early diagnosis and accurate prognosis prediction of esophageal cancer is an essential part of improving survival. However, these diseases lack effective and specific markers. A total of 1,744 samples of HumanMethylation450 data were integrated to identify and validate specific methylation markers for esophageal adenocarcinoma (EAC) and esophageal squamous cell carcinoma (ESCC) as well as for Barrett's esophagus (BE) using The Cancer Genome Atlas and the Gene Expression Omnibus. The diagnostic and prognostic methylation classifiers were constructed by moderated t-statistics and the least absolute shrinkage and selection operator method. The diagnostic methylation classifier using 12 CpG sites was constructed in training set (377 samples) that could effectively discriminate samples of BE, EAC, and ESCC from normal tissue (AUC = 0.992), which achieved highly predictive ability in both internal (187 samples, AUC = 0.990) and external validation (184 samples, AUC = 0.978). The prognostic methylation classifier with 3 CpG and 2 CpG sites for EAC and ESCC respectively, could accurately estimate the prognosis of an individual patient and improved the predictive ability of the tumor node metastasis staging system. Overall, our study systematically analyzed large-scale methylation data and provided promising markers for the diagnosis and prognosis of esophageal cancer.Entities:
Keywords: Barrett’s esophagus; DNA methylation; LASSO; biomarker; esophageal cancer
Year: 2019 PMID: 31834866 PMCID: PMC6932928 DOI: 10.18632/aging.102569
Source DB: PubMed Journal: Aging (Albany NY) ISSN: 1945-4589 Impact factor: 5.682
Genomic information of 12 CpG sites for diagnostic methylation classifier.
| cg10078335 | CAPN10 | chr2 | 241535845 | Island | Body |
| cg13257812 | NA | chr3 | 27525884 | Island | NA |
| cg04607372 | NA | chr5 | 54523900 | Island | NA |
| cg13441766 | NA | chr5 | 134376442 | Island | NA |
| cg13927501 | TRIM31 | chr6 | 30079090 | OpenSea | Body |
| cg08858649 | TRIM15 | chr6 | 30139903 | Island | Body |
| cg18080046 | CLIC1 | chr6 | 31704844 | N_Shelf | TSS1500 |
| cg14534279 | NA | chr10 | 3329966 | OpenSea | NA |
| cg06966660 | TACC2 | chr10 | 123923066 | Island | Body |
| cg08436756 | SHANK2 | chr11 | 70781118 | OpenSea | Body |
| cg01025720 | ATP11A | chr13 | 113346439 | S_Shore | Body |
| cg03474687 | XRCC3 | chr14 | 104179160 | N_Shelf | 5'UTR |
Confusion matrix of training set.
| NSE | 2 | 3 | 1 | 96 | |
| BE | 2 | 4 | 0 | 70 | |
| EAC | 3 | 3 | 2 | 151 | |
| ESCC | 1 | 0 | 2 | 60 | |
| Correct | 90 | 64 | 143 | 57 | 354 |
| Total | 96 | 69 | 152 | 60 | 377 |
| Accuracy rate (%) | 93.75 | 92.75 | 94.08 | 95.00 | 93.90 |
Figure 1Diagnostic methylation classifier can differentiate for NSE, BE, EAC, and ESCC. Unsupervised hierarchical clustering and heatmap of 12 methylation markers selected for constructing diagnostic methylation classifier in (A) training (N=377), (C) test (N=187), and (E) validation set (N=184). ROC curve showing the high AUC in predicting four tissue types in (B) training, (D) test, and (F) validation set.
Confusion matrix of test set.
| NSE | 2 | 1 | 0 | 48 | |
| BE | 1 | 2 | 0 | 32 | |
| EAC | 1 | 3 | 1 | 76 | |
| ESCC | 0 | 0 | 2 | 31 | |
| Correct | 45 | 29 | 71 | 29 | 174 |
| Total | 47 | 34 | 76 | 30 | 187 |
| Accuracy rate (%) | 95.74 | 85.29 | 93.42 | 96.67 | 93.05 |
Confusion matrix of validation set.
| NSE | 4 | 0 | 0 | 64 | |
| BE | 4 | 1 | 1 | 65 | |
| EAC | 1 | 6 | 7 | 36 | |
| ESCC | 1 | 0 | 0 | 19 | |
| Correct | 60 | 59 | 22 | 18 | 159 |
| Total | 66 | 69 | 23 | 26 | 184 |
| Accuracy rate (%) | 90.91 | 85.51 | 95.65 | 69.23 | 86.41 |
Genomic information of CpG sites for prognostic methylation classifier.
| cg01192745 | NA | chr3 | 31239040 | OpenSea | NA |
| cg19801256 | ITGA1 | chr5 | 52166469 | OpenSea | Body |
| cg18276155 | MCC | chr5 | 112504356 | OpenSea | Body |
| cg14387626 | NA | chr14 | 106331803 | N_Shore | NA |
| cg04777726 | PLEKHA4 | chr19 | 49340489 | Island | 3'UTR |
Figure 2Prognostic methylation classifier can predict overall survival of EAC and ESCC. Waterfall plots show the risk scores of prognostic methylation classifier between high-risk and low risk patients for (A) EAC and (B) ESCC. The dash lines represent the median of the risk score. Kaplan-Meier curves were used of overall survival in high and low risk groups for (C) EAC and (D) ESCC. The cutoff values for the high and low risk groups were based on the median of the risk score.
Univariate and multivariate Cox regression analysis of the 3-CpG prognostic methylation classifier and clinical factors with overall survival of EAC.
| Age (> 60 vs ≤60) | 0.986(0.962-1.009) | 0.2283 | 0.986(0.960-1.014) | 0.3275 | |
| Gender (male vs female) | 0.847(0.299-2.400) | 0.7553 | 0.552(0.158-1.928) | 0.3520 | |
| BMI (> 25 vs ≤25) | 1.019(0.983-1.058) | 0.3056 | 1.063(1.015-1.112) | ||
| Smoking (yes vs no) | 1.133(0.606-2.117) | 0.6953 | 1.317(0.681-2.547) | 0.4138 | |
| Alcohol use (yes vs no) | 0.516(0.281-0.948) | 0.616(0.304-1.245) | 0.1771 | ||
| Tumor stage (III/IV vs I/II) | 2.238(1.151-4.351) | 2.028(0.930-4.420) | 0.0753 | ||
| Methylation classifier (high vs low risk) | 5.661(2.639-12.145) | 5.164(2.199-12.130) | |||
Univariate and multivariate Cox regression analysis of the 2-CpG prognostic methylation classifier and clinical factors with overall survival of ESCC.
| Age (> 60 vs ≤60) | 1.763(0.826-3.765) | 0.1428 | 1.536(0.681-3.463) | 0.3011 | |
| Gender (male vs female) | 10.290(1.358-78.001) | 3.508(0.407-30.207) | 0.2533 | ||
| BMI (> 25 vs ≤25) | 0.727(0.283-1.868) | 0.5082 | 1.211(0.453-3.240) | 0.7031 | |
| Smoking (yes vs no) | 2.113(0.939-4.754) | 0.0706 | 1.257(0.526-3.003) | 0.6066 | |
| Alcohol use (yes vs no) | 2.169(0.750-6.277) | 0.1530 | 4.562(1.311-15.880) | ||
| Tumor stage (III/IV vs I/II) | 2.987(1.432-6.230) | 1.980(0.873-4.492) | 0.1020 | ||
| Methylation classifier (high vs low risk) | 7.354(2.962-18.257) | 6.603(2.407-18.116) | |||
Figure 3Risk stratification combining prognostic methylation classifier and tumor stage in relation to overall survival of EAC and ESCC. Kaplan-Meier curves of four risk levels for (A) EAC and (B) ESCC. Multivariate Cox model of four risk levels for (C) EAC and (D) ESCC adjusting for age, gender, BMI, smoking, and alcohol use.