| Literature DB >> 28852427 |
David C Christiani1,2, Sipeng Shen3,1,2, Guanrong Wang4, Qianwen Shi3, Ruyang Zhang3,1, Yang Zhao3,1, Yongyue Wei3,1, Feng Chen3,1,5,6.
Abstract
BACKGROUND: DNA methylation has started a recent revolution in genomics biology by identifying key biomarkers for multiple cancers, including oral squamous cell carcinoma (OSCC), the most common head and neck squamous cell carcinoma.Entities:
Keywords: Gene expression; Methylation; Oral squamous cell carcinoma; Overall survival; Prognostic signature
Mesh:
Year: 2017 PMID: 28852427 PMCID: PMC5571486 DOI: 10.1186/s13148-017-0392-9
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Demographic and clinical characteristics of OSCC patients
| Characteristic | Training set ( | Validation set 1a ( | Validation set 2 ( |
|---|---|---|---|
| Censor rate | 66.4% | 71.9% | 71.6% |
| Age, median years (range) | 61.0 (19–90) | 58.0 (23–85) | 45.0 (28–79) |
| Gender, | |||
| Male | 206 (65.8) | 36 (43.9) | 11 (20.7) |
| Female | 107 (34.2) | 46 (56.1) | 42 (79.3) |
| Smoking status, | |||
| Never | 87 (27.8) | 30 (36.6) | – |
| Current/former | 217 (69.3) | 44 (53.7) | – |
| NA | 9 (2.9) | 8 (9.8) | – |
| Race, | |||
| White | 272 (86.9) | 79 (96.3) | – |
| Other | 31 (9.9) | 3 (3.7) | – |
| NA | 10 (3.2) | 0 (0) | – |
| HPV status, | |||
| Positive | 14 (4.5) | 9 (11.0) | 22 (41.5) |
| Negative | 176 (56.2) | 64 (78.0) | 16 (30.2) |
| NA | 123 (39.3) | 9 (11.0) | 15 (28.3) |
| TNM stage, | |||
| Early (I–II) | 88 (28.1) | 48 (58.5) | 18 (34.0) |
| Advanced (III–IV) | 218 (69.6) | 34 (41.5) | 35 (66.0) |
| NA | 7 (2.2) | 0 (0) | 0 (0) |
NA not available
aBaseline information of validation set 1 is collected from [19]
Fig. 1Flow chart indicating study design. We identified candidate CpG sites from 32 paired OSCC and adjacent non-tumor tissues by methylation 450k assay in the discovery set. Then, we excluded a large proportion of CpG sites that were unrelated to survival and developed prognostic scores by SIS. The seven-CpG-based classifier was validated in two independent datasets. Relationships between methylation and gene expression were also analyzed in the training dataset
Fig. 2Construction of the seven-CpG-based classifier. a Circos plot of epigenome-wide DNA methylation CpG sites. Results are presented as P values ordered by genomic position, including paired t test of the discovery set (green and red symbols) and univariable Cox regression analysis of the training set (orange and blue symbols). b Volcano plot comparing CpG methylation for OSCC tumor and non-tumor tissues. A total of 1490 CpG sites had an absolute value of differential methylation of > 0.4 and a paired t test P value of < 1 × 10−7 (blue dots). c Heatmap showing methylation of 15 CpG sites in tumor tissues and adjacent non-tumor tissues. d Coefficients of CpG sites calculated by univariate Cox regression and sure independence screening (SIS). After SIS selection, seven probes remained non-zero coefficients
Fig. 3Prognostic signature and OSCC patient survival. Left panels show Kaplan-Meier survival analyses of patients, which are categorized into low-risk and high-risk groups using a cutoff value of 0.02, for the a training set, b validation set 1, and c validation set 2. P values were calculated using log-rank test, and HR indicates hazard ratio. Right panels show time-dependent ROC curves of different months used to evaluate patient survival, with risk score using the nearest neighbor method
Fig. 4Subgroup and stratification analysis of the seven-CpG-based signature. Subgroup analysis for HPV+ cases (a) and HPV− cases (b) in the imputed combined dataset. c Kaplan-Meier curves plotting overall survival of the combined three datasets for respective prognostic score categories. d Subgroup analysis with clinical stage of the combined training set and validation set 2
Clinical characteristics of the training set with both methylation and mRNA data
| Characteristic | Subset of training set with both DNA methylation and mRNA expression data ( |
|---|---|
| Censor rate | 66.9% |
| Age, median years (range) | 61.0 (19–90) |
| Gender, | |
| Male | 206 (66.9) |
| Female | 102 (33.1) |
| Smoking status, | |
| Never | 84 (27.3) |
| Current/former | 216 (70.1) |
| NA | 9 (2.6) |
| Race, | |
| White | 267 (86.7) |
| Black or African American | 20 (6.5) |
| Asian | 10 (3.2) |
| American Indian or Alaska Native | 1 (0.3) |
| NA | 10 (3.2) |
| HPV status, | |
| Positive | 13 (4.2) |
| Negative | 175 (56.8) |
| NA | 120 (39.0) |
| TNM stage, | |
| I | 12 (3.9) |
| II | 75 (24.4) |
| III | 64 (20.8) |
| IV | 150 (48.7) |
| NA | 7 (2.3) |
| Grade, | |
| G1 | 48 (15.6) |
| G2 | 193 (62.7) |
| G3 | 63 (20.5) |
| NA | 4 (1.3) |
NA not available
Fig. 5Association between gene expression and methylation. Left panels show correlation of a AJAP1, b SHANK2, c FOXA2, d MT1A, e ZNF570, f HOXC4, and g HOXB4 expression (X-axis) with methylation (Y-axis). Right panels show Kaplan-Meier survival plots of gene expression from the TCGA cohort. HR indicates hazard ratio. Correlation coefficients and hypothesis tests were based on Spearman rank correlation tests. Patients were categorized into high-risk and low-risk groups by an optimum cutoff point according to the highest χ 2 value. h ROC curves for expression of the seven genes (left) and combinations of different types of data (right), including clinical characteristics (Clin), gene expression (Exp), and methylation (Methy)
Fig. 6Mediation analysis for methylation prognostic signature through mRNA expression. a Diagram of mediation model. b Methylation signature from the seven CpG sites was treated as “exposure”; mediator was the linear combination of the corresponding seven genes’ expression level (scoreexpression) (Overall model). Total prognostic effect in hazard ratio (HR) were described as direct effect (HRdirect), indirect effect (HRindirect), corresponding 95% confidence interval (95% CI), and the proportion of effect mediated (M%). Further, sensitivity analyses were performed by excluding each gene from scoreexpression, respectively, which retained statistical significance for mediation effect