| Literature DB >> 27433284 |
Annette M Lim1,2, Nicholas C Wong3, Ruth Pidsley4, Elena Zotenko4, June Corry5, Alexander Dobrovic6,7,8, Susan J Clark4, Danny Rischin9, Benjamin Solomon9.
Abstract
BACKGROUND: DNA methylation profiling of heterogeneous head and neck squamous cell carcinoma (HNSCC) cohorts has been reported to predict patient outcome. We investigated if a prognostic DNA methylation profile could be found in tumour tissue from a single uniform subsite, the oral tongue. The methylation status of 109 comprehensively annotated oral tongue squamous cell carcinoma (OTSCC) formalin-fixed paraffin-embedded (FFPE) samples from a single institution were examined with the Illumina HumanMethylation450K (HM450K) array. Data pre-processing, quality control and analysis were performed using R packages. Probes mapping to SNPs, sex chromosomes and unreliable probes were accounted for prior to downstream analyses. The relationship between methylation and patient survival was examined using both agnostic approaches and feature selection. The cohort was enlarged by incorporation of 331 The Cancer Genome Atlas (TCGA) HNSCC samples, which included 91 TCGA OTSCC samples with HM450K and survival data available.Entities:
Keywords: DNA methylation; Epigenetics; HM450K; Oral carcinoma; Profiling; Survival; Tongue
Mesh:
Year: 2016 PMID: 27433284 PMCID: PMC4948090 DOI: 10.1186/s13148-016-0235-0
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Patient and tumour characteristics [22]
| Number | Percent | |
|---|---|---|
| Gender | ||
| Female | 48 | 37 |
| Male | 83 | 63 |
| Age at diagnosis (in years) | ||
| Mean | 57.8 | |
| Standard deviation | 15.1 | |
| Median | 60 | |
| Range | 21–93 | |
| <40 years | 15 | 11 |
| 40–49 years | 26 | 20 |
| 50–59 years | 24 | 18 |
| 60–69 years | 32 | 24 |
| 70–79 years | 27 | 21 |
| 80+ years | 7 | 5 |
| Stage | ||
| 1 | 37 | 28 |
| 2 | 37 | 28 |
| 3 | 15 | 11 |
| 4 | 42 | 32 |
| T category | ||
| 1 | 43 | 33 |
| 2 | 51 | 39 |
| 3 | 14 | 11 |
| 4 | 23 | 18 |
| N category | ||
| 0 | 83 | 63 |
| 1 | 14 | 11 |
| 2 | 34 | 26 |
| Smoking history | ||
| No/never | 39 | 30 |
| Yes | 88 | 67 |
| Unknown | 4 | |
| Alcohol >20 g/day | ||
| No/never/social | 81 | 62 |
| Past or current | 46 | 35 |
| Unknown | 4 | |
| ECOG performance status | ||
| 0 | 76 | 58 |
| 1 | 40 | 31 |
| 2 | 9 | 7 |
| 3 | 4 | 3 |
| Unknown | 2 |
Fig. 1Workflow for bioinformatics processing of raw data
Fig. 2Plot of median beta values of the combined OTSCC cohort. A plot of the median beta values for the combined cohort of 174 OTSCC samples with outcome data, with the blue line indicating the overall median value for the group. A bimodal distribution is observed, without evidence of a hypermethylated group of samples
Fig. 3Plot of the average silhouette value according to the number of clusters. This was assessed on RnBeads using an average linkage-based algorithm. The combined cohort demonstrates no distinct clustering according to observed methylation values. A silhouette value of “1” indicates good classification of the observation into the cluster, a value of “0” indicates that the observation lies independent of the groupings, whilst a negative value indicates poor classification and likely incorrect grouping
Fig. 4Prediction performance plots for overall survival. a Prediction performance plot for overall survival of the combined oral tongue cohort of the mean value derived from the machine learning method RFE-SVM using a range of parameters (lambda range 0.01 with a poly kernel). Error bars represent the range of values obtained across the three-fold cross validation performed. The prediction performance ranged between 0.5 and 0.58 indicating a poor performance. b Representative prediction performance plot for overall survival of the combined oral tongue cohort using CFS. The training set demonstrates a decrease in performance with an increasing number of probes utilised, and the test model derived from the training set shows poor performance. Error bars represent the range of values across the three-fold cross validation performed. For both plots, a performance of 1.0 represents a 100 % chance of being able to classify groups according to the nominated variable, according to the number of probes interrogated. A prediction performance of 0.5 represents a capability of no better than chance