| Literature DB >> 36010616 |
Ilda Patrícia Ribeiro1,2,3,4, Luísa Esteves1, Francisco Caramelo2,3,5, Isabel Marques Carreira1,2,3,4, Joana Barbosa Melo1,2,3,4.
Abstract
Head and Neck Cancer (HNC) is characterized by phenotypic, biological, and clinical heterogeneity. Despite treatment modalities, approximately half of all patients will die of the disease. Several molecular biomarkers have been investigated, but until now, without clinical translation. Here, we identified an integrative nine-gene multi-omics signature correlated with HNC patients' survival independently of relapses or metastasis development. This prognosis multi-omic signature comprises genes mapped in the chromosomes 1q, 3p, 8q, 17q, 19p, and 19q and encompasses alterations at copy number, gene expression, and methylation. Copy number alterations in LMCD1-A1S and GRM7, the methylation status of CEACAM19, KRT17, and ST18, and the expression profile of RPL29, UBA7, FCGR2C, and RPSAP58 can predict the HNC patients' survival. The difference higher than two years observed in the survival of HNC patients that harbor this nine-gene multi-omics signature can represent a significant step forward to improve patients' management and guide new therapeutic targets development.Entities:
Keywords: copy number alterations; gene expression; head and neck cancer; methylation; omics data; survival biomarkers
Mesh:
Substances:
Year: 2022 PMID: 36010616 PMCID: PMC9406438 DOI: 10.3390/cells11162536
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 7.666
Clinic-pathologic characteristics of study population.
| Patients (n = 410) | |||
|---|---|---|---|
|
|
| ||
|
|
| ||
| Male | 304 (74) | Positive | 79 (19.5) |
| Female | 106 (26) | Negative | 329 (80) |
|
|
| ||
| <60 | 185 (45) | Oral Tongue | 104 (25) |
| ≥60 | 225 (55) | 88 (21) | |
|
| 54 (13) | ||
| Yes | 306 (75) | 45 (11) | |
| No | 96 (23) | 37 (9) | |
|
| 19 (5) | ||
| Yes | 280(68) | 15 (4) | |
| No | 120 (29) | 8 (2) | |
| NA | 10 (2) | 8 (2) | |
|
| 7 (2) | ||
| I | 19 (5) | 2 (0.5) | |
| II | 72 (18) | ||
| III | 82 (20) | ||
| IV | 226 (55) | ||
| NA | 11 (3) | ||
|
| |||
| Yes | 102 (25) | ||
| No | 308 (75) | ||
Figure 1Circos plots representing the copy number alterations (A), methylation (B), and expression level (C) profiles of the genes kept after correlation analysis.
Figure 2Heatmap representing the clusters obtained using the first ten principal components. Two clusters were found: Cluster 1 (C1) and Cluster 2 (C2), which, according to average silhouette coefficient (0.533), present good separation and good cohesion.
Figure 3Kaplan-Meier curves determined for each cluster found using the first ten principal components of the PCA, indicating that patients in Cluster 1 survive, in median, 2 years and 4 months longer than those in Cluster 2 Patients belonging to Cluster 1are represented in grey and Cluster 2 is represented in black.
Means and medians for survival times for the two clusters found in the data.
| Mean | Median | |||||||
|---|---|---|---|---|---|---|---|---|
| 95% Confidence Interval | 95% Confidence Interval | |||||||
| Cluster | Estimate | Std. Error | Lower Bound | Upper Bound | Estimate | Std. Error | Lower Bound | Upper Bound |
| 1 | 2879.300 | 276.379 | 2337.598 | 3421.003 | 2900.000 | 702.434 | 1523.229 | 4276.771 |
| 2 | 2085.443 | 252.901 | 1589.757 | 2581.128 | 2064.000 | 652.289 | 785.513 | 3342.487 |
| Overall | 2633.589 | 214.999 | 2212.191 | 3054.987 | 2319.000 | 378.696 | 1576.757 | 3061.243 |
Survival was determined by the Kaplan-Meier method and is shown in days. Cluster 1 patients survive, in median, up to 2 years and 4 months longer than patients found in Cluster 2.
Evaluation metrics for test set using RF classification in the first ten PCs.
| Minimum | 1st Quantile | Median | Mean | 3rd Quantile | Maximum | |
|---|---|---|---|---|---|---|
|
| 0.8654 | 0.9519 | 0.9615 | 0.9624 | 0.9808 | 1.0000 |
|
| 0.7500 | 0.9038 | 0.9423 | 0.9338 | 0.9615 | 1.0000 |
|
| 0.8846 | 0.9808 | 1.0000 | 0.9910 | 1.0000 | 1.0000 |
Accuracy, sensitivity, and specificity are reported across 5000 runs.
Evaluation metrics for the test sets from SVM and RF classification using the nine gene multi-omics signature.
| Minimum | 1st Quantile | Median | Mean | 3rd Quantile | Maximum | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Model | SVM | RF | SVM | RF | SVM | RF | SVM | RF | SVM | RF | SVM | RF |
| Accuracy | 0.8654 | 0.8365 | 0.9519 | 0.9423 | 0.9615 | 0.9519 | 0.9624 | 0.9548 | 0.9808 | 0.9712 | 1.0000 | 1.0000 |
| Sensitivity | 0.7500 | 0.6923 | 0.9038 | 0.9231 | 0.9423 | 0.9423 | 0.9338 | 0.9373 | 0.9615 | 0.9615 | 1.0000 | 1.0000 |
| Specificity | 0.8846 | 0.7500 | 0.9808 | 0.9615 | 1.0000 | 0.9808 | 0.9910 | 0.9723 | 1.0000 | 0.9808 | 1.0000 | 1.0000 |
Accuracy, sensitivity, and specificity are reported for 5000 runs. This signature shows high predictive ability, with the mean accuracies being 96% and 95% in the test set, respectively, for SVM and RF.
Figure 4Heatmap representing the genetic profiles of the two clusters, for the nine gene signature. Lower values are represented in blue, whereas red indicates higher values. Two distinct signatures are observable, with a prevalence of lower values in Cluster 2.
AUC and corresponding 95% Confidence Interval (95% CI) for the genes in the multi-omics model.
| 95% CI | |||
|---|---|---|---|
| Gene | AUC | Lower | Higher |
|
| 0.959 | 0.935 | 0.983 |
|
| 0.956 | 0.931 | 0.981 |
|
| 0.658 | 0.605 | 0.711 |
|
| 0.726 | 0.677 | 0.774 |
|
| 0.721 | 0.672 | 0.770 |
|
| 0.717 | 0.667 | 0.767 |
|
| 0.723 | 0.674 | 0.772 |
|
| 0.767 | 0.721 | 0.813 |
|
| 0.703 | 0.651 | 0.755 |
AUC: Area under the ROC curve. Good individual separation capability for patient survival is shown for the nine genes, with AUC higher than 65%.
Optimum cut-off points determined for each gene included in the multi-omics signature.
|
|
|
|
|
|
|
|
|
|
|
|
| −0.150 | −0.150 | 6885.475 | 7082.378 | 33.152 | 618.547 | 0.625 | 0.344 | 0.573 |
Variables included in the logistic regression model and respective coefficients determined by the model.
| B | S.E. | ORadj | 95% CI ORadj | |||
|---|---|---|---|---|---|---|
| Lower | Upper | |||||
|
| 2.209 | 0.668 | 0.001 | 9.105 | 2.456 | 33.749 |
|
| 1.729 | 0.596 | 0.004 | 5.635 | 1.752 | 18.120 |
|
| 7.528 | 0.896 | <0.001 | 1859.334 | 321.161 | 10,764.458 |
|
| 2.668 | 0.812 | 0.001 | 14.405 | 2.935 | 70.707 |
| Constant | −7.116 | 1.138 | <0.001 | 0.001 | ||
B stands for the regression coefficient, S.E. for standard error and ORadj stands for the adjusted odds ratio. The copy number alterations present in LMCD1-AS1, methylation status of CEACAM19 as well as the expression profiles of RPL29 and FCGR2C are statistically significant for the distinction between survival groups (p-value < 0.05).
Figure 5Kaplan-Meier survival curves for the group that developed metastasis (A) and for the group that did not present metastasis during the time of follow-up (B). Patients belonging to Cluster 1 are represented in grey and Cluster 2 is represented in black. Patients that did not develop metastasis or relapse in Clusters 1 and 2 present well distinguished survival profiles, as opposed to patients that did.