| Literature DB >> 32448271 |
Abstract
BACKGROUND: This study aims to identify a predictive model to predict survival outcomes of osteosarcoma (OS) patients.Entities:
Keywords: Differentially expressed genes; Osteosarcoma; Prognosis; Risk score; Support vector machine
Mesh:
Substances:
Year: 2020 PMID: 32448271 PMCID: PMC7245838 DOI: 10.1186/s12885-020-06741-4
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Fig. 1The flow chart of the whole analysis in this study
Fig. 2Volcano plot and heatmap clustering of differentially expressed genes (DEGs). a: Volcano plot of DEGs. The green nodes represent DEGs; the red horizontal dashed lines show the false discovery rate (FDR) value is less than 0.05 and the red vertical dashed lines indicate the value of |log2 fold change (FC)| is more than 0.5. b: Heatmap clustering of DEGs. The white bars represent metastatic osteosarcoma samples and black bars represent non-metastatic osteosarcoma samples
Fig. 3Screening of the optimal prognostic gene set for osteosarcoma and the receiver operating characteristic (ROC) curve of SVM classification. a: The identification of optimal prognostic gene set for osteosarcoma based on the recursive feature elimination algorithm. The horizontal axis shows the number of differentially expressed genes and the vertical axis represents the cross-validation accuracy. b: The ROC curve of SVM classification in training dataset. c: The ROC curve of SVM classification in validation dataset
The performance evaluation of a SVM classifier in training and validation dataset
| ROC | |||||
|---|---|---|---|---|---|
| Datasets | AUROC | Sensitivity | Specificity | PPV | NPV |
| Training set (TCGA, | 0.969 | 0.915 | 0.884 | 0.741 | 0.966 |
| Validation set (GSE21257, | 0.907 | 0.857 | 0.778 | 0.882 | 0.737 |
SVM Support vector machine, ROC Receiver operating characteristic, AUROC Area under the receiver operating characteristic curve, PPV Positive predictive value, NPV Negative predictive value
The list of independent prognostic feature genes
| Gene | coef | Hazard Ratio | 95%CI | |
|---|---|---|---|---|
| KCNJ15 | 0.0501 | 2.220E-03 | 1.0513 | 1.0182–1.0856 |
| SLC24A4 | − 0.392 | 4.840E-03 | 0.6757 | 0.5145–0.8875 |
| ASPA | 0.0661 | 7.220E-03 | 1.0683 | 1.0180–1.1211 |
| REM1 | −0.0633 | 2.152E-02 | 0.9386 | 0.8893–0.9907 |
| SCARA5 | −0.024 | 2.287E-02 | 0.9763 | 0.9564–0.9967 |
| LANCL3 | 0.143 | 3.370E-02 | 1.1533 | 1.0111–1.3155 |
| CPA6 | 0.0522 | 3.787E-02 | 1.0536 | 1.0029–1.1067 |
| TRH | 0.0592 | 4.298E-02 | 1.0610 | 1.0019–1.1236 |
Coef Coefficient derived from multiivariate cox regression analysis, 95%CI 95% confidence interval
Fig. 4Kaplan–Meier survival analysis in training and validation sets. a: The KM curve based on RS and survival outcomes in training dataset. b: The KM curve based on RS and survival outcomes in validation dataset. HR: hazard ratio; C-index: Harrell concordance index; B-score: Brier score. The red and blue lines respectively represent high risk samples and low risk samples
The univariables and multi-variables cox regression of clinical parameters and survival outcomes of patients with osteosarcoma
| Clinical characteristics | TCGA( | Uni-variables cox | Multi-variables cox | ||||
|---|---|---|---|---|---|---|---|
| HR | 95%CI | HR | 95%CI | ||||
| Age(years, mean ± sd) | 61.01 ± 15.23 | 1.018 | 0.999–1.036 | 5.19E-02 | – | – | – |
| Gender(Male/Female) | 72/104 | 1.102 | 0.666–1.827 | 7.05E-01 | – | – | – |
| Pathologic tumor depth(years, mean ± sd) | 6.35 ± 3.69 | 1.136 | 0.951–1.227 | 9.05E-02 | – | – | – |
| Pathologic tumor length(years, mean ± sd) | 11.76 ± 7.25 | 1.061 | 0.993–1.092 | 5.91E-02 | – | – | – |
| Pathologic tumor width(years, mean ± sd) | 8.78 ± 5.51 | 1.091 | 0.940–1.143 | 2.17E-01 | – | – | – |
| Tumor multifocal(Yes/No/−) | 34/132/10 | 1.685 | 0.949–2.991 | 7.13E-02 | – | – | – |
| Tumor recurrence(Yes/No/−) | 28/141/7 | 2.692 | 1.581–4.585 | 1.48E-04 | 1.626 | 0.914–2.893 | 9.80E-02 |
| Tumor metastatic(Yes/No/−) | 58/118 | 2.898 | 1.754–4.789 | 1.36E-05 | 1.879 | 1.079–3.274 | 2.58E-02 |
| Radiotherapy(Yes/No/−) | 64/110/2 | 0.799 | 0.474–1.347 | 3.99E-01 | – | – | – |
| Tumor necrosis(No/Slight/Moderate/Severe/−) | 61/35/59/10/11 | 1.191 | 0.925–1.530 | 1.75E-01 | – | – | – |
| RS model status(High/ Low) | 88/88 | 3.13 | 1.824–5.370 | 1.28E-05 | 2.288 | 1.291–4.057 | 4.60E-03 |
| Dead(Death/Alive/−) | 63/113 | – | – | – | – | – | – |
| Overall survival time(months, mean ± sd) | 39.76 ± 32.24 | – | – | – | – | – | – |
SD Standard deviation, HR Hazard ratio, 95%CI 95% confidence interval, TCGAThe Cancer Genome Atlas
Fig. 5Kaplan–Meier survival analysis and prognostic nomogram model. a: Kaplan-Meier curve comparing the survival rate between patients with and without osteosarcoma metastasis in TCGA cohort. b: Kaplan-Meier curve comparing the survival rate between patients with and without tumor metastasis from high risk group in TCGA cohort. c: Kaplan-Meier curve comparing the survival rate between patients with and without tumor metastasis from low risk group in TCGA cohort. The blue and red curves respectively represent metastatic osteosarcoma and non-metastatic osteosarcoma samples. d: The nomogram prediction for overall survival probability at 3- and 5-year for osteosarcoma patients in TCGA cohort. e: The calibration curve of nomogram to predict the probability of overall survival at 3, and 5 years for osteosarcoma patients in TCGA cohort; the X axis represents the predicted actual overall survival while the Y axis represents the actual overall survival. TCGA The Cancer Genome Atlas
Fig. 6The differentially expressed genes (DEGs) between high and low risk groups in TCGA cohort. a: Volcano plot of DEGs. The green nodes represent DEGs and black color showed non-DEGs. b: The heatmap of DEGs. The expression changes from low to high expression levels with risk score. TCGA The Cancer Genome Atlas
The functional analyses of survival-related genes
| Category | Term | Count | FDR | |
|---|---|---|---|---|
| GO-BP | GO:0002250~adaptive immune response | 28 | 6.430E-13 | 1.560E-09 |
| GO:0050776~regulation of immune response | 28 | 5.700E-11 | 6.900E-08 | |
| GO:0070098~chemokine-mediated signaling pathway | 18 | 1.450E-10 | 1.170E-07 | |
| GO:0006955~immune response | 41 | 3.820E-09 | 1.850E-06 | |
| GO:0042110~T cell activation | 14 | 3.170E-09 | 1.920E-06 | |
| GO:0006935~chemotaxis | 19 | 1.430E-07 | 5.750E-05 | |
| GO:0006954~inflammatory response | 35 | 2.300E-07 | 7.950E-05 | |
| GO:0006968~cellular defense response | 13 | 9.040E-07 | 2.730E-04 | |
| GO:0007267~cell-cell signaling | 26 | 1.760E-06 | 4.740E-04 | |
| GO:0007166~cell surface receptor signaling pathway | 27 | 2.180E-06 | 5.280E-04 | |
| GO:0071346~cellular response to interferon-gamma | 11 | 1.840E-05 | 4.037E-03 | |
| GO:0042102~positive regulation of T cell proliferation | 11 | 2.930E-05 | 5.894E-03 | |
| GO:0007204~positive regulation of cytosolic calcium ion concentration | 16 | 4.730E-05 | 8.769E-03 | |
| GO:0035589~G-protein coupled purinergic nucleotide receptor signaling pathway | 6 | 6.700E-05 | 1.152E-02 | |
| GO:0043547~positive regulation of GTPase activity | 38 | 9.630E-05 | 1.446E-02 | |
| GO:0070374~positive regulation of ERK1 and ERK2 cascade | 18 | 9.260E-05 | 1.484E-02 | |
| GO:0007155~cell adhesion | 32 | 2.060E-04 | 2.894E-02 | |
| GO:0007165~signal transduction | 63 | 2.210E-04 | 2.926E-02 | |
| GO:0048247~lymphocyte chemotaxis | 7 | 2.880E-04 | 3.606E-02 | |
| GO:0042472~inner ear morphogenesis | 9 | 3.260E-04 | 3.873E-02 | |
| GO:0050850~positive regulation of calcium-mediated signaling | 6 | 4.390E-04 | 4.712E-02 | |
| GO:0006508~proteolysis | 33 | 4.240E-04 | 4.774E-02 | |
| GO:0002548~monocyte chemotaxis | 8 | 4.660E-04 | 4.781E-02 | |
| KEGG Pathway | hsa04060:Cytokine-cytokine receptor interaction | 38 | 4.250E-12 | 9.690E-10 |
| hsa04080:Neuroactive ligand-receptor interaction | 34 | 4.100E-08 | 4.680E-06 | |
| hsa04640:Hematopoietic cell lineage | 17 | 4.960E-07 | 3.770E-05 | |
| hsa04514:Cell adhesion molecules (CAMs) | 19 | 2.440E-05 | 1.389E-03 | |
| hsa04062:Chemokine signaling pathway | 21 | 9.560E-05 | 4.350E-03 | |
| hsa04660:T cell receptor signaling pathway | 13 | 9.200E-04 | 2.590E-02 | |
| hsa05033:Nicotine addiction | 8 | 1.213E-03 | 3.028E-02 | |
| hsa04650:Natural killer cell mediated cytotoxicity | 14 | 1.739E-03 | 3.543E-02 |
KEGG Kyoto Encyclopedia of Genes and Genomes, FDR false discovery rate, GO-BP Gene Ontology-Biology Process