| Literature DB >> 30569146 |
Kai Sun1, Jianmin Zhao1.
Abstract
The present study was conducted to establish a risk assessment model for evaluating osteosarcoma prognosis based on prognosis-associated long non-coding RNA (lncRNA) expression. Human osteosarcoma expression profiles were obtained from the NCBI GEO and EBI ArrayExpress databases and differently expressed lncRNAs between good and poor prognosis groups were evaluated using Student's t-test and Wilcoxon rank test in R (v. 3.1.0). A multivariate Cox regression was used to establish a risk assessment system based on lncRNA expression levels, with the associated regression coefficients used as the weight. Survival analysis and receiver operating characteristic (ROC) curves were constructed to verify the accuracy of the risk assessment model. Associations between the prognosis, risk assessment model and clinical features were also investigated using univariate and multivariate Cox regression analyses. Furthermore, differentially expressed genes associated with the lncRNAs in the risk assessment model were identified, and functional enrichment analysis was performed. A total of 9 from the 211 differentially expressed lncRNAs were selected to establish the risk assessment model. The risk assessment model exhibited a good prognostic prediction ability, with high area under the curve values in the training and validation sets. Additionally, the calculated risk score based on the 9 selected lncRNAs was identified to be an independent prognostic factor for osteosarcoma. Furthermore, differentially expressed genes were primarily enriched in the cell cycle, oxidative phosphorylation and cell adhesion processes. The present study described a risk assessment model based on 9 significantly differentially expressed lncRNAs, which was identified to have a high accuracy in potentially predicting patient prognosis.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30569146 PMCID: PMC6323200 DOI: 10.3892/mmr.2018.9768
Source DB: PubMed Journal: Mol Med Rep ISSN: 1791-2997 Impact factor: 2.952
Clinical information for training and validation sets.
| Variable | GSE21257 (n=53) | GSE39055 (n=37) | P-value |
|---|---|---|---|
| Age (mean ± SD) | 18.71±12.19 | 13.47±11.34 | 0.0402[ |
| Sex (male/female) | 34/19 | 20/17 | 0.4572[ |
| Metastases (yes/no) | 34/19 | – | – |
| Death (dead/alive) | 23/30 | 10/27 | 0.1728[ |
| Overall survival time (mean ± SD) | 68.55±59.34 | 52.92±50.14 | 0.1813[ |
SD, standard deviation
Student's t-test
Chi-square test.
Figure 1.Significantly differentially expressed lncRNAs identified in the training set. Volcanic maps of differentially expressed lncRNAs identified via (A) Student's t-test or (B) Wilcoxon rank test. The abscissa indicates the log2 FC and the ordinate indicates the negative logarithm of the P-value. Red nodes indicate upregulated lncRNA, green nodes represent downregulated lncRNA and black nodes represent non-differentially expressed lncRNA. (C) Overlapping differentially expressed lncRNAs between the Student's t-test and Wilcoxon rank test. (D) A two-way hierarchical clustering map based on the 211 differentially expressed lncRNAs. FC, fold change; lncRNA, long non-coding RNA.
Information of 9 lncRNAs screened from the 53 osteosarcoma samples in GSE21257 to build the risk assessment model.
| Long non-coding RNA | Coefficient | Hazard ratio | 95% confidence interval | P-value |
|---|---|---|---|---|
| CH17-360D5.2 | −2.037 | 0.112 | 0.042–0.301 | 0.005 |
| LINC00987 | −0.068 | 0.111 | 0.067–0.186 | 0.037 |
| LINC01526 | −6.092 | 0.160 | 0.023–0.213 | 0.016 |
| RP11-15A1.3 | −3.673 | 0.107 | 0.092–0.125 | 0.024 |
| RP11-213H15.1 | −4.925 | 0.163 | 0.035–0.752 | 0.034 |
| RP11-218F4.1 | −0.160 | 0.607 | 0.301–0.802 | 0.028 |
| RP11-242F11.2 | −3.758 | 0.781 | 0.296–0.964 | 0.013 |
| RP11-411H5.1 | −0.009 | 0.243 | 0.219–0.629 | 0.028 |
| RP11-834C11.5 | 7.861 | 3.510 | 1.090–4.113 | 0.016 |
lncRNA, long non-coding RNA.
Figure 2.Kaplan-Meier curves of overall survival for the 9 long non-coding RNAs in the risk assessment model. The red lines indicate samples with high expression levels and green lines indicate samples with low expression levels.
Figure 3.ROC curve assessing high-risk and low-risk patients based on the risk assessment model. ROC curves for the (A) training set and (B) validation set, with the abscissa indicating the sensitivity and the ordinate indicating the specificity. ROC, receiver operating characteristic; AUC, area under the ROC curve.
Figure 4.Expression levels for the 9 selected long non-coding RNAs in the validation set. *P<0.05, **P<0.01 and ***P<0.005.
Univariate and multivariate Cox regression analysis for independent prognostic factors of osteosarcoma according to the 53 osteosarcoma samples in the GSE21257 data set.
| Univariate analysis | Multivariate analysis | |||||
|---|---|---|---|---|---|---|
| Variables | HR | 95% CI | P-value | HR | 95% CI | P-value |
| Risk score (high/low) | 7.574 | 2.492–13.030 | 3.53×10−5 | 1.868 | 1.563–5.785 | 0.028 |
| Age (<18/≥18) | 0.363 | 0.134–0.985 | 0.038 | 0.738 | 0.228–2.386 | 0.814 |
| Sex (male/female) | 1.403 | 0.588–3.348 | 0.444 | – | – | – |
| Grade (G1+G2/G3+G4) | 1.496 | 1.284–1.867 | 0.011 | 1.504 | 1.291–1.872 | 0.014 |
| Tumor metastases (yes/no) | 2.218 | 1.963–3.649 | 2.38×10−7 | 1.211 | 1.037–2.759 | 2.71×10−3 |
HR, hazard ratio; CI, confidence interval.
Figure 5.Kaplan-Meier curves of overall survival between high-risk and low-risk patients in the training set based on the hierarchical analysis. Kaplan-Meier curves for (A) samples with grade I–II (green line) and samples with grade III–IV (red line) tumors and for (B) samples without tumor metastasis (green line) and samples with tumor metastasis (red line).
Top 10 upregulated and downregulated differentially expressed genes associated with the 9 lncRNAs in the risk assessment model.
| A, Upregulated genes | |||
|---|---|---|---|
| Log FC | P-value | FDR | |
| RPLP1 | 1.331 | 1.860×10−6 | 1.305×10−4 |
| UQCRH | 1.298 | 7.820×10−7 | 5.480×10−5 |
| PTMA | 1.293 | 6.750×10−6 | 4.734×10−4 |
| RPL23 | 1.237 | 4.850×10−6 | 3.399×10−4 |
| SUMO2 | 1.209 | 6.110×10−6 | 4.281×10−4 |
| PTGES3 | 1.165 | 6.770×10−6 | 4.743×10−4 |
| NDUFB9 | 1.133 | 1.110×10−6 | 7.790×10−5 |
| RPL27A | 1.131 | 1.520×10−6 | 1.068×10−4 |
| KPNA2 | 1.108 | 3.180×10−6 | 2.228×10−4 |
| ACTR3 | 1.061 | 4.780×10−6 | 3.348×10−4 |
| C1S | −0.501 | 6.443×10−4 | 4.517×10−2 |
| IFI44L | −0.503 | 4.218×10−4 | 2.957×10−2 |
| FAP | −0.506 | 4.576×10−4 | 3.208×10−2 |
| CYP27A1 | −0.513 | 4.440×10−6 | 3.110×10−4 |
| CD163 | −0.526 | 1.821×10−4 | 1.277×10−2 |
| ETV5 | −0.526 | 3.940×10−6 | 2.759×10−4 |
| SDC1 | −0.528 | 1.390×10−5 | 9.719×10−4 |
| HLA-DMA | −0.535 | 3.650×10−5 | 2.559×10−3 |
| CCL8 | −0.537 | 4.822×10−4 | 3.380×10−2 |
| MOXD1 | −0.574 | 5.927×10−4 | 4.155×10−2 |
FC, fold change; FDR, false discovery rate.
Significantly enriched functions and pathways for the identified prognosis-associated genes.
| Category | Term | Count | P-value | Genes |
|---|---|---|---|---|
| BP | GO:0006414; translational elongation | 11 | 3.690×10−6 | |
| BP | GO:0006412; translation | 16 | 2.080×10−4 | |
| BP | GO:0006091; generation of precursor metabolites and energy | 14 | 1.223×10−3 | |
| BP | GO:0008283; cell proliferation | 16 | 3.357×10−3 | |
| BP | GO:0070271; protein complex biogenesis | 17 | 5.484×10−3 | |
| BP | GO:0006461; protein complex assembly | 17 | 5.484×10−3 | |
| BP | GO:0015031; protein transport | 22 | 7.654×10−3 | |
| BP | GO:0045184; establishment of protein localization | 22 | 8.450×10−3 | |
| BP | GO:0009611; response to wounding | 17 | 8.564×10−3 | |
| BP | GO:0006954; inflammatory response | 12 | 1.264×10−2 | |
| BP | GO:0065003; macromolecular complex assembly | 19 | 1.558×10−2 | |
| BP | GO:0007049; cell cycle | 21 | 1.796×10−2 | |
| BP | GO:0055114; oxidation reduction | 18 | 2.148×10−2 | |
| BP | GO:0043933; macromolecular complex subunit organization | 19 | 2.790×10−2 | |
| BP | GO:0022402; cell cycle process | 16 | 3.030×10−2 | |
| BP | GO:0006886; intracellular protein transport | 12 | 3.190×10−2 | |
| BP | GO:0000279; M phase | 11 | 3.283×10−2 | |
| BP | GO:0008104; protein localization | 22 | 3.337×10−2 | |
| BP | GO:0010605; negative regulation of macromolecule metabolic process | 19 | 3.693×10−2 | |
| PATHWAY | hsa03010; ribosome | 11 | 6.270×10−7 | |
| PATHWAY | hsa00190; oxidative phosphorylation | 8 | 1.355×10−3 | |
| PATHWAY | hsa04260; cardiac muscle contraction | 6 | 1.810×10−3 | |
| PATHWAY | hsa00010; glycolysis/Gluconeogenesis | 5 | 2.965×10−3 | |
| PATHWAY | hsa04110; cell cycle | 7 | 3.570×10−3 | |
| PATHWAY | hsa04610; complement andcoagulation cascades | 4 | 1.536×10−2 | |
| PATHWAY | hsa04142; lysosome | 4 | 4.059×10−2 | |
| PATHWAY | hsa03040; spliceosome | 4 | 4.532×10−2 | |
| PATHWAY | hsa04514; cell adhesion molecules | 4 | 4.839×10−2 |
BP, biological process.