Hua Liu1,2, Chenyu Zong3, Jiacheng Sun4, Haiyang Li5, Guangzhen Qin2, Xiaojian Wang2, Jianwei Zhu3, Yang Yang6, Qiang Xue1, Xianchen Liu1. 1. Department of Radiation Oncology, Affiliated Hospital of Nantong University, Nantong, China. 2. Department of Orthopedics, Haian Hospital of Traditional Chinese Medicine, Haian, China. 3. Department of Orthopedics, Affiliated Hospital of Nantong University, Nantong, China. 4. Xinglin College, Nantong University, Nantong, China. 5. Department of Oncology, Binhai County People's Hospital, Yancheng, China. 6. Department of Trauma Center, Affiliated Hospital of Nantong University, Nantong, China.
Abstract
Background: Osteosarcoma (OS) is a disease with high mortality in children and adolescents, and metastasis is one of its important clinical features. However, the molecular mechanism of OS occurrence is not completely clear. Thus, we screened potential biomarkers of OS and analyze their prognostic value. Methods: The Cancer Genome Atlas (TCGA) datasets were used to analyze the differential lncRNAs in patients with OS of different immune score and the lncRNAs expressed by immune cells. Cox regression was used to develop the prognosis prediction model and specify the prognosis outcomes. Risk-proportional regression model was constructed, and the samples were divided into high and low groups based on the risk scores for the survival analysis. The areas under the receiver operating characteristic (ROC) curve were calculated and the risk-score model was verified. Finally, using 4 gene sets (comprising chemokines, immune checkpoint blockades, immune activity-related genes, and immune cells), and 4 analysis tools (CIBERSORT, TIMER, XCELL and MCP) to evaluated tumor immune infiltration. Results: Twenty-nine long non-coding ribonucleic acids (lncRNAs) were obtained from the intersection of the screened lncRNAs. Caspase recruitment domain-containing protein 8-antisense RNA 1 (CARD8-AS1), lncRNA five prime to Xist (FTX), KAT8 regulatory NSL complex unit 1-antisense RNA 1 (KANSL1-AS1), Neuroplastin Intronic Transcript 1 (NPTN-IT1), oligodendrocyte maturation-associated long intervening non-coding RNA (OLMALINC) and RPARP Antisense RNA 1 (RPARP-AS1) were found to be correlated with survival. Univariate and multivariate regression analysis showed risk score [HR (hazard ratio) 3.5, P value 0.0043; HR 3.7, P value 0.0033] and metastasis (HR 4.7, P value 6.60E-05; HR 4.8, P value 8.36E-05) were the key factors of patients with OS. The areas under curves (AUCs) of the 1-, 3-, and 5-year ROC curves of the prognostic model were 0.715, 0.729, and 0.771. The low-risk patients tended to have a high abundance of immune cells. Conclusions: This study showed that a risk score based on 6 lncRNAs has potential value in the prognosis of OS, and patients with low-risk scores have high immune cell infiltration and good prognosis. This study may enrich understandings of underlying mechanisms related to the occurrence and development of OS. 2022 Translational Pediatrics. All rights reserved.
Background: Osteosarcoma (OS) is a disease with high mortality in children and adolescents, and metastasis is one of its important clinical features. However, the molecular mechanism of OS occurrence is not completely clear. Thus, we screened potential biomarkers of OS and analyze their prognostic value. Methods: The Cancer Genome Atlas (TCGA) datasets were used to analyze the differential lncRNAs in patients with OS of different immune score and the lncRNAs expressed by immune cells. Cox regression was used to develop the prognosis prediction model and specify the prognosis outcomes. Risk-proportional regression model was constructed, and the samples were divided into high and low groups based on the risk scores for the survival analysis. The areas under the receiver operating characteristic (ROC) curve were calculated and the risk-score model was verified. Finally, using 4 gene sets (comprising chemokines, immune checkpoint blockades, immune activity-related genes, and immune cells), and 4 analysis tools (CIBERSORT, TIMER, XCELL and MCP) to evaluated tumor immune infiltration. Results: Twenty-nine long non-coding ribonucleic acids (lncRNAs) were obtained from the intersection of the screened lncRNAs. Caspase recruitment domain-containing protein 8-antisense RNA 1 (CARD8-AS1), lncRNA five prime to Xist (FTX), KAT8 regulatory NSL complex unit 1-antisense RNA 1 (KANSL1-AS1), Neuroplastin Intronic Transcript 1 (NPTN-IT1), oligodendrocyte maturation-associated long intervening non-coding RNA (OLMALINC) and RPARP Antisense RNA 1 (RPARP-AS1) were found to be correlated with survival. Univariate and multivariate regression analysis showed risk score [HR (hazard ratio) 3.5, P value 0.0043; HR 3.7, P value 0.0033] and metastasis (HR 4.7, P value 6.60E-05; HR 4.8, P value 8.36E-05) were the key factors of patients with OS. The areas under curves (AUCs) of the 1-, 3-, and 5-year ROC curves of the prognostic model were 0.715, 0.729, and 0.771. The low-risk patients tended to have a high abundance of immune cells. Conclusions: This study showed that a risk score based on 6 lncRNAs has potential value in the prognosis of OS, and patients with low-risk scores have high immune cell infiltration and good prognosis. This study may enrich understandings of underlying mechanisms related to the occurrence and development of OS. 2022 Translational Pediatrics. All rights reserved.
Osteosarcoma (OS) is a primary malignant bone tumor that most commonly affects children, adolescents, and young adults (1-3). OS occurs in the epiphysis of long bones, most commonly in the distal femur (43%), proximal tibia (23%), and humerus (10%) (4,5). Patients with advanced metastatic OS have a poor prognosis. OS patients often develop a resistance to standard therapies; thus, treatment regimens need to be improved and novel therapeutic targets need to identified (6). At present, the underlying molecular mechanism of OS is still unclear, which hinders the development of prognosis and treatment strategies. It is urgent to identify new prognostic and diagnostic biomarkers in OS.Numerous studies have been conducted upon this issue. With advancement of sequencing technologies, several kinds of non-coding RNAs (ncRNAs) have been discovered, such as the microRNAs, lncRNAs and circleRNAs (7-9). Recently microRNAs (i.e., miR-9, miR-21, miR-29 and miR-195) have been presented play a potential biological role in osteosarcoma and can be used as diagnostic and prognostic markers and therapeutic targets (7,8). LncRNAs play an important role in the occurrence and development of tumors, which show significant potential in osteosarcoma prognosis (10). However, roles of lncRNAs in the progress, prognosis and metastasis of osteosarcoma OS has not been widely explored (11). There is experimental evidence that many lncRNAs are involved in the pathogenesis and development of OS; For example, Ankyrin Repeat and SOCS Box Containing 16 Antisense RNA 1 (ASB16-AS1) has been identified as a novel oncogenic lncRNA in OS cells. ASB16-AS1 increases the level of hepatoma-derived growth factor expression by sponging Mir-760, which play a promoting role in OS (12). Potassium Voltage-Gated Channel Subfamily Q Member 1 Opposite Strand/Antisense Transcript 1 (KCNQ1OT1) expression has been shown to be increased in the tissues of patients with OS and is associated with OS progression and decreased overall survival (13). KCNQ1OT1 may be a drug-resistant lncRNA and a promising target for avoiding chemical resistance (13). LINC01116 targets miR-520a-3p and affects interleukin-6 receptor (IL6R), promoting the proliferation and migration of OS cells through the Janus-activated kinase/signal transducer and activator of transcription (JAK/STAT) signaling pathway (14). The high expression of lncRNA prostate cancer-associated transcript 6 (PCAT6) has been shown to be positively correlated with an advanced stage and the metastatic state of OS, and a survival analysis showed that the upregulation of PCAT6 is associated with a poor prognosis (15). The down syndrome cell adhesion molecule antisense 1 (DSCAM-AS1) silencing significantly inhibited the viability and invasion characteristics of OS cells, while DSCAM-AS1 upregulation had the opposite effect (16). Long Intergenic Non-Protein Coding RNA 1614 (LINC01614) can serve as a competing endogenous RNA and promote the proliferation and invasion of OS cells through the miR-520a-3p/sorting nexin 3 (SNX3) axis, thus serving as a novel prognostic marker for clinical OS (17). LncRNA small nucleolar RNA host gene 4 (SNHG4) is highly expressed in OS tissues and cell lines. In addition, SNHG4 expression has been shown to be associated with distant metastasis, tumor-node-metastasis staging, and survival in patients with OS (18). Melanotransferrin Antisense RNA 1 (MELTF-AS1) acts as a metastasis-promoting gene in OS via the upregulation of Matrix metalloproteinase 14 (MMP14), and may be a potential therapeutic and diagnostic target for OS (19). LncRNA LEF1-AS1 binds to heterogeneous nuclear ribonucleoprotein L, and promotes the proliferation, migration, and invasion of OS by enhancing the mRNA stability of LEF1 (20). LncRNA antisense noncoding RNA from the RNA binding motive 5 (RBM5-AS1) promotes OS cell proliferation, migration, and invasion in vitro and tumor growth in vivo. LncRNA RBM5-AS1 targets RBM5 in OS cells (21). LncRNA PCAT-1 promotes the progression of OS, and the mir-508-3p/ Zinc finger E-box binding homeobox 1 (ZEB1) axis has been shown to be associated with the functional role of PCAT-1 in OS, which suggests that PCAT-1/Mir-508-3p/ZEB1 may be a therapeutic target for OS patients (22). LncRNA titinantisense RNA1 (TTN-AS1) is highly expressed in OS and is associated with a poor prognosis (23). It regulates OS cell apoptosis and drug resistance through the Mir-134-5p/MBTD1 axis (23). A Kaplan-Meier analysis showed that the overexpression of mir-100-let-7a-2-mir-125b-1 cluster host gene (MIR100HG), HOXD cluster antisense RNA 1 (HOXD-AS1), Ewing sarcoma-associated transcript 1 (EWSAT1), LIM and Cysteine Rich Domains 1 Antisense RNA 1 (LMCD1-AS1), and many other lncRNAs predicts a poor prognosis in patients with OS (24). Thus, lncRNAs are very important in OS. Th early detection and screening of lncRNA expression and possible interventions may be of great help in reducing the mortality and improving the prognosis of OS patients.In this study, a bioinformatics analysis demonstrated the role of lncRNAs in the occurrence and development of OS, and their clinical prognostic value was explored via univariate and multivariate regression analyses and receiver operating characteristic (ROC) curve analyses. More importantly, we conducted a Gene Ontology (GO) analysis, Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis, Gene Set Enrichment Analysis (GSEA), and immune infiltration analysis to determine the biological functions of differentially expressed genes (DEGs) in high- and low-risk patients. Our results suggest that lncRNA is a possible future therapeutic target for OS. We present the following article in accordance with the TRIPOD reporting checklist (available at https://tp.amegroups.com/article/view/10.21037/tp-22-253/rc).
Methods
Data collection
Clinical data and RNA-sequencing data were downloaded from The Cancer Genome Atlas (TCGA) data set for OS (labeled x TARGET-OS; sample size n=86, one sample is excluded). Clinical data and RNA-sequencing data for cutaneous melanoma (labeled TCGA-SKCM; n=457) and IMvigor210 (n=348, http://research-pub.gene.com/IMvigor210CoreBiologies/) were also downloaded. A lncRNA data set of immune cells were defined as IM-lncRNAs. The chip data related to the following immune cells were collected: GSE13906, GSE23371, GSE25320, GSE27291, GSE27838, GSE28490, GSE28698, GSE28726, GSE37750, GSE39889, GSE42058, GSE49910, GSE51540, GSE59237, GSE6863, and GSE8059. The chip platform used was the Affymetrix HG-U133_Plus 2.0 platform. The robust multiarray analysis (RMA) method was used to standardize the data. The annotation information of lncRNA was extracted by NetAffx annotation files (HG-U133 Plus 2.0 Annotations, CSV format, release 36, July 12, 2016). As the data includes batch information, the comBat package was used to remove the batch effect. The pre-processed data were used for the subsequent analyses. High expression lncRNA in each immune cell type was screened, and the expression of lncRNA in each immune cell was determined to be normally distributed by a normal distribution test. Thus, the value at the 0.95 quantile of normal distribution was taken as the cutoff for whether a cell type was highly expressed. If the expression level of the lncRNA was higher than the cutoff, it was considered to be highly expressed lncRNA. A collection of highly expressed lncRNAs in all the immune cells was used for this analysis. The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013).
Survival analysis and risk modeling
In the TARGET-OS data set, according to the immune score value obtained by the Estimation of Stromal and Immune cells in Malignant Tumour tissues using Expression data (ESTIMATE), the samples were divided into high and low immune score group by the median score. Then, differential expressed lncRNAs (DE-lncRNAs) were detected with a false discovery rate (FDR) <0.05 and |log2 (fold change)| ≥0.58 (21). Then, DE-lncRNAs intersect with IM-lncRNAs were used for univariate Cox regression analysis. lncRNAs with a P value <0.05 were kept for constructing risk model by the multivariate Cox regression and Stepwise screening analysis. Next, we built a risk-proportional regression model to calculate the risk score. Based on their risk scores, the samples were divided into high and low groups, and survival curves were drawn to compare the survival status of the 2 groups.
Bioinformatics analysis
We conducted a functional analysis of the DEGs in the samples of two risk-score groups, including a GO analysis, KEGG analysis, and GSEA analysis. To evaluate the relationship between the risk-score and tumor immune invasion, we firstly calculate the percentage of various immune cells in the sample of two risk-score groups by CIBERSORT (https://cibersort.stanford.edu/), TIMER (tumor immune estimation resource; https://cistrome.shinyapps.io/timer), XCELL (xcell.ucsf.edu/), and MCP-counter (https://github.com/ebecht/MCPcounter). Besides, 3 immune related gene signatures’ activity were calculated by GSVA. Then, checked this scores’ distribution between high and low risk group. We used 4 gene sets: comprising chemokines, immune checkpoint blockades (ICBs), immune activity-related genes, and immune cells to evaluate the relationship between the risk-score and tumor immune invasion.
Statistical analysis
To evaluate the prediction accuracy of the risk-score model, we plotted the ROC curves and calculated the areas under the curve (AUCs). Age, gender, risk score, and metastasis in the TARGET-OS data sets were analyzed by univariate and multivariate Cox regression analysis. All the statistic analysis were performed by R4.0 software.
Results
Differentially expressed lncRNAs in samples with high and low immune scores were screened
The flowchart of study was presented (Figure S1). To explore the relationship between the differentially expressed lncRNAs in the immune score samples and survival, we first conducted a differential analysis and screened DE-lncRNAs in the TARGET-OS data set. The intersection of these DE-lncRNAs and IM-lncRNAs (lncRNAs expressed by the immune cells) yielded the following 29 lncRNAs: Inositol-Tetrakisphosphate 1-Kinase antisense RNA 1 (ITPK1-AS1), Long Intergenic Non-Protein Coding RNA 599 (LINC00599), Glycerol kinase 3 pseudogene (GK3P), Ankyrin Repeat Domain 36B Pseudogene 2 (ANKRD36BP2), NPTN-IT1, RPARP-AS1, OLMALINC, Polycystin 1, Transient Receptor Potential Channel Interacting Pseudogene 6 (PKD1P6), Ectonucleoside Triphosphate Diphosphohydrolase 1 Antisense RNA 1 (ENTPD1-AS1), FTX, CH17-340M24.3, CARD8-AS1, PC-Esterase Domain Containing 1B Antisense RNA 1 (PCED1B-AS1), Napsin B Aspartic Peptidase Pseudogene (NAPSB), Glycoprotein Alpha-Galactosyltransferase 1 Pseudogene (GGTA1P), Proteasome 20S Subunit Beta 8 Antisense RNA 1 (PSMB8-AS1), Major Histocompatibility Complex, Class II, DR Beta 6 (HLA-DRB6), GTPase, Very Large Interferon Inducible Pseudogene 1 (GVINP1), HOXA Transcript Antisense RNA, Myeloid-Specific 1 (HOTAIRM1), Long Intergenic Non-Protein Coding RNA 996 (LINC00996), Small Nucleolar RNA Host Gene 9 (SNHG9), FLJ20021, Zinc Finger BED-Type Containing 5 Antisense RNA 1 (ZBED5-AS1), KANSL1-AS1, Integrin Subunit Beta 2 Antisense RNA 1 (ITGB2-AS1), T Cell Receptor Gamma Locus Antisense RNA 1 (TRG-AS1), Major Histocompatibility Complex, Class I, J (HLA-J), Trinucleotide Repeat Containing Adaptor 6C Antisense RNA 1 (TNRC6C-AS1), and PRR34 Antisense RNA 1 (PRR34-AS1) ().
Figure 1
The volcano map of the DEGs between the samples with high and low immune scores were screened. Blue represents downregulated, red represents upregulated. The genes named in the figure are the lncRNAs expressed by the immune cells. The horizontal dotted line represents FDR0.05. The 2 vertical dashed lines represent –log2(1.5) and log2(1.5). DEGs, differentially expressed genes.
The volcano map of the DEGs between the samples with high and low immune scores were screened. Blue represents downregulated, red represents upregulated. The genes named in the figure are the lncRNAs expressed by the immune cells. The horizontal dotted line represents FDR0.05. The 2 vertical dashed lines represent –log2(1.5) and log2(1.5). DEGs, differentially expressed genes.
Prognostic value of the OS markers
To examine the prognostic value of these lncRNAs for OS patients, 15 lncRNAs were obtained by a univariate Cox regression analysis of the 29 lncRNAs (P<0.05; ). Next, 15 lncRNAs were selected by a multivariate Cox regression analysis and stepwise screening. Ultimately, 6 lncRNAs were obtained (i.e., CARD8-AS1, FTX, KANSL1-AS1, NPTN-IT1, OLMALINC, and RPARP-AS1) (). The following formula was used: risk score = CARD8-AS1 * (–0.816784749) + FTX * (–1.167412412) + OLMALINC * (0.622776568) + KANSL1-AS1 * (–0.588179333) + NPTN-IT1 * (0.863758704) + RPARP-AS1 * (0.500369245). Next, the samples were divided into high- and low-risk groups according to their risk scores, and survival curves were drawn (). We found that samples with high-risk scores had poor survival. Next, we conducted a survival analysis in relation to age, gender, risk score, and metastasis in the TARGET-OS data set using univariate and multivariate regression analysis (). The results showed that both risk score and metastasis were associated with survival regardless of the univariate or multivariate outcomes, and that these may be important factors affecting the prognosis of patients with OS.
Table 1
15 lncRNAs were screened by a univariate cox regression analysis of 29 lncRNAs
A multivariate Cox regression analysis was performed for 6 lncRNAs
Term
HR
Lower
Upper
Coef
Se (coef)
Z value
P value
CARD8-AS1
0.441850032
0.214477992
0.91026333
–0.816784749
0.368763613
–2.214927719
0.026765026
FTX
0.311171083
0.110473154
0.87647939
–1.167412412
0.528361914
–2.209493872
0.027140308
KANSL1-AS1
0.55533745
0.32037526
0.962620156
–0.588179333
0.280659723
–2.095702673
0.036108575
NPTN-IT1
2.372059828
1.16966068
4.810512936
0.863758704
0.360743881
2.394382134
0.016648393
OLMALINC
1.864096654
1.09016925
3.187446659
0.622776568
0.273700748
2.275392278
0.022882408
RPARP-AS1
1.649330165
1.006990703
2.701405273
0.500369245
0.251740781
1.98763682
0.046851871
HR, hazard ratio; Coef, coefficient, regression coefficient beta; Se (coef), standard error; z value = coef/se (coef).
Figure 2
Prognostic value of the 6 lncRNAs in OS. (A) Survival curves for high- and low-risk patients. (B) ROC curves were drawn to analyze the predictive value of risk scores for the prognosis of OS. (C) The relationship between risk score and metastasis was analyzed. OS, osteosarcoma; ROC, receiver operating characteristics.
Table 3
Univariate Cox regression analysis of clinical factors + risk
Multivariate Cox regression analysis of clinical factors + risk
Term
HR
Lower
Upper
Coef
Se (coef)
Z value
P value
Age >16
1.005184606
0.426658567
2.368160794
0.005171212
0.437223547
0.011827387
0.990563331
Gender (male)
0.835465251
0.364224958
1.916404051
–0.179766523
0.423587918
–0.424390109
0.671281333
Type (high)
3.691673791
1.544561948
8.823508437
1.306079957
0.444569198
2.937855263
0.003304913
Meta. or non-metastatic
4.8087353
2.199025527
10.51553741
1.570434118
0.39920111
3.933942263
8.36E-05
HR, hazard ratio; Coef, coefficient, regression coefficient beta; Se(coef), standard error; z value = coef/se(coef).
Beta, regression coefficient beta; HR, hazard ratio; CI, confidence interval.HR, hazard ratio; Coef, coefficient, regression coefficient beta; Se (coef), standard error; z value = coef/se (coef).Prognostic value of the 6 lncRNAs in OS. (A) Survival curves for high- and low-risk patients. (B) ROC curves were drawn to analyze the predictive value of risk scores for the prognosis of OS. (C) The relationship between risk score and metastasis was analyzed. OS, osteosarcoma; ROC, receiver operating characteristics.Beta, regression coefficient beta; CI, confidence interval.HR, hazard ratio; Coef, coefficient, regression coefficient beta; Se(coef), standard error; z value = coef/se(coef).To analyze the predictive value of the risk score in the prognosis of OS, time-dependent ROC curves were drawn based on the risk scores of the 6 genes and the AUCs were calculated (). The results showed that the AUCs of the 1-, 3-, and 5-year ROC curves of the prognostic model were 0.715, 0.729, and 0.771, respectively, indicating that the prediction accuracy of the risk-score model was relatively high. Finally, we analyzed the relationship between the risk score and metastasis. The risk scores were higher in the metastatic samples than those in the non- metastatic samples ().
The risk-score model was verified
First, we divided the samples into high- and low-risk groups according to the risk scores in the IMvigor210 data set (the urothelial carcinoma data set) and drew survival curves. The high-risk patients were found to have poor survival (Figure S2A). We analyzed the ROC curves of the risk scores and found that the AUCs of the ROC curves of the prognostic model at 1, 3, and 5 years were 0.466, 0.52, and 0.537, respectively (Figure S2B). We also verified these results in the TCGA-SKCM data set (the cutaneous melanoma data set). The data set comprised 42 samples with immunotherapy data and 415 samples without immunotherapy data. Based on whether the immunotherapy data were available or not, the samples were divided into high- and low-risk groups according to the risk scores, and survival curves were drawn. The high-risk patients with immunotherapy were found to have poor survival (Figure S2C). Then, we analyzed the ROC curves of risk score. We found that the AUC of the 1-, 3-, and 5-year ROC curves for the sample prognostic models with immunotherapy data were 0.86, 0.916, and 0.711, respectively (Figure S2D). The high-risk patients without immunotherapy were found to have poor survival (Figure S2E). The AUCs of the 1-, 3-, and 5-year ROC curves for the sample prognostic models without immunotherapy data were 0.522, 0.562, and 0.599, respectively (Figure S2F). These results are consistent with the above results and suggest that the prediction value of the OS risk-score model is relatively high.Next, we verified the model by conducting a functional analysis of the DEGs between two risk group samples. Through a GO analysis of the biological processes, we found that the functions of these DEGs were mainly related to the immune response, defense response, positive regulation of immune system process, and cytokine response. Additionally, most of these DEGs were downregulated in the high-risk group (). Through a GO analysis of the cellular components, we found that the functions of these DEGs were mainly related to the intrinsic components of the plasma membrane, extracellular exosome, and cell surface. Additionally, most of these DEGs were downregulated in the high-risk group (). We conducted a GO analysis of the molecular function, and found that the functions of these DEGs were mainly related to signaling receptor binding, identical protein binding, and signaling receptor activity. Additionally, most of these DEGs were downregulated in the high-risk group (Figure 3C).
Figure 3
The DEGs in the two risk group samples were functionally analyzed. (A) The GO analysis of the biological processes. (B) The GO analysis of the cellular components. (C) The GO analysis of the molecular functions. Up gene (ratio): The ratio of the upregulated genes to total genes. If the value is 0.5, it means that the proportion of upregulated and downregulated genes is equal; if the value is >0.5, there are more upregulated genes. (D) KEGG analysis. The horizontal axis represents −log10 (P value); Color gradient fill according to −log10 (P value). The number on the right represents the number of differences; the colors are filled according to the number of downregulated genes. DEGs, differentially expressed genes; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes.
The DEGs in the two risk group samples were functionally analyzed. (A) The GO analysis of the biological processes. (B) The GO analysis of the cellular components. (C) The GO analysis of the molecular functions. Up gene (ratio): The ratio of the upregulated genes to total genes. If the value is 0.5, it means that the proportion of upregulated and downregulated genes is equal; if the value is >0.5, there are more upregulated genes. (D) KEGG analysis. The horizontal axis represents −log10 (P value); Color gradient fill according to −log10 (P value). The number on the right represents the number of differences; the colors are filled according to the number of downregulated genes. DEGs, differentially expressed genes; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes.The KEGG analysis revealed that these DEGs were mainly involved in the phagosomes, cell adhesion molecules, and the cytokine-cytokine signaling pathway. Most of the DEGs in these pathways were downregulated (). For the GSEA analysis, we used immune-related gene sets and found that the B cell receptor (BCR) signaling pathway, T cell receptor (TCR) signaling pathway, natural killer (NK) cell cytotoxicity, antigen processing and presentation, chemokine receptors, and other immune-related signaling pathways were downregulated in the high-risk group. No upregulated immune-related signaling pathway was found (). We used gene sets from the KEGG database for the analysis and found antigen processing and presentation, NK cell-mediated cytotoxicity, Th17 cell differentiation, the T cell receptor signaling pathway, the Toll-like receptor signaling pathway, the chemokine signaling pathway, and other immune-related signaling pathways were downregulated in the high-risk group. The ribosome pathway was upregulated in the high-risk group (). In conclusion, the function of the DEGs in the low-risk group was mainly related to immune inflammation.
Figure 4
A GSEA analysis was performed for the DEGs in the two risk group samples. (A) Immune-related gene sets were used for the analysis. (B) Gene sets from the KEGG database were used for the analysis. GSEA, Gene Set Enrichment Analysis; DEGs, differentially expressed genes; KEGG, Kyoto Encyclopedia of Genes and Genomes; NES, Normalized Enrichment Score.
A GSEA analysis was performed for the DEGs in the two risk group samples. (A) Immune-related gene sets were used for the analysis. (B) Gene sets from the KEGG database were used for the analysis. GSEA, Gene Set Enrichment Analysis; DEGs, differentially expressed genes; KEGG, Kyoto Encyclopedia of Genes and Genomes; NES, Normalized Enrichment Score.
Immuno-infiltration analysis
First, we used a chemokine (), ICB (), immune-activity-related gene (), and immune cell (Figure 6B) data set to evaluate whether the model was related to tumor immune invasion. The results showed that no matter the data set, almost all of the genes were more highly expressed in the low-risk group than the high-risk group. Next, we analyzed the relationship between the immune infiltration results and survival in the TARGET-OS data set (). The immune score was significantly associated with survival in the target-OS data sets, and cytotoxic_T cells, monocytic lineage, and cluster of differentiation (CD) 8 T cells were also associated with survival.
Figure 5
Chemokine and ICB gene sets were used to evaluate whether the model was associated with tumor immune invasion. (A) Boxplot of the chemokine gene concentration evaluation model. The horizontal axis is the name of the gene; the vertical axis is log2 expression. Each gene corresponds to 2 boxplots; that is, high risk and low risk, and are filled with blue and yellow, respectively. (B,D) Boxplot of the ICB gene concentration evaluation model. (C,E) The relationship between ICB gene expression level and risk score as presented in a heat map. Type stands for the risk score. Ns, not significant, *, P<0.05, **, P<0.01, ***, P<0.001, ****, P<0.0001. ICB, immune checkpoint blockade; TPM, transcript per million.
Figure 6
Immune activity-related and immune cell gene sets were used to evaluate whether the model was related to tumor immune invasion. (A) A boxplot of the evaluation model in the immune-activity-related gene set. (B) A boxplot of the evaluation model in the immune cell gene set. Ns, not significant, *, P<0.05, **, P<0.01, ***, P<0.001. TPM, transcript per million.
Table 5
The correlations between immune infiltration and survival were analyzed
Cell type
Survivorship curve P value (TARGET-OS Data set )
Immune Score
0.0011
Cytotoxic T cells
0.045
Myeloid dendritic cells
0.83
Monocytic lineage
0.019
Cytotoxic lymphocytes
0.71
Fibroblasts
0.96
Endothelial cells
0.26
Neutrophils
0.2
NK cells
0.14
B lineage
0.05
CD8 T cells
0.045
T cells
0.31
Chemokine and ICB gene sets were used to evaluate whether the model was associated with tumor immune invasion. (A) Boxplot of the chemokine gene concentration evaluation model. The horizontal axis is the name of the gene; the vertical axis is log2 expression. Each gene corresponds to 2 boxplots; that is, high risk and low risk, and are filled with blue and yellow, respectively. (B,D) Boxplot of the ICB gene concentration evaluation model. (C,E) The relationship between ICB gene expression level and risk score as presented in a heat map. Type stands for the risk score. Ns, not significant, *, P<0.05, **, P<0.01, ***, P<0.001, ****, P<0.0001. ICB, immune checkpoint blockade; TPM, transcript per million.Immune activity-related and immune cell gene sets were used to evaluate whether the model was related to tumor immune invasion. (A) A boxplot of the evaluation model in the immune-activity-related gene set. (B) A boxplot of the evaluation model in the immune cell gene set. Ns, not significant, *, P<0.05, **, P<0.01, ***, P<0.001. TPM, transcript per million.Next, we examined the TARGET-OS data through CIBERSORT, TIMER, XCELL, MCP analyze the relation between risk score and immunity infiltration results. The CIBERSORT results showed that activated dendritic cells were richer in the low-risk samples (). According to the results of TIMER, macrophages and mDC cells were richer in the low-risk samples (). The XCELL results showed that B plasma was enriched in the high-risk samples. Hemastem cells, immune score, macrophage M1, macrophage, mDC activated, microenvironment scores, monocytes, stroma score, and T CD8+ central memory cells were enriched in the low-risk sample (). From the MCP results, B lineage, cytotoxic_T cells, monocytic lineage, NK cells and T cell expression were richer in the low-risk samples (). Due to the high proportion of immune cells with significant differences in the MCP results, we analyzed the correlation between the results of the immune infiltration calculated by MCP and risk scores. We found that monocytic lineage, fibroblasts, T cells, and other cells were highly negatively correlated with risk score (correlation coefficient value <0, P value <0.05) ( and ).
Figure 7
Immune cell infiltration analysis. CIBERSORT (A) and TIMER (B) were used to analyze the relationship between the risk score and immune infiltration results.
Figure 8
Immune cell infiltration analysis. XCELL (A) and MCP (B) were used to analyze the relationship between the risk score and immune infiltration results. (C) The correlations between the results of immune infiltration calculated by MCP and risk score were analyzed.
Table 6
The correlations between the immune infiltration results calculated by MCP and risk score were analyzed
Cell type
Cor_value
P value
Risk
1
0
T cells
–0.263464746
0.014839614
CD8 T cells
–0.145846871
0.182904083
Cytotoxic lymphocytes
0.005366495
0.96112298
B lineage
–0.154100059
0.159094681
NK cells
–0.009482383
0.931362481
Monocytic lineage
–0.438292802
2.72E-05
Myeloid dendritic cells
–0.154471182
0.158080916
Neutrophils
–0.074284273
0.499255801
Endothelial cells
–0.058055816
0.597659975
Fibroblasts
–0.297774076
0.005644012
Cytotoxic T cells
–0.208816999
0.055124599
Cor_value, correlation coefficient value.
Immune cell infiltration analysis. CIBERSORT (A) and TIMER (B) were used to analyze the relationship between the risk score and immune infiltration results.Immune cell infiltration analysis. XCELL (A) and MCP (B) were used to analyze the relationship between the risk score and immune infiltration results. (C) The correlations between the results of immune infiltration calculated by MCP and risk score were analyzed.Cor_value, correlation coefficient value.
Discussion
OS is a malignant bone tumor commonly seen in children and adolescents, and its incidence is increasing year by year (4). At present, its pathogenesis is very complicated and unclear. The circular RNA circTADA2A promotes the progression and metastasis of OS by sponging miR-203A-3p and regulating CREB3 expression (25). Circ_001422 accelerates OS oncogenesis and metastasis by regulating the miR-195-5p/FGF2/PI3K/Akt axis, which suggests that Circ_001422 can be the target for OS therapy (26). A comprehensive bioinformatics analysis of micro RNA (miRNA) and mRNA in OS showed that miR-30D-5p, miR-17-5p, miR-98-5p, miR-301A-3p, and miR-30E-5p are the hub miRNAs (27). A protein-protein interaction network analysis indicated COL1A1 COL1A2, MMP2, CDH11, and COL4A1 may be the core regulatory molecules in the occurrence and development of OS (27). MiRNA-218 inhibits cell proliferation, migration, and invasion by targeting runt-related transcription factor 2 in human OS cells (28). The overexpression of C3AR1 mRNA was confirmed to inhibit the proliferation, migration, and invasion of OS cells and induce apoptosis. It may be a promising therapeutic target for OS, and it may be related to prognosis and the tumor microenvironment (29). M6A regulators (i.e., FTO and IGF2BP2) may play an important role in the metastasis of OS, which is of great value to the prognosis and treatment strategy for OS patients (30). In this study, we focused on the relationship between 6 lncRNAs and the occurrence and development of OS, and our findings provide further insights into the potential molecular regulatory mechanism of the occurrence and development of OS.The lncRNAs examined in this study not only play a very important role in OS, but have also been reported in the study to have related roles in other tumors (31). In this study, CARD8-AS1 was highly expressed in the samples with high immune scores. It is suggested that immune cells expressing CARD8-AS1 had a high degree of infiltration in high immune score samples. Additionally, the expression of CARD8-AS1 was negatively correlated with the risk score, indicating that the higher the expression of CARD8-AS1, the lower the risk score and the better the prognosis of patients. These results suggest that CARD8-AS1 has a protective effect in patients with OS. In ovarian cancer, card8-AS1 (HR =1.31, P=9.3E−03) was found to be significantly associated with the overall survival rate of ovarian cancer patients through a weighted gene co-expression network analysis (31). In relation to glioma, a study has found that CARD8-AS1 regulates the metastasis potential of glioma cell lines in vitro, and combined with clinical information, CARD8-AS1 has been identified as a risk lncRNA of glioma (32). Thus, the effects of the same gene in different tumors may be opposite, and the specific mechanism needs to be further explored.In this study, the low expression of FTX in high immune score sample indicated that the immune cell expressing FTX infiltration degree in high immune score samples was low. However, by constructing a risk model construction, we found a negative correlation between FTX expression and risk score, which suggests that FTX plays a risk role in patients with OS. Research has found that FTX promotes gastric cancer progression by targeting miR-215 (33). In pancreatic cancer, FTX silencing inhibits pancreatic cancer cell proliferation and invasion via the upregulation of miR-513B-5p (34). In colorectal cancer, FTX promotes colorectal cancer cell migration and invasion through the miR-590-5P/RBPJ axis (35). Additionally, it has been found that FTX knockdown inhibits the proliferation and migration of OS by regulating miR-320A/TXNRD1 (36). These findings are consistent with our results, but the risk model we constructed still needs to be verified by further experiments.In this study, the low expression of OLMALINC and RPARP-AS1 in the high immune score samples indicated that the infiltration degree of immune cell expressing OLMALINC and RPARP-AS1in high immune score samples was low. Additionally, the expression levels of OLMALINC and RPARP-AS1 were positively correlated with risk score, suggesting that OLMALINC and RPARP-AS1 play risk roles in patients with OS. Long intervening non-coding RNA (lincRNA) is a subclass of non-coding RNA (37). OLMALINC (LINC00263) and RPARP-AS1 have been identified as important regulators of the pathogenesis of non-alcoholic fatty liver disease (38,39). Studies have shown that OLMALINC (LINC00263), regulated by heterogeneous nuclear ribonucleoprotein K, upregulates CAPN2 through miR-147a, and thus plays an important role in cancer and malignant tumors (40). Additionally, Wnt-regulated OLMALINC (LINC00263) was found to modify cancer cell growth (41). This is consistent with our results, but the specific mechanism of action needs to be further studied through a large number of animal experiments. In colon cancer, a study has shown that the RPARP-AS1/miR-125A-5p axis plays an active role in promoting the proliferation, migration, and invasion of colon cancer cells (42). A bioinformatics analysis has previously shown that the high expression of RPARP-AS1 is associated with low survival in patients with OS (43). However, in lung adenocarcinoma and triple negative breast cancer, the high expression of RPARP-AS1 is associated with higher patient survival (44,45). This also suggests that the role of lncRNA in tumors is extremely complex and needs to be further studied. In summary, the prognostic effect between the lncRNAs discussed in our study in tumor development is obvious, and the model we established may be applicable to clinical practice. However, we have not yet verified it and hope that future studies will conduct molecular biology experiments to verify our conclusions.We also used gene sets to evaluate whether the model was associated with tumor immune invasion. Chemokine is a large class of small cytokines that can be divided into 4 distinct subgroups (i.e., XC, CC, CXC, and CX3C) based on cysteine residues. These molecules are responsible for immune cell transportation and shaping the immune system, are expressed by cancer cells, and play an important role in cancer progression and treatment outcomes (46). XCL1, also known as lymphotactin, is produced by T cells, NK cells, and Natural killer T (NKT) cells during infection and inflammatory responses (47). CX3CL1 is a chemokine involved in the anti-cancer function of lymphocytes (mainly NK cells, T cells, and dendritic cells), and its increased expression in tumors can improve the prognosis of cancer patients (48). In gliomas, the CXCL family contributes to the immunosuppressive microenvironment in gliomas and enhances the sensitivity of gliomas to chemotherapy (49). The CXCL12/CXCR4 axis has become a new target for cancer therapy, and many CXCL12/CXCR4 antagonists have been developed and validated and shown promising anti-cancer activity in tumor cells (50). In breast cancer, CCL8 and CCL21 are critical in targeting cancer cells, particularly by modifying stroma and immune cells through tumor suppressor mechanisms (51). In hepatocellular carcinoma, targeting tumor-infiltrating macrophages through the CCL2/CCR2 signaling pathway can be a therapeutic strategy (52). In gastric cancer, MiR484 inhibits proliferation, migration, invasion, and induces apoptosis by targeting CCL18 (53).In ICB gene sets, lymphocyte activation gene-3, CD233 (LAG3) is an activation marker of CD4+ and CD8+ T cells and can be used as a target for cancer immunotherapy (54). Some studies have shown that IDO1 inhibits the CD8+T cell response in colon cancer (55). Mir-448, as a tumor suppressor miRNA, enhances the CD8+T cell response by inhibiting IDO1 expression (55). CD200-CD200r1 signaling can reduce imiquimod-induced psoriatic skin inflammation by inhibiting macrophage activation (56). TNFSF4 is closely associated with therapies that induce anti-tumor immunity and may help induce a promising immune response in breast cancer (57). CTLA-4 has sequence homology with CD28 and is expressed on T cells after activation. They both regulate T cell proliferation and differentiation and play important roles in immune response pathways in vivo (58). In the immune activity-related gene set, CXCL9, also known as interferon-γ induced monocyte factor, is induced by interferon-gamma (IFN-γ) and mainly mediates lymphocyte infiltration into the lesion site and inhibits tumor growth (59). CXCL10 is a driver chemokine that affects CD4+ and CD8+ T cells, promoting anti-tumor immunity, and regulating autoimmunity (60).Granzymes (GZMs) have been recognized as key cell death executors of cytotoxic T and NK cells during cancer immunosurveillance. In immune surveillance, the cytotoxic molecules (i.e., perforin and Granzyme B) exert anti-tumor and anti-infective effect (61). In this study, we found that no matter what the gene concentration, almost all the genes were more highly expressed in the low-risk group than the high-risk group. Through literature reports, we also found that most genes have anti-inflammatory and anti-cancer effects, which is consistent with our conclusion.
Conclusions
In this study, we comprehensively analyzed the relationship between the expression of lncRNA in OS and the occurrence, development, and prognosis of OS. The abnormal expression of 6 lncRNAs (i.e., CARD8-AS1, FTX, KANSL1-AS1, NPTN-IT1, OLMALINC, and RPARP-AS1) was significantly associated with the progression of OS, and these lncRNAs could be used as independent markers of OS to assess the prognosis of patients with OS. In conclusion, our study identified a novel marker for assessing the prognosis of OS patients and provides important evidence for further studies on the role of OS-related genes in OS.The article’s supplementary files as
Authors: Kyoung Min Kim; Usama Khamis Hussein; See-Hyoung Park; Mi Ae Kang; Young Jae Moon; Zhongkai Zhang; Yiping Song; Ho Sung Park; Jun Sang Bae; Byung-Hyun Park; Sang Hoon Ha; Woo Sung Moon; Jung Ryul Kim; Kyu Yun Jang Journal: J Exp Clin Cancer Res Date: 2019-06-18