Literature DB >> 31173206

Identification of critical genes associated with human osteosarcoma metastasis based on integrated gene expression profiling.

Hongwu Fan1, Shan Lu2, Shengqun Wang1, Shanyong Zhang3.   

Abstract

Osteosarcoma is the most common type of malignant bone cancer, which often affects teenagers and young adults. The present study aimed to screen for critical genes and microRNAs (miRNAs/miRs) involved in osteosarcoma. A total of four microarray datasets (accession numbers GSE32981, GSE21257, GSE14827 and GSE14359) were downloaded from the Gene Expression Omnibus database. Following data preprocessing, module analysis was performed to identify the stable modules using the weighted gene co‑expression network analysis (WGCNA) package. The differentially expressed genes (DEGs) between metastatic samples and non‑metastatic samples were screened, followed by gene co‑expression network construction, and Gene Ontology function and Kyoto Encyclopedia of Genes and Genomes pathway analyses. Subsequently, prognosis‑associated genes were screened and a miRNA‑target gene regulatory network was constructed. Finally, the data for critical genes were validated. WGCNA analysis identified six modules; blue and yellow modules were significantly positively associated with osteosarcoma metastasis. A total of 1,613 DEGs were screened between primary tissue samples and metastatic samples. Following comparison of the genes in the two (blue and yellow) modules, a total of 166 DEGs were identified (metastatic samples vs. non‑metastatic samples). Functional enrichment analysis demonstrated that these DEGs were mainly involved in 'defense response', 'p53 signaling pathway' and 'lysosome'. By utilizing the clinical information in GSE21257, 10 critical genes associated with osteosarcoma prognosis were obtained, including CTP synthase 2 (CTPS2), tumor protein p53 inducible protein 3 (TP53I3) and solute carrier family 1 member 1 (SLC1A1). In addition, hsa‑miR‑422a and hsa‑miR‑194 were highlighted in the miRNA‑target gene network. Finally, matrix metallopeptidase 3 (MMP3) and vascular endothelial growth factor B (VEGFB) were predicted as critical genes in osteosarcoma metastasis. CTPS2, TP53I3 and SLC1A1 may serve major roles in osteosarcoma development, and hsa‑miR‑422a, hsa‑miR‑194, MMP3 and VEGFB may be associated with osteosarcoma metastasis.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31173206      PMCID: PMC6625205          DOI: 10.3892/mmr.2019.10323

Source DB:  PubMed          Journal:  Mol Med Rep        ISSN: 1791-2997            Impact factor:   2.952


Introduction

Osteosarcoma is the most common type of malignant bone cancer and is prevalent in teenagers and young adults (1). This type of cancer arises from primitive transformed cells of mesenchymal origin that exhibit osteoblastic differentiation and produce malignant osteoids (2). Despite considerable advances in surgery and chemotherapy, the 5-year survival rate remains at 60–70% and patients continue to succumb to osteosarcoma metastasis (3). Therefore, the exploration of novel strategies and noninvasive biomarkers that reflect disease progression is urgently required for the clinical management of patients with osteosarcoma. With the development of molecular biology techniques, tumor gene therapy for osteosarcoma exhibits a potential clinical strategy (4,5). A number of studies have demonstrated that abnormal gene expression, and alterations in microRNAs (miRNAs/miRs) and molecular signaling pathways contribute to the pathogenesis and development of osteosarcoma (6–8). These affected molecules may be considered potential diagnostic biomarkers and therapeutic targets for patients with osteosarcoma. C-X-C motif chemokine ligand 12 and matrix metallopeptidase 9 (MMP9) serve important roles in the metastasis of osteosarcoma (9). Ren et al (10) revealed that high expression levels of C-X-C motif chemokine receptor 4 and MMP9 are valuable biomarkers for osteosarcoma metastasis and survival rates. A recent study revealed that tumor protein p53 may inhibit cell proliferation and angiogenesis in osteosarcoma cell lines by inhibiting the phosphoinositide 3-kinase (PI3K)/protein kinase B (AKT)/mechanistic target of rapamycin pathway; therefore, it may be an effective novel therapeutic candidate against osteosarcoma in the future (11). In addition, Fas cell surface death receptor (Fas) is a death receptor, which has been reported to be involved in osteosarcoma metastasis. An inhibitor of the Fas pathway, c-FLIP, has been developed as a potential treatment for patients with lung metastasis (12). miRNAs are small non-coding RNA molecules (18–25 nt) and studies have revealed that miRNAs act as critical regulators involved in the pathological process of osteosarcoma (13,14). miR-30a serves as an oncogene, which regulates the proliferation, migration and invasion of human osteosarcoma by targeting runt-related transcription factor 2 (15). In addition, overexpression of miR-21 in the human osteosarcoma cell line MG63 has been reported to significantly increase cell proliferation and invasion (16). Phosphatase and tensin homolog (PTEN) may be a potential target gene of miR-21, and miR-21 may activate the PI3K/Akt pathway by decreasing PTEN expression (16). These previous studies may provide a comprehensive understanding of osteosarcoma development. Namløs et al (17) explored the potential mechanism underlying osteosarcoma and demonstrated that multiple signaling molecules serve a vital role in promoting metastasis. The present study, according to the gene expression profiles deposited by Namløs et al (17), aimed to identify metastasis-associated genes or miRNAs in osteosarcoma development and to improve the understanding of osteosarcoma metastasis. Firstly, the gene expression in metastatic osteosarcoma samples from four microarray datasets was compared with that in non-metastatic samples; subsequently, a number of differentially expressed genes (DEGs) and miRNAs were screened using the weighted gene co-expression network analysis (WGCNA) algorithm. Gene Ontology (GO) functional and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analyses were performed to identify the major signaling pathways involved in osteosarcoma. Subsequently, the gene co-expression network for these DEGs was constructed. Additionally, the miRNA-target gene network was constructed to screen the key miRNAs associated with disease prognosis. Finally, the critical genes and miRNAs were further verified based on validation dataset analysis. The results may provide novel diagnostic biomarkers and therapeutic target molecules in osteosarcoma metastasis.

Materials and methods

Data resources

The microarray datasets associated with osteosarcoma were downloaded from the National Center of Biotechnology Information Gene Expression Omnibus (GEO) database (www.ncbi.nlm.nih.gov/geo). The screening standards were as follows: The microarray datasets were gene expression profiles; the datasets were gene expression profiles associated with osteosarcoma tissue samples; osteosarcoma samples were of primary and metastatic origin; gene expression profiling of human osteosarcoma; and, the total number of osteosarcoma samples was >20. Datasets that did not meet any of these criteria were excluded. Eventually, four datasets were screened out for further analysis: GSE32981 (17), GSE21257 (18), GSE14827 (19) and GSE14359 (20) (Table I).
Table I.

Summary of microarray datasets.

GEO accession no.PlatformProbe numberTotal samplesMetastasis samplesNon-metastasis samplesPMID
GSE32981GPL3307-ABI14,7252318522518090
GSE21257GPL10295-Illumina48,70153341921372215
GSE14827GPL570-Affymetrix42,4502791820159990; 24448647
GSE14359GPL96-Affymetrix41,05934211321166698

GEO, Gene Expression Omnibus; PMID, PubMed unique identifier.

The GSE32981 dataset was tested based on the GPL3307 ABI Human Genome Survey Microarray v2.0 Array platform (Applied Biosystems; Thermo Fisher Scientific, Inc., Waltham, MA, USA), including 18 metastatic tissue samples and five non-metastatic samples. The GSE21257 dataset was tested based on the GPL10295 Illumina human-6 v2.0 expression beadchip (using nuIDs as identifiers) platform (Illumina, Inc., San Diego, CA, USA), including 34 metastatic tissue samples and 19 non-metastatic samples. The GSE14827 dataset was tested based on the GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array platform (Affymetrix; Thermo Fisher Scientific, Inc.), including nine metastatic tissue samples and 18 non-metastatic samples. The GSE14359 dataset was tested based on the GPL96 [HG-U133A] Affymetrix Human Genome U133A Array platform (Affymetrix; Thermo Fisher Scientific, Inc.), including 21 metastatic tissue samples and 13 non-metastatic samples.

Data preprocessing

The GSE14827 and GSE14359 gene expression profiles were preprocessed using oligo software version 1.41.1 (www.bioconductor.org/packages/release/bioc/html/oligo.html) (21) in the R3.4.1 package (22). The original microarray data were converted into gene symbols according to annotation information of the array platform. If several probes corresponded with the same gene, the average scores were calculated as the gene expression value of these probes. Quantile normalization in the preprocessCore package (23) was used to normalize the matrix. For the GSE32981 and GSE21257 datasets, the limma package (www.bioconductor.org/packages/release/bioc/html/limma.html) in R software 3.1.3 version (24) was used to preprocess the microarray data. The logarithmic value of each microarray data point was calculated and the gene expression data were converted from a skewed distribution to an approximately normal distribution. The median normalization method was used to normalize the microarray data.

Identification of gene modules associated with osteosarcoma

The WGCNA method was used to identify gene modules associated with osteosarcoma. WGCNA provides the topological properties of co-expression networks, in addition to the correlation of two node genes and relevant other genes (25). The WGCNA package version 1.61 (cran.r-project.org/web/packages/WGCNA/index.html) (26) in R3.4.1 software was used to screen for stable genetic modules. Since the GSE21257 dataset contained the most tumor samples and relatively abundant clinical information, it was used as the training dataset and the other three datasets served as validation datasets. Briefly, for the four datasets, the expression correlation between any two datasets was first calculated, followed by adjacency function definition and module division (the threshold of module screening and division was set as follows: The modules contained at least 150 RNA and cutHeight=0.99). Furthermore, the correlation between each module and clinical information provided by the GSE21257 dataset was analyzed. The clinical information in GSE21257, including age, histological subtype, tumor location and stage is presented in Table II.
Table II.

Clinical information of GSE21257.

Source nameAge (years)GroupHistological subtypeMortality (months)Tumor location
GSM53066714.5Metastases present at diagnosisGiant cell richSuccumbed after 27Left distal femur
GSM53089913.5Metastases at 16 months following diagnosis of the primary tumorOsteoblasticSuccumbed at 21 following diagnosis of the primary tumorLeft proximal humerus
GSM53128314.2Metastases present at diagnosisOsteoblasticAlive at 46 after diagnosis of the primary tumorLeft distal femur
GSM53128411.1Metastases at 10 months following diagnosis of the primary tumorFibroblasticAlive at 28 after diagnosis of the primary tumorLeft distal femur
GSM5312856.8Metastases at 5 months following diagnosis of the primary tumorOsteoblasticSuccumbed at 11 following diagnosis of the primary tumorHumerus
GSM53128612.0Metastases at 30 months following diagnosis of the primary tumorChondroblasticAlive at 37 after diagnosis of the primary tumorTibia
GSM53128715.0Metastases present at diagnosisSclerosingAlive at 45 after diagnosis of the primary tumorLeft distal femur
GSM53128815.0Metastases present at diagnosisChondroblasticSuccumbed at 13 following diagnosis of the primary tumorRight proximal tibia
GSM53128917.0Metastases at 9 months following diagnosis of the primary tumorOsteoblasticSuccumbed at 33 following diagnosis of the primary tumorLeft femur
GSM53129015.1Metastases at 25 months following diagnosis of the primary tumorSclerosingAlive at 25 after diagnosis of the primary tumorRight proximal tibia
GSM53129179.0Metastases at 12 months after diagnosis of the primary tumorOsteoblasticSuccumbed at 18 following diagnosis of the primary tumorHumerus
GSM53129216.7Metastases present at diagnosisOsteoblasticSuccumbed at 30 following diagnosis of the primary tumorRight proximal femur
GSM53129317.1Metastases at 9 months after diagnosis of the primary tumorChondroblasticSuccumbed at 35 following diagnosis of the primary tumorLeft distal femur
GSM53129410.8Metastases present at diagnosisChondroblasticSuccumbed at 4 following diagnosis of the primary tumorLeft distal femur
GSM53129515.3Metastases present at diagnosisOsteoblasticSuccumbed at 27 following diagnosis of the primary tumorDistal femur
GSM53129618. 3Metastases present at diagnosisOsteoblasticAlive at 26 after diagnosis of the primary tumorRight proximal tibia
GSM53129732.1Metastases at 10 months after diagnosis of the primary tumorPossibly chondromyxoid fibroma-likeSuccumbed at 18 following diagnosis of the primary tumorRight humerus
GSM53129839.0Metastases at 36 months after diagnosis of the primary tumorFibroblasticSuccumbed at 189 following diagnosis of the primary tumorUnknown
GSM53129922Metastases at 10 months after diagnosis of the primary tumorOsteoblasticAlive at 36 after diagnosis of the primary tumorFemur
GSM53130019Metastases at 24 months after diagnosis of the primary tumorOsteoblasticAlive at 123 after diagnosis of the primary tumorFemur
GSM53130112Metastases at 44 months after diagnosis of the primary tumorOsteoblasticSuccumbed at 110 following diagnosis of the primary tumorFemur
GSM53130217No metastasesFibroblasticAlive at 63 after diagnosis of the primary tumorHumerus
GSM53130322No metastasesFibroblasticAlive at 60 after diagnosis of the primary tumorHumerus
GSM53130458No metastasesTelangiectaticAlive at 60 after diagnosis of the primary tumorTibia
GSM53130528No metastasesFibroblasticAlive at 60 after diagnosis of the primary tumorFemur
GSM53130616.7Metastases at 6 months after diagnosis of the primary tumorOsteoblasticSuccumbed at 10 following diagnosis of the primary tumorLeft proximal tibia
GSM53130718.6Metastases at 21 months after diagnosis of the primary tumorChondroblasticSuccumbed at 39 following diagnosis of the primary tumorLeft proximal tibia
GSM53130814.8Metastases present at diagnosisOsteoblasticAlive at 95 after diagnosis of the primary tumorLeft distal femur
GSM53130917. 7Metastases at 17 months after diagnosis of the primary tumorOsteoblasticSuccumbed at 83 following diagnosis of the primary tumorRight distal femur
GSM53131018No metastasesOsteoblasticAlive at 246 after diagnosis of the primary tumorLeft distal femur
GSM53131116.7Metastases present at diagnosisTelangiectaticSuccumbed at 25 following diagnosis of the primary tumorRight distal femur
GSM53131213.7Metastases at 18 months after diagnosis of the primary tumorTelangiectaticSuccumbed at 40 following diagnosis of the primary tumorRight proximal tibia
GSM53131314.6No metastasesOsteoblasticAlive at 143 after diagnosis of the primary tumorRight distal tibia
GSM53131416.7Metastases present at diagnosisOsteoblasticSuccumbed at 11 following diagnosis of the primary tumorLeft proximal humerus
GSM5313198.4No metastasesOsteoblasticAlive at 105 after diagnosis of the primary tumorDiaphysis of left femur
GSM53132011.3No metastasesOsteoblasticAlive at 78 after diagnosis of the primary tumorLeft proximal tibia
GSM53132140.6No metastasesAnaplasticAlive at 97 after diagnosis of the primary tumorLeft proximal femur
GSM53132216.5Metastases at 10 months after diagnosis of the primary tumorOsteoblasticSuccumbed at 33 following diagnosis of the primary tumorLeft proximal tibia
GSM53132311.4No metastasesOsteoblasticAlive at 77 after diagnosis of the primary tumorLeft proximal tibia
GSM53132425.3Metastases at 27 months after diagnosis of the primary tumorOsteoblasticSuccumbed at 47 following diagnosis of the primary tumorLeft proximal tibia
GSM53132519.1No metastasesChondroblasticAlive at 120 after diagnosis of the primary tumorRight proximal humerus
GSM53132620.2No metastasesOsteoblasticAlive at 91 after diagnosis of the primary tumorRight distal femur
GSM53132713.75Metastases present at diagnosisOsteoblasticSuccumbed at 29 following diagnosis of the primary tumorLeft distal femur
GSM53132818.2Metastases at 24 months after diagnosis of the primary tumorOsteoblasticAlive at 32 after diagnosis of the primary tumorLeft proximal fibula
GSM5313298Metastases present at diagnosisOsteoblasticAlive at 31 after diagnosis of the primary tumorLeft distal femur
GSM53133010.7Metastases present at diagnosisOsteoblasticSuccumbed at 25 following diagnosis of the primary tumorRight distal femur
GSM53133118.1No metastasesOsteoblasticAlive at 219 after diagnosis of the primary tumorLeft distal femur
GSM53133216.0No metastasesOsteoblasticAlive at 193 after diagnosis of the primary tumorRight proximal tibia
GSM5313339.3No metastasesPleomorphicAlive at 184 after diagnosis of the primary tumorRight distal femur
GSM5313343.1No metastasesOsteoblasticAlive at 194 after diagnosis of the primary tumorLeft proximal tibia
GSM53133520.3No metastasesAnaplasticAlive at 94 after diagnosis of the primary tumorRight distal femur
GSM53135118.1No metastasesOsteoblasticAlive at 87 after diagnosis of the primary tumorRight proximal fibula
GSM53135216.0No metastasesOsteoblasticAlive at 60 after diagnosis of the primary tumorFemur

Meta-analysis for DEG screening

The MetaDE package (27,28) (cran.r-project.org/web/packages/MetaDE) in R3.4.1 software was used to screen consistent DEGs between metastatic and non-metastatic samples from the four datasets (GSE21257, GSE32981, GSE14827 and GSE14359). τ2=0, Qpval >0.05 and false discovery rate <0.05 were considered as the thresholds. The first two parameters were used for heterogeneity testing and the last parameter was used to evaluate significant differences. After screening the DEGs based on the MetaDE method, these genes were compared with those from screening stable gene modules according to the WGCNA method. The common genes in these two sets of DEGs were deemed the common critical genes. In addition, based on these gene interactions, the gene co-expression network was constructed and visualized using Cytoscape 3.3 (29) (www.cytoscape.org). GO function (biological process, molecular function and cellular component) and KEGG pathway enrichment analyses were performed using the online search annotation software tool Database for Annotation, Visualization and Integrated Discovery (30) (version 6.8, david.ncifcrf.gov). P<0.05 was considered to indicate a statistically significant difference.

Critical genes screening associated with osteosarcoma prognosis

Based on the key node sets in the gene co-expression network, combined with the clinical prognostic information of samples, the critical genes associated with osteosarcoma prognosis were identified using the univariate Cox regression analysis in Survival package (31) (version 2.4, cran.r-project.org/web/packages/survival/index.html) in R3.4.1 software. Survival data were plotted using Kaplan-Meier analysis and Log-Rank test was used to compare the statistical significance. P<0.05 was considered to indicate a statistically significant difference.

Construction of miRNA-target gene regulatory network

The miRNAs directly associated with osteosarcoma were searched from the miR2Disease database (32) (http://watson.compbio.iupui.edu:8080/miR2Disease/index.jsp). Each entry in miR2Disease contains information about a miRNA and its association with disease, in addition to the ID of the miRNA, disease name and a brief description of the miRNA-disease association, references and detection methods of miRNA expression. ‘Osteosarcoma’ was used as the disease name to screen key miRNAs associated with osteosarcoma in this database. Furthermore, the target genes of miRNAs were searched using miRanda database (33) (www.microrna.org/microrna/home.do). Finally, a miRNA-target gene regulatory network was constructed. Cytoscape 3.3 software was used to visualize the interactions among miRNAs and related target genes.

Validation of critical genes

To validate the universality of critical genes, the GSE39055 (34) expression profile (platform GPL14951 Illumina HumanHT-12 WG-DASL V4.0 R2 expression beadchip) was downloaded from the GEO database as a novel validation dataset. This dataset included 37 osteosarcoma samples that possessed associated survival rate information. This dataset was used to validate the associations between key genes and survival outcomes. Additionally, the expression levels of these key genes between metastatic and non-metastatic samples in GSE21257, GSE32981, GSE14827 and GSE14359 were analyzed. The analysis flow chart is presented in Fig. 1.
Figure 1.

Analysis process for the four microarray datasets. DEGs, differentially expressed genes; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes; miRNA/miR, microRNA; WGCNA, weighted gene co-expression network analysis.

Results

Identification of stable modules associated with osteosarcoma

Following the normalization of the four datasets, GSE21257 was used as the training set and the other three were considered validation datasets. The WGCNA algorithm was used to screen for significant modules associated with osteosarcoma. The processes were as follows: First, consensus analysis was performed for the overlapping genes in the four datasets. The results revealed that the correlation values between either of two datasets were >0.5 and all P-values were <1×10−200, which indicated that the data were comparable (Fig. 2A). The cluster dendrogram based on the all modules is presented in Fig. 2B. Second, in order to satisfy the precondition of scale-free network distribution, the weight parameter (power) of the adjacency matrix was fixed by setting the selection range of network parameter and calculating the scale-free topological matrix. The scale-free distribution of the topological matrix was calculated based on the GSE21257 dataset and the results are presented in Fig. 3A (‘scale independence’). The horizontal axis represents weight parameters of the power, while the vertical axis represents the square values of correlation coefficient between log (k) and log [p (k)]. A higher square value meant the scale-free distribution of these data. Once the square value reached 0.9 for the first time, the power value (power=7) was selected and the mean connectivity of genes calculated. As presented in Fig. 3A (‘mean connectivity’), the mean connectivity of genes was only 1, which is in line with the connectivity feature of nodes in a scale-free small network.
Figure 2.

Identification of stable gene modules associated with osteosarcoma as determined by weighted gene co-expression network analysis. (A) Correlation values between any two datasets from GSE21257, GSE32981, GSE14827 and GSE14359. The charts represent correlations between GSE21257-GSE32981, GSE21257-GSE14359, GSE21257-GSE14827, GSE32981-GSE14359, GSE32981-GSE14827 and GSE14359-GSE14827. (B) Cluster dendrogram based on the dynamic tree (GSE21257, GSE32981, GSE14827 and GSE14359). Different dendrogram colors represent various modules.

Figure 3.

Assessment of the stability of the modules. (A) Adjacency function definition for the genes. The left chart represents the power selection diagram of adjacency matrix weight parameter. The horizontal axis represents weight parameters of the power, while the vertical axis represents the square values of correlation coefficient between log (k) and log [p (k)]. A higher square value indicates the scale-free distribution of these data. The red line represents the standard line while square value reached 0.9. The right chart represents the mean connectivity of genes under different adjacency matrix weight parameters. (B) Multidimensional scaling plot of genes in each module. The X- and Y-axes represent the first and second principal components, respectively. (C) Cluster dendrogram of modules in the four datasets, GSE21257, GSE32981, GSE14359 and GSE14827. (D) Heat map for the correlation between each module and clinical factors. The horizontal axis represents clinical factors and the vertical axis represents different colored modules; the color changes from green to pink indicate changes from negative to positive, the numbers in the grid indicate the correlation coefficient and the numbers in parentheses indicate the significance of the correlation (P-value).

Subsequently, the gene dendrogram and modules were identified based on WGCNA. The GSE21257 dataset was used as the training set to screen the modules associated with osteosarcoma. The dissimilarity in easements of different nodes was calculated and a hierarchical clustering tree was generated. Based on the dynamic tree, the minimum number of genes for each network was set as 100 and the cut height was set to 0.99. A total of six modules were obtained, namely M1-blue, M2-brown, M3-green, M4-grey, M5-turquoise and M6-yellow (Table III).
Table III.

Preservation of modules associated with microarray datasets.

ModuleColorModule sizePreservation Z-score
1Blue90829.979
2Brown3542.198
3Green1647.264
4Grey4615.492
5Turquoise1,0922.576
6Yellow18318.147
Finally, the stability of gene modules was evaluated. The other three datasets (GSE32981, GSE14827 and GSE14359) were also subjected to module partition and the stability of modules obtained from the GSE21257 dataset was evaluated. The results of module partition in GSE32981, GSE14827 and GSE14359 datasets are presented in Fig. 2B. The correlation of modules in each dataset is presented in Fig. 3B and C. The genes in the same module (same color) were inclined to cluster together, indicating that these genes had similar expression levels. The overall expression of modules on the same dendrogram branch was more similar, including brown and yellow modules, in addition to blue, green and turquoise modules. Following analysis of the correlation of gene expression for the same-colored modules, three modules (M1-blue, M3-green and M6-yellow) with preservation Z scores >5 were identified, which were considered significantly stable modules. The three stable modules may be major functional modules associated with osteosarcoma. According to the clinical information provided by the GSE21257 training dataset, the correlation between each module and clinical factors was analyzed. As presented in Fig. 3D, genes in blue and yellow modules were significantly positively correlated with osteosarcoma metastasis (correlation coefficient values, 0.51 and 0.25; P-values, 9×10−205 and 7×10−46, respectively). Finally, the 1,091 genes in these two modules were selected for further analysis.

DEG screening and gene co-expression network analysis

Based on the thresholds, a total of 1,613 consistent DEGs were screened out between the osteosarcoma primary tissue samples and metastatic samples. These DEGs were subjected to hierarchical clustering analysis using the MetaDE package (Fig. 4A). The cluster analysis revealed that DEGs screened from four datasets could accurately distinguish primary osteosarcoma samples from metastatic samples. Subsequently, these genes were compared with the 1,091 genes obtained from blue and yellow modules, and 166 common genes were obtained (Fig. 4B).
Figure 4.

DEG screening and gene co-expression network analysis. (A) Heat map for the significant DEGs. Black bars represent metastatic osteosarcoma samples and white bars represent non-metastatic osteosarcoma samples. (B) Venn diagram of key genes screened according to the WGCNA method and using the MetaDE package. (C) Co-expression network of overlapping genes. Blue and yellow represent the genes screened from blue and yellow modules, respectively. The equilateral and inverted triangles represent upregulated genes and downregulated genes; the green and gray lines represent negative and positive correlations, respectively. DEGs, differentially expressed genes; WGCNA, weighted gene co-expression network analysis.

Based on the expression correlation among the 166 genes, a gene co-expression network was constructed that consisted of 166 nodes (28 from yellow module and 138 from blue module) and 1,344 edges (434 negative correlation and 910 positive correlation). Among these nodes, 28 were upregulated genes and 138 were downregulated (Fig. 4C). To further explore the function of these 166 DEGs, GO function and KEGG pathway analyses were performed. The results demonstrated that these DEGs were enriched in several functional terms and pathways (Fig. 5 and Table IV). The DEGs were mainly involved in the following GO terms: ‘Defense response’, ‘extracellular region’, ‘calcium ion binding’, etc. The major pathways the DEGs were involved in were ‘lysosome’, ‘cytokine-cytokine receptor interaction’, ‘chemokine signaling pathway’, ‘p53 signaling pathway,’ ‘ECM-receptor interaction’, ‘cell cycle’ and ‘focal adhesion’.
Figure 5.

Functional annotation of the key overlapping genes in the co-expression network. (A) GO annotation. The horizontal axis represents the number of genes and the vertical axis represents the name of the GO terms. The size of the dot represents a significant P-value; larger dots and lower P-values indicate a higher significance. (B) Kyoto Encyclopedia of Genes and Genomes pathway analysis for genes in the network. The color changes from purple to light pink represent changes in significance from high to low. The numbers in each component represent the number of genes involved in a pathway. GO, Gene Ontology.

Table IV.

GO terms and KEGG pathways for the critical DEGs in the gene co-expression network.

TermCountP-value
Biological process
  GO:0006952-defense response176.710×10−4
  GO:0070271-protein complex biogenesis142.379×10−3
  GO:0006461-protein complex assembly142.379×10−3
  GO:0006954-inflammatory response106.995×10−3
  GO:0065003-macromolecular complex assembly159.548×10−3
  GO:0007267-cell-cell signaling149.895×10−3
  GO:0006955-immune response151.289×10−2
  GO:0043933-macromolecular complex subunit organization151.618×10−2
  GO:0042127-regulation of cell proliferation153.508×10−2
  GO:0010033-response to organic substance143.794×10−2
Cellular component
  GO:0005615-extracellular space229.670×10−6
  GO:0044421-extracellular region part241.680×10−4
  GO:0005576-extracellular region377.410×10−4
  GO:0000267-cell fraction219.733×10−3
  GO:0044459-plasma membrane part351.210×10−2
  GO:0005624-membrane fraction162.361×10−2
  GO:0005886-plasma membrane522.390×10−2
  GO:0005626-insoluble fraction163.132×10−2
  GO:0031988-membrane-bounded vesicle123.805×10−2
  GO:0005887-integral to plasma membrane204.399×10−2
  GO:0046983-protein dimerization activity123.398×10−2
  GO:0005509-calcium ion binding174.100×10−2
  GO:0042802-identical protein binding134.578×10−2
KEGG pathway
  hsa04142: Lysosome61.733×10−3
  hsa04060: Cytokine-cytokine receptor interaction91.914×10−3
  hsa04115: p53 signaling pathway45.762×10−3
  hsa04062: Chemokine signaling pathway69.418×10−3
  hsa05200: Pathways in cancer81.309×10−2
  hsa04512: ECM-receptor interaction32.978×10−2
  hsa04110: Cell cycle34.860×10−2
  hsa04510: Focal adhesion44.878×10−2

GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes.

Screening critical genes associated with osteosarcoma metastasis

By combining the clinical information of the GSE21257 dataset, the critical genes associated with osteosarcoma metastasis were identified using a Cox regression model. Eventually, 10 genes associated with osteosarcoma prognosis were obtained (Table V). For the top three genes with higher P-values compared with the other DEGs, a Kaplan-Meier survival curve analysis was performed. All samples were divided into high expression and low expression groups in terms of their median numerical boundary (Fig. 6). The results demonstrated that tumor samples with high CTP synthase 2 (CTPS2) and tumor protein p53 inducible protein 3 (TP53I3) expression were associated with improved survival outcome. The expression levels of these genes were downregulated in metastatic tumor samples, indicating that these patients had a worse prognosis. Furthermore, the hazard ratio values were <1, meaning that these genes may be major factors for promoting osteosarcoma metastasis. Conversely, high expression of solute carrier family 1 member 1 (SLC1A1) was associated with a worse prognostic effect. In terms of expression, SLC1A1 was increased in metastatic tumor samples, which indicated that these patients had a worse prognosis. In addition, its hazard ratio value was >1 (Table V), thus suggesting that SLC1A1 expression may be a risk factor for osteosarcoma metastasis.
Table V.

Critical genes associated with the prognosis of osteosarcoma.

GeneP-valueHazard ratio95% CI
SLC1A10.0133.4971.299–9.414
CTPS20.0340.4790.204–1.126
TP53I30.0340.5050.247–1.031
CIB10.0410.5960.265–1.339
VEGFB0.0410.5550.224–1.373
ITGB50.0450.5190.233–1.159
MMP30.0450.5180.127–2.112
MOSPD20.0460.8200.320–2.099
FAP0.0470.7600.538–1.075
CLC0.0470.7260.246–2.137

CI, confidence interval.

Figure 6.

Kaplan-Meier survival curve analysis for the top three genes, (A) SLC1A1, (B) CTPS2 and (C) TP53I3, associated with the prognosis of osteosarcoma. The black and red curves represent low expression and high expression sample groups, respectively. CTPS2, CTP synthase 2; SLC1A1, solute carrier family 1 member 1; TP53I3, tumor protein p53 inducible protein 3.

Analysis of miRNA-target gene network

Following searches in the miR2Disease database, seven miRNAs and multiple target genes associated with osteosarcoma were obtained (Table VI). The interactions between miRNAs and their target genes were visualized by a biological network (Fig. 7). The miRNA-target gene network consisted of 48 nodes, including seven miRNAs (including hsa-miR-422a, hsa-miR-145 and hsa-miR-194), six disease prognosis-associated DEGs [CTPS2, fibroblast activation protein α (FAP), SLC1A1, MMP3, motile sperm domain containing 2 (MOSPD2) and vascular endothelial growth factor B (VEGFB)] and 35 DEGs co-expressed with critical genes or miRNAs.
Table VI.

Critical miRNAs related to the prognosis of OS.

Disease miRNAOS DEGsPMID
hsa-miR-422aCTPS220949564
hsa-miR-145FAP, SLC1A126339404
hsa-miR-194FAP, SLC1A126339404
hsa-miR-93MMP328260111
hsa-miR-125bMOSPD2, VEGFB25661090
hsa-miR-193a-3pSLC1A120949564
hsa-miR-140-5pVEGFBPMC2783211

DEGs, differentially expressed genes; miRNA/miR, microRNA; OS, osteosarcoma; PMID, PubMed unique identifier.

Figure 7.

miRNA-target gene network associated with osteosarcoma. Blue and yellow colors represent the genes screened from blue and yellow modules, respectively. The equilateral and inverted triangles represent upregulated and downregulated genes, respectively. The red squares refer to miRNAs associated with osteosarcoma. The green and gray lines represent the negative and positive connections. The red lines represent the interactions between miRNAs and target genes. miRNA/miR, microRNA.

KEGG analysis was performed to identify the major pathways for the miRNAs and target genes. Eventually, nine pathways were identified, including ‘pathways in cancer’, ‘ubiquitin-mediated proteolysis’ and the ‘Ras signaling pathway’ (Table VII). Among the genes, VEGFB was identified to be involved in several signaling pathways (‘pathways in cancer’, ‘focal adhesion’, ‘Ras signaling pathway’ and ‘cytokine-cytokine receptor interaction’), meaning that it may serve a vital role in the development of osteosarcoma.
Table VII.

Kyoto Encyclopedia of Genes and Genomes pathway analysis for the target genes in the regulatory network.

TermCountP-valueGenes
hsa05200: Pathways in cancer30.029VEGFB, WNT5B, FN1
hsa04120: Ubiquitin mediated proteolysis20.032NEDD4, UBE2S
hsa00230: Purine metabolism20.039PDE7B, POLA2
hsa05205: Proteoglycans in cancer20.043WNT5B, FN1
hsa04510: Focal adhesion20.044VEGFB, FN1
hsa01100: Metabolic pathways50.044MAN2A1, TDO2, B4GALT3, CTPS2, POLA2
hsa04810: Regulation of actin cytoskeleton20.045GSN, FN1
hsa04014: Ras signaling pathway20.047VEGFB, RIN1
hsa04060: Cytokine-cytokine receptor interaction20.047VEGFB, CXCL14

Expression and prognostic validation

To validate the general characteristics of the six prognosis-associated DEGs involved in the miRNA-target network, a different dataset, GSE39055 (including 37 osteosarcoma samples and related survival outcome information), was applied as a validation dataset to verify the association between the genes and survival outcome. The results demonstrated that the prognostic analysis of six genes in the GSE39055 validation dataset was consistent with that in the GSE21257 dataset (Figs. 8 and 9). Among these genes, MMP3 (Fig. 8B) and SLC1A1 (Fig. 9C) had improved survival outcomes in low expression groups of osteosarcoma samples. In addition, the expression levels of SLC1A1 were upregulated in metastatic osteosarcoma samples and patients with high expression of this gene exhibited worse survival outcomes, indicating that SLC1A1 expression may be a risk factor for osteosarcoma metastasis. In addition, VEGFB, CTPS2, MOSPD2 and FAP had improved survival outcomes in high expression groups of osteosarcoma samples (Figs. 8 and 9). The expression levels of these genes were downregulated in metastatic osteosarcoma samples and patients exhibited worse survival outcomes, indicating that decreased expressions of these four genes (VEGFB, CTPS2, MOSPD2 and FAP) may be risk factors for osteosarcoma metastasis.
Figure 8.

Expression and prognostic validation for critical genes, including (A) VEGFB, (B) MMP3 and (C) CTPS2. The black and red curves represent low expression and high expression osteosarcoma sample groups, respectively. CTPS2, CTP synthase 2; MMP3, matrix metallopeptidase 3; VEGFB, vascular endothelial growth factor B.

Figure 9.

Expression and prognostic validation for critical genes, including (A) MOSPD2, (B) FAP and (C) SLC1A1. The black and red curves represent low expression and high expression osteosarcoma sample groups, respectively. FAP, fibroblast activation protein α; MOSPD2, motile sperm domain containing 2; SLC1A1, solute carrier family 1 member 1.

Discussion

In the present study, gene expression profile analysis (accession numbers: GSE32981, GSE21257, GSE14827 and GSE14359) was performed and a total of 166 critical DEGs were identified in metastatic osteosarcoma tissue samples compared with non-metastatic samples, including 28 upregulated genes and 138 downregulated genes. Functional enrichment analysis results demonstrated that these DEGs were mainly enriched in ‘defense response’, ‘lysosome’ and ‘p53 signaling pathway’. In the gene co-expression network, CTPS2, TP53I3 and SLC1A1 were key nodes and may be considered risk factors for osteosarcoma metastasis. In addition, hsa-miR-422a and hsa-miR-194 were highlighted in the miRNA-target gene network. Finally, MMP3 and VEGFB were predicted as critical genes in osteosarcoma metastasis. CTPS2 is a critical enzyme that controls the synthesis of cytosine nucleotides, and CTPS2 serves a vital role in numerous metabolic processes (35). Cancer cells that exhibit increased cell proliferation also exhibit increased activity of CTPS2. Patients with colorectal cancer with low CTPS2 expression did not receive a survival benefit from 5-fluorouracil treatment (P=0.072), whereas those with high expression did (P=0.003); therefore, low CTPS2 expression may be a major determinant for chemoresistance (36). TP53I3 encodes the putative quinone oxidoreductase, an enzyme that is involved in cellular responses to oxidative stress and irradiation in humans (37). TP53I3 is involved in p53-mediated cell death and can be induced by the tumor suppressor p53 (38). A recent study indicated that p53 is able to directly regulate target genes, including TP53I3, associated with several drug treatments in an osteosarcoma cell line (39). SLC1A1, also known as excitatory amino-acid transporter 3, is a high-affinity glutamate transporter (40). This protein serves an essential role in glutamate transport from plasma membranes to neurons. However, studies on SLC1A1 in osteosarcoma are few. In the present study, CTPS2, TP53I3 and SLC1A1 were abnormally expressed in metastatic osteosarcoma tissue samples, thus suggesting that these three genes may be considered as prognostic biomarkers of osteosarcoma. The miRNA-target gene network demonstrated that several miRNAs were involved in osteosarcoma prognosis. Downregulation of miR-422a has been reported to be associated with poor prognosis in human osteosarcoma (41). Increased expression levels of miR-422a can inhibit cell proliferation and invasion, and can enhance chemosensitivity in osteosarcoma cells (42). Zhang et al (43) demonstrated that miR-422a may serve as a tumor inhibitor in osteosarcoma via suppression of BCL2 like 2 (BCL2L2) and KRAS proto-oncogene, GTPase (KRAS) translation. Therefore, miR-442a may be involved in the progression of osteosarcoma via targeting BCL2L2 and KRAS. In addition, hsa-miR-194 has been reported to be a major factor involved in tumor progression and metastasis (44). Han et al (45) indicated that miR-194 may suppress osteosarcoma cell proliferation and metastasis in vitro and in vivo by targeting cadherin 2 and insulin-like growth factor 1 receptor. Therefore, these findings suggested that hsa-miR-422a and hsa-miR-194 may be critical molecules in osteosarcoma metastasis. MMP3 or stromelysin-1 is an enzyme that regulates the breakdown of extracellular matrix proteins in normal physiological processes, in addition to in disease processes, including tumor metastasis or arthritis (46). The MMP family, including MMP3, has been confirmed to interact with collagen type I α 2 chain and serve an important role in osteosarcoma tumorigenesis (47). A previous study revealed that upregulation of MMP13 can result in suppression of osteosarcoma metastasis in a mouse model (48). As a member of the MMP family, MMP3 may be an important diagnostic marker for osteosarcoma metastasis. VEGFs are signaling proteins produced by cells that stimulate the formation of blood vessels, which is an important process for tumorigenesis and metastasis (49). It has also reported the abnormal expression of VEGF in osteosarcoma and explored the prognostic value in cancer patients (50). VEGFB belongs to the VEGF family and serves a role in maintaining newly formed blood vessels during pathological conditions (51). It has been reported that VEGF correlates with a poor histologic response to chemotherapy in osteogenic sarcoma (52). In the present study, the expression levels of MMP3 and VEGFB were downregulated in metastatic osteosarcoma samples and these patients exhibited worse survival outcomes. The survival analysis of MMP3 for GSE21257 training set is consistent with that of the GSE39055 validation set in the present study. Together with the present findings, it was suggested that these two genes may be risk factors for promoting osteosarcoma metastasis. However, certain limitations existed in the present study. One important factor is the lack of experimental verification. The number of osteosarcoma samples in metastatic or non-metastatic groups was small; the clinical information in microarray datasets was also scarce. Therefore, more studies are required to explore the diagnostic roles of critical genes and miRNAs in osteosarcoma metastasis; for example, the expression levels of the critical genes and miRNAs in osteosarcoma samples should be validated. Besides, the roles of these genes and miRNAs in predicting the prognosis of osteosarcoma should be analyzed in a large sample size in future studies. In conclusion, the present study identified 28 upregulated and 138 downregulated genes in metastatic osteosarcoma samples. The DEGs were associated with ‘defense response’, ‘p53 signaling pathway’ and ‘lysosome’. In addition, it was revealed that CTPS2, TP53I3 and SLC1A1 may serve a major role in osteosarcoma metastasis, and hsa-miR-422a, hsa-miR-194, MMP3 and VEGFB may also be associated with the metastasis of osteosarcoma. The present study provided a reliable strategy to discover non-invasive biomarkers for osteosarcoma prognosis.
  50 in total

1.  Identification of a cDNA encoding an isoform of human CTP synthetase.

Authors:  A B van Kuilenburg; R Meinsma; P Vreken; H R Waterham; A H van Gennip
Journal:  Biochim Biophys Acta       Date:  2000-07-24

Review 2.  Cytogenetics and molecular biology of osteosarcoma.

Authors:  Brian D Ragland; Walter C Bell; Robert R Lopez; Gene P Siegal
Journal:  Lab Invest       Date:  2002-04       Impact factor: 5.662

3.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

4.  Effect of normalization on significance testing for oligonucleotide microarrays.

Authors:  Rudolph S Parrish; Horace J Spencer
Journal:  J Biopharm Stat       Date:  2004-08       Impact factor: 1.051

5.  Expression of MMP2, MMP9 and MMP3 in breast cancer brain metastasis in a rat model.

Authors:  Odete Mendes; Hun-Taek Kim; George Stoica
Journal:  Clin Exp Metastasis       Date:  2005       Impact factor: 5.150

6.  A polymorphic microsatellite that mediates induction of PIG3 by p53.

Authors:  Ana Contente; Alexandra Dittmer; Manuela C Koch; Judith Roth; Matthias Dobbelstein
Journal:  Nat Genet       Date:  2002-02-04       Impact factor: 38.330

7.  MDM2 oncogene as a target for cancer therapy: An antisense approach.

Authors:  H Wang; X Zeng; P Oliver; L P Le; J Chen; L Chen; W Zhou; S Agrawal; R Zhang
Journal:  Int J Oncol       Date:  1999-10       Impact factor: 5.650

8.  Neoadjuvant chemotherapy with high-dose Ifosfamide, high-dose methotrexate, cisplatin, and doxorubicin for patients with localized osteosarcoma of the extremity: a joint study by the Italian and Scandinavian Sarcoma Groups.

Authors:  Stefano Ferrari; Sigbjorn Smeland; Mario Mercuri; Franco Bertoni; Alessandra Longhi; Pietro Ruggieri; Thor A Alvegard; Piero Picci; Rodolfo Capanna; Gabriella Bernini; Cristoph Müller; Amelia Tienghi; Thomas Wiebe; Alessandro Comandone; Tom Böhling; Adalberto Brach Del Prever; Otte Brosjö; Gaetano Bacci; Gunnar Saeter
Journal:  J Clin Oncol       Date:  2005-10-24       Impact factor: 44.544

9.  Bioconductor: open software development for computational biology and bioinformatics.

Authors:  Robert C Gentleman; Vincent J Carey; Douglas M Bates; Ben Bolstad; Marcel Dettling; Sandrine Dudoit; Byron Ellis; Laurent Gautier; Yongchao Ge; Jeff Gentry; Kurt Hornik; Torsten Hothorn; Wolfgang Huber; Stefano Iacus; Rafael Irizarry; Friedrich Leisch; Cheng Li; Martin Maechler; Anthony J Rossini; Gunther Sawitzki; Colin Smith; Gordon Smyth; Luke Tierney; Jean Y H Yang; Jianhua Zhang
Journal:  Genome Biol       Date:  2004-09-15       Impact factor: 13.583

10.  The microRNA.org resource: targets and expression.

Authors:  Doron Betel; Manda Wilson; Aaron Gabow; Debora S Marks; Chris Sander
Journal:  Nucleic Acids Res       Date:  2007-12-23       Impact factor: 16.971

View more
  14 in total

1.  Exploration of Potential Ewing Sarcoma Drugs from FDA-Approved Pharmaceuticals through Computational Drug Repositioning, Pharmacogenomics, Molecular Docking, and MD Simulation Studies.

Authors:  Mubashir Hassan; Muhammad Yasir; Saba Shahzadi; Andrzej Kloczkowski
Journal:  ACS Omega       Date:  2022-06-01

2.  A model of twenty-three metabolic-related genes predicting overall survival for lung adenocarcinoma.

Authors:  Zhenyu Zhao; Boxue He; Qidong Cai; Pengfei Zhang; Xiong Peng; Yuqian Zhang; Hui Xie; Xiang Wang
Journal:  PeerJ       Date:  2020-09-24       Impact factor: 2.984

3.  MiR-25-3p Serves as an Oncogenic MicroRNA by Downregulating the Expression of Merlin in Osteosarcoma.

Authors:  Hua-Chun Rao; Zhao-Ke Wu; Si-da Wei; Yun Jiang; Qing-Xin Guo; Jia-Wen Wang; Chang-Xian Chen; Hui-Yong Yang
Journal:  Cancer Manag Res       Date:  2020-09-24       Impact factor: 3.989

4.  MicroRNA-431-5p Inhibits the Tumorigenesis of Osteosarcoma Through Targeting PANX3.

Authors:  Shengliang Sun; Lei Fu; Gen Wang; Jianli Wang; Liping Xu
Journal:  Cancer Manag Res       Date:  2020-09-08       Impact factor: 3.989

5.  Increased DEF6 expression is correlated with metastasis and poor prognosis in human osteosarcoma.

Authors:  Qiao Zhang; Guo-Sheng Zhao; Ya Cao; Xue-Feng Tang; Qiu-Lin Tan; Lu Lin; Qiao-Nan Guo
Journal:  Oncol Lett       Date:  2020-06-17       Impact factor: 2.967

6.  Long non‑coding RNA NR2F1‑AS1 facilitates the osteosarcoma cell malignant phenotype via the miR‑485‑5p/miR‑218‑5p/BIRC5 axis.

Authors:  Guanghui Jia; Yalei Wang; Yali Yu; Zijun Li; Xiangyu Wang
Journal:  Oncol Rep       Date:  2020-07-20       Impact factor: 3.906

7.  SLC1A1, SLC16A9, and CNTN3 Are Potential Biomarkers for the Occurrence of Colorectal Cancer.

Authors:  Jie Zhou; Zhiman Xie; Ping Cui; Qisi Su; Yu Zhang; Lijia Luo; Zhuoxin Li; Li Ye; Hao Liang; Jiegang Huang
Journal:  Biomed Res Int       Date:  2020-05-23       Impact factor: 3.411

8.  Coupled structural transitions enable highly cooperative regulation of human CTPS2 filaments.

Authors:  Eric M Lynch; Justin M Kollman
Journal:  Nat Struct Mol Biol       Date:  2019-12-23       Impact factor: 15.369

9.  Identification of hub methylated-CpG sites and associated genes in oral squamous cell carcinoma.

Authors:  Yuxin Dai; Qiaoli Lv; Tingting Qi; Jian Qu; Hongli Ni; Yongkang Liao; Peng Liu; Qiang Qu
Journal:  Cancer Med       Date:  2020-03-10       Impact factor: 4.452

10.  LncRNA SPRY4‑IT1 promotes progression of osteosarcoma by regulating ZEB1 and ZEB2 expression through sponging of miR‑101 activity.

Authors:  Hui Yao; Gang Hou; Qi-You Wang; Wen-Bin Xu; Hui-Qing Zhao; Yi-Chun Xu
Journal:  Int J Oncol       Date:  2019-11-13       Impact factor: 5.650

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.