Literature DB >> 27314445

Identification of CD20, ECM, and ITGA as Biomarkers for Osteosarcoma by Integrating Transcriptome Analysis.

Da-Wei Wang1, Sheng-Yuan Yu1, Yang Cao1, Lei Yang1, Wei Liu1, Xiao-Qiang Er1, Gui-Jun Yao1, Zheng-Gang Bi1.   

Abstract

BACKGROUND Osteosarcoma is the most frequent primary bone cancer derived from primitive mesenchymal cells. The aim of this study was to explore the molecular mechanism of the development and progression of osteosarcoma. MATERIAL AND METHODS The gene expression profiles of osteosarcoma from 17 specimens (3 normal and 14 osteosarcoma) were downloaded from the GEO database. The differentially expressed genes were identified by use of the Limma package. DAVID and Enrichment Map were used to perform GO and KEGG pathways enrichment analysis and to integrate enrichment results of differentially expressed genes (DEGs). Protein-protein interaction network was constructed and analyzed to screen out the potential regulatory proteins using the STRING online tools. RESULTS A total of 417 DEGs were screened, including 215 up-regulated and 202 down-regulated ones, accounting for 51.56% and 48.4%, respectively. In GO term, a total of 12 up-regulated expression genes were enriched in Cellular Component. The up-regulated DEGs were enriched in 6 KEGG pathways while the down-regulated expression genes were enriched in 2 KEGG pathways. The constructed PPI network was aggregated with 1006 PPI relationships and 238 nodes, accounting for 57.07% of DEGs. We found that CD20, MCM, and CCNB1 (down-regulated) in cell cycle and ECM, ITGA, RTKin (up-regulated) in focal adhesion had important roles in the progression of osteosarcoma. CONCLUSIONS The identified DEGs and their enriched pathways provide references for the exploration of the molecular mechanism of the development and progression of osteosarcoma. Moreover, the key genes (CD20, ECM, and ITGA) may be useful in treatment and diagnosis of osteosarcoma.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27314445      PMCID: PMC4913741          DOI: 10.12659/msm.898852

Source DB:  PubMed          Journal:  Med Sci Monit        ISSN: 1234-1010


Background

Osteosarcoma (OS) is the most frequent primary bone cancer in children and adolescents. It is derived from primitive mesenchymal bone-forming cells that undergo aberrant alterations in the differentiation program [1]. The incidence of OS was reported to be 10 cases per million persons per year [2]. Patients with OS for 10 to 20 years account for 60% of cases [3]. With the development of pediatric oncology, orthopedic oncology, and biology, treatment of osteosarcoma now includes aggressive cytotoxic chemotherapy and local control surgery, which achieves a 5-year overall survival approaching 70–80% [4]. However, after patients are treated with aggressive cytotoxic chemotherapy, they finally present with metastatic disease, and there are even those whose tumors recur because of its aggressive malignancy [5]. The overall survival rate of patients with non-metastatic osteosarcoma decreases to 20% when metastases occur [6]. In most cases, unless obvious clinical manifestations such as bone fractures and local pain were observed, the patients may not be diagnosed as having OS; therefore, OS is often found at advanced stage [7]. In order to continue to make progress in the treatment and diagnosis of OS, it is very important to find biomarkers of osteosarcoma. In recent years, considerable research has focussed on osteosarcoma. Zhu found that SOX9 is up-regulated in aggressive osteosarcoma tissues compared with controls by detecting the SOX9 mRNA and protein expression levels of 30 pairs of osteosarcoma and noncancerous bone tissues [8]. It is reported that FOXM1 is over-expressed and plays an important role in development and progression in various cancers, such as lung, liver, and breast cancer [9,10]. FoxM1 was found to be up-regulated in osteosarcoma tissues, which means that FoxM1may be as a valuable prognostic biomarker for osteosarcoma [11]. Kuan et al. suggested Dual AO/EB staining is an economic and convenient method to detect apoptosis in osteosarcoma cells [12]. However, these molecules do not treat the osteosarcoma efficiently or selectively. Therefore, there is great need for new methods to elucidate the mechanism of osteosarcoma and new therapeutics for osteosarcoma. In this paper, microarrays were utilized to identify differentially expressed genes (DEGs) between osteosarcoma and normal cells. Significance of differential expression, functions of DEGs, and network module analysis were performed. The identified DEGs and their enriched pathways provide references for the exploration of the molecular mechanism of the development and progression of osteosarcoma. With the analysis of PPI (protein-to-protein interaction) network, we selected out key genes, which appeared to have potential to be used for clinical treatment and diagnosis of osteosarcoma in the future.

Material and Methods

Data source

The gene expression profile of GSE16088 [13] were downloaded from a public functional genomics data repository, the GEO (Gene Expression Omnibus, ) database. A total of 17 specimens, including 3 normal samples and 14 osteosarcoma specimens, were available based on the Affymetrix Human Genome U133A Array.

Data preprocessing

We converted the probe-level data in CEL files into expression measures using the RMA function of the Affy package in R [14]. Then, the R/Bioconductor notes package of the microarray data platform was used to map probe sets to genes. For each sample, the expression values of all probes for a given gene were reduced to a single value by taking the average expression value.

DEGs screening

Limma, the most popular method in the statistical analysis to study the DEGs, was used to compare the normal samples and osteosarcoma samples [15]. Only the genes with the p-value adjusted of 0.01 and |log2(fc)|>1 were screened out as DEGs. We also performed clustering analysis for the DEGs and drew dendrograms.

Functional enrichment analysis of the DEGs

Gene ontology (GO) analysis is a commonly used approach for functional annotation of large-scale genomic data. The KEGG pathways database is a comprehensive and recognized database involving many kinds of biochemistry pathways. To gain further insights into the function of DEGs, we used the DAVID method to perform the GO enrichment analysis and KEGG pathways enrichment analysis related with the up-regulated genes and down-regulated genes, respectively. DAVID (The Database for Annotation, Visualization and Integrated Discovery) [16] was used to perform the Gene Ontology (GO) analysis and KEGG pathways enrichment analysis of DEGs, with a false discovery rate (FDR) less than 0.05. EnrichmentMap [17], a useful tool to overcome gene-set redundancy and help in the interpretation of large gene lists, was used to integrate function enrichment results by analyzing gene-sets for enrichment significance and then organizing them as a weighted similarity network to define the relationship of different biological processes.

PPI network construction analysis

Recent key research on the interaction of proteins shows that they regularly form individual molecules in the interaction of many molecules and form complicated networks. In the present study, based on the 417 DEGs obtained above, the STRING online tools [18] were used to analyzed the PPI. The combined score >0.4 was the PPI value. Furthermore, we performed KEGG pathways enrichment analysis of the network nodes. After obtaining the PPI, the Network Analyzer plug-in [19] of Cytoscape software was used to analyze the topology property of the networks. In addition, we performed the module analysis of the network by using the ClusterONE plug-in board, then modules with p-value <0.01 were selected to perform the function analysis.

Results

Data source and preprocessing

A total of 13 272 genes from 3 normal samples and 14 osteosarcoma specimens in GSE16088 were obtained. The data before normalization and after normalization are shown in Figure 1. The median of different samples was almost on the same line after normalization, indicating an excellent degree of standardization.
Figure 1

Box-plot of expressed data before and after normalization. Horizontal axis stands for the name of samples; vertical axis stands for expression values. The black line of the box is the median of data of each group, whose distribution stands for the degree of standardization of data.

Identification of DEGs

Limma of R software was used to perform the differential expression analysis of 14 osteosarcoma cancer cell samples and 3 controls, with the p-value adjusted to 0.01 and |log2(fc)|>1. Finally, 417 DEGs were identified, including 215 (51.56%) up-regulated and 202 (48.4%) down-regulated DEGs (Figure 2, Table 1). The clustering of samples was mainly separated into 2 clusters: one was a control sample and the other was an osteosarcoma sample. The clustering of genes was mainly separated into 3 clusters: the first was the down-regulated genes of the osteosarcoma sample, the second was the up-regulated genes of the osteosarcoma sample, and the third was significantly up-regulated genes.
Figure 2

The up- and down-regulated genes. The horizontal axis above stands for the name of sample; the left vertical axis stands for the name of gene; the right vertical stands for the clustering of gene. Red stands for up-regulated gene, green stands for down-regulated gene.

Table 1

The top 10 up- and down-regulated genes.

Gene symbollogFCAdj.P.ValStage
ENPP25.175479958.55E-06UP
OLFML2B5.366602645.06E-08UP
IBSP5.4176184850.002416UP
COL1A25.4232801621.81E-05UP
A2M5.5364077295.52E-07UP
CTSK5.8871737581.61E-05UP
LUM5.9179159081.48E-06UP
MAFB5.9188692822.39E-10UP
SEPP16.9739665887.43E-12UP
SPP17.0536560443.34E-09UP
TRIP13−2.6788537355.47E-06Down
CTGF−2.5570848060.00453Down
DRAP1−2.5424903144.88E-09Down
PFKP−2.4098719770.001516Down
SLC7A5−2.2926791354.80E-05Down
KIAA0101−2.2633015867.77E-06Down
FHL2−2.22353180.004587Down
FEN1−2.2190246054.60E-06Down
CCNB1−2.1864898710.002583Down
HNRNPF−2.185919963.75E-05Down

Functional enrichment and integrated analysis of the DEGs

EnrichmentMap was used to analyze the relation between the enriched GO terms (Figure 3). In GO term, a total of 12 up-regulated expression genes were enriched in Cellular Component (CC), while 22 down-regulated ones were not mutually enriched in (Table 2). Down-regulated expression proteins were enriched in the organelles concerned with energy metabolism, such as ribosomes and spliceosomes; while up-regulated expression proteins were enriched in the organelles concerned with energy metabolism such as up-regulated expression proteins were enriched in extracellular domain such as extracellular region part, while in Biological Process (BP), 17 genes were up-regulated and 40 genes were down-regulated (Table 3). Down-regulated expression genes were mainly enriched in 4 biological processes such as positive regulation of ubiquitin-protein ligase activity during mitotic cell cycle. Down-regulated expression proteins influence the proliferation and fission of cell since ubiquitin-protein ligase is related with the cell cycle. Some expression genes of bone development and cell adhesion were up-regulated. Meanwhile, 5 up-regulated and 8 down-regulated genes were enriched in Molecular Function (MF) (Table 4). The 5 MFs which the up-regulated expression proteins enriched have no close connection, while 5 out of the 8 MFs which the down-regulated expression proteins enriched have close connections.
Figure 3

The GO terms relation schemas of Molecular Function, Cellular Component, and Biological Process after enrichment of up-regulated and down-regulated genes.

Table 2

The modules of networks.

Module nameNodesDensityp-value
1290.3997.99E-09
2210.50952.48E-05
3130.89740.0001
4120.71210.000122
5140.50550.001929
6130.52560.002855
7210.50.003462
Table 3

The KEGG pathways of 7 modules.

Module rankingKEGG pathwayGenesFDR
Module 1hsa04512: ECM-receptor interactionIBSP, COL4A1, ITGAV, COL6A3, COL3A1, COL1A2, COL1A1, COL5A2, COL11A1, COL5A1, FN15.99E-11
hsa04510: Focal adhesionIBSP, COL4A1, ITGAV, COL6A3, COL3A1, COL1A2, PDGFRB, COL1A1, COL5A2, COL11A1, COL5A1, FN11.35E-08
Module 3hsa03050: ProteasomePSMD14, PSMB6, PSMC4, PSMC3, PSMA3, PSMD1, PSMD6, PSMA72.04E-10
Module 4hsa04612: Antigen processing and presentationHLA-DRB1, HLA-A, HLA-C, HLA-B, HLA-G, B2M6.72E-04
hsa05330: Allograft rejectionHLA-DRB1, HLA-A, HLA-C, HLA-B, HLA-G0.004467
hsa05332: Graft-versus-host diseaseHLA-DRB1, HLA-A, HLA-C, HLA-B, HLA-G0.00571
hsa04940: Type I diabetes mellitusHLA-DRB1, HLA-A, HLA-C, HLA-B, HLA-G0.007163
hsa05320: Autoimmune thyroid diseaseHLA-DRB1, HLA-A, HLA-C, HLA-B, HLA-G0.012942
hsa05416: Viral myocarditisHLA-DRB1, HLA-A, HLA-C, HLA-B, HLA-G0.035202
Module 5hsa03030: DNA replicationMCM7, RFC4, MCM2, MCM3, FEN11.89E-05
hsa04110: Cell cycleCCNB1, MCM7, CDC20, MCM2, MCM30.003019
Module 6hsa03030: DNA replicationMCM7, RFC4, MCM2, MCM3, FEN11.89E-05
hsa04110: Cell cycleCCNB1, MCM7, CDC20, MCM2, MCM30.003019
Module 7hsa04510: Focal adhesionCAV1, COL4A1, ITGAV, COL6A3, COL3A1, COL1A2, PDGFRB, COL1A1, CTNNB1, FN16.85E-07
hsa04512: ECM-receptor interactionCOL4A1, ITGAV, COL6A3, COL3A1, COL1A2, COL1A1, FN16.43E-05
Table 4

Genes nriched in molecular function.

CategoryTermCount%P value
Up-regulatedGOTERM_MF_FATGO: 0048407~platelet-derived growth factor binding62.9411764718.41E-08
GOTERM_MF_FATGO: 0005201~extracellular matrix structural constituent104.9019607846.22E-07
GOTERM_MF_FATGO: 0005518~collagen binding73.4313725493.15E-06
GOTERM_MF_FATGO: 0050840~extracellular matrix binding62.9411764711.27E-05
GOTERM_MF_FATGO: 0019838~growth factor binding94.4117647062.85E-05
Down-regulatedGOTERM_MF_FATGO: 0000166~nucleotide binding6030.456852791.63E-09
GOTERM_MF_FATGO: 0003723~RNA binding3015.22842641.09E-08
GOTERM_MF_FATGO: 0051082~unfolded protein binding136.5989847721.56E-08
GOTERM_MF_FATGO: 0003735~structural constituent of ribosome147.1065989851.40E-07
GOTERM_MF_FATGO: 0017076~purine nucleotide binding4723.857868023.10E-06
GOTERM_MF_FATGO: 0030554~adenyl nucleotide binding3919.796954312.68E-05
GOTERM_MF_FATGO: 0032553~ribonucleotide binding4321.827411173.12E-05
GOTERM_MF_FATGO: 0032555~purine ribonucleotide binding4321.827411173.12E-05
On the other hand, the metabolic pathways of the up-regulated and down-regulated genes were clearly different. The up-regulated expression genes were enriched in 6 KEGG pathways and the down-regulated expression genes were enriched in 2 KEGG pathways (Table 5).
Table 5

The metabolic pathways of the up-regulated and down-regulated genes.

CategoryTermCount%P value
Up-regulatedKEGG_PATHWAYhsa04512: ECM-receptor interaction157.3529411766.48E-11
KEGG_PATHWAYhsa04510: Focal adhesion178.3333333331.64E-07
KEGG_PATHWAYhsa04940: Type I diabetes mellitus83.9215686275.17E-06
KEGG_PATHWAYhsa04612: Antigen processing and presentation104.9019607848.78E-06
KEGG_PATHWAYhsa05330: Allograft rejection73.4313725492.55E-05
KEGG_PATHWAYhsa05332: Graft-versus-host disease73.4313725494.09E-05
Down-regulatedKEGG_PATHWAYhsa03040: Spliceosome147.1065989851.00E-07
KEGG_PATHWAYhsa03050: Proteasome84.0609137069.62E-06

Network analysis

The STRING tool was used to get the PPI relationships of the 417 DEGs. The PPI with the combined score >0.4 was selected and we gained 1006 PPI relationships and 238 nodes, accounting for 57.07% of all DEGs. The network of PPI relationships appeared to be aggregated (Figure 4).
Figure 4

The network of PPI of DEG.

Network module analysis

The topology property of the network is shown in Figure 5. The node degree distribution of the PPI was in power-law distribution according to the results shown in Figure 5. We screened 7 modules with 10 nodes and p-value 0.05 by using the ClusterONE plug-in board of Cytoscape (Table 2, Figure 6).
Figure 5

The topology parameter of the networks. (A) Degree distribution; (B) Average clustering coefficient; (C) Shortest path distribution; (D) Closeness centrality.

Figure 6

The KEGG pathways of 6 modules.

Discussion

Osteosarcoma is the most frequent primary bone tumor predominantly affecting children and adolescents [20]. Patients with OS commonly are treated with aggressive cytotoxic chemotherapy and local control surgery, and the prognosis for patients is poor [21]. In addition, patients with OS were often diagnosed at advanced stage due to the limited diagnostic methods and unclear clinical manifestations. There is an urgent need to explore the mechanism of osteosarcoma, and the knowledge gained would help develop effectively diagnoses and treatment strategies. In this study, we gained 417 DEGs upon gene expression profiling of osteosarcoma patient samples, among which the down-regulated DEGs were enriched in cell cycle pathways. CDC20, MCM, and CCNB1, the key proteins of the cell cycle, were down-regulated in module analysis of PPI, which facilitated the decline of cell proliferation. Furthermore, ECM, ITGA, and RTKin of focal adhesion were up-regulated. Cell division cycle (CDC) proteins play important roles in the orderly progression of the cell cycle through complexing with the anaphase-promoting complex/cyclosome (APC/C) to initiate early mitosis [22]. Kata found the over-expression of CDC20 was closely related with the poor prognosis of patients with primary non-small cell lung cancer [23]. Wu found that CDC20 over-expression was a sign of the poor prognosis for patients with colorectal cancer [24]. The high expression of CDC20 and MAD2 were found to predict poor prognosis in urothelial bladder cancer [25]. Zhong-Yang Ding identified CDC20 as an independent marker for predicting the clinical outcome of gastric cancer patients because up-regulation of CDC20 was associated with aggressive progression and poor prognosis in gastric cancer [26]. In this study, we found CDC20 was differentially expressed, suggesting that CDC20 might play an important role in the progression of osteosarcoma. The other key proteins of the cell cycle, MCM and CCNB1, were differentially expressed. It is widely acknowledged that the tumor microenvironment, consisting of not only the cancer cells and all non-malignant cells, but also the interstitial fluids and the extracellular matrix (ECM), play an important role in the progression of cancer. The deregulation of ECM proteins, inducing biochemical and biomechanical changes of cancer, was associated with the progression of cancer [27]. Periostin and tenascin C are the key players in the ECM. The over-expression of periostin was associated with the metastatic growth of breast cancer, colon cancer, lung cancer, and pancreatic cancer [28-31]. The elevated level of tenascin C predicted squamous cell carcinoma of the head and neck [32]. In Lustosa’s study, ITGA-3 was over-expressed in tumors with lymph node and distant metastasis (III/IV-stage tumors compared with I/II tumors), which is associated with advanced-stage tumors [33]. ITGA2 might play an important role in the progress of hepatocellular carcinoma (HCC) as the target genes of miR-128, which is underexpressed in HCC [34]. In our study, ECM and ITGA of focal adhesion was up-regulated, suggesting that ECM and ITGA might be key proteins in the progression of osteosarcoma.

Conclusions

In this study, a total of 417 DEGs were found, protein-protein interaction network was constructed, and the network module analysis of the PPI was performed. Among the identified DEGs, CDC20, MCM, and CCNB1, the key proteins of the cell cycle, were down-regulated, and ECM, ITGA, and RTKin of focal adhesion were up-regulated. Furthermore, CD20, ECM, and ITGA may play important roles in the progression of osteosarcoma. Our results may provide potential data for the exploration of the development and progression of the osteosarcoma, and might be used as biological targets for the treatment of OS.
  34 in total

1.  High expression of spindle assembly checkpoint proteins CDC20 and MAD2 is associated with poor prognosis in urothelial bladder cancer.

Authors:  Jung-Woo Choi; Younghye Kim; Ju-Han Lee; Young-Sik Kim
Journal:  Virchows Arch       Date:  2013-08-31       Impact factor: 4.064

2.  Analysis of SOX9 expression in colorectal cancer.

Authors:  Bingjian Lü; Yihu Fang; Jing Xu; Lipei Wang; Fangying Xu; Enping Xu; Qiong Huang; Maode Lai
Journal:  Am J Clin Pathol       Date:  2008-12       Impact factor: 2.493

3.  Forkhead box protein M1 predicts outcome in human osteosarcoma.

Authors:  Chong-Lang Fan; Jian Jiang; Hu-Cheng Liu; Dong Yang
Journal:  Int J Clin Exp Med       Date:  2015-09-15

4.  Bone cancer.

Authors:  J Sybil Biermann; Douglas R Adkins; Mark Agulnik; Robert S Benjamin; Brian Brigman; James E Butrynski; David Cheong; Warren Chow; William T Curry; Deborah A Frassica; Frank J Frassica; Kenneth R Hande; Francis J Hornicek; Robin L Jones; Joel Mayerson; Sean V McGarry; Brian McGrath; Carol D Morris; Richard J O'Donnell; R Lor Randall; Victor M Santana; Robert L Satcher; Herrick J Siegel; Margaret von Mehren; Mary Anne Bergman; Hema Sundar
Journal:  J Natl Compr Canc Netw       Date:  2013-06-01       Impact factor: 11.908

5.  Expression characteristics of CDC20 in gastric cancer and its correlation with poor prognosis.

Authors:  Zhong-Yang Ding; Hao-Rong Wu; Jia-Ming Zhang; Guo-Rong Huang; Dong-Dong Ji
Journal:  Int J Clin Exp Pathol       Date:  2014-01-15

6.  Enrichment map: a network-based method for gene-set enrichment visualization and interpretation.

Authors:  Daniele Merico; Ruth Isserlin; Oliver Stueker; Andrew Emili; Gary D Bader
Journal:  PLoS One       Date:  2010-11-15       Impact factor: 3.240

7.  Periostin potently promotes metastatic growth of colon cancer by augmenting cell survival via the Akt/PKB pathway.

Authors:  Shideng Bao; Gaoliang Ouyang; Xuefang Bai; Zhi Huang; Chaoyu Ma; Ming Liu; Rong Shao; Ryan M Anderson; Jeremy N Rich; Xiao-Fan Wang
Journal:  Cancer Cell       Date:  2004-04       Impact factor: 31.743

8.  A novel mammalian protein, p55CDC, present in dividing cells is associated with protein kinase activity and has homology to the Saccharomyces cerevisiae cell division cycle proteins Cdc20 and Cdc4.

Authors:  J Weinstein; F W Jacobsen; J Hsu-Chen; T Wu; L G Baum
Journal:  Mol Cell Biol       Date:  1994-05       Impact factor: 4.272

9.  CDC20 overexpression predicts a poor prognosis for patients with colorectal cancer.

Authors:  Wen-jing Wu; Kai-shun Hu; De-shen Wang; Zhao-lei Zeng; Dong-sheng Zhang; Dong-liang Chen; Long Bai; Rui-hua Xu
Journal:  J Transl Med       Date:  2013-06-10       Impact factor: 5.531

10.  Human periostin gene expression in normal tissues, tumors and melanoma: evidences for periostin production by both stromal and melanoma cells.

Authors:  Gaëlle Tilman; Marina Mattiussi; Francis Brasseur; Nicolas van Baren; Anabelle Decottignies
Journal:  Mol Cancer       Date:  2007-12-17       Impact factor: 27.401

View more
  3 in total

1.  Identification of Key Genes Affecting Results of Hyperthermia in Osteosarcoma Based on Integrative ChIP-Seq/TargetScan Analysis.

Authors:  Yuxia Shi; Fan Yang; Shuqing Wei; Gang Xu
Journal:  Med Sci Monit       Date:  2017-04-28

2.  Identification of a Two-Gene (PML-EPB41) Signature With Independent Prognostic Value in Osteosarcoma.

Authors:  Shengye Liu; Jiamei Liu; Xuechen Yu; Tao Shen; Qin Fu
Journal:  Front Oncol       Date:  2020-01-24       Impact factor: 6.244

3.  The Clinical Significance and Potential Molecular Mechanism of Upregulated CDC28 Protein Kinase Regulatory Subunit 1B in Osteosarcoma.

Authors:  Chaohua Mo; Le Xie; Chang Chen; Jie Ma; Yingxin Huang; Yanxing Wu; Yuanyuan Xu; Huizhi Peng; Zengwei Chen; Rongjun Mao
Journal:  J Oncol       Date:  2021-12-10       Impact factor: 4.375

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.