Literature DB >> 30849987

Identification of co-expression modules and pathways correlated with osteosarcoma and its metastasis.

Jian-Sheng Wang1, Yun-Guo Wang2, Yong-Sheng Zhong3, Xue-Dong Li4, Shi-Xin Du4, Peng Xie4, Gui-Zhou Zheng4, Jing-Ming Han5.   

Abstract

BACKGROUND: Osteosarcoma is the most common bone tumor that occurs in children.
METHODS: To identify co-expression modules and pathways correlated with osteosarcoma and its clinical characteristics, we performed weighted gene co-expression network analysis (WGCNA) on RNA-seq data of osteosarcoma with 52 samples. Then we performed pathway enrichment analysis on genes from significant modules.
RESULTS: A total of 5471 genes were included in WGCNA, and 16 modules were identified. Module-trait analysis identified that a module involved in microtubule bundle formation, drug metabolism-cytochrome P450, and IL-17 signaling pathway was negatively correlated with osteosarcoma and positively correlated with metastasis; a module involved in DNA replication was positively correlated with osteosarcoma; a module involved in cell junction was positively correlated with metastasis; and a module involved in heparin binding negatively correlated with osteosarcoma. Moreover, expression levels in four of the top ten differentially expressed genes were validated in another independent dataset.
CONCLUSIONS: Our analysis might provide insight for molecular mechanisms of osteosarcoma.

Entities:  

Keywords:  Co-expression modules; Metastasis; Osteosarcoma; Pathways

Mesh:

Year:  2019        PMID: 30849987      PMCID: PMC6408756          DOI: 10.1186/s12957-019-1587-7

Source DB:  PubMed          Journal:  World J Surg Oncol        ISSN: 1477-7819            Impact factor:   2.754


Background

Osteosarcoma is an aggressive cancer in the skeletal system that most commonly occurs in children [1]. The proportion of patients who receive a complete clinical response after standard chemotherapy and multimodal treatment of surgery is about 70–75% [2]. For patients diagnosed with metastases, the overall 5-year survival rate was approximately 30%, and for patients with relapsed osteosarcoma, the rate decreased to 15% [3]. Since the development of chemotherapy, survival in localized high-grade osteosarcoma has improved considerably. However, there is still no worldwide consensus on a standard chemotherapy approach [3]. Therefore, exploring the molecular mechanism involved in disease progression and metastasis and its relationship with drug response is critical for identifying effective drugs that overcome drug resistance of tumor. During these decades, with the dramatic improvement on sequencing technology, the increasing accumulation of sequencing data provides us a data resource to investigate molecular mechanisms of diseases including critical genes, pathways, and networks based on an analysis of omics data. Weighted gene co-expression network analysis (WGCNA) is a method that is widely used in exploration of co-expression modules correlated with traits or phenotypes based on expression data (e.g., expression microarray, transcriptome data) [4]. And the R software package for WGCNA was developed to perform weighted correlation network analysis, including procedures of network construction, module identification, gene selection, calculations of topological properties, data simulation, and visualization [4]. In this study, we performed weighted gene co-expression network analysis based on data from RNA-seq of osteosarcoma to identify critical co-expression modules and pathways correlated with osteosarcoma and its clinical characteristics, which might provide new insights for exploring the underlying molecular mechanisms of osteosarcoma.

Materials and methods

Expression profiles of osteosarcoma

Expression profiles of osteosarcoma (GSE87624) were obtained from the Gene Expression Omnibus (GEO) (https://www.ncbi.nlm.nih.gov/geo/). This data included RNA-seq profiles of 52 samples including 44 osteosarcoma tissues, 3 normal bone tissues, 1 osteoblast, and 4 osteosarcoma cell lines based on Illumina HiSeq 2000. After quality control, read mapping, and normalization of read counts of the RNA-seq data, the values of fragments per kilobase million (FPKM) in each gene were calculated.

Co-expression module analysis of osteosarcoma

Weighted gene co-expression network analysis (WGCNA) was used to investigate co-expression modules related with osteosarcoma and its clinical characteristics. Genes with the top 25% variance of expression values among samples were included in WGCNA. Then sample clustering was performed to detect whether there were outliers in these samples. After sample clustering, scale independence and mean connectivity analysis of modules with different power values were performed to determine the soft threshold of module analysis. The power value was set from 1 to 20, and then the values of scale independence and mean connectivity were generated according to these power values. The power value was determined when the scale independence value was 0.9. Then co-expression matrix was calculated under the determined power value, with the minimal module size of 30 and the merge cut height of 0.25. A cluster dendrogram among modules and an eigengene adjacency heatmap between modules were generated.

Module-trait analysis based on clinical characteristics of osteosarcoma

Information on the clinical characteristics of samples with osteosarcoma, including sample type (osteosarcoma/normal), tissue type (osteosarcoma/normal), cell-line type (osteosarcoma/normal), and osteosarcoma type (metastasis/primary/unknown), was collected to identify significant co-expression modules related with the clinical characteristics (as trait). Module-trait relationships were calculated according to the correlation between modules and traits; modules that were significantly correlated with individual traits (P value < 0.05, module size < 500) were identified; and genes in significant modules were then exported for further analysis.

Pathway enrichment analysis of significant co-expression modules

To identify the biological pathways and functions of significant modules, pathway enrichment analysis was conducted by using the R package “clusterProfiler v3.4.4” [5, 6] with genes in significant co-expression modules related with osteosarcoma. Pathways were annotated by information from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database [7] and Gene Ontology (GO) terms [8], and the P value was adjusted by the Benjamin-Hochberg methods [9]. Pathways with a P value < 0.05 were considered as significant pathways.

Validation of differentially expressed genes in independent dataset

To validate whether differentially expressed genes identified in GSE87624 were also differentially expressed in other expression datasets, we investigated the expression levels of the top ten differentially expressed genes in another expression dataset (GSE12865), which included 14 samples (12 osteosarcoma tumor samples and 2 normal human osteoblasts as control). Differentially expressed genes of osteosarcoma were analyzed by limma package in R.

Results

For GSE87624, FPKM values of 21,884 genes from 52 samples were obtained. Then log2-transformed FPKM values were used for further analysis. Information of clinical characteristics of samples included the sample type (osteosarcoma/normal), tissue type (osteosarcoma/normal), cell-line type (osteosarcoma/normal), and osteosarcoma type (metastasis/primary/unknown). Clinical information for osteosarcoma patients is shown in Table 1.
Table 1

Sample information in expression profiling

TypeNumber (%)
Osteosarcoma
 Tissues44 (83.0%)
 Cell line4 (7.5%)
Normal bone
 Tissues3 (5.7%)
 Cell line1 (1.9%)
Tumor type
 Metastasis9 (17.0%)
 Primary20 (37.8%)
 Unknown15 (28.3%)
Sample information in expression profiling By selecting genes with the threshold of the top 25% variance of expression values, a total of 5471 genes were included in WGCNA. As shown in Fig. 1, by sample clustering, no outliers were observed in 52 samples, thus all samples were included in the analysis.
Fig. 1

Sample clustering for detecting outliers

Sample clustering for detecting outliers Then the soft threshold was determined by scale independence and mean connectivity analysis of modules with different power values ranging from 1 to 20. As shown in Fig. 2, when the power value was set to 4, the scale independence value achieved 0.9 and lower mean connectivity. Therefore, the co-expression matrix was calculated under the determined power value of 4. As shown in Figs. 3 and 4, a total of 16 modules with different genes were generated and displayed with different colors, including 198 genes in a black module, 837 genes in a blue module, 689 genes in a brown module, 60 genes in a cyan module, 408 genes in a green module, 95 genes in a green-yellow module, 31 in a light cyan module, 131 genes in a magenta module, 50 genes in a midnight blue module, 171 genes in a pink module, 127 genes in a purple module, 302 genes in a red module, 63 genes in a salmon module, 88 genes in a tan module, 1413 genes in a turquoise module, and 428 genes in a yellow module.
Fig. 2

Scale independence and mean connectivity analysis

Fig. 3

Cluster dendrogram among modules

Fig. 4

Eigengene adjacency heatmap between modules

Scale independence and mean connectivity analysis Cluster dendrogram among modules Eigengene adjacency heatmap between modules Clinical characteristics of samples with osteosarcoma were collected as a trait, and module-trait relationships were calculated according to the correlation between modules and traits. As shown in Fig. 5, for the sample type of osteosarcoma, the most significant modules with negative correlation were purple and green-yellow; for the tissue type of osteosarcoma, the most significant modules with negative correlation were purple and greenyellow and the positively correlated module was brown; and for the osteosarcoma type of metastasis, the most significant modules with positive correlation were purple and yellow.
Fig. 5

Module-trait relationships

Module-trait relationships

Pathway enrichment analysis

By pathway enrichment analysis, with the threshold of Benjamin-adjusted P value < 0.05, we obtained significant GO terms and KEGG pathways enriched in significant modules (see Additional file 1). For the purple module (Additional file 1: Table S1), we identified 22 GO terms and 6 KEGG pathways and microtubule bundle formation, IL-17 signaling pathway, and drug metabolism-cytochrome P450 were identified; for the brown module (Additional file 1: Table S2), we identified 58 GO terms and 3 KEGG pathways, which were mainly related with DNA replication and mitotic nuclear division; for the yellow module (Additional file 1: Table S3), we identified 26 GO terms, which were mainly related with the anchored component of membrane and cell junction; and for the green-yellow module (Additional file 1: Table S4), we identified 3 GO terms and 2 KEGG pathways, which were mainly related with regulation of lipolysis in adipocytes and heparin binding.

Validation of differentially expressed genes in independent datasets

With the threshold of Benjamin-adjusted P value < 0.05 and |log2 (fold change)| > 1, we obtained a total of 369 differentially expressed genes of osteosarcoma in GSE87624. The top ten differentially expressed genes were BPIFA1, AGR2, MEPE, MSMB, BPIFB1, BGLAP, SLPI, HBA2, LCN2, and SERPINB3. In an independent dataset GSE12865, we observed that all ten genes had the same trend of expression level as that in GSE87624, in which four genes, MEPE, BPIFB1, HBA2, and SERPINB3, were also identified to have significantly differential expression levels (shown in Table 2.
Table 2

Expression levels of the top ten genes in GSE87624 and GSE12865

GeneDiscovery datasetValidation dataset
Log (fold change)P valueLog (fold change)P value
BPIFA1 − 6.931.69E−05− 0.105.87E−01
AGR2 − 5.772.22E−04− 0.058.85E−01
MEPE − 5.29 7.43E03 − 3.59 8.36E03
MSMB − 4.701.02E−04− 0.776.07E−02
BPIFB1 − 4.64 2.61E03 − 0.31 4.40E02
BGLAP − 4.591.68E−01− 0.483.00E−01
SLPI − 4.242.33E−01− 1.111.13E−01
HBA2 − 3.93 3.31E02 − 1.83 7.10E04
LCN2 − 3.821.60E−03− 0.243.11E−01
SERPINB3 − 3.76 1.17E03 − 0.69 1.46E02
Expression levels of the top ten genes in GSE87624 and GSE12865

Discussion

In this study, we performed a weighted gene co-expression network analysis (WGCNA) to investigate co-expression modules related with osteosarcoma and its clinical characteristics. Significant modules were identified to be correlated with osteosarcoma. For the purple module, which was mainly related with microtubule bundle formation, drug metabolism-cytochrome P450, and IL-17 signaling pathway and was identified to be negatively correlated with the trait of osteosarcoma, while being positively correlated with the trait of metastasis in osteosarcoma, previous studies have reported that increase of microtubule destabilization was related with G1/G2 phase cell cycle arrest and apoptosis, and microtubule inhibitors could trigger autophagy and cell death in osteosarcoma cell line [10]. Besides, IL-17A/IL-17RA interaction promoted metastasis of osteosarcoma cells [11]. Moreover, the resistance of osteosarcoma to chemotherapy was related to cytochrome P450 3A4 [12]. Our results might provide supporting evidence for these previous findings. For the brown module, which was mainly involved in DNA replication and mitotic nuclear division and was observed to be positively correlated with the trait of osteosarcoma tissue, previous studies have revealed that genes involved with DNA replication and DNA damage were associated with radiosensitivity of osteosarcoma [13], as well as drug sensitivity [14]. Our results indicated that genes in the brown module might be related with carcinogenesis of osteosarcoma and might provide insights for exploring drug targets for the treatment of osteosarcoma. For yellow module, which was mainly related with the anchored component of membrane and cell junction and was observed to be positively correlated with the trait of metastasis in osteosarcoma, it has been reported that pathways related with cell junction were involved in metastasis in various tumors, e.g., lung cancer [15], osteosarcoma [16], and pancreatic cancer [17], which might be due to the molecules in cell junction, and the anchored component of membrane felicitated the migration and invasion tumor cells into other tissues and organs [18]. Our results suggest that genes in the anchored component of membrane and cell junction might also play important roles in the metastasis of osteosarcoma and were worthy of further investigation. For the green-yellow module, which was mainly related with regulation of lipolysis in adipocytes and heparin binding, it was observed to be negatively correlated with the trait of osteosarcoma. In accordance with our results, there were studies showing that heparin could reduce the osteosarcoma proliferation and growth [19, 20], and heparin binding sites might be potential therapeutic targets for osteosarcoma [21]. In the validation section, among the top ten differentially expressed genes in GSE87624, we validated four genes, MEPE, BPIFB1, HBA2, and SERPINB3, also had significantly differential expression levels in another expression dataset (GSE12865), demonstrating the results identified from the first dataset can be supported by an independent dataset. For MEPE, there were researches revealing its involvement with osteosarcoma [22-24]; for BPIFB1, it has been reported that it could inhibit radioresistance in nasopharyngeal carcinoma [25, 26]; for HBA2, previous studies have identified different expression levels in the bone marrow of prostate cancer patients [27]; and for SERPINB3, it has been identified as mediators of Ras-driven inflammation and oncogenesis [28]. The effect of these genes on osteosarcoma was worthy of further investigation.

Conclusions

By WGCNA methods on expression data, we identified significant co-expression modules and pathways correlated with osteosarcoma, as well as metastasis of osteosarcoma, and our analysis might provide insights for the molecular mechanisms of osteosarcoma. Table S1. Significant pathways of the purple module. Table S2. Significant pathways of the brown module. Table S3. Significant pathways of the yellow module. Table S4. Significant pathways of the green-yellow module. (DOCX 27 kb)
  28 in total

1.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.

Authors:  Da Wei Huang; Brad T Sherman; Richard A Lempicki
Journal:  Nat Protoc       Date:  2009       Impact factor: 13.491

2.  Expression levels and activation of a PXR variant are directly related to drug resistance in osteosarcoma cell lines.

Authors:  Edith J Mensah-Osman; Dafydd G Thomas; Michelle M Tabb; Jose M Larios; Dennis P Hughes; Thomas J Giordano; Michelle L Lizyness; James M Rae; Bruce Blumberg; Paul F Hollenberg; Laurence H Baker
Journal:  Cancer       Date:  2007-03-01       Impact factor: 6.860

Review 3.  Chemotherapeutic adjuvant treatment for osteosarcoma: where do we stand?

Authors:  Jakob K Anninga; Hans Gelderblom; Marta Fiocco; Judith R Kroep; Antoni H M Taminiau; Pancras C W Hogendoorn; R Maarten Egeler
Journal:  Eur J Cancer       Date:  2011-06-22       Impact factor: 9.162

4.  Constitutively active Stat3 enhances neu-mediated migration and metastasis in mammary tumors via upregulation of Cten.

Authors:  Isaia Barbieri; Sara Pensa; Tania Pannellini; Elena Quaglino; Diego Maritano; Marco Demaria; Alessandra Voster; James Turkson; Federica Cavallo; Christine J Watson; Paolo Provero; Piero Musiani; Valeria Poli
Journal:  Cancer Res       Date:  2010-03-09       Impact factor: 12.701

5.  International patterns of cancer incidence in adolescents.

Authors:  Charles A Stiller
Journal:  Cancer Treat Rev       Date:  2007-02-27       Impact factor: 12.111

6.  Effects of phosphates on the expression of tissue-nonspecific alkaline phosphatase gene and phosphate-regulating genes in short-term cultures of human osteosarcoma cell lines.

Authors:  Hideo Orimo; Takashi Shimada
Journal:  Mol Cell Biochem       Date:  2006-01       Impact factor: 3.396

7.  Glycosaminoglycans as potential regulators of osteoprotegerin therapeutic activity in osteosarcoma.

Authors:  Francois Lamoureux; Gaëlle Picarda; Laure Garrigue-Antar; Marc Baud'huin; Valerie Trichet; André Vidal; Elisabeth Miot-Noirault; Bruno Pitard; Dominique Heymann; Françoise Rédini
Journal:  Cancer Res       Date:  2009-01-15       Impact factor: 12.701

8.  Evidence of downregulation of matrix extracellular phosphoglycoprotein during terminal differentiation in human osteoblasts.

Authors:  H Siggelkow; E Schmidt; B Hennies; M Hüfner
Journal:  Bone       Date:  2004-08       Impact factor: 4.398

9.  Analysis of invasion-metastasis mechanism in pancreatic cancer: involvement of tight junction transmembrane protein occludin and MEK/ERK signal transduction pathway in cancer cell dissociation.

Authors:  Xiaodong Tan; Yasuhiro Tamori; Hiroshi Egami; Shinji Ishikawa; Takashi Kurizaki; Eiji Takai; Masahiko Hirota; Michio Ogawa
Journal:  Oncol Rep       Date:  2004-05       Impact factor: 3.906

10.  WGCNA: an R package for weighted correlation network analysis.

Authors:  Peter Langfelder; Steve Horvath
Journal:  BMC Bioinformatics       Date:  2008-12-29       Impact factor: 3.169

View more
  6 in total

1.  The Identification of Childhood Asthma Progression-Related lncRNAs and mRNAs Suitable as Biomarkers Using Weighted Gene Coexpression Network Analysis.

Authors:  Min Hao; Jinling Zan
Journal:  Genet Res (Camb)       Date:  2021-07-27       Impact factor: 1.588

2.  Differential Gene Expression and Weighted Correlation Network Dynamics in High-Throughput Datasets of Prostate Cancer.

Authors:  Taj Mohammad; Prithvi Singh; Deeba Shamim Jairajpuri; Lamya Ahmed Al-Keridis; Nawaf Alshammari; Mohd Adnan; Ravins Dohare; Md Imtaiyaz Hassan
Journal:  Front Oncol       Date:  2022-06-01       Impact factor: 5.738

3.  CORR Insights®: Is Use of BMP-2 Associated with Tumor Growth and Osteoblastic Differentiation in Murine Models of Osteosarcoma?

Authors:  Terri A Zachos
Journal:  Clin Orthop Relat Res       Date:  2020-12       Impact factor: 4.755

4.  Characterization of long non-coding RNA and messenger RNA profiles in laryngeal cancer by weighted gene co-expression network analysis.

Authors:  Huanhuan Liu; Yi Sun; Huan Tian; Xiaolian Xiao; Jiaqi Zhang; Yongzhen Wang; Fengyan Yu
Journal:  Aging (Albany NY)       Date:  2019-11-18       Impact factor: 5.682

5.  Transcriptomic Analysis of Canine Osteosarcoma from a Precision Medicine Perspective Reveals Limitations of Differential Gene Expression Studies.

Authors:  Rebecca L Nance; Sara J Cooper; Dmytro Starenki; Xu Wang; Brad Matz; Stephanie Lindley; Annette N Smith; Ashley A Smith; Noelle Bergman; Maninder Sandey; Jey Koehler; Payal Agarwal; Bruce F Smith
Journal:  Genes (Basel)       Date:  2022-04-13       Impact factor: 4.141

6.  PKIB involved in the metastasis and survival of osteosarcoma.

Authors:  Rongxue Wan; Gu Yang; Qianzhen Liu; Xiaokang Fu; Zengping Liu; Huilai Miao; Huan Liu; Wenhua Huang
Journal:  Front Oncol       Date:  2022-08-22       Impact factor: 5.738

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.