Literature DB >> 29588600

Identification of differentially expressed genes and signaling pathways in ovarian cancer by integrated bioinformatics analysis.

Xiao Yang1, Shaoming Zhu2, Li Li3, Li Zhang1, Shu Xian1, Yanqing Wang1, Yanxiang Cheng1.   

Abstract

BACKGROUND: The mortality rate associated with ovarian cancer ranks the highest among gynecological malignancies. However, the cause and underlying molecular events of ovarian cancer are not clear. Here, we applied integrated bioinformatics to identify key pathogenic genes involved in ovarian cancer and reveal potential molecular mechanisms.
RESULTS: The expression profiles of GDS3592, GSE54388, and GSE66957 were downloaded from the Gene Expression Omnibus (GEO) database, which contained 115 samples, including 85 cases of ovarian cancer samples and 30 cases of normal ovarian samples. The three microarray datasets were integrated to obtain differentially expressed genes (DEGs) and were deeply analyzed by bioinformatics methods. The gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichments of DEGs were performed by DAVID and KOBAS online analyses, respectively. The protein-protein interaction (PPI) networks of the DEGs were constructed from the STRING database. A total of 190 DEGs were identified in the three GEO datasets, of which 99 genes were upregulated and 91 genes were downregulated. GO analysis showed that the biological functions of DEGs focused primarily on regulating cell proliferation, adhesion, and differentiation and intracellular signal cascades. The main cellular components include cell membranes, exosomes, the cytoskeleton, and the extracellular matrix. The molecular functions include growth factor activity, protein kinase regulation, DNA binding, and oxygen transport activity. KEGG pathway analysis showed that these DEGs were mainly involved in the Wnt signaling pathway, amino acid metabolism, and the tumor signaling pathway. The 17 most closely related genes among DEGs were identified from the PPI network.
CONCLUSION: This study indicates that screening for DEGs and pathways in ovarian cancer using integrated bioinformatics analyses could help us understand the molecular mechanism underlying the development of ovarian cancer, be of clinical significance for the early diagnosis and prevention of ovarian cancer, and provide effective targets for the treatment of ovarian cancer.

Entities:  

Keywords:  GEO data; differentially expressed genes; integrated bioinformatics; ovarian cancer

Year:  2018        PMID: 29588600      PMCID: PMC5858852          DOI: 10.2147/OTT.S152238

Source DB:  PubMed          Journal:  Onco Targets Ther        ISSN: 1178-6930            Impact factor:   4.147


Introduction

The mortality rate of ovarian cancer ranks the first among the malignant tumors that occur in female reproductive organs, and the incidence of this cancer is increasing each year. More than 200,000 new cases of ovarian cancer occur each year in the world, resulting in >140,000 deaths.1 The incidence of ovarian cancer is occult, early diagnosis is difficult, and invasion and metastasis occur easily. When ovarian cancer is detected, the patient is usually at an advanced stage of the disease. The 5-year survival rate of patients with advanced ovarian cancer is only ~20%, but for patients in the early stage, it can reach 85%–90%.2 At present, the commonly used methods for the early diagnosis and monitoring of ovarian cancer are ultrasonography combined with serum tumor marker assays, but there are some limitations, and the specificity is not high. Computed tomography (CT) and positron emission tomography (PET) can only detect lesions with a volume of ≥1 cm, and they cannot detect early tumor metastasis. At present, the treatment of ovarian cancer includes surgical staging, surgery, reoperation, staging surgery, cytoreductive surgery, postoperative combined chemotherapy, radiotherapy, and biological treatment. In recent years, the clinical diagnosis and treatment of ovarian cancer have improved, but the 5-year survival rate of patients is still 30%.3 The reasons that lead to a failure of treatment, tumor recurrence, and the low survival rate include its insidious onset, the fact that ovarian cancer is not usually detected at an early stage and cannot be removed effectively by surgery, and the fact that tumor cells have a primary or secondary tolerance to radiotherapy and chemotherapy.4 Therefore, it is important to study the underlying molecular mechanisms of the malignant biological behavior of ovarian cancer cells and therefore identify more effective early diagnostic techniques and more reliable molecular markers for monitoring recurrence and evaluating prognosis, as well as to explore a more effective way to block and control tumor cell proliferation, metastasis, and the reversal of drug resistance in cancer cells. As an efficient and large-scale technique for acquiring genetic data, gene expression microarrays have been widely used to collect gene chip expression profiling data and to study gene expression profiles in many human cancers. These microarrays provide a new method for studying tumor-related genes and offer promising prospects for molecular prediction, drug-based molecular targeting, and molecular therapy.5,6 With the widespread application of gene expression microarray technology, a large amount of data have been published on public database platforms, and integrating these databases can allow a deeper study of molecular mechanisms. At present, a large number of studies have been performed on ovarian gene expression profiles, and these studies have screened thousands of differentially expressed genes (DEGs) that may be involved in the development and progression of ovarian cancer.7 However, the results for the identification of significantly expressed mRNAs are inconsistent or discrepant among different studies due to tissue or sample heterogeneity among each independent experiment, different technological detection platforms, different data processing methods, and the fact that the samples come from different backgrounds. Thus, there are still some limitations in a single-cohort study, and we should integrate their results using an unbiased approach. The integration and analysis of microarray data from several gene expression profiles may resolve these problems and enable the discovery of effective and reliable molecular markers. The RobustRankAggreg (RRA) approach has been specifically designed for the comparison of several ranked gene lists.8 The RRA method uses a probabilistic model for aggregation that is robust to noise and facilitates the calculation of significance probabilities for all the elements in the final ranking. The RRA method is able to view the ranking of each item in each list, and it compares this ranking with the baseline case where all the preference lists are randomly ordered. The P-value can represent the rank location and significance of a gene. The higher the gene ranks, the smaller the P-value is. RRA is a suitable and effective integrative analysis solution for the identification of statistically significant genes. In addition, it is useful when different kinds of genes are obtained by different technology platforms and full rankings of mRNAs are not available. In this study, we have downloaded three original microarray datasets, GDS3592,9 GSE54388,10 and GSE66957 (Cheng et al, unpublished data, 2015), from the NCBI-Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih. gov/geo/), which contained a total of 115 samples, with 85 cases of ovarian cancer samples and 30 cases of normal ovarian samples. DEGs in ovarian cancer and normal ovarian samples were screened using the R software, and the gene ontology (GO) pathway enrichment analysis of DEGs was performed on DAVID (https://david.ncifcrf.gov/) and KOBAS-Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways (http://kobas.cbi.pku.edu.cn/). Then, the STRING online database protein–protein interaction (PPI) network was used to analyze the association of DEGs and discover the molecular interactions involved in tumorigenesis. In conclusion, the DEGs associated with the carcinogenesis and development of ovarian cancer were screened through the GEO database of ovarian cancer and an integrated analysis was conducted. The biological functions and key signaling pathways of these DEGs are discussed, and the encoding protein interaction network was analyzed. Our study provides reliable molecular markers for early detection and prognosis, as well as effective drug targets for treating ovarian cancer.

Materials and methods

Microarray data

DNA microarray is a new technique that can analyze genome and characteristic map of gene expression. A variety of DNA microarray and DNA chip devices and systems have now been developed and commercialized. DNA microarray analysis includes an oligonucleotide chip, cDNA chip, and genomic chip, and is divided into the following two modes: one is to fix the target DNA on the support, which is suitable for the analysis of a large number of different target DNAs, and another involves fixing a large number of probes on the support material, which is suitable for the analysis of different probe sequences of the same target DNA.11 There are various platforms available (http://arrayconsortium.tgen. org/np2/home.do; http://www1.amershambiosciences.com/APTR1X/upp01077.nsf/Contentand/codelink_bioarray_system; http://www.affymetrix.com; http://www.illumina.com/prod_expression.htm; and http://www.nimblegen.com; http://www.xeotron.com).12 Using the keywords “ovarian cancer geo accession” to search on the GEO DataSets database (https://www.ncbi.nlm.nih.gov/geo/), the gene expression profiles of GDS3592,9 GSE54388,10 and GSE66957 (Cheng et al, unpublished data, 2015) were downloaded. The platform for GDS3592 is GPL570, [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2 Array, which includes 12 normal ovarian surface epithelial samples and 12 serous papillary ovarian adenocarcinomas specimens. The platform for GSE54388 is GPL570, [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2 Array, which consists of six ovarian surface epithelium samples and 16 high-grade serous ovarian cancer samples. The platform for GSE66957 is GPL15048 Rosetta/Merck Human RSTA Custom Affymetrix 2 microarray [HuRSTA_2a520709. CDF], which includes 12 normal ovarian surface epithelium samples and 57 high-grade serous ovarian cancer samples. Platform and series matrix file(s) were downloaded as TXT files. The dataset information is shown in Table 1. The R software package was used to process the downloaded files and to convert and reject the unqualified data. The data were calibrated, standardized, and log2 transformed.
Table 1

Details for GEO ovarian cancer data

ReferenceSampleGEOPlatformNormalTumor
Bowen et al (2009)9OvarianGDS3592GPL5701212
Yeung et al (2017)10OvarianGSE54388GPL570616
Cheng et al (unpublished data, 2015)OvarianGSE66957GPL150481257

Abbreviation: GEO, Gene Expression Omnibus.

Screening for DEGs

The downloaded platform and series of matrix file(s) were converted using the R language software and annotation package. The ID corresponding to the probe name was converted into an international standard name for genes (gene symbol) and saved in a TXT file. Gene differential expression analysis was performed using the limma package in the Bioconductor package (available online: http://www.bioconductor.org/). The related operating instruction codes were put into R, and the DEGs in ovarian cancer and normal ovarian samples of the three microarray datasets were analyzed by the limma software package. Samples with a corrected P-value of <0.05 and log fold change (FC) >2 were considered DEGs. The TXT results were preserved for subsequent analysis.

Integration of microarray data

The list of DEGs from the three microarray datasets obtained by limma packet analysis was saved as a TXT file. The RRA software package was downloaded, and R was used to run the instruction code. A list of genes that were up- or downregulated in the three chips were used for subsequent analysis. The RRA approach is openly available in the Comprehensive R Network (http://cran.r-project.org/).

GO and KEGG pathway enrichment analyses of DEGs

The DAVID database (https://david.ncifcrf.gov/) is an essential foundation for the success of any high-throughput gene function analysis. The functional and pathway enrichment of the proteins encoded by candidate genes were analyzed, and these genes were annotated using the DAVID database.13 GO annotations were performed using a DAVID online tool on the screened DEGs. KEGG pathway analysis of DEGs was performed using the KOBAS online analysis database (available online: http://kobas.cbi.pku.edu.cn/). In this study, we analyzed the DEGs that were significantly up- and downregulated as determined from integrated microarray ovarian cancer data, and a P-value of <0.05 was considered statistically significant.

PPI network integration

The STRING database (http://string-db.org/) is a software system that is commonly used to identify the interactions between known proteins and predicted proteins. The results in this database are obtained from experimental data, databases, text mining, and predictive bioinformatics data.14 In addition, the core of the Cytoscape software is a network. Each node is a gene, protein, or molecule, and the connections between nodes represent the interaction of these biological molecules, which can be used to identify interactions and pathway relationships between the proteins encoded by DEGs in ovarian cancer. The corresponding proteins in the central node may be core proteins or key candidate genes with important physiological regulatory functions.

Results

Microarray data information and identification of DEGs in ovarian cancer

The ovarian cancer expression microarray datasets GDS3592, GSE54388, and GSE66957 were standardized, and the results are shown in Figure 1. When the GDS3592 dataset was screened by the limma package (corrected P-value <0.05, logFC >2), 839 DEGs were obtained. Among them, 289 downregulated genes and 550 upregulated genes were identified. Overall, 255 DEGs were screened from the GSE54388 data-set, including 96 upregulated genes and 159 downregulated genes. Additionally, 1,498 DEGs were screened from the GSE66957 dataset, including 709 upregulated genes and 789 downregulated genes. The differential expression of multiple genes from the two sets of sample data included in each of the three microarrays is shown in Figure 2. The cluster heatmaps of the top 200 DEGs are shown in Figure 3.
Figure 1

Standardization of gene expression.

Notes: (A) The standardization of GDS3592 data, (B) the standardization of GSE54388 data, and (C) the standardization of GSE66957 data. The blue bar represents the data before normalization, and the red bar represents the normalized data.

Figure 2

Differential expression of data between two sets of samples.

Notes: (A) GDS3592 data, (B) GSE54388 data, and (C) GSE66957 data. The red points represent upregulated genes screened on the basis of |fold change| >2.0 and a corrected P-value of <0.05. The green points represent downregulation of the expression of genes screened on the basis of |fold change| >2.0 and a corrected P-value of <0.05. The black points represent genes with no significant difference. FC is the fold change.

Figure 3

Hierarchical clustering heatmap of DEGs screened on the basis of |fold change| >2.0 and a corrected P-value <0.05.

Notes: (A) GDS3592 data, (B) GSE54388 data, and (C) GSE66957 data. Red indicates that the expression of genes is relatively upregulated, green indicates that the expression of genes is relatively downregulated, and black indicates no significant changes in gene expression; gray indicates that the signal strength of genes was not high enough to be detected.

Abbreviation: DEGs, differentially expressed genes.

Identification of DEGs in ovarian cancer using integrated bioinformatics

The three ovarian cancer gene expression microarray datasets were analyzed by the limma package and sorted according to logfold-change value and then analyzed by RRA (corrected P-value <0.05). The RRA method is based on the hypothesis that each gene is randomly ordered in each experiment. If a gene ranked high in all experiments, then the smaller its P-value is, the greater the likelihood of differential gene expression. Through rank analysis, we identified 190 DEGs, with 99 upregulated genes and 91 downregulated genes; the DEGs are shown in Table 2. R-heatmap software was used to draw a heatmap of the top 20 up- and downregulated genes, as shown in Figure 4.
Table 2

Screening DEGs in ovarian cancer by integrated microarray

DEGsGene names
UpregulatedCP CD24 KLK7 ST6GALNAC1 MMP7 EPCAM SCGB2A1 PTH2R PRAME SPON1 MEOX1 ESRP1 MPZL2 TMPRSS4 SOX17 TNNT1 ELF3 SCGB1D2 SST LCN2 RGS1 LYPD1 KLK6 ESM1 CDH6 BCL11A FOXQ1 MUC16 UCP2 TMTC1 DCDC2 LGR6 TLCD1 KLK8 MAL SLC4A11 EHF WFDC2 ST6GALNAC2 FZD10 TSPAN12 SLC39A4 KRT23 CXXC5 DEFB1 C12orf56 LOC100129098 NPTX2 TMC4 CLDN3 LAMP3 EPHX4 TPX2 PSAT1 ZWINT MELK SLC2A1 LPAR3 DAPL1 SCRIB C2orf88 RASSF10 VTCN1 MUC1 FJX1 C1orf106 METTL7B LOC101928554 TMEM139 CKS2 SPP1 HOXB2 MAL2 MECOM TMEM125 CTCFL E2F8 IFI27 TTK CLDN7 ASS1 GLDC TMPRSS3 PTX3 FOXM1 EYA2 SULT1C2 RAD51AP1 GGT6 CEP55 NUF2 TACSTD2 LOC100509445 ZIC1 PPL MXRA5 ERP27 ZBED2 ATP6V1B1
DownregulatedITLN1 GADL1 PRG4 BCHE SYT4 OGN REEP1 GIPC2 MGARP HOXA5 HBB CHRDL1 SPOCK1 FABP4 MUM1L1 BNC1 VGLL3 SERTM1 TCEAL2 ANXA8L1 TCEAL7 LOC101930363 TMEM255A STK31 LINC01105 NELL2 AADACL2 EFEMP1 LOC100996760 AOX1 RNASE4 MGC24103 HSD17B2 ARHGAP18 LOC100506718 LRRN4 HAND2-AS1 FGF13 ADH1B SFRP1 NPY1R ADGRD1 LHX2 HBA2 RNF128 ADH1C CALB2 LGALS2 CRNDE NDNF CAV1 MNDA KCNT2 WNT2B GFPT2 GHR CSGALNACT1 OMD C21orf62 DFNA5 PCDH17 CHGB LOC101928635 ABCA8 ARX PCDH9 PRSS35 PEX5L PDE8B FGF9 NEFH PDGFD PEG3 CLEC4M PROCR NAP1L3 SGCG TSPAN8 HBG2 MEIS2 PDZD3 EPB42 CELF2 PCOLCE2 BAMBI ALDH1A1 SHISA3 PDE7B DIRAS3 HAS1 HSPB3

Abbreviation: DEGs, differentially expressed genes.

Figure 4

LogFC heatmap of the image data of each expression microarray.

Notes: The abscissa is the GEO ID, and the ordinate is the gene name. Red represents logFC >0, green represents logFC <0, and the values in the box represent the logFC values.

Abbreviations: FC, fold change; GEO, Gene Expression Omnibus.

GO term enrichment analysis of DEGs

Biological annotation of the DEGs in ovarian cancer identified from an integrated analysis of microarray data was performed using the DAVID online analysis tool, and GO functional enrichments of up- and downregulated genes with a P-value of <0.05 were obtained. GO analysis of DEGs was divided into three functional groups, including molecular function, biological processes, and cell composition. The results are shown in Figures 5 and 6. Significant results of the GO enrichment analysis of DEGs in ovarian cancer are shown in Tables 3 and 4. In the biological process group, the upregulated genes were mainly enriched in cell proliferation, regulation of transcription, proteolysis, and epithelial cell differentiation. The downregulated genes were mainly concentrated in ethanol oxidation, mesenchymal–epithelial cell signaling, and the Wnt signaling pathway. In the molecular function group, the upregulated genes were mainly enriched in DNA binding, transcriptional activator activity, and endopeptidase activity. The downregulated genes were mainly enriched in binding, including oxygen binding, iron ion binding, and calcium ion binding, as well as peptidase activator activity and growth factor activity. In the cell composition group, the upregulated genes were mainly enriched in plasma membranes, extracellular space, extracellular exosomes, and vesicles. The downregulated genes were mainly enriched in the extracellular matrix, extracellular exosomes, and the hemoglobin complex. These results indicate that most DEGs are significantly enriched in cell proliferation, binding, cell cycle regulation, and transcriptional activity.
Figure 5

GO enrichment analysis of DEGs in ovarian cancer.

Notes: (A) GO analysis divided DEGs into three functional groups: molecular function, biological processes, and cell composition. (B) GO enrichment significance items of DEGs in different functional groups.

Abbreviations: DEGs, differentially expressed genes; GO, gene ontology.

Figure 6

Distribution of DEGs in ovarian cancer for different GO-enriched functions.

Abbreviations: DEGs, differentially expressed genes; GO, gene ontology.

Table 3

GO analysis of upregulated genes associated with ovarian cancer

TermDescriptionCountP-value
Upregulated
GO:0016328Lateral plasma membrane51.47E–04
GO:0005615Extracellular space183.70E–04
GO:0000978RNA polymerase II core promoter proximal region sequence-specific DNA binding80.001939897
GO:0016323Basolateral plasma membrane60.002186397
GO:0016324Apical plasma membrane70.003563039
GO:0043565Sequence-specific DNA binding90.004212667
GO:0004252Serine-type endopeptidase activity60.008867111
GO:0008283Cell proliferation70.010058246
GO:0070062Extracellular exosome240.011258185
GO:0045893Positive regulation of transcription, DNA-templated80.014565685
GO:0010046Response to mycotoxin20.014933085
GO:0042552Myelination30.022197031
GO:0015992Proton transport30.024979514
GO:0031982Vesicle40.0270288
GO:0001665Alpha-N-acetylgalactosaminide alpha-2,6-sialyltransferase activity20.02949141
GO:0032463Negative regulation of protein homooligomerization20.029645788
GO:0001077Transcriptional activator activity, RNA polymerase II core promoter proximal region sequence-specific binding50.030142085
GO:0016266O-Glycan processing30.036297156
GO:0006508Proteolysis70.039180152
GO:0048672Positive regulation of collateral sprouting20.039333412
GO:0001766Membrane raft polarization20.039333412
GO:0030855Epithelial cell differentiation30.047971693
GO:0019911Structural constituent of myelin sheaths20.0486732

Abbreviation: GO, gene ontology.

Table 4

GO analysis of downregulated genes associated with ovarian cancer

TermDescriptionCountP-value
Downregulated
GO:0005576Extracellular region231.01E–06
GO:0005578Proteinaceous extracellular matrix70.001157176
GO:0005833Hemoglobin complex30.001220686
GO:0006069Ethanol oxidation30.001228863
GO:0005344Oxygen transporter activity30.001536131
GO:0015671Oxygen transport30.001938348
GO:0090263Positive regulation of the canonical Wnt signaling pathway50.001941189
GO:0008201Heparin binding50.00461454
GO:0008083Growth factor activity50.004821586
GO:0001654Eye development30.00769484
GO:0007596Blood coagulation50.008846893
GO:0016491Oxidoreductase activity50.010016724
GO:0005109Frizzled binding30.01001912
GO:0031720Haptoglobin binding20.012565482
GO:0060638Mesenchymal–epithelial cell signaling20.013163186
GO:0019825Oxygen binding30.016689042
GO:0030514Negative regulation of the BMP signaling pathway30.016784306
GO:0070062Extracellular exosome210.017300221
GO:0031838Haptoglobin–hemoglobin complex20.017445404
GO:0009267Cellular response to starvation30.01822392
GO:0071560Cellular response to transforming growth factor β stimulus30.019713919
GO:0004024Alcohol dehydrogenase activity, zinc-dependent activity20.02497527
GO:0005506Iron ion binding40.026650964
GO:0072562Blood microparticle40.029308832
GO:0050805Negative regulation of synaptic transmission20.030448545
GO:0005509Calcium ion binding80.030726722
GO:0098909Regulation of cardiac muscle cell action potentials involved in regulation of contraction20.034723005
GO:0016504Peptidase activator activity20.041282745
GO:0060484Lung-associated mesenchyme development20.04321623
GO:0030246Carbohydrate binding40.049593778

Abbreviations: BMP, bone morphogenetic protein; GO, gene ontology.

KEGG pathway analysis of DEGs

Using the KOBAS online analysis database (http://kobas.cbi.pku.edu.cn/) to analyze the DEGs identified from ovarian cancer-integrated gene microarrays, the most significantly enriched pathways of the DEGs were submitted to KEGG analysis. The results are shown in Table 5. The signaling pathways of DEGs were mainly enriched in the Wnt signaling pathways, metabolic pathways, and pathways in cancer. The data were imported into Cytoscape to calculate the topological characteristics of the network and determine each node. The genes and pathway nodes are represented by semiellipses. The results are shown in Figure 7.
Table 5

KEGG pathway analysis of DEGs associated with ovarian cancer

PathwayIDDatabaseGene countP-valueCorrected P-valueGenes
Wnt signaling pathwayhsa04310KEGG pathway67.43E–050.00765726SFRP1 MMP7 SOX17 FZD10 WNT2B BAMBI
Retinol metabolismhsa00830KEGG pathway40.0003106310.015310434ALDH1A1 AOX1 ADH1C ADH1B
Vitamin B6 metabolismhsa00750KEGG pathway20.0005927890.015310434PSAT1 AOX1
Pathways in cancerhsa05200KEGG pathway80.0006641160.015310434CKS2 WNT2B MECOM SLC2A1 FGF9 FZD10 FGF13 LPAR3
Tyrosine metabolismhsa00350KEGG pathway30.0007432250.015310434AOX1 ADH1C ADH1B
Taurine and hypotaurine metabolismhsa00430KEGG pathway20.0016262120.027916645GADL1 GGT6
Metabolic pathwayshsa01100KEGG pathway140.0023938920.035224406ALDH1A1 PSAT1 GLDC GADL1 AOX1 ASS1 ATP6V1B1 CSGALNACT1 ADH1B ADH1C GFPT2 HSD17B2 GGT6 ST6GALNAC1

Abbreviations: DEGs, differentially expressed genes; KEGG, Kyoto Encyclopedia of Genes and Genomes.

Figure 7

Significant pathway enrichment of DEGs.

Note: Red represents the signaling pathway, green represents downregulated genes, blue represents signaling pathway, and red represents upregulated genes.

Abbreviation: DEGs, differentially expressed genes.

Analyzing DEGs in ovarian cancer using a PPI network

The DEG expression products in ovarian cancer were constructed using the STRING database (http://string-db.org) to construct PPI networks, with a total of 190 DEGs, including 99 upregulated genes and 91 downregulated genes. After removing the isolated and partially connected nodes, a complex network of DEGs was constructed, as shown in Figure 8. The 17 most significant genes showing significant interaction were HBB, ZWINT, WNT2B, SPP1, HBA2, NUF2, ALDH1A1, FZD10, MMP7, MUC16, MUC1, OMD, OGN, AOX1, ADH1B, HBG2, and TTK.
Figure 8

PPI network.

Notes: Circles represent genes, lines represent the interaction of proteins between genes, and the results within the circle represent the structure of proteins. Line color represents evidence of the interaction between the proteins.

Abbreviation: PPI, protein–protein interaction.

Discussion

Ovarian cancer is one of the most common tumors in the female reproductive system. The incidence of ovarian cancer is the second highest among gynecologic malignancies, but the mortality rate is the highest among gynecologic tumors.15 The early onset of ovarian cancer is difficult to identify, and tumors are often found in the late stages of the disease. Its occurrence and development are complex biological processes that can occur at any age, and the disease has a poor prognosis. Therefore, it is important to study the molecular mechanisms of the carcinogenesis and development of ovarian cancer. Microarray and high-throughput sequencing technologies that detect the expression levels of tens of millions of genes in humans have been widely used to predict potential targets for ovarian cancer treatment. In recent decades, there have been many basic research reports on the mechanisms underlying the occurrence of ovarian cancer, but the 5-year survival rate of ovarian cancer is still relatively low, and there are no clear and effective treatment measures because most studies focus on a single genetic event or the results are generated from a single-cohort study.16 This study integrated three gene expression profile datasets from different groups and used R software and bioinformatics to deeply analyze these datasets. The results identified 255 DEGs using the RRA analysis method, including 96 upregulated genes and 159 downregulated genes. The top 20 most significantly upregulated genes were CP, CD24, KLK7, ST6GALNAC1, MMP7, EPCAM, SCGB2A1, PTH2R, PRAME, SPON1, MEOX1, ESRP1, MPZL2, TMPRSS4, SOX17, TNNT1, ELF3, SCGB1D2, SST, and LCN2. The top 20 most significantly downregulated genes were ITLN1, GADL1, PRG4, BCHE, SYT4, OGN, REEP1, GIPC2, MGARP, HOXA5, HBB, CHRDL1, SPOCK1, FABP4, MUM1L1, BNC1, VGLL3, SERTM1, TCEAL2, and ANXABL1. In addition, 255 DEGs were divided into groups by GO functional annotation, including molecular function, biological process, and cellular component groups. DEG enrichment was determined by KEGG signal pathway analysis to construct a PPI of DEG-encoding proteins and to screen the 17 most closely related genes. The DEGs in ovarian cancer were analyzed by GO functional annotation and showed that the upregulated DEGs were mainly involved in the regulation of cell proliferation and gene differentiation in ovarian cancer, transcription, different membranes, membrane raft polarization, proteolysis, and extracellular exosomes, and that the downregulated genes were mainly involved in the positive regulation of the canonical Wnt signaling pathway, oxygen transport, and mesenchymal–epithelial cell signaling, and in the negative regulation of the bone morphogenetic protein signaling pathway, and the cellular response to starvation and transforming growth factor β (TGF-β) stimulus. This finding is consistent with the knowledge that cell proliferation and differentiation, mesenchymal–epithelial cell signaling, and the cellular response to starvation and TGF-β play important roles in the tumor development and progression and that ion transport can contribute to cancer-related processes that differ substantially from processed in normal cells.17 The TGF-β signaling pathway is an important intracellular signal transduction pathway involved in the embryonic development, tumorigenesis, wound healing, inflammatory response, and physiological and pathophysiological processes.18 TGF-β can improve the adhesion and motility of tumor cells by promoting the epithelial-to-mesenchymal transition (EMT) and can inhibit the expression of E-cadherin, damaging the connection between the epithelium and weakening the adhesion of cancer cells. Through the expression of N-cadherin, interstitial characteristics can develop, and the invasion and metastasis of tumor cells from the primary site is facilitated.19 Therefore, the abnormal expression of TGF-β is closely related to tumorigenesis and cancer progression. Furthermore, the enriched KEGG pathways of DEGs included the Wnt signaling pathway, metabolic pathways, and pathways in cancer. Recent study has shown that the Wnt/β-catenin signaling pathway can promote ovarian cancer resistance by promoting the EMT in ovarian cancer.20 Bodnar et al21 found that the activation of the Wnt/β-catenin signaling pathway could promote the proliferation and differentiation of ovarian cancer cells, inhibit apoptosis, and promote the growth of ovarian cancer. Multiple levels of negative modulators are involved in the Wnt/β-catenin signaling pathway. The DEGs that are closely associated with the Wnt/β-catenin signaling pathway in ovarian cancer were determined using the Cytoscape method, with SFRP1, MMP7, SOX17, FZD10, WNT2B, BAMBI, SOX17, FZD10, and MMP7 expressions upregulated and SFRP1, BAMBI, and WNT2B downregulated. Matrix metalloproteinases (MMPs) are closely related to tumor invasion and metastasis. The overexpression of MMPs can significantly promote the invasion and metastasis of tumor cells.22 MMP7 is an important member of the MMP family that can degrade the extracellular matrix, including the basement membrane, and inhibit the defense against tumor invasion and metastasis, thus increasing the invasive capacity of the tumor.23 The FZD10 gene is a member of the Frizzled gene family, which encodes the receptor protein of the Wnt pathway and plays an important role in the pathway. A study has found that BRMS1L mediates FZD10 silencing by promoting the recruitment of HDAC1 in the FZD10 promoter region and the acetylation of histone H3K9, thereby inhibiting the migration, invasion, and adhesion of breast cancer cell lines.24 Another study found that silencing the expression of FZD10 in ovarian cancer cells could significantly increase the sensitivity to chemotherapy drugs, but the specific mechanism was not clear.25 Our study suggests that FZD10 is closely related to Wnt and tumor-related pathways, which can provide a theoretical basis for further studies. SFRP1 is a Wnt antagonist that competes with the Frizzled protein receptor for Wnt ligands and can block Wnt signal transduction.26 Therefore, when SFRP1 is downregulated, it can promote the activation of the Wnt signaling pathway and the proliferation of tumor cells and accelerate the occurrence and development of tumors. The downregulation of BAMBI can promote tumor invasion and metastasis. Overexpression of BAMBI can reduce the TGF-β-induced EMT, and the invasion and migration of tumor cells and can slow tumor growth.27 A large number of studies have reported that WNT2B is upregulated in a variety of tumor tissues and is capable of activating the β-catenin-dependent Wnt signaling pathway.28,29 However, in our study, we found that the expression of WNT2B was downregulated in ovarian cancer, suggesting that it requires further detection in tissue samples to identify its expression patterns and explore the molecular mechanism underlying its role in ovarian cancer. Detecting these pathways and the expression of related molecules can help predict the occurrence and development of tumors. We constructed a PPI network of protein encoded by DEGs and identified the following 17 closely related genes: HBB, ZWINT, WNT2B, SPP1, HBA2, NUF2, ALDH1A1, FZD10, MMP7, MUC16, MUC1, OMD, OGN, AOX1, ADH1B, HBG2, and TTK. The proteins encoded by these genes are key nodes in the PPI network. Pathway enrichment analysis revealed that the genes were mainly involved in the Wnt signaling pathway, retinol metabolism, pathways in cancer, and metabolic pathways. Endo et al found that ZWINT was overexpressed in breast cancer cell lines and could promote tumor cell growth, and that the degradation of ZWINT negatively regulated cell proliferation. Increasing evidence shows that SPP1 is closely related to the tumorigenesis and metastasis of tumors.30 A high expression of SPP1 was detected in tumor tissues, such as colon cancer, gastric cancer, prostate cancer, and breast cancer.31–33 NUF2 is expressed in a variety of malignancies and plays an important role in tumorigenesis and progression.34 NUF2 is highly expressed in epithelial ovarian cancer and is involved in many characteristics, such as the metastasis, invasion, division, and proliferation of cancer cells.35 However, the specific molecular mechanisms underlying the abnormal expression of NUF2 in tumorigenesis and progression remain to be studied. In recent years, acetaldehyde dehydrogenases (ALDHs) have been found to play an important role in tumor cell metabolism and tumorigenesis. Among them, ALDH1 is closely related to tumor cell stemness. Among the numerous ALDH1 families, aldehyde dehydrogenase 1A1 (ALDH1A1) is one of the important members of the superfamily. The physiological function of ALDH1A1 is to participate in the metabolism of retinoic acid and promote the activation of the metabolic pathway of retinoic acid. In recent years, the role and function of this molecule in malignant tumors have attracted research attention. Studies have reported that ALDH1A1 is highly expressed in ovarian cancer cells and is involved in the resistance to chemoradiotherapy in ovarian cancer, resulting in recurrence and metastasis.36 Another study reported that ALDH1A1 is the target molecule of β-catenin and β-catenin knockdown can disrupt ovarian cancer spheroid formation, cell viability, and tumor growth and metastasis.37 Therefore, developing more specific ALDH1A1 inhibitors could increase chemotherapy effectiveness in ovarian cancer. MUC16 is an important tumor marker for the early diagnosis of epithelial ovarian cancer. It is widely used in clinical practice. A study found that 80% of patients with ovarian cancer had elevated serum MUC16 levels.38 In addition, The Cancer Genome Atlas (TCGA) effort aimed at ovarian cancer found that both the amplification of the gene encoding MUC16 and the expression of MUC16 mRNA are closely related to the poor prognosis of patients with ovarian cancer,39 suggesting that the abnormal increase of tumor marker MUC16 in ovarian cancer plays an important role in the development and progression of ovarian cancer. The Wnt signaling pathway is an important signaling pathway in biological development and tumorigenesis. β-Catenin is the major effector of the Wnt signaling pathway. After silencing MUC1, the levels of E-cadherin and E-cadherin/β-catenin complexes were elevated, and the expression levels of nuclear β-catenin, cyclin D1 (cyclinD1), and c-myc were decreased.40 In renal cell carcinoma, MUC1-C and β-catenin interact with the promoter region of SNAIL and increase SNAIL transcription, thereby facilitating the EMT of tumor cells.41 In recent years, MUC1-N monoclonal antibodies, MUC1-C peptides, and MUC1 vaccines have been extensively studied in preclinical, experimental, and clinical trials. Osteomodulin (OMD) plays an important role in the extracellular matrix in tooth and cartilage tissue and is reportedly involved in bone mineralization.42 Osteoglycin (OGN) is a member of proteoglycans (PGs) called small leucine-rich PGs (SLRPs), which are a group of extracellular matrix molecules related in structure and function that are involved in matrix assembly, the regulation of cellular growth, and migration.43 OGN, also known as osteoinductive factor, plays an important role in maintaining normal bone tissue.44 As an extracellular matrix molecule, OGN is involved in the formation and regulation of extracellular matrix and has a potential impact on tumor cell metastasis. In vivo experiments have shown that OGN is involved in the regulation of collagen fiber formation associated with tumor metastasis, suggesting that OGN may inhibit tumor metastasis by a number of mechanisms.45 MMPs are important regulators of extracellular matrix metabolism and are closely related to tumor metastasis.46 Future studies should focus on the relationship between OGN expression and MMPs secreted by tumor cells. Many studies have shown that the protein expression of the ADH1B gene is downregulated in many tumors.47,48 Compared with levels in normal tissues, the mRNA levels of ADH1B in colorectal cancer tissues were low.49 The risk of residual lesions in patients with high-grade ovarian cancer after cytoreductive surgery is closely related to the high expression of ADH1B.50 HBB is one of the globin strands of hemoglobin, whose basic function is to transport oxygen.51 A study found that the expression of HBB proteins in a rat model of ovarian cancer was significantly lower, which suggests that it can be used for early diagnosis and disease detection, and as a treatment target.52 Thus, ADH1B and HBB are negatively correlated with patient outcome. This conclusion is also consistent with previous studies reporting that HBB and ADH1B are biomarkers for early detection and found in other microarrays retrieved from oncomine.53 The important role of protein kinases in cellular activity has received increased attention. Protein kinases can phosphorylate target proteins deliver and amplify signals, and then regulate the activities of normal epithelial cells, inflammatory cells, tumor cells, and other cells, such as proliferation, migration, apoptosis, and metastasis.54 TTK kinase is a dual-specificity kinase that can phosphorylate tyrosine and serine/threonine residues and is critical for the recruitment of SAC proteins to unattached kinetochores, mitotic checkpoint complex (MCC) formation, and mitotic arrest.55 Studies have reported that TTK is overexpressed in many cancers, such as breast cancer, lung cancer, prostate cancer, and liver cancer.56–59 Another report showed that TTK is a favorable prognostic biomarker associated with triple negative breast cancer (TNBC) survival, and a high level of TTK expression predicts good survival and may safely spare the patient from adjuvant chemotherapy. A low level of TTK activates B-Raf/ERK signaling, which contributes to the invasiveness of cancer cells and poor survival of patients with TNBC.60

Conclusion

We downloaded multiple microarray datasets from the NCBI GEO database and integrated three microarray datasets. Then, we used R software and bioinformatics analysis to further investigate these datasets. We have identified 190 candidate DEGs, which may be involved in the progression of ovarian cancer. Among them, 99 genes were upregulated and 91 genes were downregulated. Using R software, we identified the top 20 most significantly up- and downregulated genes that could be the most related to the occurrence and development of ovarian cancer. By analyzing the GO and KEGG pathways, we found that DEGs were mainly enriched in the Wnt signaling pathway, metabolic pathways, and pathways in cancer, which provide a theoretical basis for studying the biological processes of ovarian cancer. We successfully constructed a PPI network of DEGs in epithelial ovarian cancer and screened several key genes encoding proteins in the network that are involved in the process of ovarian cancer in the form of molecular populations. Further study of this network would be beneficial for understanding the interaction between DEGs. These findings improve our understanding of the pathogenesis of ovarian cancer and the occurrence and development of the underlying molecular mechanisms. Our study has important clinical significance for the early diagnosis and treatment, as well as the prevention, of ovarian cancer and provides effective targets for the treatment of ovarian cancer. However, further molecular biological experiments are required to confirm the function of the identified genes associated with ovarian cancer.
  60 in total

Review 1.  Matrix metalloproteinases (MMPs) in health and disease: an overview.

Authors:  Charles J Malemud
Journal:  Front Biosci       Date:  2006-05-01

2.  High levels of the Mps1 checkpoint protein are protective of aneuploidy in breast cancer cells.

Authors:  Jewel Daniel; Jonathan Coulter; Ju-Hyung Woo; Kathleen Wilsbach; Edward Gabrielson
Journal:  Proc Natl Acad Sci U S A       Date:  2011-03-14       Impact factor: 11.205

3.  RNA sequencing identifies novel markers of non-small cell lung cancer.

Authors:  Seon-Sook Han; Woo Jin Kim; Yoonki Hong; Seok-Ho Hong; Seung-Joon Lee; Dong Ryeol Ryu; Wonho Lee; Yo Han Cho; Seungkoo Lee; Young-Joon Ryu; Jun Yeon Won; Hwanseok Rhee; Jung Hoon Park; Se Jin Jang; Jae Seung Lee; Chang-Min Choi; Jae Cheol Lee; Sang Do Lee; Yeon-Mok Oh
Journal:  Lung Cancer       Date:  2014-03-26       Impact factor: 5.705

Review 4.  Matrix metalloproteinases and tumor metastasis.

Authors:  Elena I Deryugina; James P Quigley
Journal:  Cancer Metastasis Rev       Date:  2006-03       Impact factor: 9.264

5.  Fructose-1,6-bisphosphatase is a novel regulator of Wnt/β-Catenin pathway in breast cancer.

Authors:  Kaichun Li; Mingzhen Ying; Dan Feng; Jie Du; Shiyu Chen; Bing Dan; Cuihua Wang; Yajie Wang
Journal:  Biomed Pharmacother       Date:  2016-10-22       Impact factor: 6.529

6.  Platform influence on DNA microarray data in postmortem brain research.

Authors:  Deborah Hollingshead; David A Lewis; Károly Mirnics
Journal:  Neurobiol Dis       Date:  2005-04       Impact factor: 5.996

Review 7.  Targeting the TGFβ signalling pathway in disease.

Authors:  Rosemary J Akhurst; Akiko Hata
Journal:  Nat Rev Drug Discov       Date:  2012-09-24       Impact factor: 84.694

Review 8.  Dimerization-induced allostery in protein kinase regulation.

Authors:  Hugo Lavoie; John J Li; Neroshan Thevakumaran; Marc Therrien; Frank Sicheri
Journal:  Trends Biochem Sci       Date:  2014-09-11       Impact factor: 13.807

9.  siRNA-mediated knockdown against NUF2 suppresses pancreatic cancer proliferation in vitro and in vivo.

Authors:  Peng Hu; Xi Chen; Jing Sun; Ping Bie; Lei-Da Zhang
Journal:  Biosci Rep       Date:  2015-01-14       Impact factor: 3.840

10.  Expression Atlas update--a database of gene and transcript expression from microarray- and sequencing-based functional genomics experiments.

Authors:  Robert Petryszak; Tony Burdett; Benedetto Fiorelli; Nuno A Fonseca; Mar Gonzalez-Porta; Emma Hastings; Wolfgang Huber; Simon Jupp; Maria Keays; Nataliya Kryvych; Julie McMurry; John C Marioni; James Malone; Karine Megy; Gabriella Rustici; Amy Y Tang; Jan Taubert; Eleanor Williams; Oliver Mannion; Helen E Parkinson; Alvis Brazma
Journal:  Nucleic Acids Res       Date:  2013-12-04       Impact factor: 16.971

View more
  21 in total

1.  Integrated bioinformatics analysis for the identification of potential key genes affecting the pathogenesis of clear cell renal cell carcinoma.

Authors:  Hao Cui; Lei Xu; Zhi Li; Ke-Zuo Hou; Xiao-Fang Che; Bo-Fang Liu; Yun-Peng Liu; Xiu-Juan Qu
Journal:  Oncol Lett       Date:  2020-06-05       Impact factor: 2.967

2.  Upregulated expression of MMP family genes is associated with poor survival in patients with esophageal squamous cell carcinoma via regulation of proliferation and epithelial‑mesenchymal transition.

Authors:  Guifeng Xu; Ling Ou; Ying Liu; Xiao Wang; Kaisheng Liu; Jieling Li; Junjun Li; Shaoqi Wang; Dane Huang; Kai Zheng; Shaoxiang Wang
Journal:  Oncol Rep       Date:  2020-05-07       Impact factor: 3.906

3.  Understanding the Genetic Domestication History of the Jianchang Duck by Genotyping and Sequencing of Genomic Genes Under Selection.

Authors:  Lei Wang; Jiazhong Guo; Yang Xi; Shengchao Ma; Yanying Li; Hua He; Jiwen Wang; Chunchun Han; Lili Bai; Ahsan Mustafa; Hehe Liu; Liang Li
Journal:  G3 (Bethesda)       Date:  2020-05-04       Impact factor: 3.154

4.  Construction of a novel prognostic-predicting model correlated to ovarian cancer.

Authors:  Weichun Tang; Jie Li; Xinxia Chang; Lizhou Jia; Qi Tang; Ying Wang; Yanli Zheng; Lizhou Sun; Zhenqing Feng
Journal:  Biosci Rep       Date:  2020-08-28       Impact factor: 3.840

5.  Construction and analysis of a spinal cord injury competitive endogenous RNA network based on the expression data of long noncoding, micro‑ and messenger RNAs.

Authors:  Linbang Wang; Bing Wang; Jingkun Liu; Zhengxue Quan
Journal:  Mol Med Rep       Date:  2019-02-22       Impact factor: 2.952

6.  The underlying molecular mechanism and potential drugs for treatment in papillary renal cell carcinoma: A study based on TCGA and Cmap datasets.

Authors:  Jin-Shu Pang; Zhe-Kun Li; Peng Lin; Xiao-Dong Wang; Gang Chen; Hai-Biao Yan; Sheng-Hua Li
Journal:  Oncol Rep       Date:  2019-02-14       Impact factor: 3.906

7.  Identification and Validation of Potential Biomarkers and Their Functions in Acute Kidney Injury.

Authors:  Jianwen Chen; Yalei Chen; Alberto Olivero; Xiangmei Chen
Journal:  Front Genet       Date:  2020-05-12       Impact factor: 4.599

8.  Identification of differentially expressed genes and signaling pathways involved in endometriosis by integrated bioinformatics analysis.

Authors:  Fang-Fang Dai; An-Yu Bao; Bing Luo; Zi-Hang Zeng; Xiao-Li Pu; Yan-Qing Wang; Li Zhang; Shu Xian; Meng-Qin Yuan; Dong-Yong Yang; Shi-Yi Liu; Yan-Xiang Cheng
Journal:  Exp Ther Med       Date:  2019-11-18       Impact factor: 2.447

9.  Identification of differentially expressed genes and signaling pathways in chronic obstructive pulmonary disease via bioinformatic analysis.

Authors:  Xinwei Huang; Yunwei Li; Xiaoran Guo; Zongxin Zhu; Xiangyang Kong; Fubing Yu; Qiang Wang
Journal:  FEBS Open Bio       Date:  2019-09-29       Impact factor: 2.693

10.  Identification and Exploration of Novel Macrophage M2-Related Biomarkers and Potential Therapeutic Agents in Endometriosis.

Authors:  Zhongqi Cui; Ramesh Bhandari; Qin Lei; Mingzhi Lu; Lei Zhang; Mengmei Zhang; Fenyong Sun; Lijin Feng; Shasha Zhao
Journal:  Front Mol Biosci       Date:  2021-07-06
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.