Literature DB >> 33302814

Screening and Discovery of New Potential Biomarkers and Small Molecule Drugs for Cervical Cancer: A Bioinformatics Analysis.

Hui-Zhu Qiu1, Ji Huang2, Cheng-Cheng Xiang1, Rong Li1, Er-Dong Zuo1, Yuan Zhang1, Li Shan1, Xu Cheng1.   

Abstract

BACKGROUND: Cervical cancer (CC) is the second most common type of malignant tumor survival rate is low in advanced stage, metastatic, and recurrent CC patients. This study aimed at identifying potential genes and drugs for CC diagnosis and targeting therapies.
METHODS: Three GEO mRNA microarray datasets of CC tissues and non-cancerous tissues were analyzed for differentially expressed genes (DEGs) by limma package. GO (Gene Ontologies) and KEGG (Kyoto Encyclopedia of Genes and Genomes) were used to explore the relationships between the DEGs. Protein-protein interaction (PPI) of these genes was established by the STRING database. MCODE was used for screening significant modules in the PPI networks to select hub genes. Biochemical mechanisms of the hub genes were investigated with Metascape. GEPIA database was used for validating the core genes. According to these DEGs, molecular candidates for CC were recognized from the CMAP database.
RESULTS: We identified 309 overlapping DEGs in the 2 tissue-types. Pathway analysis revealed that the DEGs were involved in cell cycle, DNA replication, and p53 signaling. PPI networks between overlapping DEGs showed 68 high-connectivity DEGs that were chosen as hub genes. The GEPIA database showed that the expression levels of RRM2, CDC45, GINS2, HELLS, KNTC1, MCM2, MYBL2, PCNA, RAD54 L, RFC4, RFC5, TK1, TOP2A, and TYMS in CC tissues were significantly different from those in the healthy tissues and were significantly relevant to the OS of CC. We found 10 small molecules from the CMAP database that could change the trend of gene expression in CC tissues, including piperlongumine and chrysin.
CONCLUSIONS: The 14 DEGs identified in this study could serve as novel prognosis biomarkers for the detection and forecasting of CC. Small molecule drugs like piperlongumine and chrysin could be potential therapeutic drugs for CC treatment.

Entities:  

Keywords:  bioinformatics analysis; candidate small molecules; cervical cancer; novel biomarker

Year:  2020        PMID: 33302814      PMCID: PMC7734488          DOI: 10.1177/1533033820980112

Source DB:  PubMed          Journal:  Technol Cancer Res Treat        ISSN: 1533-0338


Introduction

Cervical cancer (CC) is the second most common type of malignant tumor in females. Recent global figures have estimated that 265,672 deaths occur every year due to CC, and 527,624 new patients are diagnosed with CC every year. It is the main cause of deaths due to cancer in women in developing countries.[1-4] Many CC patients can be treated if diagnosed early, but the prognosis is poor because of lack of obvious signs and symptoms, which do not appear until the advanced stages.[5] Metastatic and recurrent CC patients also have a poor prognosis. Research has shown that HPV (Human Papilloma Virus) infection is the cause of CC[5]; however, it is important to note that only a fraction of women infected by HPV develop CC. Therefore, HPV infection is not the only reason for CC induction. HPV acts by blocking cellular tumor suppressor pathways and causing genetic changes.[6] The p53-mediated growth arrest is inhibited by HPV infection, and c-MYC and telomerase reverse transcriptase (TERT) are activated, but genetic changes induced by other risk factors are required for the development of CC.[7-9] Thus, studies are necessary to explore the differential expression of key genes between CC and healthy cervix tissue samples. After the diagnosis of CC, many different treatment options are available according to the stage of cancer. Pre-invasive (stage 0) and early-stage patients (IA to IIA) are treated by surgery, and advanced-stage patients (IIB to IVB) are treated by radiation therapy and chemotherapy.[10] Cisplatin is considered to be the most active single agent against CC.[11] Numerous other chemotherapeutic drugs are also active against CC, including paclitaxe, carboplatin, bevacizumab, and topotecan.[12-15] Although platinum or taxane-based chemotherapy can significantly improve the survival of CC patients, the results of conventional treatments are still limited and the 5-year survival rate of CC is still low, only about 52%.[16] Consequently, the discovery of more effective, selective, and less toxic agents is urgently needed to improve the 5-year survival rate and reduce the mortality rate of CC patients. In this study, we explored new biomarkers for CC diagnosis and targeting therapies. We manipulated the microarray data of GSE63514, GSE9750, and GSE7803 datasets from the GEO (Gene Expression Omnibus) database to distinguish differentially expressed genes (DEGs) between CC and non-cancerous tissues. GO and pathway analysis were done to elucidate the functions of the DEGs. The hub genes related to the pathogenesis of CC were chosen by protein-protein interaction (PPI) network. The key genes displaying significantly different expression level in CC tissues and significant relation with the overall survival (OS) were identified. In addition, we identified some potential therapeutic candidates from the connectivity map (CMAP) database for treatment of CC.

Materials and Methods

Microarray Data

Three gene expression datasets [GSE63514, GSE9750, and GSE7803] were downloaded from GEO based on the following criteria: 1 The datasets should include not less than 10 of control samples, and not less than 20 cervical Squamous cancer samples. 2 The datasets should include cervical Squamous cancer specimens of different stages and different HPV infection status. The probes were altered into corresponding gene symbols according to the annotations. The GSE63514 dataset consisted of 28 CC cervical tissue samples and 24 healthy cervical tissue samples; GSE9750 dataset consisted of 33 CC cervical tissue samples and 24 healthy tissue samples; GSE7803 included 21 CC cervical tissue samples and 10 healthy cervical tissue samples.

DEGs Screening

The DEGs between CC and healthy tissue samples were analyzed in R language with the limma package. In order to balance genes and false-positive discovery rates, modified P-values and Benjamini/Hochberg false discovery levels were introduced. Non-meaning probe sets, including those without gene markers or genes present in only 1 sample set were omitted or averaged. The collection of significant DEGs was set to P-value < 0.01 and |logFC| > 1.

GO Enrichment and KEGG Pathway Analysis of DEGs

We used GO analysis to explore the potential functions of the common DEGs. GO is a widely used bioinformatics tool to identify genes and study-related biological processes. We employed KEGG to search for potential pathways of the overlapping DEGs. KEGG is a database to study gene functions and pathways from big datasets sourced from high-throughput experiments. DAVID, an online biological database, was used to analyze the GO and KEGG terms. The top 5 terms were considered statistically significant.

PPI Network Construction and Analysis

PPI network was established using the STRING database (version 10.0). We found a statistically significant association of DEG proteins expected by STRING with a combined score > 0.4. To display the molecular interaction networks, Cytoscape (version 3.7.2) was used. The Cytoscape plug-in MCODE (version 1.5.1) was used to screen main modules for the collection of the hub genes in the PPI networks. The criteria for selection were: degree cut-off = 10, node score cut-off = 0.2, Max depth = 100, and k-score = 2. Metascape (http://metascape.org, an online annotation analysis tool for gene function) was used to identify the biological cycle of the hub genes.

Validation of Key Genes

In order to analyze the effect of the hub genes on the prognosis of the patient, GEPIA (Gene Expression Profiling Integrated Analysis) software was used. The mRNA expression level of the hub genes was verified by the GEPIA platform. Referring to the median expression levels of the hub genes, the overall survival analyses were executed using the Kaplan-Meier curve in the GEPIA database. The expression level of genes in CC tissues was significantly different from normal tissues and was significantly related to the OS were considered to be the key genes. Online database Oncomine was used to verify the mRNA expression of the key genes in CC tissue samples. Finally, the PPI network was established according to the key genes by STRING with a combined score > 0.7, and KEGG pathway was analyzed by STRING too. Online database Human Protein Atlas (HPA, https://www.proteinatlas.org/) was used to verify qualitative data of protein in CC. Protein expression score is based on immunohistochemical data manually scored with regard to staining intensity (negative, weak, moderate or strong) and fraction of stained cells (<25%, or >25%). Each combination of intensity and fractions is automatically converted into a protein expression level score as follows: negative—not detected; weak <25%; positive-weak combined with weak fraction of stained cells >25%, moderate or strong intensity.

Identification of Small Molecules

CMAP database has information on a large number of molecular interfering experiments and gene expression. We matched DEGs with the genes involved in small molecular interference to identify some small molecules. First, the overlapping DEGs among the 3 datasets were grouped on the basis of their up- and down-regulated expression. These probes were mapped to the database to get relevant active small molecules, and the enrichment values were calculated. Values closer to −1 suggested that the small molecule could imitate the state of the normal cell, and values closer to 1 indicated that the compound might imitate the state of colonial carcinoma cells.

Results

Microarray Data Analysis and Identification of DEGs

After normalization of the data, DEGs (3753 in GSE63514, 2064 in GSE9750, and 560 in GSE7803) were identified by the limma package; a total of 309 overlapping DEGs in both the tissue samples were ascertained from the 3 datasets, which included 155 down-regulated genes and 154 up-regulated genes (Figure 1).
Figure 1.

Venn diagram.

Venn diagram.

GO Enrichment and KEGG Pathway Analyses of DEGs

We used DAVID to perform KEGG and GO analysis of the overlapping DEGs. The functional classification of these genes (p < 0.05) indicated significant enrichment in corresponding items. BP (Biological Process) analysis revealed that these DEGs were mostly enriched in response to DNA replication, mitotic cell cycle transformation, and cell division. Cellular Component was mainly enriched in nucleoplasm, cornified envelope, and cytoplasm. MF (Molecular Feature) was enriched in protein binding, chromatin binding, and peptidase activity of the serine form. KEGG analysis showed enrichment in a variety of pathways, such as cell cycle, DNA replication, and p53 signaling (Figure 2 and Table 1).
Figure 2.

GO enrichment and signaling pathway analysis of the overlapped DEGs in CC and healthy tissues. Abbreviations: GO, Gene Ontologies; DEGs, Differentially expressed genes; CC, Cervical cancer; BP, Biological Process; MF, Molecular Feature; KEGG, Kyoto Encyclopedia of Genes and Genomes.

Table 1.

GO and KEGG Pathway Enrichment Analysis of DEGs in CC.

TermCountP value
GOTERM_BP_FAT
GO:0006260∼DNA replication261.29E-17
GO:0000082∼G1/S transition of mitotic cell cycle191.07E-13
GO:0051301∼cell division291.13E-11
GO:0030216∼keratinocyte differentiation144.53E-10
GO:0006270∼DNA replication initiation102.12E-09
GOTERM_CC_FAT
GO:0005654∼nucleoplasm862.61E-09
GO:0001533∼cornified envelope112.76E-09
GO:0005737∼cytoplasm1311.60E-08
GO:0070062∼extracellular exosome829.73E-08
GO:0030496∼midbody141.91E-07
GOTERM_MF_FAT
GO:0005515∼protein binding1971.94E-08
GO:0003682∼chromatin binding242.72E-07
GO:0008236∼serine-type peptidase activity101.14E-06
GO:0003688∼DNA replication origin binding52.49E-05
GO:0004252∼serine-type endopeptidase activity151.29E-04
KEGG_PATHWAY
hsa04110: Cell cycle181.69E-11
hsa03030: DNA replication91.05E-07
hsa04115: p53 signaling pathway78.72E-04
hsa05215: Prostate cancer70.00355863
hsa05212: Pancreatic cancer60.004617042
GO enrichment and signaling pathway analysis of the overlapped DEGs in CC and healthy tissues. Abbreviations: GO, Gene Ontologies; DEGs, Differentially expressed genes; CC, Cervical cancer; BP, Biological Process; MF, Molecular Feature; KEGG, Kyoto Encyclopedia of Genes and Genomes. GO and KEGG Pathway Enrichment Analysis of DEGs in CC. PPI network of the potential interactions between the overlapping DEGs was built using the STRING and Cytoscape databases. (Figure 3A). The major module was chosen by the PPI network using MCODE. Sixty-eight high-connectivity DEGs were selected as the hub genes, which were speculated to play important roles in CC pathogenesis (Figure 3B). The abbreviations, names, and functions of these genes are displayed in Table 2. The functions of these hub genes were analyzed by Metascape, which showed that the hub genes were enriched in cell cycle, cell cycle phase transition, cell division, and DNA replication.(Figure 3C).
Figure 3.

(A) PPI network of DEGs, nodes with red and green represent the up- and down-regulated genes. (B) The hub genes obtained from the PPI network (all nodes are up-regulated genes). (C) Bar graph of the enriched terms across input hub genes by Metascape, colored by P-values. Abbreviations: PPI, Protein interaction; DEGs, Differentially expressed genes;

Table 2.

Name and Functional Roles of 68 Hub Genes.

Gene symbolFull nameFunction
ASPMAbnormal spindle microtubule assemblyCell Mitosis
AURKAAurora kinase ACell cycle
BIRC5Baculoviral IAP repeat containing 5inhibit apoptosis cell death
BRCA1Breast cancer gene 1DNA repair and recombination
CCNE2Cyclin-E2Cell cycle control
CCNFCyclin FCell cycle control
CDC25ACell division cycle 25ACell mitosis
CDC45Cell division cycle 45DNA replication and DNA replication
CDC7Cell division cycle 7Cell cycle and DNA replication
CDK1Cyclin-dependent kinase 1Cell cycle
CENPFCentromere protein FCell Mitosis
CENPNCentromere protein NCell mitosis
CEP55Centrosomal protein of 55 kDaCell Mitosis
CKS1BCyclin-dependent kinases regulatory subunit 1Essential for cyclin-dependent kinases
CKS2CDC28 protein kinase regulatory subunit 2Cell cycle
DTLDenticleless E3 ubiquitin protein ligaseCell cycle control, DNA damage response and DNA synthesis
E2F8E2F transcription factor 8Transcription repressor
ECT2Epithelial cell transforming 2Cell cycle
EZH2Enhancer of zeste 2 polycomb repressive complex 2 subunitTranscriptional repression
FANCIFanconi anemia complementation group IDNA repair
FEN1Flap structure-specific endonuclease 1DNA replication and repair
FOXM1Forkhead box M1Cell proliferation
GINS1GINS complex subunit 1DNA replication
GINS2G2 and S-phase expressed 1DNA replication and DNA replication
GMNNGeminin DNA replication inhibitorInhibits DNA replication
GTSE1G2 and S-phase expressed 1p53-induced cell cycle arrest
HELLSLymphoid-specific helicaseDNA replication, repair, recombination transcription and methylation
HJURPHolliday junction recognition proteinCell Mitosis
HMMRHyaluronan mediated motility receptorCell motility
KIAA0101PCNA-associated factorDNA repair
KIF14Kinesin family member 14Cell division, cytokinesis, cell proliferation and apoptosis
KIF20AKinesin family member 20ACell division
KIF4AKinesin family member 4ACell Mitosis
KIFC1Kinesin family member C1RNA splicing
KNTC1Kinetochore associated 1Cell Mitosis
MAD2L1MAD2 mitotic arrest deficient-like 1Cell Mitosis
MCM2Minichromosome maintenance complex component 2DNA replication initiation and elongation
MCM5Minichromosome maintenance complex component 5DNA replication
MCM6Minichromosome maintenance complex component 6DNA replication
MCM7Minichromosome maintenance complex component 7DNA replication initiation and elongation
MELKMaternal embryonic leucine zipper kinaseCell cycle control
MKI67Marker of proliferation Ki-67Cell Mitosis
MYBL2MYB proto-oncogene like 2Cell survival, proliferation, and differentiation
NCAPGNon-SMC condensin II complex subunit G2Cell Mitosis
NCAPHNon-SMC condensin I complex subunit HCell Mitosis
NDC80Nuclear division cycle 80Cell cycle
NEK2NIMA related kinase 2Cell Mitosis
NUSAP1Nucleolar and spindle-associated protein 1Cell Mitosis
PCNAProliferating cell nuclear antigenDNA replication, DNA repair
PLK4Polo like kinase 4Cell cycle control
POLQDNA polymerase thetaDNA replication and repair
PRC1Protein regulator of cytokinesis 1Key regulator of cytokinesis
RAD51DNA repair protein 51DNA repair
RAD51AP1DNA repair and recombination protein 51 associated protein 1DNA repair
RAD54LDNA repair and recombination protein 54-likeDNA repair
RFC4Replication factor C subunit 4DNA replication
RFC5Replication factor C subunit 5DNA replication
RRM2Ribonucleotide reductase regulatory subunit M2DNA synthesis
SMC4Structural maintenance of chromosomes protein 4Cell mitosis
SPAG5Sperm associated antigen 5Cell Mitosis
TACC3Transforming acidic coiled-coil containing protein 3Cell Mitosis
TIMELESSTimeless circadian clockDNA replication
TK1Thymidine kinase 1DNA replication and repair
TOP2ADNA topoisomerase 2-alphaDNA synthesis and transcription, cell mitosis
TTKTTK protein kinaseCell proliferation
TYMSThymidylate synthaseDNA biosynthesis
WDHD1WD repeat and HMG-box DNA-binding protein 1DNA replication
ZWINTZW10 interacting kinetochore proteinCell mitosis
(A) PPI network of DEGs, nodes with red and green represent the up- and down-regulated genes. (B) The hub genes obtained from the PPI network (all nodes are up-regulated genes). (C) Bar graph of the enriched terms across input hub genes by Metascape, colored by P-values. Abbreviations: PPI, Protein interaction; DEGs, Differentially expressed genes; Name and Functional Roles of 68 Hub Genes. We found that the expression levels of the following genes was higher in CC tissues as compared to that in the healthy tissues: RRM2, CDC45, GINS2, HELLS, KNTC1, MCM2, MYBL2, PCNA, RAD54 L, RFC4, RFC5, TK1, TOP2A, and TYMS (Figure 4A). Up-regulated expression of CDC45, GINS2, HELLS, KNTC1, MCM2, MYBL2, PCNA, RAD54 L, RFC4, RFC5, TK1, TOP2A, and TYMS was linked to high survival rate of CC patients, whereas the over-expression of RRM2 in CC tissues was linked to lower survival rate of CC patients (Figure 4B); these results were in accordance with the results of GEPIA database. We also found that mRNA of the key genes were significantly overexpressed in CC tissues as compared to that in the healthy tissues, by analyzing 5 different Oncomine databases (Figure 4C). PPI network was established according to the key genes (Figure 4D). The KEGG pathway analysis indicated that the key genes were mainly enriched in DNA replication, mismatch repair, and nucleotide excision repair (Figure 4E). We verified the qualitative data of protein in CC by HPA too. Immunohistochemistry confirmed the differential positive expression rate of key genes protein (Figure 5). Both the antibody staining and intensity of key genes protein in CC samples were higher than normal tissue (Figure 6). Thus, RRM2, CDC45, GINS2, HELLS, KNTC1, MCM2, MYBL2, PCNA, RAD54 L RFC4, RFC5, TK1, TOP2A, and TYMS could act as potential key biomarkers for CC.
Figure 4.

(A) Expression boxplots of key genes by GEPIA. (B) Survival curves of key genes by GEPIA. (C) Oncomine Log rank analysis Heat maps of the key genes mRNA expression in clinical CC samples vs. normal tissues. 1. Cervical Squamous Cell Carcinoma vs. Normal, Biewenga Cervix, Gynecol Oncol, 2008. 2. Cancer Type: Cervical Cancer, Bittner Multi-cancer, Not Published, 2006. 3. Cervical Cancer vs. Normal, Pyeon Multi-cancer, Cancer Res, 2007. 4. Cervical Squamous Cell Carcinoma vs. Normal, Scotto Cervix 2, Genes Chromosomes Cancer, 2008. 5. Cervical Squamous Cell Carcinoma Epithelia vs. Normal Zhai Cervix, Cancer Res, 2007. The rank for a gene is the median rank for that gene across each of the analyses. The p-Value for a gene is its p-Value for the median-ranked analysis. (D) PPI network of key genes. Colored nodes: query proteins and first shell of interactors, empty nodes: proteins of unknown 3D structure, filled nodes: some 3D structure is known or predicted, line thickness indicates the strength of data support. (E) KEGG pathway enrichment analysis of key genes in CC sample. Abbreviations: GEPIA, Gene Expression Profiling Integrated Analysis; CC, Cervical Cancer; KEGG; Kyoto Encyclopedia of Genes and Genomes; TPM, transcripts per million.

Figure 5.

Immunohistochemistry confirmed the differential expression of key genes in CC tissues and normal tissues based on HPA. Abbreviations: HPA, Human Protein Atlas.

Figure 6.

Immunohistochemistry confirmed the differential expression of key genes in CC tissues and normal tissues based on HPA. Abbreviations: HPA, Human Protein Atlas.

(A) Expression boxplots of key genes by GEPIA. (B) Survival curves of key genes by GEPIA. (C) Oncomine Log rank analysis Heat maps of the key genes mRNA expression in clinical CC samples vs. normal tissues. 1. Cervical Squamous Cell Carcinoma vs. Normal, Biewenga Cervix, Gynecol Oncol, 2008. 2. Cancer Type: Cervical Cancer, Bittner Multi-cancer, Not Published, 2006. 3. Cervical Cancer vs. Normal, Pyeon Multi-cancer, Cancer Res, 2007. 4. Cervical Squamous Cell Carcinoma vs. Normal, Scotto Cervix 2, Genes Chromosomes Cancer, 2008. 5. Cervical Squamous Cell Carcinoma Epithelia vs. Normal Zhai Cervix, Cancer Res, 2007. The rank for a gene is the median rank for that gene across each of the analyses. The p-Value for a gene is its p-Value for the median-ranked analysis. (D) PPI network of key genes. Colored nodes: query proteins and first shell of interactors, empty nodes: proteins of unknown 3D structure, filled nodes: some 3D structure is known or predicted, line thickness indicates the strength of data support. (E) KEGG pathway enrichment analysis of key genes in CC sample. Abbreviations: GEPIA, Gene Expression Profiling Integrated Analysis; CC, Cervical Cancer; KEGG; Kyoto Encyclopedia of Genes and Genomes; TPM, transcripts per million. Immunohistochemistry confirmed the differential expression of key genes in CC tissues and normal tissues based on HPA. Abbreviations: HPA, Human Protein Atlas. Immunohistochemistry confirmed the differential expression of key genes in CC tissues and normal tissues based on HPA. Abbreviations: HPA, Human Protein Atlas.

Identification of Active Small Molecules Related to Key Genes

Table 3 shows the top 10 small molecules that significantly corresponded to changes in expression of the key genes. The small molecules with enrichment close to 1 were considered as robust inducers of CC. In contrast, small molecules like piperlongumine (PL) and chrysin (enrichment: −0.973 and −0.934) mimicked normal cells, which suggested that they could repair the cancerous state of CC cells; hence, they are potential lead compounds for development of new drugs for CC patients.
Table 3.

List of Top 10 Small Molecules That Can Imitate the Status of Normal Cells.

CMAP nameMeannEnrichmentP value
Piperlongumine-0.8022-0.9730.0015
Chrysin-0.7563-0.9340.0005
Menadione-0.8122-0.9170.0141
Apigenin-0.7844-0.8890.0003
1,4-chrysenequinone-0.7212-0.8870.0254
Alsterpaullone-0.743-0.8810.0033
Thioguanosine-0.7994-0.8690.0006
Sulconazole-0.7144-0.8630.0007
Sanguinarine-0.772-0.8550.0418
Blebbistatin-0.6982-0.8480.0465
List of Top 10 Small Molecules That Can Imitate the Status of Normal Cells.

Discussion

CC is the second most common type of malignant tumor in females with a 5-year survival rate of only about 52%. Thus, more research is required to discover candidate genes and drugs for improving the 5-year survival rate and reducing the mortality rate of CC patients. Over the past few years, microarray assay and bioinformatics tools have been commonly applied to screen gene mutations and gene expression levels to distinguish DEGs involved in tumor formation and development, and for their functional analysis. In the current study, the DEGs between the CC and non-cancerous tissues were collected from 3 mRNA datasets. A total of 309 overlapping DEGs were identified. It was speculated that these DEGs could play a role in the pathogenesis of CC and act as potential targets for clinical treatment. To understand the roles of these DEGs, GO and KEGG analyses were performed. GO term analysis was performed via 3 directions: BP, MF, and Cellular Component. BP analysis showed that these genes were mainly enriched in DNA replication, mitotic cycle transformation, and cell division. These Process were related to the proliferation and differentiation of tumor. MF analysis suggested that the DEGs were mostly associated with the binding of proteins, chromatin binding, and peptidase activity of the serine form. Protein function changes caused by protein binding, and chromatin changes caused by chromatin binding have been confirmed to play a key role in tumorigenesis.[17] Serine protease has been proved as potential diagnostic, prognostic targets for several human tumors, including CC.[18,19] Cellular Component was mainly enriched in nucleoplasm, cornified envelope, and cytoplasm. The KEGG analysis showed involvement of many pathways, such as cell cycle, DNA and p53 signaling pathway. The HPV oncoproteins, E7, play a major role in the development of CC. The major characteristic of E7 is to influence the cell cycle by inactivation of pRb, so the major driver of CC may be the cell cycle.[20] Moreover, previous studies have confirmed that numerous DEGs, for example, AURKA, CDK1, and PLK4, can adjust the cell cycle to influence CC progression.[21-23] Many DEGs, such as MCM, PCNA, RFC4, RFC5 are involved in the control of cell cycle by approving DNA replication, which highlights the significance of DNA replication in CC.[24,25] It is well known that HPV infection can inhibit the p53-mediated growth arrest. In accordance with this report, we found that the p53 signaling pathway was another critical pathway in the development of CC.[26] Thus, GO and KEGG pathways provide a range of explanations behind the molecular mechanism of CC. A PPI network was established using 309 DEGs and 68 DEGs were identified as the hub genes. The functional analysis of the hub genes by Metascape indicated that they were mainly enriched in cell cycle, cell cycle phase transition, cell division, and DNA replication. These results were broadly in line with our analysis on DEGs. To further understand the role of these 68 genes in CC development, we analyzed the expression level of these genes and overall survival of CC patients via GEPIA database. We found that the expression level of RRM2, CDC45, GINS2, HELLS, KNTC1, MCM2, MYBL2, PCNA, RAD54 L, RFC4, RFC5, TK1, TOP2A, and TYMS in CC tissues was higher than that in the normal tissues. CC patients with high expression of RRM2 had a lower survival rate than those with low expression, and high expression levels of CDC45, GINS2, HELLS, KNTC1, MCM2, MYBL2, PCNA, RAD54 L, RFC4, RFC5, TK1, TOP2A, and TYMS were related to high survival rate of CC patients. To further clarified the key role of these genes, we verified the qualitative data of protein in CC by HPA too. We found differential positive expression rate of key genes protein in CC sample, both the antibody staining and intensity of key genes protein in CC samples were higher than normal tissue. These results suggested that the expression of these genes was different from that of normal tissues at both mRNA and protein levels, and should play a key role in the development of CC. RRM2 is for dNDP synthesis by ribonucleotide reductase (RNR).[27] RRM2 is strongly linked to cancer biological pathways. High expression of RRM2 is linked to the invasive ability of cell, tumor angiogenesis, metastasis, and poor prognosis.[28,29] RRM2 has been found to activate a variety of oncogenes, including NF-κB, c-Myc, and v-fes.[30,31] Wang et al reported that the upregulation of RRM2 expression could promote the development of CC by ROS-ERK1/2-HIF-1α-VEGF-induced angiogenesis.[32] Su et al found that over-expression of RRM2 could lead to poor survival of CC patients.[33] In our study, we found that the patients with high expression of RRM2 had a low survival rate than those with low expression; thus, RRM2 could act as a new therapeutic target for developing treatments of CC. CDC45, MCM2, and GINS2, a part of the Cdc45-MCM-GINS complex, are important DNA replication genes in oncogenesis.[34] Over-expression of CDC45, MCM2, and GINS2 is reported to occur in the progression of many tumors, including CC.[35-38] HELLS, a member of the SNF2 family, is essential for DNA replication, methylation, and repair because of its ATPase activity, and has been found to be conspicuously up-regulated in different type of cancers.[39-41] KNTC1 is a gene involved in cell division but has not yet been studied in cancer.[42] MYBL2 (B-Myb) belongs to the MYB family and is a regulator of cell proliferation, cell survival and differentiation, and can be induced by expression of HPV16 E7 protein.[43,44] TOP2A is an isoenzyme, which mediates the catalytic activity of type II topoisomerases and can be transactivated by MYBL2.[45] Both TOP2A and MYBL2 were found to be involved in the pathogenesis of CC in different CC cell lines.[46,47] PCNA is widely recognized as a cell proliferation marker and is known to be activated by HPV infection in CC.[48] Its main function is DNA replication and repair, and it is considered to be a sign of poor prognosis for a variety of tumors.[49-51] RFC4 and RFC5 are members of the RFC family and play important roles in DNA replication and repair.[52] RFC family together with PCNA is an important part of the large loop DNA repair synthesis pathway.[53] RFC4 and RFC5 have been reported to be deregulated in cancer (for example, CC, head, and neck squamous cell carcinomas and prostate cancer) cells and are considered to be a predictor of poor overall survival.[54-58] RAD54 L, plays a key role in DNA repair in some cancers; however, this has not been proven in CC.[59,60] TK1 has a key function in DNA synthesis and repair.[61] Increased expression of TK1 is found in solid malignancies, and it has been associated with a poor outcome.[62-64] TYMS encodes for thymidylate synthase (TS), an enzyme involved in DNA replication and repair.[65] Some studies have proven that TYMS plays a role in cancer, and its expression indicates poor survival rate, but its role has not yet been studied in CC.[66,67] The 14 genes identified in this study are considered oncogenes in most tumors, and are indicators of poor prognosis of tumor patients. Some of these genes have also indicated poor prognosis of CC patients. But in our study, high expression of these genes was significantly related to high survival rate of CC patients. The mechanism responsible for this discrepancy in results is unclear; however, the following reasons could be responsible: 1. Zhang et al found that HPV infection might increase DNA repair mechanisms, which could limit the build-up of fatal cell mutations in head and neck carcinomas patients and lead to a better prognosis.[68] However, 2 important KEGG pathways that were found related to the key genes in our study were mismatch repair and nucleotide excision repair. Therefore, we speculated that over-activation of DNA repair mechanisms may responsible for the high survival rate of CC patients. 2. Like the expression of PCNA in breast cancer, high expression of certain genes may have affected the sensitivity to chemotherapy and reduced the risk of death.[69,70] 3. Small datasets and difference in ethnicity of the patients could also be one of the reasons. Therefore, further exploration is required to elucidate the mechanisms of these genes in CC pathogenesis. In addition, assisted by the CMAP database, we identified 10 small molecule candidates that could potentially reverse the expression of the overlapping genes in CC patients. PL (enrichment = −0.973) is a natural product extracted from the plant species Piper longum L[71]; it has a marked cytotoxic effect in numerous cancers, such as breast, colon, head and neck, ovarian, and prostate via multiple signaling pathways.[72-76] Seber et al demonstrated that PL could induce the apoptosis of HeLa cells.[77] A previous research suggested that PL induced cell death/apoptosis in breast cancer cells by regulating the members involved in apoptotic and survival pathways, including p53 and its targets.[78] One of the GO enrichment terms for the DEGs identified in this study was the p53 signaling pathway; thus, we speculated that the p53 signaling pathway could be one of the mechanisms used by PL against CC. Further experiments should be performed on CC tissues to test this hypothesis. Chrysin (enrichment = −0.934) is a bioactive flavone widely distributed in plants, and it is used as an herbal medicine in China. Zhang et al demonstrated that chrysin inhibited the growth of HeLa cells by apoptosis induction.[79] Another study indicated that chrysin could induce p38 and activate NF-κB/p65 in the HeLa cells; the MAPK p38 pathway is involved in cell cycle arrest and apoptosis.[80] Another important GO enrichment term found associated with the DEGs was the cell cycle pathway. Therefore, the drugs identified from the CMAP database were consistent with the results of the GO enrichment analysis. These drugs could be used in therapeutic regimen against CC in future. Experiments should be carried out to explore the anti-tumor mechanism of these molecules, which could be a new option for tumor therapy. Although several articles have been published based on GSE63514, GSE9750, and GSE7803, each article used different tools and explored different issues. Dai et al focused on microRNA and its target gene, Gong et al focused on the effect of propranolol in CC, and the research focus of Kori et al is co expression network. Our major research focus was on the genes that affect the prognosis of CC and the potential therapeutic drugs, and we did find some new genes and drugs. However, this study has several limitations. First, it was a study only based on TCGA database, we should validate the key genes with our own cohort in the future. Second, we should validate which pathways do these key genes work through on the development of cervical cancer. Third, further experiments with piperlongumine and chrysin should perform to confirm which pathway they work with on the treatment of cervical cancer.

Conclusions

In this study, we identified 14 key genes that were related to the survival rate of CC. We found potential drugs, such as PL and chrysin, which may act effectively against CC. However, the mechanism of these genes and drugs in CC patients is still unclear; further experiments are required to elucidate these mechanisms.
  79 in total

1.  Inhibition of Aurora A promotes chemosensitivity via inducing cell cycle arrest and apoptosis in cervical cancer cells.

Authors:  Jian-Ming Sun; Li-Na Yang; Han Xu; Bin Chang; Hua-Ying Wang; Gong Yang
Journal:  Am J Cancer Res       Date:  2015-02-15       Impact factor: 6.166

2.  Functional sites of human PCNA which interact with p21 (Cip1/Waf1), DNA polymerase delta and replication factor C.

Authors:  T Oku; S Ikeda; H Sasaki; K Fukuda; H Morioka; E Ohtsuka; H Yoshikawa; T Tsurimoto
Journal:  Genes Cells       Date:  1998-06       Impact factor: 1.891

3.  Identification of a novel telomerase repressor that interacts with the human papillomavirus type-16 E6/E6-AP complex.

Authors:  Lindy Gewin; Hadley Myers; Tohru Kiyono; Denise A Galloway
Journal:  Genes Dev       Date:  2004-09-15       Impact factor: 11.361

4.  Tumor-specific exon creation of the HELLS/SMARCA6 gene in non-small cell lung cancer.

Authors:  Masaaki Yano; Mamoru Ouchida; Hisayuki Shigematsu; Noriyoshi Tanaka; Koichi Ichimura; Kazuyasu Kobayashi; Yasuhiko Inaki; Shinichi Toyooka; Kazunori Tsukuda; Nobuyoshi Shimizu; Kenji Shimizu
Journal:  Int J Cancer       Date:  2004-10-20       Impact factor: 7.396

5.  The value of TOP2A gene copy number variation as a biomarker in breast cancer: Update of DBCG trial 89D.

Authors:  Kirsten Vang Nielsen; Bent Ejlertsen; Susanne Møller; Jan Trøst Jørgensen; Ann Knoop; Helle Knudsen; Henning T Mouridsen
Journal:  Acta Oncol       Date:  2008       Impact factor: 4.089

6.  Cdc45-MCM-GINS, a new power player for DNA replication.

Authors:  Tomás Aparicio; Arkaitz Ibarra; Juan Méndez
Journal:  Cell Div       Date:  2006-08-24       Impact factor: 5.130

7.  Integrative Genomics and Transcriptomics Analysis Reveals Potential Mechanisms for Favorable Prognosis of Patients with HPV-Positive Head and Neck Carcinomas.

Authors:  Wensheng Zhang; Andrea Edwards; Zhide Fang; Erik K Flemington; Kun Zhang
Journal:  Sci Rep       Date:  2016-04-25       Impact factor: 4.379

8.  Integrating TYMS, KRAS and BRAF testing in patients with metastatic colorectal cancer.

Authors:  Anastasios Ntavatzikos; Aris Spathis; Paul Patapis; Nikolaos Machairas; George Peros; Stefanos Konstantoudakis; Danai Leventakou; Ioannis G Panayiotides; Petros Karakitsos; Anna Koumarianou
Journal:  World J Gastroenterol       Date:  2017-08-28       Impact factor: 5.742

9.  The prognostic value of p53 expression for patients with cervical cancer: a meta analysis.

Authors:  Ruoyu Zhou; Chuanyu Wei; Jing Liu; Ying Luo; Wenru Tang
Journal:  Eur J Obstet Gynecol Reprod Biol       Date:  2015-10-23       Impact factor: 2.831

10.  Increased expression of RRM2 by human papillomavirus E7 oncoprotein promotes angiogenesis in cervical cancer.

Authors:  N Wang; T Zhan; T Ke; X Huang; D Ke; Q Wang; H Li
Journal:  Br J Cancer       Date:  2014-01-14       Impact factor: 7.640

View more
  8 in total

1.  KNTC1 knockdown suppresses cell proliferation of colon cancer.

Authors:  Zhang Zhengxiang; Tao Yunxiang; Liu Zhiping; Yang Zhimin
Journal:  3 Biotech       Date:  2021-05-11       Impact factor: 2.893

2.  Kinetochore-associated protein 1 promotes the invasion and tumorigenicity of cervical cancer cells via matrix metalloproteinase-2 and matrix metalloproteinase-9.

Authors:  Caimei Wang; Yiyuan Wang; Congrong Liu; Xiaoyu Meng; Zhongxia Hang
Journal:  Bioengineered       Date:  2022-04       Impact factor: 6.832

Review 3.  Emerging cellular and molecular mechanisms underlying anticancer indications of chrysin.

Authors:  Marjan Talebi; Mohsen Talebi; Tahereh Farkhondeh; Jesus Simal-Gandara; Dalia M Kopustinskiene; Jurga Bernatoniene; Saeed Samarghandian
Journal:  Cancer Cell Int       Date:  2021-04-15       Impact factor: 5.722

4.  lncRNA IGF2-AS Regulates Nucleotide Metabolism by Mediating HMGA1 to Promote Pyroptosis of Endothelial Progenitor Cells in Sepsis Patients.

Authors:  Guibin Liang; Menghao Zeng; Min Gao; Wei Xing; Xin Jin; Qianlu Wang; Longtian Deng; Hao Ou; Zhihui He
Journal:  Oxid Med Cell Longev       Date:  2022-01-17       Impact factor: 6.543

5.  Bioinformatics Screening of Potential Biomarkers from mRNA Expression Profiles to Discover Drug Targets and Agents for Cervical Cancer.

Authors:  Md Selim Reza; Md Harun-Or-Roshid; Md Ariful Islam; Md Alim Hossen; Md Tofazzal Hossain; Shengzhong Feng; Wenhui Xi; Md Nurul Haque Mollah; Yanjie Wei
Journal:  Int J Mol Sci       Date:  2022-04-02       Impact factor: 5.923

6.  Metadata analysis to explore hub of the hub-genes highlighting their functions, pathways and regulators for cervical cancer diagnosis and therapies.

Authors:  Md Selim Reza; Md Alim Hossen; Md Harun-Or-Roshid; Mst Ayesha Siddika; Md Hadiul Kabir; Md Nurul Haque Mollah
Journal:  Discov Oncol       Date:  2022-08-22

7.  Bioinformatics analysis of differentially expressed genes and pathways in the development of cervical cancer.

Authors:  Baojie Wu; Shuyi Xi
Journal:  BMC Cancer       Date:  2021-06-26       Impact factor: 4.430

8.  Histone Deacetylation Regulated by KDM1A to Suppress DACT1 in Proliferation and Migration of Cervical Cancer.

Authors:  Lingjuan Zeng; Chunyan Chen; Chanjiao Yao
Journal:  Anal Cell Pathol (Amst)       Date:  2021-07-24       Impact factor: 2.916

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.