Literature DB >> 35117850

Identification of biomarkers in colon cancer based on bioinformatic analysis.

Ying Zhu1, Leitao Sun2, Jieru Yu3, Yuying Xiang1, Minhe Shen2, Harpreet S Wasan4, Shanming Ruan2, Shengliang Qiu2.   

Abstract

BACKGROUND: Colon cancer is one of the most common cancers in the world. Targeting biomarkers is helpful for the diagnosis and treatment of colon cancer. This study aimed to identify biomarkers in colon cancer, in addition to those that have already been reported, using microarray datasets and bioinformatics analysis.
METHODS: We downloaded two mRNA microarray datasets (GSE44076 and GSE47074) for colon cancer from the Gene Expression Omnibus (GEO) database and the most recent colon cancer data (COAD) from The Cancer Genome Atlas (TCGA) database. The differentially expressed genes (DEGs) between colon cancer and adjacent normal tissues were determined based on these three datasets. Additionally, we performed Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses, and protein-protein interaction (PPI) network analysis. The hub genes in the PPI network were then selected and analysed.
RESULTS: We identified 150 DEGs and the GO enrichment analysis revealed that these DEGs were enriched in functions related to accelerating the cell cycle, promoting tumour cell accumulation, promoting cell division, positively regulating cell division, and negatively regulating apoptosis. The KEGG pathway analysis indicated that the DEGs were also involved in the cell cycle pathway. In the PPI network, 34 hub genes were found to be enriched in cell division. Prognostic analysis of the 34 hub genes revealed that eight genes (CCNB1, CHEK1, DEPDC1, ECT2, GINS2, HMMR, KIF14, and KIF18A) were associated with the prognosis of colon cancer. And our qRT-PCR results confirmed that DEPDC1, ECT2, GINS2, HMMR and KIF18A were highly expressed in colon cancer cells.
CONCLUSIONS: The genes DEPDC1, ECT2, GINS2, HMMR, and KIF18A could serve as novel diagnostic biomarkers of colon cancer. 2020 Translational Cancer Research. All rights reserved.

Entities:  

Keywords:  Colon cancer; biomarkers; computational biology

Year:  2020        PMID: 35117850      PMCID: PMC8797703          DOI: 10.21037/tcr-20-845

Source DB:  PubMed          Journal:  Transl Cancer Res        ISSN: 2218-676X            Impact factor:   1.241


Introduction

Globally, colon cancer is one of the most common cancers affecting the digestive tract and is the fourth most common cause of cancer-related deaths. Estimates in 2020 revealed 104,610 new colon cancer cases and 53,200 colorectal cancer-related deaths in the United States (1). Although some patients can initially respond to anticancer therapies, such as chemotherapy and targeted therapy, patients with advanced colon cancer eventually succumb to this disease (2). Thus, identifying the molecular mechanisms underlying colon cancer will facilitate the development of novel treatment strategies. The diagnosis and treatment of colon cancer are aided by certain biomarkers, ranging from the early microsatellite instability (MSI) status to the recently discovered Ras family, B-Raf proto-oncogene, and serine/threonine kinase (BRAF) (3). Recent studies have markedly advanced the characterization of genetic changes in the malignant transformation process. Caudal-type homeobox 2 (CDX2) was identified as a prognostic biomarker of stage II and stage III colon cancer (4). Aldehyde dehydrogenase 1B1 (ALDH1B1) is a potential biomarker of human colon cancer (5). The epidermal growth factor receptor (EGFR) signal transduction pathway is one of the major pathways involved in the pathogenesis of colon cancer and is now widely targeted for the effective treatment of colon cancer. The initiation and progression of colon cancer are associated with the continuous activation of downstream Ras-RAF-MAPK and PI3K-AKT-mTOR pathways, which are mediated by EGFR (6). Thus, there is a need to identify novel biomarkers of colon cancer. The objective of this study was to identify novel biomarkers of colon cancer through bioinformatics analysis of microarray data. We downloaded two mRNA microarray datasets for colon cancer from the Gene Expression Omnibus (GEO) database. Additionally, we obtained the most recent colon cancer dataset (COAD) in The Cancer Genome Atlas (TCGA) database. The differentially expressed genes (DEGs) between colon cancer and adjacent normal tissues were identified in these three datasets. Further, the DEGs were examined using Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses and protein-protein interaction (PPI) network analysis. Finally, we investigated the key molecular events and functional pathways in colon cancer, which may provide the theoretical basis for the development of improved treatment strategies for colon cancer.

Methods

Data screening from the GEO database

The GEO database (http://www.ncbi.nlm.nih.gov/geo) (7) in the National Center for Biotechnology Information (NCBI) platform was utilized for screening the gene expression data, chip-based data, and microarray data. The inclusion criteria for the target data set are as follows: (I) the selected dataset included samples of both colon cancer tissue and adjacent normal tissue; (II) the dataset contains gene expression information; (III) study conducted over the last 10 years. Exclusion criteria: (I) study data from cell or animal experiments rather than from colon cancer patient tissues; (II) studies on colorectal cancer (one study includes both colon cancer and rectal cancer); (III) studies of rectal cancer or caecum cancer. According to the above standards, we qualified the search strategy as “colon AND normal” in the GEO Datasets sub-database, selected “entry type” as “series” and limited “Publication dates” to after January 1, 2010. Then the retrieval results were screened one by one. Finally, we downloaded two gene expression datasets (GSE44076 and GSE47074) from the GEO database that satisfied the screening criteria. The probes were converted to the corresponding gene symbols based on the annotation information available on the platform. The GSE44076 dataset comprised 98 colon cancer and 98 adjacent normal tissues, while the GSE47074 dataset comprised 4 colon cancer and 4 adjacent normal tissues.

Data screening from the TCGA database

The mRNA array gene expression data from the colon cancer dataset (COAD) were downloaded from the TCGA database (https://tcga-data.nci.nih.gov/tcga/) using the R package Bioconductor/RTCGAToolbox (8). The TCGA COAD database (2016-01-28) contains 153 colon samples and 19 normal samples for analysis. The mRNA expression profile data in GEO and TCGA databases are publicly available for downloading. So that this study does not require the ethics approval.

Identification of DEGs

The GSE44076 and GSE47074 datasets were used to identify the DEGs between colon cancer and adjacent normal tissues based on the interactive web tool GEO2R (http://www.ncbi.nlm.nih.gov/geo/http://www.ncbi.nih.gov/geo/geo2r/). The differences were considered statistically significant when |log fold change (FC)| was greater than 1 and P value was less than 0.01. In addition, the genes that were found to be dysregulated in the TCGA COAD datasets were loaded using the get DiffExpressedGene function package in R. The cut-off values were adjusted to obtain P value <0.05 and log FC >2. The analysis identified 889 upregulated genes and 550 downregulated genes in the colon cancer tissues. Finally, the common DEGs among the three datasets were selected for further analysis.

KEGG and GO enrichment analyses of DEGs

The KEGG (9) database contains information on genomes, biological pathways, diseases, and chemical compounds. GO (10) enrichment analysis is mainly used for annotating the genes and gene products. GO terms are grouped into three categories: biological processes (BP), cellular composition (CC), and molecular function (MF). We used the Database for Annotation, Visualization, and Integrated Discovery (DAVID; http://david.ncifcrf.gov) (version 6.8) (11) to analyze the KEGG pathways and GO terms associated with the identified DEGs. The difference was considered statistically significant when the adjusted P value was less than 0.05. The results were visualized using the ggplot2 package in R to generate the bubble map of the KEGG pathway analysis and the histogram of the GO enrichment analysis.

PPI network construction and visualization

The functional interactions between DEGs were evaluated using the protein interaction database (STRING) (12). The interactions with a combined score of greater than 0.4 were considered statistically significant. Cytoscape (version 3.7.0) was used to visualize the PPI network (13) based on the STRING results, and MCODE was used to identify the densely connected regions (14). The following parameters were used in the analysis: MCODE scores >5, degree cut-off =2, node score cut-off =0.2, max depth =100, and k-score =2. Additionally, KEGG and GO enrichment analyses were performed on the selected meaningful models. The difference was considered statistically significant when the P value was less than 0.05.

Selection and analysis of hub genes

The genes with degree ≥10 were defined as hub genes (15). We used cBioportal (http://www.cbioportal.org) (16,17) to analyze the genes that were co-expressed with the hub genes. Next, hierarchical clustering of the hub genes was performed using the UCSC Cancer Genomics Browser (http://genome-cancer.ucsc.edu) (18). Using the starBase (http://starbase.sysu.edu.cn/) (19) Pan-Cancer Analysis platform, the prognostic values of the hub genes were analyzed to identify the genes whose survival curves were statistically significant and those that exhibit potential as biomarkers of colon cancer. Finally, the Oncomine database (http://www.oncomine.com) was used to verify the correlation between expression patterns and tumor grades, tumor histological type and other characteristics.

Validation by quantitative real-time polymerase chain reaction real-time fluorescence quantitative PCR (qRT-PCR)

We evaluated the expression levels of hub genes using 3 groups of the colon cancer cells (HCT-8, SW620, and SW480) and a normal colon cell (HT-29). The colon cancer cells and normal colon cell were obtained from Cell Bank of the Typical Culture Preservation Committee of the Chinese Academy of Sciences (Shanghai, China). Total RNA was extracted from the samples using the Trizol reagent (Invitrogen, Carlsbad, CA, USA), following the manufacturer’s instructions. The reverse transcription of mRNA was performed using the FastQuant RT Kit (TaKaRa, Beijing China), following the manufacturer’s instructions. qRT-PCR was performed using the ABI 7500 Real-time PCR Detection System. Relative gene expression was analyzed using 2−ΔCt and 2−ΔCtΔCt method.

Results

Differential expression analysis identified 2,677 and 517 DEGs in the GSE44076 and GSE44074 datasets respectively, as well as 1,439 DEGs in TCGA datasets. We obtained 150 genes that were common among the three datasets, which are presented as a Venn diagram ().
Figure 1

Venn diagram for the differentially expressed genes (DEGs) in GSE44076, GSE47074 and TCGA COAD datasets. DEGs were selected based on the cut-off values [Log FC (fold change) >1 or <−1 and P value <0.01] across the different mRNA expression profiles of GSE44076 and GSE47074 database. And from the TCGA COAD dataset, DEGs were identified based on the cut-off values (adjusted P value <0.05 and Log FC >2). In total, 150 genes were common between the three datasets.

Venn diagram for the differentially expressed genes (DEGs) in GSE44076, GSE47074 and TCGA COAD datasets. DEGs were selected based on the cut-off values [Log FC (fold change) >1 or <−1 and P value <0.01] across the different mRNA expression profiles of GSE44076 and GSE47074 database. And from the TCGA COAD dataset, DEGs were identified based on the cut-off values (adjusted P value <0.05 and Log FC >2). In total, 150 genes were common between the three datasets.

GO term and KEGG pathway enrichment

The top ten terms with P<0.05 in the BP and CC categories were selected for visualization. All terms with P<0.05 in the MF category are shown in . The BP terms were predominantly related to cell division, DNA damage repair, and positive regulation of RNA transcription. The MF terms were mainly related to ATP binding. The CC terms were predominantly associated with the cytoplasm (). KEGG pathway analysis revealed that the DEGs were mainly related to the cell cycle, microRNAs in cancer, and other signaling pathways, such as the p53 signaling pathway ().
Figure 2

Gene Ontology (GO) enrichment analysis of the differentially expressed genes common to all three datasets using the DAVID database, and the results were visualized using the ggplot2 package in R. BP, biological processes (top 10 terms with P<0.05); CC, cellular composition (top ten terms with P<0.05); MF, molecular function (all terms with P<0.05).

Figure 3

KEGG pathway analysis of the differentially expressed genes common to all three datasets using KEGG database, and the results were visualized using the ggplot2 package in R. The DEGs were mainly associated with cell cycle, microRNAs, and other signalling pathways.

Gene Ontology (GO) enrichment analysis of the differentially expressed genes common to all three datasets using the DAVID database, and the results were visualized using the ggplot2 package in R. BP, biological processes (top 10 terms with P<0.05); CC, cellular composition (top ten terms with P<0.05); MF, molecular function (all terms with P<0.05). KEGG pathway analysis of the differentially expressed genes common to all three datasets using KEGG database, and the results were visualized using the ggplot2 package in R. The DEGs were mainly associated with cell cycle, microRNAs, and other signalling pathways.

PPI network

In total, 148 proteins were identified from the STRING database based on the DEGs, which involved 807 interaction pairs. The average node degree was 10.9. Next, the PPI network was constructed using Cytoscape (). The most important module in the network, which comprised 34 nodes, was extracted using MCODE (). Functional analysis of the genes involved in this module revealed that these genes were predominantly enriched in cell division, ATP binding, and cell cycle-associated functions and pathways ().
Figure 4

(A) Protein-protein interaction (PPI) network constructed based on the differentially expressed genes (DEGs) common to all three datasets using Cytoscape; (B) The most important module extracted from the PPI network using MCODE comprises 34 genes with 537 edges; (C) Hub genes and their co-expression network were constructed using cBioPortal. Nodes with bold black outlines represent hub genes. Nodes with thin black outlines represent the co-expressed genes.

Table 1

Functional enrichment analysis of genes in the most important module of PPI network

TermDescriptionCountP value
BP
   GO:0051301Cell division111.86E-13
   GO:0032467Positive regulation of cytokinesis41.88E-05
   GO:0007080Mitotic metaphase plate congression43.22E-05
   GO:0031536Positive regulation of exit from mitosis33.94E-05
   GO:0000281Mitotic cytokinesis30.0011
   GO:0000086G2/M transition of mitotic cell cycle30.0013
   GO:0006302Double-strand break repair30.0036
   GO:0007018Microtubule-based movement30.0055
   GO:0046602Regulation of mitotic centrosome separation20.0081
CC
   GO:0005634Nucleus152.39E-04
   GO:0030496Midbody52.83E-05
   GO:0000776Kinetochore42.30E-04
   GO:0051233Spindle midzone35.65E-04
   GO:0005876Spindle microtubule30.0023
   GO:0005871Kinesin complex30.0049
   GO:0097149Central spindlin complex20.0067
MF
   GO:0005524ATP binding125.90E-06
   GO:0019901Protein kinase binding51.48E-05
   GO:0004842Ubiquitin-protein transferase activity40.0053
KEGG
   04110Cell cycle82.91E-09
   04914Progesterone-mediated oocyte maturation47.68E-04
   04114Oocyte meiosis40.0016
   04115p53 signaling pathway30.0096

BP, biological processes; CC, cellular composition; MF, molecular function; KEGG, pathways in Kyoto Encyclopedia of Genes and Genomes (KEGG) database.

(A) Protein-protein interaction (PPI) network constructed based on the differentially expressed genes (DEGs) common to all three datasets using Cytoscape; (B) The most important module extracted from the PPI network using MCODE comprises 34 genes with 537 edges; (C) Hub genes and their co-expression network were constructed using cBioPortal. Nodes with bold black outlines represent hub genes. Nodes with thin black outlines represent the co-expressed genes. BP, biological processes; CC, cellular composition; MF, molecular function; KEGG, pathways in Kyoto Encyclopedia of Genes and Genomes (KEGG) database.

Hub genes

In the most important module, the degree value of the 34 genes was ≥25, which indicated close interaction between the nodes. Thus, the 34 genes were considered as the hub genes for preliminary analysis. A co-expression network of the hub genes was constructed using the cBioPortal online platform (). The co-expression network revealed the connection between the hub genes and some relevant genes in colon cancer. The results of hierarchical clustering of the hub genes using UCSC are shown in . The hub genes were weakly expressed in the normal tissues and strongly expressed in the primary colon cancer tissues. However, we did not observe significant correlation between the expression levels of the hub genes and gender. Moreover, for the MSI entry, although most samples did not provide MSI detection data, it was observed that more samples with MSI appeared in samples with strong expression of hub genes.
Figure 5

Hierarchical clustering of the hub genes was analyzed using the UCSC database. The samples under the brown bar are normal tissues and the samples under the blue bar are colon cancer samples. Upregulation of genes is marked in red, while downregulation of genes is marked in blue. In the microsatellite instability entry, red for “YES”, blue for “NO”, and grey for lack of information.

Hierarchical clustering of the hub genes was analyzed using the UCSC database. The samples under the brown bar are normal tissues and the samples under the blue bar are colon cancer samples. Upregulation of genes is marked in red, while downregulation of genes is marked in blue. In the microsatellite instability entry, red for “YES”, blue for “NO”, and grey for lack of information. Next, the prognostic values of these hub genes were analyzed using the starBase Pan-Cancer Analysis Platform. As shown in , the survival curves of eight genes were statistically significant (P value <0.05). These eight genes were CCNB1, CHEK1, DEPDC1, ECT2, GINS2, HMMR, KIF14, and KIF18A. However, the generated predictions using starBase suggested that the weak expression of these eight genes was associated with poor prognosis.
Figure 6

The prognostic values of these hub genes were analyzed using the starBase Pan-Cancer Analysis Platform. Survival curves of genes CCNB1, CHEK1, DEPDC1, ECT2, GINS2, HMMR, KIF14, and KIF18A were significantly associated with prognosis in colon cancer. The green curve indicates enhanced gene expression, and the brown curve indicates low expression.

The prognostic values of these hub genes were analyzed using the starBase Pan-Cancer Analysis Platform. Survival curves of genes CCNB1, CHEK1, DEPDC1, ECT2, GINS2, HMMR, KIF14, and KIF18A were significantly associated with prognosis in colon cancer. The green curve indicates enhanced gene expression, and the brown curve indicates low expression. Finally, validation analysis using the Oncomine database revealed that whether in a single study or in the meta-analysis results of multiple studies, the expression levels of the eight genes were upregulated in the cancer samples when compared to those in the non-cancer samples (). In Oncomine database, data from Kaiser et al. reported that the expression of GINS2 was strongly upregulated in the colon small cell carcinoma and colon signet ring cell adenocarcinoma when compared to that in other types of colon cancers, whereas the other seven genes were strongly upregulated in the colon small cell carcinoma (). Furthermore, the expression levels of these eight genes were higher in patients with grade 1 cancer than those in patients with grades 2-3 cancer ().
Figure 7

Validation of differentially expressed genes (DEGs) which have prognostic values in normal and colon cancer tissues using the Oncomine database. The results of the Skrzypczak’s experiment included in the database showed that CCNB1, DEPDC1, ECT2, GINS2, HMMR, KIF14, and KIF18A were up-regulated in colon cancer tissue relative to normal colon tissue, respectively. And in Zou’s experiment result, CHEK1 also has a higher expression in colon cancer tissue than normal colon tissue.

Figure 8

Meta-analysis of all researches in Oncomine database involving the expression of eight prognostic genes in colon cancer versus normal tissue. The rank of gene was defined as its median rank across all analyses. The number of red bars indicates the number of studies in which the target gene was upregulated in colon cancer, and the depth of the red colon indicates its expression level.

Figure 9

Eight gene expression across different cancer tissue types. The study from Kaiser et al. in Oncomine database reported that the tissue type of colon cancer where the gene is highly expressed was primarily colon adenocarcinoma, colon mucinous adenocarcinoma, colon signet ring cell adenocarcinoma, and colon small cell carcinoma.

Figure 10

Eight gene expression across different tumor pathological grades. The expression levels of these 8 genes in Grade 1 (well differentiated) patients were higher than those in Grade 2 (moderately differentiated) and Grade 3 (poorly differentiated) cancer patients.

Validation of differentially expressed genes (DEGs) which have prognostic values in normal and colon cancer tissues using the Oncomine database. The results of the Skrzypczak’s experiment included in the database showed that CCNB1, DEPDC1, ECT2, GINS2, HMMR, KIF14, and KIF18A were up-regulated in colon cancer tissue relative to normal colon tissue, respectively. And in Zou’s experiment result, CHEK1 also has a higher expression in colon cancer tissue than normal colon tissue. Meta-analysis of all researches in Oncomine database involving the expression of eight prognostic genes in colon cancer versus normal tissue. The rank of gene was defined as its median rank across all analyses. The number of red bars indicates the number of studies in which the target gene was upregulated in colon cancer, and the depth of the red colon indicates its expression level. Eight gene expression across different cancer tissue types. The study from Kaiser et al. in Oncomine database reported that the tissue type of colon cancer where the gene is highly expressed was primarily colon adenocarcinoma, colon mucinous adenocarcinoma, colon signet ring cell adenocarcinoma, and colon small cell carcinoma. Eight gene expression across different tumor pathological grades. The expression levels of these 8 genes in Grade 1 (well differentiated) patients were higher than those in Grade 2 (moderately differentiated) and Grade 3 (poorly differentiated) cancer patients. Finally, we selected five specific genes (DEPDC1, ECT2, GINS2, HMMR, KIF18A) to verify the analysis. qRT-PCR was used to detect the expression differences between colon cancer cells (HCT-8, SW620, SW480) and normal colon cells (HT-29) in each group. The results of qRT-PCR showed that the expression levels of DEPDC1, ECT2, GINS2, HMMR, and KIF18A were increased in colon cancer cells compared with normal colon tissues ( and ), which was consistent with the analysis results of GEO and TCGA data sets, as well as the Oncomine and UCSC database validation results.
Figure 11

Quantitative real-time polymerase chain reaction (qRT-PCR) verified the expression of five genes (DEPDC1, ECT2, GINS2, HMMR, and KIF18A) in the colon cancer cells and the normal colon cell. Three cell lines of the colon cancer (HCT-8, SW620, and SW480) and a normal colon cell line (HT-29) were used in the study. The expression levels of DEPDC1, ECT2, GINS2, HMMR, and KIF18A were upregulated in the colon cancer cells when compared to those in the normal colon cell.

Table 2

qRT-PCR verification result of five differentially expressed genes obtained from bioinformatics analysis

GeneGroup2–△CT (mean ± SD) △CT 2–△△CTP value
GINS2 HCT-80.000539±1.51E-0510.9185.220<0.0001
SW6200.000271±6.984E-0611.8492.737<0.0001
SW4800.000248±6.173E-0611.9772.505<0.0001
HT-290.000099±1.152E-0513.302
ECT2 HCT-80.000863±2.076E-0510.1782.185<0.0001
SW6200.000615±1.885E-0510.6671.5580.0019
SW4800.0006798±1.359E-0510.5221.722<0.0001
HT-290.000394±3.391E-0511.310
HMMR HCT-80.000294±6.119E-0611.7323.292<0.0001
SW6200.000164±4.807E-0612.5741.8370.0041
SW4800.000150±4.359E-0612.7121.6690.0005
HT-298.93E-05±1.658E-0513.451
DEPDC1 HCT-87.979E-05±1.924E-0613.6135.5820.0012
SW6205.116E-05±1.269E-0614.2533.5810.0004
SW4804.109E-05±1.079E-0614.5712.8750.0805
HT-291.43E-05±4.302E-0616.094
KIF18A HCT-89.902E-05±1.768E-0613.3023.8230.0004
SW6206.97E-05±1.539E-0613.8082.6920.0008
SW4807.628E-05±2.16E-0613.6782.9470.0024
HT-292.59E-05±6.362E-0615.237
Quantitative real-time polymerase chain reaction (qRT-PCR) verified the expression of five genes (DEPDC1, ECT2, GINS2, HMMR, and KIF18A) in the colon cancer cells and the normal colon cell. Three cell lines of the colon cancer (HCT-8, SW620, and SW480) and a normal colon cell line (HT-29) were used in the study. The expression levels of DEPDC1, ECT2, GINS2, HMMR, and KIF18A were upregulated in the colon cancer cells when compared to those in the normal colon cell.

Discussion

Recent studies have demonstrated the importance of targeting key molecules in a network for the diagnosis and treatment of colon cancer. Therefore, there is an urgent need to identify novel biomarkers of colon cancer. Bioinformatics analysis of gene expression profiles is widely employed to identify DEGs as biomarkers for the occurrence and progression of cancer. This has facilitated the development of effective diagnostic and therapeutic strategies. In this study, we analyzed two mRNA microarray datasets from the GEO database and one mRNA microarray dataset from the TCGA database. Our analysis identified 150 DEGs between colon cancer and adjacent normal tissues. GO enrichment analysis revealed that these DEGs were enriched in functions related to accelerating the cell cycle, promoting tumor cell accumulation, promoting cell division, positively regulating cell division, and negatively regulating apoptosis. Enhanced cell division and inhibition of apoptosis of cancer cells are generally recognized as important markers of colon cancer (20). KEGG pathway analysis revealed that the DEGs were also involved in the cell cycle pathway, which concurred with the results of GO enrichment analysis. Furthermore, we identified 34 hub genes in the PPI network, which were enriched in the cell cycle pathway. These findings indicated the critical role of the cell cycle in colon cancer progression. As shown in , the expression of hub genes is not affected by gender factors. However, the expression levels of the hub genes were correlated with the MSI status, which is one of the important mechanisms for the formation of colon cancer. MSI induces carcinogenesis by activating the oncogenes or by inactivating the tumour suppressor genes (21). Our results indicate that high expression of hub genes is associated with MSI status, suggesting that hub genes may play a synergistic role with MSI in the development of colon cancer. Studies have shown that colorectal cancer patients with MSI status benefit less from chemotherapy and are more likely to acquire 5-FU resistance (22). But for immunotherapy, colon cancer patients with MSI-H have a higher response to immune checkpoint inhibitors (23). In our study, more samples with MSI appeared in samples with strong expression of hub genes, suggesting that these hub genes may guide the treatment of colon cancer in adjuvant chemotherapy and immunotherapy. Prognostic analysis of the 34 hub genes revealed that the low expression levels of the eight genes (CCNB1, CHEK1, DEPDC1, ECT2, GINS2, HMMR, KIF14, and KIF18A) were significantly correlated with the poor prognosis of colon cancer (P value <0.05). The role of these eight genes in colon cancer was validated using relevant studies from the Oncomine database. The results of the Skrzypczak’s and Zou’s experiments included in the database showed that these genes were up-regulated in colon cancer tissue relative to normal colon tissue, respectively. In addition, a meta-analysis of all colon cancer related studies in Oncomine database was performed online to demonstrate the result again. For the tissue type of colon cancer, the eight genes were strongly expressed in small cell carcinoma of colon cancer. Small cell carcinoma of the gastrointestinal tract is rare (24) and shares many characteristics with squamous cell carcinoma of the lungs, including neuroendocrine features, regional lymph node involvement, high metastatic potential, and poor prognosis (25). However, because of the limited number of studies in the Oncomine database that involve these 8 genes and colon cancer tissue type, the sample size of various tissue types involved is relatively small only through the analysis of Kaiser’s study. So more large-sample studies are still needed to support this conclusion. Furthermore, we observed that in the study of tumor pathological grading, the expression levels of these 8 genes in Grade 1 (well differentiated) patients were higher than those in Grade 2 (moderately differentiated) and Grade 3 (poorly differentiated) cancer patients. This suggested that the upregulation of the eight hub genes was not associated with the malignancy of colon cancer. Hence, these eight genes might be potential biomarkers of early colon cancer. Among the eight genes, checkpoint kinase 1 (CHEK1) and DEP domain containing 1 (DEPDC1) exhibited the most statistically significant survival curves (P<0.01). CHEK1 is strongly expressed in many tumors, such as colon cancer (26), breast cancer (27), and liver cancer (28). CHEK1, a checkpoint kinase of the G2/M phase during cell division, helps repair DNA damage in cells and plays a crucial role in the regulation of tumorigenesis. In colon cancer, CHEK1 not only promotes the cell proliferation, but also attenuates the DNA damage of tumor cells induced by radiotherapy and chemotherapy (29). Hence, treatment with CHEK1 inhibitors blocks the colon cancer cell cycle at the G2 phase and enhances the sensitivity of tumor cells to various chemotherapy drugs by inhibiting the DNA damage repair (30). CHEK1 inhibitors are currently being tested as novel anti-tumor drugs for several types of tumors, including colon cancer (30). In recent years, DEPDC1 was reported as a novel cell cycle-related gene that regulates mitotic progression in several human cancers, such as bladder cancer (31), nasopharyngeal cancer (32), gastric cancer (33), glioblastoma (34), and liver cancer (35). Additionally, several studies suggest that DEPDC1 promotes cell proliferation and inhibits apoptosis of cancer cells by binding to the zinc finger protein 224 and activating the NF-κB signaling pathway (36). Therefore, the expression of DEPDC1 may be associated with the rapid proliferation of colon cancer cells. To validate our bioinformatics findings, we selected DEPDC1, ECT2, GINS2, HMMR, and KIF18A gene for qRT-PCR analysis. Our analysis revealed that the expression levels of all five genes in the colon cancer cells were upregulated when compared to those in the normal colon cell. The results obtained from qRT-PCR analysis were concurred with those obtained from the Oncomine and UCSC databases validation results. Epithelial cell transforming sequence 2 (ECT2) is highly evolutionarily conserved and is an oncogene known to be closely associated with cell proliferation and apoptosis. Additionally, ECT2 can induce malignant transformation of epithelial cells (37). ECT2 is a Rho guanine nucleotide exchange factor (RhoGEF) that can promote the dissociation of GDP and the replacement of GDP with GTP, which activates the Ras analogue Rho GTPase in the cell signal transduction pathway. Therefore, ECT2 can affect the cytoskeletal dynamics, cell morphogenesis, and regulation of cell movement in colon cancer. Additionally, ECT2 can affect malignant transformation, growth, invasion, and metastasis of tumor cells (38). Our protein co-expression data suggest that ECT2 acts on RHOBTB2, which was identified recently as a member of the Rho family, and inhibits tumor cell proliferation (39). ECT2 may promote the proliferation of colon cancer by inhibiting the expression of RHOBTB2. A recent study analyzed the RNA samples extracted from tumor cells in the peripheral blood circulation of 90 patients with colorectal cancer and 151 healthy donors. The analysis revealed that ECT2 had a high detection rate, even in patients with carcinoembryonic antigen level lower than 5 ng/mL. This indicated that ECT2 is a potential diagnostic marker of colorectal cancer and exhibits high sensitivity for the detection of circulating tumor cells in blood compared to that for the detection of carcinoembryonic antigen (40). GINS complex subunit 2 (GINS2) is a helicase that plays an important role in various BP, such as DNA replication, cross-chain repair, and maintenance of chromosome condensation (41). GINS2 promotes the growth of breast cancer cells by regulating the cell cycle progression to G2/M phase (42). In ovarian cancer, GINS2 knockdown resulted in the induction of cell cycle arrest at the S phase and not at the G2 phase (43). Based on our bioinformatics analysis, we hypothesized that the enhanced expression of GINS2 in colon cancer is also involved in cell cycle regulation. However, this hypothesis must be experimentally verified. Previous studies have demonstrated that GINS2 expression levels are downregulated after exposure to 5-fluorouracil (5-FU) in a colorectal adenocarcinoma cell line. This suggested that GINS2 can be a novel potential biomarker of 5-FU chemosensitivity/resistance for human colon cancer as they inhibit cell proliferation (44). Hyaluronan-mediated motility receptor (HMMR) is located on chromosome 5. The HMMR gene is downstream of IC-1 and is overexpressed in various cancers, such as breast cancer (45), lung adenocarcinoma (46), and epithelial ovarian cancer (47). HMMR affects the tumor microenvironment in the tumor budding cells and promotes single-cell invasion and metastatic dissemination of colon cancer. The enhanced expression levels of HMMR is correlated with the high tumor grade, lymphatic infiltration, and lymph node metastasis in colorectal cancer. This suggested that HMMR expression may drive the progression of cancer in the early phase. Therefore, targeting the HMMR receptor could be a novel therapeutic strategy for colorectal cancer (48). Kinesin family member 18A (KIF18A) is a member of the kinesin superfamily proteins, which belong to the microtubules depolymerase Kip3 family. During mitosis, KIF18A is directly involved in mid plate assembly and separation of sister chromatids (49). Dysregulated protein expression of KIF18A leads to abnormal separation of sister chromatids during mitosis resulting in aneuploidy, which promotes tumorigenesis. KIF18A is highly expressed in breast cancer and is associated with poor prognosis (50). In colon cancer, the enhanced expression of KIF18A induces proliferation and inhibits apoptosis of colon cancer cells. Additionally, KIF18A induces Akt phosphorylation, mediates PI3K-Akt signaling pathway, and promotes colon cancer progression (51).The co-expression network analysis using the cBioPortal database revealed that KIF18A was associated with the expression of several genes, including BIRC5, DSN1, CENPF, CENPA, CDCA5 and BUB1B. Additionally, the analysis revealed that CCNB1 can act on KIF18A.These genes were associated with proliferation and apoptosis of colon cancer.

Conclusions

In conclusion, DEPDC1, ECT2, GINS2, HMMR, and KIF18A were upregulated in colon cancer, and promoted the progression of colon cancer by regulating the cell cycle, which was consistent with our GO enrichment and KEGG pathway analyses. The results of our bioinformatics analysis and those of other studies indicate that DEPDC1, ECT2, GINS2, HMMR, and KIF18A may serve as biomarkers for the early diagnosis and progression of colon cancer. However, further studies are needed to elucidate the specific mechanism of action of these genes in colon cancer.
  50 in total

Review 1.  The KEGG database.

Authors:  Minoru Kanehisa
Journal:  Novartis Found Symp       Date:  2002

2.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

3.  Multiple therapeutic peptide vaccines for patients with advanced gastric cancer.

Authors:  Yoshiyuki Fujiwara; Kaoru Okada; Takeshi Omori; Keijiro Sugimura; Hiroshi Miyata; Masayuki Ohue; Shogo Kobayashi; Hidenori Takahashi; Hiroyuki Nakano; Chie Mochizuki; Katsuji Shimizu; Masahiko Yano; Yusuke Nakamura; Masaki Mori; Yuichiro Doki
Journal:  Int J Oncol       Date:  2017-04-06       Impact factor: 5.650

4.  Colorectal cancer statistics, 2020.

Authors:  Rebecca L Siegel; Kimberly D Miller; Ann Goding Sauer; Stacey A Fedewa; Lynn F Butterly; Joseph C Anderson; Andrea Cercek; Robert A Smith; Ahmedin Jemal
Journal:  CA Cancer J Clin       Date:  2020-03-05       Impact factor: 508.702

5.  GINS2 regulates matrix metallopeptidase 9 expression and cancer stem cell property in human triple negative Breast cancer.

Authors:  Liang Peng; Zhigang Song; Demeng Chen; Ruixia Linghu; Yingzhe Wang; Xingyang Zhang; Xiaoxue Kou; Junlan Yang; Shunchang Jiao
Journal:  Biomed Pharmacother       Date:  2016-11-06       Impact factor: 6.529

6.  Expression of the hyaluronan-mediated motility receptor RHAMM in tumor budding cells identifies aggressive colorectal cancers.

Authors:  Viktor Hendrik Koelzer; Bettina Huber; Valentina Mele; Giandomenica Iezzi; Mafalda Trippel; Eva Karamitopoulou; Inti Zlobec; Alessandro Lugli
Journal:  Hum Pathol       Date:  2015-07-29       Impact factor: 3.466

7.  STRING v10: protein-protein interaction networks, integrated over the tree of life.

Authors:  Damian Szklarczyk; Andrea Franceschini; Stefan Wyder; Kristoffer Forslund; Davide Heller; Jaime Huerta-Cepas; Milan Simonovic; Alexander Roth; Alberto Santos; Kalliopi P Tsafou; Michael Kuhn; Peer Bork; Lars J Jensen; Christian von Mering
Journal:  Nucleic Acids Res       Date:  2014-10-28       Impact factor: 16.971

8.  Early Assessment of Colorectal Cancer by Quantifying Circulating Tumor Cells in Peripheral Blood: ECT2 in Diagnosis of Colorectal Cancer.

Authors:  Chih-Jung Chen; Wen-Wei Sung; Hung-Chang Chen; Yi-Jye Chern; Hui-Ting Hsu; Yueh-Min Lin; Shu-Hui Lin; Konan Peck; Kun-Tu Yeh
Journal:  Int J Mol Sci       Date:  2017-03-31       Impact factor: 5.923

9.  DEP domain containing 1 suppresses apoptosis via inhibition of A20 expression, which activates the nuclear factor κB signaling pathway in HepG2 cells.

Authors:  Aili Li; Qingqing Wang; Gaofeng He; Junfei Jin; Guojin Huang
Journal:  Oncol Lett       Date:  2018-05-22       Impact factor: 2.967

10.  DEPDC1 is required for cell cycle progression and motility in nasopharyngeal carcinoma.

Authors:  Xuefei Feng; Chundong Zhang; Ling Zhu; Lian Zhang; Hongxia Li; Longxia He; Yan Mi; Yitao Wang; Jiang Zhu; Youquan Bu
Journal:  Oncotarget       Date:  2017-06-29
View more
  1 in total

1.  FOXO3a‑modulated DEPDC1 promotes malignant progression of nephroblastoma via the Wnt/β‑catenin signaling pathway.

Authors:  Geng Geng; Qinghao Li; Xingqing Guo; Qingbin Ni; Yongtao Xu; Zhaolong Ma; Yongjin Wang; Ming Ming
Journal:  Mol Med Rep       Date:  2022-07-07       Impact factor: 3.423

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.