Literature DB >> 31612015

Four targeted genes for predicting the prognosis of colorectal cancer: A bioinformatics analysis case.

Qinglai Bian1, Jiaxu Chen1,2, Wenqi Qiu1, Chenxi Peng1, Meifang Song1, Xuebin Sun1, Yueyun Liu1, Fengmin Ding3, Jianbei Chen1, Liqing Zhang4.   

Abstract

The molecular mechanisms underlying the development and progression of colorectal cancer (CRC) have not been clarified. The purpose of the present study was to identify key genes that may serve as novel therapeutic targets or prognostic predictors in patients with CRC using bioinformatics analysis. Four gene expression datasets were downloaded from the Gene Expression Omnibus database, which revealed 19 upregulated and 34 downregulated differentially expressed genes (DEGs). The downregulated DEGs were significantly enriched in eight pathways according to Kyoto Encyclopedia of Genes and Genomes pathway enrichment analysis. A protein-protein interaction network was constructed with 52 DEGs and 458 edges. Ten key genes were identified according to the degree value, betweenness centrality and closeness centrality. Survival analysis revealed that low expression of four of the ten genes, carcinoembryonic antigen related cell adhesion molecule 7 (CEACAM7), solute carrier family 4 member 4 (SLC4A4), glucagon (GCG) and chloride channel accessory 1 (CLCA1) genes, were associated with unfavorable prognosis in CRC. Furthermore, gene set enrichment analysis revealed that two pathways were significantly enriched in the CEACAM7 low-expression group. Thus, CEACAM7, SLC4A4, GCG and CLCA1 may be prognostic markers or therapeutic targets of CRC. Low CEACAM7 expression may be associated with the activation of glycosaminoglycan biosynthesis-chondroitin sulfate and extracellular matrix receptor interaction pathways and may affect the prognosis of CRC. Copyright: © Bian et al.

Entities:  

Keywords:  bioinformatics; colorectal cancer; genes; prognosis

Year:  2019        PMID: 31612015      PMCID: PMC6781647          DOI: 10.3892/ol.2019.10866

Source DB:  PubMed          Journal:  Oncol Lett        ISSN: 1792-1074            Impact factor:   2.967


Introduction

Colorectal cancer (CRC), which includes colon cancer and rectum cancer, is the third most common type of cancer and was the second leading cause of cancer-associated mortality for men and women worldwide in 2018 (1). It was estimated that there were more than 1.8 million new cases and over 0.8 million mortalities worldwide due to CRC in 2018 (1). In the USA, the number of newly diagnosed CRC cases in 2019 was ~145,600, accounting for 8.3% of all new cancer cases (2). Furthermore, the number of CRC-associated deaths in 2019 was ~51,020 in the USA, accounting for 8.4% of all cancer-associated deaths (2). The survival and prognosis of patients with CRC are closely associated with the staging of the tumor. If the tumors are diagnosed early and removed, the disease may be curable. The 5-year survival rate of patients with localized CRC was ~90% in the USA between 2001 and 2007 (3). However, in CRC cases at regional and distant stages, the 5-year survival rates are only ~70 and ~11%, respectively (3). Unfortunately, in developed countries including the USA, only ~40% of patients with CRC are diagnosed at early stages (4). Therefore, the identification of diagnostic and prognostic molecular markers for early detection and the prediction of prognosis for patients with CRC is clinically important. The development of CRC involves interconnections between environmental and genetic factors. In recent decades, great progress has been made in understanding the molecular pathogenesis of CRC, which includes four main mechanisms of molecular pathogenesis: Adenoma-carcinoma sequence, inherited forms, mismatch repair deficiency, and high-level microsatellite instability (5). However, the precise molecular mechanisms underlying the development of CRC have not been fully elucidated. With the rapid development of bioinformatics and high-throughput platforms for detecting gene expression, screening key genes for CRC based on publicly available databases provides a strategy to clarify the molecular mechanisms of CRC. Large numbers of gene expression datasets for CRC are available in the Gene Expression Omnibus (GEO) database, and numerous studies have used these datasets for the identification of differentially expressed genes (DEGs) in CRC (6–9). Previous studies have revealed the prognostic value of certain DEGs in CRC (10–13). However, the results of these individual studies varied and the studies demonstrated differences in sample collection, platform types and analysis methods. Furthermore, large-scale studies on the prognostic value of the DEGs in CRC are lacking. In addition, the enrichment pathways, gene set enrichment analysis, Gene Ontology (GO) functions and the interaction network involved in the DEGs remain to be clarified. In order to overcome these shortcomings, the present study integrated and reanalyzed four online GEO datasets of CRC using bioinformatics analysis methods. DEGs between CRC samples and noncancerous samples were determined, and the interaction network among these DEGs was constructed. Enrichment analysis for these DEGs was conducted using the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways and GO functions. Through network analysis, gene expression confirmation and overall survival analysis, four key genes that were associated with the prognosis of CRC were identified. These four genes may provide valuable information for the identification of potential prognostic markers of CRC and to elucidate the molecular mechanism of CRC.

Materials and methods

Data source and identification of DEGs

In order to compensate for the limitation of small sample size and result offset in a single cohort study, four gene expression profiles (GSE113513, GSE87211, GSE35279 and GSE24551) were acquired from the GEO database (http://www.ncbi.nlm.nih.gov/geo/) between January 1 2010 and August 31 2018. These profiles included both CRC samples and noncancerous samples, and all datasets contained at least five samples in each group (Table I). GSE113513 (unpublished, 2018; http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE113513) was based on the platform GPL15207 [(PrimeView) Affymetrix Human Gene Expression Array], GSE87211 was based on the platform GPL13497 (Agilent-026652 Whole Human Genome Microarray 4×44K v2) (6), GSE35279 was based on the platform GPL6480 (Agilent-014850 Whole Human Genome Microarray 4×44K G4112F) (7), and GSE24551 was based on the platform GPL5175 [(HuEx-1_0-st) Affymetrix Human Exon 1.0 ST Array] [transcript (gene) version] (8).
Table I.

Sample numbers in the four GSE datasets.

Dataset IDCRCNoncancerous tissuesTotal
GSE113513141428
GSE87211203160363
GSE3527974579
GSE2455116013173

CRC, colorectal cancer.

The DEGs between CRC samples and noncancerous samples in the four GEO series were filtered using the online GEO2R tool (https://www.ncbi.nlm.nih.gov/geo/geo2r/) (14,15). Genes that satisfied the threshold [|log2fold change (FC)| ≥2.0; adjusted P<0.05] were classified as DEGs. Statistical analysis was performed for each dataset. The overlapping genes among the four profiles were determined and presented by the online tool Venny 2.1.0 (http://bioinfogp.cnb.csic.es/tools/venny/index.html) (16). The expression of the upregulated and downregulated DEGs in each dataset was visualized using a volcano plot generated by Sanger Box (version 0.0.9; http://sangerbox.com/).

GO and KEGG enrichment analysis

GO analysis is commonly performed for functional enrichment analysis, in which gene function is classified into biological process (BP), molecular function (MF) and cellular component (CC) terms (17,18). KEGG is frequently used for exploring the advanced functions and mechanisms involved in the biological system at the molecular level (19). In the present study, GO and KEGG analysis was completed with the Database for Annotation, Visualization and Integrated Discovery (DAVID) platform (version 6.8; http://david.ncifcrf.gov/) (20,21). Statistical significance was set as P<0.05.

Construction of a protein-protein interaction (PPI) network and identification of key genes

The GeneMANIA (http://genemania.org/) prediction server was designed to assess the PPI network (22). The network was analyzed and visualized by Cytoscape 3.6.1 (http://www.cytoscape.org/). In the network, a high degree value indicated a more essential role for that gene. The degree value of each gene was calculated by the network analyzer tool that was built in the Cytoscape software. The genes whose degree value, closeness centrality and betweenness centrality were greater than the median value were identified as key genes.

Confirmation of key genes and overall survival analysis

The expression of key genes in CRC samples and noncancerous samples were further examined using the Gene Expression Profiling Interactive Analysis (GEPIA) platform (http://gepia.cancer-pku.cn) (23). The expression profiles of key genes between tumor samples and adjacent normal samples of colon adenocarcinoma (COAD) and rectum adenocarcinoma (READ) were obtained from The Cancer Genome Atlas (TCGA) database by GEPIA (http://gepia.cancer-pku.cn) (23). Student's unpaired t-test was used to determine the statistical significance of the calculated differential expression. The fold change was defined as 2 and the P-value of significance was set at 0.01. Overall survival analysis was performed in GEPIA by log-rank test based on gene expression. The overall survival analysis plot also contained the Cox proportional hazard ratio and the 95% confidence interval. The patients with CRC were divided into low-expression and high-expression groups, based on the median value of mRNA expression of the ten key genes. The differences between the two groups were evaluated separately for each of the ten key genes. P<0.05 was considered to indicate a statistically significant result.

Gene set enrichment analysis (GSEA) of prognosis-associated key genes

The pre-processed level 3 RNA-seq data and corresponding clinical information of patients with CRC were collected from the TCGA-COAD and TCGA-READ datasets (https://cancergenome.nih.gov/, updated in September 2018). GSEA (http://www.broadinstitute.org/gsea/index.jsp) (24,25) for the prognosis-associated key genes was performed on the TCGA datasets. The c2.cp.kegg.v6.0.symbols.gmt dataset was obtained from the molecular signatures database (MSigDB) v6.0 on the GSEA website. The CRC samples obtained from the TCGA database were divided into high- and low-expression groups according to the median expression level of prognosis-associated genes. The samples were analyzed by default weighted enrichment statistics using the GSEA 3.0 software. In the present study, the gene sets satisfying nominal P<0.05 and false discovery rate (FDR)<0.25 were considered to be significantly enriched. The enrichment analysis was carried out by default weighted enrichment statistics, and the number of random permutations was set to 1,000 times.

Results

Identification of DEGs

A flow chart of the present study design is presented in Fig. 1. Four GEO datasets (GSE113513, GSE87211, GSE35279 and GSE24551) were downloaded. The numbers of CRC and noncancerous samples in each dataset are presented in Table I. GSE113513 consisted of 14 CRC samples and 14 noncancerous tissues samples; GSE87211 included 203 CRC samples and 160 noncancerous tissues samples; GSE35279 contained 74 CRC samples and 5 noncancerous tissues samples; and GSE24551 included 160 CRC samples and 13 noncancerous tissues samples. GSE113513 comprised 340 DEGs, 258 of which were upregulated genes and 82 were downregulated genes. GSE87211 comprised 971 DEGs, including 573 upregulated genes and 398 downregulated genes. GSE35279 included 1371 DEGs, with 222 upregulated genes and 1149 downregulated genes. GSE24551 comprised 213 DEGs, with 109 upregulated and 104 downregulated genes. The intersection of the DEGs is presented in Venn diagrams (Fig. 2). The DEGs in each dataset are presented in volcano plots (Fig. 3). In total, 53 common DEGs (19 upregulated and 34 downregulated) were identified among all four datasets.
Figure 1.

Flow chart of the present study. GEO, Gene Expression Omnibus; GO, Gene Ontology; PPI, protein-protein interaction; KEGG, Kyoto Encyclopedia of Genes and Genomes; DEGs, differentially expressed genes.

Figure 2.

Venn diagram of DEGs in datasets GSE113513, GSE87211, GSE35279 and GSE24551. (A) A total of 19 upregulated common DEGs were extracted from datasets GSE113513, GSE87211, GSE35279 and GSE24551 with a threshold of |log2FC|≥2.0 and adjusted P<0.05. (B) In total, 34 downregulated common DEGs were extracted from datasets GSE113513, GSE87211, GSE35279 and GSE24551 with a threshold of |log2FC|≥2.0 and adjusted P<0.05. FC, fold change; DEGs, differentially expressed genes.

Figure 3.

Volcano plot of DEGs in datasets GSE113513, GSE87211, GSE35279 and GSE24551. Red dots represent upregulated DEGs and blue dots represent downregulated DEGs with a threshold of |log2FC|≥2.0 and adjusted P<0.05 in datasets (A) GSE113513, (B) GSE87211, (C) GSE35279 and (D) GSE24551. DEGs, differentially expressed genes; FC, fold change.

GO and KEGG analyses of the DEGs were performed using DAVID and the results are summarized in Table II. BP analysis indicated that upregulated genes were enriched in proteolysis, response to tumor necrosis factor, and positive regulation of gene expression, whereas the downregulated genes were enriched in seven terms comprised ‘bicarbonate transport’, ‘one-carbon metabolic process’, ‘regulation of chloride transport’, ‘positive regulation of cellular pH reduction’, ‘chloride transmembrane transport’, ‘ethanol oxidation’ and ‘positive regulation of synaptic transmission’. Analysis of the CC function indicated that the main enriched functions of upregulated genes involved the proteinaceous extracellular matrix and extracellular space, whereas the downregulated genes involved the seven terms ‘apical plasma membrane’, ‘basolateral plasma membrane’, ‘extracellular exosome’, ‘anchored component of membrane’, ‘plasma membrane’, ‘zymogen granule membrane’ and ‘integral component of membrane’. Analysis of the MF, identified calcium ion binding as a significantly enriched term of upregulated genes, whereas nine terms were significantly enriched in downregulated genes, including ‘carbonate dehydratase activity’, ‘chloride channel activity’, ‘zinc-dependent alcohol dehydrogenase activity’, ‘arylesterase activity’, ‘alcohol dehydrogenase (NAD) activity’, ‘zinc ion binding’, ‘intracellular calcium activated chloride channel activity’, ‘retinol dehydrogenase activity’ and ‘carbohydrate binding’. Furthermore, KEGG analysis revealed that downregulated genes were significantly enriched in nitrogen metabolism, proximal tubule bicarbonate reclamation and six other pathways.
Table II.

Enriched GO terms and KEGG pathways of DEGs.

A, Upregulated expression

CategoryDescriptionP-value
BPGO:0006508-proteolysis1.10×10−2
BPGO:0034612-response to tumor necrosis factor2.36×10−2
BPGO:0010628-positive regulation of gene expression2.52×10−2
CCGO:0005578-proteinaceous extracellular matrix2.18×10−3
CCGO:0005615-extracellular space8.32×10−3
MFGO:0005509-calcium ion binding3.87×10−2

B, Downregulated expression

CategoryDescriptionP-value

BPGO:0015701-bicarbonate transport1.11×10−8
BPGO:0006730-one-carbon metabolic process1.91×10−7
BPGO:2001225-regulation of chloride transport3.45×10−3
BPGO:0032849-positive regulation of cellular pH reduction6.89×10−3
BPGO:1902476-chloride transmembrane transport1.12×10−2
BPGO:0006069-ethanol oxidation2.05×10−2
BPGO:0032230-positive regulation of synaptic transmission, GABAergic2.05×10−2
CCGO:0016324-apical plasma membrane1.61×10−3
CCGO:0016323-basolateral plasma membrane3.81×10−3
CCGO:0070062-extracellular exosome6.42×10−3
CCGO:0031225-anchored component of membrane1.67×10−2
CCGO:0005886-plasma membrane1.71×10−2
CCGO:0042589-zymogen granule membrane1.92×10−2
CCGO:0016021-integral component of membrane4.47×10−2
MFGO:0004089-carbonate dehydratase activity8.01×10−9
MFGO:0005254-chloride channel activity4.13×10−3
MFGO:0004024-alcohol dehydrogenase activity, zinc-dependent1.06×10−2
MFGO:0004064-arylesterase activity1.06×10−2
MFGO:0004022-alcohol dehydrogenase (NAD) activity1.24×10−2
MFGO:0008270-zinc ion binding1.54×10−2
MFGO:0005229-intracellular calcium activated chloride channel activity2.81×10−2
MFGO:0004745-retinol dehydrogenase activity3.15×10−2
MFGO:0030246-carbohydrate binding4.72×10−2
KEGGhsa00910-nitrogen metabolism1.49×10−7
KEGGhsa04964-proximal tubule bicarbonate reclamation4.18×10−5
KEGGhsa04972-pancreatic secretion2.67×10−3
KEGGhsa00830-retinol metabolism1.60×10−2
KEGGhsa00010-glycolysis/Gluconeogenesis1.74×10−2
KEGGhsa00982-drug metabolism-cytochrome P4501.79×10−2
KEGGhsa00980-metabolism of xenobiotics by cytochrome P4502.10×10−2
KEGGhsa05204-chemical carcinogenesis2.43×10−2

GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes; DEGs, differentially expressed genes; BP, biological process; CC, cellular component; MF, molecular function; GABA, gamma aminobutyric acid.

PPI network and key genes

The PPI network of the 53 DEGs was generated with the GeneMANIA platform (22) and visualized by Cytoscape (26) (Fig. 4). Following the removal of the single epoxide hydrolase 4 gene that had no connections in the network, the network contained 52 DEGs and 458 edges. The connections between the DEGs included physical interactions, co-expression and co-localization. Among the DEGs, ten genes were selected as key genes associated with CRC. The carbonic anhydrase 2 (CA2) gene was the most connected gene (degree, 44), followed by the guanylate cyclase activator 2A gene (GUCA2A; degree, 35), and carcinoembryonic antigen-related cell adhesion molecule 7 gene (CEACAM7; degree, 34), among others (Table III). Only one gene, matrix metalloprotease 7 (MMP7), was upregulated in the CRC samples; the other nine genes were downregulated.
Figure 4.

Protein-protein interaction network among the DEGs. Red nodes indicate upregulated DEGs while blue nodes indicate downregulated DEGs. The grey lines represent the connections between the DEGs. DEGs, differentially expressed genes.

Table III.

Degree value of ten key genes.

Gene symbolDescriptionDegreeBetweenness centralityCloseness centrality
CA2Carbonic anhydrase 2447.61×10−26.80×10−1
GUCA2AGuanylate cyclase activator 2A358.12×10−26.80×10−1
CEACAM7Carcinoembryonic antigen-related cell adhesion molecule 7344.07×10−26.30×10−1
MMP7Matrix metalloproteinase-7334.37×10−26.07×10−1
SLC4A4Solute carrier family 4 member 4262.90×10−25.86×10−1
CA12Carbonic anhydrase 12253.27×10−26.14×10−1
GCGGlucagon251.79×10−26.00×10−1
MS4A12Membrane-spanning 4-domains subfamily A member 12241.04×10−25.93×10−1
CA1Carbonic anhydrase 1249.63×10−35.93×10−1
CLCA1Calcium-activated chloride channel regulator 1248.05×10−35.73×10−1

CA2, carbonic anhydrase 2; GUCA2A, guanylate cyclase activator 2A; CEACAM7, carcinoembryonic antigen-related cell adhesion molecule 7; MMP7, matrix metalloproteinase-7; SLC4A4, solute carrier family 4 member 4; CA, carbonic anhydrase 12; GCG, glucagon; MS4A12, membrane-spanning 4-domains subfamily A member 12; CLCA1, calcium-activated chloride channel regulator 1.

Confirmation and survival analysis of ten key genes

GEPIA (23) was used to compare the expression levels of the ten key genes between CRC tumor samples and adjacent normal samples in COAD and READ obtained from the TCGA database. A total of 275 tumor samples and 41 adjacent normal samples in COAD and 92 tumor samples and ten adjacent normal samples in READ were available for analyses. Among the ten genes, only MMP7 was significantly upregulated in tumor samples compared with control samples, whereas the other nine genes were significantly downregulated in tumor samples (Fig. 5). The differential expression of the ten key genes in the present study was confirmed in GEPIA.
Figure 5.

Boxplots of the expression of ten key genes in COAD and READ. Gene expression in COAD and READ of (A) CA2, (B) GUCA2A, (C) CEACAM7, (D) MMP7, (E) SLC4A4, (F) CA12, (G) GCG, (H) MS4A12, (I) CA1 and (J) CLCA1. *P<0.01 vs. the noncancerous group. COAD, colon adenocarcinoma; READ, rectum adenocarcinoma; T, tumor; N, normal; CA2, carbonic anhydrase 2; GUCA2A, guanylate cyclase activator 2A; CEACAM7, carcinoembryonic antigen-related cell adhesion molecule 7; MMP7, matrix metalloproteinase-7; SLC4A4, solute carrier family 4 member 4; CA2, carbonic anhydrase 2; GCG, glucagon; MS4A12, membrane-spanning 4-domains subfamily A member 12; CLCA1, calcium-activated chloride channel regulator 1; TPM, transcripts per million.

The overall survival analysis of the ten genes was also obtained from GEPIA. Among these key genes, low expression of CEACAM7, solute carrier family 4 member 4 (SLC4A4), glucagon (GCG) and chloride channel accessory 1 (CLCA1) genes were significantly associated with an unfavorable outcome of CRC (Fig. 6).
Figure 6.

Survival curves of overall survival analysis of ten key genes. Survival curves of overall survival time of (A) CA2, (B) GUCA2A, (C) CEACAM7, (D) MM7, (E) SLC4A4, (F) CA12, (G) GCG, (H) MS4A12, (I) CA1 and (J) CLCA1. CA2, carbonic anhydrase 2; GUCA2A, guanylate cyclase activator 2A; CEACAM7, carcinoembryonic antigen-related cell adhesion molecule 7; MMP7, matrix metalloproteinase-7; SLC4A4, solute carrier family 4 member 4; GCG, glucagon; MS4A12, membrane-spanning 4-domains subfamily A member 12; CA2, carbonic anhydrase 2; CLCA1, calcium-activated chloride channel regulator 1; TPM, transcripts per million.

GSEA of the four prognosis-associated key genes

The mechanism of the four prognosis-associated genes (CEACAM7, SLC4A4, GCG and CLCA1) was further investigated by examining the associated pathways via GSEA. The expression matrix of CRC from the TCGA database was divided into high-expression (323 samples) and low-expression (324 samples) groups, according to the median expression level of CEACAM7, SLC4A4, GCG and CLCA1. In the low CEACAM7 expression group, two significantly enriched KEGG pathways at nominal P<0.05 and FDR<0.25 were identified, including glycosaminoglycan biosynthesis-chondroitin sulfate (nominal P=0.002; FDR, 0.101) and extracellular matrix-receptor interaction (nominal P=0.022; FDR, 0.151) (Fig. 7). However, no pathway was significantly enriched in the low SLC4A4, GCG and CLCA1 expression groups.
Figure 7.

Gene set enrichment analysis of CEACAM7. Two significantly enriched pathways were identified in the low CEACAM7 expression group of patients with CRC at nominal P<0.05 and FDR<0.25, including (A) glycosaminoglycan biosynthesis-chondroitin sulfate (nominal P=0.002; FDR, 0.101) and (B) ECM receptor interaction (nominal P=0.022; FDR, 0.151). CEACAM7, carcinoembryonic antigen-related cell adhesion molecule 7; CRC, colorectal cancer; NES, normalized enrichment score; FDR, false discovery rate; ECM, extracellular matrix.

Discussion

In the present study, 53 DEGs were identified in CRC samples compared with noncancerous tissues samples, including 19 upregulated genes and 34 downregulated genes. The downregulated DEGs were significantly enriched in eight KEGG pathways, including nitrogen metabolism, proximal tubule bicarbonate reclamation and six other pathways. The upregulated DEGs were associated with six GO terms: Proteolysis, response to tumor necrosis factor, positive regulation of gene expression, proteinaceous extracellular matrix, extracellular space and calcium ion binding. The downregulated DEGs were associated with 23 GO terms such as bicarbonate transport, one-carbon metabolic process and regulation of chloride transport, among others. A PPI network was constructed, consisting of 52 nodes and 458 edges, to evaluate the interactions among these DEGs. The ten key genes identified were CA2, GUCA2A, CEACAM7, MMP7, SLC4A4, CA12, GCG, membrane spanning 4-domains A12, CA1 and CLCA1. Only the MMP7 gene was upregulated in patients with CRC, whereas the other nine genes were downregulated. Confirmation and survival analyses of these genes were performed using GEPIA. Survival analysis revealed that low expressions of CEACAM7, SLC4A4, GCG and CLCA1 genes were significantly associated with unfavorable prognosis in patients with CRC. Furthermore, GSEA results showed that the glycosaminoglycan biosynthesis-chondroitin sulfate and extracellular matrix-receptor interaction pathways were significantly enriched in the low CEACAM7 expression group of patients with CRC. Genetic factors serve a critical role in the pathogenesis of CRC (27). CEACAM7, also termed CGM2, is a member of the carcinoembryonic antigen family and is expressed on highly differentiated colorectal epithelial cells and within ducts of pancreas epithelial cells (28). CEACAM7 sequences were detected only in human cDNA libraries of pancreas, pancreatic islets, colonic tumors and colon (29). The very narrow expression spectrum of CEACAM7 in pancreatic and colonic epithelial cells indicated a highly specialized function. Thompson et al (30) reported that CEACAM7 was downregulated in colorectal carcinoma. Messick et al (31) showed that CEACAM7 was significantly decreased in rectal cancer and considered a predictor for the recurrence of rectal cancer. Schölzel et al (28) identified that CEACAM7 was downregulated in hyperplastic polyps as well as early adenomas, which indicated early detected subtleties at the molecular level that lead to CRC. The present study demonstrated that the glycosaminoglycan biosynthesis-chondroitin sulfate and extracellular matrix receptor interaction pathways were significantly enriched in the group with low expression of CEACAM7. The extracellular matrix components closely interact with cell surface receptors, growth factors and cytokines, supporting a substantial role for the extracellular matrix in the morphogenesis of tissues and organs and in maintaining the structure and function of cells and tissues. In addition, the functional macromolecules of extracellular matrix are involved in regulating the properties and function of cells. Notably, surface molecules of matrix such as synaptophysin itself can also act as cell receptors or co-receptors. Consequently, the components of the extracellular matrix were closely associated with the cellular and molecular mechanisms of malignant cells (32). Recent studies have reported various roles of matrix molecules in tissue development, homeostasis and pathological processes (33–35). Glycosaminoglycans, a type of matrix molecule, affect the growth and progression of tumors by interacting with growth factors, cytokines and growth factor receptors (36). Furthermore, chondroitin sulfate is also a critical molecule in cancer progression (37). However, the association between CEACAM7 and glycosaminoglycan biosynthesis-chondroitin sulfate and extracellular matrix receptor interaction pathways has not yet been investigated. Further reports are required to clarify this potential connection. SLC4A4 may regulate bicarbonate influx and efflux in the basolateral membrane of cells and regulate intracellular pH (38–42). Certain studies have shown that SLC4A4 is significantly downregulated in CRC (43–45). However, the mechanism of SLC4A4 in affecting the prognosis of CRC is not well studied, and future studies should examine the potential function of SLC4A4 in CRC. GCG serves a key role in glucose metabolism and homeostasis. Much attention has been drawn to its low expression in CRC tissues (44,46–48). GCG is cleaved into glucagon-like peptide (GLP)-1, GLP-2 and other small peptides in intestinal endocrine cells and brain neurons (49). Moreover, GLP-1 and its analogs have become an effective therapeutic strategy for numerous patients with type 2 diabetes (50). Notably, CRC is more common in diabetic patients than in the non-diabetic population (51–54). Furthermore, Zanders et al (55) demonstrated that diabetes affects the presentation, treatment and outcome of CRC. Patients with both CRC and diabetes are likely to have a lower survival rate compared with patients with CRC without diabetes. In a study by Koehler et al (27), GLP-1 receptor activation decreased proliferation and survival of CT26 colon cancer cells that expressed the endogenous classical GLP-1 receptor. Hence, more studies on the associations between GLP-1 and CRC are required, particularly for patients with both diabetes and CRC. GLP-2, a nutrient-responsive neuropeptide and intestinal hormone, functions in promoting cell proliferation and survival (56,57). Previous studies have demonstrated the therapeutic potential of GLP-2 in surgical resections and ulcerative colitis (58,59). However, the function of GLP2 was shown to be controversial. A histopathological analysis in one study showed a significant increase in tumor load of mice treated with Gly2-GLP-2, which indicated that GLP2 promoted the development of CRC (60). These findings appear to be inconsistent with the present study, which revealed shorter overall survival time associated with low expression of GCG in patients with CRC. Therefore, further investigation is required to elucidate whether GLP2 promotes intestinal healing or accelerates the development of CRC. CLCA1 is the first reported member of the CLCA family and is mainly expressed in the colon, small intestine and appendix (61). Yang et al (62) revealed that CLCA1 is expressed in differentiated, growth-arrested mammalian epithelial cells but is downregulated during tumor progression. CLCA1 has been identified as a regulator of the transition from proliferation to differentiation in Caco-2 cells. Further investigations demonstrated that low expression levels of CLCA1 predicted lower survival in patients with CRC (63). Li et al (64) demonstrated that increased expression levels of CLCA1 could suppress the aggressiveness of CRC via inhibiting the epithelial-mesenchymal transition process and the Wnt/β-catenin signaling pathway. Hence, CLCA1 is associated with CRC prognosis and may be a tumor suppressor in CRC. Overall, the present study identified four prognosis-associated key genes, CEACAM7, SLC4A4, GCG and CLCA1, in CRC using bioinformatics analysis. All four genes were downregulated in patients with CRC. Differential expressions of these genes were also observed in CRC tumor samples. Low expression of these genes appeared to be associated with adverse clinical outcome in patients with CRC. These four genes may be potential prognosis markers or therapeutic targets of CRC. Low expression of CEACAM7 may affect the prognosis of patients with CRC via activating glycosaminoglycan biosynthesis-chondroitin sulfate and extracellular matrix receptor interaction pathways. It is speculated that CLCA1 is a potential prognosis predictor and therapeutic target of CRC. Further study is required to verify and investigate the molecular mechanisms of these genes in CRC in vitro and in vivo.
  61 in total

1.  Proximal renal tubular acidosis and ocular pathology: a novel missense mutation in the gene (SLC4A4) for sodium bicarbonate cotransporter protein (NBCe1).

Authors:  F Yesim K Demirci; Min-Hwang Chang; Tammy S Mah; Michael F Romero; Michael B Gorin
Journal:  Mol Vis       Date:  2006-04-10       Impact factor: 2.367

Review 2.  Chondroitin sulfate as a key molecule in the development of atherosclerosis and cancer progression.

Authors:  A D Theocharis; I Tsolakis; G N Tzanakakis; Nikos K Karamanos
Journal:  Adv Pharmacol       Date:  2006

3.  Transcriptional gene expression profiles of colorectal adenoma, adenocarcinoma, and normal tissue examined by oligonucleotide arrays.

Authors:  D A Notterman; U Alon; A J Sierk; A J Levine
Journal:  Cancer Res       Date:  2001-04-01       Impact factor: 12.701

4.  Glucagon-like peptide-2-stimulated protein synthesis through the PI 3-kinase-dependent Akt-mTOR signaling pathway.

Authors:  Xuemei Shi; Xiaojie Li; Yi Wang; Keying Zhang; Fuguo Zhou; Lawrence Chan; Defa Li; Xinfu Guan
Journal:  Am J Physiol Endocrinol Metab       Date:  2010-12-21       Impact factor: 4.310

5.  Cloning and characterization of a human electrogenic Na+-HCO-3 cotransporter isoform (hhNBC).

Authors:  I Choi; M F Romero; N Khandoudi; A Bril; W F Boron
Journal:  Am J Physiol       Date:  1999-03

Review 6.  Life in the crypt: a role for glucagon-like peptide-2?

Authors:  Katherine J Rowland; Patricia L Brubaker
Journal:  Mol Cell Endocrinol       Date:  2008-02-29       Impact factor: 4.102

Review 7.  Review article: a comparison of glucagon-like peptides 1 and 2.

Authors:  P Janssen; A Rotondo; F Mulé; J Tack
Journal:  Aliment Pharmacol Ther       Date:  2012-11-05       Impact factor: 8.171

8.  Modeling oncogenic signaling in colon tumors by multidirectional analyses of microarray data directed for maximization of analytical reliability.

Authors:  Magdalena Skrzypczak; Krzysztof Goryca; Tymon Rubel; Agnieszka Paziewska; Michal Mikula; Dorota Jarosz; Jacek Pachlewski; Janusz Oledzki; Jerzy Ostrowski; Jerzy Ostrowsk
Journal:  PLoS One       Date:  2010-10-01       Impact factor: 3.240

9.  GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses.

Authors:  Zefang Tang; Chenwei Li; Boxi Kang; Ge Gao; Cheng Li; Zemin Zhang
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

10.  The transition from proliferation to differentiation in colorectal cancer is regulated by the calcium activated chloride channel A1.

Authors:  Bo Yang; Lin Cao; Bin Liu; Colin D McCaig; Jin Pu
Journal:  PLoS One       Date:  2013-04-12       Impact factor: 3.240

View more
  14 in total

1.  Analyzing large scale gene expression data in colorectal cancer reveals important clues; CLCA1 and SELENBP1 downregulated in CRC not in normal and not in adenoma.

Authors:  Fariborz Asghari Alashti; Bahram Goliaei; Zarrin Minuchehr
Journal:  Am J Cancer Res       Date:  2022-01-15       Impact factor: 6.166

2.  Fusobacterium nucleatum promotes colon cancer progression by changing the mucosal microbiota and colon transcriptome in a mouse model.

Authors:  Na Wu; Yu-Qing Feng; Na Lyu; Di Wang; Wei-Dong Yu; Yong-Fei Hu
Journal:  World J Gastroenterol       Date:  2022-05-14       Impact factor: 5.374

3.  Prognostic and clinicopathological insights of phosphodiesterase 9A gene as novel biomarker in human colorectal cancer.

Authors:  Tasmina Ferdous Susmi; Atikur Rahman; Md Moshiur Rahman Khan; Farzana Yasmin; Md Shariful Islam; Omaima Nasif; Sulaiman Ali Alharbi; Gaber El-Saber Batiha; Mohammad Uzzal Hossain
Journal:  BMC Cancer       Date:  2021-05-20       Impact factor: 4.430

4.  Colon Cancer Progression Is Reflected to Monotonic Differentiation in Gene Expression and Pathway Deregulation Facilitating Stage-specific Drug Repurposing.

Authors:  Marilena M Bourdakou; George M Spyrou; George Kolios
Journal:  Cancer Genomics Proteomics       Date:  2021 Nov-Dec       Impact factor: 4.069

5.  SLC1A1, SLC16A9, and CNTN3 Are Potential Biomarkers for the Occurrence of Colorectal Cancer.

Authors:  Jie Zhou; Zhiman Xie; Ping Cui; Qisi Su; Yu Zhang; Lijia Luo; Zhuoxin Li; Li Ye; Hao Liang; Jiegang Huang
Journal:  Biomed Res Int       Date:  2020-05-23       Impact factor: 3.411

Review 6.  The Neuropeptide System and Colorectal Cancer Liver Metastases: Mechanisms and Management.

Authors:  Aldona Kasprzak; Agnieszka Adamek
Journal:  Int J Mol Sci       Date:  2020-05-15       Impact factor: 5.923

7.  Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data.

Authors:  Mohanad Mohammed; Innocent B Mboya; Henry Mwambi; Murtada K Elbashir; Bernard Omolo
Journal:  PLoS One       Date:  2021-12-29       Impact factor: 3.240

8.  The Regulatory Role of Neuropeptide Gene Glucagon in Colorectal Cancer: A Comprehensive Bioinformatic Analysis.

Authors:  Wenfeng Du; Yue Miao; Guoqing Zhang; Guangcai Luo; Peng Yang; Fei Chen; Benhua Zhang; Chenggang Yang; Gang Li; Jin Chang
Journal:  Dis Markers       Date:  2022-03-18       Impact factor: 3.434

9.  Identification of DEGs and transcription factors involved in H. pylori-associated inflammation and their relevance with gastric cancer.

Authors:  Honghao Yin; Aining Chu; Songyi Liu; Yuan Yuan; Yuehua Gong
Journal:  PeerJ       Date:  2020-06-03       Impact factor: 2.984

10.  Development and validation of prognostic model based on the analysis of autophagy-related genes in colon cancer.

Authors:  Yongfeng Wang; Kaili Lin; Tianchun Xu; Liuli Wang; Liangyin Fu; Guangming Zhang; Jing Ai; Yajun Jiao; Rongrong Zhu; Xiaoyong Han; Hui Cai
Journal:  Aging (Albany NY)       Date:  2021-07-27       Impact factor: 5.682

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.