| Literature DB >> 35300114 |
Ming Li1, Ziming Liu2, Jia Song1, Tian Wang2, Hongjie Wang1,3, Yanan Wang4, Jiguang Guo1.
Abstract
Colorectal cancer (CRC) is the second most deadly cancer in the whole world, with the underlying mechanisms largely indistinct. Therefore, we aimed to identify significant pathways and genes involved in the initiation, formation and poor prognosis of CRC using bioinformatics methods. In this study, we compared gene expression profiles of CRC cases with those from normal colorectal tissues from three chip datasets (GSE33113, GSE23878 and GSE41328) to identify 105 differentially expressed genes (DEGs) that were common to the three datasets. Gene ontology and Kyoto Encyclopedia of Genes and Genomes pathway analyses showed that the highest proportion of up-regulated DEGs was involved in extracellular region and cytokine-cytokine receptor interaction pathways. Integral components of membrane and bile secretion pathways were identified as containing down-regulated DEGs. 13 hub DEGs were chosen and their expression were further validated by GEPIA. Only four DEGs (ADH1C, CLCA4, CXCL8 and GUCA2A) were associated with a significantly lower overall survival after the prognosis analysis. Lower ADH1C protein level and higher CXCL8 protein level were verified by immunohistochemical staining and western blot in clinical CRC and normal colorectal tissues. In conclusion, our study indicated that the extracellular tumor microenvironment and bile metabolism pathways play critical roles in the formation and progression of CRC. Furthermore, we confirmed ADH1C being down-regulated in CRC and reported ADH1C as a prognostic predictor for the first time.Entities:
Keywords: ADH1C; bioinformatics analysis; colorectal cancer; differentially expressed gene; gene expression Omnibus database
Year: 2022 PMID: 35300114 PMCID: PMC8921497 DOI: 10.3389/fmolb.2022.791249
Source DB: PubMed Journal: Front Mol Biosci ISSN: 2296-889X
The 105 common DEGs were identified from the three gene expression profile datasets.
| DEGs | Gene names |
|---|---|
| Upregulated | ADAM12, NFE2L3, CEMIP, GDF15, CTHRC1, TRIB3, TACSTD2, FOXQ1, MMP3, TESC, CXCL5, ASCL2, BGN, INHBA, AJUBA, MMP1, WISP1, CXCL8, CRNDE, CLDN1, MMP11, SLC O 4A1 |
| Downregulated | LGALS2, NR3C2, SPIB, HSD17B2, ABCG2, HMGCS2, ZG16, GUCA2B, UGT2B17, CHP2, SCARA5, CLCA4, DHRS11, AKR1B10, TUBAL3, ARL14, CA4, TRPM6, NXPE4, IGH, PTGDR, PYY, UGT2B15, SCIN, SLC26A3, B3GALT5, TSPAN7, HHLA2, CA2, DPP10-AS1, FCGBP, CHGA, SLC26A2, PKIB, ANPEP, CEACAM7, PADI2, C10orf99, ADTRP, NR1H4, KLF4, ISX, ABCA8, MUC2, BEST2, SLC51B, ADH1B, EDN3, AQP8, GCG, LYPD8, CD177, GBA3, MS4A12, PCK1, VSIG2, ADH1C, TMEM72, HEPACAM2, UGT2A3, GCNT2, LRRC19, SST, SCNN1B, NXPE1, C2orf88, HPGD, LAMA1, CWH43, BEST4, CA1, STMN2, LOC100506558///MATN2, MUC4, SLC4A4, MOGAT2, CA12, SI, SLC51A, GUCA2A,UGT1A3///UGT1A1///UGT1A4///UGT1A9///UGT1A5///UGT1A6///UG, T1A7///UGT1A8///, GT1A10, DHRS9, CA7 |
FIGURE 1Identification of DEGs in the three datasets using the Draw Venn diagram software. Three datasets (GSE33113, GSE23878, and GSE41328) were used and indicated by different colors. The overlapping area indicated common DEGs. (A) Twenty-two DEGs were up-regulated in all three datasets (logFC> 0). (B) Eighty-three DEGs were down-regulated in three datasets (logFC <0).
Gene ontology analysis results of up-regulated DEGs in CRC.
| Category | Term | Count | P-Value | FDR |
|---|---|---|---|---|
| GOTERM_BP_DIRECT | GO:0030574∼collagen catabolic process | 3 | 0.002599 | 3.222424 |
| GOTERM_BP_DIRECT | GO:0007267∼cell-cell signaling | 4 | 0.003223 | 3.981161 |
| GOTERM_BP_DIRECT | GO:0022617∼extracellular matrix disassembly | 3 | 0.003643 | 4.489588 |
| GOTERM_BP_DIRECT | GO:0006469∼negative regulation of protein kinase activity | 3 | 0.006101 | 7.412011 |
| GOTERM_BP_DIRECT | GO:0032461∼positive regulation of protein oligomerization | 2 | 0.016552 | 18.94565 |
| GOTERM_CC_DIRECT | GO:0005578∼proteinaceous extracellular matrix | 6 | 8.57E-06 | 0.007978 |
| GOTERM_CC_DIRECT | GO:0005576∼extracellular region | 10 | 2.19E-05 | 0.020359 |
| GOTERM_CC_DIRECT | GO:0005615∼extracellular space | 7 | 0.002534 | 2.333393 |
| GOTERM_CC_DIRECT | GO:0031012∼extracellular matrix | 3 | 0.041212 | 32.40586 |
| GOTERM_MF_DIRECT | GO:0004222∼metalloendopeptidase activity | 4 | 3.06E-04 | 0.30983 |
| GOTERM_MF_DIRECT | GO:0004860∼protein kinase inhibitor activity | 3 | 0.001707 | 1.714626 |
| GOTERM_MF_DIRECT | GO:0004252∼serine-type endopeptidase activity | 3 | 0.036114 | 31.09485 |
| GOTERM_MF_DIRECT | GO:0005160∼transforming growth factor-beta receptor binding | 2 | 0.048628 | 39.63498 |
Gene ontology analysis results of down-regulated DEGs in CRC.
| Category | Term | Count | p-Value | FDR |
|---|---|---|---|---|
| GOTERM_BP_DIRECT | GO:0015701∼bicarbonate transport | 8 | 4.43E-10 | 6.17E-07 |
| GOTERM_BP_DIRECT | GO:0006730∼one-carbon metabolic process | 5 | 6.23E-06 | 0.008676 |
| GOTERM_BP_DIRECT | GO:0051453∼regulation of intracellular pH | 4 | 4.12E-04 | 0.572874 |
| GOTERM_BP_DIRECT | GO:1902476∼chloride transmembrane transport | 5 | 5.48E-04 | 0.760588 |
| GOTERM_BP_DIRECT | GO:0052695∼cellular glucuronidation | 3 | 0.001869 | 2.574031 |
| GOTERM_CC_DIRECT | GO:0005887∼integral component of plasma membrane | 14 | 0.006041 | 6.366143 |
| GOTERM_CC_DIRECT | GO:0016324∼apical plasma membrane | 6 | 0.007776 | 8.124723 |
| GOTERM_CC_DIRECT | GO:0031225∼anchored component of membrane | 4 | 0.012178 | 12.4538 |
| GOTERM_CC_DIRECT | GO:0016021∼integral component of membrane | 32 | 0.015945 | 16.01086 |
| GOTERM_MF_DIRECT | GO:0004089∼carbonate dehydratase activity | 5 | 2.34E-07 | 2.78E-04 |
| GOTERM_MF_DIRECT | GO:0005254∼chloride channel activity | 5 | 6.54E-05 | 0.077743 |
| GOTERM_MF_DIRECT | GO:0005179∼hormone activity | 5 | 5.59E-04 | 0.662534 |
| GOTERM_MF_DIRECT | GO:0005215∼transporter activity | 6 | 0.001321 | 1.559094 |
| GOTERM_MF_DIRECT | GO:0015020∼glucuronosyltransferase activity | 3 | 0.006051 | 6.960171 |
The KEGG pathways involved with the common DEGs in CRC.
| Pathway id | Name | Count | P-Value | Genes |
|---|---|---|---|---|
| Up-regulated | ||||
| hsa05323 | Rheumatoid arthritis | 4 | 6.82E-05 | CXCL5, CXCL8, MMP3, MMP1 |
| hsa04060 | Cytokine-cytokine receptor interaction | 3 | 0.023207 | INHBA, CXCL5, CXCL8 |
| hsa05219 | Bladder cancer | 2 | 0.041 | CXCL8, MMP1 |
| Down-regulated | ||||
| hsa00910 | Nitrogen metabolism | 5 | 2.97E-06 | CA12, CA7, CA4, CA2, CA1 |
| hsa04976 | Bile secretion | 7 | 3.72E-06 | AQP8, SLC51B, CA2, SLC51A, SLC4A4, NR1H4, ABCG2 |
| hsa00830 | Retinol metabolism | 6 | 4.36E-05 | UGT2B17, ADH1C, DHRS9, ADH1B, UGT2A3, UGT2B15 |
| hsa04964 | Proximal tubule bicarbonate reclamation | 4 | 3.69E-04 | CA4, CA2, SLC4A4, PCK1 |
| hsa00982 | Drug metabolism - cytochrome P450 | 5 | 8.06E-04 | UGT2B17, ADH1C, ADH1B, UGT2A3, UGT2B15 |
FIGURE 2PPI network of 105 common DEGs built by STRING and Cytoscape module analysis. (A) 73 out of the 105 DEGs were contained in the PPI network complex. The 73 nodes and 122 edges represent the interaction of proteins. Red circles indicate the up-regulated DEGs and green circles indicate the down-regulated DEGs; (B) The rankings of the top 15 hub genes identified by the Cytoscape plugin cytoHubba MCC. 9 different colors from black to white was chosen to show the rank of the hub genes.
The hub genes with top15 ranking of the PPI network.
| Hub genes | Protein name | Rank |
|---|---|---|
| GUCA2B | Guanylate cyclase activator 2B | 1 |
| CLCA4 | Chloride channel accessory 4 | 1 |
| ZG16 | Zymogen granule protein 16 | 3 |
| MS4A12 | Membrane spanning 4-domains A12 | 3 |
| GUCA2A | Guanylate cyclase activator 2A | 5 |
| SLC26A3 | Solute carrier family 26 member 3 | 5 |
| CXCL8 | C-X-C motif chemokine ligand 8 | 7 |
| GCG | Glucagon | 8 |
| UGT2B15 | UDP glucuronosyltransferase family 2 member B15 | 8 |
| PYY | Peptide YY | 10 |
| CXCL5 | C-X-C motif chemokine ligand 5 | 11 |
| ADH1C | Alcohol dehydrogenase 1C (class I), gamma polypeptide | 12 |
| ADH1B | Alcohol dehydrogenase 1B (class I), beta polypeptide | 12 |
| SST | Somatostatin | 12 |
| MMP1 | Matrix metallopeptidase 1 | 15 |
Validation of 13 hub genes using GEPIA.
| Category | Genes |
|---|---|
| Genes with differential expression between the CRC and normal tissues ( | ADH1B, ADH1C, CLCA4, CXCL5, CXCL8, GCG, GUCA2A, GUCA2B, MMP1, MS4A12, PYY, SST, ZG16 |
| Genes without differential expression between the CRC and normal tissues ( | SLC26A3, UGT2B15 |
FIGURE 3The expression levels of the hub genes were validated using the GEPIA website. Thirteen of the 15 hub genes showed significant differential expression in CRC samples, when compared with the normal samples (*P < 0.05). Red represents tumor tissues and grey represents normal tissues.
The prognostic analysis results of the 13 hub validated genes.
| Category | Genes |
|---|---|
| Genes with significantly worse survival ( | ADH1C, CLCA4, CXCL8, GUCA2A |
| Genes without significantly worse survival ( | ADH1B, GCG, GUCA2B, MMP1, MS4A12, PYY, SST, ZG16, CXCL5 |
FIGURE 4The prognostic analysis and the protein level of the hub genes. (A) UALCAN and OncoLnc online resources were applied to obtain the prognostic results of the 13 hub genes. Only four of the 13 hub genes had significantly lower overall survival rates (P < 0.05). (B,C) Immunohistochemistry (IHC) analysis of CRC tissue and adjacent normal tissue. (B) showed the representative DAB IHC staining with ADH1C, CXCL8 antibody, and cell nuclei stained by hematoxylin of cancer and normal tissue pathology slides. (C) showed the quantitative analysis results by ImageJ and IHC Toolbox plugin. The magnification of the field is ×100, and the scale bar is 75 μm. (D) Real-time PCR results confirmed the dysregulation of the mRNA expression of ADH1C and CXCL8. (E) The protein level of ADH1C and CXCL8 were detected by Western blot. T, tumor tissue; N, normal tissue. n = 8. **P < 0.01 or *P < 0.05 by paired t-test.