| Literature DB >> 32194683 |
Xiaoqun Liu1, Xiangdong Liu2, Tiankui Qiao1, Wei Chen1.
Abstract
Colorectal cancer (CRC) is a prevalent malignant tumour type arising from the colon and rectum. The present study aimed to explore the molecular mechanisms of the development and progression of CRC. Initially, differentially expressed genes (DEGs) between CRC tissues and corresponding non-cancerous tissues were obtained by analysing the GSE15781 microarray dataset. The Database for Annotation, Visualization and Integrated Discovery was then utilized for functional and pathway enrichment analysis of the DEGs. Subsequently, a protein-protein interaction (PPI) network was created using the Search Tool for the Retrieval of Interacting Genes and Proteins database and visualized by Cytoscape software. Furthermore, CytoNCA, a Cytoscape plugin, was used for centrality analysis of the PPI network to identify crucial genes. Finally, UALCAN was employed to validate the expression of the crucial genes and to estimate their effect on the survival of patients with colon cancer by Kaplan-Meier curves and log-rank tests. A total of 1,085 DEGs, including 496 upregulated and 589 downregulated genes, were screened out. The DEGs identified were enriched in various pathways, including 'metabolic pathway', 'cell cycle', 'DNA replication', 'nitrogen metabolism', 'p53 signalling' and 'fatty acid degradation'. PPI network analysis suggested that interleukin-6, MYC, NOTCH1, inhibin subunit βA (INHBA), CDK1, cyclin (CCN)B1 and CCNA2 were crucial genes, and their expression levels were markedly upregulated. Survival analysis suggested that upregulated INHBA significantly decreased the survival probability of patients with CRC. Conversely, upregulation of CCNB1 and CCNA2 expression levels were associated with increased survival probabalities. The identified DEGs, particularly the crucial genes, may enhance the current understanding of the genesis and progression of CRC, and certain genes, including INHBA, CCNB1 and CCNA2, may be candidate diagnostic and prognostic markers, as well as targets for the treatment of CRC. Copyright: © Liu et al.Entities:
Keywords: bioinformatics analysis; colorectal cancer; differentially expressed genes
Year: 2020 PMID: 32194683 PMCID: PMC7039150 DOI: 10.3892/ol.2020.11278
Source DB: PubMed Journal: Oncol Lett ISSN: 1792-1074 Impact factor: 2.967
Figure 1.Identification of DEGs. (A) Pearson's correlation (signal) map. The correlation coefficient was close to 1.0, indicating high repeatability or similar distribution. The key on the right indicates the correlation coefficient. (B) Relative signal boxplot map. The red line is the baseline; a more similar distribution implies higher repeatability of the data. (C) Volcano plot comparing all DEGs. The red squares represent DEGs that were significant at |log2 (FC)|≥1. The red squares on the upper left and upper right represent down- and upregulated genes in colorectal cancer, respectively. T, tumour tissues; N, normal tissues; DEG, differentially expressed gene; FC, fold change; RMA, Robust Multichip Averaging.
Top 10 upregulated and downregulated genes in colorectal cancer tumor vs. normal tissues.
| Gene symbol | Log2 FC | Adjusted P-value | Expression |
|---|---|---|---|
| MMP7 | 5.39 | 7.39×10−4 | Up |
| DEFA6 | 5.33 | 1.90×10−3 | Up |
| FABP6 | 5.25 | 1.10×10−3 | Up |
| KRT23 | 5.24 | 9.90×10−3 | Up |
| CXCL8 | 5.06 | 7.60×10−3 | Up |
| INHBA | 5.05 | 3.20×10−3 | Up |
| DEFA5 | 4.97 | 2.40×10−2 | Up |
| TCN1 | 4.96 | 8.60×10−3 | Up |
| REG3A | 4.92 | 3.50×10−2 | Up |
| MMP3 | 4.04 | 3.50×10−2 | Up |
| AQP8 | −6.90 | 1.80×10−3 | Down |
| CA1 | −5.92 | 1.20×10−3 | Down |
| INSL5 | −5.90 | 1.80×10−3 | Down |
| GUCA2A | −5.61 | 2.40×10−3 | Down |
| CARD14 | −5.59 | 1.40×10−3 | Down |
| SLC26A3 | −5.54 | 3.50×10−3 | Down |
| MS4A12 | −5.50 | 8.10×10−3 | Down |
| PYY | −5.40 | 2.30×10−3 | Down |
| CLCA4 | −5.39 | 1.40×10−2 | Down |
| CA4 | −5.33 | 9.30×10−3 | Down |
FC, fold change; MMP7, matrix metalloproteinase-7; DEFA6, defensin alpha 6; FABP6, fatty acid binding protein 6; KRT23, keratin 23; CXCL8, C-X-C motif chemokine ligand 8; INHBA, inhibin βA; DEFA5, defensing alpha 5; TCN1, transcobalamin 1; REG3A, regenerating islet-derived 3 alpha; MMP3, matrix metalloproteinase-3; AQP8, aquaporin 8; CA1, carbonic anhydrase 1; INSL5, insulin-like 5; GUCA2A, guanylate cyclase activator 2a; CARD14, caspase recruitment domain family member 14; SLC26A3, solute carrier family 26 member 3; MS4A12, membrane-spanning 4-domains subfamily A member 12; PYY, peptide YY; CLCA4, chloride channel accessory 4; CA4, carbonic anhydrase 4.
Figure 2.Hierarchical clustering analysis of the 1,085 differentially expressed genes. Red and green indicate upregulated and downregulated gene expression, respectively. T, tumor tissues; N, normal tissues.
GO enrichment analysis of the differentially expressed genes.
| Category | Term/gene function | Gene counts | Percentage | P-value |
|---|---|---|---|---|
| GOTERM_BP_DIRECT | GO:0007067~mitotic nuclear division | 37 | 2.2 | 5.10×10−7 |
| GO:0008284~positive regulation of cell proliferation | 53 | 3.2 | 7.25×10−6 | |
| GO:0006730~one-carbon metabolic process | 10 | 0.6 | 3.88×10−5 | |
| GO:0051301~cell division | 41 | 2.5 | 4.75×10−5 | |
| GO:0006955~immune response | 46 | 2.8 | 6.21×10−5 | |
| GOTERM_MF_DIRECT | GO:0005179~hormone activity | 18 | 1.1 | 2.70×10−5 |
| GO:0017080~sodium channel regulator activity | 10 | 0.6 | 6.21×10−5 | |
| GO:0051287~NAD binding | 10 | 0.6 | 2.63×10−4 | |
| GO:0042803~protein homodimerization activity | 66 | 4.1 | 4.18×10−4 | |
| GO:0005515~protein binding | 563 | 34.8 | 5.65×10−4 | |
| GOTERM_CC_DIRECT | GO:0005615~extracellular space | 153 | 9.5 | 1.90×10−16 |
| GO:0070062~extracellular exosome | 248 | 15.3 | 3.86×10−13 | |
| GO:0005576~extracellular region | 135 | 8.3 | 4.92×10−6 | |
| GO:0016324~apical plasma membrane | 37 | 2.3 | 1.08×10−5 | |
| GO:0005829~cytosol | 241 | 14.9 | 2.92×10−5 |
The top 5 GO terms in every category are presented. CC, cellular component; BP, biological processes; MF, molecular function; NAD, nicotinamide adenine dinucleotide; GO, Gene Ontology.
Figure 3.Top 10 most significantly enriched pathways for differentially expressed genes associated with colorectal cancer as determined by Kyoto Encyclopedia of Genes and Genomes pathway analysis.
Top 10 nodes with highest degrees of interaction in colorectal cancer.
| Gene | Node degree | Betweenness centrality | Closeness centrality | Stress centrality | Clustering coefficient |
|---|---|---|---|---|---|
| IL6 | 135 | 0.085 | 0.437 | 976,462 | 0.111 |
| MYC | 127 | 0.084 | 0.445 | 1,307,298 | 0.127 |
| CDK1 | 118 | 0.021 | 0.397 | 354,950 | 0.387 |
| CCNB1 | 111 | 0.015 | 0.397 | 301,564 | 0.427 |
| CCNA2 | 102 | 0.009 | 0.387 | 200,222 | 0.475 |
| TOP2A | 102 | 0.012 | 0.397 | 236,116 | 0.496 |
| AURKA | 101 | 0.016 | 0.393 | 246,324 | 0.476 |
| CXCL8 | 97 | 0.029 | 0.416 | 408,590 | 0.202 |
| BUB1 | 94 | 0.005 | 0.369 | 106,422 | 0.567 |
| MAD2L1 | 94 | 0.009 | 0.366 | 135,590 | 0.530 |
IL6, interleukin 6; CCNB1, cyclin B1; CCNA2, cyclin A2; TOP2A, DNA topoisomerase IIα; AURKA, aurora kinase A; CXCL8, C-X-C motif chemokine ligand 8; BUB1, BUB1 mitotic checkpoint serine/threonine kinase; MAD2L1, mitotic arrest deficient 2 like 1.
Top 3 genes ranked by the node centrality of the protein-protein interaction network.
| Degree centrality | Betweenness centrality | Closeness centrality | Eigenvector centrality | |||||
|---|---|---|---|---|---|---|---|---|
| Rank | Gene symbol | Expression in CRC | Gene symbol | Expression in CRC | Gene symbol | Expression in CRC | Gene symbol | Expression in CRC |
| 1 | IL6 | Upregulated | MYC | Upregulated | IL6 | Upregulated | CDK1 | Upregulated |
| 2 | MYC | Upregulated | IL6 | Upregulated | MYC | Upregulated | CCNB1 | Upregulated |
| 3 | NOTCH1 | Upregulated | INHBA | Upregulated | CDK1 | Upregulated | CCNA2 | Upregulated |
CRC, colorectal cancer; IL6, interleukin 6; INHBA, inhibin βA; CCNB1, cyclin B1; CCNA2, cyclin A2.
Figure 4.Crucial genes identified are upregulated in human CRC specimens. (A) Boxplots indicating the expression levels of the crucial genes in CRC tissues and normal tissues. (B) Immunohistochemical staining results (magnification, ×4) for the protein expression levels of the crucial genes in normal tissues and CRC tissues. IL6, interleukin 6; INHBA, inhibin βA; CCNB1, cyclin B1; CCNA2, cyclin A2.
Figure 5.Kaplan-Meier plots for the association between the expression levels of the crucial genes and the overall survival of patients with CRC. IL6, interleukin 6; INHBA, inhibin βA; CCNB1, cyclin B1; CCNA2, cyclin A2.