| Literature DB >> 31127138 |
Li Li1, Zuan Zhu1, Yanchao Zhao1, Qi Zhang1, Xiaoting Wu1, Bei Miao1, Jiang Cao2, Sujuan Fei3.
Abstract
Gastric adenocarcinoma (GAC), also known as stomach adenocarcinoma (STAD), is one of the most lethal malignancies in the world. It is vital to classify and detect the hub genes and key pathways participated in the initiation and progression of GAC. In this study, we collected and sequenced 15 pairs of GAC tumor tissues and the adjacent normal tissues. Differentially expressed genes (DEGs) were analyzed and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway and Gene Ontology (GO) analysis were used to annotate the unique biological significance and important pathways of enriched DEGs. Moreover, we constructed the protein-protein interaction (PPI) network by Cytoscape and conducted KEGG enrichment analysis of the prime module. We further applied the TCGA database to start the survival analysis of these hub genes by Kaplan-Meier estimates. Finally, we obtained total 233 DEGs consisted of 64 up-regulated genes and 169 down-regulated genes. GO enrichment analysis found that DEGs most significantly enriched in single organism process, extracellular region, and extracellular region part. KEGG pathway enrichment analysis suggested that DEGs most significantly enriched in Protein digestion and absorption, Gastric acid secretion, and ECM-receptor interaction. Furthermore, the PPI network showed that the top 10 hub genes in GAC were IL8, COL1A1, MMP9, SST, COL1A2, TIMP1, FN1, SPARC, ALDH1A1, and SERPINE1 respectively. The prime gene interaction module in PPI network was enriched in protein digestion and absorption, ECM receptor interaction, the PI3K-Akt signaling pathway, and pathway in cancer. Survival analysis based on the TCGA database found that the expression of the FN1, SERPINE1, and SPARC significantly predicted poor prognosis of GAC. Collectively, we identified several hub genes and key pathways associated with GAC initiation and progression by analyzing the microarray data on DEGs, which provided a detailed molecular mechanism underlying GAC occurrence and progression.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31127138 PMCID: PMC6534579 DOI: 10.1038/s41598-019-43924-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
The demographic and clinical features of the patient.
|
| |
| Male | 20 (~67%) |
| Female | 10 (~33%) |
|
| |
| Median | 43 |
| Range | 23–66 |
| Race | EthnicHan |
|
| |
| I | 5 (~17%) |
| II | 8 (~27%) |
| III | 11 (~37%) |
| IV | 6 (~20%) |
Figure 1Overview of differentially expressed genes (DEGs) in gastric adenocarcinoma. (A) Principal components analysis (PCA) plot for total of 30 samples. (B) the pearson correlation matrix among all samples by calculating pairwise correlation coefficient. The redder the color, the higher the correlation coefficient, while the bluer the color, the lower the correlation coefficient. (C) Volcano plot of the DEGs. Each circle represents a gene, and red circle represents. (D) Heatmap of the 233 DEGs. Each column in the heatmap represents a sample, and each row represents the expression level of a gene. The color scale beside the heatmap represents the raw Z-score ranging from green (low expression) to red (high expression).
The top10 upregulated DEGs and downregulated DEGs.
| prob | Gene_Symbol |
|---|---|
|
| |
| 11723174_a_at | FNDC1 |
| 11726339_s_at | MAGEA3 |
| 11721212_a_at | THBS4 |
| 11755955_a_at | FAP |
| 11740290_a_at | HOXC6 |
| 11720251_at | CCL18 |
| 11715393_a_at | C3 |
| 11735643_a_at | RARRES1 |
| 11728809_a_at | COL8A1 |
| 11742712_a_at | THBS2 |
|
| |
| prob | Gene_Symbol |
| 11733660_a_at | CHIA |
| 11728308_at | KRT20 |
| 11732742_at | ATP4B |
| 11756545_a_at | GKN2 |
| 11723302_a_at | CHGA |
| 11734596_a_at | ATP4A |
| 11744246_at | KCNE2 |
| 11715481_a_at | SST |
| 11723940_at | GKN1 |
| 11728126_x_at | PHGR1 |
KEGG pathway enrichment analysis of DEGs associated with GAC.
| Gene count | Background number | Rich factor | P-Value | Genes |
|---|---|---|---|---|
| 10 | 90 | 11.11% | 3.59E-10 | COL1A2, CPA2, COL10A1, COL11A1, KCNQ1, COL12A1, PGA3, COL1A1, COL6A3, COL2A1 |
| 9 | 74 | 12.16% | 1.34E-09 | KCNJ16, CCKBR, KCNQ1, SST, SLC26A7, ATP4A, ATP4B, KCNE2, CA2 |
| 8 | 82 | 9.76% | 5.48E-08 | COL1A2, THBS2, COL2A1, COL1A1, COL6A3, THBS4, SPP1, FN1 |
| 25 | 1243 | 2.01% | 9.98E-08 | ALDH1A1, CKMT2, ADH1C, GCNT1, ALDH3A1, ADH7, HDC, RDH12, UGT2B15, CYP2C18, HGD, PIK3C2G, LIPF, ACER2, ALDOB, PLA2G7, PLA2G2A, HMGCS2, AGXT2L1, ACSM3, FBP2, NNMT, GLUL, CHIA, CYP2C8 |
| 7 | 65 | 10.77% | 2.04E-07 | ALDH1A1, CYP2C8, UGT2B15, ADH7, ADH1C, CYP2C18, RDH12 |
| 7 | 73 | 9.59% | 4.23E-07 | AKR1C1, CYP1B1, AKR7A3, ALDH3A1, ADH7, UGT2B15, ADH1C |
| 7 | 82 | 8.54% | 8.77E-07 | CYP1B1, CYP2C8, ALDH3A1, ADH7, UGT2B15, ADH1C, CYP2C18 |
| 6 | 69 | 8.70% | 4.90E-06 | FMO5, CYP2C8, ALDH3A1, ADH7, UGT2B15, ADH1C |
| 8 | 203 | 3.94% | 3.32E-05 | COL1A2, THBS2, COL2A1, COL1A1, COL6A3, THBS4, SPP1, FN1 |
| 6 | 101 | 5.94% | 3.78E-05 | COL1A2, SELE, FN1, IL8, COL1A1, SERPINE1 |
Figure 2Statistics of functional and pathway enrichment. (A) Scatter plot of top 20 enriched KEGG pathways. (B) Scatter plot of top 10 enriched GO terms of molecular function (MF), biological process (BP) and cellular component (CC) separately.
Figure 3The networks of the top 5 mostly enrichment function or pathways. (A) The network of top 5 mostly enriched KEGG pathways. (B) The network of top 5 mostly enriched GO terms.
GO enrichment analysis of DEGs associated with GAC.
| GO term | Description | Category | Gene count | Background number | Rich factor | P-Value |
|---|---|---|---|---|---|---|
| GO:0044699 | single-organism process | BP | 186 | 12728 | 1.46% | 3.66E-52 |
| GO:0005576 | extracellular region | CC | 114 | 4424 | 2.58% | 6.13E-47 |
| GO:0044421 | extracellular region part | CC | 104 | 3743 | 2.78% | 7.18E-45 |
| GO:0005615 | extracellular space | CC | 64 | 1330 | 4.81% | 3.28E-39 |
| GO:0005623 | cell | CC | 188 | 15802 | 1.19% | 1.04E-38 |
| GO:0044464 | cell part | CC | 187 | 15776 | 1.19% | 5.27E-38 |
| GO:0009987 | cellular process | BP | 177 | 14647 | 1.21% | 4.54E-35 |
| GO:0032501 | multicellular organismal process | BP | 120 | 6781 | 1.77% | 1.24E-33 |
| GO:0044763 | single-organism cellular process | BP | 155 | 11494 | 1.35% | 2.90E-33 |
| GO:0044707 | single-multicellular organism process | BP | 110 | 5822 | 1.89% | 1.50E-32 |
Figure 4The protein–protein interaction (PPI) network and hub genes. (A) PPI network of DEGs. Only experimentally validated interactions with a combined score >0.4 were selected as significant. The nodes were colored according to whether it belongs to up- or down-regulated genes. The thicknesses of those edges were associated with the combined scored. The size of each node is proportional to the number of connections, that is, the degree. (B) The expression heatmap of TOP10 hub genes.
Figure 5The prime module from the PPI network. (A) The sub-network of the main module. (B) The enriched pathway of the module.
Figure 6Overall survival analysis in GAC based on the TCGA data as determined by Kaplan-Meier estimates. 411 GAC cases with full data of both clinical and 10 hub gene expression were downloaded from TCGA database. Kaplan-Meier estimates (log-rank test) were made and found the expression of (A), FN1; (B), SERPINE1 and (C), SPARC was significantly affect the prognosis of GAC in overall survival (p < 0.05).
Figure 7The expression level of FN2, SERPINE1, and SPARC between the normal tissues and tumor tissues. 32 normal tissues and 375 tumor tissues both containing the three gene expression values were collected from the TCGA database.