| Literature DB >> 34321959 |
Shui Liu1, Yan Cai2, E Changyong3, Jiyao Sheng1, Xuewen Zhang1.
Abstract
Pancreatic cancer is a digestive system malignant tumor with high mortality and poor prognosis, but the mechanisms of progression remain unclear in pancreatic cancer. It's necessary to identify the hub genes in pancreatic cancer and explore the novel potential predictors in the prognosis of pancreatic cancer. We downloaded two mRNA expression profiles from Gene Expression Omnibus and The Cancer Genome Atlas Pancreatic Cancer (TCGA-PAAD) datasets to screen the commonly differentially expressed genes in pancreatic cancer by limma package in R. Subsequently, measurement of the functional similarity among the 38 DEGs in common was performed to identify the hub genes using GOSemSim package. Then, survival analysis and Cox regression were applied to explore prognosis-related hub genes using the survival package. Statistics analysis by two-tailed Student's t-test or one-way based on TCGA-PAAD datasets and qPCR detection in clinical samples were performed to explore the correlations between expression of hub genes in pancreatic cancer tissues and clinical parameters. Based on integrated analysis of TCGA and GEO datasets, we screened 38 DEGs in common, which were all up-regulated. The functional similarity results showed that 10 DEGs including TSPAN1, MSLN, C1orf116, PKP3, CEACAM6, BAIAP2L1, PPL, RAB25, ERBB3, and AP1M2 in the DEGs in common, which had the higher average functional similarity, were considered as the hub genes. Survival analysis results and Cox regression analysis showed that TSPAN1, CEACAM6, as well as ERBB3 were all associated with poor overall survival of PC. qPCR results showed that the expression levels of TSPAN1 and ERBB3 were significantly upregulated in the PC tissues. The statistical analysis results revealed that TSPAN1 expression correlated significantly with histologic grade, T stage, clinical stage, and vital status by two-tailed Student's t-test or one-way ANOVA; ERBB3 expression correlated significantly with T stage, clinical stage, and vital status by two-tailed Student's t-test or one-way ANOVA. We found that TSPAN1 and ERBB3 could be independent predictors of poor survival in pancreatic cancer.Entities:
Keywords: erb-b2 receptor tyrosine kinase 3; gene expression omnibus; pancreatic cancer; prognosis; tetraspanin-1; the cancer genome atlas
Mesh:
Substances:
Year: 2021 PMID: 34321959 PMCID: PMC8310909 DOI: 10.3389/pore.2021.1609868
Source DB: PubMed Journal: Pathol Oncol Res ISSN: 1219-4956 Impact factor: 3.201
FIGURE 1Flow chart of the study.
The Co-upregulated DEGs in mRNA expression profiling datasets GSE16515, GSE32676, and TCGA-PAAD.
| Gene symbol | |
|---|---|
| Co-upregulated DEGs | S100P, CEACAM5, CEACAM6, TMPRSS4, SERPINB5, MSLN, SDR16C5, AGR2, TSPAN1, NQO1, NMU, FXYD3, EPS8L3, GALNT5, TMEM45B, MLPH, MUC13, FOXQ1, STYK1, KCNK1, CAMK2N1, BAIAP2L1, RAB25, PLS1, C1orf116, B3GNT3, PKP3, CGN, USH1C, TUFT1, PERP, TNS4, ERBB3, CDS1, PPL, MYH14, RNF128, AP1M2 |
FIGURE 2Identification of common DEGs in mRNA expression profiling datasets GSE16515, GSE32676, and TCGA-PAAD. Volcano plots and Venn diagrams of differentially expressed genes. (A) Venn diagram of upregulated DEGs between GSE16515, GSE32676, and TCGA-PAAD datasets; (B) Venn diagram of downregulated DEGs between GSE16515, GSE32676, and TCGA-PAAD datasets; (C) Volcano plot of DEGs of GSE16515 datasets; (D) Volcano plot of DEGs of GSE32676 datasets; (E) Volcano plot of DEGs of TCGA-PAAD datasets. Upregulated genes are marked in red; downregulated genes are marked in light green.
FIGURE 3Clustering of the 38 DEGs in PAAD tissues vs. adjacent non-tumor tissues across each independent dataset. (A–C) A hierarchical clustering heat map showing the DEGs in common in PC tissues compared to adjacent non-tumor tissues.
FIGURE 4Summary of functional similarities of the DEGs in common in PC. The rainclouds were performed to reflect the distributions of functional similarities of 10 genes with top 10 scores, which were considered as the hub genes in PC. The dashed line represents the cutoff value.
FIGURE 5Kaplan–Meier curves for the survival of PAAD patients according to hub gene expression based on TCGA-PAAD datasets. Patients were divided into high and low hub gene expression groups using the median value of hub gene expression as the cutoff. Survival analysis and subgroup analysis according to grade, and TNM stage were performed based on Kaplan–Meier curves.
Univariate and multivariate analyses of clinicopathological characteristics, hub genes with overall survival in TCGA PAAD cohort.
| Parameters | Univariate analysis | Multivariate analysis | ||
|---|---|---|---|---|
| TCGA PAAD set | HR (95% CI) | P-value | HR (95% CI) | P-value |
| TSPAN1 | 1.485 (1.106–1.992) |
| 1.364 (1.014–1.835) |
|
| MSLN | 1.267 (0.948–1.694) | 0.11 | ||
| C1orf116 | 1.228 (0.919–1.642) | 0.166 | ||
| PKP3 | 1.332 (0.995–1.784) | 0.054 | ||
| CEACAM6 | 1.363 (1.016–1.827) |
| 1.104 (0.817–1.49) | 0.52 |
| BAIAP2L1 | 1.106 (0.828–1.478) | 0.494 | ||
| PPL | 1.288 (0.962–1.725) | 0.089 | ||
| RAB25 | 1.287 (0.962–1.722) | 0.089 | ||
| ERBB3 | 1.446 (1.078–1.938) |
| 1.35 (1.005–1.814) |
|
| AP1M2 | 1.105 (0.827–1.475) | 0.499 | ||
| Gender | 0.872 (0.653–1.163) | 0.35 | ||
| Age | 1.208 (0.904–1.614) | 0.202 | ||
| Histologic grade | 1.343 (0.988–1.824) | 0.06 | ||
| Pathologic T stage | 1.663 (1.062–2.604) |
| 1.147 (0.724–1.816) | 0.560 |
| Pathologic N Stage | 1.698 (1.177–2.449) |
| 1.647 (1.16–2.431) |
|
| Pathologic M stage | 1.035 (0.377–2.846) | 0.947 | ||
| Tumor stage | 0.805 (0.356–1.822) | 0.603 | ||
Bold values of p ≤ 0.05 indicate statistically significant correlations.
FIGURE 6Validation of the expression of the 10 hub genes in 10 paired PC and para-carcinoma tissues by qPCR. All samples were normalized to the expression of GAPDH. The relative expression level of each gene was analyzed using the 2−ΔΔCt method. *p < 0.05, **p < 0.01, # p > 0.05.
FIGURE 7Boxplots for differential expression of TSPAN1 and ERBB3 according to age, gender, grade, clinical stage of cancer, TNM stage of cancer, and survival status based on TCGA-PAAD datasets. P-values were calculated using a two-tailed Student’s t-test or one-way ANOVA.