| Literature DB >> 35117820 |
Lan-Er Shi1, Xin Shang1, Ke-Chao Nie1, Zhi-Qin Lin1, Miao Wang1, Yin-Ying Huang1, Zhang-Zhi Zhu1.
Abstract
BACKGROUND: Pancreatic adenocarcinoma (PC), is a type of digestive tract cancer with the highest mortality all over the word, and its exact pathogenesis is not clear. Therefore, it is of great significance to search for genes related to PC and elucidate its molecular mechanism.Entities:
Keywords: Pancreatic adenocarcinoma (PC); bioinformatics analysis; differentially expressed genes (DEGs); microarray
Year: 2020 PMID: 35117820 PMCID: PMC8799009 DOI: 10.21037/tcr-19-2873
Source DB: PubMed Journal: Transl Cancer Res ISSN: 2218-676X Impact factor: 1.241
Characteristics of datasets in this study
| Dataset | Platform | Number of samples (tumor/control) | Tumor type |
|---|---|---|---|
| GSE15471 | GPL570[(HG-U133_Plus_2) Affymetrix Human Genome U133 Plus 2.0 Array] | 78 (39/39) | Pancreatic cancer |
| GSE28735 | GPL6244[(HuGene-1_0-st) Affymetrix Human Gene 1.0 ST Array] | 90 (45/45) | Pancreatic cancer |
| GSE32676 | GPL570[(HG-U133_Plus_2) Affymetrix Human Genome U133 Plus 2.0 Array] | 32 (25/7) | Pancreatic cancer |
| GSE39751 | GPL5936 (HEEBO Human oligo array) | 24 (12/12) | Pancreatic cancer |
| GSE43795 | GPL10558(Illumina HumanHT-12 V4.0 expression beadchip) | 31 (26/5) | Pancreatic cancer |
| GSE55643 | GPL6480 (Agilent-01485 0 Whole Human Genome Microarray 4x44K G4112F) | 53 (45/8) | Pancreatic cancer |
| GSE62165 | GPL13667[(HG-U219) Affymetrix Human Genome U219 Array] | 131 (118/13) | Pancreatic cancer |
| GSE62452 | GPL6244[(HuGene-1_0-st) Affymetrix Human Gene 1.0 ST Array] | 130 (61/69) | Pancreatic cancer |
| TCGA | Illumina HiSeq | 182 (178/4) | Pancreatic cancer |
Figure 1Identification of DEGs in tumor tissues and adjacent nontumor tissues from PC patients. (A) Volcano plots of DEGs in GEO database. (B) Volcano plots of DEGs in TCGA database. The red dots represent the upregulated genes based on an adjusted P<0.05 and log FC >2; the green dots represent the downregulated genes based on an adjusted P<0.05 and log FC <−2; the black spots represent genes with no significant difference in expression. FC, fold change; GEO, Gene Expression Omnibus. (C) Venn diagrams of common DEGs of GEO and TCGA PC dataset.
Identification of 41 DEGs from eight profile in GEO, including 19 upregulated genes and 22 downregulated genes
| DEGs | Gene names |
|---|---|
| Upregulated |
|
| Downregulated |
|
GEO, Gene Expression Omnibus; DEGs, differentially expressed genes.
Identification of DEGs in TCGA
| DEGs | Gene names |
|---|---|
| Upregulated |
|
| Downregulated |
|
A total of 446 DEGs were identified, including 26 upregulated genes and 420 downregulated genes. TCGA, The Cancer Genome Atlas; DEGs, differentially expressed genes.
Figure 2Expression of TSPAN1 and its effect on OS. (A) The gene expression profile across 33 kinds of tumor samples and paired normal tissues. Each dot represents expression of samples. Red indicates high expression and green indicates low expression. (B) relative expression of TSPAN1 in normal tissues and PC tissues (*, P<0.05). (C) Kaplan-Meier survival curves of TSPAN1 in PC patients
Figure 3The relative expression of TSPAN1 in normal tissues and PC tissues of PC patients. (A) Patient’s gender; (B) patient’s age; (C) tumor grade; (D) individual cancer stage. *, P<0.05; ***, P<0.001.
Figure 4GSEA was used to perform hallmark analysis in TSPAN1, results suggested that TSPAN1 significantly involved in the pathway of O glycan biosynthesis, TJ, glycerophospholipid metabolism, p53 signaling pathway, and glycolysis gluconeogenesis.
GSEA analysis of TSPAN1
| Gene set | ES | NES | P | FDR |
|---|---|---|---|---|
| O Glycan Biosynthesis | 0.699 | 1.876 | 0.002 | 0.088 |
| Tight Junction | 0.457 | 1.643 | 0.004 | 0.161 |
| Glycerophospholipid Metabolism | 0.451 | 1.567 | 0.006 | 0.193 |
| P53 Signaling Pathway | 0.552 | 1.750 | 0.011 | 0.100 |
| Glycolysis Gluconeogenesis | 0.494 | 1.618 | 0.019 | 0.152 |
ES, enrichment score; NES, normalized enrichment score.
43 genes significantly related to survival time identified from univariate Cox regression model
| Gene | HR | Z | P value |
|---|---|---|---|
|
| 0.703639506 | −3.75906501 | 0.00017055 |
|
| 1.438868912 | 3.588130084 | 0.000333058 |
|
| 0.776088386 | −3.57570495 | 0.000349285 |
|
| 1.363523877 | 3.508204535 | 0.000451142 |
|
| 0.756703186 | −3.503799083 | 0.000458671 |
|
| 1.345648032 | 3.502539144 | 0.000460846 |
|
| 0.66083542 | −3.443526972 | 0.000574179 |
|
| 1.400354154 | 3.357058121 | 0.000787766 |
|
| 1.298382346 | 3.197651888 | 0.001385514 |
|
| 1.191687987 | 3.149919639 | 0.001633154 |
|
| 1.179596007 | 3.091184704 | 0.001993596 |
|
| 1.155409242 | 3.071293664 | 0.002131334 |
|
| 1.203202385 | 2.998880938 | 0.002709732 |
|
| 0.835728686 | −2.993100039 | 0.002761592 |
|
| 1.140309517 | 2.776945264 | 0.005487242 |
|
| 0.832208745 | −2.736346206 | 0.006212563 |
|
| 0.749149411 | −2.711469636 | 0.006698568 |
|
| 0.825456763 | −2.703393517 | 0.006863544 |
|
| 0.777298851 | −2.692655943 | 0.007088538 |
|
| 0.833587002 | −2.692401593 | 0.007093947 |
|
| 0.755658934 | −2.561809842 | 0.010412831 |
|
| 1.181711465 | 2.488663846 | 0.012822414 |
|
| 1.182505672 | 2.451743156 | 0.01421661 |
|
| 0.867313614 | −2.405796106 | 0.016137268 |
|
| 1.145824966 | 2.391443842 | 0.016782251 |
|
| 1.187844904 | 2.343005963 | 0.019129077 |
|
| 1.144437783 | 2.326983781 | 0.019966128 |
|
| 0.878285206 | −2.261701774 | 0.023715835 |
|
| 0.796884219 | −2.261683447 | 0.023716968 |
|
| 0.827171237 | −2.24864359 | 0.024535181 |
|
| 0.740642374 | −2.234866064 | 0.025426143 |
|
| 1.130492663 | 2.210796342 | 0.027049943 |
|
| 0.828394267 | −2.20007862 | 0.027801317 |
|
| 0.72930414 | −2.171415905 | 0.029899748 |
|
| 0.795211505 | −2.16555869 | 0.030344926 |
|
| 0.82872131 | −2.134770338 | 0.032779774 |
|
| 1.124525193 | 2.116437642 | 0.034307603 |
|
| 0.901979278 | −2.053300195 | 0.040043472 |
|
| 0.823390068 | −2.053220159 | 0.04005123 |
|
| 0.812586746 | −2.029752065 | 0.042381748 |
|
| 0.861947085 | −2.003268123 | 0.045148517 |
|
| 0.887148466 | −1.993301358 | 0.046228458 |
|
| 0.837601622 | −1.980687455 | 0.047626334 |
Prognostic value of the four genes in the PAAD patients of the TCGA dataset
| Gene symbol | Univariate analysis | Multivariate analysis | ||||
|---|---|---|---|---|---|---|
| HR (95%CI) | P value | HR (95%CI) | P value | Coefficient | ||
| AIM2 | 1.1458 | 0.0168 | 1.2555 | 0.0056 | 0.2275 | |
| B3GNT3 | 1.4389 | 0.0003 | 1.4298 | 0.0291 | 0.3576 | |
| MATK | 0.8126 | 0.0424 | 0.7087 | 0.0321 | −0.3443 | |
| BEND4 | 0.8783 | 0.0237 | 0.8084 | 0.0326 | −0.2127 | |
HR, hazard ratio; CI, confidence interval.
Figure 5Prognostic gene signature of the four genes in patients with PC from TCGA dataset. (A) Distribution of risk scores in low-risk and high-risk groups; (B) survival status distribution; (C) the heatmap of the four genes for low- and high-risk group; (D) Kaplan-Meier curves for low-and high-risk groups; (E) receiver operating characteristic curve (ROC) curve of OS in PC patients was predicted according to risk score.