| Literature DB >> 30588191 |
Meng-Yu Zhang1, Xiao-Xia Liu1, Hao Li1, Rui Li1, Xiao Liu1, Yi-Qing Qu1.
Abstract
Background and aim: Adenocarcinoma is a very common pathological subtype for lung cancer. We aimed to identify the gene signature associated with the prognosis of smoking related lung adenocarcinoma using bioinformatics analysis.Entities:
Keywords: Kaplan-Meier analysis; biomarkers; differentially expressed genes; gene ontology; lung adenocarcinoma
Mesh:
Substances:
Year: 2018 PMID: 30588191 PMCID: PMC6299412 DOI: 10.7150/ijms.28728
Source DB: PubMed Journal: Int J Med Sci ISSN: 1449-1907 Impact factor: 3.738
Clinical characteristics and correlations with mRNA expression of AURKA, TPX2 and CDC20.
| Characteristic | n=497 | AURKA | TPX2 | CDC20 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Lown=248 | Highn=249 | P value | Lown=248 | Highn=249 | P value | Lown=248 | Highn=249 | P value | ||
| Age (years) | 0.043 | 0.029 | <0.001 | |||||||
| <65 | 214 | 96 | 118 | 96 | 118 | 86 | 128 | |||
| >=65 | 264 | 143 | 121 | 145 | 119 | 156 | 108 | |||
| Not given | 19 | 9 | 10 | 7 | 12 | 6 | 13 | |||
| Gender | <0.001 | 0.005 | 0.005 | |||||||
| Female | 230 | 95 | 135 | 99 | 131 | 99 | 131 | |||
| Male | 267 | 153 | 114 | 149 | 118 | 149 | 118 | |||
| Smoking history | 0.002 | 0.008 | 0.004 | |||||||
| Smoker | 422 | 198 | 224 | 200 | 222 | 199 | 223 | |||
| Non-smoker | 75 | 50 | 25 | 48 | 27 | 49 | 26 | |||
| New tumor event | 0.114 | 0.008 | 0.013 | |||||||
| YES | 118 | 53 | 65 | 47 | 71 | 48 | 70 | |||
| NO | 257 | 138 | 119 | 140 | 117 | 140 | 117 | |||
| Not given | 122 | 57 | 65 | 61 | 61 | 60 | 62 | |||
| Pathological T stage | 0.062 | 0.001 | 0.001 | |||||||
| T1 | 164 | 94 | 70 | 102 | 64 | 100 | 64 | |||
| T2 | 267 | 122 | 145 | 115 | 152 | 115 | 152 | |||
| T3 + T4 | 64 | 31 | 33 | 30 | 34 | 32 | 32 | |||
| unknown | 2 | 1 | 1 | 1 | 1 | 1 | 1 | |||
| Therapy outcome | 0.002 | <0.001 | 0.014 | |||||||
| *CR+PR | 232 | 128 | 104 | 131 | 101 | 127 | 105 | |||
| *SD+PD | 71 | 24 | 47 | 23 | 48 | 27 | 44 | |||
| unknown | 194 | 96 | 98 | 94 | 100 | 94 | 100 | |||
*CR+PR:Complete Remission+Partial Remission.
*SD+PD:Stable Disease+Progressive Disease.
Figure 1Identification of DEGs. A-E. Volcano plots of the different mRNA expression analysis. X-axis: log 2 fold change; Y-axis: -log10 p-value for each probes; A: There were 829 genes identified to be differentially expressed in GSE31210, including 274 up-regulated and 555 down-regulated genes. B: 3564 genes (1682 up-regulated and 1882 down-regulated genes) identified to be differentially expressed in GSE32863. C: 10896 genes (5064 up-regulated and 5832 down-regulated genes) differentially expressed in GSE40791. D: 831 genes (195 up-regulated and 636 down-regulated genes) in GSE43458. E: 7726 genes (3771 up-regulated and 3955 down-regulated genes) in GSE75037. F-H. Overlap analysis between different datasets. F: A total of 2226 genes were significantly differentially expressed in three lung adenocarcinoma GEO datasets. G: 140 genes were overlapped in two smoking related lung adenocarcinoma GEO datasets. H: There were 58 overlapping genes significantly differentially expressed between smokers and non-smoers of lung adenocarcinoma in five GEO datasets.
Figure 2Functional enrichment analysis of 58 DEGs. A: The significantly enriched GO categories were calculated. Blue: Molecular function; Green: Cellular component; Red: Biological process. B: Gene networks identified through KEGG analysis of the DEGs.
Figure 3Validation of the gene expression in five GEO datasets. A: Validation of mRNA expression of AURKA in five GEO datasets. B: Validation of mRNA expression of CDC20 in five GEO datasets. C: Validation of mRNA expression of TPX2 in five GEO datasets.
Figure 4A-C. Validation of the gene expression between non-smoker lung adenocarcinoma and smoker lung adenocarcinoma. A: Validation of mRNA expression of AURKA in TCGA database. B: Validation of mRNA expression of CDC20 in TCGA database. C: Validation of mRNA expression of TPX2 in TCGA database. D-F. Gene expression of smoking related lung adenocarcinoma according to clinical stage in TCGA database. D: AURKA mRNA expression of smoking related lung adenocarcinoma according to clinical stage of TCGA database. E: CDC20 mRNA expression of smoking related lung adenocarcinoma according to clinical stage in TCGA database. F: TPX2 mRNA expression of smoking related lung adenocarcinoma according to clinical stage in TCGA database. G-I. Gene expression of smoking related lung adenocarcinoma according to smoking history in TCGA database. G: AURKA mRNA expression of smoking related lung adenocarcinoma according to smoking history in TCGA database. H: CDC20 mRNA expression of smoking related lung adenocarcinoma according to smoking history in TCGA database. I: TPX2 mRNA expression of smoking related lung adenocarcinoma according to smoking history in TCGA database.
Figure 5Kaplan-Meier survival curves by different mRNA expression levels of AURKA, CDC20 and TPX2 of 497 smoking related lung adenocarcinoma in TCGA database. A: OS between low and high AURKA mRNA expression. B: OS between low and high CDC20 mRNA expression. C: OS between low and high TPX2 mRNA expression. D: RFS between low and high AURKA mRNA expression. E: RFS between low and high CDC20 mRNA expression. F: RFS between low and high TPX2 mRNA expression.
Figure 6Kaplan-Meier survival curves by different groups of 497 smoking related lung adenocarcinoma in TCGA database. A: OS among three different groups. B: RFS among three different groups.