| Literature DB >> 29069744 |
Jian Xiao1, Xiaoxiao Lu1, Xi Chen2, Yong Zou1, Aibin Liu3, Wei Li4, Bixiu He1, Shuya He5, Qiong Chen1.
Abstract
Lung adenocarcinoma (LADC) and squamous cell carcinoma (LSCC) are the most common non-small cell lung cancer histological phenotypes. Accurate diagnosis distinguishing between these two lung cancer types has clinical significance. For this study, we analyzed four Gene Expression Omnibus (GEO) datasets (GSE28571, GSE37745, GSE43580, and GSE50081). We then imported the datasets into the Gene-Cloud of Biotechnology Information online platform to identify genes differentially expressed in LADC and LSCC. We identified DSG3 (desmoglein 3), KRT5 (keratin 5), KRT6A (keratin 6A), KRT6B (keratin 6B), NKX2-1 (NK2 homeobox 1), SFTA2 (surfactant associated 2), SFTA3 (surfactant associated 3), and TMC5 (transmembrane channel-like 5) as potential biomarkers for distinguishing between LADC and LSCC. Receiver operating characteristic curve analysis suggested that KRT5 had the highest diagnostic value for discriminating between these two cancer types. Using the PrognoScan online survival analysis tool and the Kaplan-Meier Plotter, we found that high KRT6A or KRT6B levels, or low NKX2-1, SFTA3, or TMC5 levels correlated with unfavorable prognoses in LADC patients. Further studies will be needed to verify our findings in additional patient samples, and to elucidate the mechanisms of action of these potential biomarkers in non-small cell lung cancer.Entities:
Keywords: adenocarcinoma; biomarker; lung cancer; prognosis; squamous cell carcinoma
Year: 2017 PMID: 29069744 PMCID: PMC5641087 DOI: 10.18632/oncotarget.17606
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Study design diagram
LADC: lung adenocarcinoma; LSCC: squamous cell carcinoma; DEGs: differentially expressed genes; GCBI: Gene-Cloud of Biotechnology Information.
Figure 2Potential DEGs between LADC and LSCC
Heat maps for potential DEGs in GSE28571 (total n=243; LADC n=50; LSCC n=28) (A), GSE37745 (total n=210; LADC n=106; LSCC n=66) (B), GSE43580 (total n=118; LADC n=77; LSCC n=73) (C), and GSE50081 (total n=101; LADC n=128; LSCC n=43) (D).
Top 10 down- or upregulated DEGs between LADC and LSCC in lung cancer dataset, GSE28571
| Probe set ID | Gene symbol | Gene description | Gene feature | Fold change |
|---|---|---|---|---|
| 209125_at | KRT6A | keratin 6A | downregulation | −176.148978 |
| 206165_s_at | CLCA2 | chloride channel accessory 2 | downregulation | −90.443266 |
| 235075_at | DSG3 | desmoglein 3 | downregulation | −88.129812 |
| 201820_at | KRT5 | keratin 5 | downregulation | −82.362516 |
| 217272_s_at | SERPINB13 | serpin peptidase inhibitor, clade B (ovalbumin), member 13 | downregulation | −64.457025 |
| 213680_at | KRT6B | keratin 6B | downregulation | −52.540652 |
| 204455_at | DST | dystonin | downregulation | −46.258579 |
| 209863_s_at | TP63 | tumor protein p63 | downregulation | −45.820729 |
| 206032_at | DSC3 | desmocollin 3 | downregulation | −43.549951 |
| 204855_at | SERPINB5 | serpin peptidase inhibitor, clade B (ovalbumin), member 5 | downregulation | −39.535047 |
| 244056_at | SFTA2 | surfactant associated 2 | upregulation | 31.032507 |
| 228979_at | SFTA3 | surfactant associated 3 | upregulation | 27.153369 |
| 211024_s_at | NKX2-1 | NK2 homeobox 1 | upregulation | 15.422392 |
| 219580_s_at | TMC5 | transmembrane channel-like 5 | upregulation | 11.725501 |
| 229105_at | GPR39 | G protein-coupled receptor 39 | upregulation | 6.443132 |
| 214033_at | ABCC6 | ATP-binding cassette, sub-family C (CFTR/MRP), member 6 | upregulation | 6.288185 |
| 212328_at | LIMCH1 | LIM and calponin homology domains 1 | upregulation | 6.28786 |
| 225822_at | TMEM125 | transmembrane protein 125 | upregulation | 5.919894 |
| 230875_s_at | ATP11A | ATPase, class VI, type 11A | upregulation | 5.787312 |
| 228806_at | RORC | RAR-related orphan receptor C | upregulation | 5.335111 |
Top 10 down- or upregulated DEGS between LADC and LSCC in lung cancer dataset, GSE37745
| Probe set ID | Gene symbol | Gene description | Gene feature | Fold change |
|---|---|---|---|---|
| 209125_at | KRT6A | keratin 6A | downregulation | −140.927 |
| 235075_at | DSG3 | desmoglein 3 | downregulation | −86.646 |
| 206165_s_at | CLCA2 | chloride channel accessory 2 | downregulation | −84.9649 |
| 201820_at | KRT5 | keratin 5 | downregulation | −62.2157 |
| 213680_at | KRT6B | keratin 6B | downregulation | −53.2072 |
| 206032_at | DSC3 | desmocollin 3 | downregulation | −47.29 |
| 209863_s_at | TP63 | tumor protein p63 | downregulation | −44.3825 |
| 204455_at | DST | dystonin | downregulation | −38.1615 |
| 213796_at | SPRR1A | small proline-rich protein 1A | downregulation | −36.8294 |
| 217272_s_at | SERPINB13 | serpin peptidase inhibitor, clade B (ovalbumin), member 13 | downregulation | −36.3898 |
| 228979_at | SFTA3 | surfactant associated 3 | upregulation | 33.59706 |
| 244056_at | SFTA2 | surfactant associated 2 | upregulation | 27.97213 |
| 216623_x_at | TOX3 | TOX high mobility group box family member 3 | upregulation | 21.41014 |
| 206239_s_at | SPINK1 | serine peptidase inhibitor, Kazal type 1 | upregulation | 17.47105 |
| 211024_s_at | NKX2-1 | NK2 homeobox 1 | upregulation | 16.6846 |
| 223806_s_at | NAPSA | napsin A aspartic peptidase | upregulation | 14.23227 |
| 37004_at | SFTPB | surfactant protein B | upregulation | 12.19793 |
| 240304_s_at | TMC5 | transmembrane channel-like 5 | upregulation | 11.27782 |
| 204424_s_at | LMO3 | LIM domain only 3 (rhombotin-like 2) | upregulation | 10.23422 |
| 219612_s_at | FGG | fibrinogen gamma chain | upregulation | 9.826917 |
Top 10 down- or upregulated DEGs between LADC and LSCC in lung cancer dataset, GSE43580
| Probe set ID | Gene symbol | Gene description | Gene feature | Fold change |
|---|---|---|---|---|
| 209125_at | KRT6A | keratin 6A | downregulation | −53.2466 |
| 235075_at | DSG3 | desmoglein 3 | downregulation | −45.44 |
| 206165_s_at | CLCA2 | chloride channel accessory 2 | downregulation | −38.0985 |
| 209863_s_at | TP63 | tumor protein p63 | downregulation | −28.6096 |
| 213796_at | SPRR1A | small proline-rich protein 1A | downregulation | −27.828 |
| 201820_at | KRT5 | keratin 5 | downregulation | −26.5195 |
| 206032_at | DSC3 | desmocollin 3 | downregulation | −25.687 |
| 213680_at | KRT6B | keratin 6B | downregulation | −25.5837 |
| 217272_s_at | SERPINB13 | serpin peptidase inhibitor, clade B (ovalbumin), member 13 | downregulation | −22.7939 |
| 209351_at | KRT14 | keratin 14 | downregulation | −21.4751 |
| 216623_x_at | TOX3 | TOX high mobility group box family member 3 | upregulation | 12.48837 |
| 228979_at | SFTA3 | surfactant associated 3 | upregulation | 9.698342 |
| 244056_at | SFTA2 | surfactant associated 2 | upregulation | 9.34222 |
| 220393_at | LGSN | lengsin, lens protein with glutamine synthetase domain | upregulation | 7.272057 |
| 223806_s_at | NAPSA | napsin A aspartic peptidase | upregulation | 6.387242 |
| 211024_s_at | NKX2-1 | NK2 homeobox 1 | upregulation | 6.235382 |
| 240304_s_at | TMC5 | transmembrane channel-like 5 | upregulation | 5.886752 |
| 229030_at | CAPN8 | calpain 8 | upregulation | 5.558286 |
| 209016_s_at | KRT7 | keratin 7 | upregulation | 5.197863 |
| 206239_s_at | SPINK1 | serine peptidase inhibitor, Kazal type 1 | upregulation | 5.028636 |
Top 10 down- or upregulated DEGs between LADC and LSCC in lung cancer dataset, GSE50081
| Probe set ID | Gene symbol | Gene description | Gene feature | Fold change |
|---|---|---|---|---|
| 209125_at | KRT6A | keratin 6A | downregulation | −57.006103 |
| 213680_at | KRT6B | keratin 6B | downregulation | −39.001783 |
| 201820_at | KRT5 | keratin 5 | downregulation | −37.082683 |
| 207935_s_at | KRT13 | keratin 13 | downregulation | −23.955773 |
| 210020_x_at | CALML3 | calmodulin-like 3 | downregulation | −22.527441 |
| 235075_at | DSG3 | desmoglein 3 | downregulation | −21.167905 |
| 213796_at | SPRR1A | small proline-rich protein 1A | downregulation | −20.461997 |
| 221854_at | PKP1 | plakophilin 1 (ectodermal dysplasia/skin fragility syndrome) | downregulation | −18.214428 |
| 205157_s_at | JUP | junction plakoglobin | downregulation | −17.594235 |
| 209351_at | KRT14 | keratin 14 | downregulation | −16.96603 |
| 228979_at | SFTA3 | surfactant associated 3 | upregulation | 13.36924 |
| 244056_at | SFTA2 | surfactant associated 2 | upregulation | 13.198138 |
| 211024_s_at | NKX2-1 | NK2 homeobox 1 | upregulation | 11.03073 |
| 240304_s_at | TMC5 | transmembrane channel-like 5 | upregulation | 8.335526 |
| 206239_s_at | SPINK1 | serine peptidase inhibitor, Kazal type 1 | upregulation | 7.171856 |
| 209016_s_at | KRT7 | keratin 7 | upregulation | 6.780702 |
| 204124_at | SLC34A2 | solute carrier family 34 (sodium phosphate), member 2 | upregulation | 6.362828 |
| 204437_s_at | FOLR1 | folate receptor 1 (adult) | upregulation | 6.138674 |
| 229177_at | C16orf89 | chromosome 16 open reading frame 89 | upregulation | 6.035951 |
| 204424_s_at | LMO3 | LIM domain only 3 (rhombotin-like 2) | upregulation | 5.987309 |
Figure 3Venn diagram showing downregulated DEGs common to all four GEO datasets
Figure 4Venn diagram showing upregulated DEGs common to all four GEO datasets
Figure 5ROC curves for downregulated (A) and upregulated DEGs (B) in distinguishing between LADC and LSCC. TPR: true positive rate; FPR: false positive rate; AUC: area under the curve.
DSG3, KRT5, KRT6A, and KRT6B prognostic values in LADC and LSCC as assessed by PrognoScan
| Gene symbol | LADC | LSCC | ||||||
|---|---|---|---|---|---|---|---|---|
| Dataset | Case | HR (95% CIs) | Dataset | Case | HR (95% CIs) | |||
| MICHIGAN-LC | 86 | 2.54 (1.22-5.32) | 0.013244 | - | - | - | >0.05 | |
| KRT5 | - | - | - | >0.05 | - | - | - | >0.05 |
| jacob-00182-HLM | 79 | 1.24 (1.06–1.45) | 0.006974 | - | - | - | >0.05 | |
| jacob-00182-MSK | 104 | 1.28 (1.06–1.53) | 0.008562 | |||||
| GSE31210 | 204 | 1.39 (1.18–1.63) | 0.000083 | |||||
| jacob-00182-MSK | 104 | 1.26 (1.07–1.47) | 0.005120 | - | - | - | >0.05 | |
| GSE31210 | 204 | 1.47 (1.23–1.75) | 0.000017 | |||||
NKX2-1, SFTA2, SFTA3, and TMC5 prognostic values in LADC and LSCC as assessed by PrognoScan
| Gene symbol | LADC | LSCC | ||||||
|---|---|---|---|---|---|---|---|---|
| Dataset | Case | HR (95% CIs) | Dataset | Case | HR (95% CIs) | |||
| jacob-00182-CANDF | 82 | 0.78 (0.64–0.96) | 0.020132 | GSE17710 | 56 | 0.71 (0.52-0.97) | 0.029764 | |
| jacob-00182-HLM | 79 | 0.78 (0.63–0.97) | 0.027745 | |||||
| MICHIGAN-LC | 86 | 0.56 (0.36–0.87) | 0.009902 | |||||
| GSE31210 | 204 | 0.62 (0.43–0.88) | 0.008218 | |||||
| jacob-00182-UM | 178 | 0.81 (0.68–0.97) | 0.021112 | |||||
| SFTA2 | - | - | - | >0.05 | - | - | - | - |
| GSE13213 | 117 | 0.89 (0.79–1.00) | 0.048445 | - | - | - | - | |
| GSE31210 | 204 | 0.62 (0.46–0.85) | 0.003019 | |||||
| jacob-00182-HLM | 79 | 0.45 (0.24–0.84) | 0.012012 | - | - | - | >0.05 | |
| GSE31210 | 204 | 0.30 (0.13–0.68) | 0.004014 | |||||
Figure 6Kaplan-Meier survival curves for KRT6A and KRT6B expression in LADC patients
Verification of potential prognostic indicators via Kaplan-Meier Plotter
| Gene symbol | LADC | LSCC | ||||
|---|---|---|---|---|---|---|
| Case | HR (95% CIs) | Case | HR (95% CIs) | |||
| DSG3 | 673 | 1.09 (0.86-1.39) | 0.48 | 271 | 0.86 (0.63–1.18) | 0.35 |
| 720 | 524 | 0.99 (0.78–1.25) | 0.92 | |||
| 720 | 524 | 0.94 (0.75–1.20) | 0.63 | |||
| 720 | 524 | 0.82 (0.65–1.04) | 0.11 | |||
| 673 | 271 | 0.82 (0.60–1.11) | 0.20 | |||
| 720 | 524 | 1.02 (0.8–1.29) | 0.88 | |||
Figure 7Kaplan-Meier survival curves for NKX2-1, SFTA3, and TMC5 expression in LADC patients