| Literature DB >> 30147367 |
Qixing Mao1,2,3,4, Louqian Zhang1,2,3, Yi Zhang1, Gaochao Dong1,3, Yao Yang4, Wenjie Xia1,2,3,4, Bing Chen1,2,3, Weidong Ma1,2,3, Jianzhong Hu4, Feng Jiang1,3, Lin Xu1,3.
Abstract
BACKGROUND: A substantial increase in the number of non-smoking lung adenocarcinoma (LAC) patients has been drawing extensive attention in the past decade. However, effective biomarkers, which could guide the precise treatment, are still limited for identifying high-risk patients. Here, we provide a network-based signature to predict the survival of non-smoking LAC.Entities:
Keywords: LAC; WGCNA; co-expressing; lung adenocarcinoma; prognostic signature; weighted gene co-expression network analysis
Year: 2018 PMID: 30147367 PMCID: PMC6101016 DOI: 10.2147/CMAR.S163918
Source DB: PubMed Journal: Cancer Manag Res ISSN: 1179-1322 Impact factor: 3.989
Figure 1Weighted gene co-expression network analysis identified co-expression gene modules of LAC.
Notes: (A) Clustering dendrogram of different expression genes. Hierarchical cluster analysis dendrogram used to detect co-expression clusters. Each color is assigned to 1 module (gray represented unassigned genes). (B) Network heatmap plot. Genes were sorted in the rows and columns by the clustering tree. Light colors denoted low adjacency and darker colors denoted higher adjacency. (C) Correlation values of different module-trait relationships with different clinical records. (D) Correlation values of module-trait relationships of non-smoking related modules across 4 training datasets.
Abbreviation: ME, module eigengene.
P-values of module-trait relationships of two nonsmoking related modules across 4 training datasets
| Datasets | Blue modules ( | Turquoise modules ( |
|---|---|---|
| 0.041 | <0.001 | |
| 0.07 | 0.004 | |
| 0.038 | <0.001 | |
| <0.001 | <0.001 |
Note:
P-value is significant.
Figure 2GO analysis and co-expression network of non-smoking related modules.
Notes: GO enrichment analysis of the blue and turquoise modules (A and B); visual representation of co-expression networks in the blue and turquoise modules. The width of the lines represented co-expression correlation value (C and D); different sizes of nodes indicated different Module Membership values.
Abbreviation: GO, Gene Ontology.
The top 25 hub genes of the blue and turquoise modules in 4 training datasets
| Rank | GSE10072 | GSE40419 | GSE31210 | GSE68465 |
|---|---|---|---|---|
| 1 | CDK1 | SPAG5 | ASPM | CCNA2 |
| 2 | ||||
| 3 | TOP2A | CDCA5 | PRC1 | |
| 4 | MAD2L1 | K1F2C | ||
| 5 | CDC20 | |||
| 6 | B1RC5 | |||
| 7 | PRC1 | K1F2C | ||
| 8 | ZW1NT | K1F2C | NCAPG | |
| 9 | KIF15 | CENPA | ||
| 10 | ECT2 | K1F20A | CENPF | MAD2L1 |
| 11 | CEP55 | CCNB1 | CEP55 | K1F4A |
| 12 | K1F11 | TOP2A | CHEK1 | |
| 13 | ASPM | TTK | K1F4A | AURKA |
| 14 | NCAPG | K1AA0101 | ||
| 15 | RRM2 | ESPL1 | CDC20 | |
| 16 | KPNA2 | K1F14 | CDC6 | |
| 17 | CENPA | |||
| 18 | CDKN3 | UHRF1 | BIRC5 | HJURP |
| 19 | TTK | CDK1 | ||
| 20 | NCAPH | NEK2 | FEN1 | |
| 21 | CDCA8 | HMMR | BIRC5 | |
| 22 | CENPU | CAV1 | KIAA0101 | |
| 23 | CKS1B | POLQ | FOXM1 | BUB1 |
| 24 | CENPA | KIFC1 | ZWINP | KIF18B |
| 25 | RFC4 | DLGAP5 | ORC6 | CDKN3 |
| 1 | GRK5 | TAL1 | HLA-DRA | |
| 2 | FHL1 | FHL1 | ||
| 3 | EDNRB | ARHGAP6 | TEK | HLA-DQA1 |
| 4 | GRK5 | CAV1 | MMRN2 | C1QA |
| 5 | TEK | NXPH3 | PTPRB | HLA-DPA1 |
| 6 | FAM107A | ABCA8 | FHL5 | HLA-DQB1 |
| 7 | FIGF | CDO1 | S1PR1 | HLA-DRB1 |
| 8 | PECAM1 | KANK3 | CD14 | |
| 9 | JAM2 | ABI3BP | EDNRB | HLA-DRB1 |
| 10 | AGER | CCBE1 | ARHGAP6 | HLA-DQB1 |
| 11 | GPM6A | ERG | ||
| 12 | CLEC3B | ADH1B | SASH1 | CSF1R |
| 13 | ABCA8 | EMCN | GIMAP4 | |
| 14 | FGD5 | TGFBR3 | ||
| 15 | CA4 | AOC3 | PKNOX2 | SERPING |
| 16 | HIGD1B | RADIL | FAM107A | CD163 |
| 17 | FOXF1 | SCUBE1 | AOC3 | HLA-DPB1 |
| 18 | TACC1 | CLEC3B | PECAM1 | |
| 19 | STARD13 | LTBP4 | ADAMTSL3 | ENTPD1 |
| 20 | RAMP2 | TGFBR3 | GIMAP6 | |
| 21 | AOC3 | GRIA1 | ASPA | FCER1G |
| 22 | TGFBR3 | KANK3 | RASIP1 | CD4 |
| 23 | ADH1B | CAV2 | DACH1 | HLA-DQB1 |
| 24 | VWF | ACVRL1 | CDO1 | SPARCL1 |
| 25 | SIPR1 | ADAMTSL3 | SLC7A7 | |
Notes:Bold font: Overlapped hub genes in 4 datasets.
Figure 3Cluster analysis of 17 candidate genes selected by penalized Cox regression in the training group (A). Penalized Cox regression analysis to select survival-associated genes in the training group. (B) The optimal λ value is 14.
Figure 4ROC curves for 17 genes to predict the survival of non-smoking LAC (A). Multi-variable Cox analysis indicated that risk score was an independent prognostic risk factor by adjusting other variables (B). The performance of the prognostic signature in stratifying the high-risk and low-risk groups. (C) training cohort (TCGA), (D) external testing cohort 1 (GSE50081), (E) external testing cohort 2 (GSE50081).
Abbreviations: EGFR, epidermal growth factor receptor; HR, hazard ratio; KRAS, Kirsten ras; LAC, lung adenocarcinoma; TGCA, The Cancer Genome Atlas.
Information of training and validation GEO datasets
| Datasets | Platform | Sample size | Smoking status (never/smoker) | Stage (I/II/III/IV) | Gender (female/male) |
|---|---|---|---|---|---|
| Affymetrix Human Genome U133A Array | 107 | 30/77 | 45/35/21/6 | 38/69 | |
| Illumina Hiseq 2000 | 164 | 70/94 | 109/24/23/8 | 67/97 | |
| Affymetrix Human Genome U133 Plus 2.0 Array | 246 | 123/123 | 168/58 | 130/116 | |
| Affymetrix Human Genome U133A Array | 440 | 49/391 | 276/102/50/12 | 220/220 | |
| Illumina Hiseq | 524 | 214/310 | 283/125/84/27 | 277/243 | |
| Affymetrix Human Genome U133 Plus 2.0 Array | 181 | 103/58 | 127/54 | 84/97 | |
| Affymetrix Human Genome U133 Plus 2.0 Array | 246 | 123/123 | 168/58 | 130/116 |