| Literature DB >> 33195688 |
Yingqing Zhang1, Xiaoping Zhang2, Xiaodong Lv1, Ming Zhang1, Xixi Gao1, Jialiang Liu1, Yufen Xu3, Zhixian Fang1, Wenyu Chen1.
Abstract
BACKGROUND: Prognosis is a main factor affecting the survival of patients with lung adenocarcinoma (LUAD), yet no robust prognostic model of high effectiveness has been developed. This study is aimed at constructing a stable and practicable gene signature-based model via bioinformatics methods for predicting the prognosis of LUAD sufferers.Entities:
Mesh:
Year: 2020 PMID: 33195688 PMCID: PMC7641279 DOI: 10.1155/2020/1836542
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Candidate gene selection. (a) Volcano plot of the DEGs in the TCGA-LUAD dataset; (b) regression coefficients in Lasso regression analysis; (c) selection of lambda in the Lasso regression model through 10-fold crossvalidation method.
Top 20 genes associated with survival.
| ID | HR | HR.95L | HR.95H |
|
|---|---|---|---|---|
| FAM83A | 1.00004 | 1.000026 | 1.000053 | 8.36 |
| RHOV | 1.000144 | 1.000093 | 1.000196 | 4.16 |
| TNS4 | 1.000136 | 1.000083 | 1.00019 | 5.85 |
| C1QTNF6 | 1.000485 | 1.000288 | 1.000683 | 1.51 |
| COL7A1 | 1.000145 | 1.000079 | 1.000211 | 1.86 |
| TRPA1 | 1.002849 | 1.001522 | 1.004177 | 2.53 |
| CREG2 | 1.000977 | 1.000508 | 1.001445 | 4.34 |
| UNC5D | 1.000503 | 1.000261 | 1.000746 | 4.69 |
| INHA | 1.000112 | 1.000056 | 1.000167 | 8.79 |
| AHNAK2 | 1.000072 | 1.000035 | 1.000108 | 0.000107 |
| OLFM4 | 1.000025 | 1.000012 | 1.000037 | 0.00011 |
| KLK8 | 1.000654 | 1.000312 | 1.000996 | 0.000177 |
| FAM83B | 1.001885 | 1.000896 | 1.002875 | 0.000186 |
| KRT6A | 1.000015 | 1.000007 | 1.000022 | 0.000285 |
| MUCL1 | 1.000166 | 1.000076 | 1.000255 | 0.000305 |
| MFI2 | 1.000104 | 1.000047 | 1.000162 | 0.000381 |
| ERO1L | 1.000042 | 1.000019 | 1.000065 | 0.000388 |
| PSG5 | 1.053914 | 1.023725 | 1.084994 | 0.000398 |
| NTSR1 | 1.000394 | 1.000171 | 1.000616 | 0.00054 |
| TMPRSS11E | 1.000327 | 1.000141 | 1.000513 | 0.000562 |
The 7 genes in the optimal multivariate COX regression model.
| ID | Coefficient | HR | HR.95L | HR.95H |
|
|---|---|---|---|---|---|
| NTSR1 | 0.000244865 | 1.000244895 | 0.999995 | 1.000495 | 0.05509 |
| RHOV | 7.13 | 1.000071315 | 0.999989 | 1.000153 | 0.08795 |
| KLK8 | 0.000505343 | 1.000505471 | 1.000028 | 1.000983 | 0.037875 |
| TNS4 | 7.01 | 1.000070108 | 0.999986 | 1.000154 | 0.103366 |
| C1QTNF6 | 0.000287673 | 1.000287714 | 1.000022 | 1.000553 | 0.033585 |
| IVL | 0.000440486 | 1.000440583 | 1.000203 | 1.000678 | 0.000272 |
| B4GALNT2 | 0.000161186 | 1.000161199 | 1.000032 | 1.00029 | 0.014511 |
Figure 2Evaluation of the 7-gene signature-based model in predicting the prognosis of LUAD patients. (a, b) The survival time between the patients of the high- and low-risk groups within the training set and testing set was analyzed by Kaplan-Meier analysis; (c–e) heat map for the 7 genes, as well as the risk score distribution and survival of patients in the training set; (f, g, h) heat map for the 7 genes, as well as the risk score distribution and survival of patients in the testing set; (i, j) ROC curves of the 7-gene signature-based model in two sets.
Figure 3Verification of stability and validity of the prognostic model for LUAD with an independent dataset GSE26939. (a) Survival difference between the patients in the high- and low-risk groups was revealed by Kaplan-Meier analysis; (b) ROC curves of the 7-gene model using the independent dataset.
Clinical information of LUAD patients in the training set.
| Low risk ( | High risk ( |
| |
|---|---|---|---|
|
| |||
| <65 | 51 (44.0%) | 59 (50.4%) | 0.392 |
| >65 | 65 (56.0%) | 58 (49.6%) | |
|
| |||
| Yes | 17 (14.7%) | 40 (34.2%) | <0.001 |
| No | 99 (85.3%) | 77 (65.8%) | |
|
| |||
| Female | 61 (52.6%) | 65 (55.6%) | 0.746 |
| Male | 55 (47.4%) | 52 (44.4%) | |
|
| |||
| T1 | 53 (45.7%) | 33 (28.2%) | 0.0246 |
| T2 | 52 (44.8%) | 71 (60.7%) | |
| T3 | 8 (6.9%) | 12 (10.3%) | |
| T4 | 3 (2.6%) | 1 (0.9%) | |
|
| |||
| N0 | 85 (73.3%) | 56 (47.9%) | <0.001 |
| N1 | 16 (13.8%) | 32 (27.4%) | |
| N2 | 15 (12.9%) | 29 (24.8%) | |
|
| |||
| Stage I | 75 (64.7%) | 48 (41.0%) | 0.0022 |
| Stage II | 22 (19.0%) | 35 (29.9%) | |
| Stage III | 15 (12.9%) | 31 (26.5%) | |
| Stage IV | 4 (3.4%) | 3 (2.6%) | |
|
| |||
| ≤40 pack years of smoke | 71 (61.2%) | 75 (64.1%) | 0.748 |
| >40 pack years of smoke | 45 (38.8%) | 42 (35.9%) |
Clinical information of LUAD patients in the testing set.
| Low risk ( | High risk ( |
| |
|---|---|---|---|
|
| |||
| <65 | 41 (36.6%) | 53 (46.1%) | 0.189 |
| >65 | 71 (63.4%) | 62 (53.9%) | |
|
| |||
| Yes | 22 (19.6%) | 34 (29.6%) | 0.114 |
| No | 90 (80.4%) | 81 (70.4%) | |
|
| |||
| Female | 62 (55.4%) | 59 (51.3%) | 0.632 |
| Male | 50 (44.6%) | 56 (48.7%) | |
|
| |||
| T1 | 42 (37.5%) | 30 (26.1%) | 0.00148 |
| T2 | 62 (55.4%) | 57 (49.6%) | |
| T3 | 3 (2.7%) | 20 (17.4%) | |
| T4 | 5 (4.5%) | 8 (7.0%) | |
|
| |||
| N0 | 94 (83.9%) | 71 (61.7%) | <0.001 |
| N1 | 13 (11.6%) | 23 (20.0%) | |
| N2 | 5 (4.5%) | 21 (18.3%) | |
|
| |||
| Stage I | 77 (68.8%) | 51 (44.3%) | 0.00205 |
| Stage II | 20 (17.9%) | 32 (27.8%) | |
| Stage III | 9 (8.0%) | 23 (20.0%) | |
| Stage IV | 6 (5.4%) | 9 (7.8%) | |
|
| |||
| ≤40 pack years of smoke | 75 (67.0%) | 61 (53.0%) | 0.045 |
| >40 pack years of smoke | 37 (33.0%) | 54 (47.0%) |
Figure 4Prognostic impact of the model in clinical characteristics. The correlation of the 7-gene signature-based risk score with the TNM stage was analyzed in the (a–c) training set and (d–f) testing set. (g) OS curves were plotted for validation of the predictive ability of the 7-gene signature-based risk score in patients with different age and TNM stage.