| Literature DB >> 36211815 |
Yisheng Huang1,2,3, Liling Qiu4, Xiaoye Liang2, Jing Zhao3, Haoting Chen5, Zhiqiang Luo6, Wanzhen Li2, Xiaohua Lin2, Jingjie Jin3, Jian Huang6, Gong Zhang3.
Abstract
The occurrence of lung adenocarcinoma (LUAD) is a complicated process, involving the genetic and epigenetic changes of proto-oncogenes and oncogenes. The objective of this study was to establish new predictive signatures of lung adenocarcinoma based on copy number variations (CNVs) and gene expression data. Next-generation sequencing was implemented to obtain gene expression and CNV information. According to univariate, multivariate survival Cox regression analysis, and LASSO analysis, the expression profiles of lung adenocarcinoma patients were screened and a risk score formula was established and experimentally validated in a local cohort. The model was evaluated by three independent cohorts (TCGA-LUAD, GSE31210, and GSE30219), and then validated by clinical samples from LUAD patients. A total of 844 CNV-related differentially expressed genes (CNV-related DEGs) were identified. These genes are significantly associated with the imbalance of various oxidative stress pathways. A CNV-associated-six gene signature was dramatically linked to overall survival in lung adenocarcinoma samples from both training and validation groups. Functional enrichment analysis further revealed involvement of genes in p53 signaling pathway and cell cycle as well as the mismatch repair pathway. Risk score is an independent marker considering clinical parameters and had better prediction in clinical subpopulation. The same signature also classified tumor tissues of clinical patients with CNV detected from their corresponding nontumorous tissues with an accuracy of 0.92. In conclusion, we identified a new class of 6 CNV-related gene markers that may act as efficient prognostic predictors of lung adenocarcinoma, thus contributing to individualized treatment decisions in patients.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36211815 PMCID: PMC9535135 DOI: 10.1155/2022/6962163
Source DB: PubMed Journal: Oxid Med Cell Longev ISSN: 1942-0994 Impact factor: 7.310
Clinical sample information for three datasets.
| Clinical features | TCGA-LUAD | GSE31210 | GSE30219 |
|---|---|---|---|
| PFS | |||
| 0 | 294 | 162 | 56 |
| 1 | 206 | 64 | 27 |
| T stage | |||
| T1 | 167 | ||
| T2 | 267 | ||
| T3 | 45 | ||
| T4 | 18 | ||
| TX | 3 | ||
| N stage | |||
| N0 | 324 | ||
| N1 | 94 | ||
| N2 | 69 | ||
| N3 | 2 | ||
| NX | 11 | ||
| M stage | |||
| M0 | 332 | ||
| M1 | 24 | ||
| MX | 144 | ||
| Stage | |||
| I | 268 | ||
| II | 119 | ||
| III | 80 | ||
| IV | 25 | ||
| X | 8 | ||
| Gender | |||
| Male | 230 | ||
| Female | 270 | ||
| Chemotherapy | |||
| YES | 175 | ||
| NO | 325 | ||
| Radiation_therapy | |||
| YES | 58 | ||
| NO | 361 | ||
| Unknown | 81 | ||
| Age | |||
| ≤65 | 237 | ||
| >65 | 253 | ||
| Unknown | 10 | ||
| Smoking∗ | |||
| 1 | 71 | ||
| 2 | 119 | ||
| 3 | 129 | ||
| 4 | 163 | ||
| 5 | 4 | ||
| 7 | 14 |
∗Lifelong nonsmoker (less than 100 cigarettes smoked in lifetime) = 1; current smoker (includes daily smokers and nondaily smokers or occasional smokers) = 2; current reformed smoker for >15 years (greater than 15 years) = 3; current reformed smoker for ≤15 years (less than or equal to 15 years) = 4; current reformed smoker, duration not specified = 5; smoking History not documented = 7.
Figure 1The workflow of this study.
Comparison of TCGA training set and validation set sample information.
| Clinical features | TCGA-train | TCGA-test |
|
|---|---|---|---|
| PFS | |||
| 0 | 210 | 84 | 0.4632 |
| 1 | 140 | 66 | |
| T stage | |||
| T1 | 106 | 61 | 0.001 |
| T2 | 199 | 68 | |
| T3 | 36 | 9 | |
| T4 | 9 | 9 | |
| TX | 0 | 3 | |
| N stage | |||
| N0 | 224 | 100 | 0.9006 |
| N1 | 69 | 25 | |
| N2 | 48 | 21 | |
| N3 | 1 | 1 | |
| NX | 8 | 3 | |
| M stage | |||
| M0 | 242 | 90 | 0.1104 |
| M1 | 14 | 10 | |
| MX | 94 | 50 | |
| Stage | |||
| I | 187 | 81 | 0.1149 |
| II | 92 | 27 | |
| III | 53 | 27 | |
| IV | 14 | 11 | |
| X | 4 | 4 | |
| Gender | |||
| Male | 161 | 69 | 1 |
| Female | 189 | 81 | |
| Chemotherapy | |||
| YES | 118 | 57 | 0.4131 |
| NO | 232 | 93 | |
| Radiation_therapy | |||
| YES | 41 | 17 | 0.9876 |
| NO | 252 | 109 | |
| Unknown | 57 | 24 | |
| Age | |||
| ≤65 | 156 | 81 | 0.1485 |
| >65 | 187 | 66 | |
| Unknown | 7 | 3 |
Clinical information of 20 LUAD patients.
| No. of patients | Samples | Classification | Age | Gender | T stage | N stage | M stage | Tumor stage | Smoking | Family history |
|---|---|---|---|---|---|---|---|---|---|---|
| P2-41-13 | P2-41-13-H3 | Tumor | 57 | Male | 1c | 2 | 1a | IVa | Yes | No |
| P2-41-13 | P2-41-13-H6 | Adjacent normal | 57 | Male | ||||||
| P2-41-13 | P2-41-13-K1 | Tumor | 55 | Male | 1 | 0 | 0 | I | Yes | No |
| P2-41-13 | P2-41-13-K4 | Adjacent normal | 55 | Male | ||||||
| P2-41-14 | P2-41-14-A1 | Tumor | 61 | Female | 1c | 0 | 0 | Ia3 | No | No |
| P2-41-14 | P2-41-14-A4 | Adjacent normal | 61 | Female | ||||||
| P2-41-14 | P2-41-14-J9 | Tumor | 41 | Female | 2b | 2 | 0 | IIIa | No | No |
| P2-41-14 | P2-41-14-K2 | Adjacent normal | 41 | Female | ||||||
| P2-41-31 | P2-41-31-A1 | Tumor | 57 | Male | 1c | 0 | 0 | Ia3 | Yes | No |
| P2-41-31 | P2-41-31-A4 | Adjacent normal | 57 | Male | ||||||
| P2-41-31 | P2-41-31-B10 | Tumor | 69 | Female | 2a | 0 | 0 | IB | No | No |
| P2-41-31 | P2-41-31-C3 | Adjacent normal | 69 | Female | ||||||
| P2-41-31 | P2-41-31-F3 | Tumor | 61 | Female | 1b | 0 | 0 | Ia2 | Yes | No |
| P2-41-31 | P2-41-31-F6 | Adjacent normal | 61 | Female | ||||||
| P2-41-31 | P2-41-31-G2 | Tumor | 55 | Male | 2a | 0 | 0 | IB | Yes | No |
| P2-41-31 | P2-41-31-G5 | Adjacent normal | 55 | Male | ||||||
| P2-41-32 | P2-41-32-A10 | Tumor | 36 | Female | Tis | 0 | 0 | No | No | |
| P2-41-32 | P2-41-32-B2 | Adjacent normal | 36 | Female | ||||||
| P2-41-32 | P2-41-32-H8 | Tumor | 32 | Female | Breast cancer lung metastasis | |||||
| P2-41-32 | P2-41-32-J1 | Adjacent normal | 32 | Female | ||||||
| P2-41-32 | P2-41-32-J2 | Tumor | 64 | Female | 1c | 0 | 0 | Ia3 | No | No |
| P2-41-32 | P2-41-32-J5 | Adjacent normal | 64 | Female | ||||||
| P2-41-32 | P2-41-32-K1 | Tumor | 72 | Female | 1b | 0 | 0 | Ia2 | No | No |
| P2-41-32 | P2-41-32-K4 | Adjacent normal | 72 | Female | ||||||
| P2-41-33 | P2-41-33-B9 | Tumor | 76 | Male | 1b | 2 | 0 | IIIa | Yes | No |
| P2-41-33 | P2-41-33-C2 | Adjacent normal | 76 | Male | ||||||
| P2-41-33 | P2-41-33-C8 | Tumor | 51 | Male | 2a | 1 | 0 | IIB | No | No |
| P2-41-33 | P2-41-33-D1 | Adjacent normal | 51 | Male | ||||||
| P2-41-33 | P2-41-33-D10 | Adjacent normal | 60 | Female | ||||||
| P2-41-33 | P2-41-33-D7 | Tumor | 60 | Female | 1c | 0 | 0 | Ia3 | No | No |
| P2-42-31 | P2-42-31-D8 | Tumor | 55 | Male | 1 | 0 | 0 | I | Yes | No |
| P2-42-31 | P2-42-31-E1 | Adjacent normal | 55 | Male | ||||||
| P2-42-31 | P2-42-31-E10 | Adjacent normal | 63 | Male | ||||||
| P2-42-31 | P2-42-31-E7 | Tumor | 63 | Male | 1b | 0 | 0 | Ia2 | Yes | No |
| P2-42-31 | P2-42-31-H6 | Tumor | 56 | Male | 2a | 0 | 0 | IB | Yes | No |
| P2-42-31 | P2-42-31-H7 | Adjacent normal | 56 | Male | ||||||
| P2-42-31 | P2-42-31-K4 | Tumor | 53 | Female | 1b | 2 | 0 | IIIa | Yes | No |
| P2-42-31 | P2-42-31-K5 | Adjacent normal | 53 | Female | ||||||
| P2-42-32 | P2-42-32-A3 | Tumor | 74 | Female | 2a | 0 | 0 | IB | No | No |
| P2-42-32 | P2-42-32-A6 | Adjacent normal | 74 | Female |
Figure 2Identification and functional analysis of differentially expressed genes. (a) Volcano plot of differentially grouped genes between Tumor and Normal. (b) Venn diagram of CNV and differentially expressed genes. (c) The top 10 most significantly enriched biological processes enriched by differential genes. (d) The top 10 KEGG pathways enriched by differential genes. (e) Differential distribution of enrichment scores of 14 oxidative stress-related pathways in cancer and adjacent samples. (f) Number distribution of oxidative stress-related pathways significantly related to different genes.
Figure 3Prognostic model construction. (a) The trajectory of each independent variable, the log of lambda on the horizontal axis and the coefficient on the vertical axis. (b) The confidence interval under each lambda. (c) Risk score, survival time and survival status and expression of 6-gene signature in the TCGA training set; (d) ROC curve and AUC of 6-gene signature in the TCGA training set; and (e) KM survival curve distribution of 6-gene signature in the TCGA training set.
Figure 4The construction of the nomogram. (a) Clinical features and the nomogram constructed by risk score; (b) correction chart of survival rate of the Nomogram; and (c) DCA curve.
Figure 5Gene expressions of six CNV-related prognostic genes in RNA-seq. Some expression of the prognostic genes showed increased pattern in tumor tissues (CNV group). Compare with samples in Non-CNV group, expression in HMMR, GOLM1, and POSTN in tumor tissues (CNV group) were detected increases. p: p value using single-tailed t- test for paired data.
Figure 6Risk score distribution based on six prognostic genes in clinical validation cohort. (a) Comparison of risk scores between cancer and adjacent tissues. (b) ROC analysis of the six CRDS genes in the cancer and adjacent tissues. (c) Comparison of risk scores between cancer and adjacent tissues in different groups. (d) ROC analysis of the six prognostic genes in CNV group and Non-CNV group. p: p value using single-tailed t-test for paired data.