| Literature DB >> 35372012 |
Longjun Yang1,2, Guangran Guo1,2, Xiangyang Yu3, Yingsheng Wen1,2, Yongbin Lin1,2, Rusi Zhang1,2, Dechang Zhao1,2, Zirui Huang1,2, Gongming Wang1,2, Yan Yan1,4, Xuewen Zhang1,5, Dongtai Chen1,5, Wei Xing1,5, Weidong Wang6, Weian Zeng1,5, Lanjun Zhang1,2.
Abstract
Background: Genomic instability is one of the representative features of cancer evolution. Recent research has revealed that long noncoding RNAs (lncRNAs) play a critical role in maintaining genomic instability. Our work proposed a gene signature (GILncSig) based on genomic instability-derived lncRNAs to probe the possibility of lncRNA signatures as an index of genomic instability, providing a potential new approach to identify genomic instability-related cancer biomarkers.Entities:
Keywords: GILncSig; TCGA; TP53; genome instability; lung adenocarcinoma (LUAD)
Year: 2022 PMID: 35372012 PMCID: PMC8965709 DOI: 10.3389/fonc.2022.780631
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Clinical information for three LUAD patients sets in this study.
| Covariates | Type | Train | Test | Total | Pvalue |
|---|---|---|---|---|---|
| age | <=65 | 108 (48.87%) | 105 (48.17%) | 213 (48.52%) | 0.9585 |
| age | >65 | 113 (51.13%) | 113 (51.83%) | 226 (51.48%) | |
| gender | FEMALE | 114 (51.58%) | 127 (58.26%) | 241 (54.9%) | 0.1905 |
| gender | MALE | 107 (48.42%) | 91 (41.74%) | 198 (45.1%) | |
| stage | Stage I-II | 177 (80.09%) | 169 (77.52%) | 346 (78.82%) | 0.5882 |
| stage | Stage III-IV | 44 (19.91%) | 49 (22.48%) | 93 (21.18%) | |
| T | T1-2 | 196 (88.69%) | 189 (86.7%) | 385 (87.7%) | 0.6244 |
| T | T3-4 | 25 (11.31%) | 29 (13.3%) | 54 (12.3%) | |
| M | M0 | 212 (95.93%) | 207 (94.95%) | 419 (95.44%) | 0.7947 |
| M | M1 | 9 (4.07%) | 11 (5.05%) | 20 (4.56%) | |
| N | N0 | 142 (64.25%) | 149 (68.35%) | 291 (66.29%) | 0.4199 |
| N | N1-3 | 79 (35.75%) | 69 (31.65%) | 148 (33.71%) |
Figure 1The recognition of genomic instability-associated lncRNAs in patients with LUAD. (A) On the basis of the expression patterns of 123 candidate lncRNAs associated with genomic instability, unsupervised clustering was performed in 497 LUAD patients. The blue cluster on the left represents the GS-like group, the red cluster on the right represents the GU-like group. (B) Boxplots of somatic mutations in the GS-like group and GU-like group. The accumulative mutation count of somatic cells in the GS-like group is dramatically lower than that in the GU-like group. (C) Boxplots of MYC expression level in the GS-like group and GU-like group. The expression level of MYC in the GS-like group is dramatically lower than that in the GS-like group.
Five prognosis-related genome instability-associated lncRNAs by multivariate Cox regression analysis.
| LncRNA | coef | HR | HR.95L | HR.95H | p-value |
|---|---|---|---|---|---|
| PTCSC3 | -0.08497 | 0.918544 | 0.828557 | 1.018303 | 0.106273 |
| LINC02555 | -0.2622 | 0.76936 | 0.559437 | 1.058056 | 0.106782 |
| LINC01671 | 0.040781 | 1.041624 | 1.014932 | 1.069017 | 0.002077 |
| LINC01116 | 0.098516 | 1.103532 | 0.996312 | 1.222292 | 0.058877 |
| LINC01214 | 0.06491 | 1.067063 | 1.006868 | 1.130858 | 0.028452 |
Correlations between GILncSig and clinical characteristics of LUAD patients in TCGA.
| Characteristics | No. of patients | GILncSig | ||
|---|---|---|---|---|
| Low (n = 204) | High (n = 230) | |||
| ≤65 | 211 (48.6%) | 110 (47.8%) | 101 (49.5%) | 0.726 |
| >65 | 223 (51.4%) | 120 (52.2%) | 103 (50.5%) | |
| Female | 238 (54.8%) | 127 (55.2%) | 111 (54.4%) | 0.866 |
| male | 196 (45.2%) | 103 (44.8%) | 93 (45.6%) | |
| T1-2 | 381 (87.8%) | 199 (86.5%) | 182 (89.2%) | 0.392 |
| T3-4 | 53 (12.2%) | 31 (13.5%) | 22 (10.8%) | |
| N0 | 287 (66.1%) | 138 (60%) | 149 (73%) | |
| N1-3 | 147 (33.9%) | 92 (40%) | 55 (27%) | |
| M0 | 414 (95.4%) | 216 (93.9%) | 198 (97.1%) | 0.119 |
| M1 | 20 (4.6%) | 14 (6.1%) | 6 (2.9%) | |
| I-II | 342 (78.8%) | 168 (73%) | 174 (85.3%) | |
| III-IV | 92 (21.2%) | 62 (27%) | 30 (14.7%) | |
| High | 93 (21.4%) | 62 (27%) | 31 (15.2%) | |
| Low | 341 (78.6%) | 168 (73%) | 173 (84.8%) | |
| Alive | 286 (65.9%) | 137 (59.6%) | 149 (73%) | |
| Dead | 148 (34.1%) | 93 (40.4%) | 55 (27%) | |
The bold number means statistical significance.
Figure 2The genomic instability-derived lncRNA signature (GILncSig) related to overall survival (OS) of LUAD in training set. (A) Log-rank test was used to draw a Kaplan Meier survival curve to predict the overall survival of GILncSig in low-risk or high-risk patients in training set. (B) Receiver Operating Characteristic (ROC) analysis of GILncSig in training set. (C) Heat map of 5 genomic instability-derived lncRNA expression patterns in training set. (D) The survival status of LUAD patients. The dotted line represents the median of the risk score; the number of deaths in the right part of the patient is greater than the left part, indicating that as the risk score increases, the risk of death increases. (E) Risk score distribution of LUAD patients.
Figure 3The GILncSig associated with OS of LUAD in testing set. (A) Log-rank test was performed to draw a Kaplan Meier survival curve to predict the overall survival of GILncSig in low-risk or high-risk patients in testing set. (B) ROC analysis of the GILncSig in testing set. Heat map of 5 genomic instability-derived lncRNA expression patterns in testing set. The survival status of LUAD patients. The dotted line represents the median of the risk score; the number of deaths in the right part of the patient is greater than the left part, indicating that as the risk score increases, the risk of death increases. (E) Risk score distribution of LUAD patients.
Figure 4The GILncSig associated with OS of LUAD and property assessment of the GILncSig in the TCGA set. (A) Log-rank test was performed to draw a Kaplan Meier survival curve to predict the overall survival of GILncSig in low- or high-risk patients in the TCGA set. (B) ROC analysis of GILncSig in the TCGA set. (C) Heat map of 5 genomic instability-derived lncRNA expression patterns in TCGA set. (D) The somatic mutation count distribution in the TCGA set. (E) Comparison analysis of somatic mutation count for patients in low- and high-risk groups in the TCGA set. (F) The distribution of MYC expression in the TCGA set. (G) Comparison analysis of MYC expression in low- and high-risk groups in the TCGA set.
Figure 5Functional annotation of lncRNA related to genomic instability and construction of lncRNA-mRNA co-expression network. (A) Co-expression network of genomic instability-associated lncRNAs and mRNAs. LncRNAs were represented by the blue circles, and mRNAs were represented by the red circles. (B) Gene Ontology (GO) enrichment analysis for mRNAs co-expressed lncRNAs. (C) Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis for mRNAs co-expressed lncRNAs.
Univariate and Multivariate Cox regression analysis of the GILncSig and overall survival in different patient sets.
| Variables | Univariable model | Multivariable model | ||||
|---|---|---|---|---|---|---|
| HR | 95% CI | p-value | HR | 95% CI | p-value | |
| Training set | ||||||
| age | 1.01639 | 0.99105-1.04238 | 0.206967 | |||
| gender | 0.971905 | 0.61358-1.53948 | 0.903347 | |||
| T | 1.282301 | 0.94688-1.73655 | 0.10802 | |||
| N | 1.512145 | 1.16412-1.96421 | 0.001944 | 1.448928 | 1.10816-1.89449 | 0.006714 |
| M | 1.726692 | 0.62629-4.76052 | 0.291146 | |||
| GILncSig | 1.182901 | 1.11023-1.26033 | 2.08E-07 | 1.168063 | 1.09414-1.24699 | 3.21E-06 |
| Testing set | ||||||
| age | 1.00906 | 0.98679-1.03183 | 0.428217 | |||
| gender | 1.288724 | 0.82256-2.01907 | 0.268176 | |||
| T | 1.883562 | 1.42049-2.49759 | 1.09E-05 | 1.518812 | 1.11782-2.06364 | 0.007537 |
| N | 2.107405 | 1.62118-2.73946 | 2.55E-08 | 1.862951 | 1.41147-2.45884 | 1.11E-05 |
| M | 1.92918 | 0.92450-4.02569 | 0.079982 | |||
| GILncSig | 1.097863 | 1.04963-1.14831 | 4.64E-05 | 1.060405 | 1.01205-1.11107 | 0.013783 |
| TCGA set | ||||||
| age | 1.012345 | 0.99562-1.02935 | 0.148942 | |||
| gender | 1.119877 | 0.81228-1.54396 | 0.48956 | |||
| T | 1.533885 | 1.25392-1.87636 | 3.17E-05 | 1.204072 | 0.97288-1.49021 | 0.087785 |
| N | 1.7257 | 1.44021-2.06778 | 3.35E-09 | 1.578085 | 1.29653-1.92078 | 5.36E-06 |
| M | 1.848907 | 1.02319-3.34098 | 0.04176 | 1.577883 | 0.86367-2.88272 | 0.137995 |
| GILncSig | 1.114489 | 1.07825-1.15195 | 1.30E-10 | 1.09146 | 1.05451-1.12971 | 6.35E-07 |
Figure 6Explore the performance of GILncSig in the overall survival of various clinical factors in LUAD patients by using the Kaplan-Meier survival curve. Groups classified by age (A, B); TMB (C, D); tissue involvement (E, F); lymphatic involvement (G, H); metastasis (I, J) and stages (K, L).
Figure 7Identification of the relationship between TP53 somatic mutation and GILncSig. (A) Comparison of the mutation ratio of TP53 in the high-risk and low-risk groups in the training set. (B) Comparison of the mutation ratio of TP53 in the high-risk and low-risk groups in the testing set. (C) Comparison of the mutation ratio of TP53 in the high-risk and low-risk groups in the TCGA set. (D) Survival analysis of LUAD patients was categorized on the basis of the GILncSig and the TP53 mutation status.
Figure 8Nomogram and calibration charts of LUAD patients based on the GILncSig. (A) Nomogram for predicting LUAD prognosis. (B) Calibration plots for the 1- year nomogram. (C) Calibration plots for the 3- year nomogram. (D) Calibration plots for the 5- year nomogram.
Figure 9Performance evaluation of LINC01214 in GSE50081 dataset. (A) Boxplots for LINC01214 expression among patients with different T stage (tumor size) in GSE50081 dataset. (B) Boxplots for LINC01214 expression among patients with different N stage (lymph node metastasis) in GSE50081 dataset. (C) Boxplots for LINC01214 expression among patients with different gender in GSE50081 dataset. (D) Kaplan Meier curve analysis was used to verify the overall survival rate of LINC01214 expression level in GEO database in different groups based on GILncSig.
Figure 10Representative lncRNAs expression in tumor and normal tissues. (A) PTCSC3; (B) LINC02555; (C), LINC01671; (D) LINC01116; (E) LINC01214. *P < 0.05; **P < 0.01; ns, P < 0.05.