| Literature DB >> 34094983 |
Huiying Yang1, Xiaoling Xiong1, Hua Li1.
Abstract
BACKGROUND: Clear cell renal cell carcinoma (ccRCC) is a kind of frequently diagnosed cancer, leading to high death rate in patients. Genomic instability (GI) is regarded as playing indispensable roles in tumorigenesis and impacting the prognosis of patients. The aberrant regulation of long non-coding RNAs (lncRNAs) is a main cause of GI. We combined the somatic mutation profiles and expression profiles to identify GI derived lncRNAs (GID-lncRNAs) in ccRCC and developed a GID-lncRNAs based risk signature for prognosis prediction and medication guidance.Entities:
Keywords: clear cell renal cell carcinoma (ccRCC); genomic instability (GI); long non-coding RNA (IncRNA); prognosis predicting; risk signature; somatic mutation profile; therapeutic scheme deciding
Year: 2021 PMID: 34094983 PMCID: PMC8176022 DOI: 10.3389/fonc.2021.678253
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Flow chart of the design and overall procedures of our research.
Figure 2(A) A heatmap of all GID-lncRNAs between GS-group and GU-group. Each cell represents the expression level of a lncRNA (left) in a sample (above). Red means high expression and blue means low expression. The expression values were log2 transferred before mapping. (B) OS-related GID-lncRNAs recognized by time-dependent univariate Cox regression.
Clinical information of patients in training and testing dataset and chi-square test between two groups.
| Covariates | Total | Training dataset | Testing dataset | p-value | |
|---|---|---|---|---|---|
| Survival status, no (%) | Alive | 340(69.25) | 258(68.44) | 82(71.93) | 0.55 |
| Dead | 151(30.75) | 119(31.56) | 32(28.07) | ||
| Age, no (%) | <=65 | 328(66.8) | 251(66.58) | 77(67.54) | 0.94 |
| >65 | 163(33.2) | 126(33.42) | 37(32.46) | ||
| Gender, no(%) | Female | 168(34.22) | 133(35.28) | 35(30.7) | 0.43 |
| Male | 323(65.78) | 244(64.72) | 79(69.3) | ||
| Grade, no(%) | G1 | 10(2.04) | 9(2.39) | 1(0.88) | 0.73 |
| G2 | 212(43.18) | 159(42.18) | 53(46.49) | ||
| G3 | 196(39.92) | 150(39.79) | 46(40.35) | ||
| G4 | 67(13.65) | 54(14.32) | 13(11.4) | ||
| Gx | 6(1.22) | 5(1.33) | 1(0.88) | ||
| Stage, no(%) | Stage I | 246(50.1) | 188(49.87) | 58(50.88) | 0.45 |
| Stage II | 53(10.79) | 39(10.34) | 14(12.28) | ||
| Stage III | 111(22.61) | 82(21.75) | 29(25.44) | ||
| Stage IV | 78(15.89) | 66(17.51) | 12(10.53) | ||
| Stage x | 3(0.61) | 2(0.53) | 1(0.88) | ||
| T group, no(%) | T1 | 252(51.32) | 193(51.19) | 59(51.75) | 0.65 |
| T2 | 65(13.24) | 47(12.47) | 18(15.79) | ||
| T3 | 165(33.6) | 129(34.22) | 36(31.58) | ||
| T4 | 9(1.83) | 8(2.12) | 1(0.88) | ||
| Metastasis, no(%) | M0 | 390(79.43) | 296(78.51) | 94(82.46) | 0.08 |
| M1 | 75(15.27) | 64(16.98) | 11(9.65) | ||
| Mx | 26(5.3) | 17(4.51) | 9(7.89) |
Information of the seven GID-lncRNAs composing the risk signature.
| Gene Symbol | Ensembl ID | Genomic location | Coefficient in the risk signature |
|---|---|---|---|
| LINC00460 | ENSG00000233532 | chr13:106,374,477-106,384,411 | 0.018688449 |
| AL139351.1 | ENSG00000276923 | chr20:48,024,788-48,074,227 | 0.163927492 |
| AC156455.1 | ENSG00000256546 | chr12:122,063,306-122,068,616 | 0.145677628 |
| AL035446.1 | ENSG00000234147 | chr6:140,807,603-140,898,430 | 0.081318043 |
| LINC02471 | ENSG00000223914 | chr12:40,155,757-40,211,419 | -0.01015238 |
| AC022509.2 | ENSG00000256234 | chr12:26,211,164-26,335,856 | 0.004539367 |
| LINC01606 | ENSG00000253301 | chr8:57,142,659-57,244,924 | 0.019535979 |
Figure 3Landscape of mutation profiles of ccRCC patients. (A) Classification of mutations by their effects. (B) Classification of mutations by different patterns. (C) Top 10 frequent mutations in ccRCC patients. (D) Mutation profiles of high-risk and low-risk groups divided by our risk signature.
Figure 4Validation of the GID-lncRNAs based risk signature. (A) Survival analysis in training dataset. (B) Survival analysis in testing dataset. (C) ROC analysis for evaluating the predictive efficiency of the risk signature in the training dataset. (D) ROC analysis for evaluating the predictive efficiency of the risk signature in the testing dataset. (E) Results of Univariate Cox regression. (F) Results of Multivariate Cox regression.
Clinical features of patients in high-risk and low-risk groups and chi-square test between two groups.
| Covariates | Total | High-risk | Low-risk | p-value | |
|---|---|---|---|---|---|
| Survival status, no (%) | Alive | 310(67.83) | 126(55.51) | 184(80.00) | 3.70E-8 |
| Dead | 147(32.17) | 101(44.49) | 46(20.00) | ||
| Age, no (%) | <=65 | 303(66.3) | 145(63.60) | 158(69.00) | 0.23 |
| >65 | 154(33.7) | 83(36.40) | 71(31.00) | ||
| Gender, no(%) | Female | 153 (33.5) | 48(21.15) | 105(50.7) | 5.00E-8 |
| Male | 304(66.5) | 179(78.85) | 125(49.3) | ||
| Grade, no(%) | G1 | 7(1.53) | 1(0.44) | 6(2.61) | 4.27E-8 |
| G2 | 198(43.33) | 75(33.04) | 123(53.48) | ||
| G3 | 187(40.92) | 100(44.05) | 87(37.83) | ||
| G4 | 65(14.22) | 51(22.47) | 14(6.09) | ||
| Stage, no(%) | Stage I | 225(49.23) | 80(35.23) | 145(63.04) | 2.41E-10 |
| Stage II | 47(10.28) | 19(8.37) | 28(12.17) | ||
| Stage III | 109(23.85) | 75(33.04) | 34(14.78) | ||
| Stage IV | 76(16.63) | 53(23.35) | 23(10.00) | ||
| T group, no(%) | T1 | 231(50.55) | 84(37.00) | 147(63.91) | 1.00E-8 |
| T2 | 58(12.69) | 28(12.33) | 30(13.04) | ||
| T3 | 159(34.79) | 109(48.02) | 50(21.74) | ||
| T4 | 9(1.97) | 6(2.64) | 3(1.30) | ||
| Metastasis, no(%) | M0 | 383(83.81) | 175(77.09) | 208(90.43) | 1.81E-4 |
| M1 | 74(16.19) | 52(22.91) | 22(9.57) |
Figure 5Results of KEGG enrichment analysis on genes co-expressed with GID-lncRNAs in the risk signature. The size of the dots means the count of genes enriched in the term and the color is corresponding to the statistical significance. Gene ratio means the ratio of genes enriched in the term and all genes involved in the analysis. (A) KEGG enrichment analysis on genes co-expressed with LINC00460. (B) KEGG enrichment analysis on genes co-expressed with AL139351.1. (C) KEGG enrichment analysis on genes co-expressed with AC156455.1. (D) KEGG enrichment analysis on genes co-expressed with AL035446.1. (E) KEGG enrichment analysis on genes co-expressed with LINC02471. (F) KEGG enrichment analysis on genes co-expressed with AC022509.2. (G) KEGG enrichment analysis on genes co-expressed with LINC01606.
Figure 6Effectiveness prediction of precise treatments with our risk signature. (A) Differentially expressed targets for precise treatments. (B) TMB analysis of ccRCC patients and significant differences between high-risk and low-risk group.