| Literature DB >> 35205391 |
Shaohua Xu1, Jiahui Xie1, Yanjie Zhou1, Hui Liu1, Yirong Wang2, Zhaoyong Li1,3.
Abstract
Long non-coding RNAs (lncRNAs) have been well known for their multiple functions in the tumorigenesis, development, and prognosis of breast cancer (BC). Mechanistically, their production, function, or stability can be regulated by RNA binding proteins (RBPs), which were also involved in the carcinogenesis and progression of BC. However, the roles and clinical implications of RBP-related lncRNAs in BC remain largely unknown. Therefore, we herein aim to construct a prognostic signature with RBP-relevant lncRNAs for the prognostic evaluation of BC patients. Firstly, based on the RNA sequencing data of female BC patients from The Cancer Genome Atlas (TCGA) database, we screened out 377 differentially expressed lncRNAs related to RBPs. The univariate, least absolute shrinkage and selection operator (LASSO), and multivariate Cox regression analyses were then performed to establish a prognostic signature composed of 12-RBP-related lncRNAs. Furthermore, we divided the BC patients into high- and low-risk groups by the prognostic signature and found the overall survival (OS) of patients in the high-risk group was significantly shorter than that of the low-risk group. Moreover, the 12-lncRNA signature exhibited independence in evaluating the prognosis of BC patients. Additionally, a functional enrichment analysis revealed that the prognostic signature was associated with some cancer-relevant pathways, including cell cycle and immunity. In summary, our 12-lncRNA signature may provide a theoretical reference for the prognostic evaluation or clinical treatment of BC patients.Entities:
Keywords: RNA binding protein; breast cancer; long non-coding RNA; prognostic signature
Mesh:
Substances:
Year: 2022 PMID: 35205391 PMCID: PMC8872055 DOI: 10.3390/genes13020345
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Identification of differential expressed lncRNAs related to RBPs and establishment of a 12-lncRNA prognostic signature. (A) The flowchart for identification of RBP-related lncRNAs. (B) Forest plot of univariate Cox regression analysis for the 22 RBP-related lncRNAs correlated with the OS of BC patients in the training dataset. (C) The LASSO coefficient profiles of the 22 prognosis-associated lncRNAs. The upper abscissa represents the number of lncRNAs with non-zero coefficients under the corresponding lambda. (D) The cross-validation graph shows the optimal parameter selection with minimum criteria in the LASSO model. The first black dashed line shows the best parameter (lambda). The upper abscissa represents the number of lncRNAs with non-zero coefficients under the corresponding lambda. (E) Forest plot of the multivariate Cox regression analysis for the 12 RBP-related lncRNAs. (F) The coefficients of the 12 lncRNAs from multivariate Cox regression analysis.
The coefficients of 12 RBP-related lncRNAs based on the multivariable Cox regression analysis in the training dataset.
| Gene Symbol | Ensembl ID | Genomic Location | Coefficient |
|---|---|---|---|
| LINC02408 | ENSG00000203585 | Chr12:67,443,105–67,590,771 | 0.007062056 |
| AL121790.2 | ENSG00000259087 | Chr14:37,556,158–37,567,095 | 0.00683141 |
| AL589765.4 | ENSG00000249602 | Chr1:151,763,384–151,769,501 | 0.003727625 |
| LINC00460 | ENSG00000233532 | Chr13:106,374,477–106,384,315 | 0.001867941 |
| YTHDF3-AS1 | ENSG00000270673 | Chr8:63,167,725–63,168,442 | 0.00146521 |
| CYTOR | ENSG00000222041 | Chr2:87,454,781–87,636,740 | 0.000252682 |
| LINC01016 | ENSG00000249346 | Chr6:33,867,506–33,896,914 | −0.000429308 |
| CD2BP2-DT | ENSG00000260219 | Chr16:30,354,665–30,357,116 | −0.001513116 |
| LINC00987 | ENSG00000237248 | Chr12:9,240,003–9,257,960 | −0.003223765 |
| U73166.1 | ENSG00000230454 | Chr3:50,260,303–50,263,358 | −0.004631155 |
| USP30-AS1 | ENSG00000256262 | Chr12:109,052,349–109,053,984 | −0.015207547 |
| AC068473.4 | ENSG00000267409 | Chr18:79,610,747–79,612,303 | −0.048280801 |
Note: the reference genome version used for the genomic location was GRCh38. Chr: chromosome.
Figure 2Evaluation and verification for the prognostic value of the RBP-related lncRNA signature. (A) Risk scores for BC patients of the high- and low-risk groups in the training dataset. (B) The scatterplot of overall survival time and status of BC patients in the high- and low-risk groups from the training dataset. (C) The OS curve for BC patients in the high- and low-risk groups of the training dataset. (D) The time-dependent ROC curves at 1-, 3-, and 5-year OS of the 12-lncRNA prognostic signature in the training dataset. (E) Risk scores for BC patients of the high- and low-risk groups in the validation dataset. (F) The scatterplot of overall survival time and status of BC patients in the high- and low-risk groups from the validation dataset. (G) The OS curve for BC patients in the high- and low-risk groups of the validation dataset. (H) The time-dependent ROC curves at 1-, 3-, and 5-year OS of the 12-lncRNA prognostic signature in the validation dataset.
Figure 3Kaplan–Meier survival analyses for BC patients of the high- and low-risk groups in the different clinical subgroups. The OS curve for BC patients of the high- and low-risk groups in the different subgroups stratified by clinical features based on the entire cohort, including age < 60 years old (A), age ≥ 60 years old (B), stage I–II (C), stage III–IV (D), T1–2 (E), T3–4 (F), N0 (G), N1–3 (H), and M0 (I).
Figure 4Assessment of the lncRNA signature as an independent prognostic factor for OS. Forest plots of the univariate (left) and multivariate (right) Cox regression analyses of risk score and several clinical features based on the OS in the training dataset (A,B), validation dataset (C,D), and entire cohort (E,F). The ROC curves at 5-year OS of the risk score and clinical features in the training dataset (G), validation dataset (H), and entire cohort (I).
Figure 5The co-expression network and GO enrichment analysis. (A) A co-expression network comprised of 12 lncRNAs in the prognostic signature and 33 RBP genes correlated to them. (B) The top 5 GO biological processes with the significant enrichment of the 33 RBP genes. (C) The top 20 GO biological processes with the significant enrichment of the differentially expressed genes between the high- and low-risk groups in the entire cohort.
Figure 6The correlation between the risk score and immune checkpoint genes or TMB. (A) The box plot shows the expression levels of some representative immune checkpoint genes between the high- and low-risk groups in the entire cohort. (B) The OS curve for BC patients in the high- and low-TMB groups based on the mutation data. (C) The correlation analysis between TMB and risk score. (D) The violin plot shows the difference in TMB between the high- and low-risk groups. *** P.adjust < 0.001.