Literature DB >> 36193491

Development and Validation of a Liquid-Liquid Phase Separation-Related Gene Signature as Prognostic Biomarker for Low-Grade Gliomas.

Lidong Ning1, Guanyan Zhao1, Changji Xie1, Huan Lan1, Jiefei Chen1, Hu Tan1, ChengCong Wei1, Zhiyu Zhou1.   

Abstract

Aim: To explore whether the liquid-liquid phase separation- (LLPS-) related genes were potential prognostic markers that could contribute to the further classification of low-grade gliomas (LGGs).
Methods: The LLPS-related genes were subjected to functional enrichment analysis. The univariable, least absolute shrinkage and selection operator, and multivariable stepwise Cox regression analyses were performed to develop an LLPS-related gene signature (GS) in the discovery data set. The biological characteristics of the high-risk LGG were explored using gene set enrichment analysis. Two independent external data sets were used to validate the LLPS-related GS.
Results: LLPS-related genes are involved in multiple important cancer-related biological processes and pathways in LGG. Nine LLPS-related genes were identified to construct the LLPS-related GS, which was significantly associated with the prognosis of LGG patients. The LLPS-related GS could successfully divide patients with LGG into high- and low-risk groups, and the high-risk group showed a poorer prognosis than the low-risk group. Furthermore, the LLPS-related GS was independent of IDH and 1p19q status. Several cancer-related pathways may be more active in high-risk LGGs, such as IL6 JAK STAT3 signaling pathway. The LLPS-related GS was successfully validated with two independent external data sets.
Conclusion: We developed and validated a novel LLPS-related GS for risk stratification of LGG. Our findings may provide more precise management for LGGs and a useful reference for LLPS mechanism to link LGG studies.
Copyright © 2022 Lidong Ning et al.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 36193491      PMCID: PMC9525737          DOI: 10.1155/2022/1487165

Source DB:  PubMed          Journal:  Dis Markers        ISSN: 0278-0240            Impact factor:   3.464


1. Introduction

Low-grade gliomas (LGGs) account for 10–20% of all primary brain tumors [1]. Following the identification of key molecular alterations (e.g., IDH mutation and 1p/19q codeletion) [2, 3] of LGG, the World Health Organization (WHO) updated the central nervous system (CNS) tumor classification in 2016 [4], and the understanding of LGG behavior has rapidly evolved. Although the conventional WHO grade of I–IV still remains, the term of LGG is now often used to refer to both grade II and III gliomas. Nonetheless, LGG remains a highly heterogeneous disease [5]. Hence, there is much interest in finding other molecular signatures for further classification for identifying the ideal management of LGG. In the conventional WHO grading system, the morphology of the nucleoli of tumor cells is one of the critical bases. Nucleoli, like other membraneless condensates inside eukaryotic cells, are mainly formed through a liquid-liquid phase separation (LLPS) mechanism [6]. The LLPS represents the dynamic concentration of biomolecules from a homogeneous environment into a relatively dense phase to form a sparse phase and a dense phase [7]. The proteins involved in LLPS aggregates usually have intrinsically disordered regions (IDRs). These IDRs may mediate weak affinity and nonspecific interactions of multiple targets to trigger LLPS [8-10]. Various critical biological processes, including chromatin organization, transcription, autophagy, DNA damage response, and tumorigenesis, have been reported to use LLPS to generate the corresponding membraneless condensates and play their specific functions [11-13]. It was shown that the aberration of liquid-liquid phase separation involves in many human diseases, including neurodegenerative diseases [14] and cancer [15]. Previous studies suggested that LLPS-related gene signature can be used as a prognostic marker for hepatocellular carcinoma [16] and ovarian cancer [17]. Thus, we hypothesized that LLPS-related genes may be potential prognostic markers that could contribute to the further classification of LGG. In the present study, to confirm our hypothesis, we developed an LLPS-related gene signature independent of known features to identify high-risk LGG.

2. Materials and Methods

2.1. Data Processing

The gene expression profiles and clinical information of LGG were downloaded from the Chinese Glioma Genome Atlas (http://www.cgga.org.cn/) [18]. This study included three data sets, namely, mRNAseq_693, mRNAseq_325, and mRNA-array_301. The inclusion criteria were as follows: (1) the sample included both gene expression profiles and prognosis information, and (2) the sample was primary LGG (WHO grade II and III glioma). The processed gene expression profiles based on RNA-sequencing technology were normalized used log2 (expression value + 0.01). The LLPS-related genes were obtained from LLPSDB v2.0 [19], and the only the unambiguous genes were included in our present study. As all data of the present are publicly available, ethical approval from the ethics committee of National Hospital of Guangxi Zhuang Autonomous Region was not necessary for the present study. The workflow of the present study is shown Figure 1.
Figure 1

The workflow of the present study.

2.2. Functional Enrichment Analysis

The 131 genes both in the unambiguous system of LLPSDB v2.0 and mRNAseq_693 data set were identified and performed to functional enrichment analysis. This would help us understand the potential biological functions of LLPS-related genes. The functional enrichment analysis was performed using the clusterProfiler [20] package.

2.3. Development of an LLPS-Related Gene Signature for Risk Stratification

Firstly, the univariable Cox regression analysis was performed for screening the prognosis-related genes. Secondly, least absolute shrinkage and selection operator (LASSO) Cox regression was performed for variable selection and shrinkage using glmnet [21] package. The LASSO Cox regression can select the features with a strong predictive value and low correlation between each other to prevent overfitting. In the LASSO analysis, the relevant parameters were set to “family = ‘cox′,” “maxit = 1000,” and “nfolds = 10.” Thirdly, the multivariable stepwise Cox regression analysis was applied to develop the LLPS-related gene signature (GS). The LLPS-related GS score for each individual was calculated using the following formula: score = expresiongene1∗βgene1 + expresiongene2∗βgene2 + expresiongene3∗βgene3 + ⋯+expresion∗β. The prognostic value of the LLPS-related GS score was evaluated using the univariable Cox regression and time-dependent ROC (tROC) curve analysis. The patients with LGG were divided into high-risk and low-risk groups according to the best cutoff, which was identified using the survminer package (https://CRAN.R-project.org/package=survminer). The prognostic value of the novel risk stratification system and other known prognostic features were included in multivariable Cox regression analysis to confirm whether it is an independent prognostic factor.

2.4. Gene Set Enrichment Analysis (GSEA)

The GSEA [22, 23] was performed to preliminarily reveal the biological mechanism underlying high-risk LGG. Hallmark gene sets [24] and Kyoto Encyclopedia of Genes and Genomes (KEGG) [25] canonical pathway gene sets were used as reference gene sets. Gene sets with false discovery rate < 0.2 were considered significant enrichment. The GSEA was performed using the GSEA Java software.

2.5. Validation of the LLPS-Related GS

The patients of LGG from data sets of mRNAseq_325 and mRNA-array_301 were used as the test sets to validate the LLPS-related GS. Each patient was assigned an LLPS-related GS score according to the above formula. Then, the patients were divided into low- or high-risk groups based on to the best cutoff.

2.6. Statistical Analysis

All these analyses were performed using R (version 4.0.2) software (https://www.r-project.org/). The overall survival (OS) between the low- and high-risk groups were compared using the Kaplan-Meier survival curve with log-rank method. We considered P values < 0.05 to be statistically significant, unless otherwise stated.

3. Results

3.1. LLPS-Related Genes Are Involved in Multiple Important Biological Processes and Pathways

The results of functional enrichment analysis indicated that the 131 LLPS-related genes are involved in multiple important biological processes and pathways. In the cellular component perspective (Figure 2(a)), the LLPS-related genes were significantly involved in cytoplasmic ribonucleoprotein granule, ribonucleoprotein granule, and cytoplasmic stress granule. In the biological process perspective (Figure 2(b)), the LLPS-related genes were significantly involved in RNA metabolic- and cell cycle-related processes. Molecular functions of the LLPS-related genes include but are not limited to transcription coregulator activity, modification-dependent protein binding, and ubiquitin-like protein ligase binding (Figure 2(c)). The LLPS-related genes involve in multiple cancer-related pathways (Figure 2(d)), such as FGFR2 alternative splicing and mRNA splicing.
Figure 2

The results of the enrichment analysis of the liquid-liquid phase separation-related genes. (a) Cellular component; (b) biological process; (c) molecular function; (d) reactome pathway.

3.2. LLPS-Related GS as a Novel and Independent Risk Stratification System for LGG

A total of 271 patients with LGG from mRNAseq_693 data set were included in the development of the LLPS-related GS according to our inclusion criteria. In the univariable Cox regression analysis, thirty-seven LLPS-related genes were considered prognosis-related genes (Table 1). Thirteen LLPS-related genes were identified as nonzero features in the LASSO Cox regression (Table 1 and Figure 3(a)). Finally, nine LLPS-related genes (ABL1, AR, CDK1, DAXX, ELN, KMT2D, POU5F1, SH3KBP1, and SYN1) were identified and used to construct the LLPS-related GS through the multivariable stepwise Cox regression analysis (Table 1). The LLPS-related GS was significantly associated with prognosis (hazard ratio (HR) = 2.718, 95% confidence interval (CI) for HR = 2.185 − 3.382, P < 0.001). The tROC curve analysis indicated that the LLPS-related GS may possess a high prognostic value with an area under curve (AUC) of 0.756, 0.793, and 0.775 for 1, 3, and 5 years, respectively (Figure 3(b)). The high-risk patients had significantly shorter OS than the low-risk patients (Figure 3(c)). Furthermore, compared to some other known prognostic factors, the novel risk stratification based on the LLPS-related GS remained a significant prognostic factor (Figure 3(d)). Moreover, the LLPS-related GS could identify high-risk IDH-mutant LGGs (Figure 4). This suggested that our LLPS-related GS could further risk stratifying IDH-mutant LGGs.
Table 1

The prognosis-related phase separation-related genes.

PredictorUnivariable Cox regression analysisLASSO Cox coefficientMultivariable stepwise Cox regression analysis
β HR95% CI for HR P value β HR95% CI for HR P value
ABL10.2121.2371.027-1.490.025-0.001241-0.3760.6860.457-1.0310.07
AKAP80.4511.5711.16-2.1260.0030
AR0.3221.381.213-1.5700.117610.1711.1871.006-1.3990.042
CCNT10.2191.2451.038-1.4940.0180
CDK10.241.2711.133-1.42700.1484980.3191.3761.179-1.6070
CPLX1-0.2270.7970.729-0.8710-0.035502
DAXX0.3231.3811.02-1.8710.037-0.201314-0.5670.5670.307-1.050.071
DNAJB10.2321.2611.009-1.5740.0410
ELN0.2271.2541.081-1.4550.0030.0776780.1541.1670.963-1.4140.115
EN20.1561.1691.048-1.3030.0050.044444
G3BP10.2361.2661.068-1.50.0060
GOLGA20.4221.5251.174-1.9810.0020
HNRNPA2B0.2590.2591.043-1.610.0190
HNRNPAB0.351.4191.117-1.8040.0040
HNRNPF0.3470.3471.152-1.7390.0010
HNRNPH10.3171.3731.146-1.6460.0010
HTR1A-0.1220.8850.828-0.9460-0.042494
KMT2D0.2931.3411.122-1.6020.0010.1209850.3441.411.082-1.8370.011
KPNA20.2051.2281.02-1.4780.030
KPNB10.1961.2171.023-1.4470.0270
MLLT10.1631.1771.029-1.3480.0180
NUP980.2831.3271.049-1.680.0180
PMEL0.2141.2391.072-1.4310.0040
POU5F10.1341.1431.015-1.2880.0280.0675970.1171.1240.976-1.2940.105
PRNP-0.1550.8560.75-0.9780.0220
PTBP10.5941.8111.344-2.4400
RAD23B0.2341.2641.012-1.5780.0390
SH3KBP10.4161.5161.18-1.9460.0010.0982870.3311.3920.983-1.9720.063
SNCB-0.1780.8370.778-0.900
SOX20.1691.1841.016-1.3790.030
SPOP0.2091.2321.014-1.4970.0360
SYN1-0.2790.7560.686-0.8340-0.13603-0.2740.760.676-0.8540
SYN2-0.1880.8280.754-0.9100
TNPO10.1941.2141.027-1.4360.0230
USH1C-0.1540.8570.796-0.9220-0.07669
WWTR10.2321.2611.102-1.4430.0010
YAP10.2461.2791.114-1.46800

Abbreviations: HR: hazard ratio; CI: confidence interval; LASSO: least absolute shrinkage and selection operator.

Figure 3

The development of the liquid-liquid phase separation- (LLPS-) related gene signature (GS) in the mRNAseq_693 data sets. (a) Thirteen genes were identified using the least absolute shrinkage and selection operator Cox regression analysis. (b) The time-dependent receiver operating characteristic curve (ROC) analysis. (c) The patients with high LLPS-related GS score had significantly shorter overall survival than those with low LLPS-related GS score. (d) The LLPS-related GS-based risk stratification is a prognostic factor independent of routine clinicopathological characteristics.

Figure 4

The liquid-liquid phase separation-related gene signature could further risk stratifying IDH-mutant low-grade gliomas.

3.3. High-Risk LGG-Specific Gene Sets

GSEA showed that the hallmark gene set of “angiogenesis,” “epithelial mesenchymal transition,” “IL6 JAK STAT3 signaling,” “inflammatory response,” and “interferon gamma response” is significantly enriched in the high-risk LGGs (Figure 5(a)). Several cancer-related pathways may be more active in high-risk LGGs (Figure 5(b)), such as IL6 JAK STAT3 signaling pathway and ECM receptor interaction.
Figure 5

The results of gene set enrichment analysis. (a) The top five (ranked by false discovery rate) hallmark and (b) Kyoto Encyclopedia of Genes and Genomes gene sets were significantly enriched in the high-risk samples.

3.4. The LLPS-Related GS Was Confirmed in Two Independent External Data Sets

In the data set of mRNAseq_325, 137 patients with LGG were included and validated the LLPS-related GS with HR = 2.390, 95% CI for HR = 1.841 − 3.103, and P < 0.001. The tROC curve analysis obtained an AUC of 0.817, 0.845, and 0.834 for 1, 3, and 5 years, respectively (Figure 6(a)). The high-risk patients had significantly shorter OS than the low-risk patients (Figure 6(b)). In the data set of mRNA-array_301, 140 patients with LGG were included and validated the LLPS-related GS with HR = 1.868, 95% CI for HR = 1.120 − 3.114, and P < 0.017. The tROC curve analysis obtained an AUC of 0.711, 0.659, and 0.618 for 1, 3, and 5 years, respectively (Figure 6(c)). The high-risk patients had significantly shorter OS than the low-risk patients (Figure 6(d)).
Figure 6

Validation of the liquid-liquid phase separation- (LLPS-) related gene signature (GS). The time-dependent receiver operating characteristic curve (ROC) analysis in (a) mRNAseq_325 and (c) mRNA-array_301. The patients with high LLPS-related GS score had significantly shorter overall survival than those with low LLPS-related GS score both in (b) mRNAseq_325 and (d) mRNA-array_301.

4. Discussion

Prediction of prognosis in gliomas was considered much more challenging than with other malignancies. Signs or biomarkers of progression in other malignancies, such as serum CEA and CA199 in colorectal cancer [26], are not present in gliomas. Several clinicopathological features have been considered demonstrating a “higher risk” for progression and poorer prognosis in various studies, including age ≥ 40 years, tumor size > 6 cm, neurological deficits prior to surgery, and tumor that crosses the midline [27, 28]. In recent years, molecular features have been found more reliable than these routine prognostic features. For instance, the isocitrate dehydrogenase (IDH) mutation has been recognized to be correlated to a better prognosis in LGG [29]. The 1p19q codeletion predicts a longer progression-free survival and overall survival [30, 31]. In our present study, we developed and validated a novel molecular signature for predicting the prognosis of LGG. The LLPS-related GS was also independent to IDH and 1p19q status, which can provide more precise management for IDH-mutant or 1p19q codeletion LGGs. LLPS has emerged as a novel concept to elaborate the organization of living cells [6]. The potential prognostic value of LLPS-related molecular has also received increasing attention. Qiu et al. constructed a LLPS-related GS as a prognostic tool for epithelial ovarian cancer [17]. Our analysis might firstly develop the novel LLPS-related GS for predicting the prognosis of LGG. The LLPS-related GS was constructed using nine genes (ABL1, AR, CDK1, DAXX, ELN, KMT2D, POU5F1, SH3KBP1, and SYN1). Actually, functional studies of these nine genes in LGG are few. This suggests that it is helpful for our discovery of candidate key molecules of LGG from the perspective of LLPS. The associations of these genes to other cancers may help us study their functions in LGG. It was reported that ABL1 promotes cancer cell growth, survival, adhesion, and migration depending on the cellular context [32]. AR has been found to be associated with the occurrence, progression, prognosis, and drug resistance of ovarian cancer, endometrial cancer, and cervical cancer [33]. The upregulation of CDK1 can promote the growth and the proliferation of melanoma tumor cells [34]. SH3KBP1 was considered promoting glioblastoma tumorigenesis by activating EGFR signaling [35]. Deregulated expression of SYN1 may maintain a cancer stem-like phenotype that contributes to the development of gliomas [36]. POU5F1 was reported play a carcinogenic role in liver hepatocellular carcinoma [37]. Whether these genes affect the prognosis of LGG through a similar mechanism or just through the LLPS mechanism remains to be further investigated. According to the results, the high-risk groups may have more active angiogenesis and may benefit from antiangiogenic treatment, such as bevacizumab [38]. We also found that the inflammatory response was more active in the high-risk LGG. Whether immunotherapy is more effective in these subgroups deserves further study [39]. We also found that other cancer-related pathways were enriched in the high-risk group. The more active epithelial-mesenchymal transition in the high-risk group may contribute to its biological behavior more similar to other cancers. Not surprisingly, we found case reports of patients benefiting from other unconventional treatments. Our analysis may provide a reference for the identification of these patients. Although our present study may provide a new insight into the LLPS and LGG and the LLPS-related GS may improve the management of LGG, there were several notable limitations. Firstly, the LLPS-related GS was proposed based a retrospective study; it requires prospective studies to validate or even improve before going into clinical decision-making. Secondly, our study is mainly focused on Chinese LGG; whether the results are applicable to other races needs further study. Thirdly, molecular function experiments were lacking in the present study; thus, it remains elusive whether these LLPS-related genes are causal or merely prognostic markers in LGG.

5. Conclusion

In conclusion, we developed and validated a novel LLPS-related GS for risk stratification of LGG. Our findings may provide more precise management for LGGs and a useful reference for LLPS mechanism to link LGG studies.
  39 in total

Review 1.  Liquid phase condensation in cell physiology and disease.

Authors:  Yongdae Shin; Clifford P Brangwynne
Journal:  Science       Date:  2017-09-22       Impact factor: 47.728

2.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

3.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

4.  The Molecular Signatures Database (MSigDB) hallmark gene set collection.

Authors:  Arthur Liberzon; Chet Birger; Helga Thorvaldsdóttir; Mahmoud Ghandi; Jill P Mesirov; Pablo Tamayo
Journal:  Cell Syst       Date:  2015-12-23       Impact factor: 10.304

5.  Expression profiling of RE1-silencing transcription factor (REST), REST corepressor 1 (RCOR1), and Synapsin 1 (SYN1) genes in human gliomas.

Authors:  Musteyde Yucebas; Sunde Yilmaz Susluer; Hasan Onur Caglar; Tugce Balci; Z Ozlem Dogan Sigva; Taner Akalin; Nezih Oktar; Tayfun Dalbasti; Cigir Biray Avci; Cumhur Gunduz
Journal:  J BUON       Date:  2016 Jul-Aug       Impact factor: 2.533

6.  CDK1 Interacts with Sox2 and Promotes Tumor Initiation in Human Melanoma.

Authors:  Dinoop Ravindran Menon; Yuchun Luo; John J Arcaroli; Sucai Liu; Lekha Nair KrishnanKutty; Douglas G Osborne; Yang Li; Jenny Mae Samson; Stacey Bagby; Aik-Choon Tan; William A Robinson; Wells A Messersmith; Mayumi Fujita
Journal:  Cancer Res       Date:  2018-10-08       Impact factor: 12.701

Review 7.  Nuclear microenvironment in cancer: Control through liquid-liquid phase separation.

Authors:  Ryu-Suke Nozawa; Tatsuro Yamamoto; Motoko Takahashi; Hiroaki Tachiwana; Reo Maruyama; Toru Hirota; Noriko Saitoh
Journal:  Cancer Sci       Date:  2020-07-21       Impact factor: 6.716

8.  Diagnostic and Prognostic Value of CEA and CA19-9 in Colorectal Cancer.

Authors:  Leilani Lakemeyer; Silvia Sander; Mathias Wittau; Doris Henne-Bruns; Marko Kornmann; Johannes Lemke
Journal:  Diseases       Date:  2021-03-17

9.  Temozolomide chemotherapy versus radiotherapy in high-risk low-grade glioma (EORTC 22033-26033): a randomised, open-label, phase 3 intergroup study.

Authors:  Brigitta G Baumert; Monika E Hegi; Martin J van den Bent; Andreas von Deimling; Thierry Gorlia; Khê Hoang-Xuan; Alba A Brandes; Guy Kantor; Martin J B Taphoorn; Mohamed Ben Hassel; Christian Hartmann; Gail Ryan; David Capper; Johan M Kros; Sebastian Kurscheid; Wolfgang Wick; Roelien Enting; Michele Reni; Brian Thiessen; Frederic Dhermain; Jacoline E Bromberg; Loic Feuvret; Jaap C Reijneveld; Olivier Chinot; Johanna M M Gijtenbeek; John P Rossiter; Nicolas Dif; Carmen Balana; Jose Bravo-Marques; Paul M Clement; Christine Marosi; Tzahala Tzuk-Shina; Robert A Nordal; Jeremy Rees; Denis Lacombe; Warren P Mason; Roger Stupp
Journal:  Lancet Oncol       Date:  2016-09-27       Impact factor: 54.433

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.