Literature DB >> 30405776

Identification of novel long non-coding RNA in diffuse intrinsic pontine gliomas by expression profile analysis.

Yuehui Liu1, Haiping Liu2, Dongwei Zhang1.   

Abstract

Diffuse intrinsic pontine glioma (DIPG) is one of the most devastating types of pediatric cancer. Accumulating evidence suggests that the dysregulated expression of long non-coding (lnc)-RNAs is associated with various pathologies of the CNS. However, the expression patterns and prognostic roles of lncRNAs in DIPG have not yet been systematically determined. In the present study, lncRNA expression profiles were obtained from the Gene Expression Omnibus (GEO) database using the lncRNA-mining approach and a differential expression analysis for lncRNAs was performed between DIPG and low-grade brainstem glioma and DIPG and normal pediatric brainstem tissue. Using a two-tailed t-test, 58 and 197 lncRNAs were found to be significantly deferentially expressed (Fold change >2 or <0.5, FDR adjusted P<0.05). To identify the prognostic value of these 255 differentially expressed lncRNAs, univariate and multivariate Cox proportional hazards regression analysis were performed and a 9-lncRNA signature as a potential biomarker for predicting the prognosis of DIPG was constructed. Kaplan-Meier curve analysis showed that patients in the high-risk group exhibited a reduced survival time compared with patients in the low-risk group (median survival of 230 vs. 460 days, log-rank test P<0.001). Moreover, this lncRNA-signature could be used as an independent prognostic marker for DIPG patient survival. The present study provided novel candidates for the investigation of potential diagnostic or prognostic biomarkers and/or therapeutic targets of DIPG, as well as a novel insight into the underlying mechanisms of DIPG.

Entities:  

Keywords:  diffuse intrinsic pontine gliomas; expression profiles; long non-coding RNAs; prognosis

Year:  2018        PMID: 30405776      PMCID: PMC6202498          DOI: 10.3892/ol.2018.9461

Source DB:  PubMed          Journal:  Oncol Lett        ISSN: 1792-1074            Impact factor:   2.967


Introduction

Diffuse intrinsic pontine glioma (DIPG) is one of the most devastating pediatric cancers, and accounts for 10–15 of pediatric brain and central nervous system (CNS) tumors (1,2). The standard treatment for DIPG currently includes neurosurgery, radiotherapy and chemotherapy. However, the prognosis for DIPG remains poor, due to high relapse rates and rapid progression (3). The 1-, 2- and 5-year survival rates of patients with DIPG are approximately 30%, <10 and <1%, respectively (2,4). Therefore, there is an urgent need to identify novel DIPG-related molecular factors and therapeutic targets for the treatment of DIPG. It is now commonly accepted that at least 90% of the human genome is actively transcribed, whereas only <2% encodes proteins; the majority of the genome can be transcribed into non-coding RNAs (ncRNAs) (5). ncRNAs can be classified into two major classes based on transcript size: Small ncRNAs (such as microRNAs) and long non-coding RNAs (lncRNAs). To date, thousands of lncRNAs have been identified in humans and other species (6). lncRNAs are commonly defined as RNA molecules longer than 200 nucleotides that are not necessarily translated into proteins (7). Accumulating evidence suggests that lncRNAs play important roles in various biological processes by negatively or positively regulating gene expression at the epigenetic, transcriptional and post-transcriptional levels (8–11). With advances in transcriptome profiling, aberrant lncRNA expression has been observed in various human diseases, including cancer. These dysregulated lncRNAs have been implicated in cancer pathogenesis and development (12–17). Recently, the regulatory roles of lncRNAs have been demonstrated in the nervous system function, and their dysregulated expression is involved in various pathologies of the CNS (18,19). However, the expression patterns and prognostic roles of lncRNAs in DIPG have not yet been systematically determined. This study aimed to identify lncRNA expression patterns in DIPG compared with brainstem low-grade glioma and normal pediatric brainstem tissue, and identify the lncRNAs associated with the survival of patients with DIPG.

Materials and methods

Datasets

The human microarray dataset GSE26576 (1) was downloaded from the NCBI Gene Expression Omnibus (GEO) database (www.ncbi.nlm.nih.gov/geo/). The GSE26576 microarray dataset was generated with the GPL570 platform (Affymetrix Human Genome U133 Plus 2.0 Array) and included 26 DIPG samples, 6 brainstems low-grade glioma samples and 2 normal pediatric brainstem samples.

lncRNA expression profiles

lncRNA expression profiles included in the GSE26576 dataset were obtained by repurposing microarray probes using GATExplorer software, as previously described (20,21). Briefly, a series of R packages in GATExplorer software were used to map the data and annotate the lncRNA microarray probes. lncRNA probes that mapped to the human and mouse genomes (derived from the RNAdb database) (22) were retained. Finally, 5635 lncRNAs were identified for further analysis.

Preprocessing and analysis of expression profiles

The raw microarray dataset (CEL file) was obtained from the GEO database and normalized using the Robust Multichip Average (RMA) method, which involved three main steps: Background correction, quantile normalization and log2-transformation. For determination of lncRNA differential expression profiles, a two-tailed T-test was used to identify differentially expressed lncRNAs between patients with DIPG and normal controls, and between patients with DIPG and low-grade glioma. lncRNAs with an adjusted P<0.05 after FDR correction and a fold change of >2 or <0.5 were considered as differentially expressed lncRNAs. Hierarchical clustering analysis was performed for the expression data of the differentially expressed lncRNAs using the R package ‘pheatmap’.

Statistical analysis

The association between the lncRNA gene expression and patient survival was assessed by univariate Cox regression analysis. The Kaplan-Meier method and two-sided log-rank test were used to compare survival differences between low- and high-risk groups. Multivariate Cox analysis was used to test whether the lncRNA expression signature was independent of other clinical features. Time-dependent receiver operating characteristic (ROC) curves were used to compare the sensitivity and specificity of the lncRNA expression signature for survival prediction.

Functional enrichment analysis

Expression correlation between protein-coding genes and lncRNAs was measured using Pearson correlation coefficients. Functional enrichment analysis was conducted for the protein-coding genes co-expressed with the lncRNAs in GO and KEGG using the ClueGO plugin (version 2.3.3) in Cytoscape (23), and DAVID (david.ncifcrf.gov/, version 6.8) (24). GO terms and KEGG pathways were considered significantly enriched when P<0.05.

Results

Identification of differentially expressed lncRNAs between patients with DIPG and normal controls

To identify differentially expressed lncRNAs between patients with DIPG and normal controls, we performed differential expression analysis for lncRNAs using student's t-test. A total of 58 lncRNAs were identified as differentially expressed between patients with DIPG and normal controls (Fold change >2 or <0.5, P<0.05 after FDR adjustment). Of these, 41 lncRNAs were upregulated, and 17 downregulated, in patients with DIPG.

Identification of differentially expressed lncRNAs between patients with DIPG and low-grade glioma

We performed a differential expression analysis of lncRNA expression profiles between patients with DIPG and low-grade glioma, and identified 197 differentially expressed lncRNAs using student's t-test. (Fold change >2 or <0.5, P<0.05 after FDR adjustment). Among the differentially expressed lncRNAs, 125 were upregulated and 72 were downregulated in patients with DIPG. To demonstrate the significance of the dysregulated lncRNAs in discriminating between patients with DIPG, normal controls and patients with low-grade glioma, we performed an unsupervised hierarchical clustering analysis for all samples according to the expression values of the identified differentially expressed lncRNAs. As shown in Fig. 1, three distinct sample clusters were obtained by hierarchical clustering analysis, suggesting that the 255 differentially expressed lncRNAs were closely associated with DIPG, and could be used to distinguish DIPG from normal brainstem tissue and low-grade glioma.
Figure 1.

Hierarchical clustering analysis of DIPG, normal control and LGG samples based on 255 differentially expressed lncRNAs. DIPG, diffuse intrinsic pontine glioma; LGG, low-grade glioma; NM, normal brainstem tissue.

Identification of an lncRNA expression signature for survival prediction in patients with DIPG

To identify survival-related lncRNAs, we performed univariate Cox proportional hazards regression analysis for the aforementioned 255 differentially expressed lncRNAs. A total of 14 lncRNAs were significantly associated with DIPG patient survival. We conducted a multivariate Cox regression analysis for the 14 survival-related lncRNAs, and identified a set of 9 lncRNAs that were independently associated with the DIPG patient survival time (Table I). We constructed a lncRNA expression signature as a classifier for survival prediction according to the expression of the 9 lncRNAs weighted by the multivariate Cox regression coefficient, as follows: Risk Score=(−3.92)*AF086127 + 1.52*AF086217 + 2.42*AF086391 + (−4.06)*AF119852 + 0.80*AK021535 + (−0.82)*AK022370 + 1.38*AL050068 + (−0.89)*BC012548 + (−2.39)*BC041658. The risk score for each patient was calculated based on the lncRNA gene expression signature. Using the median risk score as the cutoff point (−32.36), 26 DIPG patients were classified into the high- and low-risk groups. Patients in the high-risk group exhibited a poorer overall survival time than patients in the low-risk group (median survival of 230 vs. 460 days, log-rank test P<0.001). Kaplan-Meier curves for the high- and low-risk groups are shown in Fig. 2A. The heatmap shows that five protective lncRNAs exhibit a high expression level in the low-risk group, while four risk lncRNAs exhibit a high expression level in the high-risk group (Fig. 2B). Analysis of time-dependent ROC demonstrated that the AUC value for the lncRNA expression signature was 0.935 for 12-month survival (Fig. 2C).
Table I.

Nine long non-coding RNAs significantly associated with the survival in the diffuse intrinsic pontine glioma dataset.

Gene symbolCoefficientHazard ratioZ-scoreP-value
AF0862170.9692.6352.5170.012
AF0863911.4444.2362.4390.015
AF119852−1.9590.141−2.4140.016
AK0215350.5951.8132.6500.008
AK022370−0.6840.505−2.0110.044
AL050068−0.9470.388−2.0420.041
BC0125480.4281.5342.0610.039
BC0416580.6341.8851.9830.047
AF086127−0.7690.464−2.2470.025
Figure 2.

Prognostic performance of the nine-lncRNA signature. (A) Kaplan-Meier curves for the high-risk and low-risk groups. (B) Time-dependent receiver operating characteristic curve analysis of the lncRNA expression signature. (C) Distribution of the risk score and expression heatmap of the nine-lncRNA signature. lncRNA, long non-coding RNA.

The 1-year survival rate in the high-risk group was 7.69%, whereas the corresponding rate in the low-risk group was 84.62%. The results of the univariate analysis indicated that the hazard ratio of the high-risk score vs. the low-risk score for survival was 2.72 [P<0.001; 95% confidence interval (CI)=1.76–4.21] (Table II). According to the multivariate analysis, including age, the hazard ratio for the high-risk vs. the low-risk score for survival was 2.69 (P<0.001; 95% CI, 1.74–4.18) (Table II), indicating that the lncRNA expression signature maintained an independent association with survival.
Table II.

Univariate and multivariate Cox regression analysis of survival in the DIPG dataset.

Univariate analysisMultivariate analysis


Variable [DIPG dataset (n=27)]HR95% CI of HRP-valueHR95% CI of HRP-value
Risk score2.721.76–4.217.15×10−62.691.74–4.181.01×10−5
Age1.040.93–1.160.531.010.90–1.140.84

HR, hazard ratio; CI, confidence interval; DIPG, diffuse intrinsic pontine glioma.

Functional analysis of the lncRNA expression signature

We performed GO and KEGG enrichment analyses for the protein-coding genes which were co-expressed with the 9 lncRNAs in the gene expression signature using DAVID and clueGO. The results of GO enrichment analysis revealed four enriched GO functional clusters, including ‘protein folding’, ‘cell proliferation’, ‘epithelial cell migration’ and ‘regulation of nucleocytoplasmic transport’ (Fig. 3A). The results of the KEGG enrichment analysis revealed eight enriched KEGG pathways, including ‘terpenoid backbone biosynthesis’, ‘protein processing in the endoplasmic reticulum’, ‘biosynthesis of antibiotics’, ‘HTLV-I infection’, ‘PI3K-Akt signaling pathway’, ‘melanoma’, ‘metabolic pathways’ and ‘Ras signaling pathway’ (Fig. 3B).
Figure 3.

Functional enrichment analysis. (A) Enriched Gene Ontology terms. (B) Enriched Kyoto Encyclopedia of Genes and Genomes pathways.

Discussion

DIPGs, representing 75–80% of pediatric brainstem tumors, are the most common brainstem tumors in children (25). Previous studies have investigated the molecular heterogeneity between DIPGs and adult high-grade gliomas (HGGs) and between DIPGs and low-grade brainstem gliomas, to improve our understanding of the molecular mechanisms and molecular expression signatures underlying DIPG. Using polymerase chain reaction-single strand polymorphism and nucleotide analyses, Zhang et al (26) reported a p53 gene mutation in some DIPGs and inferred that DIPGs might be associated with mutagenic or carcinogenic agents. Paugh et al (1) performed genome-wide analyses and demonstrated significantly different frequencies of specific large-scale and local imbalances in gene expression between DIPGs and nonbrainstem pediatric glioblastomas. Another study performed by Lulla et al (27) studied the miRNA expression pattern in TPG, and identified two distinct subgroups with differentially expressed microRNAs. A recent study indicated that H3K27M-mutant gliomas share similar histological features and an adverse prognosis in adults and children (28). However, the above studies have focused on genomic mutations, mRNAs or miRNAs. Recent studies have suggested that lncRNAs, a new class of ncRNAs, is an important component of disease biology, and the dysregulated expression of lncRNAs has been observed in various human diseases (29). However, the expression patterns of lncRNAs and their functional roles in DIPGs have not been systematically studied yet. In this study, we first obtained lncRNA expression profiles of DIPG, brainstem low-grade glioma and normal pediatric brainstem using the lncRNA-mining approach. We subsequently performed differential expression analysis and identified 58 and 197 significantly differentially expressed lncRNAs between patients with DIPG and normal controls, and between patients with DIPG and low-grade glioma, respectively. To the best of our knowledge, our study is the first to attempt to identify the dysregulated lncRNA expression pattern in patients with DIPG compared with normal controls and patients with low-grade glioma. We hypothesize that these differentially expressed lncRNAs in patients with DIPG may be involved in the pathogenesis and development of DIPG, and could be used as candidates for the investigation of potential diagnostic or prognostic biomarkers and/or therapeutic targets for the treatment of DIPG. During the initial phase of marker discovery, we performed univariate and multivariate Cox proportional hazards regression analysis for these 255 differentially expressed lncRNAs, and constructed a 9-lncRNA signature as a potential biomarker for prognosis of DIPG. Kaplan-Meier curve analysis also demonstrated that patients with the high-risk lncRNA signature had much poorer survival than those with the low-risk lncRNA signature. As radiotherapy and chemotherapy affect the prognosis of patients with DIPG, whether the 9-lncRNA signature is affected by radiotherapy and chemotherapy needs to be investigated in future studies. Although a large number of lncRNAs have been discovered in humans and animals, few lncRNAs have been functionally characterized. It has been reported that it is an effective method to infer the function of lncRNAs based on coding genes that are co-expressed with these lncRNAs (30). Based on this assumption, we first identified coding genes that are co-expressed with lncRNAs using Pearson correlation coefficients. Functional enrichment analysis was conducted for the protein-coding genes co-expressed with lncRNAs to predict the functions of the 9-lncRNA signature. Functional analysis suggested that the 9-lncRNA signature may be involved in known cancer-related biological pathways and processes. For example, altered Ras signaling has been detected in a variety of cancers, including CNS tumors (31). The PI3K-Akt signaling pathway is well known to be involved in various cellular functions, including nutrient uptake, cell proliferation, growth, autophagy, apoptosis and migration (32), and the dysregulation of the PI3K-Akt signaling pathway is associated with neurodevelopmental disorders (33). In conclusion, we have identified some novel differentially expressed lncRNAs in DIPG using previously generated microarray data and identified a lncRNA signature comprising nine lncRNAs (AF086127, AF086217, AF086391, AF119852, AK021535, AK022370, AL050068, BC012548 and BC041658), which can be collectively used as an independent prognostic marker of DIPG patient survival. Our study provided basis for the further investigation of the mechanisms underlying DIPG.
  32 in total

1.  Human genome: end of the beginning.

Authors:  Lincoln D Stein
Journal:  Nature       Date:  2004-10-21       Impact factor: 49.962

Review 2.  Treatment of newly diagnosed diffuse brain stem gliomas in children: in search of the holy grail.

Authors:  David N Korones
Journal:  Expert Rev Anticancer Ther       Date:  2007-05       Impact factor: 4.512

3.  Diffuse intrinsic pontine glioma: poised for progress.

Authors:  Katherine E Warren
Journal:  Front Oncol       Date:  2012-12-28       Impact factor: 6.244

4.  RNAdb--a comprehensive mammalian noncoding RNA database.

Authors:  Ken C Pang; Stuart Stephen; Pär G Engström; Khairina Tajul-Arifin; Weisan Chen; Claes Wahlestedt; Boris Lenhard; Yoshihide Hayashizaki; John S Mattick
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

5.  Characterization of long non-coding RNA-associated ceRNA network to reveal potential prognostic lncRNA biomarkers in human ovarian cancer.

Authors:  Meng Zhou; Xiaojun Wang; Hongbo Shi; Liang Cheng; Zhenzhen Wang; Hengqiang Zhao; Lei Yang; Jie Sun
Journal:  Oncotarget       Date:  2016-03-15

6.  A novel lncRNA-focus expression signature for survival prediction in endometrial carcinoma.

Authors:  Meng Zhou; Zhaoyue Zhang; Hengqiang Zhao; Siqi Bao; Jie Sun
Journal:  BMC Cancer       Date:  2018-01-05       Impact factor: 4.430

Review 7.  Brain Development and Akt Signaling: the Crossroads of Signaling Pathway and Neurodevelopmental Diseases.

Authors:  Long Wang; Kai Zhou; Zhi Fu; Di Yu; Hesuyuan Huang; Xiaodong Zang; Xuming Mo
Journal:  J Mol Neurosci       Date:  2016-12-26       Impact factor: 3.444

8.  Discovery and validation of immune-associated long non-coding RNA biomarkers associated with clinically molecular subtype and prognosis in diffuse large B cell lymphoma.

Authors:  Meng Zhou; Hengqiang Zhao; Wanying Xu; Siqi Bao; Liang Cheng; Jie Sun
Journal:  Mol Cancer       Date:  2017-01-19       Impact factor: 27.401

Review 9.  Gene regulation by the act of long non-coding RNA transcription.

Authors:  Aleksandra E Kornienko; Philipp M Guenzl; Denise P Barlow; Florian M Pauler
Journal:  BMC Biol       Date:  2013-05-30       Impact factor: 7.431

10.  A long non-coding RNA signature to improve prognosis prediction of colorectal cancer.

Authors:  Ye Hu; Hao-Yan Chen; Chen-Yang Yu; Jie Xu; Ji-Lin Wang; Jin Qian; Xi Zhang; Jing-Yuan Fang
Journal:  Oncotarget       Date:  2014-04-30
View more
  1 in total

1.  Noncoding RNAs as potential biomarkers for DIPG diagnosis and prognosis: XIST and XIST-210 involvement.

Authors:  M Á Velázquez-Flores; J M Rodríguez-Corona; J E López-Aguilar; G Siordia-Reyes; G Ramírez-Reyes; G Sánchez-Rodríguez; R Ruiz Esparza-Garrido
Journal:  Clin Transl Oncol       Date:  2020-07-13       Impact factor: 3.405

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.