| Literature DB >> 32802843 |
Jingwei Zhao1, Le Wang2, Bo Wei1.
Abstract
Energy metabolic processes play important roles for tumor malignancy, indicating that related protein-coding genes and regulatory upstream genes (such as long noncoding RNAs (lncRNAs)) may represent potential biomarkers for prognostic prediction. This study will develop a new energy metabolism-related lncRNA-mRNA prognostic signature for lower-grade glioma (LGG) patients. A GSE4290 dataset obtained from Gene Expression Omnibus was used for screening the differentially expressed genes (DEGs) and lncRNAs (DELs). The Cancer Genome Atlas (TCGA) dataset was used as the prognosis training set, while the Chinese Glioma Genome Atlas (CGGA) was for the validation set. Energy metabolism-related genes were collected from the Molecular Signatures Database (MsigDB), and a coexpression network was established between energy metabolism-related DEGs and DELs to identify energy metabolism-related DELs. Least absolute shrinkage and selection operator (LASSO) analysis was performed to filter the prognostic signature which underwent survival analysis and nomogram construction. A total of 1613 DEGs and 37 DELs were identified between LGG and normal brain tissues. One hundred and ten DEGs were overlapped with energy metabolism-related genes. Twenty-seven DELs could coexpress with 67 metabolism-related DEGs. LASSO regression analysis showed that 9 genes in the coexpression network were the optimal signature and used to construct the risk score. Kaplan-Meier curve analysis showed that patients with a high risk score had significantly worse OS than those with a low risk score (TCGA: HR = 3.192, 95%CI = 2.182-4.670; CGGA: HR = 1.922, 95%CI = 1.431-2.583). The predictive accuracy of the risk score was also high according to the AUC of the ROC curve (TCGA: 0.827; CGGA: 0.806). Multivariate Cox regression analyses revealed age, IDH1 mutation, and risk score as independent prognostic factors, and thus, a prognostic nomogram was established based on these three variables. The excellent prognostic performance of the nomogram was confirmed by calibration and discrimination analyses. In conclusion, our findings provided a new biomarker for the stratification of LGG patients with poor prognosis.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32802843 PMCID: PMC7403901 DOI: 10.1155/2020/3708231
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Identification of energy metabolism-related differentially expressed RNAs between lower-grade gliomas and normal brain tissues in the GSE4290 dataset. (a) Volcano plot to display the distribution of differentially expressed RNAs, which was performed by ggplot2. Green dots were downregulated RNAs; red dots were upregulated RNAs; black dots were RNAs not differentially expressed. FC: fold change; FDR: false discovery rate. (b) Heat map of differentially expressed RNAs, which was created by Pheatmap. Red indicated high expression; green indicated low expression. (c) Venn diagram to display the overlap between differentially expressed protein-coding genes (DEGs) and energy metabolism-related genes obtained from Molecular Signatures Database.
Figure 2Identification of energy metabolism-related differentially expressed lncRNAs based on their coexpression with differentially expressed protein-coding mRNAs. Upregulated lncRNAs (square) and mRNAs (circle) were in red; downregulated lncRNAs (square) and mRNAs (circle) were in green. The coexpression pairs between lncRNAs and mRNAs were selected by calculation of Pearson correlation coefficients (PCC) by cor.test function. Only coexpression pairs with PCC > 0.6 were visualized.
Figure 3Function enrichment analysis for genes in the coexpression network by the Database for Annotation, Visualization and Integrated Discovery database. (a) Gene Ontology (GO) biological process terms; (b) Kyoto Encyclopedia of Genes and Genomes (KEGG). These plots were generated using the ggplot2 package.
Function enrichment.
| Category | Term |
| Genes |
|---|---|---|---|
| Biology process | GO:0006661~phosphatidylinositol biosynthetic process | 3.380 | INPP5J, ARF3, SYNJ1, PI4KA, PIP4K2A, MTMR7 |
| GO:0030203~glycosaminoglycan metabolic process | 5.100 | B3GAT1, GPC2, CHST9, BCAN, VCAN | |
| GO:0030206~chondroitin sulfate biosynthetic process | 1.310 | CHSY3, CHST9, BCAN, VCAN | |
| GO:0009725~response to hormone | 6.240 | ME1, OXCT1, FHL2, DHCR24 | |
| GO:0035338~long-chain fatty-acyl-CoA biosynthetic process | 6.240 | ACOT7, ELOVL4, SLC25A1, ACOT4 | |
| GO:0030208~dermatan sulfate biosynthetic process | 1.009 | UST, BCAN, VCAN | |
| GO:0055114~oxidation-reduction process | 2.379 | ME1, TYRP1, CYP46A1, CYP2C8, QDPR, CBR4, SRD5A1, CRYM, DHCR24 | |
| GO:0006699~bile acid biosynthetic process | 3.136 | CYP46A1, STAR, SLC27A2 | |
| GO:0046856~phosphatidylinositol dephosphorylation | 4.090 | INPP5J, SYNJ1, MTMR7 | |
| GO:0008652~cellular amino acid biosynthetic process | 4.792 | GLS2, ASPA, GOT1 | |
| GO:0042493~response to drug | 7.278 | STAR, BCHE, OXCT1, FABP3, SRD5A1, AACS | |
| GO:0019083~viral transcription | 1.015 | RPLP0, NUP93, TPR, RPS21 | |
| GO:0006486~protein glycosylation | 1.040 | B3GAT1, B3GALT2, B4GALT6, LRP2 | |
| GO:0070859~positive regulation of bile acid biosynthetic process | 1.192 | NR1D1, STAR | |
| GO:0006024~glycosaminoglycan biosynthetic process | 1.218 | GPC2, HAS1, HS3ST2 | |
| GO:0006094~gluconeogenesis | 1.332 | GOT1, ENO2, SLC25A1 | |
| GO:0006533~aspartate catabolic process | 1.587 | ASPA, GOT1 | |
| GO:0071320~cellular response to cAMP | 1.829 | STAR, RPLP0, SRD5A1 | |
| GO:0016101~diterpenoid metabolic process | 1.979 | STAR, SRD5A1 | |
| GO:0071394~cellular response to testosterone stimulus | 1.979 | SRD5A1, AACS | |
| GO:0006629~lipid metabolic process | 2.485 | CPNE6, AACS, LRP2, FABP6 | |
| GO:0060992~response to fungicide | 2.760 | STAR, SRD5A1 | |
| GO:0044539~long-chain fatty acid import | 2.760 | FABP3, SLC27A2 | |
| GO:0044321~response to leptin | 2.760 | NR1D1, STAR | |
| GO:0046951~ketone body biosynthetic process | 3.148 | HMGCLL1, AACS | |
| GO:0007584~response to nutrient | 3.523 | STAR, OXCT1, AACS | |
| GO:0035457~cellular response to interferon-alpha | 3.535 | STAR, TPR | |
| GO:0032869~cellular response to insulin stimulus | 3.788 | GOT1, STAR, SRD5A1 | |
| GO:0046855~inositol phosphate dephosphorylation | 4.304 | SYNJ1, MTMR7 | |
| GO:0046488~phosphatidylinositol metabolic process | 4.304 | PITPNM3, SYNJ1 | |
| GO:0051292~nuclear pore complex assembly | 4.304 | NUP93, TPR | |
| GO:0071872~cellular response to epinephrine stimulus | 4.686 | STAR, SRD5A1 | |
|
| |||
| KEGG pathway | hsa01100:metabolic pathways | 9.150 | ME1, PLD3, TYRP1, HMGCLL1, B3GALT2, CYP2C8, SYNJ1, PI4KA, QDPR, HK1, AGMAT, RIMKLA, ACOT4, GLS2, ASPA, CKMT1A, GOT1, CHSY3, INPP5J, ENO2, B4GALT6, MTMR7, DHCR24 |
| hsa00562:inositol phosphate metabolism | 1.555 | INPP5J, SYNJ1, PI4KA, PIP4K2A, MTMR7 | |
| hsa00250:alanine, aspartate, and glutamate metabolism | 1.894 | GLS2, ASPA, GOT1, RIMKLA | |
| hsa04070:phosphatidylinositol signaling system | 5.025 | INPP5J, SYNJ1, PI4KA, PIP4K2A, MTMR7 | |
| hsa00330:arginine and proline metabolism | 5.263 | CKMT1A, GOT1, AGMAT, CARNS1 | |
| hsa00062:fatty acid elongation | 1.343 | ACOT7, ELOVL4, ACOT4 | |
| hsa00650:butanoate metabolism | 1.558 | HMGCLL1, OXCT1, AACS | |
| hsa00280:valine, leucine, and isoleucine degradation | 4.385 | HMGCLL1, OXCT1, AACS | |
| hsa01200:carbon metabolism | 4.608 | ME1, GOT1, ENO2, HK1 | |
The optimal signature combination.
| Symbol | Type | Expression (LGG vs. CK) | Expression (GBM vs. LGG) | Multivariate Cox regression analysis | LASSO coefficient | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| logFC | FDR (GSE4290) |
|
|
| HR | 95% CI |
| |||
| GABPB1-AS1 | lncRNA | 1.07 | 2.72 | 7.59 | 6.72 | 8.83 | 0.9946 | 0.9899-0.9994 | 2.846 | -0.05468 |
| HAR1A | lncRNA | -1.51 | 7.65 | 4.31 | 1.65 | 1.62 | 0.9938 | 0.988-0.9996 | 3.580 | -0.64387 |
| LINC00599 | lncRNA | -1.3 | 4.80 | 3.64 | 2.36 | 2.62 | 1.0035 | 1.0001-1.007 | 4.628 | 0.00904 |
| SNAI3-AS1 | lncRNA | -1.76 | 9.51 | 7.96 | 3.12 | 1.71 | 0.9817 | 0.9701-0.9934 | 2.310 | -1.81399 |
| SNHG1 | lncRNA | 1.3 | 6.06 | 1.28 | 1.28 | 8.19 | 1.0054 | 1.0004-1.0104 | 3.328 | 0.29846 |
| FABP6 | mRNA | -2.18 | 1.13 | 2.94 | 4.34 | 1.42 | 1.0062 | 1.004-1.0085 | 6.270 | 0.55872 |
| MBOAT7 | mRNA | -1.27 | 3.92 | 8.01 | 1.11 | 0 | 1.0084 | 1.0017-1.0152 | 1.463 | 0.77305 |
| SLC25A1 | mRNA | 1.03 | 2.39 | 4.05 | 5.96 | 0 | 0.9955 | 0.9911-0.9999 | 4.934 | -0.40797 |
| UST | mRNA | 1.47 | 5.50 | 5.80 | 7.57 | 2.41 | 0.9951 | 0.9914-0.9989 | 1.177 | -0.55501 |
LGG: lower-grade gliomas; GBM: glioblastoma multiforme; CK: normal control; CGGA: Chinese Glioma Genome Atlas; TCGA: The Cancer Genome Atlas; FC: fold change; FDR: false discovery rate; HR: hazard ratio; CI: confidence interval; LASSO: least absolute shrinkage and selection operator.
Figure 4The prognostic performance assessment for the risk score model. (a, c) Kaplan-Meier survival curve analysis to show the overall survival difference between the high- and low-risk group of the training (a) and validation (c) datasets; (b, d) receiver operator characteristic (ROC) curves to demonstrate the predictive accuracy for the overall survival of patients in the training (b) and validation (d) datasets. CGGA: Chinese Glioma Genome Atlas; TCGA: The Cancer Genome Atlas; HR: hazard ratio; AUC: area under the ROC curve.
Figure 5Stratification analysis based on age (a) and IDH1 mutation (d) which were also independent prognostic factors for overall survival in addition to the risk score. The Kaplan-Meier curve showed significant differences in overall survival between the high-risk group and the low-risk group in different ages (b, c) and IDH1 mutation status (e, f). HR: hazard ratio; y: year; IDH: isocitrate dehydrogenase.
Univariate and multivariate Cox regression analyses of clinical pathologic features for overall survival.
| Variables | TCGA ( | Univariate analysis | Multivariate analysis | ||||
|---|---|---|---|---|---|---|---|
| HR | 95% CI |
| HR | 95% CI |
| ||
| Age (years, mean ± SD) | 42.84 ± 13.39 | 1.057 | 1.043-1.072 | 2.78 | 1.049 | 1.003-1.096 | 3.59 |
| Gender (male/female) | 286/234 | 1.145 | 0.811-1.617 | 4.40 | — | — | — |
| Animal insect allergy history (yes/no/-) | 16/289/215 | 0.831 | 0.202-3.428 | 7.92 | — | — | — |
| Asthma history (yes/no/-) | 22/346/152 | 0.894 | 0.409-1.956 | 7.76 | — | — | — |
| Food allergy history (yes/no/-) | 20/290/210 | 1.008 | 0.363-2.801 | 9.87 | — | — | — |
| Hay fever history (yes/no/-) | 38/304/178 | 0.488 | 0.177-1.347 | 1.24 | — | — | — |
| Headache history (yes/no/-) | 174/297/49 | 0.841 | 0.576-1.227 | 3.64 | — | — | — |
| Histological type (astrocytoma/oligoastrocytoma/oligodendroglioma) | 194/132/194 | 0.756 | 0.620-0.921 | 5.25 | 0.529 | 0.244-1.149 | 1.08 |
| IDH1 mutation (yes/no/-) | 91/34/395 | 0.181 | 0.0675-0.484 | 4.14 | 0.226 | 0.0729-0.701 | 9.98 |
| Neoplasm histologic grade (G2/G3/-) | 254/265/1 | 3.416 | 2.341-4.985 | 1.84 | 0.949 | 0.294-3.063 | 9.30 |
| Radiation therapy (yes/no/-) | 294/181/45 | 1.847 | 1.209-2.820 | 2.83 | 1.981 | 0.584-6.722 | 2.73 |
| Targeted molecular therapy | 268/200/52 | 1.389 | 0.955-2.019 | 8.08 | — | — | — |
| Risk score (high/low) | 260/260 | 3.192 | 2.182 -4.670 | 1.99 | 5.041 | 1.272-19.97 | 2.13 |
| Death (dead/alive) | 133/387 | — | — | — | — | — | — |
| Overall survival days (months, mean ± SD) | 32.97 ± 32.78 | — | — | — | — | — | — |
HR: hazard ratio; CI: confidence interval; SD: standard deviation; IDH: isocitrate dehydrogenase; TCGA: The Cancer Genome Atlas. Bold indicated the factors with statistical difference (p < 0.05).
Figure 6Establishment and assessment of a prognostic nomogram based on age, IDH1 mutation status, and the risk score. (a) A prognostic nomogram; (b) the calibration assessment by calibration curves; (c) the discrimination assessment by receiver operator characteristic curve. RS: risk score; IDH: isocitrate dehydrogenase.
The performance of the nomogram assessed by discrimination parameters.
| Model | AUC | C-index |
| Specificity | Sensitivity |
|---|---|---|---|---|---|
| Age | 0.564 | 0.743 | 0 | 0.889 | 0.278 |
| IDH1 mutation | 0.646 | 0.733 | 8.138 | 0.849 | 0.442 |
| Multiclinical | 0.779 | 0.8 | 1.278 | 0.877 | 0.632 |
| lncRNA alone | 0.747 | 0.702 | 9.97 | 0.832 | 0.662 |
| mRNA alone | 0.739 | 0.737 | 0 | 0.778 | 0.662 |
| Multi-RNA based | 0.827 | 0.812 | 0 | 0.798 | 0.827 |
| Multi-RNA combined | 0.845 | 0.928 | 0 | 0.863 | 0.782 |
AUC: area under the curve of receiver operating characteristic curve; C-index: concordance index; IDH: isocitrate dehydrogenase.