| Literature DB >> 34211824 |
Haiyan Qi1, Long Chi2, Xiaogang Wang1, Xing Jin1, Wensong Wang1, Jianping Lan1.
Abstract
Abnormal expressions of long noncoding RNAs (lncRNAs) and protein-encoding messenger RNAs (mRNAs) are important for the development of childhood acute lymphoblastic leukemia (ALL). This study developed an lncRNA-mRNA integrated classifier for the prediction of recurrence and prognosis in relapsed childhood ALL by using several transcriptome data. Weighted gene coexpression network analysis revealed that green, turquoise, yellow, and brown modules were preserved across the TARGET, GSE60926, GSE28460, and GSE17703 datasets, and they were associated with clinical relapse and death status. A total of 184 genes in these four modules were differentially expressed between recurrence and nonrecurrence samples. Least absolute shrinkage and selection operator analysis showed that seven genes constructed a prognostic signature (including one lncRNA: LINC00652 and six mRNAs: INSL3, NIPAL2, REN, RIMS2, RPRM, and SNAP91). Kaplan-Meier curve analysis observed that patients in the high-risk group had a significantly shorter overall survival than those of the low-risk group. Receiver operating characteristic curve analysis demonstrated that this signature had high accuracy in predicting the 5-year overall survival and recurrence outcomes, respectively. LINC00652 may function by coexpressing with the above prognostic genes (INSL3, SNAP91, and REN) and lipid metabolism-related genes (MIA2, APOA1). Accordingly, this lncRNA-mRNA-based classifier may be clinically useful to predict the recurrence and prognosis for childhood ALL. These genes represent new targets to explain the mechanisms for ALL.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34211824 PMCID: PMC8208884 DOI: 10.1155/2021/6692022
Source DB: PubMed Journal: Anal Cell Pathol (Amst) ISSN: 2210-7177 Impact factor: 2.916
Figure 1The correlation between any two datasets (TARGET, GSE60926, GSE28460, and GSE17703) in the RNA expression levels (a) and connectivity (b).
Figure 2Selection graphs of the soft-thresholding power β in the adjacency matrix (a) and schematic diagram of the mean connectivity of RNA under various power values (b).
Stable modules screened using weighted gene coexpression network analysis.
| ID | Color | Module size | Number of mRNAs | Number of lncRNAs | Preservation | Module annotation |
|---|---|---|---|---|---|---|
| Module 1 | Black | 138 | 136 | 2 | 4.6749 | Neurological system process |
| Module 2 | Blue | 424 | 418 | 6 | 1.0193 | Cell motion |
| Module 3 | Brown | 302 | 302 | 0 | 14.1644 | Defense response |
| Module 4 | Green | 193 | 191 | 2 | 5.3994 | Cell-cell signaling |
| Module 5 | Grey | 1658 | 1640 | 18 | 2.1860 | Cell adhesion |
| Module 6 | Red | 174 | 172 | 2 | 2.7514 | Ion transport |
| Module 7 | Turquoise | 2008 | 1980 | 28 | 8.1713 | Cell-cell signaling |
| Module 8 | Yellow | 239 | 236 | 3 | 5.1719 | Mitosis |
Figure 3Clustering dendrograms of gene modules screened from the datasets TARGET (a), GSE28460 (b), GSE17703 (c), and GSE60926 (d) and the association between functional modules of RNAs in the TARGET dataset and the clinical characteristics of acute lymphoblastic leukemia patients (e). In the module-trait heat map, each column corresponds to clinical parameters and each row corresponds to a module eigengene. The correlation coefficients are shown at the top of each row. The corresponding p values for each module are displayed at the bottom of each row within parentheses. WBC: white blood cell; MLL: mixed lineage leukemia.
Figure 4Identification of differentially expressed module genes. (a) Heat map of differentially expressed RNAs; (b) Venn diagram to show the overlap between differentially expressed RNAs and module genes.
The optimal prognostic signature.
| Symbol | Module | Expression | Type | Multivariate Cox regression analysis | LASSO coefficient | ||
|---|---|---|---|---|---|---|---|
| HR | 95% CI |
| |||||
| LINC00652 | Turquoise | Downregulated | lncRNA | 0.980 | 0.949-0.993 | 2.30 | -0.00432 |
| INSL3 | Green | Downregulated | mRNA | 1.008 | 1.004-1.012 | 2.41 | 0.00739 |
| NIPAL2 | Yellow | Upregulated | mRNA | 1.004 | 1.001-1.007 | 1.20 | 0.00362 |
| REN | Turquoise | Downregulated | mRNA | 1.024 | 1.004-1.045 | 1.75 | 0.02222 |
| RIMS2 | Turquoise | Upregulated | mRNA | 1.011 | 1.005-1.033 | 3.49 | 0.00769 |
| RPRM | Turquoise | Upregulated | mRNA | 1.020 | 1.006-1.035 | 6.60 | 0.01680 |
| SNAP91 | Turquoise | Downregulated | mRNA | 1.050 | 1.001-1.101 | 4.80 | 0.02990 |
HR: hazard ratio; CI: confidence interval; LASSO: least absolute shrinkage and selection operator.
Figure 5The prognostic performance of the risk score model established by the seven-lncRNA-mRNA signature genes. (a) Kaplan-Meier survival curve of the training dataset, TARGET; (b) Kaplan-Meier survival curve of validation dataset 1, E-MTAB-1216; (c) Kaplan-Meier survival curve of validation dataset 2, E-MTAB-1205; (d) ROC of the training dataset, TARGET; (e) ROC curve of validation dataset 1, E-MTAB-1216; (f) ROC curve of validation dataset 2, E-MTAB-1205. HR: hazard ratio; ROC: receiver operator characteristic curve; AUC: area under the ROC curve.
Univariate and multivariate Cox regression analyses of overall survival.
| Clinical characteristics | TARGET ( | Univariate Cox | Multivariate Cox | ||
|---|---|---|---|---|---|
| HR (95% CI) |
| HR (95% CI) |
| ||
| Age (years, mean ± SD) | 8.53 ± 5.48 | 0.967 (0.639-0.993) | 2.36 | 0.965 (0.903-0.989) | 3.05 |
| Sex (male/female) | 61/44 | 0.779 (0.429-1.413) | 4.10 | — | — |
| WBC at diagnosis (IU, mean ± SD) | 96.28 ± 158.33 | 1.001 (0.999-1.003) | 1.44 | 0.999 (0.996-1.002) | 5.75 |
| Relapse (yes/no) | 10/95 | 1.271 (0.534-3.021) | 5.87 | — | — |
| Immunophenotype (T/B/mixture) | 45/37/23 | 1.903 (1.260-2.875) | 1.78 | 1.779 (0.941-2.851) | 1.68 |
| MLL rearrangement (yes/no) | 8/97 | 3.160 (1.231-8.113) | 1.16 | 1.664 (0.350-7.913) | 5.22 |
| Prognostic score model status (high/low) | 53/52 | 3.409 (1.788-6.501) | 8.28 | 3.787 (1.837-7.808) | 3.10 |
| Death (dead/alive) | 44/61 | — | — | — | — |
| Overall survival (months, mean ± SD) | 41.86 ± 39.65 | — | — | — | — |
SD: standard deviation; WBC: white blood cell; TARGET: Therapeutically Applicable Research to Generate Effective Treatments; HR: hazard ratio; CI: confidence interval; MLL: mixed lineage leukemia.
Figure 6The predictive performance of the seven-lncRNA-mRNA signature for recurrence outcomes in different datasets: (a) ROC curve of TARGET; (b) receiver operator characteristic curve of GSE60926; (c) ROC of GSE28460; (d) ROC curve of GSE17703; (e) ROC curve of E-MTAB-1216; (f) ROC curve of E-MTAB-1205. ROC: receiver operating characteristic curve; AUC: area under the ROC curve.
Validation of the prognostic classifier models for recurrence prediction.
| Accuracy (%) | Positive predicted value (%) | Negative predicted value (%) | Sensitivity (%) | Specificity (%) | |
|---|---|---|---|---|---|
| TARGET | 93.3 | 94.7 | 80.0 | 97.8 | 61.5 |
| GSE60926 | 84 | 72.7 | 92.9 | 88.9 | 81.3 |
| GSE28460 | 71.4 | 67.3 | 75.5 | 73.3 | 69.8 |
| GSE17703 | 90.1 | 93.3 | 63.6 | 95.5 | 53.8 |
| E-MTAB-1216 | 78.8 | 85.7 | 62.5 | 84.2 | 65.2 |
| E-MTAB-1205 | 78 | 82.1 | 72.7 | 79.3 | 76.2 |
Figure 7A coexpression network between prognostic LINC00652 and its differentially expressed mRNAs. Triangle indicates the upregulated RNAs; inverted triangle indicates the downregulated RNAs; the color of nodes is corresponding to the module color (turquoise, green, yellow, and brown); the nodes with larger font are prognostic signature RNAs.
Function enrichment results.
| Category | Term |
| Genes |
|---|---|---|---|
| GO BP | GO:0070328~triglyceride homeostasis | 5.11 | MIA2, LPL, GIP, APOA1 |
| GO BP | GO:0006508~proteolysis | 3.38 | ADGB, CASP5, CELA3B, CLCA2, REN, THSD4, MEP1A, MMP26, TLL1, HABP2 |
| GO BP | GO:0042632~cholesterol homeostasis | 6.93 | MIA2, LPL, APOA1, MTTP |
| GO BP | GO:0007200~phospholipase C-activating G-protein coupled receptor signaling pathway | 7.55 | GALR1, P2RY2, LHCGR, NMBR |
| GO BP | GO:0001662~behavioral fear response | 1.33 | DRD1, GABRA5, NR2E1 |
| GO BP | GO:0060291~long-term synaptic potentiation | 2.23 | DRD1, GIP, NR2E1 |
| GO BP | GO:0007189~adenylate cyclase-activating G-protein coupled receptor signaling pathway | 3.71 | DRD1, GALR1, LHCGR |
| GO BP | GO:0019233~sensory perception of pain | 3.98 | GIP, ALOXE3, SCN10A |
| GO BP | GO:0007186~G-protein coupled receptor signaling pathway | 4.64 | OR5I1, RAMP2, FZD10, APOA1, LHCGR, CCL8, RGS7, OR11A1, BDKRB2, NMBR, OR2W1 |
| KEGG | hsa04080: neuroactive ligand-receptor interaction | 4.50 | GRM5, DRD1, GALR1, P2RY2, GABRA5, LHCGR, BDKRB2, NMBR |
| KEGG | hsa04950: maturity onset diabetes of the young | 1.62 | ONECUT1, GCK, SLC2A2 |
| KEGG | hsa04911: insulin secretion | 2.61 | GIP, GCK, SLC2A2, RIMS2 |
| KEGG | hsa04020: calcium signaling pathway | 4.56 | GRM5, DRD1, TNNC1, LHCGR, BDKRB2 |
| REACTOME | R-HSA-174800: chylomicron-mediated lipid transport | 1.12 | LPL, APOA1, MTTP |
| REACTOME | R-HSA-418555: G alpha (s) signaling events | 1.54 | RAMP2, INSL3, DRD1, GIP, LHCGR |
| REACTOME | R-HSA-416476: G alpha (q) signaling events | 2.473 | GRM5, P2RY2, RGSL1, BDKRB2, NMBR |
| REACTOME | R-HSA-975634: retinoid metabolism and transport | 3.36 | LPL, APOA1, BCO1 |
KEGG: Kyoto Encyclopedia of Genes and Genomes; GO BP: Gene Ontology biological process.