| Literature DB >> 30378276 |
Ying Wu1, Peng-Shuo Wang2, Ben-Gang Wang3, Lu Xu4,5, Wan-Xia Fang4,5, Xiao-Fang Che4,5, Xiu-Juan Qu4,5, Yun-Peng Liu4,5, Zhi Li4,5.
Abstract
The current prognostic long noncoding RNA (lncRNA) signatures for hepatocellular carcinoma (HCC) are still controversial and need to be optimized by systematic bioinformatics analyses with suitable methods and appropriate patients. Therefore, we performed the study to establish a credible lncRNA signature for HCC outcome prediction and explore the related mechanisms. Based on the lncRNA profile and the clinical data of carefully selected HCC patients (n = 164) in TCGA, six of 12727 lncRNAs, MIR22HG, CTC-297N7.9, CTD-2139B15.2, RP11-589N15.2, RP11-343N15.5, and RP11-479G22.8 were identified as the independent predictors of patients' overall survival in HCC by sequential univariate Cox and 1000 times Cox LASSO regression with 10-fold CV, and multivariate Cox analysis with 1000 times bootstrapping. In the Kaplan-Meier analysis with patients trichotomized by the six-lncRNA signature, high-risk patients showed significantly shorter survival than mid- and low-risk patients (log-rank test P < 0.0001). According to the ROCs, the six-lncRNA signature showed superior predictive capacity than the two existing four-lncRNA combinations and the traditional prognostic clinicopathological parameter TNM stage. Furthermore, low MIR22HG and CTC-297N7.9, but high CTD-2139B15.2, RP11-589N15.2, RP11-343N15.5, and RP11-479G22.8, were, respectively, demonstrated to be related with the malignant phenotypes of HCC. Functionally, the six lncRNAs were disclosed to involve in the regulation of multiple cell cycle and stress response-related pathways via mediating transcription regulation and chromatin modification. In conclusion, our study identified a novel six-lncRNA signature for resectable HCC prognosis prediction and indicated the underlying mechanisms of HCC progression and the potential functions of the six lncRNAs awaiting further elucidation.Entities:
Keywords: LncRNA; biomarker; hepatocellular carcinoma; prognosis
Mesh:
Substances:
Year: 2018 PMID: 30378276 PMCID: PMC6308084 DOI: 10.1002/cam4.1854
Source DB: PubMed Journal: Cancer Med ISSN: 2045-7634 Impact factor: 4.452
Figure 1Analysis of flowchart illustrates the exploration procedure for the HCC prognostic lncRNAs and the related mechanisms
Figure 2The seed lncRNAs were extracted by 1000 times Cox LASSO regression. A, Highly consistency was demonstrated in the lncRNAs among the 11 extracted lncRNA sets. The left ordinate indicates the seed lncRNA set and the number of seed lncRNAs found by every single iteration of LASSO. The right ordinate is the frequency of the seed lncRNA set disclosed through the 1000 times Cox LASSO regression. The horizontal ordinate is the lncRNA name. The yellow block represents the occurrence of the particular lncRNA in the specific lncRNA set; B, Totally 14 seed lncRNAs with >600 occurrences in the most common lncRNA set were filtered out for further analysis. The blue column indicates the frequency of each lncRNA occurs in the most common lncRNA set
Descriptions of the six lncRNAs
| Hg19_name | Lnc ID | Ensembl_ID | Havana_gene | Gene_type | Chr | Start | End | Strand |
|---|---|---|---|---|---|---|---|---|
| MIR22HG | MIR22HG | ENSG00000186594.8 | OTTHUMG00000132197.5 | lincRNA | chr17 | 1614805 | 1620468 | − |
| CTD‐2139B15.2 | lnc‐BASP1‐1 | ENSG00000248223.1 | OTTHUMG00000161870.1 | lincRNA | chr5 | 17354019 | 17355008 | + |
| CTC‐297N7.9 | lnc‐TMEM220‐1 | ENSG00000264016.2 | OTTHUMG00000178048.3 | lincRNA | chr17 | 10644584 | 10672333 | − |
| RP11‐589N15.2 | lnc‐DEFB136‐2 | ENSG00000269899.1 | OTTHUMG00000183972.1 | sense_intronic | chr8 | 11703663 | 11703900 | − |
| RP11‐343N15.5 | lnc‐FCGR1B‐12 | ENSG00000269996.1 | OTTHUMG00000184096.1 | lincRNA | chr1 | 121133256 | 121134581 | + |
| RP11‐479G22.8 | lnc‐ITGB1‐1 | ENSG00000273038.1 | OTTHUMG00000186205.1 | lincRNA | chr10 | 33176189 | 33178239 | − |
The correlations of the six lncRNAs with patients' overall survival in HCC based on TCGA dataset using uni‐ and multivariate Cox analysis
| Gene | Univariate Cox | Multivariate Cox | |||
|---|---|---|---|---|---|
| HR (95% CI) |
| HR (95% CI) |
| Bootstrapping 95% CI | |
| MIR22HG | 0.80 (0.68‐0.94) | 0.0069 | 0.79 (0.68‐0.92) | 0.0023 | 0.65‐0.91 |
| CTD‐2139B15.2 | 1.06 (1.03‐1.09) | <0.0001 | 1.05 (1.02‐1.08) | 0.0022 | 1.02‐1.10 |
| CTC‐297N7.9 | 0.72 (0.58‐0.90) | 0.0032 | 0.73 (0.58‐0.93) | 0.0101 | 0.58‐0.94 |
| RP11‐589N15.2 | 1.15 (1.04‐1.27) | 0.0059 | 1.16 (1.03‐1.31) | 0.0166 | 1.05‐1.32 |
| RP11‐343N15.5 | 1.09 (1.03‐1.16) | 0.0047 | 1.10 (1.03‐1.17) | 0.0065 | 1.03‐1.18 |
| RP11‐479G22.8 | 1.18 (1.09‐1.28) | <0.0001 | 1.12 (1.03‐1.23) | 0.0119 | 1.03‐1.30 |
Figure 3The prognostic significance and superiority of the novel six‐lncRNA signature were, respectively, illustrated by the Kaplan‐Meier (K‐M) analysis and the time‐dependent ROC curve analysis. A, K‐M analysis identified the prognosis significance of the six‐lncRNA signature for OS of patients in HCC. B, The six‐lncRNA signature was not only superior to the two published four‐lncRNA groups, but also increased the prediction accuracy of traditional prognostic TNM stage for OS. 6‐LNC: our six‐lncRNA model; CM_4‐LNC: the four‐lncRNA group published in Cancer Medicine; PJ_4‐LNC: the four‐lncRNA signature disclosed in Peer Journal; Stage and 6‐LNC: the prognostic model established based on both our six‐lncRNA signature and TNM stage
The relationships of the 6 lncRNA level with the CPPs of 164 patients in TCGA
| Variable | N (%) | MIR22HG | CTD‐2139B15.2 | CTC‐297N7.9 | RP11‐589N15.2 | RP11‐343N15.5 | RP11‐479G22.8 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Med (IQR) |
| Med (IQR) |
| Med (IQR) |
| Med (IQR) |
| Med (IQR) |
| Med (IQR) |
| ||
| Sex | — | — | — | 0.0317 (0.2536) | — | — | |||||||
| Female | 56 (34.1) | — | — | — | 1.88 (0.59‐2.32) | — | — | ||||||
| Male | 108 (65.9) | — | — | — | 1.65 (0.89‐2.78) | — | — | ||||||
| PS | — | 0.0023 (0.1110) | 0.0444 (0.2839) | — | — | — | |||||||
| 0 | 60 (49.2) | — | 6.06 (4.36‐9.46) | 1.63 (1.01‐2.82) | — | — | — | ||||||
| 1 | 39 (32.0) | — | 7.20 (5.80‐10.98) | 1.89 (1.33‐3.09) | — | — | — | ||||||
| 2 | 15 (12.3) | — | 9.05 (8.16‐13.57) | 0.76 (0.67‐1.44) | — | — | — | ||||||
| ≥2 | 8 (6.6) | — | 11.94 (6.19‐15.26) | 0.75 (0.42‐1.39) | — | — | — | ||||||
| CPG | 0.0135 (0.1848) | — | — | — | — | — | |||||||
| A | 78 (47.6) | 2.33 (1.58‐3.34) | — | — | — | — | — | ||||||
| B | 17 (10.4) | 1.37 (1.08‐2.83) | — | — | — | — | — | ||||||
| Grade | — | 0.0004 (0.0371) | 0.0059 (0.1678) | — | — | 0.0430 (0.2839) | |||||||
| 1 | 32 (19.5) | — | 6.61 (3.91‐8.23) | 1.98 (1.17‐3.17) | — | — | 1.07 (0.57 ‐ 1.54) | ||||||
| 2 | 78 (47.6) | — | 7.11 (4.91‐9.49) | 1.48 (0.90‐2.34) | — | — | 1.50 (0.92‐2.43) | ||||||
| 3 | 50 (30.5) | — | 9.12 (6.69‐13.35) | 1.29 (0.64‐2.15) | — | — | 1.57 (0.97‐2.63) | ||||||
| T stage | 0.0381 (0.2811) | — | 0.0216 (0.1882) | — | 0.0196 (0.1882) | 0.0160 (0.1882) | |||||||
| T1 | 2 (1.2) | 2.52 (1.67‐4.09) | — | 1.71 (1.19‐2.77) | — | 3.67 (2.54‐5.19) | 1.19 (0.81‐2.06) | ||||||
| T2 | 43 (26.2) | 2.29 (1.32‐3.04) | — | 1.09 (0.60‐1.93) | — | 4.38 (3.04‐6.52) | 1.37 (0.97‐2.27) | ||||||
| T3‐4 | 53 (32.3) | 1.95 (1.27‐2.80) | — | 1.51 (0.76‐2.18) | — | 4.55 (3.16‐7.92) | 1.74 (1.00‐3.59) | ||||||
| Stage | — | — | 0.0087 (0.1678) | — | 0.0087 (0.1678) | 0.0211 (0.1882) | |||||||
| I | 64 (39.0) | — | — | 1.71 (1.20‐2.90) | — | 3.67 (2.51‐5.21) | 1.19 (0.80‐2.11) | ||||||
| II | 40 (24.4) | — | — | 1.07 (0.61‐2.03) | — | 4.02 (2.94‐6.11) | 1.36 (0.92‐2.31) | ||||||
| III | 51 (31.1) | — | — | 1.40 (0.73‐2.15) | — | 4.80 (3.31‐8.05) | 1.70 (1.01‐3.36) | ||||||
| ALB | — | — | — | — | −0.05 (−0.18‐0.03) | 0.0121 (0.1848) | — | — | — | — | — | — | |
CPG, Child‐Pugh grade; CPPs, clinicopathological characteristics; IQR, interquartile range; Med, median; PS, performance status.
Figure 4The significant trends in the associations of MIR22HG, CTD‐2139B15.2, and CTC‐297N7.9 expression with clinicopathological characteristics of the cohort. Low level of MIR22HG significantly indicated advanced T classification (A) and higher Child‐Pugh score (B) of HCC patients. Higher CTD‐2139B15.2 predicted higher ECOG PS score (C) and tumor grade (D). CTC‐297N7.9 was associated with low ECOG PS score of patients (E), the low grade (F), and early T classification (G) and TNM stage (H) of tumors, and the low serum levels of ALB (I). For categorical variables, the data distributions of each lncRNA were illustrated by the box plots. In the plot, the upper and lower hinge and the inner line of the boxes indicate the first and third quartile and the median value of the data, respectively, and the black dots represent the outlier values. For continuous variables, we used scatter plot with trend line illustrating their relationships with the level of each lncRNA
Figure 5The analysis for the significant trends in the relationships between the levels of RP11‐589N15.2, RP11‐343N15.5, and RP11‐479G22.8 and the clinicopathological parameters of the HCC cohort. RP11‐589N15.2 was higher in male patients (A). RP11‐343N15.5 significantly predicted advanced T classification (B) and TNM stage (C) of HCC. Patients with advanced tumor grade (D), T classification (E), and TNM stage (F) expressed higher RP11‐479G22.8. For categorical variables, the data distributions of each lncRNA were illustrated by the box plots. In the plot, the upper and lower hinge and the inner line of the boxes indicate the first and third quartile and the median value of the data, respectively, and the black dots represent the outlier values.
Gene set enrichment analysis and leading‐edge gene assays according to the levels of the 6 lncRNAs in TCGA
| Size | ES |
| FDR.q. | Rankmax | Leading edge | ||
|---|---|---|---|---|---|---|---|
| MIR22HG | DNA_REPAIR | 149 | −0.51 | 0.0043 | 0.1123 | 3741 | tags = 48%, list = 18%, signal = 59% |
| MYC_TARGETS_V1 | 195 | −0.59 | 0.0142 | 0.0791 | 3847 | tags = 54%, list = 19%, signal = 66% | |
| E2F_TARGETS | 193 | −0.68 | 0.0122 | 0.0527 | 2780 | tags = 56%, list = 14%, signal = 64% | |
| MYC_TARGETS_V2 | 58 | −0.62 | 0.0354 | 0.1012 | 4306 | tags = 60%, list = 21%, signal = 76% | |
| CTD‐2139B15.2 | MITORC1_SIGNALING | 198 | 0.56 | <0.0001 | 0.0039 | 3572 | tags = 46%, list = 17%, signal = 56% |
| UV_RESPONSE_UP | 158 | 0.45 | 0.0017 | 0.0717 | 4141 | tags = 41%, list = 20%, signal = 50% | |
| MYC_TARGETS_V1 | 195 | 0.65 | 0.0077 | 0.0711 | 3519 | tags = 59%, list = 17%, signal = 70% | |
| UNFOLDED_PROTEIN_REPONSE | 113 | 0.49 | 0.0037 | 0.0658 | 4531 | tags = 49%, list = 22%, signal = 62% | |
| GLYCOLYSIS | 199 | 0.44 | 0.0107 | 0.0615 | 3943 | tags = 38%, list = 19%, signal = 46% | |
| MYC_TARGETS_V2 | 58 | 0.65 | 0.0277 | 0.1314 | 3324 | tags = 60%, list = 16%, signal = 72% | |
| P53_PATHWAY | 196 | 0.40 | 0.0086 | 0.1377 | 4259 | tags = 38%, list = 21%, signal = 47% | |
| REACTIVE_OXIGEN_SPECIES_PATHWAY | 48 | 0.55 | 0.0383 | 0.1401 | 4553 | tags = 50%, list = 22%, signal = 64% | |
| DNA_REPAIR | 149 | 0.46 | 0.0302 | 0.1549 | 5511 | tags = 54%, list = 27%, signal = 73% | |
| CTC‐297N7.9 | E2F_TARGETS | 193 | −0.73 | <0.0001 | 0.0106 | 3063 | tags = 70%, list = 15%, signal = 81% |
| G2M_CHECKPOINT | 194 | −0.70 | 0.0021 | 0.0057 | 3596 | tags = 68%, list = 18%, signal = 81% | |
| MITOTIC_SPINDLE | 198 | −0.54 | 0.0022 | 0.0560 | 3759 | tags = 42%, list = 18%, signal = 51% | |
| RP11‐589 N15.2 | CHOLESTEROL_HOMEOSTASIS | 74 | 0.70 | <0.0001 | <0.0001 | 1266 | tags = 43%, list = 6%, signal = 46% |
| MITORC1_SIGNALING | 198 | 0.55 | 0.0038 | 0.0140 | 3797 | tags = 48%, list = 18%, signal = 59% | |
| REACTIVE_OXIGEN_SPECIES_PATHWAY | 48 | 0.62 | 0.0115 | 0.0954 | 3769 | tags = 60%, list = 18%, signal = 74% | |
| PEROXISOME | 102 | 0.49 | 0.0218 | 0.1735 | 5318 | tags = 56%, list = 26%, signal = 75% | |
| RP11‐343N15.5 | G2M_CHECKPOINT | 194 | 0.69 | 0.0057 | 0.0746 | 4034 | tags = 69%, list = 20%, signal = 85% |
| MITOTIC_SPINDLE | 198 | 0.54 | 0.0056 | 0.1026 | 4703 | tags = 53%, list = 23%, signal = 67% | |
| E2F_TARGETS | 193 | 0.70 | 0.0114 | 0.0854 | 3367 | tags = 67%, list = 16%, signal = 79% |
Figure 6Gene set enrichment analysis for the six prognostic lncRNAs screened out multiple hallmark gene sets related with cell cycle, DNA repair, cell homeostasis, or stress response. The network (A) and the heat map (B) illustrate the significant gene sets enriched by the six lncRNAs. In the network (A), the brown nodes represent the enriched hallmark gene sets, the purple nodes represent the lncRNAs related with poorer prognosis, and the green nodes represent the lncRNAs related with better prognosis of patients. In the heat map (B), the yellow block indicates the particular hallmark gene set enriched by the specific lncRNA
Figure 7GO‐MF analysis for the coexpression mRNAs of RP11‐479G22.8 and RP11‐343N15.5. The cnetplot illustration was used to visualize the GO‐MF terms enriched by the coexpressed mRNAs of RP11‐479G22.8 (A) and RP11‐343N15.5 (B). The brown node represents the enriched GO‐MF term, with the size indicating the overall number of its included mRNAs. The other smaller nodes are the enriched mRNAs, and the node colors changing from green to red indicate the increased associations of the mRNAs with the specific lncRNA
GO‐MF assays according to the levels of the six‐lncRNA‐related mRNAs in TCGA
| ID | Description | GeneRatio | BgRatio |
|
|
| Count | |
|---|---|---|---|---|---|---|---|---|
| CTD‐2139B15.2 | GO:0004812 | Aminoacyl‐tRNA ligase activity | 4/46 | 44/16982 | 5.91E−06 | 0.00041374 | 0.0003102 | 4 |
| GO:0016875 | Ligase activity, forming carbon‐oxygen bonds | 4/46 | 44/16982 | 5.91E−06 | 0.00041374 | 0.0003102 | 4 | |
| GO:0016876 | Ligase activity, forming aminoacyl‐tRNA and related compounds | 4/46 | 44/16982 | 5.91E−06 | 0.00041374 | 0.0003102 | 4 | |
| GO:0016874 | Ligase activity | 6/46 | 177/16982 | 7.81E−06 | 0.00041374 | 0.0003102 | 6 | |
| GO:0000287 | Magnesium ion binding | 5/46 | 196/16982 | 0.0001816 | 0.00770244 | 0.00577491 | 5 | |
| CTC‐297N7.9 | GO:0048037 | Cofactor binding | 50/714 | 264/16982 | 1.72E−19 | 1.26E−16 | 1.10E−16 | 50 |
| GO:0016614 | Oxidoreductase activity, acting on CH‐OH group of donors | 27/714 | 137/16982 | 1.55E−11 | 3.80E−09 | 3.29E−09 | 27 | |
| GO:0003682 | Chromatin binding | 46/714 | 482/16982 | 1.91E−07 | 2.50E−05 | 2.16E−05 | 46 | |
| GO:0051287 | NAD binding | 13/714 | 53/16982 | 2.04E−07 | 2.50E−05 | 2.16E−05 | 13 | |
| GO:0008094 | DNA‐dependent ATPase activity | 15/714 | 76/16982 | 5.10E−07 | 5.34E−05 | 4.63E−05 | 15 | |
| RP11‐343N15.5 | GO:0015631 | Tubulin binding | 12/214 | 289/16982 | 0.00031283 | 0.04698072 | 0.04389422 | 12 |
| GO:0031491 | Nucleosome binding | 5/214 | 49/16982 | 0.00036839 | 0.04698072 | 0.04389422 | 5 | |
| GO:0003777 | Microtubule motor activity | 6/214 | 77/16982 | 0.00042041 | 0.04698072 | 0.04389422 | 6 | |
| GO:0008017 | Microtubule binding | 10/214 | 217/16982 | 0.0004401 | 0.04698072 | 0.04389422 | 10 | |
| RP11‐479G22.8 | GO:0016645 | Oxidoreductase activity, acting on the CH‐NH group of donors | 7/170 | 28/16982 | 8.83E−09 | 3.72E−06 | 3.18E−06 | 7 |
| GO:0048037 | Cofactor binding | 14/170 | 264/16982 | 4.39E−07 | 9.24E−05 | 7.90E−05 | 14 | |
| GO:0016614 | Oxidoreductase activity, acting on CH‐OH group of donors | 10/170 | 137/16982 | 1.19E−06 | 0.00012535 | 0.00010719 | 10 | |
| GO:0016408 | C‐acyltransferase activity | 4/170 | 20/16982 | 4.14E−05 | 0.00249223 | 0.00213112 | 4 | |
| GO:0016903 | Oxidoreductase activity, acting on the aldehyde or oxo group of donors | 5/170 | 45/16982 | 8.38E−05 | 0.00440768 | 0.00376904 | 5 |
The first five GO‐MF terms based on P values were listed.