Literature DB >> 32474975

A four-long noncoding RNA signature predicts survival of hepatocellular carcinoma patients.

Haitao Jiang1,2,3, Lianhe Zhao4, Yunjie Chen1,2,3, Liang Sun4.   

Abstract

BACKGROUND: Hepatocellular carcinoma (HCC) is a common neoplasm located in the liver. Accumulating evidence has highlighted that long noncoding RNAs (lncRNAs) are correlated with the survival of HCC patients. This study focuses on finding a lncRNA signature to predict the prognostic risk of HCC patients.
METHODS: Statistical and machine learning analyses were conducted to analyze the lncRNA expression data and corresponding clinical data of 180 HCC patients collected from the public online Tanric and The Cancer Genome Atlas (TCGA) databases.
RESULTS: From the training dataset, we obtained the four-lncRNA model comprising RP11-495K9.6, RP11-96O20.2, RP11-359K18.3, and LINC00556 which can divide HCC patients into two different groups with significantly different prognosis (n = 90, median 1.81, 95% confidence interval [CI]: 1.50-4.91 vs 8.56 years, 95% CI: 6.96-9.97, log-rank test P < .001). The test dataset confirmed the prognostic ability of the signature (n = 90, median 1.95, 95% CI: 1.14-4.08 vs 5.80 years, 95% CI: 3.11-6.82, log-rank test P = .007). Receiver operating characteristic curve displayed the better prediction efficiency of the four-lncRNA signature than the tumor/node/metastasis stage. Cox analysis showed the four-lncRNA signature was an independent predictor of HCC prognosis.
CONCLUSION: The four-lncRNA signature can be used as an independent biomarker for HCC patients to predict the prognostic risk.
© 2020 The Authors. Journal of Clinical Laboratory Analysis published by Wiley Periodicals LLC.

Entities:  

Keywords:  biomarker; hepatocellular carcinoma; lncRNA; prognostic; signature

Mesh:

Substances:

Year:  2020        PMID: 32474975      PMCID: PMC7521318          DOI: 10.1002/jcla.23377

Source DB:  PubMed          Journal:  J Clin Lab Anal        ISSN: 0887-8013            Impact factor:   2.352


confidence interval hepatocellular carcinoma long noncoding RNA The Cancer Genome Atlas

INTRODUCTION

Hepatocellular carcinoma (HCC) is a refractory tumor that kills 746 000 people every year, , ranked as the third cause of cancer‐induced death. The main reasons for the high mortality of HCC are the following two points. First, the disease is insidious and difficult to be detected early; thus, most of the HCC patients are diagnosed at advanced stages when they are in poor physical condition and miss the opportunity of surgery; second, there are few effective treatments for patients with advanced HCC who are not only insensitive to radiotherapy but also poorly responsive to conventional chemotherapy drugs. In recent years, it has been recognized that molecular characteristics are closely related to the prognosis and therapeutic effectiveness of HCC patients. Therefore, identifying molecular indicators will result in more accurate prognostic judgments and improved treatments, which are urgently needed for HCC patients. Long noncoding RNAs (lncRNAs) are a group of noncoding RNAs with the length more than 200 bp. , Recent studies have found that lncRNAs play important roles in the regulation of important biological processes in various types of cancer, especially the oncogenic or onco‐suppressive role, , implying the potential of lncRNAs as biomarkers and therapeutic targets for cancer. , In addition, the prognostic role of lncRNA in HCC has been reported in many studies. For instance, lncRNA PTTG3P was found to be associated with short survival in HCC patients and could be used as an unfavorable prognostic predictor. LncRNA ASB16AS1 was demonstrated to promote the malignant behavior of HCC through regulating miR‐1827/FZD4/Wnt/β‐catenin pathway and has the prognostic value. CTC‐297N7.9 was observed to be high expressed in HCC patients with good prognosis, indicating its protective role. Subsequently, due to better prediction performance than a single lncRNA molecule, lncRNA signatures for HCC prognosis prediction are being discovered. , , In the present study, we aimed to identify lncRNAs that could predict outcomes of HCC patients and construct a prognostic lncRNA signature based on lncRNA expression profile data of HCC from the The Cancer Genome Atlas (TCGA) and Tanric databases.

MATERIALS AND METHODS

Construction process of the lncRNA risk score model

LncRNA transcriptome expression data of 180 HCC patients were downloaded from the Tanric database (https://www.tanric.org/home). Corresponding clinical information of 180 HCC patients was downloaded from TCGA database (https://xenabrowser.net/datapages/). We omitted lncRNAs expressing value with coefficient of variance >0.1 and selected survival‐related lncRNAs from training samples by performing Cox analysis (P < .05). Then, we used the random survival forests‐variable hunting algorithm to further filter nodes until nine lncRNAs were screened out. We developed risk score models to estimate prognosis risk as follows , : , where N represents the lncRNAs number in the model, lncRNAexp is the lncRNAs expression value, and coefficientCOXi is the coefficient of lncRNAs in the Cox analysis. We selected signatures which predicted the HCC OS with AUC > 0.7 and log‐rank P < .05 from all 29‐1 = 511 signatures.

Statistical analysis

We used R program, including pROC, TimeROC, Survival, and RandomForestSRC (from Bioconductor: http://www.bioconductor.org/) to perform statistics and machine learning analysis. Using the receiver operating characteristic (ROC) and the Time ROC analysis, , we compared the prognostic performance of tumor/node/metastasis (TNM) stage and the lncRNA signature. Cox analysis was performed on the data processing to identify the prognostic factors with significance defined as P < .05. Pearson's test with P < .05 and the Pearson coefficient >0.2 <−0.2 were used to select co‐expressed protein‐coding genes with lncRNAs which were visualized by Cytoscape (3.2.3). We performed Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Ontology (GO) enrichment analysis by the R package clusterProfiler.

RESULTS

Constructing the lncRNA signature for predicting HCC prognosis in the training group

Table 1 displayed the detailed clinical information of the 180 HCC patients. The median age of the enrolled patients was 63 years (20‐90 years) including 67 female and 113 male patients. A total of 165 HCC patients were categorized as TNM stage I to IV. These 180 HCC patients were randomly divided into two groups, one as the training (n = 90) group and one as the test group (n = 90). We constructed prognostic lncRNA signature from the training group and then verified its predictive power in the test group.
TABLE 1

Clinicopathological parameters of hepatocellular carcinoma patients in each cohort

CharacteristicTraining setTesting set
Age (y)
>634844
≤634246
Sex
Female2839
Male6251
Vital status
Living5947
Dead3143
Tumor/node/metastasis stage
I3734
II2222
III2621
IV12
Unknown411
Clinicopathological parameters of hepatocellular carcinoma patients in each cohort First, we selected 9683 lncRNAs with coefficient of variance <0.1 based on their expression value from 12 727 lncRNAs. Then, we used univariate Cox regression analysis and got a 642‐lncRNA set associated with HCC patient OS (Figure 1A, P < .05). Finally, through random survival forests analysis, we obtained 9 prognostic lncRNAs according to importance score (Figure 1A,B).
FIGURE 1

Constructing the prognostic long noncoding RNA (lncRNA) signature in the training dataset. A, The process of selecting the survival‐related lncRNAs. B, Based on the associated expression score, random survival forests‐variable hunting analysis was performed to filter lncRNAs. C, Receiver operating characteristic analysis of the selected signature

Constructing the prognostic long noncoding RNA (lncRNA) signature in the training dataset. A, The process of selecting the survival‐related lncRNAs. B, Based on the associated expression score, random survival forests‐variable hunting analysis was performed to filter lncRNAs. C, Receiver operating characteristic analysis of the selected signature Kaplan‐Meier and ROC analyses were performed on 29‐1 = 511 signatures. The lncRNA combination including RP11‐495K9.6, RP11‐96O20.2, RP11‐359K18.3, and LINC00556 was considered as the final lncRNA signature since its AUC value was the largest (AUC > 0.70) and log‐rank P < .001 (Figure 1C). The lncRNA signature risk score (Table 2) = (1.13 × RP11‐495K9.6 expression value) + (1.35 × RP11‐96O20.2 expression value) + (1.42 × RP11‐359K18.3 expression value) + (2.17 × LINC00556 expression value).
TABLE 2

The feature of the long noncoding RNAs (lncRNAs) in the prognostic expression signature

lncRNA nameEnsembl IDCoefficient a P value a Gene expression level association with poor prognosis
RP11‐495K9.6ENSG000002499261.13.01High
RP11‐96O20.2ENSG000002596811.35.01High
RP11‐359K18.3ENSG000002597881.42<.001High
LINC00556ENSG000002601312.17<.001High

Derived from the univariable Cox analysis in the training set.

The feature of the long noncoding RNAs (lncRNAs) in the prognostic expression signature Derived from the univariable Cox analysis in the training set.

The predictive performance of the four‐lncRNA signature

Based on the four‐lncRNA signature, HCC patients obtained their risk scores. We used the median risk score as a cutoff point for Kaplan‐Meier analysis, and HCC patients in the training group (n = 90) were subgrouped into two risk groups with significantly different survival. The median survival of the high‐risk group was shorter than that of the low‐risk group (median survival time: 1.81 years, 95% confidence interval [CI]: 1.50‐4.91 vs 8.56, 95% CI: 6.96‐9.97, log‐rank test P < .001; Figure 2A). Then, we test the survival predictive performance of the signature in the test set. Kaplan‐Meier result revealed the outcome of high‐risk patients were significantly different from low‐risk patients (median survival time: 1.95, 95% CI: 1.14‐4.08 vs 5.80 years, 95% CI: 3.11‐6.82, P = .007; Figure 2B). At last, we tested the risk identification ability of the signature in the entire TCGA dataset (n = 180) and the Kaplan‐Meier result showed that the HCC patients of the low‐risk group (n = 90) outlived ones in high‐risk group (n = 90) in Figure 2C (log‐rank P < .001).
FIGURE 2

The performance of the four‐long noncoding RNAs (lncRNA) signature in Hepatocellular carcinoma prognosis prediction. A‐C, Kaplan‐Meier analysis of the SIGNATURE in the training, test, and entire The Cancer Genome Atlas datasets. D‐F, Comparing the survival prediction power between the lncRNA signature and tumor/node/metastasis stage by receiver operating characteristic in the training, test, and entire datasets

The performance of the four‐long noncoding RNAs (lncRNA) signature in Hepatocellular carcinoma prognosis prediction. A‐C, Kaplan‐Meier analysis of the SIGNATURE in the training, test, and entire The Cancer Genome Atlas datasets. D‐F, Comparing the survival prediction power between the lncRNA signature and tumor/node/metastasis stage by receiver operating characteristic in the training, test, and entire datasets

Prognostic independence test of the four‐lncRNA signature

Chi‐square test found there was no correlation between the signature and other clinical features (Table 3). We further performed univariable and multivariable Cox analysis to evaluate the prognostic independence of the four‐lncRNA signature. As shown in Table 4, the four‐lncRNA signature was proved to be an independent indicator in the training group (high‐risk vs low‐risk, HR = 3.95, 95% CI 3.65‐8.90, P < .001, n = 90). The test group and the entire TCGA set verified the accuracy of the independence test (HR = 2. 38, 95% CI 1.14‐4.96, P = .02, n = 90; HR = 3.82, 95% CI 2.17‐6.71, P < .001, n = 180).
TABLE 3

Association of the long noncoding RNA signature with clinicopathological characteristics in the hepatocellular carcinoma patients

VariablesTrain group P Test group P Entire group P
Low risk a High risk a Low risk a High risk a Low risk a High risk a
Age (y)
>631725.142123.833848.18
≤63282024225242
Sex
Female1018.112316.203334.35
Male352722295756
M stage
M03932.163129.277061.21
M1012021
N stage
N02831.372923.315754.62
N1200121
N2141416213035
T stage
T12217.091422.223639.17
T21491793118
T38179101727
T4025355
Tumor/node/metastasis stage
I2017.221420.203437.29
II1391482717
III91711102027
IV012021

Low risk ≤ median of risk score; high risk > median of risk score; the chi‐squared test; P value < .05 was considered significant.

TABLE 4

Univariable and multivariable Cox regression analysis of the lncRNA signature with survival of hepatocellular carcinoma patients in the training group, test group, and entire group

VariablesThe training set (n = 90)The Test set (n = 90)The TCGA dataset (n = 180)
HR95% CI of HR P HR95% CI of HR P HR95% CI of HR P
LowerUpperLowerUpperLowerUpper
Univariable analysis
Age
>63 vs ≤630.760.371.55.441.510.792.88.221.090.681.74.73
Sex
Male vs female1.600.733.50.241.150.622.13.651.260.782.03.34
TNM stage
IV + III vs I + II1.360.902.06.151.240.851.81.271.300.981.71.07
lncRNA signature
High risk vs low risk3.343.237.03<.0012.031.083.84.033.562.116.00<.001
Multivariable analysis
Age
>63 vs ≤630.930.432.01.851.450.712.97.311.180.711.98.52
Sex
Male vs female2.591.096.15.031.130.552.32.731.340.802.22.27
TNM stage
IV + III vs I + II1.100.711.70.681.400.942.08.101.351.021.78.04
lncRNA signature
High risk vs low risk3.953.658.90<.0012.381.144.96.023.822.176.71<.001

Abbreviation: TNM, tumor/node/metastasis.

Association of the long noncoding RNA signature with clinicopathological characteristics in the hepatocellular carcinoma patients Low risk ≤ median of risk score; high risk > median of risk score; the chi‐squared test; P value < .05 was considered significant. Univariable and multivariable Cox regression analysis of the lncRNA signature with survival of hepatocellular carcinoma patients in the training group, test group, and entire group Abbreviation: TNM, tumor/node/metastasis.

Comparison of the lncRNA signature with TNM stage system

Receiver operating characteristic analyses found that the AUC value of the lncRNA signature was greater than that of the TNM stage system in the training, test, and entire datasets (n = 90/90/180), (lncRNA model‐AUC = 0.73/0.62/0.67 vs TNM‐AUC = 0.60/0.60/0.60, Figure 2D‐F), demonstrating the lncRNA signature had better survival predictive performance. Combining the lncRNA signature and the TNM stage had the largest AUC value, indicating the signature could be used as an auxiliary prognostic marker (Both‐AUC = 0.76/0.65/0.71, Figure 2D‐F). On the other hand, the result of TimeROC demonstrated that the predictive ability of lncRNA signature outperformed that of the TNM stage. The AUCs of the four‐lncRNA signature in the training group were 0.75/0.75/0.72/0.78 at 2/3/4/5 years, greater than the corresponding AUC values of TNM stage (Figure 3A,B). Similar results were also visible in the entire TCGA dataset (signature‐AUC training = 0.67/0.65/0.62/0.69 at 2/3/4/5 years vs TNM‐AUC training = 0.50/0.57/0.58/0.61 at 2/3/4/5 years, Figure 3C,D).
FIGURE 3

TimeROC analysis of the signature and tumor/node/metastasis stage for the survival prediction at 2, 3, 4, and 5 y in the training (A, B) and test dataset (C, D)

TimeROC analysis of the signature and tumor/node/metastasis stage for the survival prediction at 2, 3, 4, and 5 y in the training (A, B) and test dataset (C, D)

Stratified analysis for TNM stage

Combined the TNM stage with lncRNA signature risk scores, we stratified the HCC patients into different subgroups. HCC patients with TNM I + II stage were stratified into high‐risk and low‐risk subgroups. Kaplan‐Meier analysis showed there was a significant difference in survival time between the two subgroups (log‐rank test P < .001, Figure 4A). HCC patients with TNM III + IV stage were also divided into two risk subgroups with different survival (log‐rank test P = .0043, Figure 4B).
FIGURE 4

The four‐long noncoding RNA signature stratified tumor/node/metastasis low/high stage into two groups with different survival in the entire dataset (A, B)

The four‐long noncoding RNA signature stratified tumor/node/metastasis low/high stage into two groups with different survival in the entire dataset (A, B)

Function prediction of the four lncRNAs in the signature

First, we used Pearson's test to compute the co‐expressed mRNAs with the four lncRNAs in the entire TCGA dataset (n = 180). A total of 749 mRNAs were selected which were co‐expressed with at least one of the four lncRNAs (coefficient >0.2/<−0.2, P < .05, Table S1, Figure 5A). Then, we used those co‐expressed genes to predict the biological function of the four lncRNAs. We found the four lncRNAs were enriched in 27 GO terms and KEGG pathways and the top 20 pathways were visualized in Figure 5B, such as DNA replication and cell cycle checkpoint (P < .05 Figure 5B).
FIGURE 5

Function prediction of the four long noncoding RNAs (lncRNAs). A, The co‐expression network. B, Function analysis of the four prognostic lncRNAs

Function prediction of the four long noncoding RNAs (lncRNAs). A, The co‐expression network. B, Function analysis of the four prognostic lncRNAs

DISCUSSION

A vast amount of research suggests that lncRNAs might serve as biomarkers in the diagnosis and prognosis of various tumors, including HCC. In addition, lncRNA has the advantage of being a marker because it is easy to detect in body fluids. Thus, there have been many articles on the prognostic lncRNA markers of HCC. Based on high throughput sequencing data, lncRNAs associated with the HCC prognosis have been identified, such as ASB16AS1, LINC01138, and CTC‐297N7.9. , , These lncRNAs were found play important roles in HCC carcinogenesis through regulating tumor proliferation and migration. Because of its better predictive efficacy, lncRNA signatures have been developed for prognostic prediction in many cancers such as esophageal squamous cell carcinoma, glioblastoma, lung adenocarcinoma, and pancreatic ductal adenocarcinoma, among others. , , , In this study, we collected and downloaded the expression data and clinical information of HCC cohort from Tanric and TCGA. Using statistical and machine learning analysis, we found 642 lncRNAs significantly correlated with overall survival and constructed a four‐lncRNA signature which was proved to be a reliable indicator of HCC survival in 180 samples. The independence test detected the survival prediction ability of the four‐lncRNA signature in HCC was not affected by age, gender, and TNM stage. In addition, stratification analysis discovered the four‐lncRNA signature or the four‐lncRNA– based risk score model can further subdivide HCC patients at same TNM stage into different risk groups with significantly different outcomes, suggesting that the four‐lncRNA signature can be used as an assistant prognostic model for TNM stage in HCC. Moreover, we found high expression of RP11‐495K9.6, RP11‐96O20.2, RP11‐359K18.3, and LINC00556 was correlated with poor prognosis of HCC patients (HR > 1, P < .05). Since the function of these four lncRNAs has not been reported yet, we performed Go and KEGG analysis and found that the coding genes co‐expressed with the four lncRNAs were enriched in terms related to DNA replication and repair, indicating that the four lncRNAs in the signature may participate in the HCC progression through DNA replication and repair related pathways. The specific mechanism of these lncRNAs regulates the prognosis of HCC remains to be elucidated. In summary, using statistical and machine learning analyses, we constructed a four‐lncRNA signature including RP11‐495K9.6, RP11‐96O20.2, RP11‐359K18.3, and LINC00556 which could be used effectively to predict clinical outcome of HCC patients. The four‐lncRNA signature exerts great applicable value in prognosis prediction, therapy selection, and disease recognition.

CONFLICT OF INTEREST

The authors declare that they have no competing interests.

AUTHORS' CONTRIBUTIONS

Haitao Jiang contributed to data analysis, interpretation, and drafting. Lianhe Zhao contributed to data collection. Yunjie Chen and Liang Sun involved in study design, study supervision, and final approval of the article. All authors read and approved the final article. TblS1 Click here for additional data file.
  28 in total

1.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

Review 2.  Non-coding RNAs in human disease.

Authors:  Manel Esteller
Journal:  Nat Rev Genet       Date:  2011-11-18       Impact factor: 53.242

Review 3.  Long Noncoding RNA and Cancer: A New Paradigm.

Authors:  Arunoday Bhan; Milad Soleimani; Subhrangsu S Mandal
Journal:  Cancer Res       Date:  2017-07-12       Impact factor: 12.701

Review 4.  Guide for diagnosis and treatment of hepatocellular carcinoma.

Authors:  Magdy Hamed Attwa; Shahira Aly El-Etreby
Journal:  World J Hepatol       Date:  2015-06-28

5.  TANRIC: An Interactive Open Platform to Explore the Function of lncRNAs in Cancer.

Authors:  Jun Li; Leng Han; Paul Roebuck; Lixia Diao; Lingxiang Liu; Yuan Yuan; John N Weinstein; Han Liang
Journal:  Cancer Res       Date:  2015-07-24       Impact factor: 12.701

Review 6.  The emergence of lncRNAs in cancer biology.

Authors:  John R Prensner; Arul M Chinnaiyan
Journal:  Cancer Discov       Date:  2011-10       Impact factor: 39.397

7.  Identification of a five-long non-coding RNA signature to improve the prognosis prediction for patients with hepatocellular carcinoma.

Authors:  Qiu-Jie Zhao; Jiao Zhang; Lin Xu; Fang-Feng Liu
Journal:  World J Gastroenterol       Date:  2018-08-14       Impact factor: 5.742

8.  CNIT: a fast and accurate web tool for identifying protein-coding and long non-coding transcripts based on intrinsic sequence composition.

Authors:  Jin-Cheng Guo; Shuang-Sang Fang; Yang Wu; Jian-Hua Zhang; Yang Chen; Jing Liu; Bo Wu; Jia-Rui Wu; En-Min Li; Li-Yan Xu; Liang Sun; Yi Zhao
Journal:  Nucleic Acids Res       Date:  2019-07-02       Impact factor: 16.971

9.  LncRNA ASB16-AS1 Promotes Growth And Invasion Of Hepatocellular Carcinoma Through Regulating miR-1827/FZD4 Axis And Activating Wnt/β-Catenin Pathway.

Authors:  Xiaoxiao Yao; Guangqiang You; Chen Zhou; Dan Zhang
Journal:  Cancer Manag Res       Date:  2019-11-07       Impact factor: 3.989

10.  Long Noncoding RNA PTTG3P Expression Is an Unfavorable Prognostic Marker for Patients With Hepatocellular Carcinoma.

Authors:  Hansong Bai; Xing Luo; Dongxu Liao; Wei Xiong; Ming Zeng; Bo Zheng
Journal:  Technol Cancer Res Treat       Date:  2019 Jan-Dec
View more
  4 in total

1.  Artificial intelligence method to predict overall survival of hepatocellular carcinoma.

Authors:  Cem Simsek; Deniz Can Guven; Taha Koray Sahin; Ibrahim Emir Tekin; Ozlem Sahan; Hatice Yasemin Balaban; Suayib Yalcin
Journal:  Hepatol Forum       Date:  2021-05-21

2.  A four-long noncoding RNA signature predicts survival of hepatocellular carcinoma patients.

Authors:  Haitao Jiang; Lianhe Zhao; Yunjie Chen; Liang Sun
Journal:  J Clin Lab Anal       Date:  2020-05-30       Impact factor: 2.352

3.  Systematic Construction and Validation of an RNA-Binding Protein-Associated Model for Prognosis Prediction in Hepatocellular Carcinoma.

Authors:  Siyuan Tian; Jingyi Liu; Keshuai Sun; Yansheng Liu; Jiahao Yu; Shuoyi Ma; Miao Zhang; Gui Jia; Xia Zhou; Yulong Shang; Ying Han
Journal:  Front Oncol       Date:  2021-01-26       Impact factor: 6.244

4.  Serum exosomal long noncoding RNA CRNDE as a prognostic biomarker for hepatocellular carcinoma.

Authors:  Ting Wang; Hengkai Zhu; Min Xiao; Shao Zhou
Journal:  J Clin Lab Anal       Date:  2021-10-06       Impact factor: 2.352

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.