| Literature DB >> 28187432 |
Jiang-Hui Zeng1, Liang Liang2, Rong-Quan He3, Rui-Xue Tang1, Xiao-Yong Cai2, Jun-Qiang Chen4, Dian-Zhong Luo1, Gang Chen1.
Abstract
Growing evidence has shown that long non-coding RNAs (lncRNAs) can serve as prospective markers for survival in patients with colorectal adenocarcinoma. However, most studies have explored a limited number of lncRNAs in a small number of cases. The objective of this study is to identify a panel of lncRNA signature that could evaluate the prognosis in colorectal adenocarcinoma based on the data from The Cancer Genome Atlas (TCGA). Altogether, 371 colon adenocarcinoma (COAD) patients with complete clinical data were included in our study as the test cohort. A total of 578 differentially expressed lncRNAs (DELs) were observed, among which 20 lncRNAs closely related to overall survival (OS) in COAD patients were identified using a Cox proportional regression model. A risk score formula was developed to assess the prognostic value of the lncRNA signature in COAD with four lncRNAs (LINC01555, RP11-610P16.1, RP11-108K3.1 and LINC01207), which were identified to possess the most remarkable correlation with OS in COAD patients. COAD patients with a high-risk score had poorer OS than those with a low-risk score. The multivariate Cox regression analyses confirmed that the four-lncRNA signature could function as an independent prognostic indicator for COAD patients, which was largely mirrored in the validating cohort with rectal adenocarcinoma (READ) containing 158 cases. In addition, the correlative genes of LINC01555 and LINC01207 were enriched in the cAMP signaling and mucin type O-Glycan biosynthesis pathways. With further validation in the future, our study indicates that the four-lncRNA signature could serve as an independent biomarker for survival of colorectal adenocarcinoma.Entities:
Keywords: COAD; READ; lncRNA; prognostic biomarker; survival
Mesh:
Substances:
Year: 2017 PMID: 28187432 PMCID: PMC5370003 DOI: 10.18632/oncotarget.15161
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Differentially expressed lncRNAs (DELs) analysis
(A) DELs identified using the edgeR package; (B) DELs identified using the DESeq package; (C) Overlapping DELs.
Figure 2The 578 differentially expressed lncRNAs (DELs) in COAD
A heatmap was drawn to show the DELs.
Prognostic value of the DELs by univariate cox regression analysis
| Estimate | StdErr | ChiSq | ProbChiSq* | HazardRatio | |
|---|---|---|---|---|---|
| AC016831.7 | –0.209 | 0.065 | 10.521 | 0.001 | 0.811 |
| LINC01555 | –0.177 | 0.056 | 9.985 | 0.002 | 0.838 |
| RP11-610P16.1 | –0.306 | 0.104 | 8.592 | 0.003 | 0.736 |
| AC006273.5 | 0.239 | 0.083 | 8.412 | 0.004 | 1.271 |
| RP11-108K3.1 | 0.252 | 0.091 | 7.760 | 0.005 | 1.287 |
| RP1-193H18.2 | –0.217 | 0.082 | 7.078 | 0.008 | 0.805 |
| LINC00675 | –0.179 | 0.069 | 6.753 | 0.009 | 0.836 |
| CTD-2619J13.17 | –0.226 | 0.093 | 5.925 | 0.015 | 0.797 |
| RP11-449D8.1 | –0.154 | 0.064 | 5.838 | 0.016 | 0.857 |
| AF064858.6 | –0.154 | 0.064 | 5.791 | 0.016 | 0.857 |
| RP11-150O12.3 | –0.138 | 0.059 | 5.504 | 0.019 | 0.871 |
| TP53TG1 | –0.234 | 0.101 | 5.419 | 0.020 | 0.791 |
| LINC00959 | –0.248 | 0.111 | 4.983 | 0.026 | 0.780 |
| SUCLG2-AS1 | –0.189 | 0.088 | 4.647 | 0.031 | 0.827 |
| LINC01315 | –0.180 | 0.086 | 4.408 | 0.036 | 0.836 |
| RP11-474D1.3 | –0.066 | 0.032 | 4.340 | 0.037 | 0.936 |
| MAFTRR | –0.184 | 0.088 | 4.331 | 0.037 | 0.832 |
| KBTBD11-OT1 | –0.172 | 0.084 | 4.174 | 0.041 | 0.842 |
| LINC01207 | –0.120 | 0.059 | 4.099 | 0.043 | 0.887 |
| LINC01132 | –0.214 | 0.106 | 4.055 | 0.044 | 0.808 |
* ProbChiSq equal P-value.
Figure 3Clinical significance of four differentially expressed lncRNAs (DELs)
(A) The expression levels of four DELs in the tumor group compared with that in the normal group; (B) ROC curves of the four DELs to distinguish COAD tissue from normal colon tissue; (C) The Kaplan-Meier curves showing the relationship between the four DELs and the overall survival. The cases were divided into a high and low expression group by the mean DEL level. *P < 0.001.
Figure 4Risk score analysis of the differentially expressed lncRNA (DEL) signature of COAD
(A) Patient survival status and time distributed by risk score; (B) Risk score curve of the four-DEL-signature; (C) Heatmap of four DELs from COAD patients. Color from blue to red indicates the expression level from low to high. The dotted line represents the individual inflection point of the risk score curve, by which the COAD patients were classified in the low-risk or high-risk group.
Figure 5The prognostic performance of the four-differentially expressed lncRNA (DEL) signature of COAD
(A) The prognostic performance of the risk score showed by the time-dependent receiver operating characteristic (ROC) curve for predicting the 5-years survival. (B) The Kaplan-Meier test of the risk score for the overall survival.
The predictive values of related clinical parameters and risk score
| Variables | Patient | Univariate analysis | Multivariate analysis | |||
|---|---|---|---|---|---|---|
| Sex | Female | 170 | 1 (reference) | |||
| Male | 201 | 1.180 (0.761–1.831) | 0.459 | |||
| Age | <= 65 years | 162 | 1 (reference) | |||
| > 65 years | 209 | 1.284 (0.816–2.022) | 0.280 | |||
| Disease stage | I | 67 | 1 (reference) | 1 (reference) | ||
| II | 146 | 2.187 (0.759–6.304) | 0.147 | 1.225E4 (0.000–1.976E86) | 0.922 | |
| III | 99 | 3.904 (1.354–11.257) | 8.180E4 (0.000–1.328E87) | 0.907 | ||
| IV | 52 | 9.645 (3.351–27.758) | 2.394E5 (0.000–3.875E87) | 0.989 | ||
| T stage | T1 | 10 | 1 (reference) | |||
| T2 | 68 | 0.452 (0.087–2.352) | 0.345 | |||
| T3 | 249 | 1.052 (0.256–4.332) | < 0.944 | |||
| T4 | 44 | 4.078 (0.937–17.747) | < 0.061 | |||
| N stage | N0 | 227 | 1 (reference) | 1 (reference) | ||
| N1 | 82 | 1.827 (1.059–3.151) | 0.624 (0.096–4.049) | 0.621 | ||
| N2-N3 | 61 | 4.348 (2.577–7.337) | 0.868 (0.136–5.555) | 0.881 | ||
| M stage | M0 | 269 | 1 (reference) | |||
| M1 | 51 | 4.601 (2.773–7.636) | ||||
| Lymphatic invasion | NO | 211 | 1 (reference) | 1 (reference) | ||
| YES | 120 | 1.960 (1.219–3.150) | 0.380 (0.121–1.189) | 0.096 | ||
| Venous invasion | NO | 248 | 1 (reference) | 1 (reference) | ||
| YES | 74 | 2.490 (1.547–4.006) | 1.331 (0.438–4.048) | 0.614 | ||
| Treatment outcome | CR+PR | 123 | 1 (reference) | |||
| SD+PD | 29 | 7.320 (3.491–15.347) | ||||
| Radiotherapy | NO | 308 | 1 (reference) | |||
| YES | 8 | 0.740 (0.102–5.355) | 0.765 | |||
| Neoplasm recurrence | NO | 256 | 1 (reference) | 1 (reference) | ||
| YES | 60 | 2.990 (1.855–4.819) | 3.030 (1.234–7.440) | |||
| Residual tumor | R0 | 263 | 1 (reference) | 1 (reference) | ||
| R1+R2 | 20 | 3.953 (1.933–8.082) | 1.660 (0.607–4.537) | 0.323 | ||
| Dimession | <=10 mm | 137 | 1 (reference) | |||
| > 10 mm | 124 | 0.944 (0.537–1.659) | 0.840 | |||
| Risk score | Low | 170 | 1 (reference) | 1 (reference) | ||
| High | 201 | 2.948 (1.779 – 4.886) | 9.389 (2.737–32.213) |
HR: hazard ratio; CI: confidence interval; SD: stable disease; PD: progressive disease; CR: complete response; PR: partial response.
Figure 6The prognostic value of different parameters for survival of COAD patients
Kaplan-Meier curves of nine independent prognostic indictors, including pathologic tumor stage, pathologic node stage, pathologic metastasis, pathologic stage, lymphatic invasion, venous invasion, residual tumor, tumor recurrence and treatment outcome. CR, complete response; PR, partial response; SD, stable disease; PD, progressive disease.
Figure 7The predictive value of the risk score for the clinical status
characteristic (ROC) curve predicting the different clinical parameters: pathologic tumor stage, pathologic node stage, pathologic metastasis, pathologic stage, lymphatic invasion, venous invasion, residual tumor, tumor recurrence and treatment outcome.
Figure 8The expression level of the four lncRNAs in the low- and high-risk groups
The difference in the expression level of LINC01555, RP11-610P16.1, RP11-108K3.1 and LINC01207 between the low-risk group and high-risk group. *P < 0.001.
Figure 9Validation of the four-DEL-signature in READ
(A) READ patients were divided into high- and low-risk groups using the risk score generated from the four-DEL-signature. The relationship between survival and risk score is presented (top); The risk score curve is drawn to classify the READ patients into low- and high- risk groups (middle); The risk scores of the READ patients are shown in a heatmap (bottom). (B) ROC curve for predicting 5-year survival in READ patients by the risk score. (C) Kaplan-Meier curves of the four-lncRNA signature for READ patients.
Figure 10Gene network of the correlative genes of LINC01207
The network of genes co-expressed with LINC01207. LncRNAs are shown as green nodes and mRNAs as red nodes. We measured LINC01207 co-expression using the Multi Experiment Matrix (MEM) software. The Affymetrix GeneChip Human Genome U133 Plus 2.0 platform type was selected for further analysis. The top 100 genes were used to draw the network schematic. The bottom box displays the protein-protein interaction (PPI) network of related genes.
Figure 11Gene network of the correlative genes of LINC01555
The network of genes co-expressed with LINC01555. LncRNAs are shown as green nodes and mRNAs are shown as red nodes. We detected the co-expression of LINC01555 using Multi Experiment Matrix (MEM) software. The Affymetrix GeneChip Human Genome U133 Plus 2.0 platform type was applied for further examination. The top 100 genes were used to draw the network schematic. The bottom box displays the protein-protein interaction (PPI) network of related genes.
GO terms and pathway analyses with the correlative genes of LINC01555 and LINC01207
| Category | Term | Genes | |
|---|---|---|---|
| KEGG pathway | cAMP signaling pathway | 0.016 | GIPR, ADCY10, GLP1R |
| Neuroactive ligand-receptor interaction | 0.029 | SSTR3, GIPR, GLP1R | |
| Biological process | Meiotic nuclear division | 0.034 | TUBGCP6, FKBP6 |
| Regulation of heart contraction | 0.043 | TNNT2, GLP1R | |
| Cellular response to estradiol stimulus | 0.049 | SSTR3, MYOG | |
| Cellular component | Basal part of cell | 0.021 | OTOF, ADCY10 |
| Molecular function | Structural molecule activity | 0.047 | KRT74, SNTG1, KRT35 |
| KEGG pathway | Mucin type O-Glycan biosynthesis | < 0.001 | GCNT3, GALNT7, POC1B-GALNT4, ST6GALNAC1 |
| Glycosphingolipid biosynthesis - lacto and neolacto series | 0.003 | FUT6, FUT3, FUT2 | |
| Metabolic pathways | 0.007 | GCNT3, CYP2J2, GALNT7, GMDS, POC1B-GALNT4, FUT6, FUT3, HGD, FUT2, ST6GALNAC1 | |
| Biological process | Protein glycosylation | < 0.001 | GALNT7, POC1B-GALNT4, FUT6, FUT3, FUT2, ST6GALNAC1 |
| L-fucose catabolic process | 0.001 | FUT6, FUT3, FUT2 | |
| Fucosylation | 0.001 | FUT6, FUT3, FUT2 | |
| Maintenance of gastrointestinal epithelium | 0.001 | TFF1, MUC13, PBLD | |
| O-glycan processing | 0.002 | GCNT3, GALNT7, POC1B-GALNT4, MUC13 | |
| Carbohydrate metabolic process | 0.005 | GCNT3, GALNT7, POC1B-GALNT4, TFF1, FUT2 | |
| Positive regulation of establishment of protein localization to plasma membrane | 0.006 | PLS1, AGR2, CIB1 | |
| Lung goblet cell differentiation | 0.012 | SPDEF, AGR2 | |
| Oligosaccharide biosynthetic process | 0.042 | FUT3, ST6GALNAC1 | |
| Cellular component | Extracellular exosome | < 0.001 | GCNT3, TSPAN1, CYP2J2, GALNT7, CLDN3, SLC44A4, POC1B-GALNT4, FUT6, KIAA1324, TSPAN8, PBLD, CANT1, ZG16B, ANG, PLS1, FUT3, TFF3, CEACAM5, FUT2, MUC13, GOLM1, CIB1, BCAS1, TMPRSS2, CAPN5, SLC12A2, GMDS, TMC5, TMC4, HGD, GPA33 |
| Golgi cisterna membrane | 0.003 | FUT6, FUT3, FUT2, CANT1 | |
| Golgi apparatus | 0.006 | SH3RF1, POC1B-GALNT4, FUT6, KIAA1324, ATP8B1, FUT3, FUT2, GOLM1, ST6GALNAC1, CIB1 | |
| Integral component of plasma membrane | 0.008 | TMPRSS2, ATP2C2, TNFRSF11A, TSPAN1, SLC12A2, CLDN3, SLC22A23, KIAA1324, ATP8B1, GPA33, TSPAN8, CEACAM5, GOLM1 | |
| Integral component of membrane | 0.009 | GCNT3, GPR160, FAM3D, GALNT7, CLDN3, SLC44A4, POC1B-GALNT4, FUT6, KIAA1324, CLDN12, CANT1, TMEM144, STX19, ATP8B1, FUT3, CREB3L1, FUT2, MUC13, TMPRSS2, TMEM45B, SLC12A2, FA2H, TMC5, SLC22A23, TMC4, ST6GALNAC1, STYK1, ATP2C2, CDH17, SMIM22, MGST2 | |
| Bicellular tight junction | 0.010 | CLDN3, POF1B, CLDN12, TJP3 | |
| Golgi membrane | 0.028 | ATP2C2, GCNT3, GALNT7, POC1B-GALNT4, FUT6, FUT3, ST6GALNAC1 | |
| Apical plasma membrane | 0.030 | SHROOM3, SLC12A2, ATP8B1, MUC13, CIB1 | |
| Perinuclear region of cytoplasm | 0.040 | SH3RF1, ATP2C2, TSPAN1, RASEF, POC1B-GALNT4, RHPN2, CIB1 | |
| Organelle membrane | 0.044 | CYP2J2, FA2H, MGST2 | |
| Molecular function | Fucosyltransferase activity | < 0.001 | FUT6, FUT3, FUT2 |
| 3-galactosyl-N-acetylglucosaminide 4-alpha-L-fucosyltransferase activity | 0.011 | FUT6, FUT3 | |
| Alpha-(1->3)-fucosyltransferase activity | 0.030 | FUT6, FUT3 | |
| Dystroglycan binding | 0.034 | AGR3, AGR2 | |
| carbohydrate binding | 0.040 | ZG16B, GALNT7, LGALS4, POC1B-GALNT4 | |
Figure 12GO and KEGG term analysis of potential genes related to LINC01555 and LINC01207
The Rich factor shows the degree of enrichment, which was calculated by the formula: (the number of selected genes in a term/total number of selected genes)/(the total number of genes in a term of the database/ the total number of genes in the database). The Node size represents the number of selected genes, and color represents the p-value of the enrichment analysis. CC, cellular component; MF, molecular function; BP, biological process.