| Literature DB >> 24522499 |
Jiagen Li1, Zhaoli Chen1, Liqing Tian2, Chengcheng Zhou1, Max Yifan He1, Yibo Gao1, Suya Wang1, Fang Zhou1, Susheng Shi3, Xiaoli Feng3, Nan Sun1, Ziyuan Liu1, Geir Skogerboe2, Jingsi Dong1, Ran Yao1, Yuda Zhao1, Jian Sun1, Baihua Zhang1, Yue Yu1, Xuejiao Shi1, Mei Luo1, Kang Shao1, Ning Li1, Bin Qiu1, Fengwei Tan1, Runsheng Chen2, Jie He1.
Abstract
BACKGROUND: Oesophageal cancer is one of the most deadly forms of cancer worldwide. Long non-coding RNAs (lncRNAs) are often found to have important regulatory roles.Entities:
Keywords: Oesophageal Cancer
Mesh:
Substances:
Year: 2014 PMID: 24522499 PMCID: PMC4215280 DOI: 10.1136/gutjnl-2013-305806
Source DB: PubMed Journal: Gut ISSN: 0017-5749 Impact factor: 23.059
Figure 1Identification of the long non-coding RNA (lncRNA) signature in the training set. (A) After microarray processing, the microarray data was described by an 60×8900 matrix with a ‘good’ or ‘poor’ label column. (B) After two filtering procedures, 909 lncRNAs remained for further analysis. (C) Selection process for the nine lncRNAs with highest classification power for patient survival. A random Forest supervised classification algorithm was used to narrow down the number of lncRNAs by several iterative steps, in which one-third of the least important lncRNAs were discarded at each step according to their importance score. (D) Development of prognostic classifier for all combinations (N=29−1=511) of the nine lncRNAs using the nearest shrunken centroid algorithm. Vg and Vp are the mean expression profiles of the lncRNA combination (g1 g3 g4 g6) for good-prognostic samples and poor-prognostic samples, respectively. Vi is the expression profile of sample i. The Euclid distances d(Vi,Vg) and d(Vi,Vp) are used to classify sample i into a low- or high-risk group. (E) The procedure for identifying the final signature. The accuracies of all 511 signatures were calculated and the nine highest accuracies for k=1, 2, …, 9 are shown in the plot. The signature containing three lncRNAs was selected as the final signature.
Figure 2Unsupervised hierarchical clustering of the 119 pairs of tissues. The normalised expression data of the 6389 lncRNAs with coefficient of variance >0.10 was used for clustering analysis. Hierarchical clustering clearly separated tumour (blue bar) and normal (yellow bar) samples. Only six tumour samples and six normal samples were misclassified.
Figure 3The three-lncRNA signature predicts overall survival of patients with OSCC. Heat maps (A–C) of the relative expression level (tumour minus normal) after z-score transformation for each lncRNA, and Kaplan–Meier survival curves (D–F) of patients classified into high- and low-risk groups using the three-lncRNA signature. p Values were calculated by log-rank test. (A, D) Training set, 60 patients. (B, E) Test set, 59 patients. (C, F) Independent cohort, 60 patients. OSCC, oesophageal squamous cell carcinoma.
Clinical and pathological characteristics of patients with OSCC with high- or low-risk lncRNA signature in the three datasets
| Training set (n=60) | Test set (n=59) | Independent set (n=60) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Characteristics | High-risk group (n=33) | Low-risk group (n=27) | p Value | High-risk group (n=25) | Low-risk group (n=34) | p Value | High-risk group (n=37) | Low-risk group (n=23) | p Value |
| Age, median (IQR) | 59.0 (11.0) | 55.0 (17.5) | 0.7976* | 62.0 (12.0) | 59.0 (9.5) | 0.3834* | 62.0 (13.0) | 58.0 (11.0) | 0.8231 |
| Gender, male | 26 (78.8) | 23 (85.2) | 0.7391 | 21 (84.0) | 28 (82.4) | 1.0000 | 28 (75.7) | 20 (87.0) | 0.3404 |
| Tobacco use, yes | 19 (57.6) | 20 (74.1) | 0.1825 | 18 (72.0) | 23 (67.6) | 0.7197 | 18 (48.6) | 16 (69.6) | 0.1119 |
| Alcohol use, yes | 20 (60.6) | 16 (59.3) | 0.9156 | 16 (64.0) | 22 (64.7) | 0.9554 | 18 (48.6) | 14 (60.9) | 0.3562 |
| Tumour location | 0.2460 | 0.5411 | 0.3780 | ||||||
| Upper | 7 (21.2) | 2 (7.4) | 1 (4.0) | 4 (11.8) | 5 (13.5) | 1 (4.3) | |||
| Middle | 15 (45.5) | 17 (63.0) | 17 (68.0) | 20 (58.8) | 18 (48.6) | 10 (43.5) | |||
| Lower | 11 (33.3) | 8 (29.6) | 7 (28.0) | 10 (29.4) | 14 (37.8) | 12 (52.2) | |||
| Tumour grade | 0.5977 | 0.3126 | 0.4270 | ||||||
| Well differntiated | 8 (24.2) | 6 (22.2) | 4 (16.0) | 5 (14.7) | 4 (10.8) | 5 (21.7) | |||
| Moderately differentiated | 17 (51.5) | 17 (63.0) | 10 (40.0) | 20 (58.8) | 21 (56.8) | 13 (56.5) | |||
| Poorly differentiated | 8 (24.2) | 4 (14.8) | 11 (44.4) | 9 (26.5) | 12 (32.4) | 5 (21.7) | |||
| T stage | 0.2524 | 0.1632 | 0.2271 | ||||||
| T1 | 1 (3.0) | 2 (7.4) | 1 (4.0) | 4 (11.8) | 1 (2.7) | 3 (13.0) | |||
| T2 | 3 (9.1) | 2 (7.4) | 4 (16.0) | 11 (32.4) | 5 (13.5) | 2 (8.7) | |||
| T3 | 17 (51.5) | 19 (70.4) | 15 (60.0) | 11 (32.4) | 31 (83.8) | 17 (73.9) | |||
| T4 | 12 (36.4) | 4 (14.8) | 5 (20.0) | 8 (23.5) | 0 | 1 (6.3) | |||
| N stage | 0.1350 | 0.0290 | 0.7255 | ||||||
| N0 | 11 (33.3) | 16 (59.3) | 6 (24.0) | 21 (61.8) | 16 (43.2) | 13 (56.5) | |||
| N1 | 18 (54.5) | 8 (29.6) | 9 (36.0) | 7 (20.6) | 14 (37.8) | 6 (26.1) | |||
| N2 | 1 (3.0) | 2 (7.4) | 7 (28.0) | 3 (8.8) | 6 (16.2) | 3 (13.0) | |||
| N3 | 3 (9.1) | 1 (3.7) | 3 (12.0) | 3 (8.8) | 1 (2.7) | 1 (4.3) | |||
| TNM stage | 0.1106 | 0.0378 | 0.5552 | ||||||
| I | 0 | 2 (7.4) | 0 | 4 (11.8) | 2 (5.4) | 2 (8.7) | |||
| II | 10 (30.3) | 12 (44.4) | 8 (32.0) | 17 (50.0) | 17 (45.9) | 13 (56.5) | |||
| III | 23 (69.7) | 13 (48.1) | 17 (68.0) | 13 (38.2) | 18 (48.6) | 8 (34.8) | |||
| Tumour clearance | N/A | N/A | N/A | ||||||
| R0 | 33 (100) | 27 (100) | 25 (100) | 34 (100) | 37 (100) | 23 (100) | |||
| R1/R2 | 0 | 0 | 0 | 0 | 0 | 0 | |||
| Postoperative complication | |||||||||
| Pneumonia | 1 (3.0) | 1 (3.7) | 1.0000 | 6 (24.0) | 4 (11.8) | 0.2970 | 2 (5.4) | 1 (4.3) | 1.0000 |
| Anastomotic leak | 3 (9.1) | 1 (3.7) | 0.6199 | 3 (12.0) | 4 (11.8) | 1.0000 | 1 (2.7) | 0 | 1.0000 |
| Arrhythmia | 11 (33.3) | 5 (18.5) | 0.2481 | 9 (36.0) | 2 (5.9) | 0.0055 | 10 (27.0) | 6 (26.1) | 1.0000 |
| Adjuvant therapy | 0.6209 | 0.2585 | 0.5196 | ||||||
| Yes | 20 (60.6) | 13 (48.1) | 16 (64.0) | 20 (58.8) | 23 (62.2) | 12 (52.2) | |||
| No | 8 (24.2) | 9 (33.3) | 1 (4.0) | 6 (17.6) | 11 (29.7) | 10 (43.5) | |||
| Unknown | 5 (15.2) | 5 (18.5) | 16 (27.1) | 8 (35.5) | 3 (8.1) | 1 (4.3) | |||
| Median survival (months) | 19.2 | >60 | <0.0001† | 21.5 | >60 | 0.0030† | 25.8 | >48 | 0.0187† |
Data are shown as n (%). p Values are calculated by χ2 test or Fisher's exact test, unless otherwise stated.
*Student's t test.
†Log-rank test. N/A: p values are not calculated because all patients received R0 resection.
OSCC, oesophageal squamous cell carcinoma; TNM, tumour node metastasis.
Univariable and multivariable Cox regression analysis of the lncRNA signature and survival in the training set (n=60) and in the combined test and independent cohort (n=119)
| Univariable analysis | Multivariable analysis | ||||
|---|---|---|---|---|---|
| HR (95% CI) | p Value | HR (95% CI) | p Value | ||
| Training set | |||||
| Age | >60/≤60 | 1.595 (0.821 to 3.098) | 0.1680 | 2.366 (1.191 to 4.701) | 0.0140 |
| Gender | Female/male | 1.233 (0.561 to 2.707) | 0.6022 | ||
| Tobacco use | Y/N | 0.693 (0.357 to 1.346) | 0.2790 | ||
| Alcohol use | Y/N | 0.896 (0.464 to 1.732) | 0.7445 | ||
| Tumour location | Upper, middle/lower | 1.249 (0.602 to 2.591) | 0.5504 | ||
| Tumour grade | Moderately differentiated, poorly/well differentiated | 1.569 (0.685 to 3.592) | 0.2863 | ||
| T | T3, T4/T1, T2 | 0.767 (0.319 to 1.845) | 0.5540 | ||
| N | N1, N2, N3/N0 | 1.960 (0.974 to 3.943) | 0.0592 | ||
| TNM | III/I, II | 2.506 (1.202 to 5.226) | 0.0143 | ||
| Pneumonia | Y/N | 1.050 (0.144 to 7.672) | 0.9614 | ||
| Anastomotic leak | Y/N | 2.716 (0.829 to 8.892) | 0.0987 | 5.805 (1.605 to 21.000) | 0.0073 |
| Arrhythmia | Y/N | 1.416 (0.706 to 2.837) | 0.3271 | ||
| Adjuvant therapy | Y/N | 1.501 (0.849 to 2.652) | 0.1625 | ||
| LncRNA signature | High risk/low risk | 6.578 (2.837 to 15.252) | <0.0001 | 8.486 (3.550 to 20.284) | <0.0001 |
| Test+independent cohort | |||||
| Age | >60/≤60 | 1.724 (1.072 to 2.774) | 0.0246 | 1.674 (1.033 to 2.713) | 0.0365 |
| Gender | Female/male | 1.283 (0.714 to 2.306) | 0.4045 | ||
| Tobacco use | Y/N | 0.788 (0.488 to 1.272) | 0.3295 | ||
| Alcohol use | Y/N | 0.866 (0.539 to 1.390) | 0.5501 | ||
| Tumour location | Upper, middle/lower | 1.184 (0.719 to 1.951) | 0.5065 | ||
| Tumour grade | moderately differentiated, poorly/well differentiated | 0.982 (0.502 to 1.919) | 0.9571 | ||
| T | T3, T4/T1, T2 | 1.237 (0.716 to 2.183) | 0.4458 | ||
| N | N1, N2, N3/N0 | 2.214 (1.346 to 3.640) | 0.0017 | ||
| TNM | III/I, II | 2.031 (1.258 to 3.278) | 0.0037 | ||
| Pneumonia | Y/N | 1.507 (0.721 to 3.152) | 0.2759 | ||
| Anastomotic leak | Y/N | 0.942 (0.343 to 2.589) | 0.9085 | ||
| Arrhythmia | Y/N | 0.976 (0.558 to 1.705) | 0.9311 | ||
| Adjuvant therapy | Y/N | 2.227 (1.241 to 3.997) | 0.0073 | 2.328 (1.299 to 4.172) | 0.0045 |
| LncRNA signature | High risk/low risk | 2.412 (1.464 to 3.975) | 0.0005 | 2.203 (1.330 to 3.649) | 0.0022 |
TNM, tumour node metastasis.
Figure 4Survival prediction in stage II and III patients. Kaplan–Meier survival curves of stage II and III patients with OSCC classified into high- and low-risk groups based on the three-lncRNA signature. (A) Stage II patients, training set (n=22). (B) Stage II patients, combined test set and independent cohort (n=55). (C) Stage III patients, training set (n=36). (D) Stage III patients, combined test set and independent cohort (n=56). OSCC, oesophageal squamous cell carcinoma.
Figure 5Comparison of sensitivity and specificity for survival prediction by the three-lncRNA signature, TNM stage and combination of the two factors. The three receiver operating characteristics (ROC) curves in the training set (A) and test set (B). p Values show the area under the ROC (AUROC) of TNM stage versus the AUROC of the three-lncRNA signature, or the combination of signature and TNM. TNM, tumour node metastasis.