| Literature DB >> 27352941 |
Stylianos Serghiou1,2, Aikaterini Kyriakopoulou3, John P A Ioannidis4,5,6,7.
Abstract
BACKGROUND: Expression of various long noncoding RNAs (lncRNAs) may affect cancer prognosis. Here, we aim to gather and examine all evidence on the potential role of lncRNAs as novel predictors of survival in human cancer.Entities:
Keywords: Cancer; Cancer biomarkers; Excess significance; LncRNA; Prognosis; Selective reporting biases; Small-study effects; Survival analysis
Mesh:
Substances:
Year: 2016 PMID: 27352941 PMCID: PMC4924330 DOI: 10.1186/s12943-016-0535-1
Source DB: PubMed Journal: Mol Cancer ISSN: 1476-4598 Impact factor: 27.401
Fig. 1A flow diagram demonstrating the study selection process. Out of 397 identified records, 111 were chosen for systematic review and 85 for overall survival meta-analysis. Duplicate exclusion refers to the process of asserting that each paper is only represented once within our set of records. Initially, 111 records with titles seemingly irrelevant to the objectives of our study were excluded, following which another 81 records were excluded after reading through the remaining 192 papers, for the reasons identified within the diagram. This process led to the identification of 111 papers eligible for systematic review. We then applied our meta-analysis eligibility criteria to these papers, on the basis of which 26 were excluded, for the reasons identified in the diagram. This process led to the identification of 85 papers eligible for meta-analysis
Descriptive statistics of eligible studies
| Characteristic | Subgroups | Frequency (%) |
|---|---|---|
| Year | 2000–2013 | 13 (12 %) |
| 2013–2014 | 23 (21 %) | |
| 2014–2014 | 0 (0 %) | |
| 2014–2015 | 75 (68 %) | |
| N | 111 (100 %) | |
| Cancer site | Gastric | 16 (13 %) |
| Colorectal | 15 (12 %) | |
| Lung | 15 (12 %) | |
| Brain | 12 (9 %) | |
| Esophageal | 10 (8 %) | |
| Prostate | 10 (8 %) | |
| Hepatic | 9 (7 %) | |
| Breast | 6 (5 %) | |
| Pancreatic | 6 (5 %) | |
| Cervical | 5 (4 %) | |
| Head and neck | 4 (3 %) | |
| Ovarian | 4 (3 %) | |
| Renal | 4 (3 %) | |
| Urinary bladder | 4 (3 %) | |
| Any | 2 (2 %) | |
| Hematologic | 2 (2 %) | |
| Endometrial | 1 (1 %) | |
| GIST | 1 (1 %) | |
| Neuroblastoma | 1 (1 %) | |
| N | 127 (100 %) | |
| Agnostic a | No | 85 (77 %) |
| Yes | 21 (19 %) | |
| Agnostic parent | 5 (5 %) | |
| N | 111 (100 %) | |
| Survival analysis | OS | 92 (83 %) |
| RFS | 36 (32 %) | |
| DSS | 10 (9 %) | |
| MFS | 9 (8 %) | |
| PFS | 6 (5 %) | |
| N | 111 (100 %) | |
| Quantification method | qRT-PCR | 84 (66 %) |
| ISH | 28 (22 %) | |
| qPCR | 11 (9 %) | |
| qRT-PCR or ISH | 2 (2 %) | |
| RT-qPCR | 1 (1 %) | |
| Unreported | 1 (1 %) | |
| N | 127 (100 %) | |
| Continent | Asia | 94 (74 %) |
| Europe | 15 (12 %) | |
| America | 13 (10 %) | |
| Multiple | 3 (2 %) | |
| Unreported | 2 (2 %) | |
| N | 127 (100 %) | |
| Study design | Unreported | 106 (83 %) |
| Retrospective | 19 (15 %) | |
| Prospective | 2 (2 %) | |
| N | 127 (100 %) | |
| Sampling method | Unreported | 113 (89 %) |
| Consecutive | 5 (4 %) | |
| Population | 5 (4 %) | |
| Random | 4 (3 %) | |
| N | 127 (100 %) | |
| Tissue preservation b | L | 66 (52 %) |
| Unreported | 34 (27 %) | |
| P | 18 (14 %) | |
| LP | 6 (5 %) | |
| L + RNALater | 3 (2 %) | |
| N | 127 (100 %) | |
| Pre-biopsy treatment | No | 77 (61 %) |
| Unreported | 46 (36 %) | |
| Yes | 4 (3 %) | |
| N | 127 (100 %) | |
| Post-biopsy treatment | Unreported | 98 (77 %) |
| Yes | 27 (21 %) | |
| No | 2 (2 %) | |
| N | 127 (100 %) | |
| Total number of lncRNAs studied | 1 | 87 (78 %) |
| 2–10 | 4 (4 %) | |
| 11–45033 | 15 (14 %) | |
| Unreported | 5 (5 %) | |
| N | 111 (100 %) | |
| Outcomes | Clinical and Non-clinical | 76 (68 %) |
| Clinical only | 35 (32 %) | |
| N | 111 (100 %) | |
| Use of validation method for survival | Unreported | 99 (89 %) |
| External | 5 (5 %) | |
| Internal | 4 (4 %) | |
| Both | 2 (2 %) | |
| Yes | 1 (1 %) | |
| N | 111 (100 %) |
These data are based on 111 studies of 127 datasets. N refers to the total number of observations for each characteristic
a Agnostic studies are those in which no prior knowledge is assumed regarding the choice of lncRNA to be studied
b Tissue preservation: L (liquid nitrogen), P (paraffin-embedded), LP (liquid nitrogen and/or paraffin-embedded
Fig. 2The covariates used within the multivariable models fitted by each paper. This is a data microarray in which the studies run along the Y-axis and the covariates run along the X-axis. Only the factors used three or more times are shown in this figure for convenience; refer to Additional file 3: Figure S1 for a data microarray illustrating all covariates studied. Rows and columns are ordered in descending order, based on how many times each covariate was included in the multivariable models fitted by each study. Where patterns were similar between studies or covariates, those papers or covariates were placed next to each other. It is evident that very few studies included the same covariates within their models and that less than half of the studies included both Stage and Grade within those models. Interestingly, according to Additional file 3: Figure S1, the majority of studies included at least one covariate within their model that had not been included in any other study. Green = Included in the multivariable model; Red = Not included in the multivariable model. LNM = Lymph node metastasis; T = Depth of invasion; M = Metastasis; KPS score = Karnofsky Performance Status score (a measure of functional impairment); LVM = Lymphovascular metastasis
Details of the lncRNAs studied
| LncRNA | Times studied | Number of cancer types (sample size) | Median (IQR) | Times significant (%) |
|---|---|---|---|---|
| HOTAIR | 29 | 13 (3886) | 100 (69) | 28 (97 %) |
| MALAT1 | 8 | 7 (1135) | 136 (53) | 5 (62 %) |
| H19 | 4 | 2 (440) | 77 (57) | 2 (50 %) |
| PVT1 | 4 | 4 (420) | 87 (24) | 4 (100 %) |
| GAS5 | 4 | 4 (369) | 96 (19) | 4 (100 %) |
| SChLAP1 | 3 | 1 (1396) | 357 (440) | 3 (100 %) |
| 6 lncRNA risk score | 3 | 1 (281) | 42 (94) | 1 (33 %) |
| CCAT2 | 2 | 2 (1226) | 613 (384) | 1 (50 %) |
| LncR1 vs LncR2 vs LncR3 | 2 | 1 (759) | 380 (96) | 2 (100 %) |
| ENSG00000261582 | 2 | 2 (576) | 288 (199) | 2 (100 %) |
| MVIH | 2 | 2 (257) | 128 (86) | 2 (100 %) |
| LOC285194 | 2 | 2 (227) | 114 (28) | 2 (100 %) |
| PCAT1 | 2 | 2 (212) | 106 (2) | 2 (100 %) |
| SPRY4-IT1 | 2 | 2 (190) | 95 (3) | 2 (100 %) |
| UCA1 | 2 | 2 (170) | 85 (5) | 2 (100 %) |
| GHET1 | 2 | 2 (122) | 61 (19) | 2 (100 %) |
| MEG3 | 2 | 2 (116) | 58 (14) | 2 (100 %) |
The following lncRNAs were studied once and found statistically significant: LINC00968, LINC01234, LINC00476, FLG-AS1, HOTTIP, TC0101686, TC0100223. The following lncRNAs were studied once and were not found significant: linc-UBC1, KIAA0495, PART1, MGC21881, MIAT, PAR5, ADAMTS9-AS2, BCAR4, XLOC_010588, FOXCUT, 3 lncRNA risk score, FENDRR, HIF1A-AS2, ANRIL, GAPLINC, MRUL, HEIH, HOXA13, 48 lncRNA risk score, BANCR, ZXF1, CARLo-5, GAS6-AS1, Sox2ot, TUG1, NAG7 - LINC00312, CAI2, TC0101441, ENST00000480739, BC008363, 80-gene SChLAP1 signature risk score, 167-gene SChLAP1 signature risk score, CADM1-AS1, RCCRT1, CCAT1. Significance in the table refers to p-value < 0.05, as this is what had been used by these studies
The ‘Times studied’ column refers to how many studies investigated each lncRNA. The ‘Number of cancer types’ column indicates in how many different cancer types each lncRNA was studied, with the total number of participants used to study each lncRNA in brackets. The ‘Median’ column indicates the median sample size for each cohort used to study each lncRNA, with the interquartile range (IQR) in brackets. The last column indicates how many times each lncRNA was found to be statistically significantly associated to prognosis and in brackets the relation of how many times it was found to be significant versus how many times it was studied
Fig. 3Forest plots for OS meta-analyses. We hereby illustrate the lncRNAs for which three or more studies reported OS. Each panel a-g corresponds to the meta-analysis of a different lncRNA: (a) HOTAIR, (b) MALAT1, (c) PVT1, (d) 6 lncRNA risk score, (e) SChLAP1, (f) H19 and (g) GAS5. The effect size for the estimate of each study is presented as a blue square proportional in size to the weight of that study. The confidence interval around that effect size is presented as a horizontal line. Where the confidence interval exceeds the range of our plot, an arrow has been placed. The vertical line across these estimates represents HR = 1 and any horizontal line crossing this vertical line represents a non-statistically significant result. The summary effect size is presented as a rhombus, the center of which represents the summary effect size and the width of which represents its confidence interval. It is evident that almost all studies quoted statistically significant results and that according to the available data, all meta-analyzed lncRNAs, apart from GAS5 (panel g), are statistically significantly associated to prognosis of OS in cancer. However, high between-study heterogeneity (based on the range of I2 estimates) indicates that these summary effect sizes are unreliable
The results of our meta-analysis for each lncRNA using ‘primarily multivariable’ data
| LncRNA | Studies | HR (95 % CI) |
| I2 (95 % CI) | Observed (Expected, |
|---|---|---|---|---|---|
| HOTAIR | 26 | 2.22 (1.86–2.65) | 0.0000 | 49 % (14–79 %) | 25 (18.2, |
| MALAT1 | 7 | 2.03 (1.64–2.52) | 0.0000 | 0 % (0–85 %) | 5 (4.1, |
| 6 lncRNA risk score | 4 | 1.57 (1.29–1.92) | 0.0000 | 0 % (0–47 %) | 2 (2.1, |
| GAS5 | 4 | 0.81 (0.33–2.00) | 0.6479 | 94 % (80–100 %) | 4 (2.1, |
| H19 | 4 | 1.16 (1.04–1.29) | 0.0100 | 0 % (0–98 %) | 2 (0.8, |
| PVT1 | 4 | 2.99 (2.10–4.27) | 0.0000 | 0 % (0–56 %) | 4 (2.7, |
| SChLAP1 | 3 | 1.98 (1.55–2.54) | 0.0000 | 0 % (0–98 %) | 3 (2.9, |
‘Studies’ refers to the number of studies included in the meta-analysis of each lncRNA. HR = Hazard Ratio, 95 % CI = 95 % Confidence Interval. I2 is a measure of between-study heterogeneity. The last column refers to how many statistically significant results had been reported by the included studies (Observed, O), how many were expected to be reported on the basis of each study’s power (Expected, E) and whether O and E are statistically significantly different from each other for each meta-analysis (p-value). Please refer to Additional file 1: Table S2 for a table illustrating all meta-analyses done (not only the one for primarily multivariable data) with all of the measures calculated
Fig. 4Funnel plot for the OS meta-analysis of HOTAIR. The meta-analysis for HOTAIR was analyzed with a funnel plot because it exceeded the pre-requisite of 10 studies. The Y-axis represents the Standard Error (SE), which serves as a measure of precision, where the higher the SE, the less precise the study. The HR has been plotted along the X-axis. The black dots map the effect size of HOTAIR on OS as this has been identified by each study. The light grey and dark grey areas respectively denote the 95 % and 99 % CI around the summary effect size. According to this plot, it is clear that the least precise studies tend to overestimate the effect size of HOTAIR on OS, skewing the summary effect size to the right (i.e. leading to a more strongly positive summary effect size)