Literature DB >> 33311941

Development and validation of a three-long noncoding RNA signature for predicting prognosis of patients with gastric cancer.

Jun Zhang1, Hai-Yan Piao2, Yue Wang1, Mei-Yue Lou3, Shuai Guo1, Yan Zhao4.   

Abstract

BACKGROUND: Gastric cancer (GC) is one of the most frequently diagnosed gastrointestinal cancers throughout the world. Novel prognostic biomarkers are required to predict the prognosis of GC. AIM: To identify a multi-long noncoding RNA (lncRNA) prognostic model for GC.
METHODS: Transcriptome data and clinical data were downloaded from The Cancer Genome Atlas. COX and least absolute shrinkage and selection operator regression analyses were performed to screen for prognosis associated lncRNAs. Receiver operating characteristic curve and Kaplan-Meier survival analyses were applied to evaluate the effectiveness of the model.
RESULTS: The prediction model was established based on the expression of AC007991.4, AC079385.3, and AL109615.2 Based on the model, GC patients were divided into "high risk" and "low risk" groups to compare the differences in survival. The model was re-evaluated with the clinical data of our center.
CONCLUSION: The 3-lncRNA combination model is an independent prognostic factor for GC. ©The Author(s) 2020. Published by Baishideng Publishing Group Inc. All rights reserved.

Entities:  

Keywords:  Gastric cancer; Least absolute shrinkage and selection operator; Long noncoding RNA; Prognosis; Survival analysis

Mesh:

Substances:

Year:  2020        PMID: 33311941      PMCID: PMC7701940          DOI: 10.3748/wjg.v26.i44.6929

Source DB:  PubMed          Journal:  World J Gastroenterol        ISSN: 1007-9327            Impact factor:   5.742


Core Tip: A model to predict survival of patients with gastric cancer was developed and validated by RNA sequencing and real-time reverse transcription-polymerase chain reaction assays. The model had an excellent performance: The areas under the curves for 3-year and 5-year survival were 0.78 and 0.75, respectively. The C-index was 0.72 (se = 0.022, 95% confidence interval: 0.67-0.76). The model contributed as a poor independent prognostic factor both in disease free survival and overall survival.

INTRODUCTION

Gastric cancer (GC), the fourth most frequently diagnosed malignant tumor, has the second highest cancer-related mortality rate worldwide, with high morbidity in Asia[1]. Moreover, GC is still the second most prevalent cancer in China[2]. Most patients are diagnosed at the advanced stage and are prone to chemoresistance and recurrence. It leads to an overall 5-year overall survival rate of less than 25%[3]. Therefore, developing a novel and reliable prognostic stratification system that could be applied to clinical risk assessment would be of great significance for the treatment and follow-up of GC patients. Long noncoding RNAs (lncRNAs) have limited protein-coding ability, exceeding 200 nucleotides in length[4]. Emerging evidence indicates that lncRNAs are involved in tumor initiation and progression through gene expression regulation from epigenetic to post-transcriptional levels[5-7]. Not surprisingly, some studies have shed light on the role of lncRNAs in GC. GClnc1 spurred tumorigenesis and metastasis by recruiting the WDR5/KAT2A complex as a “scaffold” in GC[8]. HOXC-AS3 could be regulated by abnormal histone modification and functioned as an essential element in GC proliferation and migration[9]. Therefore, the abnormal expression of lncRNAs may indirectly reflect the occurrence and development of GC. As genes do not usually act alone, it is necessary to select suitable lncRNAs and establish a multi-lncRNA prediction model, which may play a pivotal role in evaluating the prognosis of GC. In the present research, we analyzed the expression of lncRNAs associated with GC patients’ survival in The Cancer Genome Atlas (TCGA). We aimed to develop and validate a useful muti-lncRNA combination prediction model that might be useful in helping predict GC survival by performing COX regression and the least absolute shrinkage and selection operator (LASSO)[10].

MATERIALS AND METHODS

Data collection

RNA sequencing and matching medical data were acquired from TCGA (https://portal.gdc.cancer.gov/). Raw data were obtained through the “RTCGA Toolbox package” (R platform). Overall, 407 samples, including 375 GC and 32 normal tissues, were analyzed. The expression of lncRNAs was acquired from Illumina HiSeq-RNASeq platforms. Moreover, the clinical data of GC patients were also downloaded. R was utilized to evaluate findings.

Data preprocessing

Package “edgeR” was utilized to evaluate differentially expressed lncRNAs (DELs). P < 0.01 and |logFC| ≥ 2 were utilized as cutoff points. The R package "survival” was used to perform univariate COX regression analysis. Meaningful lncRNAs in univariate COX regression were incorporated into the construction of the least absolute shrinkage and selection operator (LASSO) regression (package “glmnet”) to minimize the overfitting caused by univariate COX regression. Besides, multivariate COX analysis was performed to screen the independent risk factors associated with GC patients’ prognosis. R package “survminer” was used for visualization. As time-dependent survival receiver operating characteristic (ROC) curve is an essential measure of the predictive power of a prognostic model, we used the R package “survival ROC” to assess the role of lncRNA combination prediction model in predicting 5- and 3-year survival. Kaplan-Meier survival analysis was performed to explore correlations between the lncRNA combination model and the overall survival (OS). This study was carried out according to the flow chart (Figure 1A).
Figure 1

Differentially expressed long noncoding RNAs in The Cancer Genome Atlas-STAndards for development. A: The workflow of the study; B: Volcano plots showing the differentially expressed long noncoding RNAs (DELs) screened with edgeR. The 772 up-regulated DELs are marked in red, and the 220 down-regulated DELs are marked in green; C: Heatmap showing the top 50 DELs in 375 gastric cancer and 32 para-carcinoma tissues. LncRNAs: Long noncoding RNAs; DELs: Differentially expressed long noncoding RNAs; TCGA: The Cancer Genome Atlas; STAD: STAndards for development; COX: Cyclooxygenase; LASSO: Least absolute shrinkage and selection operator; ROC: Receiver operating characteristic.

Differentially expressed long noncoding RNAs in The Cancer Genome Atlas-STAndards for development. A: The workflow of the study; B: Volcano plots showing the differentially expressed long noncoding RNAs (DELs) screened with edgeR. The 772 up-regulated DELs are marked in red, and the 220 down-regulated DELs are marked in green; C: Heatmap showing the top 50 DELs in 375 gastric cancer and 32 para-carcinoma tissues. LncRNAs: Long noncoding RNAs; DELs: Differentially expressed long noncoding RNAs; TCGA: The Cancer Genome Atlas; STAD: STAndards for development; COX: Cyclooxygenase; LASSO: Least absolute shrinkage and selection operator; ROC: Receiver operating characteristic.

Ethical statement and tissue samples

Overall, 200 GC and non-tumorous adjacent tissues were acquired from people undergoing surgery at the Liaoning Province Cancer Hospital and Institute from 2012 and 2014. Individuals were asked to sign an informed consent form before the operation. The hospital’s Ethics Committee approved the study. No preoperative chemotherapy or radiotherapy was performed on enrolled patients. Gastrectomy and D2 lymph node dissection were applied to all patients. Total RNA was extracted from patient tissue samples. Tumor staging was based on the tumor-node-metastasis (TNM) staging system (8th edition). One hundred and forty-two men and 58 women were enrolled in this study; their average age was 65 years (range, 42-78 years). Postoperative adjuvant chemotherapy was applied to stage IIA and above disease. Table 1 shows the clinicopathological parameters of the patients. Follow-up was the same as that in previous studies[9], and the follow-up deadline is December 31, 2019.
Table 1

Patient characteristics and univariate analysis

Characteristic n Disease-free survival
Overall survival
Time (mo)
P value
F
Time (mo)
P value
F
Age, median, yr0.142.170.201.72
≥ 6010742.4649.39
< 609350.4855.79
Gender0.152.100.122.42
Male14243.8650.23
Female5849.5854.82
Bormann type0.0044.980.0043.62
I1569.9369.93
II8845.5751.73
III9429.3536.99
IV315.3325.00
Tumor size0.0028.460.0027.42
≥ 5 cm8731.5139.48
< 5 cm11357.4662.07
Location0.094.910.134.11
Upper4435.2243.33
Middle5649.3354.92
Low10048.9254.31
Lauren type0.620.960.630.93
Intestinal9545.2251.26
Mixed4541.0847.06
Diffuse6050.2356.12
Tumor differentiation0.162.000.191.71
Moderate and well6650.8056.28
Poor13444.5750.74
Vessel invasion0.0016.300.0014.73
Yes5931.0738.83
No14152.0857.54
Perineural invasion0.0011.240.0011.52
Yes5128.0137.39
No14952.1457.41
TNM stage0.00335.890.00360.28
I4266.4566.50
II4764.3668.85
III10918.7028.50
IV25.008.50
AC007991.40.053.770.053.95
Overexpression6853.0258.07
Weak expression13242.7049.24
AC079385.30.0014.230.0014.71
Overexpression9536.2643.94
Weak expression10555.1259.92
AL109615.20.0010.480.029.87
Overexpression11339.0446.05
Weak expression8756.2760.90
Model0.0021.760.0021.34
High risk9537.6644.46
Low risk10553.9159.27

TNM: Tumor-node-metastasis.

Patient characteristics and univariate analysis TNM: Tumor-node-metastasis.

Cell culture

The gastric epithelial cell line GES-1 and gastric cancer cell lines AGS and MKN45 were obtained from China Medical University (Shenyang, China). Cells were maintained in RPMI 1640 supplemented with 10% fetal bovine serum, penicillin, and streptomycin (Invitrogen, United States) at 37 °C at 5% CO2/1% O2. Analyses were performed at least three times.

Real-time reverse transcription-polymerase chain reaction

TRIzol, Promega cDNA core kit, and SYBR Master Mixture were utilized to create cDNA and conduct real-time reverse transcription-polymerase chain reaction (RT-PCR) as those described in previous studies[6,11,12].

Statistical analysis

Data are shown as the mean ± standard deviation. Student’s t-test, Wilcoxon-signed rank test, and ANOVA were used for statistical analyses through SPSS 23.0 (IBM, NY, United States). The relationship between clinical data and expression of biomarkers was tested by χ test or Fisher’s exact test. The log-rank test and COX proportional hazards model were used for survival analysis. P < 0.05 represented statistical significance.

RESULTS

Identification of differentially expressed lncRNAs

RNA sequencing involved 375 GC and 32 para-carcinoma tissues. EdgeR was used to determine the DELs (P < 0.01 and |logFC| ≥ 2). A total of 992 DELs were obtained, of which 772 were up-regulated, and 220 were down-regulated (Figure 1B and Supplementary Table 1). Moreover, the heatmap elucidated the top 50 DELs (Figure 1C).

Prognostic model based on expression levels of 3-lncRNA combination

Sixty-three lncRNAs were associated with the survival of GC by univariate COX regression analysis (Supplementary Table 2). The lncRNA-seq expression profile of the 63 lncRNAs and the clinical data were extracted for LASSO regression (Figure 2A). As a result of the LASSO penalized uni-Cox model, 22 lncRNAs were finally identified (Figure 2B) and incorporated into the multivariate COX model to identify independent prognostic factors associated with survival. As a result of LASSO and COX model, AC007991.4 (ENSG00000254287, antisense), AC079385.3 (ENSG00000257918, antisense), and AL109615.2 (ENSG00000231881, lincRNA) were finally identified (Figure 2C and D).
Figure 2

Least absolute shrinkage and selection operator and COX regression screened prognosis associated long noncoding RNAs. A: Least absolute shrinkage and selection operator coefficient values of the 22 prognosis-related long noncoding RNAs in The Cancer Genome Atlas cohort; B: L1-penalty of least absolute shrinkage and selection operator-COX regression; C: Forest plotshowing the correlations between the 22 long noncoding RNAs and the survival of gastric cancer patients in The Cancer Genome Atlas; D: AC007991.4, AC079385.3, and AL109615.2 are all independent prognostic risk factors for gastric cancer.

Least absolute shrinkage and selection operator and COX regression screened prognosis associated long noncoding RNAs. A: Least absolute shrinkage and selection operator coefficient values of the 22 prognosis-related long noncoding RNAs in The Cancer Genome Atlas cohort; B: L1-penalty of least absolute shrinkage and selection operator-COX regression; C: Forest plotshowing the correlations between the 22 long noncoding RNAs and the survival of gastric cancer patients in The Cancer Genome Atlas; D: AC007991.4, AC079385.3, and AL109615.2 are all independent prognostic risk factors for gastric cancer. According to the expression levels of the 3-lncRNA combination, all samples were divided into a high-risk group (red, Figure 3A) and low-risk group (green, Figure 3A). The survival time in years is shown in Figure 3B (red dots indicate death, and green dots indicate alive). The heatmap elucidated the expression of the three lncRNAs according to the risk level (Figure 3C). ROC analysis assessed the role of the 3-lncRNA combination prediction model in predicting survival. The areas under the curves for 3-year and 5-year survival were 0.78 and 0.75, respectively (Figure 3D). The C-index was 0.72 (se = 0.022, 95% confidence interval [CI]: 0.67-0.76), indicating that the model was a good predictor of patient survival. The low-risk group had a longer survival time than the high-risk group (Figure 3E, P < 0.001). Based on this, we obtained a 3-lncRNA combination prognostic model for GC patients: “risk score = -0.92 × AC007991.4 + 1.18 × AC079385.3 + 1.17 × AL109615.2”, and prepared for further verification (cutoff value = 6.58).
Figure 3

Characteristics of the 3-long noncoding RNA combination in The Cancer Genome Atlas queue. A: The Cancer Genome Atlas samples arranged according to risk score (the low-risk group, green, the high-risk group, red); B: The Cancer Genome Atlas samples arranged according to survival time in years (red, death; green, alive); C: Heatmap showing the expression of three long noncoding RNAs in samples according to the risk score (blue, low-risk group; pink, high-risk group); D: The receiver operating characteristic curve for evaluating the predictive effectiveness of the model; E: The high-risk group in this model has a worse overall survival. AUC: Area under the curve.

Characteristics of the 3-long noncoding RNA combination in The Cancer Genome Atlas queue. A: The Cancer Genome Atlas samples arranged according to risk score (the low-risk group, green, the high-risk group, red); B: The Cancer Genome Atlas samples arranged according to survival time in years (red, death; green, alive); C: Heatmap showing the expression of three long noncoding RNAs in samples according to the risk score (blue, low-risk group; pink, high-risk group); D: The receiver operating characteristic curve for evaluating the predictive effectiveness of the model; E: The high-risk group in this model has a worse overall survival. AUC: Area under the curve.

Expression of AC007991.4, AC079385.3, and AL109615.2 in GC

We then tested the expression of AC007991.4, AC079385.3, and AL109615.2 in GC cells and tissues. The results were consistent with those of bioinformatics prediction: AC007991.4 was weakly expressed in both GC cells and tissues (Figure 4A and D), while AC079385.3 and AL109615.2 were overexpressed in GC (Figure 4B, C, E, and F).
Figure 4

Expression of AC007991.4, AC079385.3, and AL109615.2 in gastric cancer tissues and cells. A and D: AC007991.4 is weakly expressed in gastric cancer (GC) cells and tissues; B and E: AC079385.3 is overexpressed in GC cells and tissues; C and F: AL109615.2 is overexpressed in GC cells and tissues. bP < 0.01. GC: Gastric cancer.

Expression of AC007991.4, AC079385.3, and AL109615.2 in gastric cancer tissues and cells. A and D: AC007991.4 is weakly expressed in gastric cancer (GC) cells and tissues; B and E: AC079385.3 is overexpressed in GC cells and tissues; C and F: AL109615.2 is overexpressed in GC cells and tissues. bP < 0.01. GC: Gastric cancer.

Validation of prognostic performance of the 3-lncRNA combination

Clinical data of 200 patients were enrolled in this study to verify the effectiveness of the 3-lncRNA combination prediction model. The disease-free survival (DFS) ranged from 5-91 mo, OS ranged from 7-91 mo, and 126 patients died before the end of follow-up. Then, we detected the expression of AC007991.4, AC079385.3, and AC079385.3 in these 200 tissue samples and evaluated the correlation between their expression and survival. As shown in Table 1, the Bormann type, tumor size, vessel invasion, perineural invasion, TNM, AC079385.3, and AL109615.2 were all associated with the poor DFS and OS (P < 0.05). Not surprisingly, “high risk” in the 3-lncRNA combination prediction model was also a poor prognostic factor for DFS (37.66 vs 53.91, P < 0.01, Figure 5) and OS (44.46 vs 59.27, P < 0.01, Figure 5).
Figure 5

Kaplan-Meier curves for disease-free survival and overall survival. A and B: Disease-free survival (DFS) and overall survival (OS) curves of 200 gastric cancer (GC) patients stratified by AC007991.4 expression (P = 0.05). The overexpression of AC007991.4 contributed to a good survival; C and D: DFS and OS curves of 200 GC patients stratified by AC079385.3 expression (P = 0.00). The overexpression of AC079385.3 contributed to an excellent survival; E and F: DFS and OS curves of 200 GC patients stratified by AL109615.2 expression (P = 0.00 and P = 0.02). The overexpression of AL109615.2 contributed to an excellent survival; G and H: DFS and OS curves of 200 GC patients stratified by the 3-long noncoding RNA model (P = 0.00). The high score of expression model contributed to a good survival. DFS: Disease-free survival; OS: Overall survival.

Kaplan-Meier curves for disease-free survival and overall survival. A and B: Disease-free survival (DFS) and overall survival (OS) curves of 200 gastric cancer (GC) patients stratified by AC007991.4 expression (P = 0.05). The overexpression of AC007991.4 contributed to a good survival; C and D: DFS and OS curves of 200 GC patients stratified by AC079385.3 expression (P = 0.00). The overexpression of AC079385.3 contributed to an excellent survival; E and F: DFS and OS curves of 200 GC patients stratified by AL109615.2 expression (P = 0.00 and P = 0.02). The overexpression of AL109615.2 contributed to an excellent survival; G and H: DFS and OS curves of 200 GC patients stratified by the 3-long noncoding RNA model (P = 0.00). The high score of expression model contributed to a good survival. DFS: Disease-free survival; OS: Overall survival. The parameters with P < 0.05 in the univariate analysis were included in the COX proportional hazard model. The role of TNM stage in prognosis remained unshaken [DFS: P = 0.00, hazard ratio (HR) = 106.50, 95%CI: 30.76-368.77; OS: P = 0, HR = 94.08, 95%CI: 30.14-293.65, Table 2]. The “high risk” also contributed as a poor independent prognostic factor both in DFS (P = 0.03, HR = 2.38, 95%CI: 1.34-4.23) and OS (P = 0.02, HR = 2.62, 95%CI: 1.44-4.77). In addition, the overexpression of AC079385.3 also foreshadowed the poor of survival of GC (DFS: P = 0.04, HR = 1.67, 95%CI: 1.46-1.99; OS: P = 0.04, HR = 1.72, 95%CI: 1.49-1.96). Its biological function in GC is worth further discussion.
Table 2

Multivariate analysis of significant prognostic factors for survival in gastric cancer patients

Variable Disease-free survival
Overall survival

P value
HR
95%CI
P value
HR
95%CI
Bormann type0.201.260.90-1.770.131.290.93-1.91
Tumor size0.410.860.59-1.240.400.850.59-1.24
Vessel invasion0.060.700.48-1.010.160.760.53-1.11
Perineural invasion0.240.800.56-1.160.090.730.50-1.05
TNM stage0.00106.5030.76-368.770.0094.0830.14-293.65
Model0.042.271.46-3.990.032.641.74-4.27

HR: Hazard ratio; CI: Confidence interval; TNM: Tumor-node-metastasis.

Multivariate analysis of significant prognostic factors for survival in gastric cancer patients HR: Hazard ratio; CI: Confidence interval; TNM: Tumor-node-metastasis.

Correlations between clinicopathological characteristics and 3-lncRNA combination prediction model

According to the 3-lncRNA combination prediction model and the expression of lncRNAs, we obtained the risk score (risk score = -0.92 × AC007991.4 + 1.18 × AC079385.3 + 1.17 × AL109615.2). Patients were classified as “high risk” and “low risk" according to the cutoff value. The combination was associated with the Bormann type (P = 0, χ = 14.29) and TNM stage (P = 0.01, χ = 10.85) of GC (Table 3).
Table 3

Three-long noncoding RNA model and clinicopathologic parameters

Characteristic Model
High risk (%)
Low risk (%)
P value
χ 2
Age, median, yr0.110.74
≥ 6052 (48.6)55 (51.4)
< 6043 (46.2)50 (53.8)
Gender0.271.23
Male71 (50.0)71 (50.0)
Female24 (41.4)34 (58.6)
Bormann type0.0014.29
I2 (13.3)13 (86.7)
II36 (40.9)52 (59.1)
III56 (59.6)38 (40.4)
IV1 (33.3)2 (66.7)
Tumor size0.063.64
≥ 5 cm48 (55.2)39 (44.8)
< 5 cm47 (41.6)66 (58.4)
Location0.770.52
Upper23 (52.3)21 (47.7)
Middle26 (46.4)30 (53.6)
Low46 (46.0)54 (54.0)
Lauren type0.820.39
Intestinal45 (47.4)50 (52.6)
Mixed23 (51.1)22 (48.9)
Diffuse27 (45.0)33 (55.0)
Tumor differentiation0.201.72
Moderate and well27 (40.9)39 (59.1)
Poor68 (50.7)66 (49.3)
Vessel invasion0.540.38
Yes30 (50.8)29 (49.2)
No65 (46.1)76 (53.9)
Perineural invasion0.063.52
Yes30 (58.8)21 (41.2)
No65 (43.6)84 (56.4)
TNM stage0.0110.85
I13 (31.0)29 (69.0)
II18 (38.3)29 (61.7)
III63 (57.8)46 (42.2)
IV1 (50.0)1 (50.0)

TNM: Tumor-node-metastasis.

Three-long noncoding RNA model and clinicopathologic parameters TNM: Tumor-node-metastasis.

DISCUSSION

LncRNAs are a class of molecules with functional relevance for gene expression regulation and have been discovered recently. With the expansion of research on lncRNA function, growing evidence demonstrates that a series of lncRNAs are aberrantly expressed in cancer and involved in cancer progression regulation. Moreover, they have increasingly complex functions. As “scaffolds”, they stimulate the interaction between proteins. As “guides”, lncRNAs enable the mixing of protein and genes[13]. As “enhancers”, lncRNAs control transcription of nearby genes. As “decoys”, lncRNAs are bound to microRNAs or proteins[14,15]. Because of the critical regulatory role of lncRNAs in tumorigenesis and development, more and more lncRNAs have been recognized as biomarkers for GC treatment and progenies[12,16]. MAFG-AS1 promoted GC cell proliferation and invasion and might be a valuable prognostic biomarker[16]. LncRNA pcsk2-2:1 could be parceled into serum exosome and acted as a diagnostic biomarker for GC[17]. Fattahi et al[18] summarized the role of famous lncRNAs such as H19, HOTAIR, UCA1, and PVT1 as molecular markers in GC. However, these studies only discussed the predictive value of a single biomarker, which may not be sufficient to predict the prognosis of GC. In the present study, clinical data and RNA-seq data of GC patients were obtained from TCGA as a training set. The patients’ data of our center served as the verification set. P < 0.01 and |logFC| ≥ 2 were used as the cutoff points to obtain the DELs. To improve the regression model’s prediction accuracy, LASSO and COX regression analyses were applied to examine the correlation between the expression of lncRNAs and GC patients’ survival. Compared with previous studies[17], this method is the first to study GC related lncRNA markers; it effectively minimizes the overfitting caused by univariate COX regression[19,20]. LASSO is a kind of regression analysis introduced in statistics and machine learning[21,22]. Compared with the traditional model, LASSO improves the prediction accuracy and interpretability through variable selection and regularization. The LASSO regression evaluation process includes relationship to ridge regression, best subset selection, the connections between lasso coefficient estimates, and soft thresholding[23,24]. Then, we got a 3-lncRNA combination prediction model: AC007991.4, AC079385.3, and AL109615.2. They were all potential prognostic independent risk factors for GC. According to the expression levels of the 3-lncRNA combination, all samples were divided into a high-risk group and low-risk group. This model could predict patients’ poor prognosis in the high expression group in both the training and validation sets. Subsequently, we verified the expression of these three lncRNAs in GC tissues and cells. AC007991.4 was weakly expressed in GC. However, both AC079385.3 and AL109615.2 were potential onco-lncRNAs. AC007991.4 and AC079385.3 are all antisense lncRNAs, and located at chromosome 8: 39918076-39920890 and chromosome 12: 106714924-106733066. Their molecular functions have not been reported. As a long intergenic non-coding RNA, AL109615.2 locates at chromosome 6: 44058792-44089288, which could competitively bind miR-133b with vascular endothelial growth factor C to induce colorectal cancer cell metastasis[25]. According to the expression of AC007991.4, AC079385.3, and AL109615.2, GC patients were divided into “high risk” and “low risk” groups. Both the area under the ROC curve and C-index suggested that the model has appropriate prediction performance. Moreover, it was confirmed by the verification set that the prediction model was an independent risk factor for the prognosis of GC. Besides, it was positively correlated with Bormann type and TNM stage in GC. It is also worth pointing out that AC079385.3 was a prognostic risk factor for GC in both the training and validation sets, suggesting that it might play a pivotal role in the model. Its molecular biological function and position in the development of GC require further study.

CONCLUSION

In conclusion, we present a 3-lncRNA model for evaluating survival in GC patients, which may be an independent prognostic factor. Clinicians can obtain the expression levels of AC007991.4, AC079385.3, and AL109615.2 in tissue samples by RT-PCR and calculate the corresponding risk values (risk score = -0.92 × AC007991.4 + 1.18 × AC079385.3 + 1.17 × AL109615.2, cutoff value = 6.58). And then it can be used to evaluate the prognosis of patients. Besides, we can visualize the model and other related risk factors to assess the modification’s effectiveness in future clinical work.

ARTICLE HIGHLIGHTS

Research background

Gastric cancer (GC) is one of the most frequently diagnosed gastrointestinal cancers throughout the world. It is necessary to identify a multi-long noncoding RNA (lncRNA) prognostic model for GC.

Research motivation

Abnormal expression of lncRNAs may indirectly reflect the occurrence and development of GC. As genes do not usually act alone, it is necessary to select suitable lncRNAs and establish a multi-lncRNA prediction model.

Research objectives

To construct a multi-lncRNA combination model to predict the prognosis of gastric cancer patients.

Research methods

The RNA-seq dataset and clinical dataset of GC in The Cancer Genome Atlas were used in this study. The least absolute shrinkage and selection operator and COX models were used to identify meaningful modules and hub genes. Clinical data of 200 patients were used to evaluate the clinical significance of the multi-lncRNA combination model via survival analysis.

Research results

We found a 3-lncRNA combination prediction model: AC007991.4, AC079385.3, and AL109615.2. It could effectively predict the prognosis of GC. AC079385.3 was found to be a prognostic risk factor for GC, and it may play an important role in the development of GC. Least absolute shrinkage and selection operator improved prediction accuracy and interpretability through variable selection and regularization.

Research conclusions

The 3-lncRNA combination model (risk score = -0.92 × AC007991.4 + 1.18 × AC079385.3 + 1.17 × AL109615.2) is an independent prognostic factor for GC.

Research perspectives

Clinicians can obtain the expression levels of AC007991.4, AC079385.3, and AL109615.2 in tissue samples by real-time reverse transcription-polymerase chain reaction and calculate the corresponding risk values.
  23 in total

1.  Discovery and annotation of long noncoding RNAs.

Authors:  John S Mattick; John L Rinn
Journal:  Nat Struct Mol Biol       Date:  2015-01       Impact factor: 15.369

2.  Lasso Proteins: Modular Design, Cellular Synthesis, and Topological Transformation.

Authors:  Yajie Liu; Wen-Hao Wu; Sumin Hong; Jing Fang; Fan Zhang; Geng-Xin Liu; Jongcheol Seo; Wen-Bin Zhang
Journal:  Angew Chem Int Ed Engl       Date:  2020-08-24       Impact factor: 15.336

3.  Hypoxia-induced LncRNA PCGEM1 promotes invasion and metastasis of gastric cancer through regulating SNAI1.

Authors:  J Zhang; H Y Jin; Y Wu; Z C Zheng; S Guo; Y Wang; D Yang; X Y Meng; X Xu; Y Zhao
Journal:  Clin Transl Oncol       Date:  2019-01-28       Impact factor: 3.405

4.  Cancer statistics, 2020.

Authors:  Rebecca L Siegel; Kimberly D Miller; Ahmedin Jemal
Journal:  CA Cancer J Clin       Date:  2020-01-08       Impact factor: 508.702

Review 5.  The rise of regulatory RNA.

Authors:  Kevin V Morris; John S Mattick
Journal:  Nat Rev Genet       Date:  2014-04-29       Impact factor: 53.242

6.  LncRNA GClnc1 Promotes Gastric Carcinogenesis and May Act as a Modular Scaffold of WDR5 and KAT2A Complexes to Specify the Histone Modification Pattern.

Authors:  Tian-Tian Sun; Jie He; Qian Liang; Lin-Lin Ren; Ting-Ting Yan; Ta-Chung Yu; Jia-Yin Tang; Yu-Jie Bao; Ye Hu; Yanwei Lin; Danfeng Sun; Ying-Xuan Chen; Jie Hong; Haoyan Chen; Weiping Zou; Jing-Yuan Fang
Journal:  Cancer Discov       Date:  2016-05-04       Impact factor: 39.397

7.  LNMAT1 promotes lymphatic metastasis of bladder cancer via CCL2 dependent macrophage recruitment.

Authors:  Changhao Chen; Wang He; Jian Huang; Bo Wang; Hui Li; Qingqing Cai; Feng Su; Junming Bi; Hongwei Liu; Bin Zhang; Ning Jiang; Guangzheng Zhong; Yue Zhao; Wen Dong; Tianxin Lin
Journal:  Nat Commun       Date:  2018-09-20       Impact factor: 14.919

8.  Elevated DKK1 expression is an independent unfavorable prognostic indicator of survival in head and neck squamous cell carcinoma.

Authors:  Haihe Gao; Lisha Li; Mang Xiao; Yongwei Guo; Yi Shen; Lixin Cheng; Ming Tang
Journal:  Cancer Manag Res       Date:  2018-10-30       Impact factor: 3.989

9.  Prognostic value of hypoxia-inducible factor-1 alpha and prolyl 4-hydroxylase beta polypeptide overexpression in gastric cancer.

Authors:  Jun Zhang; Yue Wu; Yu-Hang Lin; Shuai Guo; Pei-Fang Ning; Zhi-Chao Zheng; Yue Wang; Yan Zhao
Journal:  World J Gastroenterol       Date:  2018-06-14       Impact factor: 5.742

10.  Exosomal Long Non-Coding RNA CEBPA-AS1 Inhibits Tumor Apoptosis and Functions as a Non-Invasive Biomarker for Diagnosis of Gastric Cancer.

Authors:  Hai-Yan Piao; Shuai Guo; Yue Wang; Jun Zhang
Journal:  Onco Targets Ther       Date:  2020-02-14       Impact factor: 4.147

View more
  3 in total

1.  Derivation, Comprehensive Analysis, and Assay Validation of a Pyroptosis-Related lncRNA Prognostic Signature in Patients With Ovarian Cancer.

Authors:  Xueyan Cao; Qingquan Zhang; Yu Zhu; Xiaoqing Huo; Junze Bao; Min Su
Journal:  Front Oncol       Date:  2022-02-24       Impact factor: 6.244

2.  N7-methylguanosine-related lncRNAs: Predicting the prognosis and diagnosis of colorectal cancer in the cold and hot tumors.

Authors:  Jing-Yu Wu; Qing-Yu Song; Chang-Zhi Huang; Yu Shao; Zhen-Ling Wang; Hong-Qiang Zhang; Zan Fu
Journal:  Front Genet       Date:  2022-07-22       Impact factor: 4.772

3.  Necroptosis-associated long noncoding RNAs can predict prognosis and differentiate between cold and hot tumors in ovarian cancer.

Authors:  Yi-Bo He; Lu-Wei Fang; Dan Hu; Shi-Liang Chen; Si-Yu Shen; Kai-Li Chen; Jie Mu; Jun-Yu Li; Hongpan Zhang; Liu Yong-Lin; Li Zhang
Journal:  Front Oncol       Date:  2022-07-28       Impact factor: 5.738

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.