Literature DB >> 28389671

Identification of a six-lncRNA signature associated with recurrence of ovarian cancer.

Kai Yang1, Yan Hou1,2, Ang Li1, Zhenzi Li1, Wenjie Wang1, Hongyu Xie1, Zhiwei Rong1, Ge Lou3, Kang Li4.   

Abstract

Ovarian cancer (OvCa) is the leading cause of death among all gynecological malignancies, and recurrent OvCa is almost always incurable. In this study, we developed a signature based on long non-coding RNAs (lncRNAs) associated with OvCa recurrence to facilitate personalized OvCa therapy. lncRNA expression data were extracted from GSE9891 and GSE30161. LASSO (least absolute shrinkage and selection operator) penalized regression was used to identify an lncRNA-based signature using the GSE9891 training cohort. The signature was then validated in GSE9891 internal and GSE30161 external validation cohorts. The Database for Annotation, Visualization and Integrated Discovery (DAVID) was used to explore the possible functions of identified lncRNAs. A six-lncRNA signature (RUNX1-IT1, MALAT1, H19, HOTAIRM1, LOC100190986 and AL132709.8) was identified in the training cohort and validated in internal and external validation cohorts using the LASSO method (P < 0.05). This signature was also independent of other clinical factors according to multivariate and sub-group analyses. The identified lncRNAs are involved in cancer-related biological processes and pathways. We selected a highly reliable signature based on six lncRNAs associated with OvCa recurrence. This six-lncRNA signature is a promising method to personalize ovarian cancer therapy and may improve patient quality of life quality according to patients' condition in the future.

Entities:  

Year:  2017        PMID: 28389671      PMCID: PMC5429632          DOI: 10.1038/s41598-017-00763-y

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Ovarian cancer (OvCa) is a common gynecological malignancy and the commonest cause of gynecological cancer-associated deaths in developed countries[1]. It is estimated that there will be 22,280 new cases and 14,240 deaths attributed to OvCa in the United States in 2016[2]. Although there is a high initial response rate to standard surgery and chemotherapy, most OvCa patients will develop recurrence within 18 months after standard first-line treatment[3]. More seriously, recurrent OvCa usually develops into platinum-resistant disease and is almost always incurable[4]. Therefore, stratification of patients to identify high-risk patients may provide more effective treatment strategies and personalized therapies. Long non-coding RNA (lncRNA) is a class of non-coding RNAs that are longer than 200 nucleotides in length[5]. Increasing studies have showed that abnormal expression of lncRNAs is associated with human cancers and that some play important roles in a variety of biological processes in cancer. Currently, several lncRNA-based signatures have been identified as prediction of patient survival in several cancers, such as gastric cancer[6], lung cancer[7], breast cancer[8], colorectal cancer[9] and esophageal squamous cell cancer[10]. Recent studies also indicated that lncRNAs were associated with OvCa recurrence and survival[11]. For example, lncRNAs CCAT2, HOTAIR, AB073614, and ANRIL have been demonstrated to be associated with poor prognosis of OvCa[12-15]. A recent study identified an eight-lncRNA signature associated with overall survival (OS) of OvCa based on The Cancer Genome Atlas (TCGA)[16]. In this study, we used LASSO (least absolute shrinkage and selection operator) penalized regression to identify a six-lncRNA signature associated with OvCa recurrence based on a training cohort. Then we validated it in internal and external validation cohorts, analyzed it in sub-groups of OvCa patients, and demonstrated this signature was independent from other clinical factors. We also analyzed the correlation between the signature and OS of OvCa patients. Furthermore, we found that these lncRNAs were involved in biological processes (e.g., cell adhesion, inflammatory response and immune response) and pathways (e.g., ECM-receptor interaction, focal adhesion and cell adhesion molecules) in cancers. Thus, our six-lncRNA signature may be a promising method to stratify OvCa patients and identify those at high-risk in the future.

Results

Demographic and clinical characteristics

The detailed demographic and clinical characteristics are listed in Supplementary Table S1. A total of 311 OvCa patients were included in our study, including 100 in the GSE9891 training cohort, 157 in the GSE9891 internal validation cohort and 54 in the GSE30161 external validation cohort. The median ages (ranges) of the three cohorts were 58 (23–80), 60 (33–80), and 62 (38–84) years, respectively. Seventy-four (74%), 111 (71%), and 48 (89%) patients relapsed during follow-up, respectively. The tumor stage, tumor grade and histology type are also summarized in Supplementary Table S1.

Identification of lncRNA signature and generation of risk score

To identify lncRNAs associated with OvCa recurrence, LASSO penalized regression was performed using lncRNA expression data. This method can select an optimal subset of lncRNAs without collinearity by imposing a penalty and shrinking most regression coefficients to zero. After 100 times of 10-fold cross validation, the optimal tuning parameter lambda1 was 16.9841 in our study. As a result, the regression coefficients of six lncRNAs were not zero when lambda1 was 16.9841, and we selected these six lncRNAs as signatures associated with OvCa recurrence (Table 1). We also conducted univariate cox regression for the six lncRNAs, in which the regression coefficients were consistent with that in LASSO penalized regression, and all six lncRNAs were statistically significant (P < 0.05). Of these six lncRNAs, five were positively associated with DFS (high expression of these lncRNAs led to a high-risk score and shorter survival) and one was negatively associated with DFS (the high expression of this lncRNA led to a low-risk score and longer survival).
Table 1

Overall information of six prognostic lncRNAs associated with DFS in GSE9891 training cohort (n = 100).

ProbeGene symbolChromosomal locationCa Cb Pb HRb
220918_atRUNX1-IT1Chr21q22.120.10780.5589<0.00011.749
223940_x_atMALAT1Chr11q13.10.07510.15140.04101.163
224646_x_atH19Chr11p15.50.10830.16250.00041.176
228642_atHOTAIRM1Chr7p15.20.11100.27760.00111.320
235167_atLOC100190986Chr16p12.2−0.0155−0.32170.00610.725
242856_atAL132709.8Chr14q32.310.01950.29750.00161.346

aDerived from the LASSO penalized regression in 100 patients of GSE9891 training cohort. bDerived from the univariate Cox proportional hazards regression in 100 patients of GSE9891 training cohort.

Abbreviations: C Coefficient, P P value, HR Hazard Ratio.

Overall information of six prognostic lncRNAs associated with DFS in GSE9891 training cohort (n = 100). aDerived from the LASSO penalized regression in 100 patients of GSE9891 training cohort. bDerived from the univariate Cox proportional hazards regression in 100 patients of GSE9891 training cohort. Abbreviations: C Coefficient, P P value, HR Hazard Ratio. To identify low- and high-risk patients for OvCa recurrence, we developed a prognostic model based on the expression of six lncRNAs and their regression coefficient in LASSO penalized regression as follows: Risk score = (0.1078*RUNX1-IT1) + (0.0751*MALAT1) + (0.1083*H19) + (0.111*HOTAIRM1) − (0.0155*LOC100190986) + (0.0195*AL132709.8). The patients were then divided into low- and high-risk groups according to the median risk score value (2.9232). As a result, the patients in the low-risk group had a better survival outcome than the high-risk group, as shown in Fig. 1a (P < 0.0001). The area under the curve (AUC) of time-dependent ROC curves for the risk score was 0.813 at three years (Fig. 1c). These results demonstrated a good performance of our six-lncRNA signature in predicting DFS for OvCa patients. Risk scores and relative expression levels of all patients are shown in Fig. 1b,d.
Figure 1

Association between the six-lncRNA signature and DFS of OvCa patients in GSE9891 training cohort. (a) K-M curve of DFS between low- and high-risk patients. (b) Risk scores of each patient in the GSE9891 training cohort (sorted by risk score). (c) Time-dependent ROC curve analysis of the DFS prediction based on the risk score with three years as the time point. (d) Expression heat map of the six lncRNAs in OvCa patients in the GSE9891 training cohort (sorted by risk score).

Association between the six-lncRNA signature and DFS of OvCa patients in GSE9891 training cohort. (a) K-M curve of DFS between low- and high-risk patients. (b) Risk scores of each patient in the GSE9891 training cohort (sorted by risk score). (c) Time-dependent ROC curve analysis of the DFS prediction based on the risk score with three years as the time point. (d) Expression heat map of the six lncRNAs in OvCa patients in the GSE9891 training cohort (sorted by risk score).

Validation of the six-lncRNA signature in the GSE9891 internal validation and entire cohorts

To confirm the ability of the six-lncRNA signature in predicting DFS for OvCa patients, we validated it in the GSE9891 internal validation and entire cohorts. The same risk formula and cutoff value were used to calculate risk scores and divide the patients into low- and high-risk groups. The results of the two cohorts were consistent with the GSE9891 training cohort. Patients with higher risk scores had poorer DFS. The differences of survival curves between the two groups were statistically significant in two cohorts (Fig. 2a and Supplementary Fig. S1a). The AUCs of the two cohorts were 0.665 and 0.697, respectively (Fig. 2c and Supplementary Fig. S1c). Risk scores and relative expression levels of all patients in the two cohorts are separately shown in Fig. 2b,d and Supplementary Fig. S1b,d.
Figure 2

Association between the six-lncRNA signature and DFS of OvCa patients in GSE9891 internal validation cohort. (a) K-M curve of DFS between low- and high-risk patients. (b) Risk scores of each patient in the GSE9891 internal validation cohort (sorted by risk score). (c) Time-dependent ROC curve analysis of the DFS prediction based on the risk score with three years as the time point. (d) Expression heat map of six lncRNAs in OvCa patients in the GSE9891 internal validation cohort (sorted by risk score).

Association between the six-lncRNA signature and DFS of OvCa patients in GSE9891 internal validation cohort. (a) K-M curve of DFS between low- and high-risk patients. (b) Risk scores of each patient in the GSE9891 internal validation cohort (sorted by risk score). (c) Time-dependent ROC curve analysis of the DFS prediction based on the risk score with three years as the time point. (d) Expression heat map of six lncRNAs in OvCa patients in the GSE9891 internal validation cohort (sorted by risk score).

Further validation of the six-lncRNA signature in the GSE30161 external validation cohort

We further validated our findings in GSE30161 external validation cohort. The same risk formula was used to calculate risk scores for every OvCa patient. The median risk score value in GSE30161 (2.2672) was used to divide the patients into low- and high-risk scores. Similar to the result in GSE9891, the patients in low-risk group also had a better survival outcome than those in the high-risk group, as shown in Fig. 3a (P = 0.0114). The AUC of time-dependent ROC curves for the risk score was 0.711 at three years (Fig. 3c). Risk scores and relative expression levels of all patients are also shown in Fig. 3b,d.
Figure 3

Association between six-lncRNA signature and DFS of OvCa patients in GSE30161 external validation cohort. (a) K-M curve of DFS between low- and high-risk patients. (b) Risk scores of each patient in GSE9891 internal validation cohort (sorted by risk score). (c) Time-dependent ROC curve analysis of the DFS prediction based on the risk score with three years as the time point. (d) Expression heat map of six lncRNAs in OvCa patients in the GSE30161 external validation cohort (sorted by risk score).

Association between six-lncRNA signature and DFS of OvCa patients in GSE30161 external validation cohort. (a) K-M curve of DFS between low- and high-risk patients. (b) Risk scores of each patient in GSE9891 internal validation cohort (sorted by risk score). (c) Time-dependent ROC curve analysis of the DFS prediction based on the risk score with three years as the time point. (d) Expression heat map of six lncRNAs in OvCa patients in the GSE30161 external validation cohort (sorted by risk score).

Sub-group analysis of six-lncRNA signature in GSE9891 and GSE30161

We then analyzed the six-lncRNA signature in sub-groups of OvCa patients. As shown in Fig. 4, the differences of K-M survival curves between the low- and high-risk patients were statistically significant (P < 0.05) for late-stage (Fig. 4b), low-grade (Fig. 4c), high-grade (Fig. 4d) OvCa patients in the GSE9891 and low-grade (Fig. 4e) OvCa patients in the GSE30161. For the early-stage patients in GSE9891 (Fig. 4a) and high-grade patients in GSE30161 (Fig. 4f), the differences of K-M survival curves were not statistically significant, but patients in the low-risk group still tended to have a better DFS. The ROCs of these sub-groups were in Supplementary Fig. S2.
Figure 4

Sub-group analysis of association between six-lncRNA signature and DFS of OvCa patients. (a) K-M curve of early-stage OvCa patients in GSE9891. (b) K-M curve of late-stage OvCa patients in GSE9891. (c) K-M curve of low-grade OvCa patients in GSE9891. (d) K-M curve of high-grade OvCa patients in GSE9891. (e) K-M curve of low-grade OvCa patients in GSE30161. (f) K-M curve of high-grade OvCa patients in GSE30161.

Sub-group analysis of association between six-lncRNA signature and DFS of OvCa patients. (a) K-M curve of early-stage OvCa patients in GSE9891. (b) K-M curve of late-stage OvCa patients in GSE9891. (c) K-M curve of low-grade OvCa patients in GSE9891. (d) K-M curve of high-grade OvCa patients in GSE9891. (e) K-M curve of low-grade OvCa patients in GSE30161. (f) K-M curve of high-grade OvCa patients in GSE30161.

Independence of six-lncRNA signature and other clinical factors

To assess whether the six-lncRNA signature was independent of other clinical factors, univariate and multivariate cox regression analyses were conducted in each patient cohort including risk scores for the six lncRNAs, age, tumor stage, tumor grade and histology type. In univariate cox regression, risk score and tumor stage were significantly associated with DFS. After adjusting for age, tumor stage, tumor grade and histology type, risk score still maintained a significant correlation with DFS in all GSE9891 and GSE30161 cohorts (Table 2).
Table 2

Univariate and multivariate cox regression analyses of DFS in GSE9891 and GSE30161.

Univariate analysisMultivariate analysis
VariableCPHR95% CI of HRCPHR95% CI of HR
GSE9891 training cohort (N = 100)
Risk score1.667 <0.0001 5.2962.925–9.5891.6849 <0.0001 5.3922.84–10.237
Age−0.00620.62230.9940.970–1.019−0.00150.91290.9990.973–1.025
Stage1.72760.01705.6271.362–23.2511.44650.05544.2480.967–18.656
Grade−0.00520.98280.9950.620–1.5950.16330.53081.1770.707–1.962
Histology subtype0.50310.28111.6540.663–4.128−0.11930.80880.8880.338–2.332
GSE9891 internal validation cohort (N = 157)
Risk score0.79 0.0016 2.2031.351–3.5940.5924 0.0255 1.8081.075–3.041
Age0.01830.06991.0180.999–1.0390.01520.15421.0150.994–1.037
Stage1.70950.00025.5262.246–13.5971.55780.00094.7481.899–11.871
Grade0.20270.30861.2250.829–1.809−0.11770.56330.8890.596–1.325
Histology subtype1.50480.01034.5031.426–14.220.8460.16062.330.715–7.597
GSE9891 entire cohort (N = 257)
Risk score1.0096 <0.0001 2.7451.902–3.9620.9028 <0.0001 2.4661.689–3.601
Age0.00870.26181.0090.994–1.0240.0060.45871.0060.99–1.022
Stage1.7306<0.00015.6442.642–12.0581.6227<0.00015.0672.337–10.985
Grade0.12450.41511.1330.839–1.528−0.05240.73840.9490.698–1.291
Histology subtype0.97680.0072.6561.305–5.4040.31330.39981.3680.66–2.837
GSE30161 entire cohort (N = 54)
Risk score1.1528 0.0309 3.1671.112–9.0221.5104 0.0114 4.5281.405–14.597
Age0.00370.80291.0040.975–1.0340.00640.70461.0060.974–1.041
Stage
Grade0.43460.17651.5440.822–2.90.70090.04292.0151.023–3.972
Histology subtype0.31310.44561.3680.612–3.0571.27880.02833.5921.146–11.26

Abbreviations: C Coefficient, P P value, HR Hazard Ratio, CI Confidence Interval.

Univariate and multivariate cox regression analyses of DFS in GSE9891 and GSE30161. Abbreviations: C Coefficient, P P value, HR Hazard Ratio, CI Confidence Interval.

Correlation between the six-lncRNA signature and OS

In addition to the analyses of association between the signature and OvCa recurrence, we also explored the correlation between the signature and OS. In accordance with the results of DFS, patients with higher risk scores had poorer OS. The differences of survival curves between the two groups were statistically significant in GSE9891 training, GSE9891 entire and GSE30161 external validation cohorts (Supplementary Fig. S3a,c,d). The differences of K-M survival curves were not statistically significant in GSE9891 internal validation cohort (P = 0.0612), but patients in the low-risk group tended to have a better OS (Supplementary Fig. S3b). After adjusting for age, tumor stage, tumor grade and histology subtype, risk score still maintained a significant correlation with OS in GSE9891 training and entire cohorts. The patients in the low-risk group in GSE9891 internal validation and GSE30161 external validation cohorts still tended to have a better OS (P < 0.08) (Supplementary Table S2).

Functional characteristics of six lncRNAs

We further analyzed the possible functions associated with the six identified lncRNAs in OvCa by GO and KEGG functional enrichment analysis using the DAVID tool. Using Satterthwaite t-test and FDR correction, 3814 genes were found to be differentially expressed between the low- and high-risk groups. These DEGs were considered as genes associated with six lncRNAs. After functional annotation in DAVID, 68 GO biological process (BP) terms, 27 GO cellular component (CC) terms, 18 GO molecular function (MF) terms, and 6 KEGG pathways were significantly enriched with FDR < 0.05 (Supplementary Tables S3–S6). The enriched BP terms were involved in some important biological processes in cancer, such as blood vessel development, inflammatory response and immune response (Table 3). The enriched KEGG pathways were also involved in cancer-related pathways, such as ECM-receptor interaction, focal adhesion and cell adhesion molecules (Table 3). An interaction network of significant BP terms with similar function showed the six lncRNAs were mainly associated with inflammatory response, immune system process, cell migration, cell adhesion, angiogenesis and extracellular matrix organization (Fig. 5). Most of the enriched GO terms and KEGG pathways have been found to be closely associated with OvCa initiation and progression, which could increase the credibility of the six-lncRNA signature from a biological perspective.
Table 3

Top 10 significant GO BP terms and KEGG pathways enriched with DEGs in OvCa.

RankBiological ProcessFDRKEGG pathwayFDR
1Cell adhesion1.10E-24ECM-receptor interaction7.30E-10
2Biological adhesion1.40E-24Focal adhesion4.00E-09
3Response to wounding2.80E-15Cell adhesion molecules (CAMs)7.50E-04
4Vasculature development2.10E-13Leukocyte transendothelial migration2.60E-03
5Blood vessel development2.30E-12Cytokine-cytokine receptor interaction2.70E-03
6Inflammatory response5.10E-10Chemokine signaling pathway4.50E-02
7Immune response6.30E-10
8Extracellular matrix organization1.40E-09
9Blood vessel morphogenesis3.00E-09
10Regulation of response to external stimulus3.30E-09
Figure 5

Interaction network of significant GO biological process terms with similar functions associated with six lncRNAs. Red nodes represent GO biological process terms. Node size is proportional to the total number of DEGs in that term. Thickness of green lines is proportional to the shared DEGs of two connected terms.

Top 10 significant GO BP terms and KEGG pathways enriched with DEGs in OvCa. Interaction network of significant GO biological process terms with similar functions associated with six lncRNAs. Red nodes represent GO biological process terms. Node size is proportional to the total number of DEGs in that term. Thickness of green lines is proportional to the shared DEGs of two connected terms.

Discussion

OvCa is the sixth commonest cause of female cancer death in developed countries[1]. Approximately 75% of women with OvCa present with late-stage disease, most (78.79% in this study) of whom will develop recurrence[4, 17]. Treatment of OvCa is based on a combination of surgery and chemotherapy, and standard treatment for late-stage OvCa combined platinum-based chemotherapy with cytoreductive surgery can achieve a good result[18]. However, for most late-stage patients, cancer recurrence seems to be inevitable. Moreover, recurrent OvCa is often insensitive to chemotherapy and is generally incurable[4]. Thus, it is of great importance to identify low- and high-risk patients to enable improved treatment. Once we can identify high-risk patients, we can adopt a more effective personalized treatment strategy, such as novel drugs and targeted therapies, to delay the recurrence of cancer and improve patient quality of life[18]. In our study, LASSO penalized regression was used to identify six lncRNAs associated with OvCa recurrence. LASSO penalized regression is optimized to high-dimensional data, which can select a few of the most influential variables and avoid collinearity of variables at the same time. These properties are not available in many other univariate and multivariate methods. lncRNA may be superior biomarkers in cancer as it is a new area of molecular biology, does not encode proteins, and may have better specificity[19]. In addition to the inherent virtues of LASSO and lncRNAs, we also validated the six-lncRNA signature in internal and external cohorts, and certified its credibility from a biological perspective. The signature also showed a great ability to stratify OvCa patients into low- and high-risk subgroup with significantly different overall survival. Further analysis, including sub-group analysis and adjustment of clinical factors, indicated that the six-lncRNA signature could still predict recurrence in most sub-groups and were independent of other clinical factors, including age, tumor stage, tumor grade and histology type. It is well known that the prognosis of late-stage OvCa patients is worse than early-stage OvCa patients (Fig. 4a,b). In our study, the six-lncRNA signature could identify all high-risk patients with late-stage OvCa in different cohorts (Figs 3a and 4b). This could be of great use to improve the prognosis of these severe cancer patients. We also explored the possible functions of these genes. Although the exact mechanism of most of the six lncRNAs was unclear, the lncRNAs were still closely related with cancer, as described in the literature, which strengthened the reliability and possibility of our six-lncRNA signature as a predictor of OvCa recurrence. In addition to the reports in the literature, our functional enrichment analysis of the six lncRNA-related genes may also shed new light on the possible functions of these lncRNAs in OvCa. RUNX1-IT1, MALAT1, H19, and HOTAIRM1 have been widely associated with cancer in recent years. RUNX1-IT1 is an oncogenic lncRNA that can promote tumor progression and metastasis[20]. RUNX1-IT1 was overexpressed in non-responder chronic myeloid leukemia[21]. MALAT1 was found to be upregulated in a variety of human cancers, such as lung cancer, breast cancer, prostate cancer, colon cancer, and liver cancer[22-24]. Some studies showed this lncRNA was involved in the regulation of cell mobility[25], which was consistent with our findings about cell migration (Fig. 5). H19 has been widely linked to oncogenesis, although the exact mechanism remains unclear[26]. H19 is a precursor of mir-675, which down-regulates tumor suppressor genes in cancer[27]. H19 is also up-regulated in basal cell cancer compared with normal skin specimens[28], and is associated with poor prognosis in breast cancer[29]. However, it should be note that H19 is down-regulated in high-risk patients in our study, which is inconsistent with these studies. HOTAIRM1 is a kind of lncRNA that plays an important role in the development of immune cells[30]. HOTAIRM1 is also reported to be a tumor suppressor by affecting a series of genes related to cell proliferation in colon cancer[31]. There were still relatively few studies describing LOC100190986 and AL132709.8 in cancer research. LOC100190986 is associated with HCV genotype 1b transfection in the HepG2 cell line[32]. AL132709.8 was up-regulated in neural precursor cells from patients of lethal congenital contracture syndrome[33]. The selected six lncRNAs were highly reliable from the perspective of both statistics and biology, since we conducted rigorous internal validation, external validation, and biological explanation. However, our study has some limitations. First, we used different cutoff values for GSE9891 and GSE30161. The overall expression levels of lncRNAs in GSE30161 were lower than that of GSE9891. The main reason for this phenomenon was the different experimental conditions in different labs. We can solve this problem by using a rigorous experimental procedure and adopting a unique cut-off value in the future. Second, the differences of K-M survival curves between low- and high-risk patients were not statistically significant for early-stage patients in GSE9891 (Fig. 4a) and high-grade patients in GSE30161 (Fig. 4f), although patients in the low-risk group still tended to have better DFS. This may be because the sample sizes of these sub-groups were too small (28 cases and 29 cases, respectively). Hence further studies are needed to validate the signature in these groups. We applied the LASSO method to high-dimensional lncRNA expression data and identified a six-lncRNA signature that was highly associated with OvCa recurrence. This signature was validated in internal and external validation cohorts and was independent of other clinical factors. Furthermore, from a literature review and our functional analysis, we found that the six lncRNAs were closely related with cancer. Thus, this six-lncRNA signature may be a promising method to personalize OvCa therapy and improve patient quality of life according to patients’ condition in the future.

Methods

OvCa patient dataset and clinical information

Microarray data for GSE9891 and GSE30161 were downloaded from the Gene Expression Omnibus (GEO; https://www.ncbi.nlm.nih.gov/geo/)[34, 35]. Clinical information for OvCa patients in these data were extracted from the R curatedOvarianData Bioconductor package[36]. The microarray data were measured by the Affymetrix Human Genome U133A Plus 2.0 Array microarray platform. Borderline tumor patients and patients without days to tumor recurrence were excluded from this study. As a result, 257 patients in GSE9891 and 54 patients in GSE30161 were enrolled in this study. The OvCa patients in GSE9891 were randomly divided into a training cohort (n = 100) and an internal validation cohort (n = 157). Additionally, the OvCa patients in GSE30161 were analyzed as an external validation cohort (n = 54).

Microarray data preprocessing and lncRNA acquisition

The downloaded microarray data for GSE9891 and GSE30161 were normalized using the robust multi-array average (RMA) method[37]. The probe set IDs of lncRNA were acquired from the study of Zhang et al.[38]. Briefly, the probe set IDs were mapped to the NetAffx Annotation Files. Based on the Refseq transcripts ID and/or Ensembl gene ID in the annotation files, non-coding RNAs were retained and other types of non-coding RNA except lncRNA were then filtered. Finally, 2446 lncRNAs with corresponding probe set IDs were generated in our study.

Signature generation and statistical analysis

LASSO penalized regression was conducted to select the lncRNAs associated with OvCa recurrence[39]. The optimal tuning parameter lambda1 was chosen after 100 times of 10-fold cross validation. The function for the selection of lambda1 was “optL1” (fold = 10), and the function for LASSO penalized regression was “penalized” (lambda1 = lambda1). Other parameters of the functions were set to default values. A risk score was generated using the sum of lncRNA expression values weighted by the coefficients from LASSO penalized regression[40]. The OvCa patients were then divided into low- and high-risk groups according to the median risk score. The association of risk score, clinical factors, disease-free survival (DFS) and overall survival (OS) were assessed by univariate and multivariate cox regression. Kaplan-Meier (K-M) survival curves were used to estimate DFS and OS for patients in the low- and high-risk groups, and the DFS and OS differences between the two groups were assessed using the log-rank test. Time-dependent receiver operating characteristic (ROC) curve analysis with three years as the time point was used to compare the sensitivity and specificity of the DFS prediction based on the risk score[41]. A heat map was used to present the relative expression levels of lncRNAs in this study. Satterthwaite t-test was performed to determine the significance of each gene, and the corresponding false discovery rate (FDR) value was estimated for correcting multiple comparisons. Differentially expressed genes (DEGs) were selected as FDR < 0.05. Functional annotation of DEGs for Gene Ontology (GO) terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways was performed using the Database for Annotation, Visualization and Integrated Discovery (version 6.7, DAVID, https://david.ncifcrf.gov/) tool[42]. Significant GO biological process terms with similar function were visualized as interaction networks using the Enrichment Map plugin in Cytoscape[43, 44]. LASSO penalized regression, time-dependent ROC curve analysis and heat map analyses were conducted on penalized[39], survivalROC[41] and gplots[45] packages, respectively, in the R platform. Univariate and multivariate cox regression and log-rank test were performed using SAS (version 9.3, SAS Institute, Cary, NC, USA). K-M survival curves and scatterplots of risk score were performed in GraphPad Prism (version 5.0, Graphpad Software, San Diego, CA, USA)[46]. All other statistical analyses were performed in the R platform (version 3.3.2). All statistical tests were two-sided and a P value of less than 0.05 was considered statistically significant. Supplemental information revision
  43 in total

1.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data.

Authors:  Rafael A Irizarry; Bridget Hobbs; Francois Collin; Yasmin D Beazer-Barclay; Kristen J Antonellis; Uwe Scherf; Terence P Speed
Journal:  Biostatistics       Date:  2003-04       Impact factor: 5.899

2.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

Review 3.  The H19 locus: role of an imprinted non-coding RNA in growth and development.

Authors:  Anne Gabory; Hélène Jammes; Luisa Dandolo
Journal:  Bioessays       Date:  2010-06       Impact factor: 4.345

4.  Long Noncoding RNA C21orF96 Promotes the Migration, Invasion and Lymph Node Metastasis in Gastric Cancer.

Authors:  Ziguo Yang; Qiaoming Zhi; Dan Wang; Li Zhang; Burnley Preston; Coder Brandon; Yuting Kuang; Ruizhen Miao; Yulong Shi; Xiaobo Guo
Journal:  Anticancer Agents Med Chem       Date:  2016       Impact factor: 2.505

5.  Cancer statistics, 2016.

Authors:  Rebecca L Siegel; Kimberly D Miller; Ahmedin Jemal
Journal:  CA Cancer J Clin       Date:  2016-01-07       Impact factor: 508.702

6.  Neural precursor cells from a fatal human motoneuron disease differentiate despite aberrant gene expression.

Authors:  Niklas Pakkasjärvi; Laura Kerosuo; Heidi Nousiainen; Massimiliano Gentile; Juha Saharinen; Satu Suhonen; Hannu Sariola; Leena Peltonen; Marjo Kestilä; Kirmo Wartiovaara
Journal:  Dev Neurobiol       Date:  2007-02-15       Impact factor: 3.964

Review 7.  An overview of long non-coding RNAs in ovarian cancers.

Authors:  Matthieu Meryet-Figuière; Bernard Lambert; Pascal Gauduchon; Nicolas Vigneron; Emilie Brotin; Laurent Poulain; Christophe Denoyelle
Journal:  Oncotarget       Date:  2016-07-12

8.  Long non-coding RNA in cancer.

Authors:  Nina Hauptman; Damjan Glavač
Journal:  Int J Mol Sci       Date:  2013-02-26       Impact factor: 5.923

Review 9.  Non-protein coding RNA biomarkers and differential expression in cancers: a review.

Authors:  Massimo Mallardo; Palmiro Poltronieri; Oscar Fernando D'Urso
Journal:  J Exp Clin Cancer Res       Date:  2008-07-16

10.  Comprehensive analysis of lncRNA expression profiles reveals a novel lncRNA signature to discriminate nonequivalent outcomes in patients with ovarian cancer.

Authors:  Meng Zhou; Yanying Sun; Yifan Sun; Wanying Xu; Zhaoyue Zhang; Hengqiang Zhao; Zhaohua Zhong; Jie Sun
Journal:  Oncotarget       Date:  2016-05-31
View more
  24 in total

1.  Advancing Pan-cancer Gene Expression Survial Analysis by Inclusion of Non-coding RNA.

Authors:  Bo Ye; Jianxin Shi; Huining Kang; Olufunmilola Oyebamiji; Deirdre Hill; Hui Yu; Scott Ness; Fei Ye; Jie Ping; Jiapeng He; Jeremy Edwards; Ying-Yong Zhao; Yan Guo
Journal:  RNA Biol       Date:  2019-10-18       Impact factor: 4.652

Review 2.  Long Noncoding RNAs as Innovative Urinary Diagnostic Biomarkers.

Authors:  Giulia Brisotto; Roberto Guerrieri; Francesca Colizzi; Agostino Steffan; Barbara Montico; Elisabetta Fratta
Journal:  Methods Mol Biol       Date:  2021

3.  LncRNA RUNX1-IT1 affects the differentiation of Th1 cells by regulating NrCAM transcription in Graves' disease.

Authors:  Feng-Jiao Huang; Yan-Ling Liu; Jiao Wang; Ying-Ying Zhou; Shui-Ying Zhao; Gui-Jun Qin
Journal:  Cell Cycle       Date:  2022-02-27       Impact factor: 5.173

Review 4.  Role of lncRNAs in ovarian cancer: defining new biomarkers for therapeutic purposes.

Authors:  Manish K Tripathi; Kyle Doxtater; Fatemeh Keramatnia; Chidi Zacheaus; Murali M Yallapu; Meena Jaggi; Subhash C Chauhan
Journal:  Drug Discov Today       Date:  2018-04-23       Impact factor: 7.851

5.  CTD-2020K17.1, a Novel Long Non-Coding RNA, Promotes Migration, Invasion, and Proliferation of Serous Ovarian Cancer Cells In Vitro.

Authors:  Linfei Zhu; Qixuan Guo; Xinxin Lu; Junhua Zhao; Jinxin Shi; Zhenning Wang; Xin Zhou
Journal:  Med Sci Monit       Date:  2018-03-05

6.  Developing a genetic signature to predict drug response in ovarian cancer.

Authors:  Stephen Hyter; Jeff Hirst; Harsh Pathak; Ziyan Y Pessetto; Devin C Koestler; Rama Raghavan; Dong Pei; Andrew K Godwin
Journal:  Oncotarget       Date:  2017-12-26

Review 7.  MALAT1: a druggable long non-coding RNA for targeted anti-cancer approaches.

Authors:  Nicola Amodio; Lavinia Raimondi; Giada Juli; Maria Angelica Stamato; Daniele Caracciolo; Pierosandro Tagliaferri; Pierfrancesco Tassone
Journal:  J Hematol Oncol       Date:  2018-05-08       Impact factor: 17.388

8.  Expression profiles analysis of long non-coding RNAs identified novel lncRNA biomarkers with predictive value in outcome of cutaneous melanoma.

Authors:  Xu Ma; Zhijuan He; Ling Li; Daping Yang; Guofeng Liu
Journal:  Oncotarget       Date:  2017-09-08

9.  A six-long non-coding RNAs signature as a potential prognostic marker for survival prediction of ER-positive breast cancer patients.

Authors:  Lei Zhong; Ge Lou; Xinglu Zhou; Youyou Qin; Lin Liu; Wenqian Jiang
Journal:  Oncotarget       Date:  2017-07-01

10.  Expression profiles analysis reveals an integrated miRNA-lncRNA signature to predict survival in ovarian cancer patients with wild-type BRCA1/2.

Authors:  Liyuan Guo; Yan Peng; Yuanyuan Meng; Yunduo Liu; Shangshang Yang; Hong Jin; Qi Li
Journal:  Oncotarget       Date:  2017-07-26
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.