Literature DB >> 30594555

Development and validation of an immune gene-set based Prognostic signature in ovarian cancer.

Sipeng Shen1, Guanrong Wang2, Ruyang Zhang1, Yang Zhao1, Hao Yu3, Yongyue Wei1, Feng Chen4.   

Abstract

BACKGROUND: Ovarian cancer (OV) is the most lethal gynecological cancer in women. We aim to develop a generalized, individualized immune prognostic signature that can stratify and predict overall survival for ovarian cancer.
METHODS: The gene expression profiles of ovarian cancer tumor tissue samples were collected from 17 public cohorts, including 2777 cases totally. Single sample gene set enrichment (ssGSEA) analysis was used for the immune genes from ImmPort database to develop an immune-based prognostic score for OV (IPSOV). The signature was trained and validated in six independent datasets (n = 519, 409, 606, 634, 415, 194).
FINDINGS: The IPSOV significantly stratified patients into low- and high-immune risk groups in the training set and in the 5 validation sets (HR range: 1.71 [95%CI: 1.32-2.19; P = 4.04 × 10-5] to 2.86 [95%CI: 1.72-4.74; P = 4.89 × 10-5]). Further, we compared IPSOV with nine reported ovarian cancer prognostic signatures as well as the clinical characteristics including stage, grade and debulking status. The IPSOV achieved the highest mean C-index (0.625) compared with the other signatures (0.516 to 0.602) and clinical characteristics (0.555 to 0.583). Further, we integrated IPSOV with stage, grade and debulking, which showed improved prognostic accuracy than clinical characteristics only.
INTERPRETATION: The proposed clinical-immune signature is a promising biomarker for estimating overall survival in ovarian cancer. Prospective studies are needed to further validate its analytical accuracy and test the clinical utility. FUND: This work was supported by National Key Research and Development Program of China, National Natural Science Foundation of China and Natural Science Foundation of the Jiangsu Higher Education Institutions of China.
Copyright © 2019 The Authors. Published by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Gene expression; Immune; Ovarian cancer; Prognostic signature

Mesh:

Substances:

Year:  2018        PMID: 30594555      PMCID: PMC6412087          DOI: 10.1016/j.ebiom.2018.12.054

Source DB:  PubMed          Journal:  EBioMedicine        ISSN: 2352-3964            Impact factor:   8.143


List of Abbreviations

Introduction

Ovarian cancer (OV) is the most lethal gynecological cancer among women, with >14,000 estimated new deaths in United States, 2018 [1]. Biomarkers especially gene expression in tumor tissues are reliably related to cancer prognosis and survival [[2], [3], [4]]. Thus, identification of the subset of patients with worse survival and higher mortality is needed for additional clinical therapy. The availability of large-scale public cohorts with gene expression data and the well-developed biological database bring the opportunity to identify a more generalized prognostic signature with biological background for ovarian cancer. Immune system has been shown to be a determining factor during cancer initiation and progression [5,6]. Various evidence have proved that ovarian cancer are immunogenic tumors [[7], [8], [9]] and immunotherapy is strongly pursued through targeting on the immune checkpoints [10,11]. In addition, previous study has preliminary confirmed the prognostic value of immune system in ovarian cancer [12]. Thus, the immune based prognostic signature remains a potential to be applied in ovarian cancer. In this study, we integrated multiple cohorts with gene expression which contained 2777 cases totally to develop and validate an individualized gene-set based prognostic signature for ovarian cancer from immune-related genes. With sufficient validation of 5 independent datasets, we proved the model stability and reliability. Further, to leverage the complementary value of clinical characteristics, we performed an integrated analysis to improve the predicted accuracy for overall survival.

Methods

Study population and eligibility criteria

We retrospectively collected the ovarian cancer gene expression profiles of frozen primary tumor tissue samples from 17 public microarray datasets, including 16 cohorts from the Gene Expression Omnibus (GEO) repository and 1 from The Cancer Genome Atlas ovarian cancer (TCGA-OV) cohort. All patients had received primary debulking surgery. Only patients with available follow-up time, status and gene expression data were included. The main outcome of our study was overall survival. Staging was assessed in accordance to the International Federation of Gynecology and Obstetrics (FIGO) stage system. Optimal debulking was defined as ≤1 cm of gross residual disease while sub-optimal debulking was defined as >1 cm of residual disease. We did not limit histology type, FIGO stage, grade, debulking and neoadjuvant chemotherapy status for sample collection. The sample quality control details were described in Table S1. The study characteristics of the 17 public ovarian cancer datasets were described in Table S1. Finally, 2777 ovarian cancer cases were included in our study. The study design and workflow were provided in Fig. 1.
Fig. 1

Flowchart of the study. 17 public ovarian cancer datasets containing 2777 cases were included and categorized into 6 independent datasets according to the microarray platform. We developed the IPSOV in the training set and validated in the other 5 datasets. Further, we integrated IPSOV with stage, grade and debulking status to improve the prognostic value.

Flowchart of the study. 17 public ovarian cancer datasets containing 2777 cases were included and categorized into 6 independent datasets according to the microarray platform. We developed the IPSOV in the training set and validated in the other 5 datasets. Further, we integrated IPSOV with stage, grade and debulking status to improve the prognostic value.

Gene expression data preprocessing

For GEO datasets, all microarray data and clinical information were download from GEO repository (https://www.ncbi.nlm.nih.gov/geo/). To increase the statistical power and leverage cohorts with small sample size sufficiently, we combined the cohorts with the same microarray platform. Only the platforms with sample size ≥ 150 were included in the study. For TCGA cohort, due to the samples with available RNA-Sequencing gene expression quantification were relatively small (n = 374), we collected Affymetrix Human Genome U133A Array microarray data instead (n = 519). All level-2 data were download from the Genomic Data Commons Data Portal (https://portal.gdc.cancer.gov/). Finally, six independent datasets were included according to the different platforms from GEO and TCGA (Table 1). Entrez IDs were used to represent genes across different platform. In the independent validation phase, if multiple probe sets correspond to the same Entrez ID in the validation sets, the one with the highest mean signal was selected as the expression level of the corresponding gene [13].
Table 1

Summary of the eight datasets included in the study.

DatasetPlatformSample sizeIncluded cohortsa
Training setAffymetrix Human Genome U133A Array519TCGA
Validation set 1Affymetrix Human Genome U133A Array409GSE14764, GSE23554, GSE26712, GSE3149
Validation set 2Affymetrix Human Genome U133 Plus 2.0 Array606GSE18520, GSE19829, GSE26193, GSE30161, GSE63885, GSE9891
Validation set 3Agilent-014850 Whole Human Genome Microarray 4x44K G4112F634GSE17260, GSE32062, GSE53963, GSE73614
Validation set 4Operon human v3 ~35 K 70-mer two-color oligonucleotide microarrays415GSE13876
Validation set 5ABI Human Genome Survey Microarray Version 2194GSE49997

We used ComBat to adjust the batch effects between different cohorts within the same platform. Gene expression values of all probes were adjusted in each dataset, respectively.

Summary of the eight datasets included in the study. We used ComBat to adjust the batch effects between different cohorts within the same platform. Gene expression values of all probes were adjusted in each dataset, respectively. The batch effects between different datasets within the same platform were adjusted by ComBat method [14]. Gene expression values of all probes were adjusted in each dataset, respectively. We also adjusted the batch effects within the TCGA cohort according to the tissue source site code. All the gene expression levels had been logarithmic transformed before batch effects adjustment.

Immune-related genes definition

We constructed a prognostic signature from the immune-related genes, which were downloaded from the ImmPort database (http://www.immport.org) [15]. It includes 17 immune categories according to different molecular function, such as antimicrobials, cytokine, interleukins, T-cell receptor signaling pathway, B-cell receptor signaling pathway, TNF family receptors.

Development of the immune-based prognostic signature for OV (IPSOV)

We performed a three-stage strategy to develop the signature. First, we used the TCGA cohort as the training set to screen the survival related probes. 1905 microarray probes for 1109 immune genes were included totally. The Cox proportional-hazards model was used to assess their association with overall survival with adjustment for age, stage, grade and debulking status. We excluded the probes with adjusted P values larger than 0.05. To estimate the population specific immune infiltration, we used single sample gene set enrichment analysis (ssGSEA) that define a enrichment score to represent the degree of absolute enrichment of a gene set in each sample within a given dataset [16]. Normalized enrichment scores could be calculated for each immune category. The ssGSEA analysis were performed in R package GSVA. In the last step, we develop a prognostic signature named the immune-based prognostic signature for OV (IPSOV) to combine the effects of each immune category in the training set. The coefficients of each category were determined by multivariable Cox regression model: , where S is the ssGSEA score for ith immune category.

Validation of the IPSOV

To get a uniform cutoff value to categorize the patients into low-risk and high-risk group, we performed a normalization for the gene expression values in each dataset with mean value = 0 and standard deviation (SD) = 1. After developing the signature, the prognostic score of IPSOV was further calculated in the 5 validation datasets, respectively. In the multivariable Cox regression, age, FIGO stage, grade, debulking status and study site were included as covariates.

Pathway enrichment analysis for the molecular function

To further understand the gene function in IPSOV, we performed a pathway enrichment analysis for the genes based on Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Ontology (GO) database including biological process (BP), molecular function (MF) and cellular component (CC). The P values were adjusted by False-Discovery Rate (FDR) method for multiple comparison. All the analysis were performed by R package clusterProfiler [17].

Existing prognostic signatures for comparison

To evaluate the survival classification and prediction ability of IPSOV, we retrospectively collected 9 published prognostic signatures for comparison, which included from 7 genes to 300 genes (Table S7). Continuous prognostic scores were calculated for each signature. The P values of continuous scores in univariable Cox model and the overall concordance statistic (C-index) were compared in the six datasets.

Statistical analysis

Continuous variables were summarized as mean ± SD, and categorized variables were described by frequency (n) and proportion (%). In the survival analysis, associations between characteristics and overall survival were evaluated by Cox proportional hazard models. Kaplan-Meier survival curves were drawn and compared among subgroups using log-rank tests. The C-index and restricted mean survival (RMS) curve were estimated using R package survival and survRM2, while C-index was compared by R package compareC [18]. Statistical analyses were performed using R version 3.4.0 (The R Foundation). P values were two-sided, and P < 0.05 was considered statistically significant.

Results

Development and definition of the IPSOV

A total of 2777 patients with ovarian cancer (mean age ± SD, 59.7 ± 11.8; 12.8% early-stage [I, II], 87.2% late-stage [III, IV]; 43.5% optimal debulking, 56.5% sub-optimal debulking) from 6 independent datasets were included in the analysis (Table 2). In the training set, 156 probes of 129 genes from 15 immune categories were associated with overall survival among the 1905 immune-related microarray probes. Further, we used these probes to perform the ssGSEA analysis for each patient to calculate the immune enrichment score of each immune category. The IPSOV was defined as the combined effect of scores in different categories using the coefficients generated from multivariable Cox regression model (Table S3). The median score in the training set (−0.23) was used as the cutoff to stratify patients into low- and high-immune risk group across all the datasets.
Table 2

Demographic and clinic characteristic descriptions for ovarian cancer patients in different datasets.

CharacteristicsaTraining setValidation set 1Validation set 2Validation set 3Validation set 4Validation set 5
Number of samples519409606634415194
Median survival time (month) (95% CI)44.5 (41–48.3)51 (45–65)41.9 (37.8–47.8)53 (49–62)24 (21−30)>49b
Number of Death (%)284 (54.9)232 (56.7)346 (57.1)364 (57.1)302 (72.7)57 (29.3)
Age (years)59.8 ± 11.462.9 ± 10.662.6 ± 11.257.9 ± 12.257.6 ± 11.8



Histology type (%)
Serous520 (100)96 (88.8)516 (85.5)534 (83.8)415 (100)171 (88.1)
Endometroid06 (5.5)41 (6.8)66 (10.3)00
Clear cell02 (1.8)20 (3.3)37 (5.8)00
Mucinous009 (1.4)000
Other04 (3.7)17 (2.8)0023 (11.8)



FIGO stage (%)
I15 (2.9)8 (4.1)41 (7.4)32 (5.0)0
II25 (4.8)2 (1.0)111 (20.0)23 (3.6)9 (4.6)
III396 (76.7)164 (84.1)364 (65.8)462 (72.6)154 (79.3)
IV80 (15.5)21 (10.7)37 (6.6)119 (18.7)31 (15.9)



Grade (%)
15 (0.9)9 (4.0)32 (5.8)26 (4.0)0
264 (12.5)86 (38.9)80 (14.6)204 (32.0)0
3439 (86.2)125 (56.5)394 (72.0)324 (50.8)143 (100)
41 (0.1)1 (0.2)41 (7.4)83 (13.0)0



Debulking status (%)
Optimal337 (73.1)193 (54.0)209 (66.7)271 (51.4)
Sub-optimal124 (26.9)164 (45.9)104 (33.2)256 (48.5)

Sum of frequency numbers may not equal to the total sample size due to missing values.

The median survival time is incalculable due to the mortality at the last follow-up time is <50%.

Demographic and clinic characteristic descriptions for ovarian cancer patients in different datasets. Sum of frequency numbers may not equal to the total sample size due to missing values. The median survival time is incalculable due to the mortality at the last follow-up time is <50%. The IPSOV significantly stratified patients into low- and high-immune risk groups in the training set (hazard ratio [HR] = 1.82; 95% confidence interval [CI]: 1.43–2.30; P = 9.19 × 10−7) and in the 5 validation sets (HR range: 1.66 [95%CI: 1.27–2.15; P = 1.84 × 10−4] to 2.36 [95%CI: 1.38–4.00; P = 0.001]) (Table S4). In addition, it remained as an independent prognostic factor in the multivariable Cox model, after adjusting for age, stage, grade and debulking status (HR range: 1.71 [95%CI: 1.32–2.19; P = 4.04 × 10−5] to 2.86 [95%CI: 1.72–4.74; P = 4.89 × 10−5]) (Fig. 2). In the meta-analysis for all datasets, the high-immune risk group showed a 1.96-fold higher risk than the low-immune risk group (HR = 1.96; 95%CI: 1.73–2.23; P = 2.37 × 10−25) (Fig. S1). The IPSOV distribution with survival status in the combined dataset was shown in Fig. 3. Significant RMS time ratio (1.24 to 1.88) was observed in the 6 datasets, respectively (Table 3).
Fig. 2

Kaplan-Meier survival analyses of IPSOV. (a) Patients are stratified into low- (red) and high-immune (blue) risk groups with a cutoff of the median value in the training set. (b-f) Further, the prognostic signature IPSOV is validated in five independent validation sets. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

Fig. 3

IPSOV distribution with survival status in the combined dataset. Upper half panel: IPSOV distribution with patient survival status. The X axis is sorted by IPSOV values. Red color indicates the patients are dead while blue color indicates survive. Lower half panel: Heatmap showing the corresponding 15 immune categories enrichment scores. The score of each immune category is normalized to mean = 0 and standard deviation = 1. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

Table 3

Restricted mean survival (RMS) time ratio between low- and high-IPSOV groups in different data sets.

DatasetNLow-IPSOVNHigh-IPSOVRMSLow-IPSOV (95%CI)aRMSHigh-IPSOV (95%CI)aRMS ratio (95%CI)bP
Training set25825857.58 (52.61–62.55)42.73 (39.02–46.45)1.35 (1.19–1.52)<0.001
Validation set 119621384.51 (72.62–96.41)61.82 (52.53–71.11)1.37 (1.11–1.67)0.003
Validation set 218541260.52 (54.77–66.28)46.40 (42.95–49.84)1.31 (1.15–1.47)<0.001
Validation set 341322392.42 (83.49–101.35)60.77 (51.99–69.56)1.52 (1.27–1.81)<0.001
Validation set 417923670.18 (60.61–79.75)37.17 (31.17–43.18)1.88 (1.52–2.33)<0.001
Validation set 51098540.50 (37.80–43.20)32.75 (29.00–36.51)1.24 (1.08–1.41)0.002

RMS time: months.

RMS ratio = RMSLow-IPSOV/ RMSHigh-IPSOV.

Kaplan-Meier survival analyses of IPSOV. (a) Patients are stratified into low- (red) and high-immune (blue) risk groups with a cutoff of the median value in the training set. (b-f) Further, the prognostic signature IPSOV is validated in five independent validation sets. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.) IPSOV distribution with survival status in the combined dataset. Upper half panel: IPSOV distribution with patient survival status. The X axis is sorted by IPSOV values. Red color indicates the patients are dead while blue color indicates survive. Lower half panel: Heatmap showing the corresponding 15 immune categories enrichment scores. The score of each immune category is normalized to mean = 0 and standard deviation = 1. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.) Restricted mean survival (RMS) time ratio between low- and high-IPSOV groups in different data sets. RMS time: months. RMS ratio = RMSLow-IPSOV/ RMSHigh-IPSOV.

Subgroup and sensitivity analysis for IPSOV

To evaluate the model stability in different clinical subgroups, we performed a sensitivity analysis according to age, histology, stage, grade and debulking status. IPSOV was significant in most subgroups and had a relatively stable HRs except endometrioid and early-stage patients, which might due to the insufficient sample size (Fig. S2). This indicated IPSOV was potentially independent of clinical characteristics. Further, we found the 129 genes were mostly involved in antimicrobials (33.3%), cytokine receptors (32.6%) and cytokines (29.5%) immune process (Table S2). We calculated the corresponding subgroup immune scores based on ssGSEA method in each immune process. The patients with low immune scores had a significant longer median survival time in each process (Fig. 4A).
Fig. 4

(a) Immune scores are calculated based on the coefficients of IPSOV in antimicrobials, cytokines and cytokine receptors immune processes. We dichotomize the scores into low- and high-immune risk group by the median value. Median survival time is compared using log-rank test. (b) Mean C-index of IPSOV, stage, grade, debulking and 9 reported signatures. (c) P value comparison of IPSOV and 9 reported signatures. Red block indicates the model is significant (P ≤ 0.05) while black indicates unsignificant (P > 0.05). (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

(a) Immune scores are calculated based on the coefficients of IPSOV in antimicrobials, cytokines and cytokine receptors immune processes. We dichotomize the scores into low- and high-immune risk group by the median value. Median survival time is compared using log-rank test. (b) Mean C-index of IPSOV, stage, grade, debulking and 9 reported signatures. (c) P value comparison of IPSOV and 9 reported signatures. Red block indicates the model is significant (P ≤ 0.05) while black indicates unsignificant (P > 0.05). (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.) To test the robustness of IPSOV, we randomly resampled 500 cases for 10,000 times from the combined dataset. Only 0.17% of P values failed to pass the 0.05 threshold (Fig. S3A). The median C-index was 0.624 with a SD of 0.017, which showed its stable predictive power (Fig. S3B).

Pathway enrichment analysis

Enrichment analysis for the 129 unique genes identified 85 significant KEGG pathways (Padjust < 0.05), such as Cytokine-cytokine receptor interaction, MAPK signaling pathway, Ras signaling pathway, B/T cell receptor signaling pathway and PI3K-Akt signaling pathway (Table S6). Further, pathway analysis based on Gene Ontology identified various significant pathways, including 1034 biological process (BP) pathways, 44 molecular function (MF) pathways and 77 cellular component (CC) pathways, which revealed the abundant biological function background.

Comparison with other signatures and clinical characteristics

We compared model predictive accuracy of IPSOV with clinical characteristics including stage, grade and debulking as well as 9 reported ovarian cancer prognostic signatures. Continuous prognostic scores were calculated from each signature to compare among different datasets. Of the 13 survival predicted factors, IPSOV had a highest mean C-index (0.625) compared with stage (0.564), grade (0.555), debulking (0.583) and other signatures (0.516 to 0.602) (Table S5, Fig. 4B). In addition, we also compared the P-values of continuous prognostic scores in the univariable Cox model. IPSOV was significant in all datasets, outperforming other signatures (from 0/6 to 5/6) (Fig. 4C).

Integrate IPSOV with clinical characteristics

In addition to IPSOV, clinical characteristics including tumor stage, grade and debulking status were also independent prognostic factors, suggesting their complementary value. To further improve predict accuracy, we integrated IPSOV (continuous score) with them using the coefficients generated from the multivariable Cox regression model in the training set as: integrated model = 1.119 × IPSOV+0.335 × stage+0.368 × grade-0.501 × debulking. The integrated model was further validated in validation set 1, 2, 3 where full clinical information in the model was available. Significantly improved estimation of survival was achieved by the continuous score of integrated model relative to clinical characteristics only (C-index: 0.65 vs 0.59 in training set, P = 0.002; 0.73 vs 0.62 in validation set 1, P = 0.047; 0.70 vs 0.64 in validation set 2, P = 0.017; 0.71 vs 0.62 in validation set 3, P = 0.019) (Fig. 5).
Fig. 5

Integration of IPSOV and clinical characteristics. (a) Training set. (b) Validation set 1. (c) Validation set 2. (d) Validation set 3. Restricted mean survival (RMS) curve for IPSOV and integrated model. In addition, we provided the C-index comparison of the model with clinical characteristics only and the integrated model. P-value represents the difference between the two models in term of C-index.

Integration of IPSOV and clinical characteristics. (a) Training set. (b) Validation set 1. (c) Validation set 2. (d) Validation set 3. Restricted mean survival (RMS) curve for IPSOV and integrated model. In addition, we provided the C-index comparison of the model with clinical characteristics only and the integrated model. P-value represents the difference between the two models in term of C-index.

Discussion

Patients suffering from ovarian cancer often display a heterogeneous clinical outcome spanning <5 months to beyond 10 years [19]. In this respect, treatment options are lacking with all the patients being treated with similar drugs. Thus, a prognostic signature with a broad scope of application is needed to accurately identify those cases with refractory disease and worse survival. In this study, using multiple well-established public ovarian cancer cohorts, we developed a prognostic signature based on gene set enrichment and validated it in 5 independent validation datasets which covered most common microarray platforms. The IPSOV could further stratify clinically defined groups of patients (e.g. debulking status, FIGO stage and age) into subgroups with different survival outcomes. To date, numerous studies have developed gene expression signatures to stratify ovarian cancer survival based on different cohorts. However, none of them has been incorporated into clinical practice might due to several limitations. First, the sample size was usually small that lacked sufficient validation to prove model stability. Second, most of them used specific genes to generate the prognostic score, which easily leaded to overfitting and ignored other casual genes [20]. In this study, we performed ssGESA analysis to determine the population specific immune infiltration based on immune pathway gene set level, which is more generic than the normal signature. Accumulating evidence indicates innate and adaptive immune systems make a crucial contribution to cancer initiation and development [[21], [22], [23]]. Moreover, cancer immunotherapy has made a great process and is moving fast that IPSOV may hold great promise for identifying novel molecular targets for it [24]. From the ImmPort database, we found most genes of IPSOV were involved in antimicrobials, cytokines and cytokine receptors processes. Recent pre-clinical and clinical trials demonstrate that adjuvant antimicrobial therapy is beneficial in cancer treatment. Antimicrobial agents can benefit cancer patients by killing oncogenic-related microorganisms and protecting from recurring immunosuppression-induced infection through their direct antiproliferative and cytotoxic effects [25]. Cytokines and cytokine receptors processes play important roles in inflammatory, angiogenesis and chemotaxis processes [26]. They can function to inhibit tumor development and progression [27] and also are proved to be effective in the treatment of cancer [28]. Further, the pathway enrichment analysis adds the evidence for its association with cancer and the clinical application potential. Ras-MAPK signaling pathway is considering playing an important role in multiple cancers and inhibitors for it have shown effectiveness in clinical trials [29]. Inhibitor of PI3K-Akt signaling pathway prevent the growth of ovarian cancer xenografts and potentiate the cytotoxic effects of paclitaxel and cisplatin [30]. In addition, other pathways such as kinase activity [31,32], growth factor [33,34] and ERK1/ERK2 cascade [35] were also involved in cancer cell growth, proliferation, invasion and metastasis and played important roles in ovarian cancer. Our study has several strengths. First, the sample size is much larger than the studies before, which gives the signature sufficient validation and makes it more robust and reliable. Second, compared with the models screened from the whole genome that contain many genes with unknown function or a single candidate gene which shows less prognostic value than an aggregated model [36,37], the immune-related genes of IPSOV have a strong biological background, making it practicable to be applied in clinical adjuvant treatment. Third, IPSOV has a promising survival prediction ability through comparing with other signatures and clinical characteristics. How high of a C-index is needed for a useful prognostic tool depend on the clinical context [38]. Although the absolute value of IPSOV C-index is not very high, it outperforms the traditional clinical factors and improves significantly in the integrated model. Thus, the utility of these tools could better estimate patient prognostic information and stratify into different subgroups to benefit from different treatment. We acknowledge some limitations. First, we tried to include as many cohorts as possible to get a more sufficient validation of our biomarkers. However, gene expression values are subject to sampling bias due to different platforms. Second, the missing rate for the clinical characteristics was high, which decreased the statistical power in multivariable Cox regression analysis and the integrated prognostic model. Third, IPSOV is developed by numerous genes, where further biological experiments are warranted to validate their functions in ovarian cancer.

Conclusion

IPSOV is a promising prognostic biomarker in ovarian cancer which could be used to distinguish and predict patients' survival outcome. Prospective studies are needed to further validate its analytical accuracy for estimating prognoses and to test its clinical utility in individualized management of ovarian cancer.

Ethics approval and consent to participate

This study has been proved by the Nanjing Medical University institutional committee.

Availability of data and material

GEO datasets: https://www.ncbi.nlm.nih.gov/gds/. TCGA dataset: https://portal.gdc.cancer.gov/. The data analysis source code is provided in the github: https://github.com/fennudepenpen/IPSOV.

Consent for publication

All authors have reviewed the manuscript and consented for publication.

Conflicts of interest statements/Financial Disclosure statement

None.
OVovarian cancer
IPSOVimmune-based prognostic signature for OV
HRhazard ratio
CIconfidence interval
GEOGene Expression Omnibus
TCGAThe Cancer Genome Atlas ovarian cancer
FIGOInternational Federation of Gynecology and Obstetrics
ssGSEAsingle sample gene set enrichment analysis
KEGGKyoto Encyclopedia of Genes and Genomes
GOGene Ontology
BPbiological process
MFmolecular function
CCcellular component
FDRFalse-Discovery Rate
C-indexconcordance statistic
RMSrestricted mean survival
  92 in total

1.  Identification of a prognostic immune signature for cervical cancer to predict survival and response to immune checkpoint inhibitors.

Authors:  Si Yang; Ying Wu; Yujiao Deng; Linghui Zhou; Pengtao Yang; Yi Zheng; Dai Zhang; Zhen Zhai; Na Li; Qian Hao; Dingli Song; Huafeng Kang; Zhijun Dai
Journal:  Oncoimmunology       Date:  2019-10-03       Impact factor: 8.110

2.  A prognostic risk model for ovarian cancer based on gene expression profiles from gene expression omnibus database.

Authors:  Wei Fan; Xiaoyun Chen; Ruiping Li; Rongfang Zheng; Yunyun Wang; Yuzhen Guo
Journal:  Biochem Genet       Date:  2022-06-27       Impact factor: 1.890

3.  A transcriptional metabolic gene-set based prognostic signature is associated with clinical and mutational features in head and neck squamous cell carcinoma.

Authors:  Lu Xing; Mingzhu Guo; Xiaoqi Zhang; Xiaoqian Zhang; Feng Liu
Journal:  J Cancer Res Clin Oncol       Date:  2020-02-17       Impact factor: 4.553

4.  Immune-related genes and gene sets for predicting the response to anti-programmed death 1 therapy in patients with primary or metastatic non-small cell lung cancer.

Authors:  Bolin Chen; Min Yang; Kang Li; Jia Li; Li Xu; Fang Xu; Yan Xu; Dandan Ren; Jiao Zhang; Liyu Liu
Journal:  Oncol Lett       Date:  2021-05-19       Impact factor: 3.111

5.  A novel comprehensive immune-related gene signature as a promising survival predictor for the patients with head and neck squamous cell carcinoma.

Authors:  Ruihua Fang; Muhammad Iqbal; Lin Chen; Jing Liao; Jierong Luo; Fanqin Wei; Weiping Wen; Wei Sun
Journal:  Aging (Albany NY)       Date:  2021-04-17       Impact factor: 5.682

6.  A prognostic model based on seven immune-related genes predicts the overall survival of patients with hepatocellular carcinoma.

Authors:  Qian Yan; Wenjiang Zheng; Boqing Wang; Baoqian Ye; Huiyan Luo; Xinqian Yang; Ping Zhang; Xiongwen Wang
Journal:  BioData Min       Date:  2021-05-07       Impact factor: 2.522

7.  Identification and Validation of a Six Immune-Related Genes Signature for Predicting Prognosis in Patients With Stage II Colorectal Cancer.

Authors:  Xianzhe Li; Minghao Xie; Shi Yin; Zhizhong Xiong; Chaobin Mao; Fengxiang Zhang; Huaxian Chen; Longyang Jin; Ping Lan; Lei Lian
Journal:  Front Genet       Date:  2021-05-04       Impact factor: 4.599

8.  Immune signature-based risk stratification and prediction of immune checkpoint inhibitor's efficacy for lung adenocarcinoma.

Authors:  Ming Yi; Anping Li; Linghui Zhou; Qian Chu; Suxia Luo; Kongming Wu
Journal:  Cancer Immunol Immunother       Date:  2021-01-02       Impact factor: 6.968

9.  Establishment and experimental validation of an immune miRNA signature for assessing prognosis and immune landscape of patients with colorectal cancer.

Authors:  Zaoqu Liu; Taoyuan Lu; Yanli Wang; Dechao Jiao; Zhaonan Li; Libo Wang; Long Liu; Chunguang Guo; Yanan Zhao; Xinwei Han
Journal:  J Cell Mol Med       Date:  2021-06-07       Impact factor: 5.310

10.  Bioinformatic Analyses Identify a Prognostic Autophagy-Related Long Non-coding RNA Signature Associated With Immune Microenvironment in Diffuse Gliomas.

Authors:  Shengchao Xu; Lu Tang; Zhixiong Liu; Kui Yang; Quan Cheng
Journal:  Front Cell Dev Biol       Date:  2021-06-15
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.