Platinum is a commonly used drug for the treatment of ovarian cancer (OC). The aim of the current study was to design and construct a risk score system for predicting the prognosis of patients with OC receiving platinum chemotherapy. The mRNA sequencing data and copy number variation (CNV) information (training set) of patients with OC were downloaded from The Cancer Genome Atlas database. A validation set, GSE63885, was obtained from Gene Expression Omnibus database. The differentially expressed genes (DEGs) and CNV genes (DECNs) between platinum-resistant and platinum-sensitive groups were identified using the limma package. The intersection between DEGs and DECNs were selected. Cox regression analysis was used to identify the genes and clinical factors associated with prognosis. Risk score system assessment and nomogram analysis were performed using the survival and rms packages in R. Gene Set Enrichment Analysis was used to identify the enriched pathways in high and low risk score groups. From 1,144 DEGs and 1,864 DECNs, 48 genes that occurred in the two datasets were selected. A total of six independent prognostic genes (T-box transcription factor T, synemin, tektin 5, growth differentiation factor 3, solute carrier family 22 member 3 and calcium voltage-gated channel subunit α1 C) and platinum response status were revealed to be associated with prognosis. Based on the six independent prognostic genes, a risk score system was constructed and assessed. Nomogram analysis revealed that the patients with the sensitive status and low risk scores had an improved prognosis. Furthermore, the current study revealed that the 574 DEGs identified were involved in eight pathways, including chemokine signaling pathway, toll-like receptor signaling pathway, cytokine-cytokine receptor interaction, RIG I like receptor signaling pathway, natural killer cell mediated cytotoxicity, apoptosis, T cell receptor signaling pathway and Fc ε receptor 1 signaling pathway. The six-mRNA risk score system designed in the present study may be used as prognosis predictor in patients with OC, whereas the nomogram may be valuable for identifying patients with OC who may benefit from platinum chemotherapy.
Platinum is a commonly used drug for the treatment of ovarian cancer (OC). The aim of the current study was to design and construct a risk score system for predicting the prognosis of patients with OC receiving platinum chemotherapy. The mRNA sequencing data and copy number variation (CNV) information (training set) of patients with OC were downloaded from The Cancer Genome Atlas database. A validation set, GSE63885, was obtained from Gene Expression Omnibus database. The differentially expressed genes (DEGs) and CNV genes (DECNs) between platinum-resistant and platinum-sensitive groups were identified using the limma package. The intersection between DEGs and DECNs were selected. Cox regression analysis was used to identify the genes and clinical factors associated with prognosis. Risk score system assessment and nomogram analysis were performed using the survival and rms packages in R. Gene Set Enrichment Analysis was used to identify the enriched pathways in high and low risk score groups. From 1,144 DEGs and 1,864 DECNs, 48 genes that occurred in the two datasets were selected. A total of six independent prognostic genes (T-box transcription factor T, synemin, tektin 5, growth differentiation factor 3, solute carrier family 22 member 3 and calcium voltage-gated channel subunit α1 C) and platinum response status were revealed to be associated with prognosis. Based on the six independent prognostic genes, a risk score system was constructed and assessed. Nomogram analysis revealed that the patients with the sensitive status and low risk scores had an improved prognosis. Furthermore, the current study revealed that the 574 DEGs identified were involved in eight pathways, including chemokine signaling pathway, toll-like receptor signaling pathway, cytokine-cytokine receptor interaction, RIG I like receptor signaling pathway, natural killer cell mediated cytotoxicity, apoptosis, T cell receptor signaling pathway and Fc ε receptor 1 signaling pathway. The six-mRNA risk score system designed in the present study may be used as prognosis predictor in patients with OC, whereas the nomogram may be valuable for identifying patients with OC who may benefit from platinum chemotherapy.
Ovarian cancer (OC) affects the ovaries and may spread to the lungs, peritoneum, liver and lymph nodes (1,2). Early OC may be asymptomatic, while advanced OC is characterized by a loss of appetite, pelvic pain, bloating and abdominal swelling (3). As OC may be difficult to diagnose in the early stages, patients with OC are often diagnosed in the advanced stages of the disease and, as a result, have an unfavorable prognosis (4). Globally, OC affects 1.2 million females and was responsible for 161,100 mortalities in 2015 (5). Among gynecological tumors, OC is the third most common and has the highest mortality (6). The mechanisms underlying OC remain to be elucidated.Phospho-H2AX expression is negatively associated with the disease-free interval in epithelial OC (EOC), and its expression may allow early diagnosis of the disease as well as the identification of patients at high risk of recurrence (7). As a potential oncogene, actinin α 4 may serve as predictive marker of poor prognosis and tumor chemoresistance in patients with advanced OC (8). Excision repair cross-complementation group 5 was revealed to be a prognostic biomarker of OC and a candidate target for the response to platinum chemotherapy treatment of patients with OC (9). Cyclin D1 (CCND1) overexpression is significantly associated with chemosensitivity and results in poor prognosis in patients with advanced serous EOC, therefore, CCND1 may serve as a novel target for treating chemoresistant tumors (10,11). Copy number variations (CNVs) of neurotrophic receptor tyrosine kinase 3 (NTRK3) were associated with platinum-resistant and platinum-sensitive relapses, and NTRK3 amplification predicted the platinum-sensitive recurrence of OC (12). The upregulation and copy number (CN) gain of cyclin E1 were detected in patients with ovarian clear cell carcinoma, and resulted in poor patient outcomes (13). Although the aforementioned genes may influence the prognosis of patients with OC, the pathways affecting OC prognosis remain to be fully elucidated.Platinum-based drugs are effective anticancer agents and are widely used in the treatment of various solid tumors (14). Up to 80% of patients with EOC initially respond while the rest exhibit primary resistance (15). However, ≥80% of patients with EOC receiving platinum-based chemotherapy eventually develop resistance (15). The pathways underlying resistance to platinum drugs in OC have not been fully elucidated (16). An increased understanding of these pathways may serve to prevent or reverse resistance, improving the effect of treatment and the survival rate of patients with OC. The current study analyzed single nucleotide polymorphisms (SNPs) and CNVs in patients with OC using bioinformatics analyses. This resulted in the design of a risk score system, which was associated with platinum response status. The expression levels of several genes were associated with the survival prognosis of patients with OC. The results obtained the current study may provide novel and accurate markers for the diagnosis, prognosis and clinical treatment of patients with OC.
Materials and methods
Microarray data
The mRNA sequencing data from 419 OC tissue samples were analyzed using an Illumina HiSeq 2000 RNA Sequencing platform. SNP and CNV data from 481 OC tissue samples were analyzed using an Affymetrix Genome-Wide SNP Array 6.0 (Affymetrix; Thermo Fisher Scientific, Inc., Waltham, MA, USA). The datasets were downloaded from The Cancer Genome Atlas (TCGA; cancergenome.nih.gov) database. After the barcodes of the samples in the datasets were matched, a total of 230 OC samples with platinum response status and survival time information were obtained. These samples included 69 platinum-resistant samples and 161 platinum-sensitive samples and were used as the training set. The GSE63885 dataset (17) was downloaded from the Gene Expression Omnibus (GEO; http://www.ncbi.nlm.nih.gov/geo) database and analyzed using a GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array platform. This dataset consisted of 101 OC tissue samples, including 75 samples with known platinum response status (34 resistant samples and 41 sensitive samples), and was used as the validation set. The clinical information of the patients from whom the samples in the training and validation sets were obtained are presented in Table I.
Table I.
Clinical information of the ovarian cancer samples in the training set and the validation set.
Training set (n=230)
Validation set (n=75)
Variable
Resistance (n=69)
Sensitive (n=161)
P-value
Resistance (n=34)
Sensitive (n=41)
P-value
Age (years, mean ± SD)
61.77±11.47
58.55±11.32
5.24×10−2
–
–
–
Neoplasm subdivision (bilateral/left/right/-)
51/7/7/4
109/27/14/11
4.18×10−1
–
–
–
Stage (II/III/IV)
1/60/8
10/127/24
2.41×10−1
–
–
–
Histological grade (G2/G3/G4/-)
7/60/1/1
25/133/0/3
1.88×10−1
–
–
–
Lymphatic invasion (yes/no/-)
17/7/45
35/30/96
2.25×10−1
–
–
–
Recurrence (yes/no)
61/8
122/5/34
6.70×10−2
–
–
–
Mortality (dead/alive)
58/11
82/79
1.67×10−6
33/1
33/8
3.50×10−2
Overall survival time (months, mean ± SD)
29.71±14.34
48.59±25.33
1.34×10−11
27.01±14.51
55.89±31.34
2.13×10−6
‘−’ indicates information unavailable; G, grade; SD, standard deviation; n, number of patients.
Data preprocessing and differential expression analysis
The preprocessCore package (18) (version 1.40.0, http://bioconductor.org/packages/release/bioc/html/preprocessCore.html) in R version 3.4.1 (https://www.r-project.org) was used to normalize the mRNA sequencing data downloaded from TCGA database. The PICNIC software (19) (the ‘predict integral copy numbers in cancer’ algorithm, ftp.sanger.ac.uk/pub/cancer) was used to convert the SNP data downloaded from the TCGA database from Cel format to segment format and to obtain CN segment data (the zero segment value indicated no CNV, and the other non-zero signal value indicated loss or amplification mutation in the detection area). Subsequently, a human gene annotation file (GRCh38.p10) (20), was downloaded from the GENCODE database (release 27; http://www.gencodegenes.org/human) and genes were annotated according to the CN area coordinates of each sample. For GSE63885, format conversion, missing value supplement, background correction and data normalization were conducted using the R package oligo (21) (version 1.42.0; www.bioconductor.org/packages/release/bioc/html/oligo.html).Subsequent to dividing the samples in the TCGA dataset into resistant and sensitive, the differentially expressed genes (DEGs) and differentially expressed CNV genes (DECNs) were identified using the R package limma (22) (version 3.34.7; bioconductor.org/packages/release/bioc/html/limma.html) The results with false discovery rate (FDR)<0.05 and |log fold change (FC)|>0.263 were selected. The overlapping genes between DEGs and DECNs were selected. The genes with CNVs [including single nucleotide polymorphism (SNP), insertion (INS) and deletion (DEL)] were chosen for further analyses.
Identification of prognosis-associated genes and clinical factors
The aforementioned selected genes and clinical factors independently associated with prognosis were analyzed using univariate and multivariate Cox regression analysis in the R package survival (23) (version 2.41.1; cran.r-project.org/web/packages/survival/index.html). P<0.05 was considered to indicate a statistically significant difference.
Construction and assessment of the risk score system
Based on the coefficients of the independent prognostic factors in the multivariate Cox regression analysis, a gene-based risk score system was constructed for OC. The risk score of each sample was calculated according to the following formula: Risk score=Σ(coefmRNA × ExpmRNA), where coefmRNA is the prognostic coefficient of the mRNA in the multivariate Cox regression analysis and ExpmRNA is the expression level of the mRNA.Using the median of the risk scores as a cut-off point (24,25), the samples in the training set were classified into high and low risk groups. The association between the risk score and prognosis was evaluated using the Kaplan-Meier (KM) survival curve (26) in the R package survival, and then validated in the validation set. To reveal the association of the risk score and platinum response status, the associations between the risk score and prognosis were evaluated in the resistant and sensitive group separately using univariate and multivariate Cox regression analysis.The association between clinical factors and prognosis was also analyzed in the high and low risk groups. The clinical factors which were significantly associated with prognosis in high and low risk groups were analyzed with KM survival analysis (26).
Association between chemotherapy response and prognosis
The clinical factors significantly associated with prognosis were identified from the samples with chemotherapy response information using the Cox regression analysis in the R package survival (23). A nomogram was subsequently constructed using the R package rms (27) (version 5.1–2; cran.r-project.org/web/packages/rms/index.html). The association between the platinum response status and prognosis of patients with OC was assessed using the probabilities acquired from the nomogram.
Differential expression and pathway enrichment analysis
Using the R package limma (28), the DEGs in the high and low risk groups were analyzed. The genes with FDR<0.05 and |logFC|>0.263 were defined as DEGs. The correlation coefficient of DEGs and risk scores of samples were calculated using Cor function in R. Pathway enrichment analysis for the DEGs was subsequently performed using Gene Set Enrichment Analysis software 3.0 (29) (software.broadinstitute.org/gsea/index.jsp). P<0.05 was considered to indicate a statistically significant difference.
Results
Differential expression analysis
The mRNA expression density distribution and CN signal chromosome display of the TCGA OC dataset are presented in Fig. 1A and B. A total of 1,144 DEGs (620 upregulated and 524 downregulated) and 1,864 DECNs (1,137 upregulated and 727 downregulated) were identified in the resistant and sensitive samples (Fig. 2A). After the DEGs and DECNs were compared, a total of 108 overlapping genes were obtained (Fig. 2B). Among the overlapping genes, 48 overlapping genes with CNVs (including 94 SNPs, 1 INS and 1 DEL) were selected and performed with the following analyses.
Figure 1.
mRNA expression density distribution and CN signal chromosome display for The Cancer Genome Atlas dataset. (A) The mRNA expression density distribution. The horizontal and vertical axes present the expression level and density, respectively. (B) The CN signal chromosome display. The horizontal and vertical axes present the detection areas on each chromosome and the ovarian cancer samples. Red and blue indicate log2 (CN signal) higher and lower than 0, respectively. CN, copy number.
Figure 2.
Scatter and Venn diagrams. (A) The scatter diagrams for the mRNA-sequencing data (left) and the single nucleotide polymorphisms and CNVs (right). Red presents the genes with upregulated expression or copy number signal values in the sensitive sample. (B) The Venn diagram presents the DEGs and DECNs and the overlapping genes. (C) The Venn diagram for comparing the prognosis-associated DEGs and DECNs. CNVs, copy number variations; DEGs, differentially expressed genes; DECNs, differentially expressed CNV genes.
Combined with the expression levels and CN signal levels of the 48 genes in the TCGA dataset, 34 prognosis-associated genes in expression level and 29 prognosis-associated genes in CN signal level were screened using univariate multivariate Cox regression analysis. A total of 20 overlapping genes were identified after comparing the two types of genes which were significantly associated with prognosis in expression level and CN signal level (Fig. 2C), including calcium voltage-gated channel subunit α1 C (CACNA1C), CD274 molecule, CECR2 histone acetyl-lysine reader, contactin 6, DEAQ-box RNA dependent ATPase 1, FERM domain-containing 1, growth differentiation factor 3 (GDF3), gap junction protein α8, neuroligin 1, NTRK3, PARN-like ribonuclease domain-containing 1, seizure-related 6 homolog-like, solute carrier family 22 member 3 (SLC22A3), solute carrier family 5 member 1, synemin (SYNM), synaptotagmin 9, T-box transcription factor T (TBXT), tektin 5 (TEKT5), V-set and transmembrane domain containing 2-like and ZFP57 zinc finger protein.The 20 overlapping genes were subjected to multivariate Cox regression analysis, and a total of six independent prognostic genes (TBXT, SYNM, TEKT5, GDF3, SLC22A3 and CACNA1C) were identified (P<0.05; Table II). The CNV data of the six independent prognostic genes are presented in Table III.
Table II.
Multivariate analysis of the six independent prognostic genes.
Gene
Coefficient
Hazard ratio (95% confidence interval)
P-value
T-box transcription factor T
1.113
3.046 (1.677–5.531)
0.0003
Synemin
−0.625
0.535 (0.367–0.779)
0.0011
Tektin 5
0.567
1.763 (1.174–2.6452)
0.0062
Growth differentiation factor 3
0.374
1.453 (1.0701–1.973)
0.0166
Solute carrier family 22 member 3
0.652
1.919 (1.036–3.555)
0.0384
Calcium voltage-gated channel subunit α1 C
0.56
1.752 (1.022–3.002)
0.0415
Table III.
CNV data of the six independent prognostic genes.
Gene
Chromosome
CNV position
Strand
Variant classification
Variant type
Reference allele
Tumor seq allele 1
Tumor seq allele 2
Validation status
T-box transcription factor T
6
166571861
−1
Missense mutation
SNP
G
G
A
Valid
Solute carrier family 22 member 3
6
160872083
1
Missense mutation
SNP
C
C
T
Unknown
Growth differentiation factor 3
12
7848219
−1
Frame shift del
DEL
C
C
–
Valid
12
7848321
−1
Missense mutation
SNP
G
G
T
Unknown
Calcium voltage-gated channel subunit α1 C
12
2224552
1
Missense mutation
SNP
C
C
A
Valid
12
2760846
1
Missense mutation
SNP
G
G
T
Valid
12
2788613
1
Missense mutation
SNP
G
G
T
Valid
12
2622145
1
Missense mutation
SNP
G
G
A
Unknown
12
2774154
1
Missense mutation
SNP
C
C
G
Valid
12
2788812
1
Missense mutation
SNP
C
C
A
Valid
12
2788717
1
Silent
SNP
G
G
A
Valid
12
2797734
1
Missense mutation
SNP
T
T
C
Valid
12
2613667
1
Intron
SNP
C
C
A
Valid
12
2716201
1
Missense mutation
SNP
A
A
T
Valid
12
2774755
1
Silent
SNP
C
C
T
Valid
Synemin
15
99672483
1
Silent
SNP
A
A
G
Unknown
15
99669768
1
Silent
SNP
G
G
A
Unknown
15
99671917
1
Missense mutation
SNP
G
G
C
Valid
15
99672905
1
Missense mutation
SNP
C
C
G
Valid
15
99653864
1
Silent
SNP
C
C
T
Valid
15
99672085
1
Missense mutation
SNP
G
G
T
Valid
Tektin 5
16
10721464
−1
Silent
SNP
G
G
A
Valid
16
10721462
−1
Missense mutation
SNP
G
G
A
Valid
SNP, single nucleotide polymorphism; DEL, deletion; CNV, copy number variation; G, guanine; C, cytosine; A, adenine; T, tyrosine; ‘−’ represents a nucleotide deletion.
Based on the multivariate Cox regression coefficients of the six independent prognostic genes, the following risk score system was developed (24,25): Risk score=(1.113) × ExpTBXT + (−0.625) × ExpSYNM + (0.567) × ExpTEKT5 + (0.374) × ExpGDF3 + (0.652) × ExpSLC22A3 + (0.560) × ExpCACNA1C, where Exp is the relative expression level of the genes. The risk scores of TCGA samples were calculated using the above formula. The median was used as the cut-off point to divide the samples in the training set into high and low risk groups. The association between the risk groups and prognosis was evaluated using the KM survival curve (Fig. 3A), and subsequently validated in the validation set (Fig. 3B). The association between the predictive results of the risk score system and platinum response status was further analyzed (Fig. 4). The platinum response status of the samples in the training set (P=5.55×10−16; Fig. 4A) and the validation set (P=1.16×10−07; Fig. 4C) were significantly associated with the actual prognosis. In the platinum sensitive samples, the predictive results of the training set (P=0.0203; Fig. 4B) and the validation set (P=0.0132; Fig. 4D) were significantly associated with survival prognosis. In the platinum resistant samples, the association between the predictive results and prognosis was not significant (Fig. 4B and D). Therefore, the risk score system could accurately predict the prognosis of the sensitive samples.
Figure 3.
KM survival curves and the AUC for the training and validation sets. (A) The KM survival curve (left) and AUC (right) for the training set. (B) The KM survival curve (left) and AUC (right) for the validation set. Red and green in KM survival curves present high and low risk groups, respectively. HR, hazard ratio; KM, Kaplan-Meier; AUC, area under the curve.
Figure 4.
KM survival curves for the training and validation sets. (A) The KM survival curve (left) for the entire training set. Red and green present sensitive and resistant samples, respectively. (B) The KM survival curves for the sensitive (left) and resistant (right) samples in the training set. Red and green present high and low risk groups, respectively. (C) The KM survival curve (left) for the entire validation set. Red and green present sensitive and resistant samples, respectively. (D) The KM survival curves for the sensitive (left) and resistant (right) samples in the validation set. Red and green present high and low risk groups, respectively. HR, hazard ratio; KM, Kaplan-Meier.
The samples in the training dataset were divided into high and low risk groups according to the median of the risk scores. The association between clinical factors and prognosis was determined. The platinum response status and predicted status based on the risk scores had a significant association with prognosis [platinum response status, P=1.05×10−12; hazard ratio (HR)=0.246; predicted status, P=0.0298; HR=1.489], which suggested that response to platinum treatment improves prognosis and patients with platinum sensitivity may have an improved prognosis compared with platinum-resistant patients (Table IV). Statistical analysis revealed that the platinum response status was significantly associated with the prognosis of low (P=5.39×10−09; Fig. 5A) and high (P=9.98×10−08; Fig. 5B) risk groups (Table V). The results obtained in the current study indicated that patients with platinum sensitivity in the low and high risk groups had improved survival prognosis compared with patients with platinum resistance.
Table IV.
Results of the Cox regression analysis for identification of the clinical factors significantly associated with prognosis.
Univariate analysis
Multivariate analysis
Variable
HR
95% CI
P-value
HR
95% CI
P-value
Age
1.008
0.993–1.024
0.2970
–
–
–
Platinum response status
0.217
0.149–0.314
5.55×10−16
0.246
0.168–0.362
1.05×10−12
Neoplasm subdivision
0.844
0.632–1.127
0.2490
–
–
–
Stage
1.052
0.727–1.524
0.7870
–
–
–
Lymphatic invasion
0.959
0.529–1.736
0.8890
–
–
–
Histological grade
1.272
0.805–2.009
0.3020
–
–
–
Recurrence
0.695
0.352–1.372
0.2910
–
–
–
Predicted status
2.036
1.449–2.861
2.95×10−05
1.489
1.039–2.132
0.0298
HR, hazard ratio; CI, confidence interval; ‘−’ indicates that these parameters were not included in the multivariate analysis.
Figure 5.
KM survival curves for the training and validation sets. (A) The KM survival curve demonstrating the association between the platinum response status of the samples in low risk group and actual prognosis. (B) The KM survival curve demonstrating the association between the platinum response status of the samples in high risk group and actual prognosis. Red and green present sensitive and resistant samples, respectively. HR, hazard ratio; KM, Kaplan-Meier.
Table V.
Results of stratified analysis of clinical factors in high and low risk groups.
Variable
Low risk (n=115) HR (95% CI)
P-value
High risk (n=115) HR (95% CI)
P-value
Age
0.996 (0.974–1.02)
0.7900
1.016 (0.996–1.037)
0.1180
Platinum response status
0.188 (0.102–0.351)
5.39×10−09
0.293 (0.182–0.472)
9.98×10−08
Neoplasm subdivision
0.692 (0.446–1.075)
0.0984
1.227 (0.832–1.811)
0.3001
Pathological Stage
0.953 (0.562–1.615)
0.8590
1.174 (0.668–2.062)
0.5770
Histological grade
1.569 (0.670–3.671)
0.2950
1.312 (0.743–2.317)
0.3480
Recurrence
1.412 (1.195–2.25)
0.7310
0.757 (0.362–1.585)
0.4590
HR, hazard ratio; CI, confidence interval; n, number of patients.
Cox regression analysis was performed for the platinum response status and risk scores, revealing that the sensitive status improved prognosis in OC (HR=0.217) and that the risk score worsened the prognosis (the higher the risk score, the worse the prognosis; Fig. 6A). Subsequently, nomogram analysis was performed based on the platinum response status and risk score. The platinum response status and risk score were used as independent prognostic factors to predict the 3-year survival probability and 5-year survival probability, and the results demonstrated that patients with the sensitive status and low risk scores had improved prognosis compared with patients with platinum resistance and high risk scores (Fig. 6B). The calibration curve for the predicted 5-year survival probability and the actual survival probability revealed that the predicted curve and the ideal curve were similar. The predicted results were therefore consistent with the actual survival status according to the clinical information in the dataset (Fig. 6C).
Figure 6.
Results of the Cox regression analysis. (A) The result of multivariate Cox regression analysis for platinum response status and risk score. (B) The nomogram predicts the 3-year survival probability and 5-year survival probability based on platinum response status and risk score. (C) The calibration diagram between the predicted 5-year survival probability and the actual survival probability. The blue line represents predicted survival probability; the gray line represents the actual survival probability. HR, hazard ratio; CI, confidence interval; ideal, the actual survival rate according to the dataset.
Differential expression and pathway enrichment analyses
The OC samples in the TCGA set were divided into high and low risk groups. A total of 574 DEGs (297 upregulated and 277 downregulated) were identified in the high and low risk groups. The volcano plot of the 574 DEGs is presented in Fig. 7A. The correlation coefficient of DEGs and risk scores of samples were calculated using Cor function in R. The top 100 DEGs (the top 50 for positive and negative association, respectively) were selected for analysis based on the association of the DEGs with the risk scores. The expression heatmap is presented in Fig. 7B. Pathway enrichment analysis revealed that DEGs were significantly enriched in eight pathways, including chemokine signaling pathway, toll-like receptor signaling pathway, cytokine-cytokine receptor interaction, RIG I-like receptor signaling pathway, natural killer cell-mediated cytotoxicity, apoptosis, T cell receptor signaling pathway and Fc ε receptor 1 signaling pathway (Table VI).
Figure 7.
Differential expression analysis and association heatmap. (A) The volcano plot of the DEGs between high and low risk groups. Red, green and grey dots represent upregulated genes, downregulated genes and non-DEGs, respectively. (B) The expression heatmap of the top 100 DEGs. FDR, false discovery rate; FC, fold change; DEGs, differentially expressed genes; IL6R, interleukin 6 receptor; IL2RB, interleukin 2 receptor subunit beta; IL2RG interleukin 2 receptor subunit gamma; CXCL11, C-X-C motif chemokine ligand 11.
Table VI.
Eight significantly enriched pathways between high and low risk groups.
In the present study, 1,144 DEGs and 1,864 DECNs in platinum resistant samples and platinum sensitive samples were screened. A total of 108 overlapping genes between the DEGs and DECNs were identified, from which 48 genes with CNVs were selected for subsequent analyses. There were 20 genes significantly associated with prognosis in the expression level and CN signal level, from which six independent prognostic genes (T, SYNM, TEKT5, GDF3, SLC22A3 and CACNA1C) were selected. The platinum response status had a significant association with prognosis and platinum sensitivity resulted in improved prognosis. Subsequently, a risk score system based on the six independent prognostic factors was constructed and evaluated. Nomogram analysis indicated that patients with sensitive status and low risk scores had improved prognosis compared with patients with a resistant status and high risk scores. A total of 574 DEGs were identified in high and low risk groups, and these genes were enriched in eight pathways.The Tektin (TEKT) family consists of several filament-forming proteins in male germ cells, including TEKT1, TEKT2, TEKT3, TEKT4 and TEKT5 (30). TEKT5 was reported to be a cancer/testis antigen, suggesting that TEKT5 may be used for the diagnosis and immunotherapy of patients with testicular cancer (31). SYNM is a potential tumor suppressor gene in breast cancer, and its promoter methylation status may be used to predict the risk of recurrence in patients with breast cancer (32). SYNM is an intermediate filament-associated protein that may function in the formation of hepatocellular carcinoma, and its alteration may result in the pleomorphism of tumor cells (33). Since TEKT5 and SYNM were revealed to be involved in the pathogenesis of different types of cancer in the aforementioned studies, they may also be implicated in the pathogenesis of OC.The plasma concentration of GDF15 is increased in OC, and increased plasma concentration is negatively associated with survival time and is an independent prognostic predictor (34,35). GDF15 upregulation contributes to the growth and invasion of rapamycin-sensitive OC cells; therefore, rapamycin inhibition may be effective for treating patients with GDF15-overexpressing OC (36). GDF3 is a member of the transforming growth factor β family and is expressed in several types of tumors (37). GDF3 blocking combined with retinoic acid may be a promising therapeutic strategy for different types of solid cancer (37). GDF3 is markedly downregulated in breast cancer tissues, and its reconstitution may be a applied for inhibiting the aggressive growth of the tumor (38). Therefore, upregulation of GDF3 may also be associated with the poor prognosis of patients with OC.SLC22A16 is expressed in ovarian clear-cell adenocarcinoma and it can promote the uptake of doxorubicin compounds directed against the cancer cells (39,40). SLC22A3 affects the absorption of numerous basic drugs and endogenous amines in multiple tissues (41). SLC22A3 is downregulated in aggressive prostate cancer and is a key risk factor for the disease (41). SLC22A3 is essential for the cytotoxicity induced by oxaliplatin as it facilitates its uptake into tumors (42). SLC22A3 expression may therefore serve as a marker for the efficacy of chemotherapy in different types of cancer (42). The results obtained in the aforementioned studies suggest that SLC22A3 may affect the efficacy of chemotherapy in patients with OC.A limitation of the current study was a lack of experimental validation (43,44). Although bioinformatics reveals genes and pathways involved in platinum-based chemotherapy for ovarian cancer, future experiments are required to verify these results.In conclusion, a six-mRNA risk score system was constructed to predict the prognosis of patients with OC. The risk score system-based nomogram may be applied for identifying patients with OC who will benefit from platinum chemotherapy.
Authors: Christine S Walsh; Seishi Ogawa; Hisae Karahashi; Daniel R Scoles; James C Pavelka; Hang Tran; Carl W Miller; Norihiko Kawamata; Charles Ginther; Judy Dering; Masashi Sanada; Yasuhito Nannya; Dennis J Slamon; H Phillip Koeffler; Beth Y Karlan Journal: J Clin Oncol Date: 2008-06-20 Impact factor: 44.544