Literature DB >> 27189279

Estimation of Prognostic Marker Genes by Public Microarray Data in Patients with Ovarian Serous Cystadenocarcinoma.

San Duk Yang1, Se Song Jang1, Jeong A Han2, Hyun Seok Park3, Jong Il Kim1,4,5.   

Abstract

PURPOSE: Lymphatic invasion (LI) is regarded as a predictor of the aggressiveness of ovarian cancer (OC). However, LI is not always the major determinant of long-term patient survival. To establish proper diagnosis and treatment for OC, we analyzed differentially expressed genes (DEGs) for patients with serous epithelial OC, with or without LI, who did or did not survive for 5 years.
MATERIALS AND METHODS: Gene expression data from 63 patients with OC and LI, and 35 patients with OC but without LI, were investigated using an Affymetrix Human Genome U133 Array and analyzed using The Cancer Genome Atlas (TCGA) database. Among these 98 patients, 16 survived for 5 years or more. DEGs were identified using the Bioconductor R package, and their functions were analyzed using the DAVID web tool.
RESULTS: We found 55 significant DEGs (p<0.01) from the patients with LI and 20 highly significant DEGs (p<0.001) from those without it. Pathway analysis showed that DEGs associated with carbohydrate metabolism or with renal cell carcinoma pathways were enriched in the patients with and without LI, respectively. Using the top five prognostic marker genes, we generated survival scores that could be used to predict the 5-year survival of patients with OC without LI.
CONCLUSION: The DEGs identified in this study could be used to elucidate the mechanism of tumor progression and to guide the prognosis and treatment of patients with serous OC but without LI.

Entities:  

Keywords:  Gene expression; microarray analysis; ovarian cancer; prognosis

Mesh:

Year:  2016        PMID: 27189279      PMCID: PMC4951462          DOI: 10.3349/ymj.2016.57.4.872

Source DB:  PubMed          Journal:  Yonsei Med J        ISSN: 0513-5796            Impact factor:   2.759


INTRODUCTION

According to data released by the Centers for Disease Control and Prevention, ovarian cancer (OC) was ranked the fifth cause of cancer-associated death in women in the USA,1 and 14270 cases of deaths from OC were reported in that country in 2014.2 Many prognostic methods have been evaluated in attempts to identify the most reliable diagnostic strategies for treating women with OC.3,4 Among these, tumor stage and the degree of lymphatic invasion (LI) have been considered to be determinants for overall survival.4,5,6 However, there is a need for better prognostic indicators, because the tumor stage merely indicates how the cancer cells have spread in the body but does not predict their response to chemotherapy. The improvement of methods for analyzing a large set of gene expression data has provided useful insights for cancer treatment.7,8 Although there have been several relevant studies for OC,9 there have been only a limited number of gene expression studies to suggest a clinical subclassification associated with survival or treatment strategies. A recent study using The Cancer Genome Atlas (TCGA) described prognostic gene expression profiles in patients with OC.10 Among 193 suggested prognostic marker genes, only four common genes were validated in three different sets (p<0.05), and none of them was validated with more strict statistical criteria (p<0.01).10 This reflects the difficulty of finding representative marker genes for overall survival among patients with OC. Subgroup analysis may show either consistent or largely different results among different categories of patients.11 We hypothesized that prognostic indicators might vary between patients with and without LI, because different sets of co-regulated genes were induced with the activation of different pathways related to invasion.12 In this study, we divided patients with OC into subgroups with or without LI and analyzed those significantly differentially expressed genes (DEGs) that were linked with 5-year survival.

MATERIALS AND METHODS

Patients

We analyzed data from 98 patients with serous OC using the TCGA datasets. Gene expression profiles were produced on an Affymetrix Human Genome U133 Array (Affymetrix, Inc., Santa Clara, CA, USA). Between 1995 and 2010, 489 clinically annotated patients with high-grade serous OC at tumor stages II–IV underwent surgery before systemic treatment. All patients received a platinum-based agent. Among 489 patients, the clinical annotation of 309 patients did not contain information on LI, and no information on 5-year survival was available for 82 patients. For patients with LI phenotypes described as 'lymphatic invasion' and 'lymphovascular invasion indicator' in the TCGA database, only 98 had LI, and for those we can trace possible 5-year survival, only 63 had LI and 35 did not. All patients were classified as having grade II or IV tumors. The patients were followed until 25 August 2010.

Gene expression analysis

We performed microarray gene expression analysis for these patients using the TCGA database with Bioconductor R.13 Background adjustment was carried out employing the Robust Multi-Array Analysis algorithm from the 'affy' package for preprocessing, and the 'limma' package for differential expression analysis.

Statistical analysis

We identified significant DEGs between patients with and without 5-year survival and those with or without LI. The p-values were adjusted using the Benjamini-Hochberg method. Two-sided Student's t tests were applied, and statistical criteria of p<0.01 or p<0.001 were used as thresholds to determine the significance of any differences in gene expression.

Hierarchical clustering and heat maps

Hierarchical clustering (HC) analysis was performed based on Pearson correlation coefficients in the expression pattern of genes. For gene clustering, we used 'pvclust' in the R package to assess any uncertainty of HC analysis.14 For each HC result, p-values were calculated using multiple bootstrap resampling.

Function analysis

We performed Gene Ontology annotations and Pathway Mapping of DEGs using the DAVID program.15 We used the Pathway category provided by DAVID for analysis of pathways that were overrepresented by three genes (EPAS1, WNT6, and PIK3R1) as 5-year survival-related DEGs for patients with OC without LI.

Regression analysis

Statistical analyses were performed using the R package (above). Multiple linear regression analyses were considered in modeling predictions of the 5-year survival score by the expression of five marker genes among patients with serous OC without LI. The five marker genes were selected by a combination of pathway-associated DEGs and rank correlation analysis. Pearson's correlation coefficients were also calculated to assess the associations between five marker genes (WNT6, EPAS1, SWAP70, PIK3R1, and IGFBP2) and 5-year survival outcomes. Receiver operating characteristics (ROC) curves were analyzed using IBM SPSS Statistics (version 23, IBM Corp., Armonk, NY, USA).

RESULTS

In total, 180 patients (116 with and 64 without LI) were enrolled in the study using the TCGA database. Among the enrolled patients, 82 (53 with and 29 without LI) were excluded from the analysis because their follow-up period was less than 5 years. Finally, 98 patients (63 with and 35 without LI) were analyzed in this study (Table 1).
Table 1

TCGA Data for Patients with Ovarian Cancer, with or without Lymphatic Invasion, in Terms of 5-Year Survival Outcomes

5 yr death5 yr survivalTotal
With lymphatic invasion55 patients8 patients63
Without lymphatic invasion27 patients8 patients35
Total82 patients16 patients98

TCGA, the Cancer Genome Atlas.

We divided these 98 patients into two groups: those who survived for 5 years and those who did not. We identified 133 DEGs (Supplementary Table 1 and 2, only online) between the survival and nonsurvival groups. As shown in the heat map in Fig. 1A, clustering of these DEGs could not clearly distinguish the survival from the nonsurvival groups.
Fig. 1

Heat map of gene expression profiles from patients with OC. The rows represent genes, and columns represent individual patients. Red indicates a high, and green indicates a low expression level. Differentially expressed genes between 5-year survival and nonsurvival groups were selected from all patients with OC (p<0.01) (A), patients with lymphatic invasion (p<0.01) (B), and patients without lymphatic invasion (p<0.001) (C). OC, ovarian cancer.

We tried to identify DEGs in subgroups of respective OCs with and without LI. As shown in the heat map (Fig. 1B), 55 DEGs (Supplementary Table 1 and 3, only online) identified from the patients with LI could fairly well divide the survival from the nonsurvival groups. However, measurements of Manhattan distance and Pvclust dendrograms indicated that the patients with LI could not be clearly divided into survival and nonsurvival groups by the expression pattern of these 55 DEGs (data not shown). We conducted the same analysis for 35 patients with OC without LI, and 20 DEGs were identified (Table 2, Supplementary Table 1, only online). The heat map (Fig. 1C) showed an obvious difference in the expression patterns between the survival and nonsurvival groups. Results from a Manhattan distance plot (Fig. 2A) and HC dendrogram (Fig. 2B) confirmed that patients without LI could be clearly classified into 5-year survival and nonsurvival groups by these DEGs.
Table 2

DEG List for Patients with Ovarian Cancer, without Lymphatic Invasion

Gene symbolGene titleLog FCp value
INTS3Integrator complex subunit 30.7867751.67E-05
EMID1EMI domain containing 11.3730292.56E-05
SMUG1Single-strand-selective monofunctional uracil-DNA glycosylase 10.8287313.43E-05
PSMD4Proteasome (prosome, macropain) 26S subunit, non-ATPase, 40.5906975.11E-05
ZNF643Zinc finger protein 6430.892569.06E-05
WNT6Wingless-type MMTV integration site family, member 60.9287010.000195
IGFBP2Insulin-like growth factor binding protein 2, 36 kDa1.6538660.00022
PEX14Peroxisomal biogenesis factor 140.4729710.000223
LOC728855Hypothetical LOC7288550.8277330.00025
ZBTB48Zinc finger and BTB domain containing 480.7920630.000371
KCNMB2Potassium large conductance calcium-activated channel, subfamily M, beta member 21.2870170.000421
FAM86AFamily with sequence similarity 86, member A0.6252780.000488
PIWIL1Piwi-like 1 (Drosophila)0.4930270.000521
EPAS1Endothelial PAS domain protein 1-0.909940.000545
SWAP70SWAP switching B-cell complex 70 kDa subunit-0.686060.000657
MPPE1Metallophosphoesterase 10.7313740.000681
FLAD1FAD1 flavin adenine dinucleotide synthetase homolog (S. cerevisiae)0.6386190.000703
LIMA1LIM domain and actin binding 1-1.004530.000811
PIK3R1Phosphoinositide-3-kinase, regulatory subunit 1 (alpha)-0.957530.000894
SCNM1Sodium channel modifier 10.7162350.00093

DEG, differentially expressed genes.

Fig. 2

Clustering of 5-year survival and nonsurvival groups of patients with OC without lymphatic invasion. (A) Manhattan distance plot of gene expression profiles in 20 survival-related genes and their association with patients without lymphatic invasion. (B) Cluster dendrogram of gene expression profiles and their association with patients without lymphatic invasion. OC, ovarian cancer.

Pathway analysis showed that among the 20 DEGs, EPAS1, WNT6, and PIK3R1 were associated with cancer and renal cell carcinoma pathways (Table 3). To gain further insight, we selected five genes (WNT6, EPAS1, SWAP70, PIK3R1, and IGFBP2) among 20 DEGs whose expression showed strong correlation with the 5-year survival pattern. After normalization to the expression of five housekeeping genes (ACTB, GAPDH, RPLP0, GUSB, and TFRC) found in both tumor and normal tissues in equivalent amounts, we formulated one equation to predict the 5-year survival scores for five marker genes using multiple linear regression analysis as follows.
Table 3

Pathway Analysis of DEGs in Patients with Ovarian Cancer, without Lymphatic Invasion

CategoryTermCount%GenesList totalPop hitsPop totalFold enrichment
KEGG_PATHWAYhsa05200: pathways in cancer315EPAS1, WNT6, PIK3R1832850855.813643
KEGG_PATHWAYhsa05211: renal cell carcinoma210EPAS1, PIK3R1370508518.16071

DEGs, differentially expressed genes.

Y=0.946094×(χ1)-1.42646×(χ2)-0.73215×(χ3)-1.14288×(χ4)+1.378633×(χ5)-2.19895 Using H as the mean expression value of the five housekeeping genes, the following factors were derived. χ1=logχ2=logχ3=logχ4=logχ5=log As shown in Fig. 3, all six patients who survived for 5 years showed positive score values, while all 27 nonsurviving patients showed negative values, based on this equation. When comparing the ROC curve estimates, we found high precision for our formula in estimating patient survival (Supplementary Fig. 1, only online)
Fig. 3

Survival score values calculated by multiple linear regression analysis with the five prognostic marker genes. The Y-axis indicates the score values; the 5-year survival group showed positive values, while the nonsurvival group showed negative values.

DISCUSSION

LI has been considered an important predictive component for many cancers. However, our TCGA-based analysis showed that LI alone could not efficiently predict the 5-year survival of these patients with OC. If we could find prognostic markers for the OCs, we could then select patients who would gain an advantage from adjuvant therapy. Therefore, we used expression profiles to detect prognostic markers that could predict the 5-year survival rates, and found that information about the presence of LI alone was not sufficient, but it still affected the efficiency of prediction using gene expression profiling. The clustering of survival and nonsurvival groups from all patients with OC using DEGs was not as clear as the subgroups with or without LI, suggesting that the cancers from each subgroup might have had a different pathophysiology that influenced the survival of the patients. Most previous research on OC has focused on LI, however, none of these studies have attempted to separately analyze the factors influencing survival between patients with or without LI. To our best knowledge, the present study is the first to use OC samples to analyze survival by subgroup with or without LI. The subgroup of patients without LI could be divided into 5-year survival and nonsurvival groups using the 20 DEGs more clearly and significantly than among all patients with OC or subgroups with LI. The survival score values using five prognostic marker genes (WNT6, EPAS1, SWAP70, PIK3R1, and IGFBP2) among 20 DEGs could efficiently predict the 5-year survival of patients with OC but no LI. Patients with low expression levels of three genes (PIK3R1, EPAS1, and SWAP70) and high expression of two genes (WNT6 and IGFBP2) showed longer survival. PIK3R1 encodes the 85 kDa regulatory subunit of phosphatidylinositol 3-kinase, which is an important player in cancer development and progression. The PI3K/AKT/mTOR signaling pathway is frequently activated and has been considered as a possible therapeutic target for OC.16,17 A recent meta-analysis showed that high PI3K levels were associated with poor survival in patients with epithelial OC.18 Lower expression of PIK3R1 might lead to downregulation of the PI3K/AKT/mTOR pathway, thereby contributing to the increased survival of patients with OC. EPAS1, also known as HIF2A, encodes a transcription factor and is induced by hypoxic stress. There have been many publications about the link between hypoxia and cancers. Thus, the level of expression of EPAS1 and genes of its downstream signaling pathways affect the aggressiveness of OC;19 higher expressions of those genes were significantly correlated with poor prognosis, similar to our findings. Better adaption to the hypoxic microenvironment common in solid tumors might explain those findings. SWAP70 is expressed in various cell types, including activated B-lymphocytes and mast cells.20 Although its role in cancer has not been reported frequently, one recent study showed increased expression of SWAP70 in malignant gliomas and a strong correlation between high expression and poor patient survival.21 Our data also showed that its expression was upregulated in poorly surviving patients with OC but no LI, suggesting a similar mechanism. WNT6 is a member of the WNT family of genes, which encode highly conserved glycoproteins secreted during embryonic development. The WNT signaling pathway is altered in many types of cancers and is regarded as a good candidate for cancer therapy.21 WNT family members have been generally regarded as poor prognostic factors for patients with cancer because of their effects on cell proliferation, migration, and survival.22 However, there is also a report that elevated levels of the Wnt-5a protein was associated with better outcomes for patients with prostate cancer, similar to our findings.23 IGFBP2 encodes one of the members of the insulin-like growth factor binding (IGFBP) proteins, and its oncogene-like action by activation of Akt signaling has been reported.24 There have been contradictory reports about the prognostic value of circulating and intratumorous IGFBP2 levels.25 A recent report showed that expression of IGFBP2 was associated with better survival in a specific group (body mass index ≤25 kg/m2) of patients with breast cancer.26 The above and our present findings suggest that the prognostic efficacy of gene expression patterns can be useful in specific subclasses of patients, however, further validation studies are needed. Our results suggest that the expression level of these five marker genes might be useful for deciding a prognosis for patients with OC and in making decisions on adjuvant hormonal therapy. There were not enough data on patients that contained both LI and 5-year survival rates within the open serous OC dataset. Nonetheless, validation of the equation using an independent cohort can be done in the future. We believe that this topic is worthy of future studies, and that our results would be helpful for researchers as well as clinicians in the OC field.
  25 in total

Review 1.  Bioconductor: an open source framework for bioinformatics and computational biology.

Authors:  Mark Reimers; Vincent J Carey
Journal:  Methods Enzymol       Date:  2006       Impact factor: 1.600

2.  Gene expression profiling predicts clinical outcome of breast cancer.

Authors:  Laura J van 't Veer; Hongyue Dai; Marc J van de Vijver; Yudong D He; Augustinus A M Hart; Mao Mao; Hans L Peterse; Karin van der Kooy; Matthew J Marton; Anke T Witteveen; George J Schreiber; Ron M Kerkhoven; Chris Roberts; Peter S Linsley; René Bernards; Stephen H Friend
Journal:  Nature       Date:  2002-01-31       Impact factor: 49.962

Review 3.  Can we safely target the WNT pathway?

Authors:  Michael Kahn
Journal:  Nat Rev Drug Discov       Date:  2014-07       Impact factor: 84.694

Review 4.  Ovarian cancer: epidemiology, biology, and prognostic factors.

Authors:  C H Holschneider; J S Berek
Journal:  Semin Surg Oncol       Date:  2000 Jul-Aug

5.  Wnt5a expression is associated with the tumor proliferation and the stromal vascular endothelial growth factor--an expression in non-small-cell lung cancer.

Authors:  Cheng-Long Huang; Dage Liu; Jun Nakano; Shinya Ishikawa; Keiichi Kontani; Hiroyasu Yokomise; Masaki Ueno
Journal:  J Clin Oncol       Date:  2005-12-01       Impact factor: 44.544

Review 6.  IGF binding proteins in cancer: mechanistic and clinical insights.

Authors:  Robert C Baxter
Journal:  Nat Rev Cancer       Date:  2014-04-10       Impact factor: 60.716

Review 7.  Exploiting the therapeutic potential of the PI3K-AKT-mTOR pathway in enriched populations of gynecologic malignancies.

Authors:  Ramez N Eskander; Krishnansu S Tewari
Journal:  Expert Rev Clin Pharmacol       Date:  2014-10-10       Impact factor: 5.045

8.  Sox9 and Hif-2α regulate TUBB3 gene expression and affect ovarian cancer aggressiveness.

Authors:  Giuseppina Raspaglio; Marco Petrillo; Enrica Martinelli; Domenica Donatella Li Puma; Marisa Mariani; Marta De Donato; Flavia Filippetti; Simona Mozzetti; Silvia Prislei; Gian Franco Zannoni; Giovanni Scambia; Cristiano Ferlini
Journal:  Gene       Date:  2014-03-21       Impact factor: 3.688

9.  IGFBP2 and IGFBP3 protein expressions in human breast cancer: association with hormonal factors and obesity.

Authors:  Nicole M Probst-Hensch; Julia H B Steiner; Peter Schraml; Zsuzsanna Varga; Ursina Zürrer-Härdi; Martina Storz; Dimitri Korol; Mathias K Fehr; Daniel Fink; Bernhard C Pestalozzi; Urs M Lütolf; Jean-Philippe Theurillat; Holger Moch
Journal:  Clin Cancer Res       Date:  2010-01-26       Impact factor: 12.531

10.  Integrated genomic analyses of ovarian carcinoma.

Authors: 
Journal:  Nature       Date:  2011-06-29       Impact factor: 49.962

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.