| Literature DB >> 27270322 |
Haiyue Zhao1, Ensong Guo1, Ting Hu1, Qian Sun1, Jianli Wu1, Xingguang Lin1, Danfeng Luo1, Chaoyang Sun1, Changyu Wang1, Bo Zhou1, Na Li1, Meng Xia1, Hao Lu1, Li Meng1, Xiaoyan Xu1, Junbo Hu1, Ding Ma1, Gang Chen1, Tao Zhu1.
Abstract
Approximately 50-75% of patients with serous ovarian carcinoma (SOC) experience recurrence within 18 months after first-line treatment. Current clinical indicators are inadequate for predicting the risk of recurrence. In this study, we used 7 publicly available microarray datasets to identify gene signatures related to recurrence in optimally debulked SOC patients, and validated their expressions in an independent clinic cohort of 127 patients using immunohistochemistry (IHC). We identified a two-gene signature including KCNN4 and S100A14 which was related to recurrence in optimally debulked SOC patients. Their mRNA expression levels were positively correlated and regulated by DNA copy number alterations (CNA) (KCNN4: p=1.918e-05) and DNA promotermethylation (KCNN4: p=0.0179; S100A14: p=2.787e-13). Recurrence prediction models built in the TCGA dataset based on KCNN4 and S100A14 individually and in combination showed good prediction performance in the other 6 datasets (AUC:0.5442-0.9524). The independent cohort supported the expression difference between SOC recurrences. Also, a KCNN4 and S100A14-centered protein interaction subnetwork was built from the STRING database, and the shortest regulation path between them, called the KCNN4-UBA52-KLF4-S100A14 axis, was identified. This discovery might facilitate individualized treatment of SOC.Entities:
Keywords: KCNN4; S100A14; prognosis; recurrence; serous ovarian cancer
Mesh:
Substances:
Year: 2016 PMID: 27270322 PMCID: PMC5190068 DOI: 10.18632/oncotarget.9721
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
general information of involved 7 public datasets
| Datasets | Platform | Sample number | Screened Samples | Age (year) | Recurrence status | Days to recurrence | Vital status | Days to death | ||
|---|---|---|---|---|---|---|---|---|---|---|
| no recurrence | recurrence | deceased | living | |||||||
| TCGA | hthgu133a | 481 | 305 | 57 (30 - 84) | 131 | 174 | 453 (92 - 3378) | 156 | 149 | 981 (92 - 4623) |
| TCGA RNASeq | RNASeqV2 | 242 | 156 | 56 (34 - 84) | 68 | 88 | 423 (92 - 2648) | 83 | 73 | 913 (92 - 4623) |
| GSE17260 | hgug4112a | 84 | 37 | − | 19 | 18 | 690 (120 - 2250) | 10 | 27 | 990 (270 - 2250) |
| GSE26193 | hgu133plus2 | 79 | 77 | − | 16 | 61 | 595 (121 – 7386) | 58 | 19 | 1136 (194 - 7386) |
| GSE30161 | hgu133plus2 | 45 | 17 | 55 (47 - 75) | 3 | 14 | 566 (162 - 4208) | 9 | 8 | 1846 (377 - 4208) |
| GSE49997 | ABI | 171 | 120 | 56 (27 - 85) | 51 | 69 | 533 (122 - 1461) | 30 | 90 | 776 (122 - 1491) |
| GSE9891 | hgu133plus2 | 239 | 126 | 59.5 (39 - 80) | 42 | 84 | 540 (120 - 3060) | 50 | 76 | 900 (180 - 6420) |
Samples screened according to the criteria mentioned in the method section: optimal debulking and days_to_tumor_recurrence > 90d.
summarized according to screened samples.
Figure 1A. The correlation of gene expression between KCNN4 and S100A14 in 7 public datasets. In each dataset, recurrent samples and norecurrent samples were marked in red and blue, respectively. Some statistical results are also listed. The black solid line represents the linear regression; B. the correlation of the expression profiles of KCNN4 and S100A14 with CNA status as well as methylation values. For CNA status, −2 = homozygous deletion, −1 = hemizygous deletion, 0 = neutral/no change, 1 = gain, and 2 = high level amplification. The total significance was estimated from the null distribution constructed by the asymptotic K-sample permutation test. If significant, pairwise comparisons were then performed via TukeyHSD test, and the p values were adjusted with the BH method. The p values are also labeled. For methylation status, the red lines represent the linear regression between the expression values and methylation values.
Expression difference of KCNN4 and S100A14 on SOC recurrence
| Dataset | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| OR | 95%CI | P | OR | 95%CI | P | OR | 95%CI | P | P | |
| 1.79 | 1.08 – 2.98 | 2.12 | 1.13 – 4.13 | 1.87 | 1.11 – 3.18 | |||||
| 3.18 | 1.52 – 6.79 | 2.91 | 1.33 - 6.68 | 4.13 | 1.87 - 9.51 | |||||
| 14.23 | 1.58 – 705.97 | 3.28 | 0.59 – 24.07 | 0.151 | 3.28 | 0.59 – 24.07 | 0.151 | |||
| 5.34 | 1.17 – 24.84 | 9.18 | 1.86 – 90.23 | 1.53 | 0.44 – 5.41 | 0.5704 | ||||
| Inf | 1.98 – Inf | Inf | 1.98 – Inf | Inf | 1.98 – Inf | |||||
| 4.11 | 0.82 – 40.35 | 0.0691 | 7.28 | 0.78 – 353.95 | 0.0821 | Inf | 0.02 - Inf | 1 | ||
| 1.82 | 0.76 – 4.58 | 0.1689 | 6.28 | 0.49 – 338.42 | 0.1075 | 2.06 | 0.85 – 5.35 | 0.111 | 0.0878 | |
This comparison were done between KCNN4(−)S100A14(−) and KCNN4(+)S100A14(+)/KCNN(+)S100A14(−)/KCNN(−)S100A14(+)
Comparisons among 4 groups such as KCNN4(−)S100A14(−), KCNN4(+)S100A14(+), KCNN(+)S100A14(−) and KCNN(−)S100A14(+)
Figure 2A. The KM plot of recurrence with different KCNN4 expression states. (−) means lower expression and (+) means higher expression; B. The KM plot of recurrence with different S100A14 expression states; C. The KM plot of recurrence with combined KCNN4 and S100A14 states; D. The prediction power of the recurrence prediction model built with TCGA dataset in the other 6 datasets. The model was built with linear kernel SVM via 5-repeats of 10-fold cross validation, coupled with internal parameter selection procedures. For each dataset, the AUCs and 95%CIs are also listed.
Figure 3A. The KCNN4 and S100A14 centered interaction subnetwork. The minimum network connected KCNN4 and S100A14 constructed from a high quality STRING database (combined score >= 600) when restricted to 1-NN of KCNN4 and S100A14. Four communities were detected using fast greedy searching and are colored differently. The node sizes are proportional to the degrees of each gene; B. The regulation frameworks of the KCNN4-UBA52-KLF4-S100A14 axis determined by the Bayesian network based on hill-climbing scoring. The arrows are the regulation directions; C. Pairwise correlations of expression profiles of 4 genes in the TCGA dataset. The upper triangle showed the paired expression in all TCGA samples and the red lines represent the linear regression results. The lower triangle illustrated the pairwise Pearson's correlation coefficients; D. Hypothesized regulation modes of the KCNN4-UBA52-KLF4-S100A14 axis. The frames were colored according to the colors of the communities to which they belong. The arrows mean stimulation and the blocked arrow means inhibition. To the right is the proof for the regulation hypothesis; E. Word cloud representation of enriched GO and pathway terms on genes in the subnetwork (adjusted p<0.05). Their significance are illustrated with different font size and gray scale.
Figure 4Representative patterns of KCNN4 and S100A14 IHC staining in the SOC cohort
A. KCNN4 staining in SOCs with recurrence (R); B. KCNN4 staining in SOCs with norecurrence (NR); C. S100A14 staining in SOCs with recurrence (R); D. S100A14 staining in SOCs with norecurrence (NR). Original magnification: ×200 and ×400. Scale bars, 100 μm, 50 μm.
Relationship of KCNN4 and S100A14 expression with clinicopathological characteristics in SOC cohort
| clinicopathological characteristics | No. | KCNN4 | S100A14 | ||||
|---|---|---|---|---|---|---|---|
| high(+) | χ2 | p value | high(+) | χ2 | p value | ||
| ≥60 | 36 | 25(69.4%) | 0.7655 | 0.3816 | 31(86.1%) | 0.0110 | 0.9166 |
| <60 | 91 | 70(76.9%) | 79(86.8%) | ||||
| I~II | 37 | 21(56.8%) | 9.0215 | 33(89.2%) | 0.2986 | 0.5848 | |
| III~IV | 90 | 74(82.2%) | 77(85.6%) | ||||
| Grade1 | 8 | 5(62.5%) | 0.692 | 0.7075 | 7(87.5%) | 4.6933 | 0.0957 |
| Grade2 | 24 | 18(75.0%) | 24(100%) | ||||
| Grade3 | 95 | 72(75.8%) | 79(83.2%) | ||||
| ≥35 | 32 | 25(78.1%) | 0.2505 | 0.6168 | 24(75.0%) | 4.9771 | |
| <35 | 95 | 70(73.7%) | 86(90.5%) | ||||
| Positive | 77 | 54(70.1%) | 2.2662 | 0.1322 | 67(95.7%) | 0.02683 | 0.8699 |
| Negative | 50 | 41(82.0%) | 43(86.0%) | ||||
| ≥5 | 87 | 66(75.9%) | 0.1643 | 0.6852 | 72(82.8%) | 0.9981 | 0.3178 |
| <5 | 40 | 29(72.5%) | 38(95.0%) | ||||
| Positive | 26 | 19(73.1%) | 0.0517 | 0.8202 | 19(73.1%) | 5.1675 | |
| Negative | 101 | 76(75.2%) | 91(90.1%) | ||||
| Positive | 73 | 62(84.9%) | 9.3443 | 69(94.5%) | 9.2566 | ||
| Negative | 54 | 33(61.1%) | 41(75.9%) | ||||
Univariate and multivariate Cox regression on recurrence in SOC cohort
| Clinical factors | Univariate analysis | Multivariate analysis | ||||
|---|---|---|---|---|---|---|
| OR | 95%CI | Pr(>|z|) | OR | 95%CI | Pr(>|z|) | |
| 0.9994 | 0.7791 - 1.282 | 0.996 | 1.0227 | 0.7903 – 1.3234 | 0.8646 | |
| 1.279 | 0.7905 - 2.068 | 0.316 | 1.1443 | 0.6615 – 1.9792 | 0.6298 | |
| 0.8208 | 0.5856 - 1.15 | 0.252 | 0.7624 | 0.5155 – 1.1276 | 0.1743 | |
| 1.081 | 0.8237 - 1.42 | 0.573 | 0.9872 | 0.7233 – 1.3473 | 0.9345 | |
| 0.9758 | 0.7704 - 1.236 | 0.839 | 1.0479 | 0.8181 – 1.3423 | 0.7109 | |
| 0.9477 | 0.745 - 1.206 | 0.662 | 0.8903 | 0.6716 – 1.1802 | 0.4192 | |
| 1.188 | 0.9086 - 1.553 | 0.208 | 1.1910 | 0.8922 – 1.5900 | 0.2356 | |
| 0.4711 | 0.2461 - 0.9018 | 0.4807 | 0.2402 – 0.9623 | |||
| 0.3551 | 0.1292 - 0.9755 | 0.2955 | 0.1020 – 0.8566 | |||
Figure 5The KM plot of SOC recurrence with KCNN4 and S100A14 expression status in our cohort
The p values from log-rank tests are also presented.