| Literature DB >> 29321541 |
Beihe Wang1,2, Fangning Wan1,2, Haoyue Sheng1,2, Yiping Zhu1,2, Guohai Shi1,2, Hailiang Zhang1,2, Bo Dai1,2, Yijun Shen3,4, Yao Zhu5,6, Dingwei Ye7,8.
Abstract
We found two deviant groups that were unpredictable with clinical models predicting bladder cancer metastasis. The group G consists of patients at high risk of pN+ , but they have pN0. The group P consists of patients at low risk of pN+ , but they have pN+ . We aimed to determine the genetic differences between these two groups. 1603 patients from SEER database were enrolled to build a multivariate model. This model was applied to patients from the TCGA database to distinguish groups G and P. Differentially expressed genes between the two groups were identified. RT-qPCR was used to validate the results in a cohort from FUSCC. Two deviant groups were identified both in the SEER population and the TCGA population. Expression of 183 genes was significantly different between the two groups. 18 genes achieved significant statistical power in predicting lymph node metastasis excluding these two deviant groups. The 18-gene signature outperformed 3 other bladder cancer lymph node prediction tools in 2 external GEO datasets. RT-qPCR results of our own cohort identified NECTIN2 (P = 0.036) as the only gene that could predict metastasis. Our study showed a novel gene screening method and proposed an 18-gene signature highly predictive of bladder cancer metastasis.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29321541 PMCID: PMC5762631 DOI: 10.1038/s41598-017-18773-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Flowchart showing the experimental design. 1603 patients from SEER database were used to build the prediction model and this model was applied in 248 patients from TCGA database to identify group G and P, as well as the predictable population. 18 of the 183 genes that differentially expressed genes between group G and P achieved statistical power for predicting metastases in the predictable population. Predictive ability of the 18-gene signature was further compared with 3 published signatures in 2 GEO datasets. The 18 genes were also validated in 130 patients from the FUSCC cohort, and NECTIN2 was identified.
Demographic characteristics of patients in SEER, TCGA and FUSCC cohorts.
| Characteristic | SEER | TCGA | FUSCC | |||
|---|---|---|---|---|---|---|
| NO | % | NO | % | NO | % | |
| Age, year | ||||||
| Median(IQR) | 69(61–76) | 70(61–77) | 63(57–70) | |||
| Gender | ||||||
| Male | 1185 | 73.9 | 177 | 71.4 | 108 | 11.1 |
| Female | 418 | 26.1 | 71 | 28.6 | 19 | 88.9 |
| Grade | ||||||
| Low grade | NA | NA | 0 | 0 | 6 | 4.7 |
| High grade | NA | NA | 248 | 100 | 121 | 95.3 |
| 1 | 2 | 0.1 | NA | NA | NA | NA |
| 2 | 31 | 1.9 | NA | NA | NA | NA |
| 3 | 563 | 35.1 | NA | NA | NA | NA |
| 4 | 1007 | 62.8 | NA | NA | NA | NA |
| Pathologic T stage | ||||||
| 2 | 606 | 37.8 | 66 | 26.6 | 61 | 48.0 |
| 3 | 732 | 45.7 | 139 | 56.0 | 41 | 32.3 |
| 4 | 265 | 16.5 | 43 | 17.3 | 25 | 18.7 |
| Tumor size, mm | ||||||
| Median(IQR) | 40(25–52) | NA | NA | |||
| Regional nodes examined count | ||||||
| <15 | 1054 | 65.8 | 94 | 37.9 | 96 | 75.6 |
| ≥15 | 549 | 34.2 | 154 | 62.1 | 31 | 24.4 |
| Lymph nodes status | ||||||
| Negative | 1170 | 69.1 | 152 | 61.3 | 85 | 66.9 |
| Positive | 496 | 30.9 | 96 | 38.7 | 42 | 33.1 |
| AJCC stage | ||||||
| II | 456 | 28.5 | 54 | 21.8 | 55 | 43.3 |
| III | 603 | 37.6 | 96 | 38.7 | 40 | 31.5 |
| IV | 544 | 33.9 | 98 | 39.5 | 32 | 25.2 |
NA: not available, AJCC: The American Joint Committee on Cancer.
Univariate and multivariate analysis with pN1 as the endpoint.
| Characteristic | Univariate analysis | Multivariate analysis | ||||
|---|---|---|---|---|---|---|
| HR | 95%CI | P | HR | 95%CI | P | |
| Gender | ||||||
| Male | ref. | |||||
| Female | 1.105 | 0.870–1.404 | 0.412 | |||
| Age | 0.991 | 0.981–1.001 | 0.09 | |||
| Races | ||||||
| White | ref. | |||||
| Black | 1.176 | 0.755–1.832 | 0.474 | |||
| Other | 0.687 | 0.429–1.101 | 0.119 | |||
| Year of diagnosis | ||||||
| 2000–2005 | ref. | |||||
| 2006–2012 | 0.907 | 0.730–1.128 | 0.381 | |||
| Grade | ||||||
| 1 | ref. | |||||
| 2 | 0.292 | 0.016–5.284 | 0.404 | |||
| 3 | 0.478 | 0.030–7.680 | 0.602 | |||
| 4 | 0.437 | 0.027–7.002 | 0.558 | |||
| T stage | ||||||
| 2 | ref. | ref. | ||||
| 3 | 2.844 | 2.174–3.720 | <0.001 | 2.918 | 2.227–3.823 | <0.001 |
| 4 | 7.653 | 5.507–10.636 | <0.001 | 7.721 | 5.547–10.748 | <0.001 |
| Tumor size | 1.01 | 1.006–1.015 | <0.001 | |||
| Regional lymph nodes examined count | 1.019 | 1.008–1.030 | 0.001 | 1.021 | 1.009–1.033 | <0.001 |
Figure 2Cumulative curves showing the distribution of the two deviant groups (group G: red area in the high-risk interval; group P: blue area in the low-risk interval) in (a) the SEER database and (b) the TCGA database.
Figure 3Association of gene expression with outcome in the TCGA. Heatmap showing expression of the 183 genes that were significantly (P < 0.05) different between group G (n = 37) and group P (n = 17). Genes are arranged by linkage distance, using unsupervised hierarchical clustering of average expression across genes as illustrated by dendrograms. Groups G and P are represented by black and red squares within the sidebar, respectively.
18 genes that could predict lymph node metastasis in a common population.
| Gene symbol | Description | P value | OR | 95%CI |
|---|---|---|---|---|
| KDELR1 | KDEL endoplasmic reticulum protein retention receptor 1 | 0.006 | 2.579 | 1.318–5.048 |
| NECTIN2 | nectin cell adhesion molecule 2 | 0.011 | 1.731 | 1.132–2.647 |
| DHCR7 | 7-dehydrocholesterol reductase | 0.020 | 1.533 | 1.070–2.917 |
| C1orf198 | chromosome 1 open reading frame 198 | 0.019 | 1.930 | 1.114–3.346 |
| GATA1 | GATA binding protein 1 | 0.018 | 1.540 | 1.079–2.205 |
| TOMM40 | translocase of outer mitochondrial membrane 40 | 0.038 | 0.597 | 0.368–0.971 |
| TACC3 | transforming acidic coiled-coil containing protein 3 | 0.009 | 0.623 | 0.436–0.890 |
| EIF3C | eukaryotic translation initiation factor 3 subunit C | 0.039 | 0.808 | 0.660–0.989 |
| CENPE | centromere protein E | 0.029 | 0.716 | 0.531–0.966 |
| TRMT1 | tRNA methyltransferase 1 | 0.021 | 0.572 | 0.356–0.919 |
| LOC100128076 | protein tyrosine phosphatase pseudogene | 0.050 | 0.710 | 0.504–0.999 |
| RCC2 | regulator of chromosome condensation 2 | 0.043 | 0.526 | 0.282–0.981 |
| MYBL2 | MYB proto-oncogene like 2 | 0.040 | 0.777 | 0.061–0.989 |
| KIFC1 | kinesin family member C1 | 0.049 | 0.725 | 0.527–0.998 |
| COPG | coatomer protein complex subunit gamma | 0.014 | 2.322 | 1.184–4.555 |
| FAM135B | family with sequence similarity 135 member B | 0.047 | 1.300 | 1.003–1.685 |
| TICRR | TOPBP1 interacting checkpoint and replication regulator | 0.037 | 0.725 | 0.535–0.981 |
| TROAP | trophinin associated protein | 0.014 | 0.681 | 0.502–0.924 |