| Literature DB >> 28979134 |
Jianli Zhao1,2, Ying Wang1,2, Zengding Lao3, Siting Liang3, Jingyi Hou4, Yunfang Yu1,2, Herui Yao1,2, Na You3, Kai Chen1,2.
Abstract
Breast cancer, the most common cancer among women, is a clinically and biologically heterogeneous disease. Numerous prognostic tools have been proposed, including gene signatures. Unlike proliferation-related prognostic gene signatures, many immune-related gene signatures have emerged as principal biology-driven predictors of breast cancer. Diverse statistical methods and data sets were used for building these immune-related prognostic models, making it difficult to compare or use them in clinically meaningful ways. This study evaluated successfully published immune-related prognostic gene signatures through systematic validations of publicly available data sets. Eight prognostic models that were built upon immune-related gene signatures were evaluated. The performances of these models were compared and ranked in ten publicly available data sets, comprising a total of 2,449 breast cancer cases. Predictive accuracies were measured as concordance indices (C-indices). All tests of statistical significance were two-sided. Immune-related gene models performed better in estrogen receptor-negative (ER-) and lymph node-positive (LN+) breast cancer subtypes. The three top-ranked ER- breast cancer models achieved overall C-indices of 0.62-0.63. Two models predicted better than chance for ER+ breast cancer, with C-indices of 0.53 and 0.59, respectively. For LN+ breast cancer, four models showed predictive advantage, with C-indices between 0.56 and 0.61. Predicted prognostic values were positively correlated with ER status when evaluated using univariate analyses in most of the models under investigation. Multivariate analyses indicated that prognostic values of the three models were independent of known clinical prognostic factors. Collectively, these analyses provided a comprehensive evaluation of immune-related prognostic gene signatures. By synthesizing C-indices in multiple independent data sets, immune-related gene signatures were ranked for ER+, ER-, LN+, and LN- breast cancer subtypes. Taken together, these data showed that immune-related gene signatures have good prognostic values in breast cancer, especially for ER- and LN+ tumors.Entities:
Keywords: breast cancer; immune-related gene; prognostic models
Year: 2017 PMID: 28979134 PMCID: PMC5602680 DOI: 10.2147/OTT.S144015
Source DB: PubMed Journal: Onco Targets Ther ISSN: 1178-6930 Impact factor: 4.147
Figure 1Flow chart for model selection process.
Features of eight published models that prognosticate breast cancer outcomes
| First author; publication year | Clinical subtype | Training data (n) | Validation data (n) | Gene signature |
|---|---|---|---|---|
| Teschendorff et al | ER− | 713 | 343 | 7-gene panel: |
| Teschendorff et al | All | 1,223 | 830 | |
| Oh et al | LN− | 684 | 616 | |
| Bianchini et al | ER+, HER2−, HER2+ | 684 | 233 | 15-gene panel: |
| Nagalla et al | All | 977 | 977 | B/P metagenes, M/D metagenes, T/NK metagenes |
| Yau et al | ER− | 199 | 75 | 14-gene panel: |
| Ursini-Siegel et al | HER2+, basal | 179 | 2,481 | 43-gene SRIS |
| Cheng et al | ER− | 1,981 | 184 | Lymphocyte-specific immune recruitment panel: |
Notes:
43 gene ShcA-regulated immune signature (SRIS). See supplementary material for additional information.
Abbreviations: ER, estrogen receptor; HER2, human epidermal growth factor 2; LN, lymph node.
Features of ten publicly available microarray data sets used to validate published risk scores
| Data sets | Microarray platform | Sample size (n) | Information about ER | Information about LN |
|---|---|---|---|---|
| ETABM158 | U133A | 130 | Yes | Yes |
| GSE11121 | U133A | 200 | Yes | Yes |
| GSE2034 | U133A | 286 | Yes | Yes |
| GSE25066 | U133A | 508 | Yes | Yes |
| GSE2603 | U133A | 121 | Yes | Yes |
| GSE3494-GPL96 | U133A | 251 | Yes | Yes |
| GSE45255 | U133A | 139 | Yes | Yes |
| GSE4922-GPL96 | U133A | 289 | Yes | Yes |
| GSE6532-GPL96 | U133A | 327 | Yes | Yes |
| GSE7390 | U133A | 198 | Yes | Yes |
| Total | 2,449 | 10 | 10 |
Abbreviations: ER, estrogen receptor; LN, lymph node.
Heterogeneity analyses for eight publicly available models
| First author; publication year | All data | ER− | ER+ | LN+ | LN− |
|---|---|---|---|---|---|
| Nagalla et al | 0.13 | 0.01 | 0.08 | 0.63 | 0.25 |
| Teschendorff et al | 0.01 | 0.05 | 0.07 | 0.91 | 0.14 |
| Oh et al | 0.12 | 0.02 | 0.07 | 0.68 | 0.01 |
| Ursini-Siegel et al | 0.18 | 0.13 | 0.30 | 0.47 | 0.01 |
| Teschendorff et al | 0.01 | 0.38 | 0.03 | 0.47 | 0.08 |
| Bianchini et al | 0.07 | 0.01 | 0.04 | 0.02 | 0.01 |
| Cheng et al | 0.01 | 0.08 | 0.33 | 0.67 | 0.97 |
| Yau et al | 0.15 | 0.29 | 0.31 | 0.99 | 0.16 |
Note: P-values of heterogeneity analyses in validation data sets are presented when analyzing all the data, ER−, ER+, LN+, and LN− cohorts, respectively.
Abbreviations: ER, estrogen receptor; LN, lymph node.
Figure 2Performance assessment of published immune-related risk scores.
Notes: (A) C-indices are given for predictions of overall survival for each of eight models in each of ten microarray data sets (left panel). Data sets used as training data sets during model development are shown in black. Darker shades of orange corresponded to higher prediction levels; 0.5= random risk score, 1.0= perfect prediction. Models are ordered from the highest to the lowest summary C-index (top to bottom). Summary C-indices are given for each model with training data sets excluded (orange boxes; right panel); 95% CIs (gray lines) were obtained by resampling cases. C-indices are presented for predictions of overall survival in ER−, ER+, LN+, and LN− data sets (left panels of B–E, respectively). Summary C-indices in ER−, ER+, LN+, and LN− data sets are given for each model with training data sets excluded (orange boxes; right panels of B–E, respectively).
Abbreviations: C-indices, concordance indices; ER, estrogen receptor; LN, lymph node.