| Literature DB >> 32241272 |
Chuanpeng Dong1,2, Jiannan Liu2, Steven X Chen1, Tianhan Dong3, Guanglong Jiang1,2, Yue Wang1, Huanmei Wu2, Jill L Reiter4, Yunlong Liu5,6.
Abstract
BACKGROUND: While several multigene signatures are available for predicting breast cancer prognosis, particularly in early stage disease, effective molecular indicators are needed, especially for triple-negative carcinomas, to improve treatments and predict diagnostic outcomes. The objective of this study was to identify transcriptional regulatory networks to better understand mechanisms giving rise to breast cancer development and to incorporate this information into a model for predicting clinical outcomes.Entities:
Keywords: Breast cancer; Prognostic model; Transcription regulators
Mesh:
Substances:
Year: 2020 PMID: 32241272 PMCID: PMC7118819 DOI: 10.1186/s12920-020-0688-z
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Workflow of the study. a. Estimation of transcription regulator activities workflow. Co-expression analysis of each regulator with all genes in TCGA breast cancer cohort was performed using GRNBoost2 and regulator binding site information was retrieved from ENCODE (step 1). The intersection of the top 5% of co-expressed genes with regulator target genes was identified (step 2) and used to calculate the regulator activity with a rank-based approach (step 3). b. Overall workflow of the study. Abbreviations: TCGA, The Cancer Genome Atlas; GEO, Gene Expression Omnibus
Fig. 2Correlation of transcription regulator activities with breast cancer subtypes. The distribution of transcription regulator activity and breast cancer subtype with hormone receptor status and PAM50 molecular classification. The transcription regulator activity profile was generated by unsupervised cluster analysis, in which the rows represented the regulator activities and the columns represented the samples. The transcription regulator activity score varies between − 10 and + 10 as indicated by a gradient from green to red color
Transcriptional regulator activity associated with breast cancer overall survival
| Regulator | log-rank p | HR | 95% CI | |
|---|---|---|---|---|
| BATF | 3.00E-06 | 0.45 | (0.32–0.64) | 5.26E-06 |
| ESR1 | 0.002 | 0.60 | (0.43–0.83) | 0.0023 |
| IRF1 | 0.0034 | 0.62 | (0.44–0.85) | 0.0037 |
| THAP1 | 0.0099 | 1.53 | (1.10–2.12) | 0.0105 |
| KDM1A | 0.0158 | 0.67 | (0.48–0.93) | 0.0165 |
| TBP | 0.0167 | 1.49 | (1.07–2.06) | 0.0173 |
| JUNB | 0.0199 | 0.68 | (0.49–0.94) | 0.0206 |
| ATF3 | 0.021 | 0.69 | (0.50–0.95) | 0.0218 |
| CHD7 | 0.0236 | 1.45 | (1.05–2.01) | 0.0243 |
| STAT2 | 0.032 | 0.70 | (0.51–0.97) | 0.0329 |
| GTF2B | 0.04 | 0.71 | (0.52–0.99) | 0.0409 |
| IKZF1 | 0.0406 | 0.71 | (0.52–0.99) | 0.0416 |
| FOXA1 | 0.0459 | 0.72 | (0.52–1.00) | 0.0469 |
| MAX | 0.046 | 0.72 | (0.52–1.00) | 0.0469 |
| REST | 0.0493 | 1.38 | (1.00–1.91) | 0.0502 |
*HR Hazard ratio, CI confidence interval
Fig. 3Regulator based risk model of TCGA breast cancer patients. a The distribution of the significant regulator activities, patients’ survival status and gene expression signature were analyzed in the TCGA breast cancer patients. (i) Hormone receptor status and PAM50 molecular classification of breast cancer patients. (ii) Heatmap of the selected regulator activities profile. (iii) Patients overall survival status and time. Rows represent genes, and columns represent patients. The red dashed line represents the risk score median cutoff dividing patients into low-risk and high-risk groups. b Kaplan-Meier estimates of patient survival in high- and low-risk groups based on transcriptional regulator activities
Fig. 4Validation of the transcriptional regulator model with GEO datasets. Kaplan-Meier estimates of the survival of independent breast cancer datasets using the regulator activity signature. a Kaplan-Meier curves of overall survival for a GSE25066, b GSE2034, c GSE3494, d GSE20685, e GSE21653 and f GSE86166. The low-risk and high-risk groups of patients was determined on the basis of the median risk score for each validation dataset. The tick marks on the Kaplan-Meier curves represent censored subjects. The statistically significant differences between the two curves were determined by the two-sided log-rank test. The vertical red dashed line indicates the 10-year mark that was used for interpreting survival difference between groups
Fig. 5Association of the transcription regulator-based risk score with pathological and molecular features. The regulator risk score is shown for breast cancer stages in the TCGA cohort (a) and in a selected validation set GSE21653 (b). One-way ANOVA p values are provided. Cancer related pathways that were significantly altered in patients with high regulator risk scores included cell cycle (c) and PLK1 signaling pathways (d). Normalized enrichment score (NES) was used to evaluate the enrichment results