| Literature DB >> 28000771 |
Matias M Falco1, Marta Bleda2, José Carbonell-Caballero1, Joaquín Dopazo1,3,4.
Abstract
Dysregulation of the normal gene expression program is the cause of a broad range of diseases, including cancer. Detecting the specific perturbed regulators that have an effect on the generation and the development of the disease is crucial for understanding the disease mechanism and for taking decisions on efficient preventive and curative therapies. Moreover, detecting such perturbations at the patient level is even more important from the perspective of personalized medicine. We applied the Transcription Factor Target Enrichment Analysis, a method that detects the activity of transcription factors based on the quantification of the collective transcriptional activation of their targets, to a large collection of 5607 cancer samples covering eleven cancer types. We produced for the first time a comprehensive catalogue of altered transcription factor activities in cancer, a considerable number of them significantly associated to patient's survival. Moreover, we described several interesting TFs whose activity do not change substantially in the cancer with respect to the normal tissue but ultimately play an important role in patient prognostic determination, which suggest they might be promising therapeutic targets. An additional advantage of this method is that it allows obtaining personalized TF activity estimations for individual patients.Entities:
Mesh:
Substances:
Year: 2016 PMID: 28000771 PMCID: PMC5175166 DOI: 10.1038/srep39709
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Cancer samples available for any cancer type selected.
| Cancer type | Tumour | Normal | Stage I | Stage II | Stage III | Stage IV | Alive | Deceased |
|---|---|---|---|---|---|---|---|---|
| 294 | 17 | 1 | 95 | 99 | 98 | 221 | 73 | |
| 1039 | 113 | 177 | 591 | 237 | 17 | 937 | 98 | |
| 428 | 41 | 73 | 168 | 120 | 58 | 374 | 53 | |
| 480 | 42 | 26 | 74 | 72 | 245 | 320 | 158 | |
| 517 | 72 | 256 | 56 | 125 | 81 | 358 | 159 | |
| 222 | 32 | 138 | 16 | 43 | 13 | 199 | 23 | |
| 294 | 48 | 132 | 66 | 71 | 5 | 222 | 72 | |
| 473 | 55 | 255 | 116 | 81 | 24 | 355 | 118 | |
| 426 | 45 | 217 | 128 | 75 | 6 | 290 | 136 | |
| 500 | 58 | 282 | 54 | 110 | 52 | 481 | 14 | |
| 508 | 23 | 318 | 49 | 114 | 27 | 464 | 43 |
Figure 1Change of TF activity in the different cancers studied.
Cells in red indicate a significant increased activity of the TF in the cancer with respect to the corresponding normal tissue, according to the TFTEA, cells in blue indicate a significant decreased activity and cells in grey indicate that no significant change in activity was detected. Columns correspond to cancers and rows to TFs.
Figure 2Change of activity in all TF included in this study across cancer stages in the different cancers studied.
Each panel corresponds to a single TF, with stages in rows and cancers in columns. The colour scale in the figure ranges from red, indicating a significant increased activity of the TF in the stage of the cancer with respect to the corresponding normal tissue, according to the TFTEA, to blue, indicating a significantly decreased activity. The colour scale represents −log10 (adjusted p-value). Cells in grey indicate that no significant change in activity was detected. Cells in white correspond to stages in cancers with very few individuals (see Table 1) in which the analysis could not be carried out.
Figure 3K-M plots representing TF activities significantly associated to patient survival in all the cancers analysed.
TFs in bold present a significant association with adjusted p-values < 0.05 and TFs in italics have nominal p-values < 0.05.
TFs significantly associated to survival.
| Cancer type | Variables (TFs and PURITY) selected by the Cox model | Total | TFs with individual effect on survival (K-M) | Total |
|---|---|---|---|---|
| 22 | 4 | |||
| 18 | 5 | |||
| 21 | 7 | |||
| 9 | 17 | |||
| 17 | 38 | |||
| — | 0 | 1 | ||
| 20 | 1 | |||
| 12 | 4 | |||
| 19 | 4 | |||
| — | 0 | 6 | ||
| 22 | 6 |
The first column denoted the cancer type analysed. The second column contains the variables included in the Cox multiple regression model, which can be TFs and tumour purity. The third column contains the total number of TFs included in the Cox model. The fourth column shows TFs that show a significant association to survival by themselves. The fifth column contains the number of TFs significant in the K-M analysis. TFs in bold are significant with an adjusted p-value < 0.05. TFs in grey and in italic are significant with a nominal p-value < 0.05.
Figure 4K-M plots representing TF activities significantly associated to patient survival.
Survival curves are represented as solid lines and their corresponding confidence intervals as dotted lines. (A) High activity (green curve) of SPI1 in KIRC is significantly associated to patient survival (FDR-adjusted p-value = 2.09 × 10−6); (B) High activity of JUND in HNSC is significantly associated to bad prognostic (FDR-adjusted p-value = 3.42 × 10−5); (C) Low activity of CTCF in KIRC is significantly associated to bad prognostic (FDR-adjusted p-value = 2.81 × 10−4); (D) Low activity of MEF2C in KIRC is significantly associated to bad prognostic (FDR-adjusted p-value = 1.84 × 10−5).
Figure 5Combinations of TFs significantly associated to patient survival in the different cancers when a Cox model is applied.
Cancers are represented in columns and TFs in rows. For each cancer, several TFs and sometimes tumour purity were included in a cox model. The colour intensity is related to the significance of this association (p-value).
Figure 6Schema of the TFTEA method to obtain TFs differentially activated between two conditions compared.
The method uses gene expression values and compares two conditions (A and B) by means of any test to obtain a rank of differentially expressed genes (Rank DE) based on the statistic. Then, for each TF, a logistic regression78 is applied to discover associations of the TF targets to high or low values of the rank (lower panel). Thus, targets of TF1 show a clear association to high values of the statistic, meaning that have significantly higher expression in condition (A) than in condition ( ), which demonstrated the differential activity of TF1. TF2 is the opposite case, in which the TF is significantly less active in (B) than in (A). TF3 have their targets active or inactive in both conditions, meaning that these activities are not a collective property and consequently are not due to TF3, but maybe to other regulators.
Figure 7Schema of the TFTEA method to obtain personalized values of survival.
The method uses gene expression values and compares two conditions (A and B). However, in this case all the samples in the (B) condition are used to produce an average expression value for any of the genes in this condition (m vector). Then, each sample in the (A) condition (vc) can be compared to the average expression in the (B) condition and a rank of fold change (Fcg) is generated for each sample. Then, this ranking is used in the same way that the rank of differential expression was used in Fig. 6 to find differentially activated TFs in samples from the condition (A) with respect to the average (B) condition. Is samples are paired the Fcg value can be derived from the direct comparison between them.