| Literature DB >> 19399471 |
Eike Staub1, Joern Groene, Maya Heinze, Detlev Mennerich, Stefan Roepcke, Irina Klaman, Bernd Hinzmann, Esmeralda Castanos-Velez, Christian Pilarsky, Benno Mann, Thomas Brümmendorf, Birgit Weber, Heinz-Johannes Buhr, André Rosenthal.
Abstract
Wiskott-Aldrich syndrome (WAS) predisposes patients to leukemia and lymphoma. WAS is caused by mutations in the protein WASP which impair its interaction with the WIPF1 protein. Here, we aim to identify a module of WIPF1-coexpressed genes and to assess its use as a prognostic signature for colorectal cancer, glioma, and breast cancer patients. Two public colorectal cancer microarray data sets were used for discovery and validation of the WIPF1 co-expression module. Based on expression of the WIPF1 signature, we classified more than 400 additional tumors with microarray data from our own experiments or from publicly available data sets according to their WIPF1 signature expression. This allowed us to separate patient populations for colorectal cancers, breast cancers, and gliomas for which clinical characteristics like survival times and times to relapse were analyzed. Groups of colorectal cancer, breast cancer, and glioma patients with low expression of the WIPF1 co-expression module generally had a favorable prognosis. In addition, the majority of WIPF1 signature genes are individually correlated with disease outcome in different studies. Literature gene network analysis revealed that among WIPF1 co-expressed genes known direct transcriptional targets of c-myc, ESR1 and p53 are enriched. The mean expression profile of WIPF1 signature genes is correlated with the profile of a proliferation signature. The WIPF1 signature is the first microarray-based prognostic expression signature primarily developed for colorectal cancer that is instrumental in other tumor types: low expression of the WIPF1 module is associated with better prognosis.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19399471 PMCID: PMC2688022 DOI: 10.1007/s00109-009-0467-y
Source DB: PubMed Journal: J Mol Med (Berl) ISSN: 0946-2716 Impact factor: 4.599
Fig. 1Discovery and validation of the WIPF1 coexpression module. a The histogram shows the distribution of p values resulting from tests for correlation of the WIPF1 expression profile with expression profiles of each other gene in the Aronow data set. b The histogram shows the distribution of 430 p values resulting from correlation of expression profiles in the Ayers data set of the WIPF1 gene with each discovered gene in the Aronow data set. Note the strong deviation from the uniform distribution, indicating a high rate of successful validation. c A scatter plot of the correlation coefficients in Aronow and Ayers data of each of the 112 successfully validated genes reveals that most validated genes are positively correlated with the WIPF1 profile and only six genes are anti-correlated with WIPF1. All correlation coefficients are consistent in both data sets, i.e., they have the same sign
Fig. 2Two-way hierarchical clustering of colorectal cancer expression data from WIPF1-coexpressed genes. The data set results from fusing the Aronow and Ayers data sets and comprises 112 genes and 159 patients. Top: The dendrogram results from hierarchical clustering of tumors. Light gray indicates low expression, dark gray indicates high expression (log-transformed mean-centered). Five clusters of tumors are evident: cluster A is colored in red, all other clusters are colored in black. Left: The dendrogram results from hierarchical clustering of genes. Two main clusters of genes are evident: the larger cluster X is colored in red, the smaller cluster Y in black. Note the characteristic down-regulation of genes from cluster X in tumors from cluster A. Other tumors have anti-correlated or indifferent expression profiles
Fig. 3Differences in survival between colorectal cancer patients of the “cluster B” type in an independent microarray data set. Tumors that resembled the cluster “A” type were predicted by machine learning in an independent data set of 62 tumors with microarray expression profiles. Nine cluster “A”-like patients all survived until the study end. The difference in survival compared with 53 remaining patients is significant with p = 0.011 in the logrank test
Summary of WIPF1 correlation and survival/relapse association for 38 core genes of the WIPF1 module and WIPF1 itself
| Gene | WIPF1 correlation | WIPF1 correlation | WIPF1 correlation | Cox regression Wald test | Cox regression Wald test |
|---|---|---|---|---|---|
| Aronow data | Ayers data | Staub data | Wang data | Phillips data | |
| AGGF1 | 1.01E−09 | 6.40E−06 | 6.66E−03 | 4.74E−01 | 7.74E−01 |
| AVEN | 1.87E−10 | 1.04E−08 | 9.21E−08 | 1.23E−04 | 6.37E−04 |
| AXL | 3.15E-07 | 4.18E-06 | 1.27E-04 | 9.10E-02 | 2.23E-05 |
| BIN1 | 9.45E-08 | 4.09E-08 | 1.57E-04 | 8.26E-02 | 1.01E-02 |
| C12orf29 | 9.28E-07 | 4.99E-07 | 9.31E-04 | 4.75E-02 | 7.31E-06 |
| C12orf43 | 9.05E-12 | 1.61E-09 | 3.27E-03 | 2.46E-02 | 1.35E-04 |
| C1QB | 4.44E-16 | 6.73E-07 | 9.96E−04 | 1.76E−02 | 1.56E−02 |
| CCL16 | 8.44E−15 | 9.04E−10 | 3.02E-04 | 4.16E−03 | 9.65E−04 |
| CHRDL1 | 1.22E−09 | 1.17E−07 | 2.10E−03 | 1.34E−03 | 5.83E−03 |
| CYLC1 | 9.27E−12 | 1.91E−06 | 5.33E−04 | 1.16E−03 | 2.38E−01 |
| DDX50 | 3.69E−07 | 3.60E−09 | 1.35E−08 | 5.63E−04 | 7.08E−03 |
| DECR2 | 4.89E−12 | 4.55E−07 | 8.30E−07 | 8.97E−04 | 1.46E−04 |
| DEDD | 7.73E−12 | 4.91E−08 | 6.10E−11 | 1.48E−03 | 2.74E−02 |
| DTX4 | 4.00E−09 | 3.28E−07 | 7.26E−04 | 7.73E−01 | 5.25E−05 |
| EFHD2 | 3.97E−08 | 2.55E−06 | 3.42E−03 | 2.05E−01 | 2.42E−02 |
| GTPBP2 | 4.64E−12 | 3.01E−10 | 1.26E−05 | 1.55E−05 | 1.63E−02 |
| HSPA14 | 1.18E−12 | 2.92E-06 | 1.84E-03 | 6.09E-01 | 6.07E-03 |
| IPW | 9.49E-10 | 2.44E-07 | 1.64E-05 | 2.38E-03 | 3.69E-02 |
| MADCAM1 | 1.39E-08 | 2.19E-06 | 1.16E-03 | 3.89E-01 | 1.22E-02 |
| MINA | 0.00E+00 | 6.98E-06 | 8.23E-05 | 9.65E-02 | 5.60E-03 |
| MOBKL2B | 8.13E-12 | 4.70E-06 | 5.86E-03 | 1.61E−01 | 9.03E−01 |
| MRPL16 | 1.48E−09 | 9.52E−10 | 8.58E−04 | 1.36E−01 | 4.49E−02 |
| MS4A12 | 1.36E−13 | 2.77E−10 | 2.02E−04 | 4.64E−01 | 1.02E−03 |
| NF1 | 1.31E−14 | 2.16E−07 | 6.94E−03 | 3.51E−01 | 6.15E−02 |
| NIP7 | 1.92E−09 | 2.03E−08 | 7.09E−03 | 4.60E−01 | 8.45E−01 |
| NLK | 0.00E+00 | 3.29E−09 | 6.99E−05 | 7.86E−03 | 3.52E−02 |
| NT5E | 3.61E−12 | 4.34E−09 | 1.05E−04 | 5.92E−06 | 4.19E−03 |
| PGM1 | 3.93E−07 | 3.17E−07 | 4.95E−03 | 4.04E−01 | 8.73E−01 |
| PLA2G2E | 1.99E−11 | 9.69E−06 | 6.35E−04 | 2.24E−01 | 1.04E−03 |
| SCUBE2 | 4.93E−08 | 4.20E−06 | 1.94E−03 | 7.52E−04 | 2.49E−03 |
| SH2B1 | 1.29E−11 | 2.68E−08 | 1.28E−06 | 5.99E−03 | 6.13E−03 |
| SLC24A1 | 1.06E−09 | 5.12E−08 | 4.70E−03 | 2.26E−05 | 1.92E−04 |
| SLC39A7 | 1.17E−07 | 4.49E−06 | 1.72E−04 | 3.57E−03 | 9.01E−05 |
| SPBC25 | 1.28E−11 | 5.89E−06 | 2.90E−05 | 1.48E−03 | 2.39E−03 |
| THBD | 1.02E−14 | 1.78E−06 | 3.81E−03 | 1.79E−03 | 8.01E−04 |
| TXNIP | 8.99E−08 | 1.85E−07 | 9.15E−04 | 2.85E−01 | 1.25E−01 |
| UBE2E3 | 1.55E−15 | 1.31E−07 | 5.94E−04 | 4.08E−04 | 3.28E−02 |
| ZNF230 | 1.60E−09 | 3.87E−09 | 1.90E−09 | 6.37E−04 | 1.03E−03 |
| WIPF1 | – | – | – | 6.92E−06 | 7.80E−02 |
A core set of 38 genes of the WIPF1 module (discovered in the Aronow data set) for which WIPF1 coexpression could be double validated in the Ayers and Staub data sets at stringent thresholds are listed here together with the p values of their WIPF1 correlation tests and of their survival/relapse association tests. A list of results for the full set of 112 genes of the WIPF1 module is given in Supplementary Table 2.
Fig. 4The WIPF1 module identifies breast cancer and glioma patients with better prognosis. Histogram (a) visualizes the distribution of p values resulting from logrank tests for the association of profiles of single genes of the WIPF1 module with relapse in the Wang data on breast cancers. b The Kaplan–Meier curves for breast cancer patients assigned to the groups cluster “A” (upper curve) or cluster “B”. Histogram (c) visualizes the distribution of p values resulting from logrank tests for the association of single expression profiles of the WIPF1 coexpressed genes with relapse in the Phillips data on gliomas. d The Kaplan–Meier curves for glioma patients assigned to the groups “A” (upper curve) or “B”. Note the strong deviation from the uniform distribution (that would result from chance association) in plots (a) or (c) and the consistently higher fractions of survivors among “A” patients in (b) and (d). These plots visualize the high fraction of genes of the WIPF1 module that are individually associated with survival in breast and brain cancers and the survival significance of the complete module
Fig. 5Similarity of the WIPF1 and proliferation signatures. We determined average profiles (signature centroids) of the WIPF1 and the proliferation signatures in three data sets. Only 107 of 112 WIPF1 module genes with concordant lower expression in cluster X of the WIPF1 signature were considered for this analysis. Signature centroids were determined using averaging over all genes for each patient. The scatter plots visualize the strength of the correlation between proliferation signature and WIPF1 signature. Coordinates of each data point correspond to a single patient’s averaged mean-centered expression values for both signatures, a for the Staub colorectal cancer data set, b for the Wang breast cancer data set, and c for the Phillips glioma expression data set
Summaries of top literature subnetworks enriched with genes from the WIPF1 coexpression module
| No | General Molecular Network | GO Processes | Total nodes | Root nodes | |
| 1 | ADAM19, SLC25A10, CDC14a, UBE2E3, TXNIP (VDUP1), ... | Sulfate transport (8.1%; 7.943e−06), cell division (18.9%; 1.334e−05), mitosis (16.2%; 1.414e−05), M phase of mitotic cell cycle (16.2%; 1.598e−05), M phase (18.9%; 1.854e−05) | 50 | 13 | 4.38E−26 |
| 2 | REA, NLK, Chordin-like 1, Copine-1, ... | BMP signaling pathway (11.6%; 8.328e−08), positive regulation of osteoblast differentiation (9.3%; 1.255e−06), regulation of osteoblast differentiation (9.3%; 7.241e−06), transmembrane receptor protein serine/threonine kinase signaling pathway (11.6%; 1.533e−05), developmental process (67.4%; 2.095e−05) | 50 | 12 | 1.49E−23 |
| 3 | Neurofibromin, TXNIP (VDUP1), REA, DEDD, DEDD2, ... | Regulation of apoptosis (45.5%; 2.153e−13), regulation of programmed cell death (45.5%; 2.710e−13), regulation of developmental process (52.3%; 2.099e−12), Ras protein signal transduction (22.7%; 1.044e−11), negative regulation of cellular process (54.5%; 1.944e−11) | 50 | 10 | 9.99E−19 |
| No | Transcriptional Regulation Network | GO Processes | Total nodes | Root nodes | |
| 4 | c-Myc | Positive regulation of mitotic cell cycle (25.0%; 1.015e−05), cell cycle (62.5%; 3.803e−05), regulation of mitotic cell cycle (37.5%; 5.661e−05), regulation of cell cycle (50.0%; 5.920e−05), positive regulation of cell cycle (25.0%; 3.432e−04) | 9 | 8 | 4.02E−22 |
| 5 | ESR1 | Response to hormone stimulus (57.1%; 2.915e−05), response to endogenous stimulus (57.1%; 3.184e−05), response to organic nitrogen (28.6%; 1.001e−04), response to steroid hormone stimulus (42.9%; 2.113e−04), negative regulation of hydrolase activity (28.6%; 2.580e−04) | 8 | 7 | 2.31E−19 |
| 6 | p53 | Response to organic nitrogen (28.6%; 1.001e−04), positive regulation of cell cycle (28.6%; 2.580e−04), regulation of apoptosis (57.1%; 4.295e−04), nucleic acid–protein covalent cross-linking (14.3%; 4.446e−04), RNA–protein covalent cross-linking (14.3%; 4.446e−04) | 8 | 7 | 2.31E−19 |
Here we show information about the top literature subnetworks with significant enrichment for genes of the WIPF1 module using two modes of analysis in the metacore software. Networks 1, 2, and 3 were derived using a large literature network considering all types of molecular interactions. Networks 4, 5, and 6 resulted from an enrichment analysis of subnetworks centered around transcription factors. Column “GO processes” shows Gene Ontology (GO) categories that are enriched in a subnetwork. Significance of enrichment of gene groups (be it WIPF1 coexpressed genes or genes associated with GO categories) in subnetworks was assessed using hypergeometric tests
Fig. 6Fused transcriptional regulation network of genes of the WIPF1 module that are regulated by c-myc, ESR1 or p53. Here we show the fused network of WIPF1 co-expressed genes that are linked to c-myc, ESR1 or p53 according to literature evidence. The legend on the left depicts the graphical symbols that describe the type of protein in the network. The arrows indicate the direction of the regulation. The colors of circles around gene names represent the coupling to transcription factors: green for c-myc, red for p53, blue for ESR1