| Literature DB >> 23819917 |
James L Chen1, Alexander Hsu, Xinan Yang, Jianrong Li, Younghee Lee, Gurunadh Parinandi, Haiquan Li, Yves A Lussier.
Abstract
MOTIVATION: Gene expression-based prostate cancer gene signatures of poor prognosis are hampered by lack of gene feature reproducibility and a lack of understandability of their function. Molecular pathway-level mechanisms are intrinsically more stable and more robust than an individual gene. The Functional Analysis of Individual Microarray Expression (FAIME) we developed allows distinctive sample-level pathway measurements with utility for correlation with continuous phenotypes (e.g. survival). Further, we and others have previously demonstrated that pathway-level classifiers can be as accurate as gene-level classifiers using curated genesets that may implicitly comprise ascertainment biases (e.g. KEGG, GO). Here, we hypothesized that transformation of individual prostate cancer patient gene expression to pathway-level mechanisms derived from automated high throughput analyses of genomic datasets may also permit personalized pathway analysis and improve prognosis of recurrent disease.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23819917 PMCID: PMC3654873 DOI: 10.1186/1755-8794-6-S2-S4
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Prostate cancer datasets analyzed
| Author | Phenotype | Samples Analyzed | Usage | Source |
|---|---|---|---|---|
| [ | Normal vs Tumor | Authors examined 61 microdissected prostate cancer specimens along with 63 normal prostate tissues samples adjacent to tumor of patients. | Feature Selection | GDS2547 |
| [ | Normal vs Tumor | Authors examined 33 African-American and 36 European-American patients with prostate cancer. Also profiled 18 non-tumor prostate tissues from 7 African-American and 11 European-American patients. | Feature Selection | GSE6956 |
| [ | Normal vs Tumor | Comprehensive set from Memorial Sloan Ketting tumor bank in prostate tumors from 53 patients with primary or metastatic prostate cancer and 29 normal controls. | Feature Selection | GSE21032 |
| [ | Disease free interval survival | 79 tumors were obtained from the Memorial Sloan Kettering tumor bank. The authors identified three signatures of early recurrence (within one year). | Validation Set | From author |
The first three datasets are used for mechanism feature selection reported in Table 2 and Figure 1. The fourth dataset serves as independent validation as depicted in Figure 3.
Evaluation of the overlap among different approaches to prioritize classifier's mechanisms
| Approaches to Prioritize Classifier's Mechanisms | ||||
|---|---|---|---|---|
| Yu ∩ Wallace | 462 | 15%; 38% | 6.9 x10-6 | |
| Yu ∩ Taylor | 824 | 26%;34% | 3.8 x10-3 | |
| Wallace ∩ Taylor | 360 | 29%;15% | 2.2 x10-5 | |
| Yu ∩ Wallace ∩ Taylor | 137 | 4%;11%;6% | ||
| Yu ∩ Wallace | 41 | 15%;49% | 1.9x10-4 | |
| Yu ∩ Taylor | 77 | 27%;46% | 5.7 x10-6 | |
| Wallace ∩ Taylor | 25 | 30%;15% | 4.8 x10-4 | |
| Yu ∩ Wallace ∩ Taylor | 14 | 5%;17%;8% | ||
| Yu ∩ Wallace | 33 | 11%;6% | 6.1 x10-9 | |
| Yu ∩ Taylor | 65 | 22%;24% | <2 x10-16 | |
| Wallace ∩ Taylor | 20 | 3%;8% | 1.5 x10-3 | |
| Yu ∩ Wallace ∩ Taylor | 7 | 2%;1%;3% | ||
| Yu ∩ Wallace | 1 | 5%;10% | 4.2 x10-2 | |
| Yu ∩ Taylor | 0 | NA | NS | |
| Wallace ∩ Taylor | 0 | NA | NS | |
| Yu ∩ Wallace ∩ Taylor | 0 | NA | NS | |
Legend: NA = not applicable; NS = not significant; #= count of significant mechanisms overlap
Comparison of FAIME-prioritized mechanisms against controls (standard bioinformatics)
| FAIME-Derived Mechanisms | Controls: Standard Bioinformatics Approaches* | ||||
|---|---|---|---|---|---|
| 0.01 | |||||
| 0.05 | |||||
| 0.01 | |||||
| 0.05 | |||||
| 0.01 | |||||
| 0.05 | |||||
*As a comparison, differentially expressed genes (SAM, FDR ≤ 0.05) were enriched for Gene Ontology (GO) Terms using the DAVID tool. Numbers in parentheses indicate significant terms and the universe of terms from which they were drawn to generate the percentage indicated.
Figure 1FAIME transformation of gene expression arrays into Gene Ontology space (Panel A) and Cancer Module Space . In this analysis, we compared 3 different prostate cancer datasets (Yu, Wallace, Taylor) for differences between tumor and normal prostate gene expression in 4 different analysis spaces: Gene Ontology (Panel A), Cancer Modules (Panel B), standard differential genes (Panel C), and Gene Ontology terms derived from standard differentially expressed genes (Panel D). Each Venn diagram displays how well how each independent mechanism set overlaps in each of these spaces. The bold percentage in each panel provides the percentage of overlapping terms as a percentage of the average mechanism set length. Panel C demonstrates conventionally generated differentially expressed genes using the Significance Analysis of Microrarrays with a FDR of 5%. In panels A and B we first transform the gene expression arrays into either Biological Process Gene Ontology space or Cancer Module space, respectively. Individual pathway terms were analyzed standard t-test and adjusted for multiplicity. Terms with a FDR ≤ 5% were retained. The Gene Ontology terms in Panel D were generated by enriching the differentially expressed genes in Panel C using the DAVID tool and retaining terms with a FDR ≤ 5%. Legend: *Only concordantly deregulated mechanisms across datasets are counted in FAIME (e.g. significantly up-regulated ones in cancer against each other, then significantly down-regulated ones, then union of the two groups).
Figure 2FAIME-derived Gene Ontology and Cancer Module tumorigenesis mechanisms overlap significantly when we analyze their constituent genes by a bootstrap method to generate an empiric p-value. 127 genes overlap between the two of them and are highly enriched for known and novel deregulated prostate cancer pathways.
Figure 3Kaplan-Meier plots of recurrent prostate cancer using the FAIME-derived Cancer Module overlap set (Panel A) and the FAIME-derived Gene Ontology overlap set (Panel B). PAM analysis was used to divide patients into two cohorts. Statistically significant differences in time to recurrence are consistently observed based on log-rank statistic (p = 0.0186 and 0.0392, respectively).