| Literature DB >> 19208157 |
Abstract
BACKGROUND: The gene shaving algorithm and many other clustering algorithms identify gene clusters showing high variation across samples. However, gene expression in many signaling pathways show only modest and concordant changes that fail to be identified by these methods. The increasingly available signaling pathway prior knowledge provide new opportunity to solve this problem.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19208157 PMCID: PMC2648790 DOI: 10.1186/1471-2105-10-S1-S54
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The schematic diagram of the proposed algorithm. "Enrichment test" means to determine the PC(s) that are most enriched by a prior knowledge gene set. α% is set to 10% following Hastie et al [4].
Figure 2Demonstration of the claimed advantages of our algorithm using the "ground truth" reported in [17]. (a) Plots of expression profiles of high-amplitude and low-amplitude gene sets. (b) Evaluating the capability of our algorithm to recover a complete low-amplitude gene set. The gene shaving shaving algorithm [4] fails in this case because it exclusively works with the leading PC. X-axis represents the increasing sizes of the subsets, and Y-axis represents the -log2P of the enrichment, indicating increased capacity of recovering a complete gene set. (c) Evaluating the capability of gene shaving algorithm [4] to recover a complete high-amplitude gene set.
Figure 3SVD analysis without splitting negative and positive PC's. WNT and NOTCH genes are maximally enriched (P-value: E-06) in the second PC (red lines), not the leading PC.
Figure 4SVD analysis with splitting negative and positive PC's. Further, WNT and NOTCH genes are maximally enriched (P-value: E-10) in the second negative PC and the second positive PC (red lines), and the level of enrichment is dramatically increased because the sizes of negative and positive PC's decrease.
Figure 5Algorithm comparisons. Horizontal axis represents the number of iterations in both upper or lower panels. The vertical axis of the upper panel corresponds to the -log2P-value of the enrichment of prior knowledge. The vertical axis of the lower panel corresponds to the number of genes in the cluster (upper) and size of the cluster (lower). (a) The performance of the original gene shaving algorithm gauged by prior knowledge enrichment over iterations [4]. (b) The performance of our semi-supervised gene shaving algorithm without splitting positive and negative PC's. (c) The performance of our semi-supervised gene shaving algorithm with splitting positive and negative PC's.
Figure 6The predicted NOTCH cluster. Highlighted genes are prior knowledge. Genes that are pointed by red arrows correspond to experimentally validated NOTCH genes, and genes pointed by blue arrows correspond to potentially interesting genes by expert opinion and literature search. The whole list of prior knowledge and prediction are available in supplemental tables.
Biological function enrichment analysis and transcription factor association analysis.[]
| Gene Set | Size | GO Annotation | Transcription Factors |
| WNT | 45 | embryonic development (1.13E-04) | MyoG_Myotubes (9.47E-03) [ |
| cytosol (9.15E-06) | MyoD_Growing cells (1.99E-05) [ | ||
| cytosolic part (4.48E-08) | |||
| iron ion binding (3.92E-06) | |||
| tube development (3.86E-04) | |||
| branching morphogenesis of a tube (9.88E-06) | |||
| tube morphogenesis (7.26E-05) | |||
| patterning of blood vessels (3.57E-05) | |||
| embryonic pattern specification (1.11E-04) | |||
| oxygen binding (6.89E-14) | |||
| gas transport (4.60E-14) | |||
| hemoglobin complex (1.12E-14) | |||
| NOTCH | 36 | developmental maturation (3.86E-04) | MyoG_Myotubes (9.47E-03) [ |
| negative regulation of cell differentiation (3.01E-04) | MyoD_Growing cells (1.99E-05) [ | ||
| ectoderm development (1.91E-05) | |||
| cell maturation (1.94E-04) | |||
| tissue morphogenesis (1.12E-05) | |||
| epidermis morphogenesis (2.00E-06) | |||
| hair cell differentiation (5.26E-06) | |||
| mechanoreceptor differentiation (7.56E-06) | |||
| negative regulation of neuron differentiation (3.49E-06) | |||
| regulation of neuron differentiation (3.93E-05) | |||
| cell fate determination (9.65E-06) | |||
| auditory receptor cell fate commitment (3.78E-08) | |||
The third column contains biological functions significantly enriched in WNT and NOTCH pathways, and the fourth column contains transcription factors significantly associated with WNT and NOTCH pathways. The analysis was done through the web-server of the Segal lab: [23]
Figure 7Leave-one-out and Leave-two-out Jackknife estimations and confidence intervals of the enrichment. (a) Accessing cluster sensitivity to perturbation of prior knowledge using leave-one-out approach (b) Accessing cluster sensitivity to perturbation of prior knowledge using leave-two-out approach.