| Literature DB >> 18570644 |
Florent Baty1, Daniel Jaeger, Frank Preiswerk, Martin M Schumacher, Martin H Brutsche.
Abstract
BACKGROUND: Multivariate ordination methods are powerful tools for the exploration of complex data structures present in microarray data. These methods have several advantages compared to common gene-by-gene approaches. However, due to their exploratory nature, multivariate ordination methods do not allow direct statistical testing of the stability of genes.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18570644 PMCID: PMC2441634 DOI: 10.1186/1471-2105-9-289
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Assessment of gene contributions by orthogonal projections. The contribution of gene j toward the three classes of samples is measured by the distance α0 from the center of the BGA axes to the orthogonal projections onto the vectors of class centroids.
Figure 2Stability of gene contributions using bootstrapping. Uncertainty plots in the upper panels display for each data set the coordinates of the 10 most discriminating genes after partial bootstrap (500 repetitions) in the first two axes of BGA. Convex hulls containing 25%, 50%, 75% and 100% of the points are used to represent the spread of gene coordinates. The directions of class centroids are represented by arrows. In the lower panels, sensitivity boxplots show the distributions of gene contributions. Genes are ranked from left to right according to their discriminating power. The zero threshold is depicted as a dashed line. Gene distributions where more than 5% of values are below 0, are represented as plain boxplots.
Figure 3Detection of influential observations and outliers by jackknifing. Stability plots in the upper panels show the shifts of sample coordinates induced by jackknifing in the first two axes of BGA. The dashed ellipse delineate 2 standard deviations of the sample coordinates on the displayed axes. Barplots in the lower panels show how many times samples were declared as significantly influential.
Functional analysis of genes obtained by bootstrapped BGA, bootstrapped CA and ANOVA
| Functional category of genes | Benjamini-Hochberg adjusted | ||
| Bootstrapped BGA | Bootstrapped CA | ANOVA | |
| Response to stress | 24% (p = 0.01) | 18% ( | 17% ( |
| Defense response | 25% ( | 16% ( | 15% ( |
| Immune response | 24% ( | 15% ( | 13% ( |
| Humoral immune response | 11% ( | 4% ( | 5% ( |
| Response to biotic stimulus | 25% ( | 16% ( | 16% ( |
| Response to stimulus | 37% ( | 24% ( | 26% ( |
| Response to pest, pathogen or parasite | 16% ( | 12% ( | 11% ( |
| Response to other organism | 16% ( | 12% ( | 11% ( |
| Gas transport | 5% ( | 0% (NS) | 3% ( |
| Oxygen transport | 5% ( | 0% (NS) | 3% ( |
| Humoral defense mechanism | 8% ( | 0% (NS) | 5% ( |
The Gene Ontology analysis is based on the 11 most significant GO categories (adjusted p-values < 0.05) obtained after bootstrapped BGA. The results present the proportion of genes that belong to these GO categories (%) and the enrichment significance (p-values are adjusted using the Benjamini-Hochberg method [30]), after bootstrapped BGA, bootstrapped CA and ANOVA.