| Literature DB >> 12095422 |
David M Mutch1, Alvin Berger, Robert Mansourian, Andreas Rytz, Matthew-Alan Roberts.
Abstract
BACKGROUND: The biomedical community is developing new methods of data analysis to more efficiently process the massive data sets produced by microarray experiments. Systematic and global mathematical approaches that can be readily applied to a large number of experimental designs become fundamental to correctly handle the otherwise overwhelming data sets.Entities:
Mesh:
Substances:
Year: 2002 PMID: 12095422 PMCID: PMC117238 DOI: 10.1186/1471-2105-3-17
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The relationship between absolute value, limit fold change (LFC), and variance across the absolute expression range. A) The x-axis threshold indicates those genes that have a minimum ADI of 20. Genes in bins of 200 are examined for the top 5% highest fold changes (red horizontal lines indicate the 95th percentile for each bin). The line of best fit, drawn through each bin in blue, identifies the overall LFC cut-off and is described by the simple equation 5% LFC = 1.74 + 91.55/min ADI. B) Identifying the top 1% (black line) or 10% (red line) highest fold changes in each bin shifts the LFC curve, when compared to the 5% LFC model (blue line), and alters the severity for the selection of differentially expressed genes (1% LFC = 2.43 + 166.12/min ADI; 10% LFC = 1.59 + 69.47/min ADI). C) The upper 99.9% confidence limit (CL) of a robust estimation of the coefficient of variance (CV) for replicates (within-treatment variability) has been modeled as a function of absolute minimum expression of all treatments, as indicated by the blue line. Overlaying the 99.9% CL on the data selected by the 5% LFC model (red dots) ensures high confidence in the selected genes.
Concordance data between an Affymetrix 11MuK microarray and RT-PCR.
The fold changes observed with microarray and RT-PCR analysis are indicated, where a positive value indicates an increase in gene expression and a negative value a decrease in gene expression. Through the coloring scheme, validation (confirmation by RT-PCR of the direction of fold change seen with microarrays) of low RV genes is not achieved; however, as RV increases, concordance increases (red = genes with no concordance across the 3 diets; blue = genes with either one or two measurements in agreement; green = genes with 100% concordance). Overall concordance with the 5% LFC model was 77.7%, which includes measurements found to be both significant and non-significant by microarray analysis. Underlined numbers indicate the HFC that resulted in this gene being selected as significantly different by the 5% LFC model (77.7% concordance with RT-PCR results). Starred-numbers indicate significant fold changes, determined by a student's t-test using α = 0.05, seen by RT-PCR. § indicates those pairwise comparisons (treatment vs. control) that meet the 5% LFC model criteria. 85.7% concordance is seen when comparing significant fold changes by RT-PCR with significant fold changes using the 5% LFC model.
Figure 2Schematic representation of the cyclical nature of the limit fold change (LFC) model. Selecting an initial X% LFC model (1) provides a starting point for the identification of those genes differentially regulated. Genes can then be ranked (2) by a calculation combining fold change and absolute expression in order to assign a degree of importance. Validation of the chosen LFC model by a complementary technique such as RT-PCR (3) and/or the characterization of variance (4) enables the analyst to reexamine the initial LFC model and determine the confidence level for the results. Depending on the data set, one could redefine the LFC model and repeat the cycle.