| Literature DB >> 15921534 |
Ola Larsson1, Claes Wahlestedt, James A Timmons.
Abstract
BACKGROUND: Users of microarray technology typically strive to use universally acceptable data analysis strategies to determine significant expression changes in their experiments. One of the most frequently utilised methods for gene expression data analysis is SAM (significance analysis of microarrays). The impact of selection thresholds, on the output from SAM, may critically alter the conclusion of a study, yet this consideration has not been systematically evaluated in any publication.Entities:
Mesh:
Year: 2005 PMID: 15921534 PMCID: PMC1173086 DOI: 10.1186/1471-2105-6-129
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1FC effects on the endurance training data set: SAM analysis was used at various fold changes studying the exercise data set while scoring genes with a q-value of <0.05 and FC>1.5. This was done to asses the effect of the fold change option in the SAM Excel addin on genes reported as significant at a higher fold change. The figure shows the number of scored genes using 4 different chip and sample combinations: (A) All eight subjects before and after training (totally 16 arrays) were compared in a paired analysis using the U95A chips. (B) All eight subjects before and after training (totally 16 arrays) were compared in a paired analysis using the U95B chips. (C) The reduced group consisting of low four low responders (totally 8 arrays) were compared in a paired analysis using the U95B chips. (D) The reduced group consisting of low four low responders (totally 8 arrays) were compared in a paired analysis using the U95D chips.
Figure 2FC effects on the senescence data set: SAM analysis was used at various fold changes using the senescence data set while scoring genes with a q-value of <0.01 and FC>1.5. A comparison between non-senescent cells and senescent cells was used (two replicates of the senescent cells and four replicates of the non-senescent cells).
Figure 3FC effects on the brain aging data set: SAM analysis at various fold changes using the brain aging data set (11 old samples and 10 young samples) while scoring genes with a q-value of <0.01 and FC>1.5. We used both the full data set and the reduced data set suggested by the authors. (A) The full data set. (B) The reduced data set (20% "present" calls by D-chip").
Figure 4FC effects individual q-values: q-values of all genes scored as significant in the brain aging study (reduced data set) at FC 1.51 at other FC settings. Genes only acquires discrete q-values and all 538 genes are shown, but overlap. (B) Running SAM with different FC settings changes the biological interpretation: Venn diagram comparing the number of significantly overrepresented classifications (EASE score <0.05) using the reduced brain aging data set analysed either with a 1.0 FC setting (314 genes) or a 1.5 FC setting (538 genes).