| Literature DB >> 20400455 |
Ashok Sharma1, Jieping Zhao, Robert Podolsky, Richard A McIndoe.
Abstract
MOTIVATION: Significance analysis of microarrays (SAM) is a widely used permutation-based approach to identifying differentially expressed genes in microarray datasets. While SAM is freely available as an Excel plug-in and as an R-package, analyses are often limited for large datasets due to very high memory requirements.Entities:
Mesh:
Year: 2010 PMID: 20400455 PMCID: PMC2872005 DOI: 10.1093/bioinformatics/btq161
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Comparison of time (minutes) taken to complete the SAM analyses by R and ParaSAM (7 nodes) on two datasets (22 283 genes and 44 760 genes)
| R-SAM | ParaSAM | |||||
|---|---|---|---|---|---|---|
| Permutations | 20 arrays | 60 arrays | 117 arrays | 20 arrays | 60 arrays | 117 arrays |
| 22 283 genes | ||||||
| 200 | 3.57±0.08 | 5.46±0.06 | 8.26±0.25 | 0.96±0.01 | 1.54±0.03 | 2.50±0.06 |
| 400 | 7.21±0.22 | 10.81±0.13 | 16.54±0.34 | 1.35±0.03 | 1.98±0.03 | 2.89±0.06 |
| 1000 | PF | PF | PF | 2.04±0.03 | 3.02±0.06 | 4.48±0.11 |
| 44 760 genes | ||||||
| 200 | 7.60±0.19 | 12.97±0.50 | 20.76±0.38 | 2.29±0.05 | 3.37±0.11 | 5.45±0.15 |
| 400 | 14.22±0.33 | PF | PF | 2.71±0.07 | 4.30±0.17 | 6.23±0.10 |
| 1000 | PF | PF | PF | 4.35±0.09 | 6.32±0.09 | 10.32±0.31 |
PF, program failed. All times in minutes where N=12 runs.
Fig. 1.Execution time of ParaSAM and ‘samr’ with increasing size of datasets and permutations. Each series indicates the program and number of permutations used.