| Literature DB >> 15533245 |
Caimiao Wei1, Jiangning Li, Roger E Bumgarner.
Abstract
BACKGROUND: Microarray experiments are often performed with a small number of biological replicates, resulting in low statistical power for detecting differentially expressed genes and concomitant high false positive rates. While increasing sample size can increase statistical power and decrease error rates, with too many samples, valuable resources are not used efficiently. The issue of how many replicates are required in a typical experimental system needs to be addressed. Of particular interest is the difference in required sample sizes for similar experiments in inbred vs. outbred populations (e.g. mouse and rat vs. human).Entities:
Mesh:
Substances:
Year: 2004 PMID: 15533245 PMCID: PMC533874 DOI: 10.1186/1471-2164-5-87
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
cDNA microarray data sets used in the study
| Data set | Reference | # Rep | # Genes | Tissue type | Description | Hybridization |
| A | Smith et al. 2003 | 20 | 15,592 | Human liver | Paired HCC tumor vs adjacent non-tumor | Direct hyb between tumor and non tumor |
| B | Lapointe et al. 2004 | 41 | 38627 | Human prostate | Paired prostate tumor vs adjacent non-tumor | Indirect hyb using common reference |
| C | Chen et al 2002 | 48 | 22618 | Human liver | Paired HCC+HBV vs HBV | Indirect hyb using common reference |
| D | Pritchard et al. 2001 | 6 | 5281 | Mouse liver and kidney | Paired liver vs kidney | Indirect hyb using common reference |
| E | Zhao et al. 2004 | 36 ductal + 21 lobular | 44549 | Human breast | lobular and ductal tumor tissue | Indirect hyb using common reference |
| F | NA | 6 | 13, 056 | Mouse liver | One third vs two thirds hepatectomy | Indirect hyb using individual baseline |
| G | Callow et al. 2000 | 8 | 5548 | Mouse liver | ApoAI knock-out vs normal | Indirect hyb using common reference (pool) |
Figure 1Histogram of standard deviation The X axis is the standard deviation, and the Y axis is the percentage of genes that has standard deviation below the value of X. All data sets were normalized by spatial lowess; (A) Data set A-standard deviation of log ratio of two groups (direct hybridization); data set B-D standard deviation of the difference of log (sample/reference) of the two groups (indirect hybridization); (B) Data sets E-G common standard deviation of (sample/reference) of the two independent groups (indirection hybridization).
Figure 2Standard deviation versus log intensity Standard deviations are based on one sample t-test (data set A), paired t-test (data sets B-D), or two independent t-test (data sets E-G).
Significant genes/ESTs/probes called by methods used in the studies using different criteria (combination of significance level and fold changes)
| Data set | Reference | P <= 0.001 and the estimated fold change >=2 | P <= 0.001 only | P <= 0.01 only |
| A | Smith et al. 2003 | 183 | 1783 | 3590 |
| B | Lapointe et al. 2004 | 609 | 6549 | 10153 |
| C | Chen et al 2002 | 1253 | 4187 | 6197 |
| D | Pritchard et al. 2001 | 479 | 1557 | 1845 |
| E | Zhao et al. 2004 | 270 | 1050 | 3821 |
| F | NA | 16 | 145 | 723 |
| G | Callow et al. 2000 | 6 | 11 | 77 |