| Literature DB >> 16545131 |
Richard E Kennedy1, Kellie J Archer, Michael F Miles.
Abstract
BACKGROUND: Current methods of analyzing Affymetrix GeneChip microarray data require the estimation of probe set expression summaries, followed by application of statistical tests to determine which genes are differentially expressed. The S-Score algorithm described by Zhang and colleagues is an alternative method that allows tests of hypotheses directly from probe level data. It is based on an error model in which the detected signal is proportional to the probe pair signal for highly expressed genes, but approaches a background level (rather than 0) for genes with low levels of expression. This model is used to calculate relative change in probe pair intensities that converts probe signals into multiple measurements with equalized errors, which are summed over a probe set to form the S-Score. Assuming no expression differences between chips, the S-Score follows a standard normal distribution, allowing direct tests of hypotheses to be made. Using spike-in and dilution datasets, we validated the S-Score method against comparisons of gene expression utilizing the more recently developed methods RMA, dChip, and MAS5.Entities:
Mesh:
Year: 2006 PMID: 16545131 PMCID: PMC1550434 DOI: 10.1186/1471-2105-7-154
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Comparison of S-Score and RMA. Plot of absolute value of S-Score vs absolute value of difference in RMA expression summaries, comparing the specified concentration to the baseline chip. X- and Y-axis projections are added to show separation of spike-in probes more clearly.
Figure 3Comparison of S-Score and MAS5. Plot of absolute value of S-Score vs MAS5 p-values, comparing the specified concentration to the baseline chip. MAS5 p-values were transformed so that significantly up- and down-regulated genes will have p-values approaching 0. X- and Y-axis projections are added to show separation of spike-in probes more clearly.
Observed and expected ranks. Observed and expected ranks from the Latin Square dataset for each of the four comparative methods, with linear correlation (R2) of MAS5 intensity vs concentration as quality control data.
| Chip | ||||||||||
| Probe Set | Rank | Observed Rank | ||||||||
| S-Score | RMA | dChip | MAS5 | |||||||
| BioB-5 | 7 | 11826 | 9395 | 125 | 539 | |||||
| BioB-M | 8 | 9 | 11023 | 136 | 61 | |||||
| BioB-3 | 9 | 7 | 11810 | 145 | 3 | |||||
| BioC-5 | 1 | 3 | 2 | 2 | 1 | |||||
| BioC-3 | 2 | 4 | 3 | 4 | 1 | |||||
| BioDn-3 | 9 | 5 | 12560 | 179 | 1 | |||||
| DapX-5 | 4 | 2 | 4 | 3 | 1 | |||||
| DapX-M | 9 | 64 | 9497 | 132 | 96 | |||||
| DapX-3 | 6 | 6 | 12443 | 155 | 1 | |||||
| CreX-5 | 3 | 1 | 1 | 1 | 1 | |||||
| CreX-3 | 5 | 1080 | 4811 | 92 | 251 | |||||
| Chip | 92564 (R2 = 0.047) | |||||||||
| Probe Set | Rank | Observed Rank | ||||||||
| S-Score | RMA | dChip | MAS5 | |||||||
| BioB-5 | 8 | 9628 | 4393 | 96 | 513 | |||||
| BioB-M | 10 | 5 | 213 | 147 | 1 | |||||
| BioB-3 | 9 | 6 | 47 | 140 | 1 | |||||
| BioC-5 | 4 | 2 | 12626 | 1 | 1 | |||||
| BioC-3 | 5 | 4 | 2119 | 112 | 1 | |||||
| BioDn-3 | 3 | 8 | 359 | 138 | 1 | |||||
| DapX-5 | 2 | 1 | 12625 | 2 | 1 | |||||
| DapX-M | 1 | 1656 | 7093 | 105 | 513 | |||||
| DapX-3 | 6 | 3 | 1233 | 122 | 1 | |||||
| CreX-5 | 1 | 7 | 320 | 145 | 1 | |||||
| CreX-3 | 7 | 22 | 11569 | 51 | 16 | |||||
| Chip | 92560 (R2 = 0.745) | |||||||||
| Probe Set | Rank | Observed Rank | ||||||||
| S-Score | RMA | dChip | MAS5 | |||||||
| BioB-5 | 2 | 5 | 4 | 4 | 1 | |||||
| BioB-M | 7 | 9 | 189 | 74 | 1 | |||||
| BioB-3 | 3 | 1 | 2 | 2 | 1 | |||||
| BioC-5 | 6 | 4 | 5 | 5 | 1 | |||||
| BioC-3 | 9 | 6 | 6 | 6 | 1 | |||||
| BioDn-3 | 8 | 10 | 881 | 87 | 1 | |||||
| DapX-5 | 6 | 3 | 3 | 3 | 1 | |||||
| DapX-M | 4 | 265 | 7339 | 115 | 130 | |||||
| DapX-3 | 1 | 7 | 7 | 9 | 1 | |||||
| CreX-5 | 9 | 2 | 1 | 1 | 1 | |||||
| CreX-3 | 5 | 670 | 2511 | 95 | 126 | |||||
| Chip | 92554 (R2 = 0.874) | |||||||||
| Probe Set | Rank | Observed Rank | ||||||||
| S-Score | RMA | dChip | MAS5 | |||||||
| BioB-5 | 3 | 1 | 1 | 1 | 1 | |||||
| BioB-M | 6 | 4 | 5 | 6 | 1 | |||||
| BioB-3 | 5 | 3 | 3 | 4 | 1 | |||||
| BioC-5 | 1 | 8 | 12 | 37 | 1 | |||||
| BioC-3 | 2 | 7 | 8 | 9 | 1 | |||||
| BioDn-3 | 7 | 6 | 14 | 27 | 1 | |||||
| DapX-5 | 4 | 10 | 6 | 13 | 1 | |||||
| DapX-M | 4 | 2 | 2 | 2 | 1 | |||||
| DapX-3 | 9 | 5 | 4 | 5 | 1 | |||||
| CreX-5 | 4 | 11 | 7 | 14 | 1 | |||||
| CreX-3 | 8 | 19 | 178 | 64 | 4 | |||||
| Chip | 92557 (R2 = 0.756) | |||||||||
| Probe Set | Rank | Observed Rank | ||||||||
| S-Score | RMA | dChip | MAS5 | |||||||
| BioB-5 | 1 | 2 | 2 | 2 | 1 | |||||
| BioB-M | 4 | 4 | 3 | 1 | 1 | |||||
| BioB-3 | 2 | 3 | 1 | 3 | 1 | |||||
| BioC-5 | 3 | 14 | 373 | 54 | 1 | |||||
| BioC-3 | 9 | 5 | 300 | 56 | 1 | |||||
| BioDn-3 | 5 | 1 | 4 | 4 | 1 | |||||
| DapX-5 | 10 | 9 | 9 | 10 | 1 | |||||
| DapX-M | 11 | 10 | 87 | 78 | 1 | |||||
| DapX-3 | 6 | 7 | 7596 | 107 | 1 | |||||
| CreX-5 | 8 | 8 | 16 | 19 | 1 | |||||
| CreX-3 | 7 | 6 | 1151 | 72 | 1 | |||||
Number and proportion of spike-in clones detected using chip 92561 as baseline
| 92562 | 4 (0.36) | 4 (0.36) | 4 (0.36) | 4 (0.36) |
| 92563 | 8 (0.72) | 4 (0.36) | 4 (0.36) | 7 (0.64) |
| 92564 | 8 (0.72) | 0 (0.00) | 2 (0.18) | 8 (0.72) |
| 92558 | 10 (0.90) | 10 (0.90) | 8 (0.72) | 11 (1.00) |
| 92559 | 11 (1.00) | 9 (0.81) | 9 (0.81) | 11 (1.00) |
| 92560 | 9 (0.81) | 7 (0.63) | 7 (0.63) | 9 (0.81) |
| 92554 | 10 (0.90) | 8 (0.72) | 6 (0.54) | 11 (1.00) |
| 92555 | 11 (1.00) | 9 (0.81) | 9 (0.81) | 11 (1.00) |
| 92556 | 10 (0.90) | 9 (0.81) | 7 (0.63) | 11 (1.00) |
| 92557 | 10 (0.90) | 5 (0.45) | 5 (0.45) | 11 (1.00) |
Comparison of S-Score vs. RMA, p < 0.001; vs. dChip, p < 0.001; vs. MAS5, p = 0.40