| Literature DB >> 16430768 |
Kyoungmi Kim1, Grier P Page, T Mark Beasley, Stephen Barnes, Katherine E Scheirer, David B Allison.
Abstract
BACKGROUND: High-density microarray technology is increasingly applied to study gene expression levels on a large scale. Microarray experiments rely on several critical steps that may introduce error and uncertainty in analyses. These steps include mRNA sample extraction, amplification and labeling, hybridization, and scanning. In some cases this may be manifested as systematic spatial variation on the surface of microarray in which expression measurements within an individual array may vary as a function of geographic position on the array surface.Entities:
Mesh:
Year: 2006 PMID: 16430768 PMCID: PMC1373606 DOI: 10.1186/1471-2105-7-35
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Descriptive statistics of measurements of individual chips in experiment A and summaries of resulting GEODEX. Five chips (9, 12, 14, 15, and 23) in boldface below consistently have the lowest GEODEX irrespective of the number of blocks.
| 1Chip ID | 2Mean | 3SD | 4CV | 5GEODEX[1] | 6GEODEX[2] | ||
| NB = 9 | NB = 36 | NB = 144 | |||||
| 1 | 298.24 | 562.15 | 188.48 | 0.9948 | 0.9932 | 0.9901 | 0.9863 |
| 2 | 306.96 | 588.32 | 191.65 | 0.9962 | 0.9950 | 0.9927 | 0.9896 |
| 3 | 387.64 | 799.63 | 206.27 | 0.9934 | 0.9920 | 0.9890 | 0.9853 |
| 4 | 309.45 | 581.16 | 187.79 | 0.9951 | 0.9935 | 0.9906 | 0.9868 |
| 5 | 398.48 | 847.48 | 212.67 | 0.9943 | 0.9929 | 0.9901 | 0.9867 |
| 6 | 364.01 | 822.76 | 226.02 | 0.9943 | 0.9925 | 0.9892 | 0.9853 |
| 7 | 403.05 | 866.67 | 215.02 | 0.9925 | 0.9907 | 0.9873 | 0.9831 |
| 8 | 302.11 | 854.81 | 282.94 | 0.9965 | 0.9957 | 0.9944 | 0.9922 |
| 9 | 372.76 | 818.37 | 219.54 | ||||
| 10 | 482.55 | 961.08 | 199.16 | 0.9932 | 0.9916 | 0.9885 | 0.9847 |
| 11 | 290.04 | 563.84 | 194.39 | 0.9948 | 0.9934 | 0.9905 | 0.9869 |
| 12 | 288.50 | 687.20 | 238.19 | ||||
| 13 | 343.23 | 663.09 | 193.18 | 0.9941 | 0.9929 | 0.9902 | 0.9868 |
| 14 | 290.12 | 610.07 | 210.27 | ||||
| 15 | 236.84 | 482.26 | 203.61 | ||||
| 16 | 261.13 | 833.62 | 319.22 | 0.9976 | 0.9969 | 0.9958 | 0.9940 |
| 17 | 501.95 | 1014.49 | 202.10 | 0.9909 | 0.9893 | 0.9863 | 0.9824 |
| 18 | 309.42 | 624.82 | 201.93 | 0.9938 | 0.9923 | 0.9893 | 0.9856 |
| 19 | 261.58 | 473.18 | 180.88 | 0.9938 | 0.9926 | 0.9898 | 0.9861 |
| 20 | 328.96 | 668.14 | 203.10 | 0.9962 | 0.9946 | 0.9917 | 0.9879 |
| 21 | 281.70 | 543.55 | 192.95 | 0.9952 | 0.9939 | 0.9914 | 0.9881 |
| 22 | 356.03 | 692.10 | 194.39 | 0.9953 | 0.9939 | 0.9912 | 0.9878 |
| 23 | 367.10 | 840.50 | 228.95 | ||||
| 24 | 395.99 | 776.34 | 196.05 | 0.9959 | 0.9948 | 0.9922 | 0.9891 |
| 25 | 293.60 | 617.81 | 210.42 | 0.9943 | 0.9930 | 0.9903 | 0.9867 |
1. Chip ID is the identification number of chip.
2. MEAN is the average of intensity measurements of gene expression for each chip.
3. SD is the standard deviation of intensity measurements of gene expression for each chip.
4. CV is the coefficient of variation of intensity measurements of gene expression for each chip.
5. GEODEX[1] is a geography index calculated by model (1).
6. GEODEX[2] is a geography index calculated by model (2) (NB = the number of defining blocks per chip).
Figure 1Distributions of geography index (GEODEX) values of individual chips of experiment A. a. Black circles and white upper triangles represent GEODEX computed by model (1) and model (2), respectively. Black rectangles and white diamonds represent GEODEX computed by model (2), using measurements in the log scale on base 2 and trimmed measurements after removing four blocks on the corners, respectively. b. For comparisons between different measures of gene expression, black circles, white circles, and black upper triangles represent GEODEX of difference (PM-MM) measurements of pairs of the perfect match (PM) and mismatch (MM) probes, of PM-only measurements, and of MM-only measurements, respectively. GEODEX were computed by model (2) (NB = 36). X-axis indicates a chip identification number and Y-axis indicates the degree of GEODEX.
Figure 2Image of Affymetrix chip 12 of experiment A. The bottom area that was damaged by technical mistakes during scanning on the right corner of chip 12 is shown enlarged.
Figure 3Distributions of blockwise GEODEX. a. A boxplot of 25 chips: Each box shows the distribution of blockwise GEODEX for 36 blocks per chip (i.e., NB = 36). The X-axis indicates a chip identification number. b. A boxplot of 36 blocks: Each box shows the distribution of blockwise GEODEX for 25 chips per block. The X-axis indicates a block identification number within the chip. Crosses represent outliers.
Figure 4Assessing the validity of geography index. Scatter plot a (c) shows the relationship between GEODEX and the coefficient of variation (CV) of measurements for experiment A (a simulation study). Scatter plot b (d) presents the relationship between pairwise GEODEX computed by equation (5) and the degree of inter-chip discrepancy computed by equation (4) for every pair of technical replicates for biological samples in experiment A (for every pair of the sample array and simulated replicates for a simulation study). GEODEX was computed by model (2).
Descriptive statistics and resulting GEODEX of a simulation study.
| 1Chip ID | 2MEAN | 3SD | 4CV | 5GEODEX[1] | 6GEODEX[2] |
| Sample | 298.2445 | 562.1518 | 1.88 | 0.99493965 | 0.99015619 |
| Scenario 1 | 298.245 | 562.1518 | 1.88 | 0.9949455 | 0.99015956 |
| Scenario 2 | 295.4852 | 562.2858 | 1.90 | 0.99422959 | 0.98896773 |
| Scenario 3 | 305.9073 | 574.8667 | 1.87 | 0.99588081 | 0.9852998 |
| Scenario 4 | 367.2101 | 934.5144 | 2.54 | 0.97747889 | 0.8120757 |
| Scenario 5 | 1056.8657 | 7701.3605 | 7.28 | 0.95547486 | 0.67372248 |
| Scenario 6 | 4609.2052 | 43392.8549 | 9.41 | 0.95409713 | 0.66822548 |
| Scenario 7 | 8927.8288 | 86801.9967 | 9.72 | 0.95396446 | 0.66776258 |
| Scenario 8 | 13246.4524 | 130212.341 | 9.82 | 0.95392095 | 0.66761398 |
1. Chip ID is the identification number of simulated chip.
2. MEAN is the average of intensity measurements of gene expression for each chip.
3. SD is the standard deviation of intensity measurement of gene expression for each chip.
4. CV is the coefficient of variation of intensity measurements of gene expression for each chip.
5. GEODEX[1] is a geography index calculated by model (1).
6. GEODEX[2] is a geography index calculated by model (2) (NB = 36).
Figure 5Distributions of GEODEX of individual chips of experiments B and C. a. Each shape stands for each biological sample: circles for biological sample 1 and triangles for biological sample 2. b. For a comparison between different microarray platforms, circles and triangles represent Affymetrix oligonucleotide arrays (chips 1–3) and cDNA arrays (chips 4–6), respectively, and orange, green, and red represent Cy5/Cy3, Cy3-only, and Cy5-only measures, respectively. GEODEX was computed by model (2).