| Literature DB >> 15907200 |
Sacha A F T van Hijum1, Anne de Jong, Richard J S Baerends, Harma A Karsens, Naomi E Kramer, Rasmus Larsen, Chris D den Hengst, Casper J Albers, Jan Kok, Oscar P Kuipers.
Abstract
BACKGROUND: In research laboratories using DNA-microarrays, usually a number of researchers perform experiments, each generating possible sources of error. There is a need for a quick and robust method to assess data quality and sources of errors in DNA-microarray experiments. To this end, a novel and cost-effective validation scheme was devised, implemented, and employed.Entities:
Mesh:
Year: 2005 PMID: 15907200 PMCID: PMC1166551 DOI: 10.1186/1471-2164-6-77
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1The validation procedure. It consists of 4 steps: (i) cell culturing, (ii) cell pelleting and RNA isolation, (iii) cDNA labeling, and (iv) hybridization, scanning, image- and data analysis.
General statistics on data obtained from the validation experiments (Figure 1 and supplementary Table S1 [21]).
| Validation | Validation slide | 5 % low spot filter | 40 % low spot filter | |||
| 1 | 2 | 3 | CV (%) | Spots (%) | CV (%) | |
| A-I | x | x | x | 27.3 | 89.5 | 17.4 |
| A-I | x | 21.6 | 88.7 | 13.4 | ||
| A-I | x | 26.1 | 88.9 | 17.3 | ||
| A-I | x | 24.6 | 91.1 | 16.5 | ||
| A | x | x | x | 16.4 | 86 | 12.5 |
| B | x | x | x | 13 | 84.3 | 9.4 |
| C | x | x | x | 14 | 94.5 | 8.6 |
| D | x | x | x | 16.7 | 92.1 | 11.1 |
| E | x | x | x | 9.2 | 90.4 | 6.7 |
| F | x | x | x | 27.5 | 87.3 | 20.2 |
| G | x | x | x | 16.5 | 88.5 | 10.3 |
| H | x | x | x | 23.8 | 91 | 15.1 |
| I | x | x | x | 18.1 | 92.2 | 12.4 |
Figure 2Sammon projection of the clustering of validation data using a self-organizing Kohonen map. Validation experiments (A-I) are shown as well as the clusters (1 – 4; consisting of 761, 230, 227, 886 genes, respectively). Operon names, the number of members, and their (putative) functions are listed to the right of the corresponding clusters. The minimum number of genes in an operon of which all members should be in a certain cluster was determined at a probability of 0.02 or lower for clusters 1 (4 genes), 2 (2 genes), 3 (2 genes), and 4 (5 genes).
Figure 3Plot of percentage of maximal intensity versus CV values calculated for the expression levels of genes in the 9 validation datasets (dark-blue small squares). Purple solid triangles show the top 40 genes with highest variability in ratio and signals (supplementary Table S2 [21]). Functionally related genes showing validation experiment-dependent SDs (Figure 2) are indicated by cluster 1 (solid yellow circles), cluster 2 (open light-blue triangles), cluster 3 (open red squares), and cluster 4 (open green circles).
Figure 4ANOVA results. Each bar represents averages (with error bars signifying the standard deviations for the respective interactions) for 10 random samples of ratio data obtained for the indicated slide combinations (1, 2, and 3; Figure 1). E.g. S = 1v2 indicates a comparison of data from slides 1 with data from slides 2. The interactions (indicated by the colored bars as detailed in the inset) and "Error" (residual variance) amount to 100 % (the total variance present in the data).
Contribution of sources to the variance estimated for the nine validation experiments (Figure 4) and contribution of individual factors to the VG interactiona.
| Variance source | Contribution to the variance (%) |
| Gene ( | 5.0 |
| Dye ( | 4.2 |
| Gene × Arrayb | 7.8 |
| RNA isolation and labelingc | 1.5 |
| Sampling | 7.1 |
| 26.9 | |
| Day × Gene | 19.7e |
| Experimenter × Gene | 17.3e |
| Array batch × Gene | 14.9e |
| Spot pins × Gene | 4.5f |
a The degrees of freedom results in the separate ANOVAs are listed in the supplementary web-site [21].
b Assumed to consist of hybridization effects and signal-to-noise differences per slide.
c Derived from the variance observed between Ba and Bb cultures (Figure 1).
d Variances that are dependent on the validation experiment performed and due to day-to-day differences, identity of the experimenter, and DNA microarray batch differences.
e Due to overlap in levels, the contribution of these interactions were individually determined.
f A change from 8 to 12 spot-pins used for array spotting coincided with a switch in the RNA isolation method.