| Literature DB >> 18676452 |
Matthew N McCall1, Rafael A Irizarry.
Abstract
As the number of users of microarray technology continues to grow, so does the importance of platform assessments and comparisons. Spike-in experiments have been successfully used for internal technology assessments by microarray manufacturers and for comparisons of competing data analysis approaches. The microarray literature is saturated with statistical assessments based on spike-in experiment data. Unfortunately, the statistical assessments vary widely and are applicable only in specific cases. This has introduced confusion into the debate over best practices with regards to which platform, protocols and data analysis tools are best. Furthermore, cross-platform comparisons have proven difficult because reported concentrations are not comparable. In this article, we introduce two new spike-in experiments, present a novel statistical solution that enables cross-platform comparisons, and propose a comprehensive procedure for assessments based on spike-in experiments. The ideas are implemented in a user friendly Bioconductor package: spkTools. We demonstrated the utility of our tools by presenting the first spike-in-based comparison of the three major platforms--Affymetrix, Agilent and Illumina.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18676452 PMCID: PMC2553586 DOI: 10.1093/nar/gkn430
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Empirical densities. These plots depict the empirical density of the average (across arrays) expression values for the background RNA. The tick marks on the x-axis show the average expression at each nominal concentration. The dotted lines represent the cut points for low, medium and high ALE values (defined in text).
Nominal concentration to ALE mapping
| Reported nominal concentration | Average expression value | Proportion of genes below | ALE strata | SD |
|---|---|---|---|---|
| 0.125 | 5.1 | 0.35 | Low | 0.87 |
| 0.250 | 5.2 | 0.38 | Low | 0.90 |
| 0.500 | 5.3 | 0.40 | Low | 0.74 |
| 1.000 | 5.7 | 0.48 | Low | 0.72 |
| 2.000 | 6.4 | 0.62 | Med | 0.82 |
| 4.000 | 7.1 | 0.73 | Med | 0.79 |
| 8.000 | 7.8 | 0.82 | Med | 0.68 |
| 16.000 | 8.4 | 0.89 | Med | 0.67 |
| 32.000 | 9.3 | 0.94 | Med | 0.79 |
| 64.000 | 10.2 | 0.97 | Med | 0.72 |
| 128.000 | 11.2 | 0.99 | Med | 0.54 |
| 256.000 | 12.0 | 0.99 | High | 0.49 |
| 512.000 | 12.6 | 1.00 | High | 0.51 |
| Reported nominal concentration | ||||
| 0.20 | 4.6 | 0.34 | Low | 1.35 |
| 2.00 | 4.8 | 0.36 | Low | 1.40 |
| 20.00 | 6.3 | 0.49 | Low | 0.63 |
| 200.00 | 9.1 | 0.73 | Med | 0.47 |
| 666.67 | 10.9 | 0.86 | Med | 0.36 |
| 2000.00 | 12.5 | 0.94 | Med | 0.3 |
| 6666.67 | 14.2 | 0.97 | Med | 0.21 |
| 20000.00 | 15.6 | 0.99 | Med | 0.16 |
| 66666.67 | 17.1 | 1.00 | High | 0.31 |
| 200000.00 | 18.0 | 1.00 | High | 0.25 |
| Reported nominal concentration | ||||
| 0.01 | 2.9 | 0.38 | Low | 0.39 |
| 0.03 | 3.0 | 0.49 | Low | 0.46 |
| 0.10 | 3.9 | 0.68 | Med | 0.83 |
| 0.30 | 5.4 | 0.76 | Med | 1.51 |
| 1.00 | 7.5 | 0.85 | Med | 1.26 |
| 3.00 | 9.5 | 0.94 | Med | 1.05 |
| 10.00 | 11.4 | 0.98 | Med | 1.01 |
| 30.00 | 12.8 | 1.00 | High | 0.81 |
| 100.00 | 14.1 | 1.00 | High | 0.65 |
| 300.00 | 14.8 | 1.00 | High | 0.34 |
| 1000.00 | 15.0 | 1.00 | High | 0.26 |
This table contains summary measures specific to each nominal spike-in level. The first column shows the nominal concentrations as originally reported. The second column shows the average of all observed expression values associated with the row's; nominal concentration. The third column shows the proportion of background RNA with expression values less than the average expression value. The fourth column shows the ALE strata (defined in text) associated with the row's; nominal concentration. Finally, the fifth column shows the SD of all observed expression values associated with the row's; nominal concentration.
Description of data sets
| Affymetrix | Agilent | Illumina | |
|---|---|---|---|
| Background RNA | HeLa complex cRNA | Human Osteosarcoma (MG-63) purchased from Ambion (Cat. # 7868) | Human Liver purchased from Ambion (Cat. # 7960) |
| Spike-in production | 30 cDNA clones isolated from a lymphoblast cell line, eight artificially engineered, four eukaryotic controls from the polyA spike control kit | ||
| Background correction | RMA | Background subtraction and spatial-detrending | Local background subtraction |
| Normalization | RMA | Normalized to the 75th percentile on each microarray | Quantile normalization |
This table gives a brief comparison of the three data sets used in this analysis.
Figure 2.Observed versus nominal values. For each of the three platforms, expression values are plotted against the log (base 2) of the reported nominal concentration. The regression slope obtained utilizing all the data and the regression slopes obtain within each ALE value strata are shown. The slope of each line is reported in the legend. The vertical lines divide the ALE strata.
Assessment results
| Accuracy | Precision | Performance | ||||
|---|---|---|---|---|---|---|
| Platform | Slope (SD) | SD | 99.5% | SNR | POT | GNN |
| Affymetrix (RMA) | 0.20 (0.31) | 0.10 | 0.36 | 2.00 | 0.30 | 13 |
| Affymetrix (MAS5) | 0.58 (0.73) | 0.64 | 3.54 | 0.91 | 0.00 | 581 |
| Agilent | 0.26 (0.90) | 0.40 | 2.74 | 0.65 | 0.00 | 246 |
| Illumina | 0.11 (0.39) | 0.35 | 1.18 | 0.31 | 0.00 | 506 |
| Affymetrix (RMA) | 0.79 (0.35) | 0.09 | 0.40 | 8.78 | 0.87 | 19 |
| Affymetrix (MAS5) | 0.80 (0.38) | 0.18 | 0.95 | 4.44 | 0.35 | 24 |
| Agilent | 0.99 (0.17) | 0.11 | 0.86 | 9.00 | 0.78 | 20 |
| Illumina | 1.15 (0.37) | 0.25 | 1.48 | 4.60 | 0.19 | 25 |
| Affymetrix (RMA) | 0.57 (0.15) | 0.06 | 0.22 | 9.50 | 0.99 | 10 |
| Affymetrix (MAS5) | 0.48 (0.19) | 0.13 | 0.42 | 3.69 | 0.62 | 10 |
| Agilent | 0.61 (0.29) | 0.10 | 0.38 | 6.10 | 0.79 | 10 |
| Illumina | 0.42 (0.32) | 0.15 | 0.62 | 2.80 | 0.27 | 15 |
For each of the ALE strata, we report summary assessments for accuracy, precision and overall performance. The first column shows the signal detection slope, which can be interpreted as the expected observed difference when the true difference is a fold change of 2. In parenthesis is the SD of the log-ratios associated with nonzero nominal log-ratios. The second column shows the standard deviation of null log-ratios. The SD can be interpreted as the expected range of observed log-ratios for genes that are not differentially expressed. The third column shows the 99.5th percentile of the null distribution. It can be interpreted as the expected minimum value that the top 100 nondifferentially expressed genes will reach. The fourth column shows the ratio of the values in column 1 and column 2. It is a rough measure of signal to noise ratio. The fifth column shows the probability that, when comparing two samples, a gene with a true log-fold change of 2 will appear in a list of the 100 genes with the highest log-ratios. The sixth column shows the size of gene list necessary to obtain 10 true positives when one considers a list of genes with the highest fold change.
Figure 3.Log-ratio distributions. These plots depict the distribution of observed log ratios for various nominal fold changes. In each case, the log ratios are stratified by the ALE values into which the two nominal concentrations fall. For example, HL means that one fell in the high stratum and one fell in the medium stratum. The null distributions' log-ratios are divided into background RNA (Bg-Null) and spike-ins at the same nominal concentration (S-Null), for each bin. The dotted horizontal lines represent the expected or nominal log-ratios: zero for the null distribution and Δ for the other comparisons (Δ =log2 4 for Affymetrix and Δ =log2 3 for Agilent and Illumina).
ANOVA results
| Platform | Affymetrix (RMA) | Affymetrix (MAS5) | Agilent | Illumina |
|---|---|---|---|---|
| Concentration effect | 2.48 | 2.77 | 4.53 | 2.19 |
| Probe effect | 0.54 | 0.55 | 0.44 | 0.38 |
| Array effect | 0.17 | 0.17 | 0.19 | NA |
| Measurement error | 0.47 | 0.72 | 0.69 | 0.54 |
| Probe imbalance | 0 | 0 | 3.60 | 0 |
| Array imbalance | 0 | 0 | 0 | 1059.67 |
To understand the variability contributed by differences in nominal concentrations, probe effect and array, we fitted a three-way ANOVA model containing only main effects to the expression values from the spike-in transcripts. The estimated SD of each effect is shown in the first three rows. The fourth row shows the SD of the error term. Finally, a measure of the amount of confounding between nominal concentration and the other two effects is included in rows five and six. We use the measure presented by Wu (17). An optimal design, such as a Latin Square, will have a measure of 0 for each imbalance. The more confounding the larger these values. Note, the large imbalance due to array in the Illumina design. In this experiment array and nominal concentration were completely confounded. However, because the array effect is small (the arrays are normalized) this was not as much of a problem. In the Agilent experiment, there is a small amount of confounding between probe and concentration because a Latin Square design was used with a single concentration/gene combination missing.