| Literature DB >> 16237126 |
Michael Barnes1, Johannes Freudenberg, Susan Thompson, Bruce Aronow, Paul Pavlidis.
Abstract
The growth in popularity of RNA expression microarrays has been accompanied by concerns about the reliability of the data especially when comparing between different platforms. Here, we present an evaluation of the reproducibility of microarray results using two platforms, Affymetrix GeneChips and Illumina BeadArrays. The study design is based on a dilution series of two human tissues (blood and placenta), tested in duplicate on each platform. The results of a comparison between the platforms indicate very high agreement, particularly for genes which are predicted to be differentially expressed between the two tissues. Agreement was strongly correlated with the level of expression of a gene. Concordance was also improved when probes on the two platforms could be identified as being likely to target the same set of transcripts of a given gene. These results shed light on the causes or failures of agreement across microarray platforms. The set of probes we found to be most highly reproducible can be used by others to help increase confidence in analyses of other data sets using these platforms.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16237126 PMCID: PMC1258170 DOI: 10.1093/nar/gki890
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Schematic representation of the experimental design. See Materials and Methods for details. The key features of the design are the use of a single pair of RNA samples for all analyses, mixed together in varying proportions and analyzed in technical replicates on each platform. Unlike the Affymetrix platform, each Illumina BeadArray slide contains multiple arrays, allowing us to analyze a complete dilution series on one slide. Note that the BeadArray slides actually contain eight arrays per slide, but we only used six for the data described here.
Summary statistics for array designs studied, comparing two annotation methods
| Array | Probes or probe sets | Probes for ‘known’ genes | Known genes assayed | Unassigned probes | Probes/gene (average) |
|---|---|---|---|---|---|
| Illumina (Mfg) | 24 114 | 17 143 | 14 420 | 6971 | 1.19 |
| Illumina (BLAT) | 19 924 | 16 847 | 2978 | 1.18 | |
| Affymetrix (Mfg) | 54 675 | 34 089 | 16 610 | 20 586 | 2.05 |
| Affymetrix (BLAT) | 33 922 | 18 417 | 20 600 | 1.84 |
aMfg, Manufacturer's annotations; BLAT, our own annotations computed using BLAT alignments to the genomic sequence.
b‘Known genes’ are genes identified in the GoldenPath ‘refGene’ or ‘knownGene’ tables, including transcript information from the ‘all_mrna’ table to determine exon overlaps. We designate all other potential transcripts ‘unassigned probes’.
cIncludes probes not yielding BLAT results.
dIncludes 14 334 probes where no gene name is listed by the manufacturer.
Figure 2Hierarchical clustering results of all 36 024 comparable pairs of probes. The dilution step is shown as a graph at the top of the figure (Blood/Placenta). Black bars at the side indicate large clusters of genes that appear to show clear dilution effects in both platforms. Gray bars indicate examples of clusters that appear to show dilution effects in one platform but not consistently in the other. Lighter colors indicate higher relative levels of expression on an arbitrary scale. Note that in this figure, if a gene occurs multiple times on one platform, it is shown in all possible valid comparisons with matching probes on the other platform.
Figure 3Distributions of correlations for the Illumina HumanRef-8 BeadArrays (A) and the Affymetrix HG-U133 Plus 2 arrays (B). Correlations near −1 and 1 reflect probes whose targets are differentially expressed between the two samples. Correlations near zero reflect probes which do not show a dilution effect. The dilution effect is more pronounced for probes targeted at ‘known’ genes. This effect is stronger for Illumina (C) than for Affymetrix (D), though the Illumina platform has fewer probes which cannot be assigned to known genes (Table 1). For complete data see the Supplementary Data.
Figure 4Distributions of correlations stratified by high and low expression levels (log2) for the Illumina HumanRef-8 BeadArrays (A) and the Affymetrix HG-U133 Plus 2.0 arrays (B). On both platforms, the probes not showing dilution effects tend to express at low levels, whereas highly expressed probes show strong dilution effects. For complete data see the Supplementary Data.
Figure 5Cross-platform agreement for all ‘known’ genes (A), stratified by differential expression (B) and for placenta/blood specific genes (C). For complete data see the Supplementary Data.
Figure 6Cross-platform agreement measured by the rank correlation of expression levels as a function of expression level (log2) (A) and distance between probes in base pairs (B). For complete data see the Supplementary Data.