| Literature DB >> 15876355 |
David Petersen1, G V R Chandramouli, Joel Geoghegan, Joanne Hilburn, Jonathon Paarlberg, Chang Hee Kim, David Munroe, Lisa Gangi, Jing Han, Raj Puri, Lou Staudt, John Weinstein, J Carl Barrett, Jeffrey Green, Ernest S Kawasaki.
Abstract
BACKGROUND: Microarrays for the analysis of gene expression are of three different types: short oligonucleotide (25-30 base), long oligonucleotide (50-80 base), and cDNA (highly variable in length). The short oligonucleotide and cDNA arrays have been the mainstay of expression analysis to date, but long oligonucleotide platforms are gaining in popularity and will probably replace cDNA arrays. As part of a validation study for the long oligonucleotide arrays, we compared and contrasted expression profiles from the three formats, testing RNA from six different cell lines against a universal reference standard.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15876355 PMCID: PMC1140753 DOI: 10.1186/1471-2164-6-63
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Overlapping gene sets represented in 3 microarray platforms
| UniGem2 & Operon V2 | UniGEM2 & HG-U133A | Operon V2 & HG-U133A | |
| Genes in common | 7082 | 6593 | 12999 |
| Genes in common in all arrays | 6430 | 6430 | 6430 |
Figure 1Venn diagram with number of genes present in each platform, genes in common between platforms, and genes in common among all three platforms.
Proportion of genes expressed in same direction between platforms ignoring expression level. The numbers of genes in matched and mismatched directions are given for each cell line and each platform pair, and agreement is given in percentages.
| GEM2 & Operon V2 | Jurkat | 4597 | 1373 | 5970 | 77 % |
| L428 | 3535 | 1251 | 4786 | 74 % | |
| SUDHL | 4608 | 1525 | 6133 | 75 % | |
| Ocily3 | 4094 | 1368 | 5462 | 75 % | |
| LNCaP | 4189 | 1333 | 5522 | 76 % | |
| MCF10A | 4469 | 1429 | 5898 | 76 % | |
| Total | 25492 | 8279 | 33771 | 75 % | |
| GEM2 & HG-U133A | Jurkat | 3657 | 906 | 4563 | 80 % |
| L428 | 3203 | 918 | 4121 | 78 % | |
| SUDHL | 3541 | 809 | 4350 | 81 % | |
| Ocily3 | 3826 | 984 | 4810 | 80% | |
| LNCaP | 3747 | 1031 | 4778 | 78 % | |
| MCF10A | 3584 | 950 | 4534 | 79 % | |
| Total | 21558 | 5598 | 27156 | 79 % | |
| Operon V2 & HG-U133A | Jurkat | 3688 | 795 | 4483 | 82 % |
| L428 | 2909 | 738 | 3647 | 80 % | |
| SUDHL | 3520 | 776 | 4296 | 82 % | |
| Ocily3 | 3509 | 947 | 4456 | 79 % | |
| LNCaP | 3607 | 860 | 4467 | 81 % | |
| MCF10A | 3511 | 898 | 4409 | 80 % | |
| Total | 20744 | 5014 | 25758 | 81 % | |
Concordance between platforms using statistically significant expression ratios at p-value < 0.05 and at 1.5 and 2-fold threshold levels.1
| Jurkat | 1471 | 1678 | 1740 | 1096 | 1088 | 1220 | 588 | 614 | 749 |
| L428 | 651 | 1394 | 750 | 580 | 1051 | 632 | 430 | 678 | 469 |
| SUDHL | 1400 | 1527 | 1707 | 1081 | 1177 | 1277 | 698 | 741 | 831 |
| OCI-Ly3 | 1484 | 1834 | 1698 | 1285 | 1428 | 1333 | 814 | 900 | 903 |
| LNCaP | 1167 | 1170 | 2062 | 737 | 751 | 1252 | 369 | 404 | 696 |
| MCF10A | 1555 | 1429 | 1880 | 899 | 812 | 1118 | 405 | 409 | 559 |
| Jurkat | 97% | 96% | 98% | 98% | 99% | 100% | 99% | 99% | 100% |
| L428 | 96% | 96% | 98% | 98% | 98% | 99% | 99% | 99% | 100% |
| SUDHL | 95% | 97% | 98% | 97% | 98% | 99% | 99% | 99% | 100% |
| OCI-Ly3 | 94% | 96% | 96% | 95% | 98% | 98% | 98% | 99% | 99% |
| LNCaP | 93% | 95% | 96% | 97% | 98% | 99% | 99% | 100% | 100% |
| MCF10A | 94% | 95% | 96% | 98% | 98% | 99% | 99% | 99% | 100% |
| 7728 | 9032 | 9837 | 5678 | 6307 | 6832 | 3304 | 3746 | 4207 | |
| 7334 | 8650 | 9540 | 5509 | 6191 | 6755 | 3257 | 3719 | 4190 | |
| 94.9% | 95.8% | 97.0% | 97.0% | 98.2% | 98.9% | 98.6% | 99.3% | 99.6% | |
| 54.0% | 57.5% | 60.4% | 39.7% | 40.2% | 42.0% | 23.1% | 23.9% | 25.8% | |
1GEM2: Incyte GEM2 arrays; Operon: Operon V2 arrays; U133A: Affymetrix Human Genome U 133A GeneChip arrays.
2Comparisons between two platforms were done only when the expressions of both platforms met the p-value and threshold criteria.
3Fraction of common expressions between two platforms to the total number of expressions found at p < 0.05 in both platforms.
Figure 2a-f. Scatter plot analysis to determine correlation coefficients between and within platforms using Jurkat RNA as an example. Correlations for all cell lines are given in Table 4. (a) Operon versus Incyte (b) Affymetrix versus Incyte (c) Affymetrix versus Operon (d) GEM2 versus GEM2 replicate correlation (e) Operon versus Operon (f) HG-U133A versus HG-U133A
Cross platform correlation coefficients. Pearson correlation coefficients are given for each platform pair and cell line, calculated over the genes common to all platforms. Values were obtained from scatter plots as shown in Figure 2a-c.
| Incyte/Operon | 0.727 | 0.707 | 0.708 | 0.724 | 0.710 | 0.708 | 0.709 |
| Incyte/Affy | 0.767 | 0.777 | 0.781 | 0.780 | 0.741 | 0.744 | 0.772 |
| Operon/Affy | 0.813 | 0.784 | 0.783 | 0.790 | 0.796 | 0.782 | 0.787 |
Median correlation coefficients of replicates within same platform. Calculations and values obtained as in Table 4.
| GEM2 | 0.859 | 0.919 | 0.902 | 0.873 | 0.827 | 0.845 |
| Operon V2 | 0.916 | 0.805 | 0.849 | 0.828 | 0.951 | 0.928 |
| HG-U133A | 0.956 | 0.957 | 0.960 | 0.952 | 0.963 | 0.954 |
Figure 3Principal Component Analysis (PCA) of the three microarray platforms and six cell lines using expression of the 3186 genes with signals above background.
Figure 4a-c. Correlation of correlations of platforms for all cell lines. Correlation values R for each pair of platforms are given in the figures. (a) Operon versus Incyte (b) Affymetrix versus Incyte (c) Affymetrix versus Operon.
Figure 5a-b. Clustered image maps showing patterns of expression relationship among genes, platforms, and cell lines. The axes were ordered by hierarchical clustering using an uncentered correlation and the average linkage algorithm for 909 genes expressed at a two-fold or greater level in at least two of the six cell lines. (a) Clustering of all 909 genes (b) A subcluster of 41 genes to show correct clustering and congruence of expression values. As indicated by the cluster trees, all three platforms gave essentially the same relationships among the six cell lines.
Figure 6Quantitative real-time RT-PCR analysis of 12 genes matched for direction of expression relative to the reference RNA for all three platforms. Log2 ratios are given in table below the graph. This example is a comparison between LnCaP and MCF-10A.
Figure 7a-f. Quantitative RT-PCR analysis of 10 mismatched genes in the six cells lines for all three platforms. (a) MCF10A, (b) LnCaP, (c) OCI-Ly3, (d) Jurkat, (e) SUDHL-6 and (f) L428.