| Literature DB >> 17288599 |
Rickard Sandberg1, Ola Larsson.
Abstract
BACKGROUND: Microarrays enable high throughput detection of transcript expression levels. Different investigators have recently introduced updated probe set definitions to more accurately map probes to our current knowledge of genes and transcripts.Entities:
Mesh:
Year: 2007 PMID: 17288599 PMCID: PMC1805763 DOI: 10.1186/1471-2105-8-48
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Improved precision using update probe set definitions
| -0.014 | 0.013 | 0.013 | 0.014 | 0.019 | 0.023 | |
| -0.035 | 0.053 | 0.030 | 0.059 | 0.045 | 0.047 | |
| -0.11 | 0.023 | -0.0063 | 0.040 | 0.031 | 0.028 |
We compared the precision in microarray experiments when using the updated probe set definitions in comparison with the original probe set definition (netAffx). We measured the precision as the Pearson correlation between two log2 ratios of expression levels comparing duplicate sample-vs-control measurements. The measurement indicates how well a fold change can be reproduced. Data from five different data sets generated using identical samples were averaged (mean) The table shows differences in means for updated probe set definitions compared to the original (NetAffx) definitions across the five different labs together with the p-value assessing the difference in means between each updated probe set definition compared to the original probe set definition (NetAffx) calculated using a paired t-test (two-tailed distribution). Three methods to generate expression estimates (Mas5, RMA and GCRMA) were compared.
Characteristics of probe set definitions
| NetAffx (original) | 22283 | 11.1 |
| ensEMBL exon | 35191 | 9.3 |
| ensEMBL gene | 18671 | 14.0 |
| ensEMBL transcript | 36174 | 13.9 |
| Entrez | 12132 | 14.1 |
| RefSeq | 17880 | 14.9 |
| UniGene | 11694 | 15.0 |
The table shows the number of probe sets together whith the average number of probes within each probeset obtained using each probe set definition for the HG-U133A platform.
Figure 1Improved precision using updated probe set definitions. (A) We compared the precision in microarray experiments when using the updated probe set definitions in comparison with the original probe set definition (netAffx). We measured the precision as the Pearson correlation between two log2 ratios of expression levels generated by comparing duplicate sample-vs-control comparisons. The measurement indicates how well a fold change can be reproduced. Data from five different data sets generated using identical samples (data normalized using RMA) were averaged (mean) [see Additional file 1] for similar figures for each laboratory independently). The x-axis shows the number of probes integrated into each probe set. We grouped the number of probes into intervals of three in order to obtain a sufficiently large number of probe sets to get a robust estimate of each data point. The y-axis shows the mean correlation/precision across the five labs for each interval. (B) Visualizing the number of probe sets in each interval between the different probe set definitions.
Improved accuracy using updated probe set definitions
| 0.74 | 0.02 | ||
| ensEMBL gene | 0.83 | 0.00040 | 0.01 |
| ensEMBL transcript | 0.83 | 0.00053 | 0.01 |
| Entrez | 0.78 | 0.00430 | 0.01 |
| RefSeq | 0.78 | 0.00381 | 0.01 |
| UniGene | 0.75 | 0.07085 | 0.01 |
The accuracy between measured log2 ratios using RT-PCR and microarrays were compared for microarray measurements using the original (NetAffx) and updated probe set definitions across 16 genes. The accuracy was obtained by calculating the slope from a linear regression comparing the obtained log2 ratios estimates between the sample groups from the microarray to those estimates obtained using RT-PCR, where the microarray estimates were obtained using the different probe set definitions. A slope of 1 would indicate perfect accuracy.
The table shows the mean of the obtained accuracy across the five labs. We performed a t-test to assess the difference in means of accuracy between the updated probe set definitions and the original probe set definition (NetAffx); p-values were calculated using a paired t-test (two-tailed distribution). The standard deviation for the mean is also shown. The data represents an assessment of RMA normalized data.
Figure 2Improved accuracy using updated probe set definitions. The accuracy between measured log2 ratios using RT-PCR and microarrays were compared for microarray measurements using the original (NetAffx) and updated probe set definitions. The accuracy was obtained by calculating the slope from a linear regression comparing the obtained log2 ratios from the microarray to those estimates obtained using RT-PCR, where the microarray estimates were obtained using the different probe set definitions. A slope of 1 would indicate perfect accuracy. The mean accuracy across five different labs is plotted together with the standard deviations.