| Literature DB >> 18205949 |
Abstract
BACKGROUND: For 2-dye microarray platforms, some missing values may arise from an un-measurably low RNA expression in one channel only. Information of such "one-channel depletion" is so far not included in algorithms for imputation of missing values.Entities:
Mesh:
Year: 2008 PMID: 18205949 PMCID: PMC2267710 DOI: 10.1186/1471-2164-9-25
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Mean deviation and mean squared deviation. For five datasets and three categories of missing values, imputed values using WeNNI are compared to duplicate controls. The first columns show the number of spots (N), the uncorrected mean squared deviation (msd, defined in eq. 3) and the uncorrected mean deviation (md, defined in eq. 4) relative to random expectation, . The sd/rd spots are systematically over/under-estimated, with a deviation much larger than random expectation. For the other missing spots, the deviation varies in sign and is more comparable to random expectation. For sd and rd spots, validation results of a constant correction are also shown, revealing mean deviations well within random expectation, except for the MEC tumours dataset (but there it is noticeably lower than the uncorrected result). The msd is reduced compared to the uncorrected result. The relative reduction is reported in the rightmost column.
| Un-corrected | Corrected (validation result) | |||||
| Dataset | msd | msd | msd decrease | |||
| breast cancer | ||||||
| sd | 11733 | 0.74 | 30 | 0.02 | 0.69 | 6.1% |
| rd | 48523 | 1.02 | -100 | 0.3 | 0.81 | 21% |
| other | 6865 | 0.64 | -8 | |||
| lymphoma | ||||||
| sd | 1707 | 1.52 | 30 | 1 | 0.74 | 51% |
| rd | 6541 | 1.23 | -50 | -0.3 | 0.73 | 40% |
| other | 41499 | 0.54 | -10 | |||
| MEC cell lines | ||||||
| sd | 4284 | 0.98 | 20 | -0.3 | 0.86 | 12% |
| rd | 942 | 1.33 | -10 | 0.2 | 1.14 | 14% |
| other | 15584 | 0.27 | -1 | |||
| MEC tumours | ||||||
| sd | 12899 | 1.24 | 50 | -8 | 1.00 | 19% |
| rd | 3376 | 1.87 | -20 | -1 | 1.60 | 15% |
| other | 21616 | 0.52 | 8 | |||
| melanoma | ||||||
| sd | 10498 | 1.92 | 10 | 0.2 | 1.89 | 1.6% |
| rd | 68503 | 1.99 | -100 | 0.1 | 1.72 | 14% |
| other | 4754 | 2.09 | -4 | |||
Figure 1Pearson scores after different imputation methods. Pearson scores for clinical outcome in the lymphoma dataset. The Pearson scores calculated with and without correction is shown on the x- and y-axis, respectively. The diagonal line is inserted to guide the eye. There is no significant change in the distribution of score magnitudes.
Figure 2Reporter standard deviation after different imputation methods. Reporter standard deviation in the lymphoma dataset. The standard deviation of expression, calculated with and without correction, is shown on the x- and y-axis, respectively. The diagonal line is inserted to guide the eye. The corrected imputation method results in larger variations of expression.