| Literature DB >> 12537562 |
Alena A Antipova1, Pablo Tamayo, Todd R Golub.
Abstract
BACKGROUND: One of the factors limiting the number of genes that can be analyzed on high-density oligonucleotide arrays is that each transcript is probed by multiple oligonucleotide probes. To reduce the number of probes required for each gene, a systematic approach to choosing the most representative probes is needed. A method is presented for reducing the number of probes per gene while maximizing the fidelity to the original array design.Entities:
Mesh:
Substances:
Year: 2002 PMID: 12537562 PMCID: PMC151175 DOI: 10.1186/gb-2002-3-12-research0073
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Schema for selection and evaluation of the single probe pair (Δh) that best approximates the Average Difference value derived from all 20 probe pairs. PM, perfect match; MM, mismatch. Δ = PM - MM.
Figure 2Comparison of Average Difference values with Δh and randomly selected Δs. For each of the datasets shown, the proportion of genes whose Δh value is within twofold of the Average Difference is shown by the black bars. The same comparison is shown for random s (gray bars). Error bars indicate standard deviation. Standard deviation shown reflects variations in the percentage of genes within twofold of the Average Difference between the 176 chips of the training set. Note that the Δhs better approximate the Average Difference compared to randomly selected Δ s.
Classification accuracy using Average Difference, randomly selected Δs, and Δh values
| Error rate | |||||
| Dataset | Classification problem | Δh (%) | Random Δs (%) | Average Difference (%) | |
| Leukemia (set A) | 35 | ALL vs AML | 3 | 2 ± 1 | 3 |
| Leukemia (set B) | 23 | T-ALL vs B-ALL | 0 | 0 ± 1 | 0 |
| Lymphoma | 58 | Cured vs fatal | 26 | 29 ± 5 | 24 |
| Medulloblastoma | 60 | Cured vs fatal | 18 | 26 ± 4 | 24 |