| Literature DB >> 17002788 |
Katrin Hoffmann1, Martin J Firth, Alex H Beesley, Nicholas H de Klerk, Ursula R Kees.
Abstract
BACKGROUND: Recent findings from microarray studies have raised the prospect of a standardized diagnostic gene expression platform to enhance accurate diagnosis and risk stratification in paediatric acute lymphoblastic leukaemia (ALL). However, the robustness as well as the format for such a diagnostic test remains to be determined. As a step towards clinical application of these findings, we have systematically analyzed a published ALL microarray data set using Robust Multi-array Analysis (RMA) and Random Forest (RF).Entities:
Mesh:
Year: 2006 PMID: 17002788 PMCID: PMC1609180 DOI: 10.1186/1471-2407-6-229
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Comparison of prediction accuracies achieved by RMA/RF and ANN (Ross et al., 2003).
| RF | RF | ANN | |
| 104 | 25 | 25 | |
| 86.7 % (15)* | 89 % (4)* | 87.5 % (4)* | |
| 100 % (18) | 100 % (5) | 100 % (5) | |
| 100 % (17) | 99.8 % (4) | 95 % (4) | |
| 100 % (20) | 100 % (5) | 100 % (5) | |
| 100 % (14) | 100 % (2) | 100 % (2) | |
| 100 % (20) | 100 % (5) | 96 % (5) | |
| 98.1 % | 98.2 % | 96.4 % | |
*By cytogenetic analysis both cases that led to apparent inaccuracies are known to show the presence of BCR-ABL and >50 chromosomes.
‡ Average prediction accuracies from 100 independent analyses. For each analysis a new training and test set was chosen and discriminating probe sets were selected using the new training set.
† Average prediction accuracies from 10 independent analyses. For each analysis a new training and test set was chosen and discriminating probe sets were selected using the new training set.
Figure 1ALL subtype distinction based on discriminating genes identified by RMA/RF. Gene expression profiles from 104 paediatric ALL specimens were analyzed using unsupervised Principle Component Analysis (PCA). Shown are three-dimensional scatter plots of all cases using PCA with the top discriminating probe sets identified by RMA/RF. (A) Three-dimensional scatter plot of a PCA using the top 20 subgroup-discriminating probe sets (120 probe sets). (B) Three-dimensional scatter plot of a PCA using the top 5 subgroup-discriminating probe sets (30 probe sets). Arrows mark two BCR-ABL-expressing samples known to contain a BCR-ABL translocation and a hyperdiploid (>50 chromosomes) karyotype.
Figure 2A small set of genes accurately predicts six ALL subgroups. Comparison of average prediction accuracies obtained with the top 20, top 15, top 10, top 5 and top 2 subgroup-discriminating probe sets identified by RMA/RF after cross validation (see Supplemental Document, Table S3). For each cross validation analysis (n = 100) a new training (n = 79) and test set (n = 25) was chosen and discriminating probes sets were selected using the new training set. Shown are prediction accuracies for individual ALL subgroups (main panel) and the overall prediction accuracies (panel on the right).
Top 30 probe sets identified by RMA/RF that are required for accurate classification.
| 201906_s_at | 1 | 17 | Above | 2.5 | ||
| 209365_s_at* | 2 | 3 | Above | 5.0 | ||
| 210830_s_at* | 3 | 2 | Above | 5.7 | ||
| 202123_s_at | 4 | 76 | Above | 2.0 | ||
| 222154_s_at | 5 | 13 | Above | 6.9 | ||
| 212148_at* | 1 | 4 | Above | 29.6 | ||
| 221113_s_at | 2 | 18 | Above | 6.2 | ||
| 205253_at* | 3 | 3 | Above | 36.1 | ||
| 230306_at• | 4 | 11 | Above | 5.9 | ||
| 225483_at• | 5 | 8 | Above | 8.2 | ||
| Hyperdipl.>50 | 200659_s_at | 1 | - | Above | 2.8 | |
| 216071_x_at* | 2 | 1 | Above | 1.7 | ||
| 208598_s_at | 3 | 12 | Above | 1.6 | ||
| 226875_at• | 4 | 35 | Above | 2.3 | ||
| 200057_s_at | 5 | - | Above | 1.5 | ||
| 226939_at*• | 1 | 1 | Above | 5 | ||
| 219463_at* | 2 | 2 | Above | 15.2 | ||
| 204069_at* | 3 | 3 | Above | 10 | ||
| 218847_at | 4 | 24 | Above | 7 | ||
| 201152_s_at | 5 | 23 | Above | 2.2 | ||
| T-ALL | 213539_at | 1 | 17 | Above | 28.3 | |
| 227646_at• | 2 | 23 | Below | 24.6 | ||
| 217147_s_at | 3 | 20 | Above | 10.1 | ||
| 229487_at• | 4 | 24 | Below | 20.4 | ||
| 50221_at | 5 | - | Below | 2.2 | ||
| 206231_at | 1 | 15 | Above | 3.2 | ||
| 241505_at*• | EST | 2 | 4 | Above | 5.8 | |
| 206033_s_at | 3 | 12 | Above | 8.8 | ||
| 214110_s_at | EST | 4 | - | Above | 2.2 | |
| 206032_at | 5 | 11 | Above | 4.8 |
† The ranking refers to the top 100 subgroup-discriminating probe sets identified by Ross et al. using the parallel analysis format.
* Probe sets/genes identified as the top 5 subgroup-discriminators in both analyses, RMA/RF and Ross et al. (parallel format).
• Probe sets that are represented on the HG-U133B array.
Analysis of independent ALL patient cohort (n = 68). Confirmation of discriminators identified by RMA/RF using HG-U133A array data.
| 3 | 3 | 3 | 100% | 100% | |
| 3 | 3 | 3 | 100% | 100% | |
| 17 | 13 | 11 | 76.5% | 64.7% | |
| 7 | 7 | 6 | 100% | 85.7% | |
| 37 | 36 | 37 | 97.3% | 100% | |
| 1 | 1 | 1 | 100% | 100% | |
| 68 | 63 | 61 | 92.6% | 89.7% | |
* Established by conventional cytogenetic and immunophenotype analysis.