| Literature DB >> 22166133 |
Minjun Chen1, Leming Shi, Reagan Kelly, Roger Perkins, Hong Fang, Weida Tong.
Abstract
BACKGROUND: Genomic biomarkers play an increasing role in both preclinical and clinical application. Development of genomic biomarkers with microarrays is an area of intensive investigation. However, despite sustained and continuing effort, developing microarray-based predictive models (i.e., genomics biomarkers) capable of reliable prediction for an observed or measured outcome (i.e., endpoint) of unknown samples in preclinical and clinical practice remains a considerable challenge. No straightforward guidelines exist for selecting a single model that will perform best when presented with unknown samples. In the second phase of the MicroArray Quality Control (MAQC-II) project, 36 analysis teams produced a large number of models for 13 preclinical and clinical endpoints. Before external validation was performed, each team nominated one model per endpoint (referred to here as 'nominated models') from which MAQC-II experts selected 13 'candidate models' to represent the best model for each endpoint. Both the nominated and candidate models from MAQC-II provide benchmarks to assess other methodologies for developing microarray-based predictive models.Entities:
Mesh:
Year: 2011 PMID: 22166133 PMCID: PMC3236846 DOI: 10.1186/1471-2105-12-S10-S3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The datasets used in MAQC-II project.
| Endpoint code | Endpoint | Endpoint description | Training set | Validation set | ||
|---|---|---|---|---|---|---|
| #Sample | P/N ratio* | #Sample | P/N ratio | |||
| A | Lung tumorigenicity | Lung tumorigen vs. non-tumorigen | 70 | 0.59 | 88 | 0.47 |
| B | Non-genotoxicity | Non-genotoxic hepatocarcinogen vs. non-carcinogen | 216 | 0.51 | 201 | 0.4 |
| C | Liver toxicity | Liver toxicants vs. non-toxicants | 214 | 0.58 | 204 | 0.62 |
| D | Breast cancer | Pathologic complete response, pCR | 130 | 0.34 | 100 | 0.18 |
| E | Breast cancer | Estrogen receptor status (ER +/-) | 130 | 1.6 | 100 | 1.56 |
| F | Multiple myeloma | Overall survival | 340 | 0.18 | 214 | 0.14 |
| G | Multiple myeloma | Event-free survival | 340 | 0.33 | 214 | 0.19 |
| H | Multiple myeloma | Male vs. female (positive control) | 340 | 1.33 | 214 | 1.89 |
| I | Multiple myeloma | Random 2-class label (negative control) | 340 | 1.43 | 214 | 1.33 |
| J | Neuroblastoma | Overall survival | 238 | 0.1 | 177 | 0.28 |
| K | Neuroblastoma | Event-free survival | 239 | 0.26 | 193 | 0.75 |
| L | Neuroblastoma | Male vs. female (positive control) | 246 | 1.44 | 231 | 1.36 |
| M | Neuroblastoma | Random 2-class label (negative control) | 246 | 1.44 | 253 | 1.36 |
* P/N = Positive/Negative ratio. Positive denotes for these samples showing the positive results (e.g. cancer, tumor).
Figure 1Overview of the NCTR model development process.
Figure 2The ensemble models vs. the NCTR nominated models. A pair-wise t-test was applied to the MCCs obtained from the ensemble models and the NCTR nominated models. (P-value = 0.039 if two random endpoints, i.e., I and M, were excluded).
Figure 3The ensemble models and the NCTR nominated models related to all the NCTR developed models. The distribution of the cross-validation MCCs from 8320 NCTR developed models for each endpoint was shown in the box plots; the NCTR nominated models were marked as the green diamonds, and the ensemble models were marked as the red squares.
Figure 4The ensemble models vs. MAQC-II candidate models. A pair-wise t-test was applied to the MCCs obtained from the ensemble models and the MAQC-II candidate models (P-value = 0.43).
Figure 5The comparison of the models from different analysis teams. The average MCC was calculated from 11 non-random endpoints in the external validation sets when I and M were excluded.