| Literature DB >> 27134723 |
Thomas Cokelaer1, Mukesh Bansal2, Christopher Bare3, Erhan Bilal4, Brian M Bot3, Elias Chaibub Neto3, Federica Eduati5, Alberto de la Fuente6, Mehmet Gönen7, Steven M Hill8, Bruce Hoff3, Jonathan R Karr9, Robert Küffner10, Michael P Menden5, Pablo Meyer4, Raquel Norel4, Abhishek Pratap3, Robert J Prill11, Matthew T Weirauch12, James C Costello13, Gustavo Stolovitzky14, Julio Saez-Rodriguez15.
Abstract
UNLABELLED: DREAM challenges are community competitions designed to advance computational methods and address fundamental questions in system biology and translational medicine. Each challenge asks participants to develop and apply computational methods to either predict unobserved outcomes or to identify unknown model parameters given a set of training data. Computational methods are evaluated using an automated scoring metric, scores are posted to a public leaderboard, and methods are published to facilitate community discussions on how to build improved methods. By engaging participants from a wide range of science and engineering backgrounds, DREAM challenges can comparatively evaluate a wide range of statistical, machine learning, and biophysical methods. Here, we describe DREAMTools, a Python package for evaluating DREAM challenge scoring metrics. DREAMTools provides a command line interface that enables researchers to test new methods on past challenges, as well as a framework for scoring new challenges. As of March 2016, DREAMTools includes more than 80% of completed DREAM challenges. DREAMTools complements the data, metadata, and software tools available at the DREAM website http://dreamchallenges.org and on the Synapse platform at https://www.synapse.org. AVAILABILITY: DREAMTools is a Python package. Releases and documentation are available at http://pypi.python.org/pypi/dreamtools. The source code is available at http://github.com/dreamtools/dreamtools.Entities:
Keywords: DREAM; benchmarking; collaborative competition; crowdsourcing; machine learning; method evaluation; systems biology; translational medicine
Year: 2015 PMID: 27134723 PMCID: PMC4837986 DOI: 10.12688/f1000research.7118.2
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. DREAMTools library framework.
DREAM challenges are described at the DREAM website ( http://dreamchallenges.org) where researchers can get an overview of the past and current challenges. Each challenge has its own project page within the Synapse framework ( http://synapse.org) where details about the challenge are available. The final leaderboard showing benchmarks achieved at the end of the challenge are also shown in the Synapse project. DREAMTools provides a Python library that allows researchers to retrieve a template for each closed challenge and to easily score a prediction/template against the gold standard. In a few lines of code, the score of a prediction can then be compared to the official leaderboard, as illustrated in the example in the green box on the right hand side of the figure.
Availability of the DREAM scoring functions within DREAMTools.
The first column provides the nickname used in DREAMTools to refer to a challenge. The challenge’s title (second column) and its Synapse identifier (fourth column) can be used to retrieve all details about a challenge. The third column gives the challenge status within DREAMTools: most of the challenges’ scoring functions are implemented in DREAMTools (green boxes); open challenges are not yet available (blue boxes); a couple of challenges did not release the gold standard and may not be implemented (red boxes labelled ’No GS’ for no gold standard); some are to be implemented in future releases (orange boxes labelled ’TBD’ for to be done).
| DREAM Nickname | Title | Availability | Synapse ID |
|---|---|---|---|
| D2C1 | BCL6 Transcriptional Target Prediction | Implemented |
|
| D2C2 | Protein-Protein Interaction Network Inference | Implemented |
|
| D2C3 | Synthetic Five-Gene Network Inference | Implemented |
|
| D2C4 |
| Implemented |
|
| D2C5 | Genome-Scale Network Inference | Implemented |
|
| D3C1 | Signaling Cascade Identification | Implemented |
|
| D3C2 | Signaling Response Prediction | Implemented |
|
| D3C3 | Gene Expression Prediction | Implemented |
|
| D3C4 |
| Implemented |
|
| D4C1 | Peptide Recognition Domain Specificity Prediction | Implemented |
|
| D4C2 |
| Implemented |
|
| D4C3 | Predictive Signaling Network Modeling | Implemented |
|
| D5C1 | Epitope-Antibody Recognition Specificity Prediction | Implemented |
|
| D5C2 | Transcription Factor DNA Motif Recognition | Implemented |
|
| D5C3 | Systems Genetics Challenge,B | Implemented |
|
| D5C4 | Network Inference Challenge | Implemented |
|
| D6C1 | Alternative Splicing | TBD |
|
| D6C2 | see D7C1 | Implemented |
|
| D6C3 | Gene Expression Prediction | Implemented |
|
| D6C4 | FlowCAP2 Molecular Classification of Acute Myeloid LeuKaimia | Implemented |
|
| D7C1 | Network Topology and Parameter Inference | Implemented |
|
| D7C2 | Breast Cancer Prognosis | TBD |
|
| D7C3 | The DREAM Phil Bowen ALS Prediction Prize4Life | Implemented |
|
| D7C4 | NCI-DREAM Drug Sensitivity | Implemented |
|
| D8C1 | HPN-DREAM Breast Cancer Network Inference | Implemented |
|
| D8C2 | NIEHS-NCATS-UNC DREAM Toxicogenetics | Implemented |
|
| D8C3 | The Whole-Cell Parameter Estimation | TBD |
|
| D8dot5 | The Rheumatoid Arthritis Responder | Implemented |
|
| D9C1 | The Broad-DREAM Gene Essentiality Prediction | Implemented |
|
| D9C2 | Acute Myeloid Leukemia Outcome Prediction | No GS |
|
| D9C3 | Alzheimer’s Disease Big Data | No GS |
|
| D9C4 | ICGC-TCGA-DREAM Somatic Mutation Calling | TBD |
|
| D9dot5C1 | Olfactory Challenge | Implemented |
|
| D9dot5C2 | Prostate Cancer | TBD |
|
| D10C1 | DREAM ALS Stratification Prize4Life | Open Challenge |
|
| D10C2 | ICGC-TCGA-DREAM Somatic Mutation Calling Tumor Heterogeneity | Open Challenge |
|
| D10C3 | ICGC-TCGA DREAM Somatic Mutation Calling RNA Challenge
| Open Challenge |
|
Figure 2. DREAMTools provides scoring methods to score or rank new predictions.
However, as shown in this figure, other functions may be provided. For instance, the plot() method available in the D5C2 challenge shows 4 sub-figures with the score of a submission (blue square) compared to the official participants (black crosses) for 4 metrics (AUROC, AUPR, Spearman versus Pearson correlation). This example is available as an IPython notebook in the DREAMTools repository.
Contingency table for a binary classifier.
| Condition/Gold standard | |||
|---|---|---|---|
| Condition
| Condition
| ||
| Prediction | Positive |
|
|
| Negative |
|
| |