| Literature DB >> 30935364 |
Wei Gu1, Reha Yildirimman2, Emmanuel Van der Stuyft3, Denny Verbeeck3, Sascha Herzinger1, Venkata Satagopam1, Adriano Barbosa-Silva1, Reinhard Schneider1, Bodo Lange2, Hans Lehrach2,4,5, Yike Guo6, David Henderson7, Anthony Rowe8.
Abstract
BACKGROUND: For large international research consortia, such as those funded by the European Union's Horizon 2020 programme or the Innovative Medicines Initiative, good data coordination practices and tools are essential for the successful collection, organization and analysis of the resulting data. Research consortia are attempting ever more ambitious science to better understand disease, by leveraging technologies such as whole genome sequencing, proteomics, patient-derived biological models and computer-based systems biology simulations.Entities:
Keywords: Data management; Oncology; Precision medicine; Translational medicine
Mesh:
Year: 2019 PMID: 30935364 PMCID: PMC6444691 DOI: 10.1186/s12859-019-2748-y
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1The components of the OncoTrack data coordination operation. The platform comprises three major components: the Electronic Data Capture System (EDC, OpenClinica), the Central Data Repository (OncoTrack DB), and the Data Integration System (tranSMART). The OpenClinica EDC system is used to collect medical history and observational patient data from clinical sites during the studies and feeds the structured data to the Central Data Repository. The Central Data Repository, OncoTrack DB is a sample indexed content management system. Data and results generated in the laboratories (before integration) are deposited and exchanged here. In order to link the different data types and layers, the data collected in the OncoTrack DB are integrated in the Data Integration System, tranSMART. The tranSMART data warehouse provides deep linking and integration between the clinical and laboratory data and a set of tools for the exploratory analysis of the integrated data
Fig. 2The OncoTrack dataset structure. a The complex OncoTrack data hierarchy with OMICS datasets directly generated from patient material and datasets generated from patient derived pre-clinical in vivo, in vitro and in silico models. b Due to constraints in tranSMART (v16.1) unable to represent this hierarchical use of samples, data has been organised as a series of different independent collections. One collection for data derived directly from patient samples and other collections for data derived from the pre-clinical models. c A solution we provided with linkage back to human subject and a tool to automatically map data using this linkage
Fig. 3Integration of OncoTrack data into tranSMART: (1) Left panel: Overall data representation in the TranSMART data tree. Right panel: easy customized cohort building with drag-and-drop. (2) Cascaded querying with cohort linking/selection tool PatientMapper. (3) Generating summary statistics of a miRNA of choice by dragging the miRNA-Seq node to the right panel and providing miRNA ID using the HiDome plugin. (4) Performing miRNA-ome wide heatmap analysis between the two sub-cohorts (here responder vs. non-responder for a selected drug treatment) using SmartR workflows
Fig. 4An overview of OncoTrack data that have been modelled, curated and loaded in the OncoTrack tranSMART Server