BACKGROUND: In molecular profiling studies of cancer patients, experimental and clinical data are combined in order to understand the clinical heterogeneity of the disease: clinical information for each subject needs to be linked to tumour samples, macromolecules extracted, and experimental results. This may involve the integration of clinical data sets from several different sources: these data sets may employ different data definitions and some may be incomplete. METHODS: In this work we employ semantic web techniques developed within the CancerGrid project, in particular the use of metadata elements and logic-based inference to annotate heterogeneous clinical information, integrate and query it. RESULTS: We show how this integration can be achieved automatically, following the declaration of appropriate metadata elements for each clinical data set; we demonstrate the practicality of this approach through application to experimental results and clinical data from five hospitals in the UK and Canada, undertaken as part of the METABRIC project (Molecular Taxonomy of Breast Cancer International Consortium). CONCLUSION: We describe a metadata approach for managing similarities and differences in clinical datasets in a standardized way that uses Common Data Elements (CDEs). We apply and evaluate the approach by integrating the five different clinical datasets of METABRIC.
BACKGROUND: In molecular profiling studies of cancerpatients, experimental and clinical data are combined in order to understand the clinical heterogeneity of the disease: clinical information for each subject needs to be linked to tumour samples, macromolecules extracted, and experimental results. This may involve the integration of clinical data sets from several different sources: these data sets may employ different data definitions and some may be incomplete. METHODS: In this work we employ semantic web techniques developed within the CancerGrid project, in particular the use of metadata elements and logic-based inference to annotate heterogeneous clinical information, integrate and query it. RESULTS: We show how this integration can be achieved automatically, following the declaration of appropriate metadata elements for each clinical data set; we demonstrate the practicality of this approach through application to experimental results and clinical data from five hospitals in the UK and Canada, undertaken as part of the METABRIC project (Molecular Taxonomy of Breast Cancer International Consortium). CONCLUSION: We describe a metadata approach for managing similarities and differences in clinical datasets in a standardized way that uses Common Data Elements (CDEs). We apply and evaluate the approach by integrating the five different clinical datasets of METABRIC.
Authors: Peter A Covitz; Frank Hartel; Carl Schaefer; Sherri De Coronado; Gilberto Fragoso; Himanso Sahni; Scott Gustafson; Kenneth H Buetow Journal: Bioinformatics Date: 2003-12-12 Impact factor: 6.937
Authors: Luis Marenco; Tzuu-Yi Wang; Gordon Shepherd; Perry L Miller; Prakash Nadkarni Journal: J Am Med Inform Assoc Date: 2004-08-06 Impact factor: 4.497
Authors: Frank W Hartel; Sherri de Coronado; Robert Dionne; Gilberto Fragoso; Jennifer Golbeck Journal: J Biomed Inform Date: 2005-04 Impact factor: 6.317
Authors: J L Haybittle; R W Blamey; C W Elston; J Johnson; P J Doyle; F C Campbell; R I Nicholson; K Griffiths Journal: Br J Cancer Date: 1982-03 Impact factor: 7.640
Authors: Faustin Laurentiu Roman; Daniel Abler; Vassiliki Kanellopoulos; Gabriel Amoros; Jim Davies; Manjit Dosanjh; Raj Jena; Norman Kirkby; Ken Peach; Jose Salt Journal: J Radiat Res Date: 2013-07 Impact factor: 2.724
Authors: Sebastian Mate; Felix Köpcke; Dennis Toddenroth; Marcus Martin; Hans-Ulrich Prokosch; Thomas Bürkle; Thomas Ganslandt Journal: PLoS One Date: 2015-01-14 Impact factor: 3.240
Authors: Lorna Morris; Andrew Tsui; Charles Crichton; Steve Harris; Peter H Maccallum; William J Howat; Jim Davies; James D Brenton; Carlos Caldas Journal: BMC Bioinformatics Date: 2013-05-01 Impact factor: 3.169
Authors: Massimiliano Izzo; Francesco Mortola; Gabriele Arnulfo; Marco M Fato; Luigi Varesio Journal: BMC Genomics Date: 2014-05-06 Impact factor: 3.969
Authors: Gerald Li; Peter Bankhead; Philip D Dunne; Paul G O'Reilly; Jacqueline A James; Manuel Salto-Tellez; Peter W Hamilton; Darragh G McArt Journal: Brief Bioinform Date: 2017-07-01 Impact factor: 11.622