| Literature DB >> 28061771 |
Philipp Bruland1, Martin Dugas2.
Abstract
BACKGROUND: Data capture for clinical registries or pilot studies is often performed in spreadsheet-based applications like Microsoft Excel or IBM SPSS. Usually, data is transferred into statistic software, such as SAS, R or IBM SPSS Statistics, for analyses afterwards. Spreadsheet-based solutions suffer from several drawbacks: It is generally not possible to ensure a sufficient right and role management; it is not traced who has changed data when and why. Therefore, such systems are not able to comply with regulatory requirements for electronic data capture in clinical trials. In contrast, Electronic Data Capture (EDC) software enables a reliable, secure and auditable collection of data. In this regard, most EDC vendors support the CDISC ODM standard to define, communicate and archive clinical trial meta- and patient data. Advantages of EDC systems are support for multi-user and multicenter clinical trials as well as auditable data. Migration from spreadsheet based data collection to EDC systems is labor-intensive and time-consuming at present. Hence, the objectives of this research work are to develop a mapping model and implement a converter between the IBM SPSS and CDISC ODM standard and to evaluate this approach regarding syntactic and semantic correctness.Entities:
Keywords: Biomedical research; Clinical trials; Data management; Database; Database management systems; Metadata; Model transformation; Software tools; Statistical data
Mesh:
Year: 2017 PMID: 28061771 PMCID: PMC5219713 DOI: 10.1186/s12911-016-0402-4
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Sections of CDISC ODM with study metadata information left-hand side and structure of clinical values on the right. For metadata there is one hierarchy for elements to reuse them in a study. In contrast, data is hierarchically represented according to the metadata structure
SPSS input files of different projects and a sample file with all available data types
| Project | S1 | S2 | S3 | S4 | S5 | S6 | S7 | S8 | Sample |
|---|---|---|---|---|---|---|---|---|---|
| # Variables | 139 | 382 | 455 | 34 | 67 | 24 | 188 | 645 | 16 |
| # Cases | 2075 | 3452 | 2890 | 2890 | 2890 | 621 | 0 | 0 | 5 |
Fig. 2Mapping model between IBM SPSS and CDISC ODM. The upper part describes the mapping between SPSS variables and ODM metadata definitions which are mainly stored within the ItemDef- and CodeList-element. Clinical cases correspond to the ClinicalData-element. Values are stored in the respective ItemData-Value-attribute which is shown in the lower part
Fig. 3S2O command line application. Input file must be given. All other parameters are optional. It can be chosen whether the data should be converted, which source language is present and which column in SPSS contains the subject identifier
Fig. 4Upper spreadsheet part: Snapshot from SPSS test file is shown in the variable view. Lower XML part: Result of conversion (excerpt) in CDISC ODM. Item definitions and a CodeList are presented
Fig. 5Upper spreadsheet part: List of SPSS cases with respective values. Lower XML part: The resulting ODM ClinicalData-part of the first SPSS case