| Literature DB >> 29370861 |
Martin Bialke1, Henriette Rau2, Oliver C Thamm3, Ronny Schuldt2, Peter Penndorf2, Arne Blumentritt2, Robert Gött2, Jens Piegsa2, Thomas Bahls2, Wolfgang Hoffmann2.
Abstract
BACKGROUND: In most research projects budget, staff and IT infrastructures are limiting resources. Especially for small-scale registries and cohort studies professional IT support and commercial electronic data capture systems are too expensive. Consequently, these projects use simple local approaches (e.g. Excel) for data capture instead of a central data management including web-based data capture and proper research databases. This leads to manual processes to merge, analyze and, if possible, pseudonymize research data of different study sites.Entities:
Keywords: Data dictionary; Data management; ECRF; Epidemiological research; Pseudonymization; Research repository
Mesh:
Year: 2018 PMID: 29370861 PMCID: PMC5785842 DOI: 10.1186/s12967-018-1390-1
Source DB: PubMed Journal: J Transl Med ISSN: 1479-5876 Impact factor: 5.531
List of additional requirements concerning data management for small-scale research projects based on [2]
| No. | Requirement |
|---|---|
| a1 | Open-source solution, free or with very low costs |
| a2 | Easy download and installation and/ or web-based data capture |
| a3 | Useful range of functionality, e.g. extensibility with additional modules like interfaces for integrating additional data sources |
| a4 | Adequate community support |
| a5 | Possibility for low-level training |
| a6 | User-friendliness and intuitive usability, e.g. easy design of eCRFs |
| a7 | Sufficient documentation |
| a8 | Management of organizational and technical processes to use and to provide access to the pseudonymized research data |
List of requirements for a comprehensive data management based on [1]
| No. | Requirement |
|---|---|
| 1 | Development of a data dictionary (DD) and electronic Case Report Forms (eCRF) for data capture |
| 2 | Generation and provision of web-based questionnaires in the form of eCRFs for central data collection |
| 3 | Separation of personally identifiable information (PII) [ |
| 4 | Separate storage of metadata and pseudonymized MDAT |
| 5 | ETL-processes: extraction of relevant data from connected data sources, transformation of the data to a uniform (internal) data format, the enrichment of metadata and the loading of the enriched data into the research data repository |
| 6 | Export and transfer of uniform, pseudonymized data in at least one standardized format (e.g. SPSS or SAS) for data analysis and/or quality assurance |
| 7 | Ensuring the possibility of authentication (managing users as well as roles and/ or rights) as well as study process and site management |
Fig. 1The functionalities of the modular Toolbox for Research can easily be extended with additional modules e.g. for quality assurance
Fig. 2The integrated pseudonymization service gPAS provides the necessary pseudonyms
Fig. 3Process chain of the Toolbox for Research
Fig. 4Architecture of the Toolbox for Research from a docker-container perspective
Fig. 5Comparison of manual and automated data management processes within the German Burn Registry in the years 2014 and 2017
Fig. 6Total number of study sites and variables of annual DGV-statistics and the German Burn Registry for the years 2011–2017