| Literature DB >> 35571355 |
Knut Wehrle1, Viola Tozzi2, Stefan Braune1, Fabian Roßnagel1, Heidi Dikow1, Silvia Paddock2, Arnfin Bergmann1, Philip van Hövell2.
Abstract
Objective: To implement a dynamic data management and control framework that meets the multiple demands of high data quality, rigorous information technology security, and flexibility to continuously incorporate new methodology for a large disease registry. Materials andEntities:
Keywords: data accuracy; information technology; reference standards; registries; trust
Year: 2022 PMID: 35571355 PMCID: PMC9097675 DOI: 10.1093/jamiaopen/ooac017
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Figure 1.Overview of the components of the registry and analytic site and the data flow in the field of multiple sclerosis (MS). Results from predictive modeling feedback into the DatabasE-aSsisted Therapy decIsioN support sYstem (DESTINY®) every 3 months to support treatment decision making in the real world. DPS: data processing site; LMU: Ludwig Maximilians Universität; NTD: NeuroTransData; PUID: patient unique identifier; SPMS: secondary progressive multiple sclerosis.
Figure 2.Journey of real-world data from capturing in the NTD registry to analysis at the DPS. Diamonds indicate checkpoints of the data control framework. (1) Already during the data collection process, dropdown menus ensure consistency during the input procedures and multiple automated checks ensure that accurate data are uploaded to the server. (2) The data transfer process from the registry to the data processing site (DPS) contains multiple checkpoints to ensure the integrity of the transfer. (3) After arriving at the DPS, the data are uploaded into a secure and strictly access-controlled structured query language (SQL) database and undergo additional control steps in a process that ends with a dataset ready for analysis. DESTINY®: DatabasE-aSsisted Therapy decIsioN support sYstem; DPS: data processing site; NTD: NeuroTransData.
Figure 3.Analysis process in an example from multiple sclerosis (MS). Diamonds indicate checkpoints of the data control framework. After arriving at the DPS, the files undergo multiple checks based on version-controlled scripts written in the programming language R. Errors discovered in these checks lead to exclusion of records from further processes. The clean data are then passed on to the next step and used to calculate the new model parameters. The processes that update the model also occur in a version-controlled (“GIT”) environment to ensure that a complete audit trail of changes is available. The updated App with new model predictions then undergoes series of tests for feasibility before the final version is uploaded to the public website.
Example of quality assurance queries
| Category | Example messages |
|---|---|
| Error | [Medication name] was prescribed and canceled on the same day ([date]). |
| [Medication name] was canceled on [Date] without giving a discontinuation reason. | |
| Relapse without date. | |
| Anamnesis: The first diagnosis ([Date]) cannot possibly be before a first manifestation ([Date]). | |
| Information | No first diagnosis date found. |
| Warning | Patient did not have any visit in the last quarter. |
| The patient does not have a single visit. Please complete the relevant core data sections or delete the patient. |