INTRODUCTION: Data quality (DQ) is an important prerequisite for secondary use of electronic health record (EHR) data in clinical research, particularly with regards to progressing towards a learning health system, one of the MIRACUM consortium's goals. Following the successful integration of the i2b2 research data repository in MIRACUM, we present a standardized and generic DQ framework. STATE OF THE ART: Already established DQ evaluation methods do not cover all of MIRACUM's requirements. CONCEPT: A data quality analysis plan was developed to assess common data quality dimensions for demographic-, condition-, procedure- and department-related variables of MIRACUM's research data repository. IMPLEMENTATION: A data quality analysis (DQA) tool was developed using R scripts packaged in a Docker image with all the necessary dependencies and R libraries for easy distribution. It integrates with the i2b2 data repository at each MIRACUM site, executes an analysis on the data and generates a DQ report. LESSONS LEARNED: Our DQA tool brings the analysis to the data and thus meets the MIRACUM data protection requirements. It evaluates established DQ dimensions of data repositories in a standardized and easily distributable way. This analysis allowed us to reveal and revise inconsistencies in earlier versions of the ETL jobs. The framework is portable, easy to deploy across different sites and even further adaptable to other database schemes. CONCLUSION: The presented framework provides the first step towards a unified, standardized and harmonized EHR DQ assessment in MIRACUM. DQ issues can now be systematically identified by individual hospitals to subsequently implement site- or consortium-wide feedback loops to increase data quality.
INTRODUCTION: Data quality (DQ) is an important prerequisite for secondary use of electronic health record (EHR) data in clinical research, particularly with regards to progressing towards a learning health system, one of the MIRACUM consortium's goals. Following the successful integration of the i2b2 research data repository in MIRACUM, we present a standardized and generic DQ framework. STATE OF THE ART: Already established DQ evaluation methods do not cover all of MIRACUM's requirements. CONCEPT: A data quality analysis plan was developed to assess common data quality dimensions for demographic-, condition-, procedure- and department-related variables of MIRACUM's research data repository. IMPLEMENTATION: A data quality analysis (DQA) tool was developed using R scripts packaged in a Docker image with all the necessary dependencies and R libraries for easy distribution. It integrates with the i2b2 data repository at each MIRACUM site, executes an analysis on the data and generates a DQ report. LESSONS LEARNED: Our DQA tool brings the analysis to the data and thus meets the MIRACUM data protection requirements. It evaluates established DQ dimensions of data repositories in a standardized and easily distributable way. This analysis allowed us to reveal and revise inconsistencies in earlier versions of the ETL jobs. The framework is portable, easy to deploy across different sites and even further adaptable to other database schemes. CONCLUSION: The presented framework provides the first step towards a unified, standardized and harmonized EHR DQ assessment in MIRACUM. DQ issues can now be systematically identified by individual hospitals to subsequently implement site- or consortium-wide feedback loops to increase data quality.
Keywords:
Data analysis; clinical research; data quality; electronic health record
Authors: Sepideh Hooshafza; Louise Mc Quaid; Gaye Stephens; Rachel Flynn; Laura O'Connor Journal: J Am Med Inform Assoc Date: 2022-04-13 Impact factor: 4.497
Authors: D Brammen; F Greiner; M Kulla; R Otto; W Schirrmeister; S Thun; S E Drösler; J Pollmanns; S C Semler; R Lefering; V S Thiemann; R W Majeed; K U Heitmann; R Röhrig; F Walcher Journal: Med Klin Intensivmed Notfmed Date: 2020-12-21 Impact factor: 0.840
Authors: Lorenz A Kapsner; Marvin O Kampf; Susanne A Seuchter; Julian Gruendner; Christian Gulden; Sebastian Mate; Jonathan M Mang; Christina Schüttler; Noemi Deppenwiese; Linda Krause; Daniela Zöller; Julien Balig; Timo Fuchs; Patrick Fischer; Christian Haverkamp; Martin Holderried; Gerhard Mayer; Holger Stenzhorn; Ana Stolnicu; Michael Storck; Holger Storf; Jochen Zohner; Oliver Kohlbacher; Adam Strzelczyk; Jürgen Schüttler; Till Acker; Martin Boeker; Udo X Kaisers; Hans A Kestler; Hans-Ulrich Prokosch Journal: Front Public Health Date: 2021-01-13
Authors: Carsten Oliver Schmidt; Stephan Struckmann; Cornelia Enzenbach; Achim Reineke; Jürgen Stausberg; Stefan Damerow; Marianne Huebner; Börge Schmidt; Willi Sauerbrei; Adrian Richter Journal: BMC Med Res Methodol Date: 2021-04-02 Impact factor: 4.615
Authors: Jonathan M Mang; Susanne A Seuchter; Christian Gulden; Stefanie Schild; Detlef Kraska; Hans-Ulrich Prokosch; Lorenz A Kapsner Journal: BMC Med Inform Decis Mak Date: 2022-08-11 Impact factor: 3.298