| Literature DB >> 29881753 |
Jay Knowlton1, Tom Belnap2, Bonnie Patelesio3, Elisa L Priest4, Friedrich von Recklinghausen5, Andreas H Taenzer6.
Abstract
INTRODUCTION: Health systems can be supported by collaborative networks focused on data sharing and comparative analytics to identify and rapidly disseminate promising care practices. Standardized data collection, quality assessment, and cleansing is a necessary process to facilitate meaningful analytics for operations, quality improvement, and research. We developed a framework for aligning data from health care delivery systems using the High Value Healthcare Collaborative central registry. FRAMEWORK: The centralized data registry model allows for multiple layers of data quality assessment. Our framework uses an iterative approach, starting with clear specifications, maintaining ongoing dialogue with diverse stakeholders, and regular checkpoints to assess data conformance, completeness, and plausibility. LESSONS LEARNED: We found that an iterative communication process is critical for a central registry to ensure: 1) clarity of data specifications, 2) appropriate data quality, and 3) thorough understanding of data source, purpose, and context. Engaging teams from all participating institutions and incorporating diverse stakeholders of clinicians, information technologists, data analysts, operations managers, and health services researchers in all decision making processes supports development of high quality datasets for comparative analytics across multiple institutions.Entities:
Keywords: Informatics; Learning Health System; Methods
Year: 2017 PMID: 29881753 PMCID: PMC5982973 DOI: 10.5334/egems.195
Source DB: PubMed Journal: EGEMS (Wash DC) ISSN: 2327-9214
Figure 1HVHC Data Submission Quality Management Framework.
Data quality assessment during Phase I and Phase II of HVHC sepsis care improvement work.
| HVHC Sepsis Data Quality Assessment | ||
|---|---|---|
| Phase I | Phase II | |
| All fields are found in the specified order using the specified naming conventions | All fields are found in the specified order using the specified naming conventions | |
| Ad-hoc distribution analyses of variables in question for each analysis | Programmed logic identifying percent missingness of each required field | |
| Ad-hoc identification of improper coding when performing downstream analyses | Programmed logic identifying valid values based on specified codes, appropriate dates (e.g., no future dates, birth date prior to death date), and general plausibility (e.g., ICU length of stay no longer than total inpatient length of stay) | |
Figure 2Phase II sepsis data submissions by HVHC Member and submission due date.
Institutions’ misalingment with LOS specifications.
| LOS Reporting Variant | Number of Institutions | Time to Alignment with Specification |
|---|---|---|
| Reported in hours | 6 | N/A |
| Reported in days rounded to nearest integer | 1 | 3 months (corrected with next submission) |
| Reported in days rounded to nearest integer; any value <1 rounds UP | 1 | 6 months (failed to correct with next submission; corrected with subsequent submission, adjusted by PMO in the interim) |
| Reported in days rounded UP to nearest integer | 1 | 3 months (corrected with next submission) |