| Literature DB >> 33610878 |
Sylvia E K Sudat1, Sarah C Robinson2, Satish Mudiganti2, Aravind Mani3, Alice R Pressman2.
Abstract
Data quality is essential to the success of the most simple and the most complex analysis. In the context of the COVID-19 pandemic, large-scale data sharing across the US and around the world has played an important role in public health responses to the pandemic and has been crucial to understanding and predicting its likely course. In California, hospitals have been required to report a large volume of daily data related to COVID-19. In order to meet this need, electronic health records (EHRs) have played an important role, but the challenges of reporting high-quality data in real-time from EHR data sources have not been explored. We describe some of the challenges of utilizing EHR data for this purpose from the perspective of a large, integrated, mixed-payer health system in northern California, US. We emphasize some of the inadequacies inherent to EHR data using several specific examples, and explore the clinical-analytic gap that forms the basis for some of these inadequacies. We highlight the need for data and analytics to be incorporated into the early stages of clinical crisis planning in order to utilize EHR data to full advantage. We further propose that lessons learned from the COVID-19 pandemic can result in the formation of collaborative teams joining clinical operations, informatics, data analytics, and research, ultimately resulting in improved data quality to support effective crisis response.Entities:
Keywords: COVID-19; Data quality; Electronic health record; Real-world data
Mesh:
Year: 2021 PMID: 33610878 PMCID: PMC7892315 DOI: 10.1016/j.jbi.2021.103715
Source DB: PubMed Journal: J Biomed Inform ISSN: 1532-0464 Impact factor: 6.317
COVID data domains and challenges in reporting data from the EHR.
| Data domain | Challenges |
|---|---|
| Hospital capacity | Difficulty reproducing the actual bed count in the hospital from EHR bed records Mismatch between bed classification according to use (e.g. telemetry) and capacity estimates based on location (e.g. department) Changes in bed classifications and counts due to pandemic surge capacity coming online or offline |
| COVID-19 confirmed patients | Difficulty identifying COVID-positive patients who had external (e.g. scanned) results Inconsistency in fixed-field real-time diagnoses such as the hospital problem list |
| COVID-19 suspected patients | Symptom-based definitions that are difficult to reproduce using EHR data |
| COVID deaths | Difficulty identifying COVID-19 patient deaths with confidence without being able to utilize discharge diagnoses |