| Literature DB >> 35036548 |
Hanieh Razzaghi1,2, Jane Greenberg2, L Charles Bailey1,3.
Abstract
INTRODUCTION: Secondary use of electronic health record (EHR) data for research requires that the data are fit for use. Data quality (DQ) frameworks have traditionally focused on structural conformance and completeness of clinical data extracted from source systems. In this paper, we propose a framework for evaluating semantic DQ that will allow researchers to evaluate fitness for use prior to analyses.Entities:
Keywords: EHR data; data quality; fit‐for‐use
Year: 2021 PMID: 35036548 PMCID: PMC8753309 DOI: 10.1002/lrh2.10264
Source DB: PubMed Journal: Learn Health Syst ISSN: 2379-6146
FIGURE 1Conceptual model for semantic DQA, showing the major elements informing the development of semantic DQA checks
Central elements of semantic DQA conceptual model
| Phase | Construct | Definition | Examples |
|---|---|---|---|
| Semantic DQ Design Principles | Clinical Data Factors |
Expresses clinical concept for which data quality (DQ) must be measured Considers the ways in which underlying workflow affects potential variables Connects clinical concepts and data provenance |
Hypertension can be measured through diagnoses, medications (prescriptions or administration of antihypertensives), or blood pressure measurements in EHR data |
| Analytic Uses |
Weighs the impact of the clinical concept undergoing DQ assessment Considers the scope: how widely the DQ check will be implemented |
Main exposure variables or outcomes may be more important than minor covariates | |
| DQ Principles |
Addresses the combination of established DQ theory with current needs Develops roadmap to determine appropriate DQ method Focuses the results of variable testing |
Benchmarking hypertension metrics across institutions for face validity requires a different set of tools than attempting to use external sources to test the plausibility of blood pressure values Common DQ principles include outlier detection, completeness of records, variable concordance, and plausible distribution of facts | |
| Semantic DQ Practice | Representation |
Translates clinical concepts to data‐adapted variable definitions |
More precise clinical definitions should be considered—eg, hypertension defined as use of antihypertensives may be important to measure specificity and hypertension defined as a series of blood pressure measurements allows more flexibility in analytic modeling |
| Assessment Lenses |
Supplies specific assessments to evaluate the validity of variables |
Common lenses to consider in clinical research are epidemiology, diagnoses, clinical care, and health care utilization. | |
| DQ Methods |
Applies statistical or descriptive methods to evaluate DQ principles |
Methods can range from simple (eg, proportions or frequency distributions) to complex (eg, PCA, clustering, or other machine learning) Results can be categorical or can rely on visualization. Thresholds for acceptable DQ can be pre‐determined or part of the applied methodology. |
Note: Green elements address development of clinical content for testing, while blue rows address application of DQA testing methods.
Assessment lens types for health care data
| Assessment lens | Goal | Sample tests |
|---|---|---|
| Epidemiology | Examine incidence and distribution of analytic variables such as diagnoses, drug exposures, procedure events, etc, to check for validity or internal consistency |
Incidence and population characteristics Subtype (eg, sub‐diagnosis) breakdown and clinical setting characteristics Variation of prevalence across participating institutions |
| Diagnostics | Identify and attempt to phenotype patients or medical events based on clinical criteria available in the data |
Major clinical facts available to cross‐validate Impact of variations on cohort definitions Distributions of test results, vital signs, or other medical events used in diagnosis |
| Clinical Care | Examine treatments, evolution over time, or expected clinical pathways for patients to identify potential variations or outliers |
Common clinical co‐occurrences Treatment data available / plausible for indication Sequence of events available / plausible over time |
| Utilization | Determine pattern of healthcare utilization given a set of clinical characteristics |
Visit type alignment with diagnoses/comorbidity severity Facts associated with visit types (eg, ICU with more frequent vital signs) Variation of utilization across participating institutions |
Note: Commonly used lenses for clinical data are described, with examples of specific types of tests each might produce.
FIGURE 2Example semantic DQA specification process, demonstrating the phases by which research hypotheses associated with the example chemotherapy/infection study are translated to requirements for semantic DQA checks
Application of data quality practices
| Data quality principle | Sample DQ checks |
|---|---|
| Numeric outlier detection |
Develop plots of blood pressure and temperature values to detect implausible values |
| Temporal outlier detection |
Determine proportion of hospitalizations where antibiotic treatment is provided without evidence of abnormal vitals (ie, temperature or blood pressure), stratified by site. Create time trend graphs to detect whether there are any abnormal spikes in hospitalizations for infection, in relation to chemotherapy intensity |
| Completeness of records |
Determine proportion of antibiotic drugs appropriately mapped from source systems, stratified by institution and antibiotic name Create metrics for patients for whom complete chemotherapy data are available: eg, |
| Concordance of facts |
Create Venn Diagram of labs, vitals, diagnoses, and medications to understand how different definitions alter the overlap of the data domains to define infection |
| Plausible distribution of facts |
Proportion of patients with sepsis diagnosis and visit to the ICU in cancer cohort vs a cohort of healthy patients |
Note: Selected DQ targets from the example chemotherapy/infection study are shown, annotated by the assessment lenses that produced them.
FIGURE 3DQ check for outliers in vital sign measurement. The results of k‐means clustering over the first two principal components evaluating measurement of four vital signs sensitive to severity of illness are shown. Measurement rates were assessed by site and within site by care setting (ICU or non‐ICU inpatient unit)
FIGURE 4Dotplot of study cohort contribution across PEDSnet institutions. The observed to expected ratio for multiple PEDSnet studies is shown. Participating PEDSnet institutions are lined across the x‐axis and dots represent individual network studies