| Literature DB >> 36149742 |
Brian J Wells1, Stephen M Downs2, Brian Ostasiewski3.
Abstract
Electronic health records (EHRs) were originally developed for clinical care and billing. As such, the data are not collected, organized, and curated in a fashion that is optimized for secondary use to support the Learning Health System. Population health registries provide tools to support quality improvement. These tools are generally integrated with the live EHR, are intended to use a minimum of computing resources, and may not be appropriate for some research projects. Researchers may require different electronic phenotypes and variable definitions from those typically used for population health, and these definitions may vary from study to study. Establishing a formal registry that is mapped to the Observation Medical Outcomes Partnership common data model provides an opportunity to add custom mappings and more easily share these with other institutions. Performing preprocessing tasks such as data cleaning, calculation of risk scores, time-to-event analysis, imputation, and transforming data into a format for statistical analyses will improve efficiency and make the data easier to use for investigators. Research registries that are maintained outside the EHR also have the luxury of using significant computational resources without jeopardizing clinical care data. This paper describes a virtual Diabetes Registry at Atrium Health Wake Forest Baptist and the plan for its continued development. ©Brian J Wells, Stephen M Downs, Brian Ostasiewski. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 23.09.2022.Entities:
Keywords: EHR; Learning Health System; diabetes; electronic health record; registry
Year: 2022 PMID: 36149742 PMCID: PMC9547333 DOI: 10.2196/39746
Source DB: PubMed Journal: JMIR Med Inform
Characteristics of research registries vs population health registries.
| Research registry | Population health registry |
| Intermittent updates | Real-time updates |
| Higher computational resources | Low resource use |
| Complex definitions from a variety of sources and multiple definitions for similar concepts | Simple definitions defined by QI-baseda reimbursement |
| Variety of external data sources | Data limited to EHRb |
| Extensive data processing | Limited data processing |
| Complex temporal relationships | Single point in time |
| Easily accessible and detailed documentation | Documentation or coding sometimes lacking or not easily accessible |
| Does not need to be integrated into workflow | Integration in clinic workflow is crucial |
| Does not require front-end EHR access. | Requires front-end EHR access with PHIc |
| Mapped to open-source common data models | Mapped to vendor-based |
aQI: quality improvement.
bEHR: electronic health record.
cPHI: protected health information.
Figure 1Sets of patients with possible diabetes according to definitions based on diagnoses codes (DX), laboratory values (LAB), or prescriptions (RX).
Figure 2Comparing data structures of electronic health records (EHR), population health registries, and research registries.
Figure 3Venn diagram showing the use of electronic algorithms combined with chart reviews to identify patients with diabetes. DM: Diabetes Mellitus; EHR: electronic health record; HbA1c: hemoglobin A1c; ICD: International Classification of Diseases.
Characteristics of patientsa who showed evidence of possible diabetes based on diagnoses codes, laboratory values, or medications.
| Characteristics | Cohort 1: diagnosis | Cohort 2: labsb | Cohort 3: medications | ||||
| Total unique patients, n (%) | 84,755 (66) | 90,967 (71) | 84,165 (66) | ||||
| Age (years), median (IQR) | 66.02 (19.43) | 65.46 (20.20) | 64.62 (20.98) | ||||
|
| |||||||
|
| Female | 43,510 (51.34) | 44,008 (48.38) | 43,374(51.53) | |||
|
| Male | 41,239 (48.66) | 46,950 (51.61) | 40,783 (48.46) | |||
|
| |||||||
|
| White | 59,547 (70.26) | 65,693 (72.22) | 60,014 (71.30) | |||
|
| Black | 19,120 (22.56) | 19,042 (20.93) | 17,905 (21.27) | |||
|
| Other | 5794 (6.84) | 5938 (6.53) | 6004 (7.13) | |||
|
| Missing | 286 (0.34) | 267 (0.29) | 223 (0.26) | |||
| Ever smoker, n (%) | 43,414 (51.22) | 48,842 (53.69) | 44,133 (52.44) | ||||
| Insulin (1 or more prescriptions in the past year), n (%) | 25,663 (30.28) | 25,943 (28.52) | 26,685 (31.70) | ||||
| Charlson comorbidity index, n (median) | 83,699 (2) | 89,692 (2) | 83,094 (2) | ||||
| Median household income, n (median) | 66,034 (46,283) | 69,253 (45,688) | 64,839 (45,927) | ||||
| Most recent hemoglobin A1c, n (median) | 64,959 (6.9) | 72,833 (7.1) | 69,933 (7.0) | ||||
| Most recent eGFRc, n (median) | 73,037 (70) | 88,633 (66) | 80,424 (70) | ||||
| Most recent LDLd, n (median) | 58,463 (88) | 60,398 (88) | 59,864 (89) | ||||
aPatients may exist in 1, 2, or all 3 of the cohorts.
bRandom blood sugar ≥200 mg/dL or hemoglobin A1c≥6.5%.
ceGFR: estimated glomerular filtration rate calculated using the Chronic Kidney Disease Epidemiology Collaboration (CKD-Epi) equation.
dLDL: low-density lipoprotein.
Figure 4Schematic of the overall architecture of the registry, highlighting some of the guiding principles governing the registry creation. CDM: common data model; EHR: electronic health records; OMOP: Observational Medical Outcomes Partnership Common Data Model; PCOR: patient-centered outcomes research common data model; TDW: Translational Data Warehouse in the Wake Forest Clinical and Translational Science Institute; UMLS: Unified Medical Language System.
Figure 5Example analytic data set extracted from the registry in a pivot table format. F: false; NA: not applicable; T: true.