| Literature DB >> 32025653 |
Earl F Glynn1, Mark A Hoffman1,2,3.
Abstract
OBJECTIVES: Electronic health record (EHR) data aggregated from multiple, non-affiliated, sources provide an important resource for biomedical research, including digital phenotyping. Unlike work with EHR data from a single organization, aggregate EHR data introduces a number of analysis challenges.Entities:
Keywords: data science; data visualization; electronic health record; phenotype
Year: 2019 PMID: 32025653 PMCID: PMC6993994 DOI: 10.1093/jamiaopen/ooz035
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Figure 1.In this UpSet graph, each row represents a module in Cerner. Each column represents a combination of modules. A filled dot indicates that the module is a member of that particular set.
Figure 2.(A) Each spark graph represents encounters in one-month intervals between 2000 and 2017 for a distinct health system among the 100 health systems contributing to Health Facts. The left of each spark graph is the year 2000 and the right represents 2017. The height of each spark graph represents relative encounter activity. The scale of each health system varies. (B) Spark graphs for two representative health systems are expanded to show corresponding facility spark graphs. Each Spark graph represents monthly encounters. The left of each spark graph is the year 2000 and the right represents 2017. The height of each spark graph represents relative encounter activity.
Figure 3.Transition from ICD-9 to ICD-10. The height of each bar represents the number of facilities providing diagnosis codes per year, the color indicates which coding system was in use, including some facilities that used both ICD-9 and ICD-10 in a given year.
Frequency of documenting death using discharge disposition of “expired”
| Facility bed size (range) | Number of facilities | Facilities reporting deaths | Facilities not reporting deaths | % Reporting deaths |
|---|---|---|---|---|
| 1–5 | 294 | 67 | 227 | 22.8% |
| 6–99 | 154 | 141 | 13 | 91.6% |
| 100–199 | 80 | 72 | 8 | 90.0% |
| 200–299 | 63 | 48 | 15 | 76.2% |
| 300–499 | 43 | 36 | 7 | 83.7% |
| 500+ | 28 | 19 | 9 | 67.9% |
| Unknown | 2 | 1 | 1 | 50.0% |
| All facilities | 664 | 384 | 280 | 57.8% |
Health system and facility usage of representative clinical events
| Category | Group | Event description | Health systems | Facilities | Unique records |
|---|---|---|---|---|---|
| Travel | Travel location | African travel location | 1 | 6 | 27 |
| Travel | Travel location | Recent travel to West Africa | 1 | 6 | 3 215 896 |
| Travel | Travel location | Travel to African country | 5 | 24 | 1 235 487 |
| Travel | Travel location | Travel to West Africa within last month | 1 | 1 | 92 708 |
| Travel | Travel timing | Travel to Africa within the last 21 days | 1 | 4 | 1154 |
| Travel | Travel timing | Travel to Guinea/Liberia/Sierra Leone within past year | 1 | 4 | 722 |
| Vital sign | Height–Length | Height | 72 | 477 | 103 411 085 |
| Vital sign | Height–Length | Height, Body Surface Area | 1 | 2 | 221 |
| Vital sign | Height–Length | Height, Estimated | 45 | 247 | 3 591 945 |
| Vital sign | Height–Length | Height, Feet | 12 | 48 | 4 756 482 |
| Vital sign | Height–Length | Height, Inches | 31 | 174 | 14 434 466 |
| Vital sign | Height–Length | Height, Measured | 8 | 17 | 864 436 |
| Vital sign | Height–Length | Height, Percent | 14 | 41 | 11 365 |
| Vital sign | Height–Length | Height, Percent for age | 16 | 44 | 4918 |
| Vital sign | Height–Length | Length, Birth | 52 | 238 | 907 501 |
| Vital sign | Height–Length | Length, Infant | 5 | 20 | 13 926 |
Figure 4.(A) Use of smoking history prompt. For the 14 health systems with more than 100 encounters that used the smoking history prompt, we express the number of smoking history prompts per year as a percentage of total encounters in the same year. (B) Annual smoking history documentation by health system. Ridges plot showing frequency of usage of smoking history by health system as a function of time. Each ridgeline is a density plot for the events of the indicated health system. Each plot is scaled by the data from that health system.
Recommendations for aggregate EHR data analysis
| Source of variation | Recommendation |
|---|---|
| Variation in ancillary module adoption | Evaluate facility level use of topic specific data tables and fields. |
| Temporal variation of contributors | Evaluate contribution of each organization over time. Structure analyses to exclude organizations when they were not actively contributing data. |
| ICD version | For analyses that span the transition period from ICD-9 to ICD-10, estimate when data contributing organizations shifted from ICD-9 to ICD-10. |
| Outcome measure | For each organization, evaluate the availability of key outcomes measures. Exclude organizations that are not documenting the measure in the expected manner. Do not assume continuous, consistent usage. |
| Variation in documentation | Confirm that each organization in an analysis is using the documentation prompt(s) needed for an analysis, exclude those that do not. Evaluate conceptual overlaps in the descriptions of variations for similar concepts. Evaluate whether usage of prompt(s) is consistent over time. |