| Literature DB >> 33712854 |
Alison Callahan1, Vladimir Polony1, José D Posada1, Juan M Banda2, Saurabh Gombar3, Nigam H Shah1.
Abstract
OBJECTIVE: To propose a paradigm for a scalable time-aware clinical data search, and to describe the design, implementation and use of a search engine realizing this paradigm.Entities:
Keywords: data science; electronic health records; in-memory datastore, query language, search engine
Mesh:
Year: 2021 PMID: 33712854 PMCID: PMC8279796 DOI: 10.1093/jamia/ocab027
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1.Summary view showing the number of patients meeting the search criteria, a summary of their demographics (histograms of age, race/ethnicity, and length of record), and their most frequently occurring diagnosis, procedure, medication, and laboratory test records.
Figure 2.Patient timeline view, displaying each patient as a row and showing the time intervals where a given search criterion was satisfied in different colors. For example, glipizide prescription records following type II diabetes diagnosis are shown in green, and subsequent stroke events are shown in pink.
Average query execution times in seconds for 100 randomly generated queries over electronic health records for ∼2.8 million patients from the Stanford Medicine CDW, using ACE or BigQuery; and over health insurance claims records for ∼65 million patients from the Optum Clinformatics Datamart, using ACE
| Command | Average [min-max] query response time (seconds) | ||
|---|---|---|---|
| Stanford Medicine CDW | Claims | ||
| ACE | BigQuery | ACE | |
|
| 0.015 [0.005–0.211] | 198.7 [121.0–642.0] | 1.224 [0.017–1.813] |
|
| 0.018 [0.005–0.306] | 167.7 [73.0–227.0] | 0.205 [0.026–1.618] |
|
| 0.026 [0.005–0.681] | 221.5 [124.0–940.0] | 0.0684 [0.026–1.530] |
|
| 0.024 [0.005–0.314] | 233.6 [152.0–748.0] | 1.018 [0.023–0.545] |
|
| 0.017 [0.005–0.214] | 202.3 [148.0–266.0] | 0.0602 [0.018–0.237] |
Abbreviations: ACE, advanced cohort engine; CDW, clinical data warehouse; OR, odds ratio.