| Literature DB >> 23920633 |
James J Cimino1, Elaine J Ayres, Andrea Beri, Robert Freedman, Ellen Oberholtzer, Sachi Rath.
Abstract
The US National Institutes of Health has developed a repository of clinical research data drawn in part from electronic health records. A new de-identified data query tool under development has been developed to support re-use of these data. We used a collection of 30 human-mediated user queries to determine whether features of the tool will be sufficient to allow users to carry out the queries themselves. The results show that the tool implemented in February 2013 will carry out a small percentage of user queries but the planned extensions will be sufficient for carrying out the majority of such queries. Future development of the tool will include extensions that correspond to the features found in human-mediated queries.Entities:
Mesh:
Year: 2013 PMID: 23920633 PMCID: PMC4209406
Source DB: PubMed Journal: Stud Health Technol Inform ISSN: 0926-9630
Figure 1The first version of the de-identified query tool. Three query modules have been dragged from the left-hand menu into the main frame and “ANDed” to each other. Note that the Diagnosis module has a controlled term (Chronic Granulomatous Disease) specified, as well as date and age ranges. The Laboratory Test module also has a controlled term (Erythrocyte Sedimentation Rate) selected and a range specified for the test value. The Medication Module shows the term look up (“prednisone”).
Features Planned for De-Identified Query Tool
| Query Function | Version 1 | Later Versions |
|---|---|---|
| Domains | Demographics, Diagnoses, Procedures, Medications, Laboratory Results | Admission/Discharge/Transfer, Alerts, Allergies, Blood Bank, Clinical Documents (notes written by healthcare personnel caring for the patients), ECG, Echo, Pathology, Pulmonary Functions, Radiology, Vital Signs |
| Query Features | Controlled Terminology, Age Range, Date Ranges, Value Range | Cardinality |
| Relationships | AND, OR | NOT, Before, After |
Characteristics of User Queries.
| Query # | Domains | Relationship | Attributes |
|---|---|---|---|
| 1 | |||
| 2 | |||
| 3 | |||
| 4 | |||
| 5 | |||
| 6 | |||
| 7 | AE, | ||
| 8 | |||
| 9 | |||
| 10 | |||
| 11 | |||
| 12 | |||
| 13 | |||
| 14 | |||
| 15 | |||
| 16 | |||
| 17 | |||
| 18 | |||
| 19 | |||
| 20 | |||
| 21 | |||
| 22 | |||
| 23 | |||
| 24 | |||
| 25 | |||
| 26 | |||
| 27 | |||
| 28 | |||
| 29 | |||
| 30 |
Bold items are included in Version 1; italic items with underline are planned for future versions. Key to Domains: Admission/Discharge/Transfer, AE=Adverse events, B=Blood Bank, C=Clinical Documents(notes written by members of the healthcare team), D=Demographics, Dx=Diagnosis, E=Echo, L=Laboratory Tests, M=Medications, Mi=Microbiology, P=Pathology, R=Radiology, S=(Research) Study, V=Vital Signs. Key to Attributes: a=age, c=controlled terms, d=date range, e=expiration date, g=gender, l=location, n=normal ranges, t=text search, u=units of measure, v=discrete values. For example, Query #2 involved the domains Demographics (implemented) and Clinical Documents (planned), and one planned attribute (date range) and two unplanned attributes normal ranges and units of measure). Of note, while searching is currently not allowed on these two attributes, they are currently included along with any reported laboratory results.