| Literature DB >> 36076268 |
John Grimes1, Piotr Szul2, Alejandro Metke-Jimenez2, Michael Lawley2, Kylynn Loi2.
Abstract
BACKGROUND: Health data analytics is an area that is facing rapid change due to the acceleration of digitization of the health sector, and the changing landscape of health data and clinical terminology standards. Our research has identified a need for improved tooling to support analytics users in the task of analyzing Fast Healthcare Interoperability Resources (FHIR®) data and associated clinical terminology.Entities:
Keywords: Clinical terminology; Data analytics; FHIR; FHIRPath; Interoperability; SNOMED CT
Mesh:
Year: 2022 PMID: 36076268 PMCID: PMC9455941 DOI: 10.1186/s13326-022-00277-1
Source DB: PubMed Journal: J Biomed Semantics
Fig. 1Operations of a FHIR Analytics API
Fig. 2Import operation
Fig. 3Aggregate operation
Example Patient data set
| id | gender | deceasedBoolean | birthDate |
|---|---|---|---|
| 1 | male | false | 2013-06-10 |
| 2 | female | true | 1999-05-12 |
| 3 | male | true | 1955-07-08 |
| 4 | female | true | 1972-03-14 |
| 5 | other | false | 1981-03-27 |
| 6 | male | false | 1991-05-05 |
Example result from Aggregate Operation
| B | C | A |
|---|---|---|
| female | true | 2 |
| male | true | 1 |
| male | false | 1 |
| other | false | 1 |
Fig. 4Extract operation
Example Practitioner data set
| id | name | identifier |
|---|---|---|
| 1 | Ampelios Ajeet | [ { system: “urn:example:provider-id”, value: “ajeet” } ] |
| 2 | Leon Gautselin | [ { system: “urn:example:provider-id”, value: “gaut” } ] |
| 3 | Felicita Cyra | [ { system: “urn:example:provider-id”, value: “cyra” } ] |
Example MedicationRequest data set
| id | medicationCodeableConcept | subject | requester |
|---|---|---|---|
| 1 | [ { text: “Atorvastatin” } ] | Patient/4 | Practitioner/1 |
| 2 | [ { text: “Levothyroxine” } ] | Patient/4 | Practitioner/1 |
| 3 | [ { text: “Lisinopril” } ] | Patient/1 | Practitioner/3 |
| 4 | [ { text: “Metformin” } ] | Patient/2 | Practitioner/2 |
Example result from Extract operation
| D | E | F | G |
|---|---|---|---|
| 1 | 4 | ajeet | Atorvastatin |
| 2 | 4 | ajeet | Levothyroxine |
| 3 | 1 | cyra | Lisinopril |
| 4 | 2 | gaut | Metformin |
Example data set used with memberOf function
| id | code |
|---|---|
| 1 | 232850000 |Aortoventriculoplasty| |
| 2 | 63377001 |Open core needle biopsy of liver| |
| 3 | 428581004 |Percutaneous transluminal ablation of accessory pathway| |
| 4 | 25321000 |Thoracoscopic pneumonectomy| |
Fig. 5Implementation components
Fig. 6Experimental user interface for exploratory data analysis
Testing data sets generated with Synthea
| Data set # | Number of patients | Total resources | Size on disk (NDJSON) |
|---|---|---|---|
| 1 | 11 | 7,126 | 9.8 MB |
| 2 | 968 | 739,229 | 1.1 GB |
| 3 | 11,516 | 7,633,526 | 11.32 GB |
Performance test results
| Data set # | Mean cold execution time (ms) | Mean warm execution time (ms) |
|---|---|---|
| 1 | 5,164 | 2,394 |
| 2 | 5,150 | 2,565 |
| 3 | 7,317 | 4,725 |
COVID-19 vaccine codes example
| Code | Description |
|---|---|
| 212 | COVID-19 vaccine, vector-nr, rS-Ad26, PF, 0.5 mL |
| 207 | COVID-19, mRNA, LNP-S, PF, 100 mcg/0.5 mL dose |
| 208 | COVID-19, mRNA, LNP-S, PF, 30 mcg/0.3 mL dose |
High risk unvaccinated patients - aggregate query result
| Vaccinated against COVID-19 | High risk | Number of patients |
|---|---|---|
| true | false | 266 |
| true | true | 215 |
| false | false | 81 |
| false | true | 85 |
High risk unvaccinated patients - extract query result
| Family name | Given name | Phone number | Chronic kidney disease | Heart disease | BMI>30 |
|---|---|---|---|---|---|
| Abernathy524 | Kathline630 | 555-746-7353 | true | false | true |
| Bartell116 | Rhett759 | 555-257-6514 | false | false | true |
| Bashirian201 | Aldo414 | 555-300-9051 | false | false | true |
| Beahan375 | Neva514 | 555-809-1747 | false | false | true |
| Bednar518 | Chase54 | 555-812-1196 | false | false | true |
Performance comparison
| Description | Total execution time (s) |
|---|---|
| Pathling server | 90.490 |
| Pathling Python API | 27.769 |
| Python and Pandas | 61.606 |
Fig. 7Number of genomic test patients by encounter type, diagnosis verification status and diagnosis specificity relative to previous diagnoses (counts redacted)