| Literature DB >> 31859675 |
Romain Lelong1,2, Lina F Soualmia1,2,3, Julien Grosjean1,3, Mehdi Taalba1, Stéfan J Darmoni1,3.
Abstract
BACKGROUND: The huge amount of clinical, administrative, and demographic data recorded and maintained by hospitals can be consistently aggregated into health data warehouses with a uniform data model. In 2017, Rouen University Hospital (RUH) initiated the design of a semantic health data warehouse enabling both semantic description and retrieval of health information.Entities:
Keywords: clinical trial; data warehousing; patient selection; search engine; semantics
Year: 2019 PMID: 31859675 PMCID: PMC6942180 DOI: 10.2196/13917
Source DB: PubMed Journal: JMIR Med Inform
Figure 1Functional coverage of the semantic health data warehouse in terms of data according to each domain. SHDW: semantic health data warehouse; CPOE: computerized physician order entry; DCC: French cancer communication file; PMR: personal medical record; DRG: diagnosis-related group; EHR: electronic health record.
Figure 2Functional architecture of the semantic health data warehouse that provides semantic information retrieval (IR) functionalities form clinical data. The 2 data repositories, knowledge data and health data, respectively, maintain the reference knowledge organization systems and the health data pertaining to the semantic health data warehouse. These data are accessed through a not only structured query language (NoSQL) layer by the 3 distinct components: the cross-terminological health terminology and ontology portal (HeTOP), the semantic annotator extracting concepts from multiple terminologies (ECMT), and the semantic search engine (SSE), each operating on a different range of data. CN: clinical narrative; T&O: terminology and ontology.
Figure 3Partial Conceptual Model of the semantic health data warehouse represented as a directed and attributed graph. Entities corresponding to elements from terminologies and ontologies are represented with dashed outlines. DRG: diagnosis-related group; PIN: personal identification number.
Figure 4The interface of the semantic access to health information, ASIS, Web application, and its 4 steps: (1) definition of constraints, (2) composition of a Boolean query from atomic constraint defined in step 1, (3) selection of the desired output entity according to its clinical coherent level, and (4) visualization of the results.
Number, percentage, and 95% confidence interval of the percentage of criteria for each support level and type (inclusion or exclusion).
| Support level | Inclusion criteria | Exclusion criteria | Total | |||
| n (%) | 95% CI | n (%) | 95% CI | n (%) | 95% CI | |
| Full | 6 (16.67) | (4.5-28.8) | 5 (8.47) | (1.4-15.6) | 11 (11.58) | (5.1-18.0) |
| Accurate | 3 (8.33) | (0.0-17.4) | 15 (25.42) | (14.3-36.5) | 18 (18.95) | (11.1-26.8) |
| Broad | 6 (16.67) | (4.5-28.8) | 19 (32.20) | (20.3-44.1) | 25 (26.32) | (17.5-35.2) |
| Inaccurate | 4 (11.11) | (0.8-21.4) | 6 (10.17) | (2.5-17.9) | 10 (10.53) | (4.4-16.7) |
| None | 3 (8.33) | (0.0-17.4) | 7 (11.86) | (3.6-20.1) | 10 (10.53) | (4.4-16.7) |
| Not applicable | 14 (38.89) | (23.0-54.8) | 7 (11.86) | (3.6-20.1) | 21 (22.10) | (13.8-30.4) |
| Total | 36 (100.00) | —a | 59 (100.00) | — | 95 (100.00) | — |
aNot applicable.
Number of criteria of each support level according to the combination of sources necessary to search them. The sources of information are as follows: P: patient data, D: diagnoses-related group data, S: stay data, B: biological data, N: clinical narrative, and I: other information.
| Support level |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Full | 4 | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Accurate | 1 | 0 | 0 | 8 | 0 | 0 | 0 | 1 | 3 | 0 | 5 | 0 | 0 | 0 | 0 |
| Broad | 0 | 2 | 2 | 6 | 7 | 0 | 1 | 0 | 0 | 1 | 4 | 0 | 0 | 2 | 0 |
| Inaccurate | 0 | 0 | 0 | 0 | 5 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 1 | 1 | 1 |
| None | 0 | 0 | 0 | 0 | 2 | 3 | 0 | 0 | 0 | 0 | 3 | 1 | 0 | 1 | 0 |
| Not applicable | 0 | 0 | 0 | 0 | 0 | 21 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Total | 5 | 2 | 9 | 14 | 14 | 24 | 1 | 1 | 3 | 2 | 13 | 1 | 1 | 4 | 1 |
Figure 5The central gray band gives the percentage of criteria of each support level excluding not applicable criteria. The upper bars show, for each support level, the percentages of involvement of each source of information in the search of criteria. The lower bars show the distribution (in percentage) of the different obstacle categories identified as lowering the effectiveness of the search of criteria. CN: clinical narrative; DRG: diagnosis-related group.