| Literature DB >> 25600078 |
Xiao-Ou Ping1, Yufang Chung, Yi-Ju Tseng, Ja-Der Liang, Pei-Ming Yang, Guan-Tarn Huang, Feipei Lai.
Abstract
BACKGROUND: Because of the increased adoption rate of electronic medical record (EMR) systems, more health care records have been increasingly accumulating in clinical data repositories. Therefore, querying the data stored in these repositories is crucial for retrieving the knowledge from such large volumes of clinical data.Entities:
Keywords: clinical practice guideline; electronic medical records; information retrieval query processing; ontology engineering; query languages
Year: 2013 PMID: 25600078 PMCID: PMC4288233 DOI: 10.2196/medinform.2519
Source DB: PubMed Journal: JMIR Med Inform
Figure 1The overview of the methodology used in the data-querying tool based on ontology-driven methodology and flowchart-based model.
Figure 2The architecture of the FBDQM (flowchart-based data-querying model)-based query execution engine.
Figure 3The flowchart-based data-querying model (FBDQM) containing the structure of query task workflow and the information of each node in the query task.
Figure 4The query criteria selection interface and the GLIF3.5 ontology information viewer.
Figure 5The flexibility in selecting all or partial nodes of the flowchart for participating in the execution of query operation.
Figure 6The query criteria of the selected nodes can be displayed in the query criteria selection interface.
Figure 7The GLIF3.5 ontology information viewer. The selected node, “degree decision,” and its corresponding information in GLIF3.5 format.
The examples of the query criteria in GLIF3.5 and the corresponding translated SQL queries.
| Query criteria format | Query criteria |
| GLIF3.5 | 1. ICD9=155.0 |
| Translated SQL queries | 1. select Personal_ID from Diagnosis where ICD9_Code=“155.0” |
| GLIF3.5 | 2. at least 2 of (Ascites==“Controllable,” ICG is within 15 to 40, Prothrombin_Activity is within 50 to 80, Serum_Albumin is within 3.0 to 3.5, Serum_Bilirubin is within 2.0 to 3.0) |
| Translated SQL queries | 2.1. select Personal_ID from Laboratory where Result_String=“Controllable” and Item_Name=“Ascites” |
Figure 8The clinical data representation interface.
Figure 9The clinical data representation interface showing the retrieved results of “degree of liver damage when applying a mutually exclusive setting” query task.
Figure 10The clinical data representation interface showing the retrieved results of “treatments of liver cancer” query task.
The distribution numbers of patients in four datasets that are randomly generated by the clinical data generator.
| Dataseta | Degree of liver damage | Degree of liver damage when applying a mutually exclusive setting | Treatments for liver cancer |
| #1 | Degree A: 1/10 | Degree A: 4/10 | LTb: 5/10 |
|
| Degree B: 3/10 | Degree B: 7/10 | TACEc: 2/10 |
|
| Degree C: 6/10 | Degree C: 6/10 | RFAd: 0/10 |
|
|
|
| AIe: 3/10 |
|
|
|
| SRf: 0/10 |
| #2 | Degree A: 11/100 | Degree A: 52/100 | LTb: 22/100 |
|
| Degree B: 36/100 | Degree B: 60/100 | TACEc: 23/100 |
|
| Degree C: 53/100 | Degree C: 53/100 | RFAd: 21/100 |
|
|
|
| AIe: 19/100 |
|
|
|
| SRf:15/100 |
| #3 | Degree A: 129/1000 | Degree A: 555/1000 | LTb: 195/1000 |
|
| Degree B: 348/1000 | Degree B: 549/1000 | TACEc: 202/1000 |
|
| Degree C: 523/1000 | Degree C: 523/1000 | RFAd: 191/1000 |
|
|
|
| AIe: 206/1000 |
|
|
|
| SRf: 206/1000 |
| #4 | Degree A: 1258/10,000 | Degree A: 5298/10,000 | LTb: 1984/10,000 |
|
| Degree B: 3409/10,000 | Degree B: 5477/10,000 | TACEc: 1970/10,000 |
|
| Degree C: 5333/10,000 | Degree C: 5333/10,000 | RFAd: 2079/10,000 |
|
|
|
| AIe: 1980/10,000 |
|
|
|
| SRf: 1987/10,000 |
aThe datasets #1, #2, #3, and #4 are regarded as the datasets with different numbers of patients, including 10, 100, 1000, and 10,000 patients.
bLT: Liver transplantation.
cTACE: Transarterial embolization and chemoembolization.
dRFA: Radiofrequency ablation.
eAI: Alcohol injection.
fSR: Surgical resection.
The performance in time of the system in experiment with three query tasks.
| Item (patient number) | Degree of liver damage (seconds) | Degree of liver damage when applying a mutually exclusive setting (seconds) | Treatments for liver cancer (seconds) |
| SQL operations | 93.82% (1.427) | 92.60% (1.377) | 95.76% (0.474) |
| Criteria verification | 0.46% (0.007) | 0.34% (0.005) | 0.61% (0.003) |
| Other tasks | 5.72% (0.087) | 7.06% (0.105) | 3.64% (0.018) |
| Total (10) | 100% (1.521) | 100% (1.487) | 100% (0.495) |
| SQL operations | 93.82% (1.623) | 93.45% (1.542) | 95.99% (0.598) |
| Criteria verification | 0.75% (0.013) | 0.55% (0.009) | 0.96% (0.006) |
| Other tasks | 5.43% (0.094) | 6.00% (0.099) | 3.05% (0.019) |
| Total (100) | 100% (1.730) | 100% (1.650) | 100% (0.623) |
| SQL operations | 85.13% (2.221) | 84.22% (1.985) | 70.90% (0.675) |
| Criteria verification | 11.46% (0.299) | 11.50% (0.271) | 26.79% (0.255) |
| Other tasks | 3.41% (0.089) | 4.29% (0.101) | 2.31% (0.022) |
| Total (1000) | 100% (2.609) | 100% (2.357) | 100% (0.952) |
| SQL operations | 22.16% (8.124) | 24.75% (8.076) | 11.95% (2.750) |
| Criteria verification | 77.60%(28.455) | 74.97% (24.461) | 87.96%(20.248) |
| Other tasks | 0.24% (0.087) | 0.28% (0.092) | 0.10% (0.022) |
| Total (10,000) | 100% (36.666) | 100% (32.630) | 100% (23.020) |
Figure 11The performances of the system based on the three query tasks in four experiments with different number of patients, including a) degree of liver damage, b) degree of liver damage when applying a mutually exclusive setting, and c) treatments for liver cancer.