| Literature DB >> 29751829 |
Stefan Kropf1, Alexandr Uciteli2, Katrin Schierle3, Peter Krücken3, Kerstin Denecke4, Heinrich Herre2.
Abstract
BACKGROUND: Legacy data and new structured data can be stored in a standardized format as XML-based EHRs on XML databases. Querying documents on these databases is crucial for answering research questions. Instead of using free text searches, that lead to false positive results, the precision can be increased by constraining the search to certain parts of documents.Entities:
Keywords: EHR query; Electronic health records; Information retrieval; Medical informatics applications; Pathology electronic health records; Query engineering; Search ontology
Mesh:
Year: 2018 PMID: 29751829 PMCID: PMC5946576 DOI: 10.1186/s13326-018-0180-2
Source DB: PubMed Journal: J Biomed Semantics
Fig. 1Use case overview: search ontology-based XPath generation
NL description of the queries (→ “Search Ontology-based Pathology Questions (OWL)” section)
| Q | Question |
|---|---|
|
|
|
|
| |
| Q1 | Prostatic carcinomas are found starting from how many |
| grams of flake tissue? | |
| Q2 | Prostatic carcinomas are found starting from how many |
| capsules? What influence has the processing method | |
| (with/without remainder)? | |
| Q3 | How large are the leiomyomas of the uterus in the entry |
| material? | |
| Q4 | How many lymph node metastasis occur at colon cancer |
| in stage pT2? | |
| Q5 | In how many esophageal biopsies is a barret mucosa found? |
| Exclude a certain negation expressionb (cave). |
aQ0 is only for proofing the concept [5]
b’ohne Nachweis einer Barrett-Schleimhaut’ (en: without evidence of barrett mucosa)
Fig. 2Simplified XML-based pathology EHR snippet, containing a specimen, an overall interpretation and a macroscopic findings part
Fig. 3One simple XPath example
Fig. 4SO → SOX
Fig. 5Overview search ontology
Fig. 6Search ontology XML extension
Fig. 7Overall process overview
Fig. 8XML_Structure tree
DL-based-description of the queries
| Q | Question |
|---|---|
|
| ( |
| Q1 | |
|
| |
| ( | |
| ( | |
| Q2 (without residual) | |
| ¬ | |
| Q2 (with residual) | |
| ( | |
| ( | |
| Q3 | |
| Interpretation | |
| Q4 | ( |
| Overall_staging | |
| Q5 (numerator) | |
| Q5 (denominator) |
aQ0 is only for proofing the concept [5]
The in relation was introduced in SOX. Xin Y means that at least one instance of the Search_Term class X (bold) should occur in the section representing class Y
Fig. 9Class Quest1_ProstateCancerGramCorrelation
Fig. 10OWL Class Quest1_ProstateCancerGramCorrelation
Overview on the evaluation results
| Question | | | | | | | | |
|---|---|---|---|---|
| (partly) enumerated | ECRI false | PQCRI false | ||
| positives | positives | |||
|
|
|
|
|
|
| Q1 | 36 | 5 | 1 | 0 |
| Q2 (without residual) | 18 | 6 | 0 | 0 |
| Q2 (with residual) | 9 | 2 | 0 | 0 |
| Q3 | 153 | 67 | 1 | 60 |
| Q4 | 4 | 4 | n/ab |
|
| Q5 (denominator) | 902 | 632 |
|
|
|
|
|
|
| |
| Sumg | 1134 | 725 | 2 | 60 |
aQ0 is only for proofing the concept [5]
bnot structured by an enumeration list, TNM classification codes are used
cPQCRI can not occur because no units are used in this query
dECRI can not occur because in this type of PEHR the specimen tissue section was not structured by an enumeration list
enot part of the column sum because Q5 (denominator) contains the Q5 (numerator) records
fevaluation can be skipped because Q5 (denominator) contains already the Q5 (numerator) records
gwithout Q5 (numerator)
In the second column is the amount of the retrieved PEHRs, in the third column is the amount of numbered content, in the fourth column is the amount of false positives which occur because of the ECRI, and in the fifth column is the amount of false positives which occur because of the PQCRI
Answers of the NL Questions based on the dataset of 68,583 PEHRs, interpretated by the ontologist
| Q | Answer |
|---|---|
| Q1 | The least weight was 3 |
| was 38 | |
| found. The average weight was | |
| ≈18.26 | |
| Q2 (without residual) | At least 2, at most 26 |
| rest. In average 9.28 | |
| Q2 (with residual) | At least 6, at most 10 |
| In average ≈9.55 | |
| Q3 | ≈2.76 |
| in average, | |
| Q4 | In four found casesa 0.5 |
| cancer in stage pT2 in average. | |
| Q5 | In 83.81 |
| mucosa has been found. |
a(1/1), (1/1), (0/41), (0/19)