Literature DB >> 28244546

Structuring Legacy Pathology Reports by openEHR Archetypes to Enable Semantic Querying.

Stefan Kropf1, Peter Krücken, Wolf Mueller, Kerstin Denecke.   

Abstract

BACKGROUND: Clinical information is often stored as free text, e.g. in discharge summaries or pathology reports. These documents are semi-structured using section headers, numbered lists, items and classification strings. However, it is still challenging to retrieve relevant documents since keyword searches applied on complete unstructured documents result in many false positive retrieval results.
OBJECTIVES: We are concentrating on the processing of pathology reports as an example for unstructured clinical documents. The objective is to transform reports semi-automatically into an information structure that enables an improved access and retrieval of relevant data. The data is expected to be stored in a standardized, structured way to make it accessible for queries that are applied to specific sections of a document (section-sensitive queries) and for information reuse.
METHODS: Our processing pipeline comprises information modelling, section boundary detection and section-sensitive queries. For enabling a focused search in unstructured data, documents are automatically structured and transformed into a patient information model specified through openEHR archetypes. The resulting XML-based pathology electronic health records (PEHRs) are queried by XQuery and visualized by XSLT in HTML.
RESULTS: Pathology reports (PRs) can be reliably structured into sections by a keyword-based approach. The information modelling using openEHR allows saving time in the modelling process since many archetypes can be reused. The resulting standardized, structured PEHRs allow accessing relevant data by retrieving data matching user queries.
CONCLUSIONS: Mapping unstructured reports into a standardized information model is a practical solution for a better access to data. Archetype-based XML enables section-sensitive retrieval and visualisation by well-established XML techniques. Focussing the retrieval to particular sections has the potential of saving retrieval time and improving the accuracy of the retrieval.

Entities:  

Keywords:  Standardized electronic health record; electronic health record system; information retrieval; medical informatics applications; openEHR; section boundary detection

Mesh:

Year:  2017        PMID: 28244546     DOI: 10.3414/ME16-01-0073

Source DB:  PubMed          Journal:  Methods Inf Med        ISSN: 0026-1270            Impact factor:   2.176


  3 in total

1.  Current approaches to identify sections within clinical narratives from electronic health records: a systematic review.

Authors:  Alexandra Pomares-Quimbaya; Markus Kreuzthaler; Stefan Schulz
Journal:  BMC Med Res Methodol       Date:  2019-07-18       Impact factor: 4.615

2.  Querying archetype-based EHRs by search ontology-based XPath engineering.

Authors:  Stefan Kropf; Alexandr Uciteli; Katrin Schierle; Peter Krücken; Kerstin Denecke; Heinrich Herre
Journal:  J Biomed Semantics       Date:  2018-05-11

3.  Designing an openEHR-Based Pipeline for Extracting and Standardizing Unstructured Clinical Data Using Natural Language Processing.

Authors:  Antje Wulff; Marcel Mast; Marcus Hassler; Sara Montag; Michael Marschollek; Thomas Jack
Journal:  Methods Inf Med       Date:  2020-10-14       Impact factor: 2.176

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.