| Literature DB >> 29801178 |
Georg Dietrich, Jonathan Krebs, Georg Fette, Maximilian Ertl, Mathias Kaspar, Stefan Störk, Frank Puppe.
Abstract
BACKGROUND: Clinical Data Warehouses (CDW) reuse Electronic health records (EHR) to make their data retrievable for research purposes or patient recruitment for clinical trials. However, much information are hidden in unstructured data like discharge letters. They can be preprocessed and converted to structured data via information extraction (IE), which is unfortunately a laborious task and therefore usually not available for most of the text data in CDW.Entities:
Mesh:
Year: 2018 PMID: 29801178 PMCID: PMC6193399 DOI: 10.3414/ME17-02-0010
Source DB: PubMed Journal: Methods Inf Med ISSN: 0026-1270 Impact factor: 2.176
Example for pre and post negating triggers.
|
|
|
|
|
Für eine pulmonale Metastasierung ergaben sich
|
Example for a context-sensitive query.
|
|
| |
|---|---|---|
|
|
| (4) |
|
|
| (5) |
Example of the regular expression feature for querying (6), constraining (7) and extracting (8) a numeric concept (Puls = pulse, ZAHL = NUMBER). “$1” is a reference to the extracted concept (the first expression in round parentheses or its equivalent predefined class, i.e. “ZAHL”).
|
|
| |
|---|---|---|
|
|
| (6) |
|
|
| (7) |
|
|
| (8) |
Performance of the negation detection of medical concepts in the two domains.
|
|
| |
|---|---|---|
|
| 100 | 50 |
|
| 619 | 397 |
|
| 608 | 366 |
|
| 1 | 1 |
|
| 11 | 31 |
|
| 0.998 | 0.997 |
|
| 0.982 | 0.922 |
|
| 0.990 | 0.958 |
Error analysis of wrong classified concepts in the negation detection.
|
|
|
| |
|---|---|---|---|
|
| 3 (0.25) | 2 (0.06) | 5 (0.12) |
|
| 1 (0.08) | 0 (0.00) | 1 (0.02) |
|
| 3 (0.25) | 5 (0.16) | 8 (0.19) |
|
| 5 (0.42) | 24 (0.77) | 29 (0.67) |
Performance of the retrieval of the negated scopes and their length in discharge letters.
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|
| TP | FP | FN | Precision | Recall | F1 | Exact | Too Narrow | Too Wide |
| 348 | 4 | 6 | 0.989 | 0.983 | 0.986 | 318 (0.91) | 2 (0.01) | 28 (0.08) |
Error analysis of wrongly determined negation scope in discharge letters.
|
| |
|---|---|
|
| 8 (0.28) |
|
| 3 (0.10) |
|
| 5 (0.17) |
|
| 6 (0.21) |
|
| 7 (0.24) |
Performance of Boolean ad hoc information extraction using the context sensitive query feature for medical concepts: (1) mild mitral insufficiency, (2) high mitral insufficiency and (3) mild aortic stenosis.
|
|
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|---|---|
|
|
| echocardiography | 0 | 7 | 304 | 0.977 | 1 | 0.987 |
|
|
| echocardiography | 0 | 0 | 14 | 1 | 1 | 1 |
|
|
| echocardiography | 0 | 3 | 160 | 0.982 | 1 | 0.991 |
Performance of numeric ad hoc information extraction using the regex query feature for the medical concepts: (1) Cholesterol, (2) Glucose, (3) BMI, (4) LVEF, and (5) age.
|
|
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|---|---|
|
|
| discharge letter | 0 | 2 | 158 | 0.988 | 1 | 0.994 |
|
|
| discharge letter | 0 | 6 | 336 | 0.982 | 1 | 0.991 |
|
|
| discharge letter | 0 | 0 | 44 | 1 | 1 | 1 |
|
|
| Echocardiography report | 6 | 0 | 452 | 1 | 0.987 | 0.993 |
|
|
| discharge letter | 136 | 4 | 49 | 0,93 | 0,27 | 0,41 |
Error analysis of ad hoc information extraction of the concept age in the entire discharge letter.
|
|
| |
|---|---|---|
|
| 18 | 0.13 |
|
| 15 | 0.11 |
|
| 81 | 0.60 |
|
| 18 | 0.13 |
|
| 4 | 0.03 |
Comparison of F1-scores to other negation detection approaches for German clinical texts.
|
|
|
|
|
|---|---|---|---|
| Discharge letter | 0.91 | Discharge letter | 0.96 |
| Clinical notes | 0.96 | Chest X-ray | 0.99 |
Comparison of F1-scores to other negation scope length determination approaches for German clinical texts.
|
|
| |
|---|---|---|
|
| cardiology report | discharge letters |
|
| 0.54 | 0.91 |
|
| 0.34 | 0.01 |
|
| 0.12 | 0.08 |
Comparison between ad hoc information extraction and standard IE.
|
|
| |
|---|---|---|
|
| specific concept | entire domain |
|
| low | high |
|
| fast | slow |
|
| yes | no |
|
| lower | higher |
|
| low | high |