| Literature DB >> 31774830 |
Maximilian König1,2, André Sander3, Ilja Demuth1,4, Daniel Diekmann3, Elisabeth Steinhagen-Thiessen1.
Abstract
OBJECTIVES: The secondary use of medical data contained in electronic medical records, such as hospital discharge letters, is a valuable resource for the improvement of clinical care (e.g. in terms of medication safety) or for research purposes. However, the automated processing and analysis of medical free text still poses a huge challenge to available natural language processing (NLP) systems. The aim of this study was to implement a knowledge-based best of breed approach, combining a terminology server with integrated ontology, a NLP pipeline and a rules engine.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31774830 PMCID: PMC6881027 DOI: 10.1371/journal.pone.0224916
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Basic structure of the discharge letters in the BASE-II study.
| Age, Year of birth, Sex |
| New diagnoses |
| Previous diagnoses |
| Medication |
| Results of physical examination |
| Results of neurological examination |
| Blood pressure |
| Addiction: smoking, alcohol |
| Geriatric assessment |
| Adjuvants |
| Laboratory values |
| Electrocardiogram (ECG) |
| Pulse wave analysis |
| Dual Energy X-ray Absorptiometry (DXA) |
| Bioelectric impedance analysis (BIA) |
| Spirometry |
| Audiometry |
| Eye refraction test |
| Tonometry |
| Depression screening |
| Discharge summary |
Description of corpus.
| Number of documents: | 1,982 |
| Total lines: | 184,022 |
| Average lines per document: | 93 |
| Total number of tokens: | 2,001,114 |
| Average tokens per document: | 1,010 |
| Number of unique tokens: | 57,745 |
| Average length of token: | 11 |
Fig 1Architecture of the SemDrugS approach.
All components used the terminology server and the included ontology in order to facilitate a semantic interpretation.
Fig 2Hierarchical representation of osteoporosis and PPI in the ontology.
Evaluation of the automated extraction versus gold standard.
| Osteoporosis/osteopenia | PPI | Osteoporosis/osteopenia | |
|---|---|---|---|
| 1298 | 145 | 87 | |
| 80 | 0 | 4 | |
| 34 | 3 | 3 | |
| 570 | 1834 | 1888 | |
| 97.45 (96.45–98.23) | 97.97 (94.19–99.58) | 96.67 (90.57–99.31) | |
| 94.19 (CI 92.42–95.58) | 100.00 (99.50–100.00) | 95.60 (93.92–96.84) | |
| 95.79 | 98.98 | 96.13 |
Notes: Data are given as N, or proportion and 95% confidence interval; N = 1982