| Literature DB >> 33898938 |
Madeleine Kittner1, Mario Lamping2,3, Damian T Rieke2,3,4, Julian Götze5, Bariya Bajwa5, Ivan Jelas3, Gina Rüter3, Hanjo Hautow1, Mario Sänger1, Maryam Habibi1, Marit Zettwitz3, Till de Bortoli3, Leonie Ostermann5, Jurica Ševa1, Johannes Starlinger1, Oliver Kohlbacher6,7,8,9, Nisar P Malek5, Ulrich Keilholz3, Ulf Leser1.
Abstract
OBJECTIVE: We present the Berlin-Tübingen-Oncology corpus (BRONCO), a large and freely available corpus of shuffled sentences from German oncological discharge summaries annotated with diagnosis, treatments, medications, and further attributes including negation and speculation. The aim of BRONCO is to foster reproducible and openly available research on Information Extraction from German medical texts.Entities:
Keywords: German language; corpus annotation; medical information extraction
Year: 2021 PMID: 33898938 PMCID: PMC8054032 DOI: 10.1093/jamiaopen/ooab025
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Figure 1.Exemplary excerpts from original discharge summaries and annotations, shown in BRAT visualization. Attributes in brackets have the following meaning: laterality right (R), negated entity (negative), speculative entity (speculative), and entity possible in the future (possibleFuture). Additionally, codes resulting from entity normalization are given in brackets.
Figure 2.Annotation procedure including deidentification, annotation of section titles, and annotation of medical entities with attributes. Altogether, 1 annotation leader and 9 medical annotators were involved in different parts of the process.
Figure 3.Visualization of mismatches between annotations of 2 annotators, shown in BRAT visualization. (A) One of the annotations misses Laterality R and (B) “Oberbauchsonographie” (sonography of the upper abdomen) is annotated only by 1 annotator and “Ausschluss von Leberfiliae” (exclusion of liver metastasis) is annotated with different text spans and only once with attribute possibleFuture.
Interannotator agreement (IAA) calculated as microaveraged phrase-level F1 for 2 corpus sets annotated by 2 groups of annotators (A, B)
| Group A | Group B | |||||
|---|---|---|---|---|---|---|
| Annotation type | No. of instances | Text span | Code/attribute | No. of instances | Text span | Code/attribute |
| Diagnosis | 734 | 0.88 (0.94) | 0.84 | 2860 | 0.69 (0.79) | 0.54 |
| Treatment | 522 | 0.81 (0.93) | 0.73 | 1730 | 0.66 (0.77) | 0.47 |
| Medication | 300 | 0.94 (0.96) | 0.90 | 927 | 0.87 (0.92) | 0.75 |
| Laterality | 104 | – | 0.75 | 452 | – | 0.53 |
| Negation | 76 | – | 0.81 | 319 | – | 0.50 |
| Speculation | 81 | – | 0.69 | 288 | – | 0.44 |
| Possible Future | 37 | – | 0.68 | 244 | – | 0.37 |
Note: IAA was calculated before conflict resolution. For text spans, IAA is also given as (token level) Cohen’s κ in paratheses. Number of double annotated documents: group A (32) and group B (113).
Frequency of annotated medical entities and attributes in BRONCO and its 2 subsets, together with general statistics
| Annotation type | BRONCO150 | BRONCO50 | BRONCO complete | Unique instances |
|---|---|---|---|---|
| Diagnosis | 4080 | 1165 | 5245 | 2394 |
| Treatment | 3050 | 816 | 3866 | 1101 |
| Medication | 1630 | 383 | 2013 | 532 |
| Total medical entities | 8760 | 2364 | 11 124 | – |
| Laterality | 1033 | 223 | 1256 | |
| Negation | 503 | 127 | 630 | |
| Speculation | 474 | 139 | 613 | |
| Possible future | 479 | 140 | 619 | |
| Total attributes | 2489 | 629 | 3118 | |
| #Documents | 150 | 50 | 200 | |
| #Sentences | 8976 | 2458 | 11 434 | |
| #Tokens | 70 572 | 19 370 | 89 942 |
Note: Unique instances are the number of unique mentions within the complete corpus.
Figure 4.Distribution of documents per cluster after hierarchical clustering of sentences in BRONCO150.
Performance for baseline methods for NEN and NER (CRF and LSTM-CRF) with and without pretrained word embeddings (WE)
| Annotation type | Task | Method | BRONCO150 | BRONCO50 | ||||
|---|---|---|---|---|---|---|---|---|
| P | R | F1 | P | R | F1 | |||
| Diagnosis | NER | CRF |
|
|
|
|
|
|
| CRF+WE | 0.782(0.006) | 0.70(0.02) | 0.74(0.01) | 0.77 |
| 0.71 | ||
| LSTM | 0.75(0.03) | 0.69(0.03) | 0.72(0.01) | 0.78 | 0.65 | 0.71 | ||
| LSTM+WE |
|
|
|
| 0.65 |
| ||
| NEN | Dictionary lookup | 0.58 | 0.54 | 0.56 | 0.54 | 0.50 | 0.52 | |
| Treatment | NER | CRF |
| 0.78(0.01) |
| 0.83 |
|
|
| CRF+WE | 0.85(0.02) | 0.78(0.01) | 0.81(0.01) |
| 0.73 |
| ||
| LSTM | 0.83(0.04) |
| 0.81(0.02) |
| 0.69 | 0.76 | ||
| LSTM+WE | 0.85(0.06) | 0.82(0.07) | 0.84(0.06) | 0.76 | 0.74 | 0.75 | ||
| NEN | Dictionary lookup | 0.18 | 0.13 | 0.15 | 0.15 | 0.12 | 0.13 | |
| Medication | NER | CRF |
| 0.85(0.02) |
| 0.94 |
|
|
| CRF+WE | 0.96(0.004) | 0.84(0.009) | 0.90(0.006) |
| 0.85 | 0.90 | ||
| LSTM | 0.91(0.05) |
| 0.88(0.02) |
| 0.85 | 0.89 | ||
| LSTM+WE | 0.96(0.02) |
|
| 0.91 |
| 0.90 | ||
| NEN | Dictionary lookup | 0.66 | 0.68 | 0.67 | 0.64 | 0.69 | 0.66 | |
Note: Results for BRONCO150 are averaged over 5-fold with standard deviation in brackets. Best (highest) values per entity type, corpus, and w/o WE are bold.
Negation and speculation detection of entities using NegEx with 2 lists of German trigger terms: Chapman et al and Cotik et al
| BRONCO150 | BRONCO50 | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Annotation type | Trigger list | #GSC | P | R | F1 | #GSC | P | R | F1 |
| Negation | Chapman | 503 | 0.57 | 0.35 | 0.44 | 127 | 0.45 | 0.31 | 0.37 |
| Cotik | 503 |
|
|
| 127 |
|
|
| |
| Speculation | Chapman | 474 | 0.13 | 0.01 | 0.02 | 139 | 0.26 | 0.06 | 0.09 |
| Cotik | 474 |
|
|
| 139 |
|
|
| |