| Literature DB >> 29888070 |
Yifan Peng1, Xiaosong Wang2, Le Lu2, Mohammadhadi Bagheri2, Ronald Summers2, Zhiyong Lu1.
Abstract
Negative and uncertain medical findings are frequent in radiology reports, but discriminating them from positive findings remains challenging for information extraction. Here, we propose a new algorithm, NegBio, to detect negative and uncertain findings in radiology reports. Unlike previous rule-based methods, NegBio utilizes patterns on universal dependencies to identify the scope of triggers that are indicative of negation or uncertainty. We evaluated NegBio on four datasets, including two public benchmarking corpora of radiology reports, a new radiology corpus that we annotated for this work, and a public corpus of general clinical texts. Evaluation on these datasets demonstrates that NegBio is highly accurate for detecting negative and uncertain findings and compares favorably to a widely-used state-of-the-art system NegEx (an average of 9.5% improvement in precision and 5.1% in F1-score). AVAILABILITY: https://github.com/ncbi-nlp/NegBio.Entities:
Year: 2018 PMID: 29888070 PMCID: PMC5961822
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Figure 1.An overall pipeline of NegBio.
Number of findings in OpenI and ChestX-ray.
| Finding | OpenI | ChestX-ray |
|---|---|---|
| Atelectasis | 315 | 311 |
| Cardiomegaly | 345 | 202 |
| Consolidation | 30 | 79 |
| Edema | 42 | 43 |
| Effusion | 155 | 381 |
| Emphysema | 103 | 54 |
| Fibrosis | 23 | 15 |
| Hernia | 46 | 2 |
| Infiltration | 60 | 383 |
| Mass | 15 | 114 |
| Nodule | 106 | 154 |
| Pleural Thickening | 52 | 52 |
| Pneumonia | 40 | 62 |
| Pneumothorax | 22 | 279 |
| 1,354 | 2,131 |
Figure 2.The dependency graph of (a) “Lungs are clear of acute infiltrates or pleural effusion”, (b) “There is no evidence of tuberculous disease, and (c) “Definite infiltrate is not excluded”.
Descriptions of OpenI, ChestX-ray, BioScope, and PK.
| Dataset | Reports | Positives | Negatives |
|---|---|---|---|
| OpenI | 3,851 | 1,354 | – |
| ChestX-ray | 900 | 2,131 | – |
| BioScope test set | 977 | – | 466 |
| PK | 116 | – | 491 |
Evaluation results on OpenI, ChestX-ray using (1) MetaMap, (2) MetaMap and NegEx, and (3) MetaMap and NegBio. Performance is measured by precision (P), recall (R), and F1–score (F) on positive findings.
| Method | OpenI | ChestX-ray | ||||
|---|---|---|---|---|---|---|
| P | R | F | P | R | F | |
| MetaMap | 13.8 | 23.8 | 72.3 | 82.4 | ||
| MetaMap+NegEx | 77.2 | 84.6 | 80.7 | 82.8 | 95.5 | 88.7 |
| MetaMap+NegBio | 85.0 | 94.4 | ||||
Evaluation results on BioScope and PK using NegEx and NegBio. Performance is measured by precision (P), recall (R), and F1–score (F) on negations.
| Method | BioScope | PK | ||||
|---|---|---|---|---|---|---|
| P | R | F | P | R | F | |
| NegEx | 70.6 | 82.3 | 95.1 | 93.1 | ||
| NegBio | 95.7 | 88.6 | ||||