| Literature DB >> 25343498 |
Yan Xu1, Ji Hua2, Zhaoheng Ni2, Qinlang Chen2, Yubo Fan2, Sophia Ananiadou3, Eric I-Chao Chang4, Junichi Tsujii4.
Abstract
References to anatomical entities in medical records consist not only of explicit references to anatomical locations, but also other diverse types of expressions, such as specific diseases, clinical tests, clinical treatments, which constitute implicit references to anatomical entities. In order to identify these implicit anatomical entities, we propose a hierarchical framework, in which two layers of named entity recognizers (NERs) work in a cooperative manner. Each of the NERs is implemented using the Conditional Random Fields (CRF) model, which use a range of external resources to generate features. We constructed a dictionary of anatomical entity expressions by exploiting four existing resources, i.e., UMLS, MeSH, RadLex and BodyPart3D, and supplemented information from two external knowledge bases, i.e., Wikipedia and WordNet, to improve inference of anatomical entities from implicit expressions. Experiments conducted on 300 discharge summaries showed a micro-averaged performance of 0.8509 Precision, 0.7796 Recall and 0.8137 F1 for explicit anatomical entity recognition, and 0.8695 Precision, 0.6893 Recall and 0.7690 F1 for implicit anatomical entity recognition. The use of the hierarchical framework, which combines the recognition of named entities of various types (diseases, clinical tests, treatments) with information embedded in external knowledge bases, resulted in a 5.08% increment in F1. The resources constructed for this research will be made publicly available.Entities:
Mesh:
Year: 2014 PMID: 25343498 PMCID: PMC4208750 DOI: 10.1371/journal.pone.0108396
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Overview of system workflow.
Figure 2Annotation samples.
Inter-annotator agreement between A1 and A2.
| Explicit | Implicit | |
| True positive | 13011 | 3287 |
| False positive | 951 | 491 |
| False negative | 522 | 312 |
| k | 0.9284 | 0.8812 |
Inter-annotator agreement between each annotator and the gold standard.
| Explicit | Implicit | ||
| A1 | True positive | 13787 | 3577 |
| False positive | 1495 | 465 | |
| False negative | 924 | 302 | |
| k | 0.8901 | 0.8937 | |
| A2 | True positive | 13116 | 3328 |
| False positive | 1679 | 596 | |
| False negative | 972 | 389 | |
| k | 0.8768 | 0.8590 |
List of features in this task.
| Category | Features |
| Baseline features | Original |
| Capital Upper | |
| Upper | |
| Normalized form | |
| Prefix and Suffix | |
| Concept dictionary matching | |
| Concept type | |
| Stanford Parser POS | |
| Enju Parser POS | |
| Ontological features | 4 dictionaries matching (DF1) |
| Position matching(DF2) | |
| Coreference features | Coreference dictionary matching (CF) |
| World knowledge features | Wiki word matching (WF1) |
| Wiki word matching (WF2) | |
| Wiki word matching (WF3) | |
| Hierarchical feature | Hierarchical feature(HF) |
Figure 3Detailed explanation of baseline features.
Numbers of entities in dictionaries.
| Dictionary | Number of explicit tokens matched in dictionary | Coverage of explicit named entities |
|
| 3012 | 18.05% |
|
| 2174 | 13.03% |
|
| 3238 | 19.40% |
|
| 1595 | 9.56% |
|
| 4019 | 24.08% |
Figure 4Building the hierarchical feature for implicit anatomical recognizer.
Performance of individual feature added to baseline using gold standard medical concepts.
| Explicit recognizer | Implicit recognizer | Total | ||||||||
| P | R | F | INC | P | R | F | INC | Micro F | INC | |
| Baseline | 0.9168 | 0.7839 | 0.8452 | / | 0.9206 | 0.6919 | 0.7900 | / | 0.8320 | / |
| Baseline+DF1 | 0.9088 | 0.8296 | 0.8674 | 0.0222 | 0.9224 | 0.7094 | 0.8020 | 0.0120 | 0.8520 | 0.0200 |
| Baseline+DF2 | 0.9239 | 0.7922 | 0.8530 | 0.0078 | 0.9136 | 0.7005 | 0.7930 | 0.0030 | 0.8385 | 0.0065 |
| Baseline+CF | 0.9200 | 0.7883 | 0.8491 | 0.0039 | 0.9198 | 0.7067 | 0.7993 | 0.0093 | 0.8371 | 0.0051 |
| Baseline+WF1 | 0.9297 | 0.8052 | 0.8630 | 0.0178 | 0.9256 | 0.7205 | 0.8103 | 0.0203 | 0.8503 | 0.0183 |
| Baseline+WF2 | 0.9330 | 0.8001 | 0.8615 | 0.0163 | 0.9483 | 0.7403 | 0.8315 | 0.0415 | 0.8542 | 0.0222 |
| Baseline+WF3 | 0.9301 | 0.7987 | 0.8594 | 0.0142 | 0.9377 | 0.7281 | 0.8197 | 0.0297 | 0.8498 | 0.0178 |
| Baseline+HF | / | / | / | / | 0.9466 | 0.7419 | 0.8318 | 0.0418 | / | / |
*Inc is increment compared with baseline.
Performance of different feature combinations using gold standard medical concepts.
| Explicit recognizer | Implicit recognizer | Micro | INC | |||||
| No | P | R | F | P | R | F | F | |
| Baseline | 0.9168 | 0.7839 | 0.8452 | 0.9206 | 0.6919 | 0.7900 | 0.8320 | / |
| DF1 | 0.9088 | 0.8296 | 0.8674 | 0.9224 | 0.7094 | 0.8020 | 0.8520 | 0.0200 |
| DF1+DF2 | 0.9108 | 0.8439 | 0.8761 | 0.9299 | 0.7160 | 0.8091 | 0.8604 | 0.0284 |
| DF1+DF2+CF | 0.9234 | 0.8469 | 0.8835 | 0.9361 | 0.7248 | 0.8170 | 0.8678 | 0.0358 |
| DF1+DF2+CF+WF1 | 0.9396 | 0.8667 | 0.9017 | 0.9463 | 0.7391 | 0.8300 | 0.8848 | 0.0528 |
| DF1+DF2+CF+WF1+WF2 | 0.9451 | 0.8735 | 0.9079 | 0.9484 | 0.7415 | 0.8327 | 0.8901 | 0.0581 |
| DF1+DF2+CF+WF1+WF2+WF3 | 0.9481 | 0.8776 | 0.9114 | 0.9505 | 0.7455 | 0.8356 | 0.8936 | 0.0616 |
| DF1+DF2+CF+WF1+WF2+WF3+HF | 0.9481 | 0.8776 | 0.9114 | 0.9686 | 0.7631 | 0.8537 | 0.8978 | 0.0658 |
*Inc is increment compared with baseline.
Performance with training datasets of various sizes using gold standard medical concepts.
| Data Size | Explicit recognizer | Implicit recognizer | Micro | INC | ||||
| P | R | F | P | R | F | F | ||
| 50 | 0.9316 | 0.8245 | 0.8748 | 0.9527 | 0.6364 | 0.7631 | 0.8491 | / |
| 100 | 0.9384 | 0.8483 | 0.8911 | 0.9593 | 0.6966 | 0.8071 | 0.8717 | 0.0226 |
| 150 | 0.9423 | 0.8621 | 0.9004 | 0.9654 | 0.7147 | 0.8213 | 0.8821 | 0.0330 |
| 200 | 0.9479 | 0.8713 | 0.9080 | 0.9664 | 0.7418 | 0.8393 | 0.8922 | 0.0431 |
| 250 | 0.9488 | 0.8743 | 0.9102 | 0.9678 | 0.7595 | 0.8511 | 0.8964 | 0.0473 |
| 300 | 0.9481 | 0.8776 | 0.9114 | 0.9686 | 0.7631 | 0.8537 | 0.8978 | 0.0487 |
*Inc is increment compared with baseline.
Performance of individual feature added to baseline using automatically predicted medical concepts.
| Explicit recognizer | Implicit recognizer | Total | ||||||||
| P | R | F | INC | P | R | F | INC | Micro F | INC | |
| Baseline | 0.8123 | 0.7048 | 0.7545 | / | 0.8217 | 0.6282 | 0.7120 | / | 0.7443 | / |
| Baseline+DF1 | 0.8113 | 0.7461 | 0.7773 | 0.0228 | 0.8236 | 0.6303 | 0.7141 | 0.0021 | 0.7625 | 0.0182 |
| Baseline+DF2 | 0.8268 | 0.7112 | 0.7647 | 0.0102 | 0.8171 | 0.6304 | 0.7117 | −0.0003 | 0.7519 | 0.0076 |
| Baseline+CF | 0.8252 | 0.7106 | 0.7636 | 0.0091 | 0.8167 | 0.6298 | 0.7112 | −0.0008 | 0.7510 | 0.0067 |
| Baseline+WF1 | 0.8318 | 0.7221 | 0.7731 | 0.0186 | 0.8293 | 0.6502 | 0.7289 | 0.0169 | 0.7624 | 0.0181 |
| Baseline+WF2 | 0.8333 | 0.7215 | 0.7734 | 0.0189 | 0.8459 | 0.6692 | 0.7472 | 0.0352 | 0.7670 | 0.0227 |
| Baseline+WF3 | 0.8314 | 0.7186 | 0.7709 | 0.0164 | 0.8376 | 0.6590 | 0.7376 | 0.0256 | 0.7628 | 0.0185 |
| Baseline+HF | / | / | / | / | 0.8447 | 0.6713 | 0.7481 | 0.0361 | / | / |
*Inc is increment compared with baseline.
Performance of different feature combinations using automatically predicted medical concepts.
| Explicit recognizer | Implicit recognizer | Micro | INC | |||||
| No | P | R | F | P | R | F | F | |
| Baseline | 0.8123 | 0.7043 | 0.7545 | 0.8217 | 0.6282 | 0.7120 | 0.7443 | / |
| DF1 | 0.8105 | 0.7452 | 0.7765 | 0.8233 | 0.6427 | 0.7219 | 0.7636 | 0.0193 |
| DF1+DF2 | 0.8116 | 0.7582 | 0.7840 | 0.8304 | 0.6491 | 0.7286 | 0.7710 | 0.0267 |
| DF1+DF2+CF | 0.8256 | 0.7599 | 0.7914 | 0.8362 | 0.6578 | 0.7363 | 0.7784 | 0.0341 |
| DF1+DF2+CF+WF1 | 0.8495 | 0.7781 | 0.8122 | 0.8452 | 0.6687 | 0.7467 | 0.7967 | 0.0524 |
| DF1+DF2+CF+WF1+WF2 | 0.8487 | 0.7826 | 0.8143 | 0.8501 | 0.6742 | 0.7200 | 0.7992 | 0.0549 |
| DF1+DF2+CF+WF1+WF2+WF3 | 0.8494 | 0.7854 | 0.8160 | 0.8517 | 0.6749 | 0.7531 | 0.8012 | 0.0569 |
| DF1+DF2+CF+WF1+WF2+WF3+HF | 0.8509 | 0.7796 | 0.8137 | 0.8695 | 0.6893 | 0.7690 | 0.8031 | 0.0588 |
*Inc is increment compared with baseline.
Performance with training datasets of various sizes using automatically predicted medical concepts.
| Explicit recognizer | Implicit recognizer | Micro | INC | |||||
| Data Size | P | R | F | P | R | F | F | |
| 50 | 0.8314 | 0.7308 | 0.7779 | 0.8403 | 0.5817 | 0.6875 | 0.7568 | / |
| 100 | 0.8456 | 0.7527 | 0.7965 | 0.8515 | 0.6401 | 0.7308 | 0.7810 | 0.0242 |
| 150 | 0.8473 | 0.7628 | 0.8028 | 0.8619 | 0.6531 | 0.7431 | 0.7888 | 0.0320 |
| 200 | 0.8501 | 0.7736 | 0.8100 | 0.8643 | 0.6764 | 0.7589 | 0.7981 | 0.0413 |
| 250 | 0.8518 | 0.7774 | 0.8129 | 0.8678 | 0.6852 | 0.7658 | 0.8019 | 0.0451 |
| 300 | 0.8509 | 0.7796 | 0.8137 | 0.8695 | 0.6893 | 0.7690 | 0.8031 | 0.0463 |
*Inc is increment compared with baseline.