| Literature DB >> 25147248 |
Sameer Pradhan1, Noémie Elhadad2, Brett R South3, David Martinez4, Lee Christensen3, Amy Vogel2, Hanna Suominen5, Wendy W Chapman3, Guergana Savova1.
Abstract
OBJECTIVE: The ShARe/CLEF eHealth 2013 Evaluation Lab Task 1 was organized to evaluate the state of the art on the clinical text in (i) disorder mention identification/recognition based on Unified Medical Language System (UMLS) definition (Task 1a) and (ii) disorder mention normalization to an ontology (Task 1b). Such a community evaluation has not been previously executed. Task 1a included a total of 22 system submissions, and Task 1b included 17. Most of the systems employed a combination of rules and machine learners.Entities:
Keywords: Clinical Notes; Disorder Identifciation; Information Extraction; Named Entity Recognition; Natural Language Processing; Word Sense Disambiguation
Mesh:
Year: 2014 PMID: 25147248 PMCID: PMC4433360 DOI: 10.1136/amiajnl-2013-002544
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1:Unique Identifiers in the Unified Medical Language System (UMLS) Metathesaurus.
Inter-annotator (A1 and A2) and gold standard (GS) agreement as F1 score for the disorder mentions and their normalization to the Unified Medical Language System Concept Unique Identifier (UMLS CUI)
| Disorder | CUI | |||
|---|---|---|---|---|
| Relaxed | Strict | Relaxed | Strict | |
| F1 | F1 | Accuracy | Accuracy | |
| A1–A2 | 0.909 | 0.769 | 0.776 | 0.846 |
| A1–GS | 0.968 | 0.932 | 0.954 | 0.973 |
| A2–GS | 0.937 | 0.826 | 0.806 | 0.863 |
Distribution of disorder mentions across the training and test set according to the two criteria—whether they map to a Concept Unique Identifier (CUI) and whether they are contiguous
| Training | Test | |||
|---|---|---|---|---|
| Total disorder mentions | 5816 | 5351 | ||
| CUI-less mentions | 1639 | (28.2%) | 1750 | (32.7%) |
| CUI-ed mentions | 4177 | (71.8%) | 3601 | (67.3%) |
| Contiguous mentions | 5165 | (88.8%) | 4912 | (91.8%) |
| Discontiguous mentions | 651 | (11.2%) | 439 | (8.2%) |
Evaluation for Task 1a
| System ({ID}.{run}) | Strict | Relaxed | ||||
|---|---|---|---|---|---|---|
| P | R | F1 | P | R | F1 | |
| No additional annotations | ||||||
| (UTHealthCCB.A).2 | 0.800 | 0.706 | 0.750* | 0.925 | 0.827 | 0.873 |
| (UTHealthCCB.A).1 | 0.831 | 0.663 | 0.737* | 0.954 | 0.774 | 0.854 |
| NCBI.1 | 0.768 | 0.654 | 0.707* | 0.910 | 0.796 | 0.849 |
| NCBI.2 | 0.757 | 0.658 | 0.704* | 0.904 | 0.805 | 0.852 |
| CLEAR.2 | 0.764 | 0.624 | 0.687* | 0.929 | 0.759 | 0.836 |
| (Mayo.A).1 | 0.800 | 0.573 | 0.668* | 0.936 | 0.680 | 0.787 |
| (UCDCSI.A).1 | 0.745 | 0.587 | 0.656 | 0.922 | 0.758 | 0.832 |
| CLEAR.1 | 0.755 | 0.573 | 0.651* | 0.937 | 0.705 | 0.804 |
| (Mayo.B).1 | 0.697 | 0.574 | 0.629* | 0.939 | 0.766 | 0.844 |
| CORAL.2 | 0.796 | 0.487 | 0.604 | 0.909 | 0.554 | 0.688 |
| HealthLanguageLABS.1 | 0.686 | 0.539 | 0.604* | 0.912 | 0.701 | 0.793 |
| LIMSI.2 | 0.814 | 0.473 | 0.598* | 0.964 | 0.563 | 0.711 |
| LIMSI.1 | 0.805 | 0.466 | 0.590 | 0.962 | 0.560 | 0.708 |
| (AEHRC.A).2 | 0.613 | 0.566 | 0.589* | 0.886 | 0.785 | 0.833 |
| (WVU.DG + VJ).1 | 0.614 | 0.505 | 0.554* | 0.885 | 0.731 | 0.801 |
| (WVU.SS + VJ).1 | 0.575 | 0.496 | 0.533 | 0.848 | 0.741 | 0.791 |
| CORAL.1 | 0.584 | 0.446 | 0.505 | 0.942 | 0.601 | 0.734 |
| NIL-UCM.2 | 0.617 | 0.426 | 0.504 | 0.809 | 0.558 | 0.660 |
| KPSCMI.2 | 0.494 | 0.512 | 0.503* | 0.680 | 0.687 | 0.684 |
| NIL-UCM.1 | 0.621 | 0.416 | 0.498 | 0.812 | 0.543 | 0.651 |
| KPSCMI.1 | 0.462 | 0.523 | 0.491* | 0.651 | 0.712 | 0.680 |
| (AEHRC.A).1 | 0.699 | 0.212 | 0.325* | 0.903 | 0.275 | 0.422 |
| (WVU.AJ + VJ).1 | 0.230 | 0.318 | 0.267* | 0.788 | 0.814 | 0.801 |
| UCDCSI.2 | 0.268 | 0.175 | 0.212* | 0.512 | 0.339 | 0.408 |
| SNUBME.2 | 0.191 | 0.137 | 0.160* | 0.381 | 0.271 | 0.317 |
| SNUBME.1 | 0.302 | 0.026 | 0.047 | 0.504 | 0.043 | 0.079 |
| (WVU.FP + VJ).1 | 0.024 | 0.446 | 0.046 | 0.088 | 0.997 | 0.161 |
| Additional annotations | ||||||
| (UCSC.CW + RA).2 | 0.732 | 0.621 | 0.672 | 0.883 | 0.742 | 0.806 |
| (UCSC.CW + RA).1 | 0.730 | 0.615 | 0.668* | 0.887 | 0.739 | 0.806 |
| RelAgent.2 | 0.651 | 0.494 | 0.562* | 0.901 | 0.686 | 0.779 |
| RelAgent.1 | 0.649 | 0.450 | 0.532 | 0.913 | 0.636 | 0.750 |
| (WVU.AL + VJ).1 | 0.492 | 0.558 | 0.523* | 0.740 | 0.840 | 0.787 |
| (THCIB.A).1 | 0.445 | 0.551 | 0.492* | 0.720 | 0.713 | 0.716 |
| (WVU.RK + VJ.1 | 0.397 | 0.465 | 0.428 | 0.717 | 0.814 | 0.762 |
In the Strict F1 score column, * indicates the F1 of the system was significantly better than the one immediately below (random shuffling, p<0.01). The .1 and .2 suffixes represent run number 1 and 2, respectively.
P, precision; R, recall.
Evaluation for Task 1b
| System ({ID}.{run}) | Strict | Relaxed |
|---|---|---|
| Accuracy | Accuracy | |
| No additional annotations | ||
| NCBI.2 | 0.589* | 0.895 |
| NCBI.1 | 0.587* | 0.897 |
| (Mayo.A).2 | 0.546* | 0.860 |
| (UTHealthCCB.A).1 | 0.514* | 0.728 |
| (UTHealthCCB.A).2 | 0.506 | 0.717 |
| (Mayo.A).1 | 0.502* | 0.870 |
| KPSCMI.1 | 0.443* | 0.865 |
| CLEAR.2 | 0.440* | 0.704 |
| CORAL.2 | 0.439* | 0.902 |
| CORAL.1 | 0.410* | 0.921 |
| CLEAR.1 | 0.409* | 0.713 |
| NIL-UCM.2 | 0.362 | 0.850 |
| NIL-UCM.1 | 0.362* | 0.871 |
| (AEHRC.A).2 | 0.313* | 0.552 |
| (WVU.SS + VJ).1 | 0.309 | 0.622 |
| (UCDCSI.B).1 | 0.299* | 0.509 |
| (WVU.DG + VJ).1 | 0.241 | 0.477 |
| (AEHRC.A).1 | 0.199* | 0.939 |
| (WVU.AJ + VJ).1 | 0.142 | 0.448 |
| (WVU.FP + VJ).1 | 0.112* | 0.252 |
| (UCDCSI.B).2 | 0.006 | 0.035 |
| Additional annotations | ||
| (UCSC.CW + RA).2 | 0.545* | 0.878 |
| (UCSC.CW + RA).1 | 0.540* | 0.879 |
| (THCIB.A).1 | 0.470* | 0.853 |
| (WVU.AL + VJ).1 | 0.349* | 0.625 |
| (WVU.RK + VJ).1 | 0.247 | 0.531 |
In the Strict Accuracy column, * indicates the accuracy of the system was significantly better than the one immediately below (random shuffling, p<0.01). The .1 and .2 suffixes represent run number 1 and 2, respectively.
Instantiations of the four Inside-Outside-Begin encoding variations for three sentences
| The | O | O | O | O | O |
| and | O | O | O | O | O |
| O | O | O | |||
| O | O | O | |||
| are | O | O | O | O | |
| moderately | O | O | O | O | |
| . | O | O | O | O | O |
| The | O | O | O | O | O |
| is | O | O | O | O | |
| moderately | O | O | O | O | |
| . | O | O | O | O | O |
| No | O | O | O | O | O |
| O | O | O | O | O |
The words that are part of the disorder mention are in bold along with the respective encodings.