| Literature DB >> 28699566 |
Zengjian Liu1, Ming Yang2, Xiaolong Wang1, Qingcai Chen1, Buzhou Tang3,4, Zhe Wang5, Hua Xu6.
Abstract
BACKGROUND: Entity recognition is one of the most primary steps for text analysis and has long attracted considerable attention from researchers. In the clinical domain, various types of entities, such as clinical entities and protected health information (PHI), widely exist in clinical texts. Recognizing these entities has become a hot topic in clinical natural language processing (NLP), and a large number of traditional machine learning methods, such as support vector machine and conditional random field, have been deployed to recognize entities from clinical texts in the past few years. In recent years, recurrent neural network (RNN), one of deep learning methods that has shown great potential on many problems including named entity recognition, also has been gradually used for entity recognition from clinical texts.Entities:
Keywords: Clinical notes; Deep learning; Entity recognition; Recurrent neural network; Sequence labeling
Mesh:
Year: 2017 PMID: 28699566 PMCID: PMC5506598 DOI: 10.1186/s12911-017-0468-7
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Overview architecture of our LSTM
Fig. 2Structure of an LSTM unit
Fig. 3Character-level representation generation models. a Bidirectional LSTM. b CNN
Statistics of entity recognition datasets used in our study
| Challenge | 2010 | 2012 | 2014 | |
|---|---|---|---|---|
| Training | #Note | 349 | 190 | 790 |
| #Entity | 27837 | 16468 | 17405 | |
| Test | #Note | 477 | 120 | 514 |
| #Entity | 45009 | 13594 | 11462 | |
Evaluation criteria for the three entity recognition tasks
| Challenge | Criterion | Remarks |
|---|---|---|
| 2010 | Exact* | Entities have the same boundary and same type. |
| Inexact | Entities overlap and have the same type. | |
| 2012 | Span* | Entities overlap |
| Type | Entities overlap and have the same type. | |
| 2014 | Exact* | Entities have the same boundary and same type. |
| Token | “Exact” criterion at token-level. |
*represents the primary evaluation criterion for each task
Hyperparameters chosen for all our experiments
| Hyperparameter | 2010/2012/2014 |
|---|---|
| Dimension of token-level word representation | 50 |
| Dimension of character representation | 25 |
| Character-level LSTM size | 25 |
| Character-level CNN filter size | 3 |
| Character-level CNN filter number | 25 |
| Token-level LSTM size | 100 |
| Dropout probability | 0.5 |
| Learning rate | 0.005 |
| Gradient clipping | 5.0 |
| Training epochs | 50/30/55 |
Performances of LSTM and CRF-based models for the three tasks (F1-score %)
| Model | 2010 i2b2 challenge (Concept Extraction) | 2012 i2b2 challenge (Event Detection) | 2014 i2b2 challenge (De-Identification) | |||
|---|---|---|---|---|---|---|
| Exact | Inexact | Span | Type | Exact | Token | |
| CRF | 83.69 | 91.39 | 90.82 | 83.72 | 92.58 | 95.37 |
| LSTM-BASELINE | 85.36 | 92.58 | 92.20 | 87.74 | 93.30 | 96.05 |
| LSTM + char-LSTM | 85.81 | 92.91 | 92.29 | 86.94 | 94.29 | 96.54 |
| LSTM + char-CNN | 85.65 | 92.77 | 92.25 | 87.66 | 94.37 | 96.67 |
| LSTM + char-LSTM + CNN | 85.78 | 92.76 | 92.28 | 87.80 | 94.16 | 96.44 |
Comparison of the performances of various systems on the three tasks (%)
| System | Method | Exact F1-score | Inexact F1-score | |
| 2010 | LSTM + char-LSTM | RNN | 85.81 | 92.91 |
| Tang et al (2013) [ | SSVM | 85.82 | 92.40 | |
| Bruijin et al (2011)* [ | Semi-Markov | 85.23 | 92.44 | |
| Kim et al (2015) [ | CRFs | 84.30 | - | |
| Jiang et al (2011) [ | CRFs | 83.91 | 91.30 | |
| System | Method | Span F1-score | Type Accuracy | |
| 2012 | LSTM + char-LSTM | RNN | 92.29 | 86.94 |
| Xu et al. (2013)* [ | CRFs | 91.66 | 85.74 | |
| Tang et al. (2013) [ | CRFs + SVM | 90.13 | 83.60 | |
| Sohn et al. (2013) [ | CRFs | 87.00 | 76.77 | |
| Aleksandar et al. (2013) [ | CRFs | 87.29 | 82.00 | |
| System | Method | Exact F1-score | Token F1-score | |
| 2014 | LSTM + char-LSTM | RNN | 94.29 | 96.54 |
| Yang et al. (2015) [ | CRFs | 93.60 | 96.11 | |
| He et al. (2015) [ | CRFs | 92.32 | 95.14 | |
| Liu et al. (2015) [ | CRFs + rule | 91.24 | 94.64 | |
| Dehghan et al. (2015) [ | CRFs + rule | 91.13 | 95.31 |