| Literature DB >> 31881938 |
Hyejin Cho1, Hyunju Lee2.
Abstract
BACKGROUND: In biomedical text mining, named entity recognition (NER) is an important task used to extract information from biomedical articles. Previously proposed methods for NER are dictionary- or rule-based methods and machine learning approaches. However, these traditional approaches are heavily reliant on large-scale dictionaries, target-specific rules, or well-constructed corpora. These methods to NER have been superseded by the deep learning-based approach that is independent of hand-crafted features. However, although such methods of NER employ additional conditional random fields (CRF) to capture important correlations between neighboring labels, they often do not incorporate all the contextual information from text into the deep learning layers.Entities:
Keywords: Contextual information; Long short-term memory; Named entity recognition; Neural networks; Text mining
Mesh:
Year: 2019 PMID: 31881938 PMCID: PMC6935215 DOI: 10.1186/s12859-019-3321-4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Statistics of the NCBI, GM, and CDR corpora
| Corpus | Entity | Unit | Training | Develop | Test | Total (Unit) |
|---|---|---|---|---|---|---|
| NCBI | Disease | Abstracts | 592 | 100 | 100 | 792 (abstracts) |
| GM | Gene | Sentences | 15000 | - | 5000 | 20000 (sentences) |
| CDR | Disease, Chemicals | Abstracts | 500 | 500 | 500 | 1500 (abstracts) |
Comparison of performance for comparative methods on the NCBI, GM, and CDR corpora using strict and partial matching and IOB tag matching
| Model | p | r | f | p | r | f | p | r | f | |
|---|---|---|---|---|---|---|---|---|---|---|
| NCBI | GM | CDR | ||||||||
| BiLSTM | 78.91 | 82.60 | 80.71 | 72.22 | 72.44 | 72.33 | 83.56 | 80.26 | 81.88 | |
| BiLSTM-CRF | 82.19 | 84.58 | 83.37 | 80.79 | 79.81 | 80.30 | 87.52 | 83.58 | 85.50 | |
| GRAM-CNN | 84.45 | 83.92 | 84.18 | 80.23 | 78.83 | 79.53 | 86.08 | 85.49 | 85.79 | |
| BERT | 81.07 | 80.73 | 80.90 | 81.72 | 81.59 | 86.21 | 85.23 | 85.72 | ||
| CLSTM | word level | 85.94 | 84.69 | 81.00 | 80.77 | 80.89 | 87.23 | 85.51 | ||
| character level | 85.40 | 84.06 | 81.09 | 80.38 | 80.73 | 87.19 | 84.69 | |||
| word+char levels | 84.73 | 86.67 | 81.75 | 81.14 | 81.44 | 87.25 | 85.66 | |||
| NCBI | GM | CDR | ||||||||
| BiLSTM | 86.67 | 90.73 | 88.65 | 87.98 | 88.25 | 88.11 | 91.14 | 87.54 | 89.30 | |
| BiLSTM-CRF | 91.19 | 93.85 | 92.51 | 93.18 | 92.04 | 92.61 | 94.27 | 90.00 | 92.08 | |
| GRAM-CNN | 94.36 | 93.78 | 93.09 | 91.47 | 92.27 | 92.47 | 91.83 | 92.15 | ||
| BERT | 88.39 | 88.02 | 88.20 | 92.65 | 92.51 | 92.58 | 91.82 | 90.77 | 91.29 | |
| CLSTM | word level | 93.66 | 92.29 | 92.97 | 92.81 | 92.54 | 93.60 | 91.74 | ||
| character level | 93.76 | 92.29 | 93.02 | 93.05 | 92.25 | 93.42 | 91.59 | |||
| word+char levels | 93.71 | 93.13 | 93.42 | 93.35 | 92.65 | 93.48 | 91.77 | |||
| NCBI | GM | CDR | ||||||||
| BiLSTM | 84.56 | 88.03 | 86.26 | 84.23 | 81.48 | 82.83 | 89.81 | 78.68 | 83.87 | |
| BiLSTM-CRF | 84.13 | 88.32 | 86.18 | 88.34 | 84.47 | 86.36 | 90.54 | 81.34 | 85.69 | |
| GRAM-CNN | 88.73 | 86.59 | 87.65 | 87.75 | 84.09 | 85.89 | 89.72 | 83.03 | 86.24 | |
| BERT | 88.42 | 83.15 | 85.70 | 89.50 | 86.26 | 88.69 | 85.01 | 86.81 | ||
| CLSTM | word level | 89.18 | 89.01 | 88.99 | 84.89 | 86.89 | 89.99 | 83.19 | 86.45 | |
| character level | 88.21 | 88.47 | 87.72 | 85.21 | 86.45 | 89.95 | 83.19 | 86.43 | ||
| word+char levels | 89.98 | 87.74 | 87.13 | 86.88 | 87.00 | 90.56 | 83.41 | |||
Comparison between a series of CLSTM (contextual long short-term memory networks [LSTMs] with conditional random fields [CRF]) experiments and the comparative methods on the NCBI, GM, and CDR corpora using strict matching
| Strict matching | NCBI | GM | CDR | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Model | Trial # | p | r | f | p | r | f | p | r | f |
| BiLSTM | - | 78.91 | 82.60 | 80.71 | 72.22 | 72.44 | 72.33 | 83.56 | 80.26 | 81.88 |
| BiLSTM-CRF | - | 82.19 | 84.58 | 83.37 | 80.79 | 79.81 | 80.30 | 87.52 | 83.58 | 85.50 |
| GRAM-CNN | - | 84.45 | 83.92 | 84.18 | 80.23 | 78.83 | 79.53 | 86.08 | 85.49 | 85.79 |
| BERT | - | 81.07 | 80.73 | 80.90 | 81.72 | 81.59 | 86.21 | 85.23 | 85.72 | |
| CLSTM (word+char levels) | 1 | 84.73 | 86.67 | 81.75 | 81.14 | 81.44 | 87.25 | 85.66 | ||
| 2 | 84.43 | 85.83 | 81.26 | 80.67 | 80.96 | 87.16 | 85.40 | |||
| 3 | 86.18 | 84.48 | 82.07 | 80.24 | 81.14 | 87.93 | 84.56 | |||
| 4 | 85.56 | 85.21 | 82.97 | 79.66 | 81.28 | 87.71 | 85.17 | |||
| 5 | 84.62 | 85.42 | 81.02 | 80.70 | 80.86 | 88.27 | 84.36 | |||
| CLSTM average | 85.10 | 85.52 | 81.81 | 80.48 | 81.14 | 87.66 | 85.03 | |||
Comparison of the performance of cross-corpus evaluation for comparative methods using strict matching
| Strict matching | train CDR → test NCBIa | train NCBI → test CDRb | |||||
|---|---|---|---|---|---|---|---|
| Model | p | r | f | p | r | f | |
| BiLSTM | 57.32 | 37.92 | 45.64 | 55.19 | 30.79 | 39.52 | |
| BiLSTM-CRF | 68.34 | 36.88 | 47.90 | 58.30 | 38.74 | 46.55 | |
| GRAM-CNN | 59.74 | 42.81 | 49.88 | 58.48 | 33.21 | 42.36 | |
| BERT | 68.92 | 53.13 | 54.17 | 61.44 | |||
| CLSTM | word level | 62.42 | 48.96 | 54.87 | 60.92 | 38.09 | 46.87 |
| character level (3)c | 68.12 | 44.06 | 53.51 | 32.66 | 42.96 | ||
| character level (7)c | 65.08 | 45.63 | 53.64 | 60.69 | 21.75 | 32.02 | |
| word+char levels (3, 3)d | 66.77 | 43.75 | 52.86 | 54.00 | 44.08 | 48.54 | |
| word+char levels (5, 5)d | 42.92 | 53.02 | 57.63 | 39.51 | 46.88 | ||
aTest the disease entities in the NCBI corpus using the model trained on the CDR corpus
bTest the disease entities in the CDR corpus using the model trained on the NCBI corpus
cThe number in parentheses represents the window size at the character level.
dThe numbers in parentheses represent the window sizes at the word and character level, respectively
Comparison of training time between CLSTM (contextual long short-term memory networks [LSTMs] with conditional random fields [CRF])and comparative methods for the NCBI, GM, and CDR corpora
| Training time (Hours) | NCBI | GM | CDR | |
|---|---|---|---|---|
| BiLSTM | 4.08 | 10.77 | 4.39 | |
| BiLSTM-CRF | 4.70 | 12.56 | 5.23 | |
| GRAM-CNN | 11.27 | 34.08 | 12.64 | |
| BERT | 1.01 | 10.04 | 3.72 | |
| CLSTM | word level | 5.54 | 14.57 | 5.98 |
| character level | 4.84 | 13.30 | 5.64 | |
| word+char levels | 5.84 | 14.73 | 5.91 | |
| Average | 5.41 | 14.20 | 5.84 | |
Fig. 1Pipeline of the CLSTM (contextual long short-term memory networks [LSTMs] with conditional random fields [CRF])model