| Literature DB >> 31331263 |
Jinghang Gu1,2, Fuqing Sun3, Longhua Qian4, Guodong Zhou1.
Abstract
BACKGROUND: Automatically understanding chemical-disease relations (CDRs) is crucial in various areas of biomedical research and health care. Supervised machine learning provides a feasible solution to automatically extract relations between biomedical entities from scientific literature, its success, however, heavily depends on large-scale biomedical corpora manually annotated with intensive labor and tremendous investment.Entities:
Keywords: Attention; Biomedical relation extraction; Deep learning; Distant supervision
Mesh:
Year: 2019 PMID: 31331263 PMCID: PMC6647285 DOI: 10.1186/s12859-019-2884-4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1The title and abstract of the sample document (PMID: 2375138)
Fig. 2The system workflow diagram
The CID relation statistics on the corpus
| Task Datasets | # of Articles | # of CID Relations |
|---|---|---|
| Training | 500 | 1038 |
| Development | 500 | 1012 |
| Test | 500 | 1066 |
Statistics on the generated training set
| Types | Count |
|---|---|
| PMIDs | 30,884 |
| Chemical Entities | 9113 |
| Chemical Mentions | 358,395 |
| Disease Entities | 3525 |
| Disease Mentions | 267,196 |
| Relations | 54,729 |
Fig. 3The architecture of the Instance Representation module
Fig. 4The architecture of the instance-level attention module
Feature names and their locations
| Name | Location |
|---|---|
| T | At the title. |
| A_Fst | At the first sentence of the abstract. |
| A_Lst | At the last sentence of the abstract. |
| A_Mdl | In the middle of the abstract. |
Fig. 5The stacked auto-encoder neural network
Hyper-parameters for two models
| Method | Hyper-parameter | Value |
|---|---|---|
| Attention-based Model | Learning rate | 0.004 |
| LSTM hidden state dimension | 200 | |
| Mini-batch size | 500 | |
| Word embedding dimension | 300 | |
| Position embedding dimension | 50 | |
| Identifier embedding dimension | 100 | |
| Hyponym embedding dimension | 50 | |
| Location embedding dimension | 50 | |
| Hidden layer nodes | 250 | |
| Dropout rate | 0.3 | |
| Stacked Auto-encoder Model | Learning rate | 0.008 |
| Mini-batch size | 400 | |
| Word embedding dimension | 300 | |
| Identifier embedding dimension | 100 | |
| Hyponym embedding dimension | 50 | |
| Encoder layer nodes | 250 | |
| Decoder layer nodes | 50 | |
| Dropout rate | 0.3 |
The performance of the Attention-based model on the test dataset at intra-sentence level
| Methods | P(%) | R(%) | F(%) |
|---|---|---|---|
| Intra_HRNN (Baseline) | 62.0 | 55.2 | 58.4 |
| Intra-Attention | 62.2 | 59.5 | 60.8 |
| - Descriptor Embedding | 61.1 | 54.2 | 57.5 |
| - Hyponym Embedding | 61.7 | 56.6 | 59.0 |
| - Location Embedding | 61.9 | 56.7 | 59.2 |
| - Entity Difference Embedding | 62.1 | 56.9 | 59.4 |
The performance of the Stacked Auto-Encoder model on the test dataset at inter-sentence level
| Methods | P(%) | R(%) | F(%) |
|---|---|---|---|
| Inter_HRNN (Baseline) | 27.0 | 19.8 | 22.8 |
| Stacked_Autoencoder | 55.7 | 14.2 | 22.6 |
The overall performance on the test dataset
| Methods | P(%) | R(%) | F(%) |
|---|---|---|---|
| ① Intra_HRNN + Inter_HRNN | 46.5 | 75.0 | 57.4 |
| ② Intra_HRNN + Stacked_Autoencoder | 58.2 | 71.6 | 64.2 |
| ③ Intra_Attention + Inter_HRNN | 46.9 | 79.3 | 59.0 |
| ④ Intra_Attention + Stacked_Autoencoder | 60.3 | 73.8 | 66.4 |
Fig. 6The precision-recall curve of different combinations
Comparisons with the related works
| Methods | Systems | Description | P(%) | R(%) | F1(%) |
|---|---|---|---|---|---|
| Distant Supervision | Ours | Intra_Attention | 62.2 | 59.5 | 60.8 |
| Intra_Attention + Stacked_Autoencoder | 60.3 | 73.8 | 66.4 | ||
| ML without KB | Gu et al. 2016 [ | Intra_ME | 60.4 | 50.3 | 54.9 |
| Intra_ME + Inter_ME | 62.0 | 55.1 | 58.3 | ||
| Gu et al. 2017 [ | CNN | 59.7 | 55.0 | 57.2 | |
| CNN + Inter_ME + PP | 55.7 | 68.1 | 61.3 | ||
| Zhou et al. 2016 [ | LSTM + SVM | 64.9 | 49.3 | 56.0 | |
| LSTM + SVM + PP | 55.6 | 68.4 | 61.3 | ||
| ML with KB | Ours | Intra_Attention + Stacked_Autoencoder + KBs | 67.9 | 77.0 | 72.1 |
| Xu et al. 2016 [ | SVM + KBs | 65.8 | 68.6 | 67.2 | |
| Pons et al. 2016 [ | SVM + KBs | 73.1 | 67.6 | 70.2 | |
| Peng et al. 2016 [ | Extra training data + SVM + KBs | 71.1 | 72.6 | 71.8 | |
| Rule-based | Lowe et al. 2016 [ | Heuristic rules | 59.3 | 62.3 | 60.8 |