| Literature DB >> 30020437 |
Yifan Peng1, Anthony Rios1,2, Ramakanth Kavuluru2,3, Zhiyong Lu1.
Abstract
Mining relations between chemicals and proteins from the biomedical literature is an increasingly important task. The CHEMPROT track at BioCreative VI aims to promote the development and evaluation of systems that can automatically detect the chemical-protein relations in running text (PubMed abstracts). This work describes our CHEMPROT track entry, which is an ensemble of three systems, including a support vector machine, a convolutional neural network, and a recurrent neural network. Their output is combined using majority voting or stacking for final predictions. Our CHEMPROT system obtained 0.7266 in precision and 0.5735 in recall for an F-score of 0.6410 during the challenge, demonstrating the effectiveness of machine learning-based approaches for automatic relation extraction from biomedical literature and achieving the highest performance in the task during the 2017 challenge.Database URL: http://www.biocreative.org/tasks/biocreative-vi/track-5/.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30020437 PMCID: PMC6051439 DOI: 10.1093/database/bay073
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Chemical–protein annotation example.
Statistics of the CHEMPROT dataset
| Training | Development | Test | |
|---|---|---|---|
| Document | 1020 | 612 | 800 |
| Chemical | 13017 | 8004 | 10810 |
| Protein | 12752 | 7567 | 10019 |
| Positive relation | 4157 | 2416 | 3458 |
| CPR: 3 | 768 | 550 | 665 |
| CPR: 4 | 2254 | 1094 | 1661 |
| CPR: 5 | 173 | 116 | 195 |
| CPR: 6 | 235 | 199 | 293 |
| CPR: 9 | 727 | 457 | 644 |
| Positive relation in one sentence | 4122 | 2412 | 3444 |
Figure 2.Architecture of the systems for the CHEMPROT task.
Figure 3.Overview of the CNN model.
Figure 4.Overview of the RNN model.
Figure 5.Data partition of 5-fold cross-validation and final submission.
Results for our individual and ensemble systems with training and development sets
| System | P | R | F |
|---|---|---|---|
| SVM | 0.6291 | 0.4779 | 0.5430 |
| CNN | 0.6406 | 0.5713 | 0.6023 |
| RNN | 0.6080 | 0.6139 | 0.6094 |
| Majority voting | 0.7408 | 0.5517 | 0.6319 |
| Stacking | 0.7554 | 0.5524 | 0.6378 |
Macro-average results are reported using 5-fold cross validation.
Results for our ensemble systems on test set
| System | Fold | TP | FP | FN | P | R | F |
|---|---|---|---|---|---|---|---|
| SVM | 1 | 1646 | 875 | 1812 | 0.6529 | 0.4760 | 0.5506 |
| 2 | 1654 | 1001 | 1804 | 0.6230 | 0.4783 | 0.5411 | |
| 3 | 1668 | 932 | 1790 | 0.6415 | 0.4824 | 0.5507 | |
| 4 | 1679 | 911 | 1779 | 0.6483 | 0.4855 | 0.5552 | |
| 5 | 1646 | 963 | 1812 | 0.6309 | 0.4760 | 0.5426 | |
| CNN | 1 | 2040 | 1280 | 1418 | 0.6145 | 0.5899 | 0.6019 |
| 2 | 1996 | 1075 | 1462 | 0.6500 | 0.5772 | 0.6114 | |
| 3 | 2031 | 1254 | 1427 | 0.6183 | 0.5873 | 0.6024 | |
| 4 | 1961 | 1073 | 1497 | 0.6463 | 0.5671 | 0.6041 | |
| 5 | 1970 | 1107 | 1488 | 0.6402 | 0.5697 | 0.6029 | |
| RNN | 1 | 2085 | 1333 | 1373 | 0.6100 | 0.6029 | 0.6065 |
| 2 | 2255 | 1601 | 1203 | 0.5848 | 0.6521 | 0.6166 | |
| 3 | 2112 | 1390 | 1346 | 0.6031 | 0.6108 | 0.6069 | |
| 4 | 2097 | 1316 | 1361 | 0.6144 | 0.6064 | 0.6104 | |
| 5 | 2322 | 1845 | 1136 | 0.5572 | 0.6715 | 0.6090 | |
| Majority voting | |||||||
| 3 | 1962 | 715 | 1496 | 0.7329 | 0.5674 | 0.6396 | |
| 4 | 1934 | 697 | 1524 | 0.7351 | 0.5593 | 0.6352 | |
| 5 | 2020 | 784 | 1438 | 0.7204 | 0.5842 | 0.6452 | |
| Stacking | 1 | 1890 | 641 | 1568 | 0.7467 | 0.5466 | 0.6312 |
| 2 | 1756 | 530 | 1702 | 0.7682 | 0.5078 | 0.6114 | |
Submitted runs are reported in bold text.
Figure 6.The distribution of pairs according to the number of approaches that can correctly classify a given pair.
The distribution of pairs for each model according to the number of approaches that can correctly classify a given pair
| No. of approaches | SVM | CNN | RNN | Majority voting | Stacking |
|---|---|---|---|---|---|
| 0 | 0 | 0 | 0 | 0 | 3 |
| 1 | 30 | 120 | 331 | 0 | 91 |
| 2 | 187 | 565 | 634 | 602 | 534 |
| 3 | 1792 | 1792 | 1792 | 1790 | 1770 |
Figure 7.Average sentence length and entity distance in words by the number of approaches that can correctly classify a given pair.
Figure 8.Average sentence length in words by the model that can correctly classify a given pair.
Figure 9.Average entity distance in words by the model that can correctly classify a given pair.