| Literature DB >> 27374122 |
Miji Choi1, Haibin Liu2, William Baumgartner3, Justin Zobel4, Karin Verspoor5.
Abstract
We describe a system that automatically extracts biological events from biomedical journal articles, and translates those events into Biological Expression Language (BEL) statements. The system incorporates existing text mining components for coreference resolution, biological event extraction and a previously formally untested strategy for BEL statement generation. Although addressing the BEL track (Track 4) at BioCreative V (2015), we also investigate how incorporating coreference resolution might impact event extraction in the biomedical domain. In this paper, we report that our system achieved the best performance of 20.2 and 35.2 in F-score for the full BEL statement level on both stage 1, and stage 2 using provided gold standard entities, respectively. We also report that our results evaluated on the training dataset show benefit from integrating coreference resolution with event extraction.Entities:
Mesh:
Year: 2016 PMID: 27374122 PMCID: PMC4930833 DOI: 10.1093/database/baw076
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.(a) Sample sentence from the BEL Track training corpus. (b) BEL statements corresponding to the sample sentence. (c) Representation of BEL statement derived from the sample sentence (a). The BEL statement describes that the abundance of chemical compound designated by ‘glucocorticoid’ in the CHEBI namespace increases the abundance of protein designated by ‘Resp18’ in the MGI namespace.
BEL abundance functions (http://wiki.ope nbel.org/display/BIOC/BEL+Documentation#BELDocumentation-Function sassociated toNamespaces) selected in the BEL track at BioCreative V
| Name space | Entity concept | Function Long Form | Function Short Form | Example | Count (Train) | Count (Test) |
|---|---|---|---|---|---|---|
| HGNC | Human protein coding genes | proteinAbundance(), | p(), | p(HGNC:MAPK14) | 7, (33%) | 127 (43%) |
| geneAbundance(), | g(), | |||||
| rnaAbundance(), | r(), | |||||
| microRNAAbundance() | m() | |||||
| MGI | Mouse genes | proteinAbundance(), | p(), | p(MGI:Mapk14) | 12 231 (53%) | 111 (38%) |
| geneAbundance(), | g(), | |||||
| rnaAbundance(), | r(), | |||||
| microRNAAbundance() | m() | |||||
| EGID | Genes in a wide range of species | proteinAbundance(), | p(), | p(EGID:1432) | 140 (0.6%) | 0 |
| geneAbundance(), | g(), | |||||
| rnaAbundance() | r() | |||||
| GOBP | Biological processes | biologicalProcess() | bp() | bp(GOBP:"cell proliferation") | 1927 (8%) | 23 (8%) |
| MESHD | Diseases | pathology() | path() | path(MESHD:Hyperoxia) | 244 (1%) | 11 (4%) |
| CHEBI | Chemicals | abundance() | a() | a(CHEBI: lipopoly-saccharide) | 875 (3.8%) | 23 (8%) |
Other BEL functions (http://wiki.openbel.org/display/BIOC/BEL+Documentation#BELDocumentation-OtherFunctions) selected in the BEL track at BioCreative V
| Function | Type | Example | Count (Train) |
|---|---|---|---|
| complex() | complex abundance | (complex(p(MGI:Itga8),p(MGI:Itgb1))) -> bp(GOBP:"cell adhesion") | 758 |
| pmod() | protein modification | p(MGI:Cav1,pmod(P)) -> a(CHEBI:"nitric oxide") | 1,351 |
| deg() | degradation | p(MGI:Lyve1) -> deg(a(CHEBI:"hyaluronic acid")) | 205 |
| tloc() | translocation | a(CHEBI:"brefeldin A") -> tloc(p(MGI:Stk16)) | 101 |
| act() | molecular activity | complex(p(MGI:Cckbr),p(MGI:Gast)) -> act(p(MGI:Prkd1)) | 124 |
Figure 2.Workflow of our system for producing BEL statements from input text with examples.
Our coreference resolution system performance comparing with the best performing system (33) in the BioNLP-ST’11 Coreference task (20) and state-of-the-art coreference resolution systems (italicised)
| Precision | Recall | F-score | |
|---|---|---|---|
| UUtah ( | 73.3 | 22.2 | 34.1 |
| Our system ( | 46.3 | 50.0 | 48.0 |
| 62.7 | 50.4 | 55.9 | |
| 67.2 | 55.6 | 60.9 | |
| 67.5 | 69.8 | 68.1 |
Results are based on the Test data of the BioNLP’11—Protein Coreference task.
Mapping the BioNLP event types into BEL functions
| BioNLP | BEL function | BEL function type | Mapping example |
|---|---|---|---|
| Binding | p() | complex abundance | ‘… |
| Gene expression | r() | rna abundance | ‘… |
| Localization | tloc() | translocation | ‘…co-Smad (Smad4) and are |
| Phosphorylation | pmod(P) | phosphorylation | ‘…the |
| protein catabolism | deg() | degradation | ‘…p53 and targets it for |
| Transcription | r() | rna abundance | ‘…High BMP-6 |
| molecular activity | ‘…IFN7 in the |
Figure 3.Example of an evaluation taken from the web interface. BEL statements in gold standard and system prediction are shown for the example sentence. The evaluation scores are provided for all levels.
Official results on test data for BEL task 1 in Stage 1
| TP | FP | FN | P | R | F | ||
|---|---|---|---|---|---|---|---|
| Run 1 (without coref.) | Term | 64 | 12 | 236 | 84.2 | 21.3 | 34.0 |
| Function Second. | 3 | 1 | 53 | 75.0 | 5.4 | 10.0 | |
| Function | 3 | 1 | 63 | 75.0 | 4.6 | 8.6 | |
| Relation-Second. | 54 | 5 | 148 | 91.5 | 26.8 | 41.4 | |
| Relation | 32 | 21 | 170 | 60.4 | 15.8 | 25.1 | |
| Statement | 25 | 21 | 177 | 54.4 | 12.4 | 20.2 | |
| Run 2 (with coreference) | Term | 64 | 15 | 236 | 81.0 | 21.3 | 33.8 |
| Function Second. | 4 | 1 | 52 | 80.0 | 7.1 | 13.1 | |
| Function | 3 | 2 | 63 | 60.0 | 4.6 | 8.5 | |
| Relation-Second. | 54 | 8 | 148 | 87.1 | 26.7 | 40.9 | |
| Relation | 32 | 24 | 170 | 57.1 | 15.8 | 24.8 | |
| Statement | 25 | 24 | 177 | 51.0 | 12.4 | 19.9 | |
| Run 3 (with coreference and extended BEL function) | Term | 64 | 15 | 236 | 81.0 | 21.3 | 33.8 |
| Function Second. | 5 | 1 | 51 | 83.3 | 8.9 | 16.1 | |
| Function | 3 | 4 | 63 | 42.9 | 4.6 | 8.2 | |
| Relation-Second. | 54 | 8 | 148 | 87.1 | 26.7 | 40.9 | |
| Relation | 32 | 26 | 170 | 55.2 | 15.8 | 24.6 | |
| Statement | 25 | 26 | 177 | 49.0 | 12.4 | 19.8 |
Run 1, an approach without coreference resolution; Run 2, an approach with coreference resolution; Run 3, a coreference approach with extended BEL function.
Results on test data for BEL task 1 in the Stage 2
| TP | FP | FN | P | R | F | ||
|---|---|---|---|---|---|---|---|
| *NonCoreference | Term | 101 | 5 | 199 | 95.3 | 33.7 | 49.8 |
| Function Second. | 8 | 2 | 48 | 80.0 | 14.3 | 24.2 | |
| Function | 7 | 1 | 59 | 87.5 | 10.6 | 18.9 | |
| Relation-Second. | 84 | 3 | 118 | 96.6 | 41.6 | 58.1 | |
| Relation | 57 | 16 | 145 | 78.1 | 28.2 | 41.5 | |
| Statement | 44 | 18 | 158 | 71.0 | 21.8 | 33.3 | |
| Coreference | Term | 113 | 3 | 187 | 97.4 | 37.7 | 54.3 |
| Function Second. | 9 | 4 | 47 | 69.2 | 16.1 | 26.1 | |
| Function | 8 | 3 | 58 | 72.7 | 12.1 | 20.8 | |
| Relation-Second. | 91 | 3 | 111 | 96.8 | 45.1 | 61.5 | |
| Relation | 62 | 20 | 140 | 75.6 | 30.7 | 43.7 | |
| Statement | 48 | 23 | 154 | 67.6 | 23.8 | 35.2 |
Coreference, a coreference approach with extended BEL function using the given gold standard entities, NonCoreference, an approach without coreference resolution with extended BEL function using the given gold standard entities.
Statistics of anaphor types in the gold standard dataset at the BioCreative V shared task Track 4 (BEL track)
| Anaphor type | Training dataset | Test dataset | ||
|---|---|---|---|---|
| Numbers | Sentence prop. | Numbers | Sentence prop. | |
| Relative pronoun | 1313 | 21% | 14 | 13% |
| Personal pronoun | 257 | 4% | 6 | 6% |
| Possessive pronoun | 411 | 6% | 5 | 5% |
| Definite noun phrase | 507 | 8% | 0 | – |
| Total | 2488 | 25 | ||
Numbers are counts of occurrence of each anaphoric type, and Sentence prop. is the percentage of all sentences that include at least one anaphor of relevant type.
Comparison of performance between an approach with coreference resolution and an approach without it on anaphoric sentences in the training dataset, in terms of anaphor types
| Without Coreference | With Coreference | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| TP | FP | FN | P | R | F | TP | FP | FN | P | R | F | ||
| Pers. pronoun | T | 34 | 45 | 89 | 43.0 | 27.6 | 33.7 | 55 | 43 | 68 | 56.1 | 44.7 | |
| Fs | 2 | 4 | 44 | 33.3 | 4.4 | 7.7 | 6 | 8 | 40 | 42.9 | 13.0 | ||
| F | 2 | 4 | 58 | 33.3 | 3.3 | 6.1 | 6 | 10 | 54 | 37.5 | 10.0 | ||
| Rs | 25 | 22 | 54 | 53.2 | 31.7 | 39.7 | 44 | 24 | 35 | 64.7 | 55.7 | ||
| R | 5 | 46 | 74 | 9.8 | 6.3 | 7.7 | 16 | 46 | 63 | 25.8 | 20.3 | ||
| S | 2 | 48 | 77 | 4.0 | 2.5 | 3.1 | 5 | 55 | 74 | 8.3 | 6.3 | ||
| Poss. pronoun | T | 82 | 74 | 125 | 52.6 | 39.6 | 45.2 | 100 | 74 | 107 | 57.5 | 48.3 | |
| Fs | 20 | 12 | 75 | 62.5 | 21.1 | 31.5 | 23 | 9 | 72 | 71.9 | 24.2 | ||
| F | 13 | 24 | 116 | 35.1 | 10.1 | 15.7 | 17 | 18 | 112 | 48.6 | 13.2 | ||
| Rs | 76 | 33 | 74 | 69.7 | 50.7 | 58.7 | 89 | 31 | 61 | 74.2 | 59.3 | ||
| R | 27 | 81 | 123 | 25.0 | 18.0 | 20.9 | 34 | 79 | 116 | 30.1 | 22.7 | ||
| S | 13 | 85 | 137 | 13.3 | 8.7 | 12 | 87 | 138 | 12.1 | 8.0 | 9.6 | ||
| Def. NP | T | 27 | 22 | 45 | 55.1 | 37.5 | 44.6 | 36 | 26 | 36 | 58.1 | 50.0 | |
| Fs | 9 | 3 | 22 | 75.0 | 29.0 | 41.9 | 11 | 3 | 20 | 78.6 | 35.5 | ||
| F | 4 | 10 | 38 | 28.6 | 9.5 | 14.3 | 10 | 5 | 32 | 66.7 | 23.8 | ||
| Rs | 26 | 5 | 23 | 83.9 | 53.1 | 65.0 | 30 | 10 | 19 | 75.0 | 61.2 | ||
| R | 10 | 20 | 39 | 33.3 | 20.4 | 25.3 | 16 | 26 | 33 | 38.1 | 32.7 | ||
| S | 3 | 23 | 46 | 11.5 | 6.1 | 8.0 | 7 | 29 | 42 | 19.4 | 14.3 | ||
| ALL | T | 141 | 140 | 255 | 50.2 | 35.6 | 41.7 | 188 | 143 | 208 | 56.8 | 47.5 | |
| Fs | 30 | 18 | 139 | 62.5 | 17.8 | 27.7 | 39 | 19 | 130 | 67.2 | 23.1 | ||
| F | 18 | 37 | 209 | 32.7 | 7.9 | 12.8 | 32 | 32 | 195 | 50.0 | 14.1 | ||
| Rs | 126 | 60 | 147 | 67.7 | 46.2 | 54.9 | 162 | 65 | 111 | 71.4 | 59.3 | ||
| R | 42 | 146 | 231 | 22.3 | 15.4 | 18.2 | 65 | 151 | 208 | 30.1 | 23.8 | ||
| S | 18 | 155 | 255 | 10.4 | 6.6 | 8.1 | 23 | 171 | 250 | 11.9 | 8.4 | ||
The higher F-score (with vs. without coreference) is indicated in bold.
Evaluation results of participating systems for Task 1
| Term | Function | Relation | Full statement | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| System | P | R | F | P | R | F | P | R | F | P | R | F |
| S1 | 38.0 | 28.3 | 32.4 | 26.3 | 7.6 | 11.8 | 1.2 | 1.5 | 1.3 | 0.8 | 1.0 | 0.9 |
| S2 | 52.6 | 60.3 | 56.2 | 11.2 | 18.2 | 13.9 | 9.7 | 8.4 | 9.0 | 7.6 | 6.4 | 7.0 |
| S3 (ours) | 84.2 | 21.3 | 34.0 | 75.0 | 4.6 | 8.6 | 60.4 | 15.8 | 25.1 | 54.4 | 12.4 | 20.2 |
| S4 ( | 64.2 | 61.0 | 62.6 | 12.5 | 1.5 | 2.7 | 39.6 | 19.8 | 26.4 | 31.2 | 14.4 | 19.7 |
| S5 ( | 82.0 | 59.3 | 68.9 | 30.7 | 34.9 | 32.6 | 69.4 | 38.1 | 49.2 | 26.4 | 13.9 | 18.2 |
The best F-score among their submissions is described for each system; adapted from Fluck et al. (25).
Results of paired t-test between an approach with coreference resolution and an approach without it on the training dataset for each level
| Term | Function_S. | Function | Relation_S. | Relation | Statement | ||
|---|---|---|---|---|---|---|---|
| With coreference without coreference | t | 6.82 | 4.77 | 5.20 | 5.51 | 5.79 | 5.34 |
At the 95% confidence interval (df = 29), a score of ± 1.699 indicates a significance difference; all reported differences are significant.