| Literature DB >> 17134476 |
Fabio Rinaldi1, Gerold Schneider, Kaarel Kaljurand, Michael Hess, Martin Romacker.
Abstract
BACKGROUND: The biomedical domain is witnessing a rapid growth of the amount of published scientific results, which makes it increasingly difficult to filter the core information. There is a real need for support tools that 'digest' the published results and extract the most important information.Entities:
Mesh:
Year: 2006 PMID: 17134476 PMCID: PMC1764447 DOI: 10.1186/1471-2105-7-S3-S3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Example of Dependency Tree. Tree of dependencies for a GENIA sentence, along with other linguistic annotations. Notice the additional deep-linguistic "control" subject dependency between token 7 and 4.
Evaluation on Carroll's test suite on subj, obj, PP-attachment and subordinate clause relations.
| GREVAL | Subject | Object | noun-PP | verb-PP | subord. clause |
| Precision | 92.4% | 89.1% | 74.4% | 72.4% | 68.2 |
| Recall | 81.0% | 83.9% | 65.5% | 84.8% | n/a |
| GENIA100 | Subject | Object | noun-PP | verb-PP | subord. clause |
| Precision | 90.0% | 94.1% | 83.3% | 81.7% | 71.1% |
| Recall | 86.2% | 94.9% | 81.9% | 84.2% | 75.0% |
Figure 2Sample Output. Sample output for the 'activate' relation.
Analysis of precision for selected relations over GENIA
| agent | target | |||||||
| Y | A | P | N | Y | A | P | N | |
| activate | 72 | 64 | 5 | 8 | 77 | 54 | 8 | 10 |
| bind | 36 | 18 | 1 | 8 | 39 | 18 | 1 | 5 |
| block | 3 | 0 | 0 | 0 | 1 | 1 | 0 | 1 |
| TOTAL | 111 | 82 | 6 | 16 | 117 | 73 | 9 | 16 |
| 52% | 38% | 3% | 7% | 55% | 34% | 4% | 7% | |
| correct 90% | incorrect 10% | correct 89% | incorrect 11% | |||||
Estimate of recall. Extrapolated percentages are in boldface
| Corpus | Relation | Recall | Coverage (at least 1 dep) | Coverage (2 dep) |
| ATCR (observed) | control | 60% | 106 out of 12 9 | 59 out of 129 |
| regulate | 60% | 116 out of 161 | 58 out of 161 | |
| GENIA (estimated) | control | 304 out of 541 | 155 out of 541 | |
| regulate | 887 out of 1125 | 339 out of 1125 |