| Literature DB >> 26201262 |
İlknur Karadeniz, Arzucan Özgür.
Abstract
BACKGROUND: Information regarding bacteria biotopes is important for several research areas including health sciences, microbiology, and food processing and preservation. One of the challenges for scientists in these domains is the huge amount of information buried in the text of electronic resources. Developing methods to automatically extract bacteria habitat relations from the text of these electronic resources is crucial for facilitating research in these areas.Entities:
Mesh:
Year: 2015 PMID: 26201262 PMCID: PMC4511461 DOI: 10.1186/1471-2105-16-S10-S5
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Sample text. A sample input file containing bacteria and habitat entities.
Figure 2Workflow of the Sub-task 1 System.
Figure 3Sample output of the preprocessing, and the noun phrase extractor and simplifier.
Figure 4Discontinuous entity handling for the sample phrase .
Figure 5Continuous entity handling for the sample phrase .
Figure 6Ontology mapping example.
Figure 7Workflow of the Sentence-based Sub-task 2 System.
Detailed results on the test set for Sub-task 1 (Entity Boundary Detection & Ontology Categorization)
| Evaluation Metrics | Boun | Boun 2 |
|---|---|---|
| 112.70 | 115.24 | |
| 141 | 158 | |
| 89 | 74 | |
| 305.30 | 317.75 | |
| 0.68 | 0.68 | |
| 0.60 | 0.63 | |
| 0.59 | 0.57 | |
| 0.59 | 0.60 |
Comparison with the other systems that participated in the BB Sub-task 1 (Entity Boundary Detection & Ontology Categorization).
| System | SER | Recall | Precision | F-score |
|---|---|---|---|---|
| 0.66 | 0.61 | 0.61 | 0.61 | |
| 0.68 | 0.60 | 0.59 | 0.59 | |
| 0.68 | 0.35 | 0.62 | 0.44 | |
| 0.68 | 0.63 | 0.57 | 0.60 | |
| 0.93 | 0.72 | 0.48 | 0.57 |
The results obtained on the test set are reported.
Effect of discontinuous entity handling (DEH).
| Boun 2 | Boun 2 - DEH | |
|---|---|---|
| 0.66 | 0.67 | |
| 0.67 | 0.68 | |
| 0.68 | 0.68 |
The results are reported on the training, development, and test sets.
Effect of entity modifier handling.
| SER Train | SER Dev | SER Test | |
|---|---|---|---|
| 0.66 | 0.67 | 0.68 | |
| 0.68 | 0.67 | 0.70 | |
| 0.72 | 0.72 | 0.72 | |
| 0.67 | 0.67 | 0.68 |
The results are reported on the training, development, and test sets.
Results of BB Sub-task 2 (Localization and PartOf Event Extraction).
| System | Type | Recall | Precision | F-score |
|---|---|---|---|---|
| Localization | 0.61 | 0.54 | 0.57 | |
| PartOf | 0.20 | 0.32 | 0.25 | |
| Localization | 0.23 | 0.38 | 0.29 | |
| PartOf | 0.15 | 0.40 | 0.22 | |
The results obtained on the test set are reported
Comparison with the other systems that participated in the BB Sub-task 2 (Localization and PartOf Event Extraction).
| System | Recall | Precision | F-score |
|---|---|---|---|
| 0.53 | 0.52 | 0.53 | |
| 0.28 | 0.82 | 0.42 | |
| 0.36 | 0.46 | 0.40 | |
| 0.21 | 0.38 | 0.27 | |
| 0.04 | 0.19 | 0.06 |
The results obtained on the test set are reported.
Effects of Anaphora Resolution Module and Syntax Rules (Localization and PartOf Event Extraction).
| System | Recall | Precision | F-score |
|---|---|---|---|
| 0.46 | 0.42 | 0.44 | |
| 0.36 | 0.45 | 0.40 | |
| 0.45 | 0.42 | 0.43 | |
| 0.46 | 0.42 | 0.44 |
The results obtained on the training set are reported.
Effects of Anaphora Resolution Module and Syntax Rules (Localization and PartOf Event Extraction).
| System | Recall | Precision | F-score |
|---|---|---|---|
| 0.55 | 0.40 | 0.46 | |
| 0.50 | 0.44 | 0.47 | |
| 0.54 | 0.40 | 0.46 | |
| 0.53 | 0.42 | 0.47 |
The results obtained on the development set are reported.
Effects of Anaphora Resolution Module and Syntax Rules (Localization and PartOf Event Extraction).
| System | Recall | Precision | F-score |
|---|---|---|---|
| 0.53 | 0.52 | 0.53 | |
| 0.46 | 0.56 | 0.50 | |
| 0.52 | 0.52 | 0.52 | |
| 0.50 | 0.55 | 0.52 |
The results obtained on the test set are reported.