| Literature DB >> 15960838 |
Shuhei Kinoshita1, K Bretonnel Cohen, Philip V Ogren, Lawrence Hunter.
Abstract
BACKGROUND: Our approach to Task 1A was inspired by Tanabe and Wilbur's ABGene system. Like Tanabe and Wilbur, we approached the problem as one of part-of-speech tagging, adding a GENE tag to the standard tag set. Where their system uses the Brill tagger, we used TnT, the Trigrams 'n' Tags HMM-based part-of-speech tagger. Based on careful error analysis, we implemented a set of post-processing rules to correct both false positives and false negatives. We participated in both the open and the closed divisions; for the open division, we made use of data from NCBI.Entities:
Mesh:
Year: 2005 PMID: 15960838 PMCID: PMC1869018 DOI: 10.1186/1471-2105-6-S1-S4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
BioCreAtIvE Data Sets
| 7500 | 8876 | 46.1% | 25.7% | 14.9% | 6.6% | 6.6% | |
| 2500 | 2975 | 46.6% | 23.9% | 15.1% | 6.7% | 7.7% | |
| 5000 | 5949 | 46.1% | 26.7% | 14.3% | 6.2% | 6.7% |
This table shows the BioCreAtIvE data including the ratio for the word length, which shows same tendency among sets.
Figure 1Precision and Recall. Figure 1A shows the precision and recall for the cross validation data. Figure 1B shows the precision and recall for the official test data. The expression "w/o post-p" is used as "without post-processing".
The term-level score comparison between the cross-validation and official test
| No post-processing | 68.0% | 76.5% | 72.0% | |
| With post-processing | 82.0% | 81.1% | 81.6% | |
| No post-processing | 68.0% | 77.2% | 72.3% | |
| With post-processing | 80.3% | 80.5% | 80.4% | |
| With post-processing, dictionary | 80.1% | 81.8% | 80.9% | |
This table shows the term-level scores about the cross-validation data and official test.
Performance on one-word false positives
| 818 | 717 | 101 | 12.3% | |
| 162 | 95 | 67 | 41.4% | |
| 440 | 65 | 375 | 85.2% | |
| 1420 | 877 | 543 | 38.2% | |
85% of one-word false positives that correspond to a word, which was seen twice or more times, were corrected with post-processing procedures.
Figure 2Effect of term length on performance. Figure 2A shows the effect of term length for the cross validation data. Figure 2B shows the effect of term length for the official test data.
The effect of the post-processing procedures on overall system performance.
| 82.0% | 68.0% | 74.2% | 81.0% | 73.1% | |
| 81.1% | 76.6% | 80.1% | 79.8% | 79.8% | |
| 81.6% | 72.0% | 77.0% | 80.4% | 76.3% | |
| base | -9.5% | -4.5% | -1.2% | -5.2% | |
This table shows the effects of each post-processing procedures in comparison with the all post-processing results. For example, No rule column shows the results without rule-based post-processing, that shows 4.5% lower score than All Post-processing in F-measure.
The results of choosing GENE tag set
| 67.9% | 67.0% | 66.4% | 68.9% | |
| 77.2% | 78.2% | 77.2% | 77.3% | |
| 72.2% | 72.1% | 71.4% | 72.9% | |
| -0.62% | -0.72% | -1.43% | base |
This table shows the results of difference from the Tag Set 4, which is our choice.
Symbol count in LocusLink database
| 2,438 | 68,805 | 55,730 | 69,654 | 43,805 | 9,221 | 5,120 | 24,234 | 279,007 |
This table shows the count of symbol per species.