| Literature DB >> 16504116 |
Richard Tzong-Han Tsai1, Shih-Hung Wu, Wen-Chi Chou, Yu-Chun Lin, Ding He, Jieh Hsiang, Ting-Yi Sung, Wen-Lian Hsu.
Abstract
BACKGROUND: Text mining in the biomedical domain is receiving increasing attention. A key component of this process is named entity recognition (NER). Generally speaking, two annotated corpora, GENIA and GENETAG, are most frequently used for training and testing biomedical named entity recognition (Bio-NER) systems. JNLPBA and BioCreAtIvE are two major Bio-NER tasks using these corpora. Both tasks take different approaches to corpus annotation and use different matching criteria to evaluate system performance. This paper details these differences and describes alternative criteria. We then examine the impact of different criteria and annotation schemes on system performance by retesting systems participated in the above two tasks.Entities:
Mesh:
Year: 2006 PMID: 16504116 PMCID: PMC1402329 DOI: 10.1186/1471-2105-7-92
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Spectrum of matching criteria.
Performance on the JNLPBA dataset with protein, DNA, and RNA merged into one category
| Zho [21] | Fin [17] | Set [22] | Son [23] | |
| Precision (%) | 70.92 | 70.93 | 70.71 | 67.26 |
| Recall (%) | 80.56 | 77.96 | 76.36 | 74.09 |
| F-score (%) | 75.43 | 74.28 | 73.43 | 70.51 |
Performance on the BioCreAtIvE dataset
| Zho [24] | Fin [25] | Mcd [26] | Son [27] | |
| Precision (%) | 82.00 | 79.20 | 86.40 | 80.00 |
| Recall (%) | 83.17 | 85.40 | 78.70 | 68.50 |
| F-score (%) | 82.58 | 82.20 | 82.40 | 73.80 |
Hypothesis testing on the equivalence of each matching criterion to BioCreAtIvE's multiple-tagging scheme
| Accept | |||||
| J-Exact | 74.20 | 1.92 | M = 80.25% | -14.07 | No |
| J-Left/Right | 84.19 | 1.17 | M = 80.25% | 15.01 | No |
| J-Approximate | 85.76 | 1.20 | M = 80.25% | 20.59 | No |
| J-Partial | 85.92 | 1.16 | M = 80.25% | 21.94 | No |
| J-Left | 79.72 | 1.20 | M = 80.25% | -1.95 | Yes |
| J-Right | 80.87 | 1.60 | M = 80.25% | 1.75 | Yes |
| J-Fragment | 83.83 | 1.82 | M = 80.25% | 8.81 | No |
*the condition for accepting H0 is t(0.025,≤ t0 ≤ t(0.975,, where t(0.025,= -2.093 and t(0.975,= 2.093.
Correlation coefficient of each matching criterion with BioCreAtIvE
| Zho | Fin | Set/Mcd | Son | Correlation coefficient | |
| BioCreAtIvE | 82.58% | 82.20% | 82.40% | 73.80% | - |
| J-Exact | 75.43% | 74.28% | 73.43% | 70.51% | 0.9286 |
| J-Left/Right | 83.75% | 84.88% | 84.46% | 82.73% | 0.8491 |
| J-Approximate | 84.88% | 86.68% | 86.34% | 85.24% | 0.3892 |
| J-Partial | 85.01% | 86.74% | 86.53% | 85.46% | 0.3476 |
| J-Left | 80.01% | 79.97% | 79.42% | 77.69% | 0.9688 |
| J-Right | 80.89% | 81.53% | 81.10% | 78.05% | 0.9788 |
| J-Fragment | 85.47% | 84.41% | 83.51% | 81.44% | 0.8926 |
Comparison of the best results using exact and relaxed evaluation
| Evaluation criterion | NE categories | Matching criterion | # of NE classes | Best system | Best performance(%) | ||
| Precision | Recall | F-score | |||||
| Exact | protein, DNA, RNA, cell line, cell type | exact match | 5 | Zho [22] | 69.4 | 76.0 | 72.6 |
| Relaxed | macromolecule (Protein + DNA + RNA), Cell (cell line + cell type) | right match | 2 | Fin [15] | 77.9 | 85.6 | 81.5 |
Results of the best JNLPBA participant system under different matching criteria
| Exact | Left/Right | Approximate | Partial | Uncategorized Partial | |
| Precision (%) | 69.4 | 77.0 | 77.3 | 77.4 | 83.9 |
| Recall (%) | 76.0 | 83.2 | 84.8 | 85.3 | 91.7 |
| F-score (%) | 72.6 | 80.0 | 80.8 | 81.2 | 87.7 |
Basic statistics of the JNLPBA dataset
| # abstracts | # sentences | # words | |
| Training Set | 2,000 | 18,546 | 472,006 (236.00/abs) (22.97/sen) |
| Test Set | 404 | 3,856 | 96,780 (239.55/abs) (22.72/sen) |
Absolute (and relative) frequencies of all NE classes in each part of the JNLPBA dataset
| Protein | DNA | RNA | Cell Type | Cell Line | All | |
| Training Set | 30,269 (59.0) | 9,533 (18.6) | 951 (1.9) | 6,718 (13.1) | 3,830 (7.5) | 51,301 (100) |
| Test Set | 5,067 (58.5) | 1,056 (12.2) | 118 (1.4) | 1,921 (22.2) | 500 (5.8) | 8,662 (100) |