| Literature DB >> 18834493 |
Larry Smith1, Lorraine K Tanabe, Rie Johnson nee Ando, Cheng-Ju Kuo, I-Fang Chung, Chun-Nan Hsu, Yu-Shi Lin, Roman Klinger, Christoph M Friedrich, Kuzman Ganchev, Manabu Torii, Hongfang Liu, Barry Haddow, Craig A Struble, Richard J Povinelli, Andreas Vlachos, William A Baumgartner, Lawrence Hunter, Bob Carpenter, Richard Tzong-Han Tsai, Hong-Jie Dai, Feng Liu, Yifei Chen, Chengjie Sun, Sophia Katrenko, Pieter Adriaans, Christian Blaschke, Rafael Torres, Mariana Neves, Preslav Nakov, Anna Divoli, Manuel Maña-López, Jacinto Mata, W John Wilbur.
Abstract
Nineteen teams presented results for the Gene Mention Task at the BioCreative II Workshop. In this task participants designed systems to identify substrings in sentences corresponding to gene name mentions. A variety of different methods were used and the results varied with a highest achieved F1 score of 0.8721. Here we present brief descriptions of all the methods used and a statistical analysis of the results. We also demonstrate that, by combining the results from all submissions, an F score of 0.9066 is feasible, and furthermore that the best result makes use of the lowest scoring submissions.Entities:
Mesh:
Year: 2008 PMID: 18834493 PMCID: PMC2559986 DOI: 10.1186/gb-2008-9-s2-s2
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Performance measures
| Rank | F | signif | % alt | ||
| 1 | 0.8848 | 0.8597 | 0.8721 | 4-19 | 32.48 |
| 2 | 0.8930 | 0.8449 | 0.8683 | 6-19 | 14.02 |
| 3 | 0.8493 | 0.8828 | 0.8657 | 6-19 | 14.08 |
| 4 | 0.8727 | 0.8541 | 0.8633 | 7-19 | 31.77 |
| 5 | 0.8577 | 0.8680 | 0.8628 | 7-19 | 16.67 |
| 6 | 0.8271 | 0.8932 | 0.8589 | 7-19 | 16.02 |
| 7 | 0.8697 | 0.8255 | 0.8470 | 8-19 | 14.83 |
| 8 | 0.8435 | 0.8139 | 0.8285 | 10-19 | 14.57 |
| 9 | 0.8628 | 0.7966 | 0.8284 | 10-19 | 14.55 |
| 10 | 0.8554 | 0.7683 | 0.8095 | 11-19 | 19.76 |
| 11 | 0.7295 | 0.8849 | 0.7997 | 13-19 | 16.82 |
| 12 | 0.9267 | 0.6891 | 0.7905 | 14-19 | 19.73 |
| 13 | 0.8883 | 0.6970 | 0.7811 | 15-19 | 37.05 |
| 14 | 0.8046 | 0.7361 | 0.7688 | 16-19 | 20.43 |
| 15 | 0.8228 | 0.7108 | 0.7627 | 17-19 | 16.80 |
| 16 | 0.8432 | 0.6857 | 0.7563 | 17-19 | 34.02 |
| 17 | 0.7168 | 0.6233 | 0.6668 | 18-19 | 28.23 |
| 18 | 0.6056 | 0.6411 | 0.6229 | 19 | 31.71 |
| 19 | 0.5009 | 0.4612 | 0.4802 | - | 28.46 |
The precision, recall, and F score for the best submitted run from each of 19 workshop participants, sorted by F score. Each team has an F score that has a statistically significant comparison (P < 0.05) with the teams indicated in the signif column. The column labeled % alt is the percentage of true positives in the submission that matched an ALTGENE annotation.
Features for combined performance
| Team | |
| Team | |
| Team | |
| Team | |
| Team | |
| Some team nominated a gene mention with | |
| Some team nominated a gene mention with | |
| Word | |
| Word | |
| Word | |
| Word |
The features generated for each candidate gene mention, based on the submitted runs.
Combined performance results
| Exp | Method | |||||
| A | CRF noalt, nom and word | 0.9255 | 0.8885 | 0.9066 | 1-19, C-F | 13.62 |
| B | BDT nom and word | 0.9221 | 0.8885 | 0.9050 | 1-19, C-F | 25.67 |
| C | BDT nom and word, top 10 teams | 0.9118 | 0.8768 | 0.8940 | 1-19, E, F | 23.37 |
| D | BDT nom only | 0.9092 | 0.8773 | 0.8929 | 1-19, E, F | 25.42 |
| E | BDT noalt, nom and word | 0.9242 | 0.8165 | 0.8670 | 7-19, F | 9.58 |
| F | BDT word only | 0.7165 | 0.6187 | 0.6640 | 18-19 | 37.07 |
The precision, recall, and F score of machine learning experiments to learn gene mentions using the data extracted from all submitted runs as features. Method column: BDT, boosted decision trees; CRF, conditional random fields; nom, all nomination features; word, words of candidate; noalt, alternate gene data not used. The column signif indicates the ranks of runs for which there was a significant difference, and the letters indicate the machine learning experiments for which there was a significant difference. The column % alt gives the percentage of alternate gene mentions among the resulting true positives.