| Literature DB >> 22151901 |
Zhiyong Lu1, Hung-Yu Kao, Chih-Hsuan Wei, Minlie Huang, Jingchen Liu, Cheng-Ju Kuo, Chun-Nan Hsu, Richard Tzong-Han Tsai, Hong-Jie Dai, Naoaki Okazaki, Han-Cheol Cho, Martin Gerner, Illes Solt, Shashank Agarwal, Feifan Liu, Dina Vishnyakova, Patrick Ruch, Martin Romacker, Fabio Rinaldi, Sanmitra Bhattacharya, Padmini Srinivasan, Hongfang Liu, Manabu Torii, Sergio Matos, David Campos, Karin Verspoor, Kevin M Livingston, W John Wilbur.
Abstract
BACKGROUND: We report the Gene Normalization (GN) challenge in BioCreative III where participating teams were asked to return a ranked list of identifiers of the genes detected in full-text articles. For training, 32 fully and 500 partially annotated articles were prepared. A total of 507 articles were selected as the test set. Due to the high annotation cost, it was not feasible to obtain gold-standard human annotations for all test articles. Instead, we developed an Expectation Maximization (EM) algorithm approach for choosing a small number of test articles for manual annotation that were most capable of differentiating team performance. Moreover, the same algorithm was subsequently used for inferring ground truth based solely on team submissions. We report team performance on both gold standard and inferred ground truth using a newly proposed metric called Threshold Average Precision (TAP-k).Entities:
Mesh:
Year: 2011 PMID: 22151901 PMCID: PMC3269937 DOI: 10.1186/1471-2105-12-S8-S2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Statistics of annotated gene ids in the different data sets.
| Set | Description | Min | Max | Mean | Median | St.dev. |
|---|---|---|---|---|---|---|
| 1 | Training Set (32 articles) | 4 | 147 | 19 | 14 | 24 |
| 2 | Test Set (50 articles – gold standard) | 0 | 375 | 33 | 19 | 63 |
| 3 | Test Set (507 articles – silver standard) | 0 | 375 | 18 | 12 | 27 |
Statistics of species distribution in the different data sets.
| # | Training Set (32 articles) | Test Set (50 articles) | Test Set (507 articles) |
|---|---|---|---|
| 1 | S. cereviaiae (27%) | Enterobacter sp. 638 (23%) | H. Sapiens (42%) |
| 2 | H. sapiens (20%) | M. musculus (14%) | M. musulus (24%) |
| 3 | M. musculus (12%) | H. Sapiens (11%) | D. melanogaster (6%) |
| 4 | D. melanogaster (10%) | S. pneumoniae TIGR4 (9%) | S. cerevisiae S228c (6%) |
| 5 | D. rerio (7%) | S. scrofa (5%) | Enterobacter sp. 638 (4%) |
| 6 | A. thaliana (5%) | M. oryzae 70-15 (4%) | R. norvegicus (4%) |
| 7 | C. elegans (3%) | D. melanogaster (4%) | A. thaliana (2%) |
| 8 | X. laevis (3%) | R. norvegicus (3%) | C. elegans (2%) |
| 9 | R. norvegicus (2%) | S. cerevisiae S228c(2%) | S. pneumoniae TIGR4 (2%) |
| 10 | G. gallus (2%) | E. histolytica HM-1 (2%) | S. scrofa (1%) |
| 11+ | Other 18 species (9%) | Other 65 species (23%) | Other 91 species (7%) |
Figure 1Percentage of articles annotated with different numbers of species in various data sets. Training (32) refers to the human annotations on the 32 articles in the training set. Test (50) and Test (507) refer to the gold standard and silver standard annotations on the 50 and 507 articles in the test set, respectively.
Team evaluation results on the gold standard annotations of 50 documents. Results are sorted by team numbers. All gold standard annotations were provided directly by humans.
| Team_Run | TAP (K=5) | TAP (K=10) | TAP (K=20) |
|---|---|---|---|
| T63_R1 | 0.0340 | 0.0488 | 0.0725 |
| T63_R2 | 0.0296 | 0.0458 | 0.0643 |
| T65_R1 | 0.0714 | 0.0986 | 0.1048 |
| T65_R2 | 0.0915 | 0.1097 | 0.1183 |
| T68_R1 | 0.1621 | 0.1876 | 0.2049 |
| T68_R2 | 0.1285 | 0.1460 | 0.1782 |
| T70_R1 | 0.0566 | 0.0566 | 0.0566 |
| T70_R2 | 0.0622 | 0.0622 | 0.0622 |
| T70_R3 | 0.0718 | 0.0718 | 0.0718 |
| T74_R1 | 0.2137 | 0.2509 | 0.2509 |
| T74_R2 | 0.2083 | 0.2480 | 0.2480 |
| T74_R3 | 0.2099 | 0.2495 | 0.2495 |
| T78_R1 | 0.0584 | 0.0741 | 0.1129 |
| T78_R2 | 0.0847 | 0.1202 | 0.1706 |
| T78_R3 | 0.0847 | 0.1128 | 0.1426 |
| T80_R1 | 0.1084 | 0.1581 | 0.1646 |
| T80_R2 | 0.0382 | 0.0516 | 0.0588 |
| T80_R3 | 0.0329 | 0.0437 | 0.0521 |
| 0.3254 | |||
| T83_R2 | 0.3216 | 0.3435 | 0.3435 |
| 0.3514 | 0.3514 | ||
| T89_R1 | 0.1205 | 0.1205 | 0.1363 |
| T89_R2 | 0.1553 | 0.1553 | 0.1652 |
| T89_R3 | 0.1295 | 0.1548 | 0.1548 |
| T93_R1 | 0.1651 | 0.1902 | 0.2075 |
| T93_R2 | 0.1560 | 0.1858 | 0.2062 |
| T93_R3 | 0.1662 | 0.1916 | 0.2096 |
| T97_R1 | 0.0727 | 0.0939 | 0.1026 |
| T97_R2 | 0.0649 | 0.0872 | 0.0974 |
| T97_R3 | 0.0727 | 0.0939 | 0.1026 |
| T98_R1 | 0.2835 | 0.3012 | 0.3103 |
| T98_R2 | 0.2909 | 0.3079 | 0.3087 |
| T98_R3 | 0.3013 | 0.3183 | 0.3303 |
| T101_R1 | 0.1896 | 0.2288 | 0.2385 |
| T101_R2 | 0.1672 | 0.2150 | 0.2418 |
| T101_R3 | 0.1812 | 0.2141 | 0.2425 |
Team evaluation results on the 50 and 507 articles using the silver standard annotations. Results are sorted by team numbers. All silver standard annotations were derived by the EM algorithm applied to team submissions over the full set of 507 test articles. The silver-standard annotations of the 50 selected articles (columns 2-4) are taken from the silver-standard annotations obtained on the 507 articles.
| Team_Runs | Using silver standard (50 selected articles) | Using silver standard (All 507 articles) | ||||
|---|---|---|---|---|---|---|
| TAP (K=5) | TAP K=10 | TAP (K=20) | TAP (K = 5) | TAP (K = 10) | TAP (K = 20) | |
| T63_R1 | 0.0504 | 0.1059 | 0.1438 | 0.1584 | 0.1961 | 0.1980 |
| T63_R2 | 0.0393 | 0.0998 | 0.1355 | 0.1415 | 0.1890 | 0.1982 |
| T65_R1 | 0.1039 | 0.1302 | 0.1532 | 0.1549 | 0.1818 | 0.2030 |
| T65_R2 | 0.1133 | 0.1360 | 0.1581 | 0.1573 | 0.1868 | 0.2097 |
| T68_R1 | 0.2282 | 0.2768 | 0.3221 | 0.3614 | 0.3787 | 0.3753 |
| T68_R2 | 0.2136 | 0.2978 | 0.2978 | 0.3468 | 0.3641 | 0.3608 |
| T70_R1 | 0.0130 | 0.0130 | 0.0130 | 0.1227 | 0.1227 | 0.1227 |
| T70_R2 | 0.0166 | 0.0166 | 0.0166 | 0.1323 | 0.1323 | 0.1323 |
| T70_R3 | 0.0560 | 0.0560 | 0.0560 | 0.1579 | 0.1579 | 0.1579 |
| T74_R1 | 0.3820 | 0.3820 | 0.3820 | 0.4873 | 0.4873 | 0.4873 |
| T74_R2 | 0.3855 | 0.3855 | 0.3855 | 0.4871 | 0.4871 | 0.4871 |
| 0.3890 | 0.3890 | |||||
| T78_R1 | 0.0552 | 0.0786 | 0.1152 | 0.1237 | 0.1529 | 0.1900 |
| T78_R2 | 0.1058 | 0.1592 | 0.2166 | 0.2561 | 0.2751 | 0.2751 |
| T78_R3 | 0.0979 | 0.1440 | 0.1997 | 0.2273 | 0.2765 | 0.2872 |
| T80_R1 | 0.2579 | 0.2840 | 0.2840 | 0.4056 | 0.4056 | 0.4056 |
| T80_R2 | 0.0716 | 0.1150 | 0.1220 | 0.2281 | 0.2281 | 0.2281 |
| T80_R3 | 0.0792 | 0.1269 | 0.1329 | 0.2332 | 0.2397 | 0.2397 |
| T83_R1 | 0.3567 | 0.3600 | 0.3600 | 0.4591 | 0.4591 | 0.4591 |
| T83_R2 | 0.3291 | 0.3291 | 0.3291 | 0.4323 | 0.4323 | 0.4323 |
| T83_R3 | 0.3382 | 0.3382 | 0.3382 | 0.4327 | 0.4327 | 0.4327 |
| T89_R1 | 0.1767 | 0.2251 | 0.2251 | 0.2783 | 0.3111 | 0.3111 |
| T89_R2 | 0.2161 | 0.2617 | 0.2992 | 0.2721 | 0.3057 | 0.3057 |
| T89_R3 | 0.2091 | 0.2091 | 0.2091 | 0.2977 | 0.2977 | 0.2977 |
| T93_R1 | 0.2614 | 0.3093 | 0.3093 | 0.4039 | 0.4039 | 0.4039 |
| T93_R2 | 0.2101 | 0.2625 | 0.2966 | 0.3709 | 0.3820 | 0.3820 |
| T93_R3 | 0.2553 | 0.3048 | 0.3048 | 0.4061 | 0.4061 | 0.4061 |
| T97_R1 | 0.1094 | 0.1317 | 0.1566 | 0.1396 | 0.1676 | 0.1918 |
| T97_R2 | 0.0858 | 0.1133 | 0.1352 | 0.1344 | 0.1601 | 0.1829 |
| T97_R3 | 0.1094 | 0.1317 | 0.1566 | 0.1396 | 0.1676 | 0.1918 |
| T98_R1 | 0.3343 | 0.3535 | 0.3629 | 0.3818 | 0.3899 | 0.3875 |
| T98_R2 | 0.3354 | 0.3543 | 0.3634 | 0.3790 | 0.3878 | 0.3868 |
| 0.3710 | 0.4086 | 0.4511 | 0.4648 | |||
| T101_R1 | 0.3590 | 0.3859 | 0.3859 | 0.4289 | 0.4289 | 0.4289 |
| T101_R2 | 0.3239 | 0.3945 | 0.4132 | 0.4294 | 0.4408 | 0.4408 |
| T101_R3 | 0.3258 | 0.4109 | 0.4109 | 0.4536 | 0.4536 | 0.4536 |
TAP scores of machine learning experiments in combining team submissions in the composite system.
| Systems | TAP-5 | TAP-10 | TAP-20 |
|---|---|---|---|
| Best team result | 0.3297 | 0.3538 | 0.3535 |
| Composite system with 14 features (N=0) | 0.3527 | 0.4241 | 0.4435 |
| Composite system with 84 features (N=5) | 0.3594 | ||
| Composite system with 154 features (N=10) | 0.4318 | 0.4454 |
Public software and resources used in the GN task.
| Step | Software | Other Public Resources |
|---|---|---|
| 1) | ABNER [ | Entrez Gene [ |
| 2) | Linnaeus [ | NCBI Taxonomy |
| 3) & 4) | Entrez Gene [ | BioThesaurus [ |