| Literature DB >> 16722581 |
William R Hersh1, Ravi Teja Bhupatiraju, Laura Ross, Phoebe Roberts, Aaron M Cohen, Dale F Kraemer.
Abstract
BACKGROUND: The goal of the TREC Genomics Track is to improve information retrieval in the area of genomics by creating test collections that will allow researchers to improve and better understand failures of their systems. The 2004 track included an ad hoc retrieval task, simulating use of a search engine to obtain documents about biomedical topics. This paper describes the Genomics Track of the Text Retrieval Conference (TREC) 2004, a forum for evaluation of IR research systems, where retrieval in the genomics domain has recently begun to be assessed.Entities:
Year: 2006 PMID: 16722581 PMCID: PMC1440302 DOI: 10.1186/1747-5333-1-3
Source DB: PubMed Journal: J Biomed Discov Collab ISSN: 1747-5333
Figure 1Steps in deriving knowledge from the biomedical literature and related application areas. This figure depicts the "funnel" of literature that occurs when a user seeks information and knowledge. The related information applications are shown to the right. The step of going from the entire literature to possibly relevant references is usually performed by an information retrieval system, whereas the step of identifying definitely relevant references and knowledge within them is the task of an information extraction system (or a person, since the state of information extraction is less developed than information retrieval).
Topics and associated data. Ad hoc retrieval topics, number of relevant documents, and average results for all runs.
| Topic | Pool | Definitely Relevant | Possibly Relevant | Not Relevant | Definitely & Possibly Relevant | MAP average | P@10 average | P@100 average |
| 1 | 879 | 38 | 41 | 800 | 79 | 0.3073 | 0.7383 | 0.2891 |
| 2 | 1264 | 40 | 61 | 1163 | 101 | 0.0579 | 0.2787 | 0.1166 |
| 3 | 1189 | 149 | 32 | 1008 | 181 | 0.0950 | 0.3298 | 0.2040 |
| 4 | 1170 | 12 | 18 | 1140 | 30 | 0.0298 | 0.0894 | 0.0360 |
| 5 | 1171 | 5 | 19 | 1147 | 24 | 0.0564 | 0.1340 | 0.0349 |
| 6 | 787 | 41 | 53 | 693 | 94 | 0.3993 | 0.8468 | 0.3938 |
| 7 | 730 | 56 | 59 | 615 | 115 | 0.2006 | 0.4936 | 0.2704 |
| 8 | 938 | 76 | 85 | 777 | 161 | 0.0975 | 0.3872 | 0.2094 |
| 9 | 593 | 103 | 12 | 478 | 115 | 0.6114 | 0.7957 | 0.6196 |
| 10 | 1126 | 3 | 1 | 1122 | 4 | 0.5811 | 0.2532 | 0.0277 |
| 11 | 742 | 87 | 24 | 631 | 111 | 0.3269 | 0.5894 | 0.3843 |
| 12 | 810 | 166 | 90 | 554 | 256 | 0.4225 | 0.7234 | 0.5866 |
| 13 | 1118 | 5 | 19 | 1094 | 24 | 0.0288 | 0.1021 | 0.0274 |
| 14 | 948 | 13 | 8 | 927 | 21 | 0.0479 | 0.0894 | 0.0270 |
| 15 | 1111 | 50 | 40 | 1021 | 90 | 0.1388 | 0.2915 | 0.1800 |
| 16 | 1078 | 94 | 53 | 931 | 147 | 0.1926 | 0.4489 | 0.2883 |
| 17 | 1150 | 2 | 1 | 1147 | 3 | 0.0885 | 0.0511 | 0.0115 |
| 18 | 1392 | 0 | 1 | 1391 | 1 | 0.6254 | 0.0660 | 0.0072 |
| 19 | 1135 | 0 | 1 | 1134 | 1 | 0.1594 | 0.0362 | 0.0062 |
| 20 | 814 | 55 | 61 | 698 | 116 | 0.1466 | 0.3957 | 0.2238 |
| 21 | 676 | 26 | 54 | 596 | 80 | 0.2671 | 0.4702 | 0.2796 |
| 22 | 1085 | 125 | 85 | 875 | 210 | 0.1354 | 0.4234 | 0.2709 |
| 23 | 915 | 137 | 21 | 757 | 158 | 0.1835 | 0.3745 | 0.2747 |
| 24 | 952 | 7 | 19 | 926 | 26 | 0.5970 | 0.7468 | 0.1685 |
| 25 | 1142 | 6 | 26 | 1110 | 32 | 0.0331 | 0.1000 | 0.0330 |
| 26 | 792 | 35 | 12 | 745 | 47 | 0.4401 | 0.7298 | 0.2411 |
| 27 | 755 | 19 | 10 | 726 | 29 | 0.2640 | 0.4319 | 0.1355 |
| 28 | 836 | 6 | 7 | 823 | 13 | 0.2031 | 0.2532 | 0.0643 |
| 29 | 756 | 33 | 10 | 713 | 43 | 0.1352 | 0.1809 | 0.1515 |
| 30 | 1082 | 101 | 64 | 917 | 165 | 0.2116 | 0.4872 | 0.3113 |
| 31 | 877 | 0 | 138 | 739 | 138 | 0.0956 | 0.2489 | 0.2072 |
| 32 | 1107 | 441 | 55 | 611 | 496 | 0.1804 | 0.6085 | 0.4787 |
| 33 | 812 | 30 | 34 | 748 | 64 | 0.1396 | 0.2234 | 0.1647 |
| 34 | 778 | 1 | 30 | 747 | 31 | 0.0644 | 0.0830 | 0.0668 |
| 35 | 717 | 253 | 18 | 446 | 271 | 0.3481 | 0.8213 | 0.6528 |
| 36 | 676 | 164 | 90 | 422 | 254 | 0.4887 | 0.7638 | 0.6700 |
| 37 | 476 | 138 | 11 | 327 | 149 | 0.5345 | 0.7426 | 0.6564 |
| 38 | 1165 | 334 | 89 | 742 | 423 | 0.1400 | 0.5915 | 0.4043 |
| 39 | 1350 | 146 | 171 | 1033 | 317 | 0.0984 | 0.3936 | 0.2689 |
| 40 | 1168 | 134 | 143 | 891 | 277 | 0.1080 | 0.3936 | 0.2796 |
| 41 | 880 | 333 | 249 | 298 | 582 | 0.3356 | 0.6766 | 0.6521 |
| 42 | 1005 | 191 | 506 | 308 | 697 | 0.1587 | 0.6596 | 0.5702 |
| 43 | 739 | 25 | 170 | 544 | 195 | 0.1185 | 0.6915 | 0.2553 |
| 44 | 1224 | 485 | 164 | 575 | 649 | 0.1323 | 0.6149 | 0.4632 |
| 45 | 1139 | 108 | 48 | 983 | 156 | 0.0286 | 0.1574 | 0.0711 |
| 46 | 742 | 111 | 86 | 545 | 197 | 0.2630 | 0.7362 | 0.4981 |
| 47 | 1450 | 81 | 284 | 1085 | 365 | 0.0673 | 0.3149 | 0.2355 |
| 48 | 1121 | 53 | 102 | 966 | 155 | 0.1712 | 0.4021 | 0.2557 |
| 49 | 1100 | 32 | 41 | 1027 | 73 | 0.2279 | 0.5404 | 0.2049 |
| 50 | 1091 | 79 | 223 | 789 | 302 | 0.0731 | 0.3447 | 0.2534 |
| Mean | 975.1 | 92.6 | 72.8 | 809.7 | 165.4 | 0.2171 | 0.4269 | 0.2637 |
| Median | 978.5 | 54 | 44.5 | 783 | 115.5 | 0.1590 | 0.3989 | 0.2472 |
| Min | 476 | 0 | 1 | 298 | 1 | 0.0286 | 0.0362 | 0.0062 |
| Max | 1450 | 485 | 506 | 1391 | 697 | 0.6254 | 0.8468 | 0.6700 |
Kappa results. Kappa results for inter-judge agreement in relevant judgments for ad hoc retrieval task.
| Judge 2 | Definitely relevant | Possibly relevant | Not relevant | Total |
| Judge 1 | ||||
| Definitely relevant | 62 | 35 | 8 | 105 |
| Possibly relevant | 11 | 11 | 5 | 27 |
| Not relevant | 14 | 57 | 456 | 527 |
| Total | 87 | 103 | 469 | 659 |
Ad hoc retrieval results. All runs, sorted by mean average precision.
| pllsgen4a2 | patolis.fujita [20] | A | 0.4075 | 6.04 | 41.96 |
| uwmtDg04tn | u.waterloo.clarke [21] | A | 0.3867 | 6.24 | 42.1 |
| pllsgen4a1 | patolis.fujita [20] | A | 0.3689 | 5.7 | 39.36 |
| THUIRgen01 | tsinghua.ma [22] | M | 0.3435 | 5.82 | 39.24 |
| THUIRgen02 | tsinghua.ma [22] | A | 0.3434 | 5.94 | 39.44 |
| utaauto | u.tampere [24] | A | 0.3324 | 5.02 | 32.26 |
| uwmtDg04n | u.waterloo.clarke [21] | A | 0.3318 | 5.68 | 36.84 |
| PSE | german.u.cairo [35] | A | 0.3308 | 5.86 | 36.66 |
| tnog3 | tno.kraaij [36] | A | 0.3247 | 5.6 | 36.56 |
| tnog2 | tno.kraaij [36] | A | 0.3196 | 5.62 | 36.04 |
| utamanu | u.tampere [24] | M | 0.3128 | 6.52 | 38.88 |
| aliasiBase | alias-i [23] | A | 0.3094 | 5.38 | 34.58 |
| ConversManu | converspeech [37] | M | 0.2931 | 5.82 | 37.18 |
| RMITa | rmit.scholer [38] | A | 0.2796 | 5.12 | 31.4 |
| aliasiTerms | alias-i [23] | A | 0.2656 | 4.8 | 30.3 |
| akoike | u.tokyo (none) | M | 0.2427 | 4.48 | 31.3 |
| OHSUNeeds | ohsu.hersh [25] | A | 0.2343 | 3.84 | 26.46 |
| tgnSplit | tarragon [39] | A | 0.2319 | 4.86 | 29.26 |
| UIowaGN1 | u.iowa [40] | A | 0.2316 | 4.76 | 28.5 |
| tq0 | nlm.umd.ul [28] | A | 0.2277 | 5.12 | 30.1 |
| OHSUAll | ohsu.hersh [25] | A | 0.2272 | 4.32 | 27.76 |
| LHCUMDSE | nlm.umd.ul [28] | A | 0.2191 | 3.9 | 24.18 |
| akoyama | u.tokyo (none) | M | 0.2155 | 4.52 | 25.62 |
| PDTNsmp4 | u.padova [41] | A | 0.2074 | 4.56 | 23.18 |
| PD50501 | u.padova [41] | A | 0.2059 | 4.42 | 25.18 |
| RMITb | rmit.scholer [38] | A | 0.2059 | 4.56 | 27.26 |
| UBgtNormJM1 | suny.buffalo [42] | A | 0.2043 | 4.34 | 25.38 |
| ConversAuto | converspeech [37] | A | 0.2013 | 3.88 | 22.8 |
| york04g2 | york.u [43] | M | 0.2011 | 5.5 | 25.8 |
| tgnNecaux | tarragon [39] | A | 0.1951 | 4.08 | 23.58 |
| lga1 | indiana.u.seki [26] | A | 0.1833 | 3.08 | 22.86 |
| york04g1 | york.u [43] | A | 0.1794 | 4.14 | 26.96 |
| lga2 | indiana.u.seki [26] | A | 0.1754 | 3.1 | 20.22 |
| rutgersGAH1 | rutgers.dayanik [44] | A | 0.1702 | 4.66 | 26.76 |
| wdvqlxa1 | indiana.u.yang [45] | A | 0.1582 | 4.2 | 24.78 |
| wdvqlx1 | indiana.u.yang [45] | A | 0.1569 | 4.26 | 24.26 |
| DCUmatn1 | dubblincity.u [46] | M | 0.1388 | 3.28 | 17.84 |
| BioTextAdHoc | u.cberkeley.hearst [27] | A | 0.1384 | 3.76 | 23.76 |
| shefauto2 | u.sheffield.gaizauskas [47] | A | 0.1304 | 3.66 | 18.5 |
| rutgersGAH2 | rutgers.dayanik [44] | A | 0.1303 | 3.42 | 19.48 |
| shefauto1 | u.sheffield.gaizauskas [47] | A | 0.1294 | 3.54 | 18.92 |
| run1 | utwente (none) | M | 0.1176 | 1.5 | 10.5 |
| MeijiHilG | meiji.u [48] | A | 0.0924 | 2.1 | 15.24 |
| DCUma | dubblincity.u [46] | M | 0.0895 | 2.4 | 15.46 |
| csusm | u.sanmarcos [49] | M | 0.0123 | 0.44 | 1.6 |
| edinauto2 | u.edinburgh.sinclair [50] | A | 0.0017 | 0.46 | 1.6 |
| edinauto5 | u.edinburgh.sinclair [50] | A | 0.0012 | 0.36 | 1.3 |
| Mean | 0.2074 | 4.48 | 26.46 |
Figure 2Ad hoc retrieval runs sorted by MAP. This figure shows all of the runs (x-axis) sorted by MAP (y-axis). The highest run to obtain statistical significance (RMITa) from the top run (pllsgen4a2) is denoted, along with the "out of the box" TF*IDF run (OHSUNeeds) are annotated. (Only every fifth run identifier is shown to make the x-axis more readable.)
Figure 3MAP by topic for the ad hoc task.
Figure 4MAP by topic for the ad hoc task sorted by MAP.
Figure 5The maximum MAP plotted vs. average MAP for the ad hoc retrieval task runs.
Figure 6The number of relevant per topic plotted vs. MAP for the ad hoc retrieval task