Literature DB >> 18693910

A comparative analysis of retrieval features used in the TREC 2006 Genomics Track passage retrieval task.

Hari Krishna Rekapalli1, Aaron M Cohen, William R Hersh.   

Abstract

OBJECTIVE: Identify the set of features that best explained the variation in the performance measure of TREC 2006 Genomics information extraction task, Mean Average Passage Precision (MAPP).
METHODS: A multivariate regression model was built using a backward-elimination approach as a function of certain generalized features that were common to all the algorithms used by TREC 2006 Genomics track participants.
RESULTS: Our regression analysis found that the following four factors were collectively associated with variation in MAPP: (1) Normalization of keywords in the query (2) Use of Entrez gene thesaurus for synonymous terms look-up (3) Unit of text retrieved using respective IR algorithms and (4) The way a passage was defined.
CONCLUSION: These reasonably likely hypotheses, generated by an exploratory data analysis, are informative in understanding results of the TREC 2006 Genomics passage extraction task. This approach has general value for analyzing the results of similar common challenge tasks.

Entities:  

Mesh:

Year:  2007        PMID: 18693910      PMCID: PMC2655837     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  1 in total

1.  Entrez Gene: gene-centered information at NCBI.

Authors:  Donna Maglott; Jim Ostell; Kim D Pruitt; Tatiana Tatusova
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

  1 in total
  3 in total

1.  Automated identification of molecular effects of drugs (AIMED).

Authors:  Safa Fathiamini; Amber M Johnson; Jia Zeng; Alejandro Araya; Vijaykumar Holla; Ann M Bailey; Beate C Litzenburger; Nora S Sanchez; Yekaterina Khotskaya; Hua Xu; Funda Meric-Bernstam; Elmer V Bernstam; Trevor Cohen
Journal:  J Am Med Inform Assoc       Date:  2016-04-23       Impact factor: 4.497

2.  The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text.

Authors:  Martin Krallinger; Miguel Vazquez; Florian Leitner; David Salgado; Andrew Chatr-Aryamontri; Andrew Winter; Livia Perfetto; Leonardo Briganti; Luana Licata; Marta Iannuccelli; Luisa Castagnoli; Gianni Cesareni; Mike Tyers; Gerold Schneider; Fabio Rinaldi; Robert Leaman; Graciela Gonzalez; Sergio Matos; Sun Kim; W John Wilbur; Luis Rocha; Hagit Shatkay; Ashish V Tendulkar; Shashank Agarwal; Feifan Liu; Xinglong Wang; Rafal Rak; Keith Noto; Charles Elkan; Zhiyong Lu; Rezarta Islamaj Dogan; Jean-Fred Fontaine; Miguel A Andrade-Navarro; Alfonso Valencia
Journal:  BMC Bioinformatics       Date:  2011-10-03       Impact factor: 3.169

3.  A comparative analysis of system features used in the TREC-COVID information retrieval challenge.

Authors:  Jimmy S Chen; William R Hersh
Journal:  J Biomed Inform       Date:  2021-04-06       Impact factor: 8.000

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.