Literature DB >> 19162232

Improving accuracy for identifying related PubMed queries by an integrated approach.

Zhiyong Lu1, W John Wilbur.   

Abstract

PubMed is the most widely used tool for searching biomedical literature online. As with many other online search tools, a user often types a series of multiple related queries before retrieving satisfactory results to fulfill a single information need. Meanwhile, it is also a common phenomenon to see a user type queries on unrelated topics in a single session. In order to study PubMed users' search strategies, it is necessary to be able to automatically separate unrelated queries and group together related queries. Here, we report a novel approach combining both lexical and contextual analyses for segmenting PubMed query sessions and identifying related queries and compare its performance with the previous approach based solely on concept mapping. We experimented with our integrated approach on sample data consisting of 1539 pairs of consecutive user queries in 351 user sessions. The prediction results of 1396 pairs agreed with the gold-standard annotations, achieving an overall accuracy of 90.7%. This demonstrates that our approach is significantly better than the previously published method. By applying this approach to a one day query log of PubMed, we found that a significant proportion of information needs involved more than one PubMed query, and that most of the consecutive queries for the same information need are lexically related. Finally, the proposed PubMed distance is shown to be an accurate and meaningful measure for determining the contextual similarity between biological terms. The integrated approach can play a critical role in handling real-world PubMed query log data as is demonstrated in our experiments.

Entities:  

Mesh:

Year:  2008        PMID: 19162232      PMCID: PMC2764279          DOI: 10.1016/j.jbi.2008.12.006

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  8 in total

1.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

2.  Entrez: making use of its power.

Authors:  Renata C Geer; Eric W Sayers
Journal:  Brief Bioinform       Date:  2003-06       Impact factor: 11.622

3.  A day in the life of PubMed: analysis of a typical day's query log.

Authors:  Jorge R Herskovic; Len Y Tanaka; William Hersh; Elmer V Bernstam
Journal:  J Am Med Inform Assoc       Date:  2007-01-09       Impact factor: 4.497

4.  Sec31 encodes an essential component of the COPII coat required for transport vesicle budding from the endoplasmic reticulum.

Authors:  N R Salama; J S Chuang; R W Schekman
Journal:  Mol Biol Cell       Date:  1997-02       Impact factor: 4.138

5.  A membrane protein enriched in endoplasmic reticulum exit sites interacts with COPII.

Authors:  B L Tang; Y S Ong; B Huang; S Wei; E T Wong; R Qi; H Horstmann; W Hong
Journal:  J Biol Chem       Date:  2001-08-06       Impact factor: 5.157

6.  Evaluation of Query Expansion Using MeSH in PubMed.

Authors:  Zhiyong Lu; Won Kim; W John Wilbur
Journal:  Inf Retr Boston       Date:  2009       Impact factor: 2.293

7.  Concept recognition for extracting protein interaction relations from biomedical text.

Authors:  William A Baumgartner; Zhiyong Lu; Helen L Johnson; J Gregory Caporaso; Jesse Paquette; Anna Lindemann; Elizabeth K White; Olga Medvedeva; K Bretonnel Cohen; Lawrence Hunter
Journal:  Genome Biol       Date:  2008-09-01       Impact factor: 13.583

8.  Text mining for biology--the way forward: opinions from leading scientists.

Authors:  Russ B Altman; Casey M Bergman; Judith Blake; Christian Blaschke; Aaron Cohen; Frank Gannon; Les Grivell; Udo Hahn; William Hersh; Lynette Hirschman; Lars Juhl Jensen; Martin Krallinger; Barend Mons; Seán I O'Donoghue; Manuel C Peitsch; Dietrich Rebholz-Schuhmann; Hagit Shatkay; Alfonso Valencia
Journal:  Genome Biol       Date:  2008-09-01       Impact factor: 13.583

  8 in total
  6 in total

1.  Author keywords in biomedical journal articles.

Authors:  Aurélie Névéol; Rezarta Islamaj Doğan; Zhiyong Lu
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

2.  Predicting clicks of PubMed articles.

Authors:  Yuqing Mao; Zhiyong Lu
Journal:  AMIA Annu Symp Proc       Date:  2013-11-16

3.  Remarkable growth of open access in the biomedical field: analysis of PubMed articles from 2006 to 2010.

Authors:  Keiko Kurata; Tomoko Morioka; Keiko Yokoi; Mamiko Matsubayashi
Journal:  PLoS One       Date:  2013-05-01       Impact factor: 3.240

4.  Understanding PubMed user search behavior through log analysis.

Authors:  Rezarta Islamaj Dogan; G Craig Murray; Aurélie Névéol; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2009-11-27       Impact factor: 3.451

5.  Analysis of PubMed User Sessions Using a Full-Day PubMed Query Log: A Comparison of Experienced and Nonexperienced PubMed Users.

Authors:  Illhoi Yoo; Abu Saleh Mohammad Mosa
Journal:  JMIR Med Inform       Date:  2015-07-02

6.  A study on PubMed search tag usage pattern: association rule mining of a full-day PubMed query log.

Authors:  Abu Saleh Mohammad Mosa; Illhoi Yoo
Journal:  BMC Med Inform Decis Mak       Date:  2013-01-09       Impact factor: 2.796

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.