Literature DB >> 21917645

Predicting biomedical document access as a function of past use.

J Caleb Goodwin1, Todd R Johnson, Trevor Cohen, Jorge R Herskovic, Elmer V Bernstam.   

Abstract

OBJECTIVE: To determine whether past access to biomedical documents can predict future document access.
MATERIALS AND METHODS: The authors used 394 days of query log (August 1, 2009 to August 29, 2010) from PubMed users in the Texas Medical Center, which is the largest medical center in the world. The authors evaluated two document access models based on the work of Anderson and Schooler. The first is based on how frequently a document was accessed. The second is based on both frequency and recency.
RESULTS: The model based only on frequency of past access was highly correlated with the empirical data (R²=0.932), whereas the model based on frequency and recency had a much lower correlation (R²=0.668). DISCUSSION: The frequency-only model accurately predicted whether a document will be accessed based on past use. Modeling accesses as a function of frequency requires storing only the number of accesses and the creation date for the document. This model requires low storage overheads and is computationally efficient, making it scalable to large corpora such as MEDLINE.
CONCLUSION: It is feasible to accurately model the probability of a document being accessed in the future based on past accesses.

Mesh:

Year:  2011        PMID: 21917645      PMCID: PMC3341785          DOI: 10.1136/amiajnl-2011-000325

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  8 in total

1.  Information overload hampers biology reforms.

Authors:  E Stokstad
Journal:  Science       Date:  2001-08-31       Impact factor: 47.728

2.  Information overload.

Authors:  Carina Dennis
Journal:  Nature       Date:  2002-05-02       Impact factor: 49.962

3.  The US National Library of Medicine in the 21st century: expanding collections, nontraditional formats, new audiences.

Authors:  Eve-Marie Lacroix; Robert Mehnert
Journal:  Health Info Libr J       Date:  2002-09

4.  On the impossibility of being expert.

Authors:  Alan G Fraser; Frank D Dunstan
Journal:  BMJ       Date:  2010-12-14

5.  Using citation data to improve retrieval from MEDLINE.

Authors:  Elmer V Bernstam; Jorge R Herskovic; Yindalon Aphinyanaphongs; Constantin F Aliferis; Madurai G Sriram; William R Hersh
Journal:  J Am Med Inform Assoc       Date:  2005-10-12       Impact factor: 4.497

6.  A day in the life of PubMed: analysis of a typical day's query log.

Authors:  Jorge R Herskovic; Len Y Tanaka; William Hersh; Elmer V Bernstam
Journal:  J Am Med Inform Assoc       Date:  2007-01-09       Impact factor: 4.497

7.  What clinical information do doctors need?

Authors:  R Smith
Journal:  BMJ       Date:  1996-10-26

8.  Understanding PubMed user search behavior through log analysis.

Authors:  Rezarta Islamaj Dogan; G Craig Murray; Aurélie Névéol; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2009-11-27       Impact factor: 3.451

  8 in total
  1 in total

1.  Predicting clicks of PubMed articles.

Authors:  Yuqing Mao; Zhiyong Lu
Journal:  AMIA Annu Symp Proc       Date:  2013-11-16
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.