Literature DB >> 24694772

Automatic generation of investigator bibliographies for institutional research networking systems.

Stephen B Johnson1, Michael E Bales2, Daniel Dine3, Suzanne Bakken3, Paul J Albert4, Chunhua Weng3.   

Abstract

OBJECTIVE: Publications are a key data source for investigator profiles and research networking systems. We developed ReCiter, an algorithm that automatically extracts bibliographies from PubMed using institutional information about the target investigators.
METHODS: ReCiter executes a broad query against PubMed, groups the results into clusters that appear to constitute distinct author identities and selects the cluster that best matches the target investigator. Using information about investigators from one of our institutions, we compared ReCiter results to queries based on author name and institution and to citations extracted manually from the Scopus database. Five judges created a gold standard using citations of a random sample of 200 investigators.
RESULTS: About half of the 10,471 potential investigators had no matching citations in PubMed, and about 45% had fewer than 70 citations. Interrater agreement (Fleiss' kappa) for the gold standard was 0.81. Scopus achieved the best recall (sensitivity) of 0.81, while name-based queries had 0.78 and ReCiter had 0.69. ReCiter attained the best precision (positive predictive value) of 0.93 while Scopus had 0.85 and name-based queries had 0.31. DISCUSSION: ReCiter accesses the most current citation data, uses limited computational resources and minimizes manual entry by investigators. Generation of bibliographies using named-based queries will not yield high accuracy. Proprietary databases can perform well but requite manual effort. Automated generation with higher recall is possible but requires additional knowledge about investigators.
Copyright © 2014 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Authorship; Automated; Bibliography as topic; MEDLINE; Natural language processing; Pattern recognition

Mesh:

Year:  2014        PMID: 24694772      PMCID: PMC4180817          DOI: 10.1016/j.jbi.2014.03.013

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  10 in total

1.  Use of a MeSH-based index of faculty research interests to identify faculty publications: an IAIMSian study of precision, recall, and data reusability.

Authors:  K Ann McKibbon; Patricia W Friedman; Charles P Friedman
Journal:  Proc AMIA Symp       Date:  2002

2.  A probabilistic similarity metric for Medline records: a model for author name disambiguation.

Authors:  Vetle I Torvik; Marc Weeber; Don R Swanson; Neil R Smalheiser
Journal:  AMIA Annu Symp Proc       Date:  2003

3.  Credit where credit is due.

Authors: 
Journal:  Nature       Date:  2009-12-17       Impact factor: 49.962

4.  Good news on the horizon: the Open Researcher and Contributor ID (ORCID).

Authors:  Errol C Friedberg
Journal:  DNA Repair (Amst)       Date:  2010-01-18

5.  An empiric modification to the probabilistic record linkage algorithm using frequency-based weight scaling.

Authors:  Vivienne J Zhu; Marc J Overhage; James Egg; Stephen M Downs; Shaun J Grannis
Journal:  J Am Med Inform Assoc       Date:  2009-06-30       Impact factor: 4.497

6.  Matching identifiers in electronic health records: implications for duplicate records and patient safety.

Authors:  Allison B McCoy; Adam Wright; Michael G Kahn; Jason S Shapiro; Elmer Victor Bernstam; Dean F Sittig
Journal:  BMJ Qual Saf       Date:  2013-01-29       Impact factor: 7.035

7.  Evolution of coauthorship in public health services and systems research.

Authors:  Michael E Bales; Stephen B Johnson; Jonathan W Keeling; Kathleen M Carley; Frank Kunkel; Jacqueline A Merrill
Journal:  Am J Prev Med       Date:  2011-07       Impact factor: 5.043

8.  Associating co-authorship patterns with publications in high-impact journals.

Authors:  Michael E Bales; Daniel C Dine; Jacqueline A Merrill; Stephen B Johnson; Suzanne Bakken; Chunhua Weng
Journal:  J Biomed Inform       Date:  2014-07-19       Impact factor: 6.317

9.  Using global unique identifiers to link autism collections.

Authors:  Stephen B Johnson; Glen Whitney; Matthew McAuliffe; Hailong Wang; Evan McCreedy; Leon Rozenblit; Clark C Evans
Journal:  J Am Med Inform Assoc       Date:  2010 Nov-Dec       Impact factor: 4.497

10.  Author Name Disambiguation in MEDLINE.

Authors:  Vetle I Torvik; Neil R Smalheiser
Journal:  ACM Trans Knowl Discov Data       Date:  2009-07-01       Impact factor: 2.713

  10 in total
  5 in total

1.  Associating co-authorship patterns with publications in high-impact journals.

Authors:  Michael E Bales; Daniel C Dine; Jacqueline A Merrill; Stephen B Johnson; Suzanne Bakken; Chunhua Weng
Journal:  J Biomed Inform       Date:  2014-07-19       Impact factor: 6.317

2.  Dynamically generating T32 training documents using structured data.

Authors:  Paul James Albert; Ayesha Joshi
Journal:  J Med Libr Assoc       Date:  2019-07-01

3.  ReCiter: An open source, identity-driven, authorship prediction algorithm optimized for academic institutions.

Authors:  Paul J Albert; Sarbajit Dutta; Jie Lin; Zimeng Zhu; Michael Bales; Stephen B Johnson; Mohammad Mansour; Drew Wright; Terrie R Wheeler; Curtis L Cole
Journal:  PLoS One       Date:  2021-04-01       Impact factor: 3.240

4.  Research evaluation support services in biomedical libraries.

Authors:  Karen Elizabeth Gutzman; Michael E Bales; Christopher W Belter; Thane Chambers; Liza Chan; Kristi L Holmes; Ya-Ling Lu; Lisa A Palmer; Rebecca C Reznik-Zellen; Cathy C Sarli; Amy M Suiter; Terrie R Wheeler
Journal:  J Med Libr Assoc       Date:  2018-01-02

Review 5.  Researcher and Author Profiles: Opportunities, Advantages, and Limitations.

Authors:  Armen Yuri Gasparyan; Bekaidar Nurmashev; Marlen Yessirkepov; Dmitry A Endovitskiy; Alexander A Voronov; George D Kitas
Journal:  J Korean Med Sci       Date:  2017-11       Impact factor: 2.153

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.