Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Automatic generation of investigator bibliographies for institutional research networking systems.

Literature DB >> 24694772

Automatic generation of investigator bibliographies for institutional research networking systems.

Stephen B Johnson¹, Michael E Bales², Daniel Dine³, Suzanne Bakken³, Paul J Albert⁴, Chunhua Weng³.

Abstract

OBJECTIVE: Publications are a key data source for investigator profiles and research networking systems. We developed ReCiter, an algorithm that automatically extracts bibliographies from PubMed using institutional information about the target investigators.
METHODS: ReCiter executes a broad query against PubMed, groups the results into clusters that appear to constitute distinct author identities and selects the cluster that best matches the target investigator. Using information about investigators from one of our institutions, we compared ReCiter results to queries based on author name and institution and to citations extracted manually from the Scopus database. Five judges created a gold standard using citations of a random sample of 200 investigators.
RESULTS: About half of the 10,471 potential investigators had no matching citations in PubMed, and about 45% had fewer than 70 citations. Interrater agreement (Fleiss' kappa) for the gold standard was 0.81. Scopus achieved the best recall (sensitivity) of 0.81, while name-based queries had 0.78 and ReCiter had 0.69. ReCiter attained the best precision (positive predictive value) of 0.93 while Scopus had 0.85 and name-based queries had 0.31. DISCUSSION: ReCiter accesses the most current citation data, uses limited computational resources and minimizes manual entry by investigators. Generation of bibliographies using named-based queries will not yield high accuracy. Proprietary databases can perform well but requite manual effort. Automated generation with higher recall is possible but requires additional knowledge about investigators.

Entities: Disease Gene Species

Keywords: Authorship; Automated; Bibliography as topic; MEDLINE; Natural language processing; Pattern recognition

Mesh：

Year: 2014 PMID： 24694772 PMCID： PMC4180817 DOI： 10.1016/j.jbi.2014.03.013

Source DB: PubMed Journal: J Biomed Inform ISSN： 1532-0464 Impact factor: 6.317

10 in total

1. Use of a MeSH-based index of faculty research interests to identify faculty publications: an IAIMSian study of precision, recall, and data reusability.

Authors: K Ann McKibbon; Patricia W Friedman; Charles P Friedman
Journal: Proc AMIA Symp Date: 2002

2. A probabilistic similarity metric for Medline records: a model for author name disambiguation.

Authors: Vetle I Torvik; Marc Weeber; Don R Swanson; Neil R Smalheiser
Journal: AMIA Annu Symp Proc Date: 2003

3. Credit where credit is due.

Authors:
Journal: Nature Date: 2009-12-17 Impact factor: 49.962

4. Good news on the horizon: the Open Researcher and Contributor ID (ORCID).

Authors: Errol C Friedberg
Journal: DNA Repair (Amst) Date: 2010-01-18

5. An empiric modification to the probabilistic record linkage algorithm using frequency-based weight scaling.

Authors: Vivienne J Zhu; Marc J Overhage; James Egg; Stephen M Downs; Shaun J Grannis
Journal: J Am Med Inform Assoc Date: 2009-06-30 Impact factor: 4.497

6. Matching identifiers in electronic health records: implications for duplicate records and patient safety.

Authors: Allison B McCoy; Adam Wright; Michael G Kahn; Jason S Shapiro; Elmer Victor Bernstam; Dean F Sittig
Journal: BMJ Qual Saf Date: 2013-01-29 Impact factor: 7.035

7. Evolution of coauthorship in public health services and systems research.

Authors: Michael E Bales; Stephen B Johnson; Jonathan W Keeling; Kathleen M Carley; Frank Kunkel; Jacqueline A Merrill
Journal: Am J Prev Med Date: 2011-07 Impact factor: 5.043

8. Associating co-authorship patterns with publications in high-impact journals.

Authors: Michael E Bales; Daniel C Dine; Jacqueline A Merrill; Stephen B Johnson; Suzanne Bakken; Chunhua Weng
Journal: J Biomed Inform Date: 2014-07-19 Impact factor: 6.317

9. Using global unique identifiers to link autism collections.

Authors: Stephen B Johnson; Glen Whitney; Matthew McAuliffe; Hailong Wang; Evan McCreedy; Leon Rozenblit; Clark C Evans
Journal: J Am Med Inform Assoc Date: 2010 Nov-Dec Impact factor: 4.497

10. Author Name Disambiguation in MEDLINE.

Authors: Vetle I Torvik; Neil R Smalheiser
Journal: ACM Trans Knowl Discov Data Date: 2009-07-01 Impact factor: 2.713

10 in total

5 in total

1. Associating co-authorship patterns with publications in high-impact journals.

Authors: Michael E Bales; Daniel C Dine; Jacqueline A Merrill; Stephen B Johnson; Suzanne Bakken; Chunhua Weng
Journal: J Biomed Inform Date: 2014-07-19 Impact factor: 6.317

2. Dynamically generating T32 training documents using structured data.

Authors: Paul James Albert; Ayesha Joshi
Journal: J Med Libr Assoc Date: 2019-07-01

3. ReCiter: An open source, identity-driven, authorship prediction algorithm optimized for academic institutions.

Authors: Paul J Albert; Sarbajit Dutta; Jie Lin; Zimeng Zhu; Michael Bales; Stephen B Johnson; Mohammad Mansour; Drew Wright; Terrie R Wheeler; Curtis L Cole
Journal: PLoS One Date: 2021-04-01 Impact factor: 3.240

4. Research evaluation support services in biomedical libraries.

Authors: Karen Elizabeth Gutzman; Michael E Bales; Christopher W Belter; Thane Chambers; Liza Chan; Kristi L Holmes; Ya-Ling Lu; Lisa A Palmer; Rebecca C Reznik-Zellen; Cathy C Sarli; Amy M Suiter; Terrie R Wheeler
Journal: J Med Libr Assoc Date: 2018-01-02

Review 5. Researcher and Author Profiles: Opportunities, Advantages, and Limitations.

Authors: Armen Yuri Gasparyan; Bekaidar Nurmashev; Marlen Yessirkepov; Dmitry A Endovitskiy; Alexander A Voronov; George D Kitas
Journal: J Korean Med Sci Date: 2017-11 Impact factor: 2.153

5 in total