Literature DB >> 14728536

A probabilistic similarity metric for Medline records: a model for author name disambiguation.

Vetle I Torvik1, Marc Weeber, Don R Swanson, Neil R Smalheiser.   

Abstract

We present a model for automatically generating training sets and estimating the probability that a pair of Medline records sharing a last and first name initial are authored by the same individual, based on shared title words, journal name, co-authors, medical subject headings, language, and affiliation, as well as distinctive features of the name itself (i.e., presence of middle initial, suffix, and prevalence in Medline).

Mesh:

Year:  2003        PMID: 14728536      PMCID: PMC1480109     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  26 in total

1.  E-mail decay rates among corresponding authors in MEDLINE. The ability to communicate with and request materials from authors is being eroded by the expiration of e-mail addresses.

Authors:  Jonathan D Wren; Joe E Grissom; Tyrrell Conway
Journal:  EMBO Rep       Date:  2006-02       Impact factor: 8.807

2.  [To be or not to be in the picture: the individual assessment of scientific performance].

Authors:  Raúl Isaac Méndez Vásquez
Journal:  Aten Primaria       Date:  2009-02-03       Impact factor: 1.137

3.  MeSH term explosion and author rank improve expert recommendations.

Authors:  Danielle H Lee; Titus Schleyer
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

4.  Design of a generic, open platform for machine learning-assisted indexing and clustering of articles in PubMed, a biomedical bibliographic database.

Authors:  Neil R Smalheiser; Aaron M Cohen
Journal:  Data Inf Manag       Date:  2018-05-22

5.  Choosing experiments to accelerate collective discovery.

Authors:  Andrey Rzhetsky; Jacob G Foster; Ian T Foster; James A Evans
Journal:  Proc Natl Acad Sci U S A       Date:  2015-11-09       Impact factor: 11.205

6.  Unsupervised low-dimensional vector representations for words, phrases and text that are transparent, scalable, and produce similarity metrics that are not redundant with neural embeddings.

Authors:  Neil R Smalheiser; Aaron M Cohen; Gary Bonifield
Journal:  J Biomed Inform       Date:  2019-01-14       Impact factor: 6.317

7.  The effects of diversity and network ties on innovations: The emergence of a new scientific field.

Authors:  Alina Lungeanu; Noshir S Contractor
Journal:  Am Behav Sci       Date:  2014-11-14

8.  Arrowsmith two-node search interface: a tutorial on finding meaningful links between two disparate sets of articles in MEDLINE.

Authors:  Neil R Smalheiser; Vetle I Torvik; Wei Zhou
Journal:  Comput Methods Programs Biomed       Date:  2009-01-30       Impact factor: 5.428

9.  Author Name Disambiguation in MEDLINE.

Authors:  Vetle I Torvik; Neil R Smalheiser
Journal:  ACM Trans Knowl Discov Data       Date:  2009-07-01       Impact factor: 2.713

10.  New linked data on research investments: scientific workforce, productivity, and public value.

Authors:  Julia Lane; Jason Owen-Smith; Rebecca Rosen; Bruce Weinberg
Journal:  Res Policy       Date:  2015-02-16
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.