Literature DB >> 16622165

A comparison of citation metrics to machine learning filters for the identification of high quality MEDLINE documents.

Yindalon Aphinyanaphongs1, Alexander Statnikov, Constantin F Aliferis.   

Abstract

OBJECTIVE: The present study explores the discriminatory performance of existing and novel gold-standard-specific machine learning (GSS-ML) focused filter models (i.e., models built specifically for a retrieval task and a gold standard against which they are evaluated) and compares their performance to citation count and impact factors, and non-specific machine learning (NS-ML) models (i.e., models built for a different task and/or different gold standard).
DESIGN: Three gold standard corpora were constructed using the SSOAB bibliography, the ACPJ-cited treatment articles, and the ACPJ-cited etiology articles. Citation counts and impact factors were obtained for each article. Support vector machine models were used to classify the articles using combinations of content, impact factors, and citation counts as predictors. MEASUREMENTS: Discriminatory performance was estimated using the area under the receiver operating characteristic curve and n-fold cross-validation.
RESULTS: For all three gold standards and tasks, GSS-ML filters outperformed citation count, impact factors, and NS-ML filters. Combinations of content with impact factor or citation count produced no or negligible improvements to the GSS machine learning filters.
CONCLUSIONS: These experiments provide evidence that when building information retrieval filters focused on a retrieval task and corresponding gold standard, the filter models have to be built specifically for this task and gold standard. Under those conditions, machine learning filters outperform standard citation metrics. Furthermore, citation counts and impact factors add marginal value to discriminatory performance. Previous research that claimed better performance of citation metrics than machine learning in one of the corpora examined here is attributed to using machine learning filters built for a different gold standard and task.

Entities:  

Mesh:

Year:  2006        PMID: 16622165      PMCID: PMC1513679          DOI: 10.1197/jamia.M2031

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  8 in total

1.  HITON: a novel Markov Blanket algorithm for optimal variable selection.

Authors:  C F Aliferis; I Tsamardinos; A Statnikov
Journal:  AMIA Annu Symp Proc       Date:  2003

Review 2.  Evaluation of methodological search filters--a review.

Authors:  Michelle Jenkins
Journal:  Health Info Libr J       Date:  2004-09

3.  Text categorization models for high-quality article retrieval in internal medicine.

Authors:  Yindalon Aphinyanaphongs; Ioannis Tsamardinos; Alexander Statnikov; Douglas Hardin; Constantin F Aliferis
Journal:  J Am Med Inform Assoc       Date:  2004-11-23       Impact factor: 4.497

4.  Optimal search strategies for detecting clinically sound prognostic studies in EMBASE: an analytic survey.

Authors:  Nancy L Wilczynski; R Brian Haynes
Journal:  J Am Med Inform Assoc       Date:  2005-03-31       Impact factor: 4.497

5.  Extracting drug-drug interaction articles from MEDLINE to improve the content of drug databases.

Authors:  Stephany Duda; Constantin Aliferis; Randolph Miller; Alexander Statnikov; Kevin Johnson
Journal:  AMIA Annu Symp Proc       Date:  2005

6.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach.

Authors:  E R DeLong; D M DeLong; D L Clarke-Pearson
Journal:  Biometrics       Date:  1988-09       Impact factor: 2.571

7.  Developing optimal search strategies for detecting clinically sound studies in MEDLINE.

Authors:  R B Haynes; N Wilczynski; K A McKibbon; C J Walker; J C Sinclair
Journal:  J Am Med Inform Assoc       Date:  1994 Nov-Dec       Impact factor: 4.497

8.  Learning Boolean queries for article quality filtering.

Authors:  Yin Aphinyanaphongs; Constantin F Aliferis
Journal:  Stud Health Technol Inform       Date:  2004
  8 in total
  12 in total

1.  Search filter precision can be improved by NOTing out irrelevant content.

Authors:  Nancy L Wilczynski; K Ann McKibbon; R Brian Haynes
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

2.  Physicians' perception of alternative displays of clinical research evidence for clinical decision support - A study with case vignettes.

Authors:  Stacey L Slager; Charlene R Weir; Heejun Kim; Javed Mostafa; Guilherme Del Fiol
Journal:  J Biomed Inform       Date:  2017-01-13       Impact factor: 6.317

3.  Optimizing feature representation for automated systematic review work prioritization.

Authors:  Aaron M Cohen
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

4.  Cross-topic learning for work prioritization in systematic review creation and update.

Authors:  Aaron M Cohen; Kyle Ambert; Marian McDonagh
Journal:  J Am Med Inform Assoc       Date:  2009-06-30       Impact factor: 4.497

5.  A new iterative method to reduce workload in systematic review process.

Authors:  Siddhartha Jonnalagadda; Diana Petitti
Journal:  Int J Comput Biol Drug Des       Date:  2013-02-21

6.  Automatic identification of high impact articles in PubMed to support clinical decision making.

Authors:  Jiantao Bian; Mohammad Amin Morid; Siddhartha Jonnalagadda; Gang Luo; Guilherme Del Fiol
Journal:  J Biomed Inform       Date:  2017-07-26       Impact factor: 6.317

7.  Classification of Clinically Useful Sentences in MEDLINE.

Authors:  Mohammad Amin Morid; Siddhartha Jonnalagadda; Marcelo Fiszman; Kalpana Raja; Guilherme Del Fiol
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

8.  Automatic identification of recent high impact clinical articles in PubMed to support clinical decision making using time-agnostic features.

Authors:  Jiantao Bian; Samir Abdelrahman; Jianlin Shi; Guilherme Del Fiol
Journal:  J Biomed Inform       Date:  2018-11-22       Impact factor: 6.317

9.  Sequential result refinement for searching the biomedical literature.

Authors:  L Y Tanaka; J R Herskovic; M S Iyengar; E V Bernstam
Journal:  J Biomed Inform       Date:  2009-03-09       Impact factor: 6.317

10.  Algorithms for Discovery of Multiple Markov Boundaries.

Authors:  Alexander Statnikov; Nikita I Lytkin; Jan Lemeire; Constantin F Aliferis
Journal:  J Mach Learn Res       Date:  2013-02       Impact factor: 3.654

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.