Literature DB >> 32308868

A High Recall Classifier for Selecting Articles for MEDLINE Indexing.

Alastair R Rae1, Max E Savery1, James G Mork1, Dina Demner-Fushman1.   

Abstract

MEDLINE is the National Library of Medicine's premier bibliographic database for biomedical literature. A highly valuable feature of the database is that each record is manually indexed with a controlled vocabulary called MeSH. Most MEDLINE journals are indexed cover-to-cover, but there are about 200 selectively indexed journals for which only articles related to biomedicine and life sciences are indexed. In recent years, the selection process has become an increasing burden for indexing staff, and this paper presents a machine learning based system that offers very significant time savings by semi-automating the task. At the core of the system is a high recall classifier for the identification of journal articles that are in-scope for MEDLINE. The system is shown to reduce the number of articles requiring manual review by 54%, equivalent to approximately 40,000 articles per year. ©2019 AMIA - All rights reserved.

Entities:  

Year:  2020        PMID: 32308868      PMCID: PMC7153058     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  8 in total

1.  Journal descriptor indexing tool for categorizing text according to discipline or semantic type.

Authors:  Susanne M Humphrey; Chris J Lu; Willie J Rogers; Allen C Browne
Journal:  AMIA Annu Symp Proc       Date:  2006

2.  Towards automatic recognition of scientifically rigorous clinical research evidence.

Authors:  Halil Kilicoglu; Dina Demner-Fushman; Thomas C Rindflesch; Nancy L Wilczynski; R Brian Haynes
Journal:  J Am Med Inform Assoc       Date:  2008-10-24       Impact factor: 4.497

3.  Automatic identification of recent high impact clinical articles in PubMed to support clinical decision making using time-agnostic features.

Authors:  Jiantao Bian; Samir Abdelrahman; Jianlin Shi; Guilherme Del Fiol
Journal:  J Biomed Inform       Date:  2018-11-22       Impact factor: 6.317

4.  The TREC 2004 genomics track categorization task: classifying full text biomedical documents.

Authors:  Aaron M Cohen; William R Hersh
Journal:  J Biomed Discov Collab       Date:  2006-03-14

5.  DeepMeSH: deep semantic representation for improving large-scale MeSH indexing.

Authors:  Shengwen Peng; Ronghui You; Hongning Wang; Chengxiang Zhai; Hiroshi Mamitsuka; Shanfeng Zhu
Journal:  Bioinformatics       Date:  2016-06-15       Impact factor: 6.937

6.  12 years on - Is the NLM medical text indexer still useful and relevant?

Authors:  James Mork; Alan Aronson; Dina Demner-Fushman
Journal:  J Biomed Semantics       Date:  2017-02-23

7.  A Deep Learning Method to Automatically Identify Reports of Scientifically Rigorous Clinical Research from the Biomedical Literature: Comparative Analytic Study.

Authors:  Guilherme Del Fiol; Matthew Michelson; Alfonso Iorio; Chris Cotoi; R Brian Haynes
Journal:  J Med Internet Res       Date:  2018-06-25       Impact factor: 5.428

8.  Collaborative biocuration--text-mining development task for document prioritization for curation.

Authors:  Thomas C Wiegers; Allan Peter Davis; Carolyn J Mattingly
Journal:  Database (Oxford)       Date:  2012-11-22       Impact factor: 3.451

  8 in total
  1 in total

1.  Automatic MeSH Indexing: Revisiting the Subheading Attachment Problem.

Authors:  Alastair R Rae; David O Pritchard; James G Mork; Dina Demner-Fushman
Journal:  AMIA Annu Symp Proc       Date:  2021-01-25
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.