Literature DB >> 26958205

Automatic Assignment of Non-Leaf MeSH Terms to Biomedical Articles.

Ramakanth Kavuluru1, Anthony Rios2.   

Abstract

Assigning labels from a hierarchical vocabulary is a well known special case of multi-label classification, often modeled to maximize micro F1-score. However, building accurate binary classifiers for poorly performing labels in the hierarchy can improve both micro and macro F1-scores. In this paper, we propose and evaluate classification strategies involving descendant node instances to build better binary classifiers for non-leaf labels with the use-case of assigning Medical Subject Headings (MeSH) to biomedical articles. Librarians at the National Library of Medicine tag each biomedical article to be indexed by their PubMed information system with terms from the MeSH terminology, a biomedical conceptual hierarchy with over 27,000 terms. Human indexers look at each article's full text to assign a set of most suitable MeSH terms for indexing it. Several recent automated attempts focused on using the article title and abstract text to identify MeSH terms for the corresponding article. Despite these attempts, it is observed that assigning MeSH terms corresponding to certain non-leaf nodes of the MeSH hierarchy is particularly challenging. Non-leaf nodes are very important as they constitute one third of the total number of MeSH terms. Here, we demonstrate the effectiveness of exploiting training examples of descendant terms of non-leaf nodes in improving the performance of conventional classifiers for the corresponding non-leaf MeSH terms. Specifically, we focus on reducing the false positives (FPs) caused due to descendant instances in traditional classifiers. Our methods are able to achieve a relative improvement of 7.5% in macro-F1 score while also increasing the micro-F1 score by 1.6% for a set of 500 non-leaf terms in the MeSH hierarchy. These results strongly indicate the critical role of incorporating hierarchical information in MeSH term prediction. To our knowledge, our effort is the first to demonstrate the role of hierarchical information in improving binary classifiers for non-leaf MeSH terms.

Entities:  

Mesh:

Year:  2015        PMID: 26958205      PMCID: PMC4765689     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  10 in total

1.  The NLM Indexing Initiative.

Authors:  A R Aronson; O Bodenreider; H F Chang; S M Humphrey; J G Mork; S J Nelson; T C Rindflesch; W J Wilbur
Journal:  Proc AMIA Symp       Date:  2000

2.  Adjusting the outputs of a classifier to new a priori probabilities: a simple procedure.

Authors:  Marco Saerens; Patrice Latinne; Christine Decaestecker
Journal:  Neural Comput       Date:  2002-01       Impact factor: 2.026

3.  An overview of MetaMap: historical perspective and recent advances.

Authors:  Alan R Aronson; François-Michel Lang
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

4.  Reflective random indexing for semi-automatic indexing of the biomedical literature.

Authors:  Vidya Vasuki; Trevor Cohen
Journal:  J Biomed Inform       Date:  2010-04-09       Impact factor: 6.317

5.  Semi-automatic indexing of full text biomedical articles.

Authors:  Clifford W Gay; Mehmet Kayaalp; Alan R Aronson
Journal:  AMIA Annu Symp Proc       Date:  2005

6.  The effect of feature representation on MEDLINE document classification.

Authors:  Meliha Yetisgen-Yildiz; Wanda Pratt
Journal:  AMIA Annu Symp Proc       Date:  2005

7.  Optimal training sets for Bayesian prediction of MeSH assignment.

Authors:  Sunghwan Sohn; Won Kim; Donald C Comeau; W John Wilbur
Journal:  J Am Med Inform Assoc       Date:  2008-04-24       Impact factor: 4.497

8.  Stochastic Gradient Descent and the Prediction of MeSH for PubMed Records.

Authors:  W John Wilbur; Won Kim
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

9.  Indexing consistency in MEDLINE.

Authors:  M E Funk; C A Reid
Journal:  Bull Med Libr Assoc       Date:  1983-04

10.  Recommending MeSH terms for annotating biomedical articles.

Authors:  Minlie Huang; Aurélie Névéol; Zhiyong Lu
Journal:  J Am Med Inform Assoc       Date:  2011-05-25       Impact factor: 4.497

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.