Literature DB >> 28736769

Convolutional Neural Networks for Biomedical Text Classification: Application in Indexing Biomedical Articles.

Anthony Rios1, Ramakanth Kavuluru2.   

Abstract

Building high accuracy text classifiers is an important task in biomedicine given the wealth of information hidden in unstructured narratives such as research articles and clinical documents. Due to large feature spaces, traditionally, discriminative approaches such as logistic regression and support vector machines with n-gram and semantic features (e.g., named entities) have been used for text classification where additional performance gains are typically made through feature selection and ensemble approaches. In this paper, we demonstrate that a more direct approach using convolutional neural networks (CNNs) outperforms several traditional approaches in biomedical text classification with the specific use-case of assigning medical subject headings (or MeSH terms) to biomedical articles. Trained annotators at the national library of medicine (NLM) assign on an average 13 codes to each biomedical article, thus semantically indexing scientific literature to support NLM's PubMed search system. Recent evidence suggests that effective automated efforts for MeSH term assignment start with binary classifiers for each term. In this paper, we use CNNs to build binary text classifiers and achieve an absolute improvement of over 3% in macro F-score over a set of selected hard-to-classify MeSH terms when compared with the best prior results on a public dataset. Additional experiments on 50 high frequency terms in the dataset also show improvements with CNNs. Our results indicate the strong potential of CNNs in biomedical text classification tasks.

Entities:  

Keywords:  convolutional neural networks; medical subject headings; text classification

Year:  2015        PMID: 28736769      PMCID: PMC5521984          DOI: 10.1145/2808719.2808746

Source DB:  PubMed          Journal:  ACM BCB


  10 in total

1.  An overview of MetaMap: historical perspective and recent advances.

Authors:  Alan R Aronson; François-Michel Lang
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

2.  The effect of feature representation on MEDLINE document classification.

Authors:  Meliha Yetisgen-Yildiz; Wanda Pratt
Journal:  AMIA Annu Symp Proc       Date:  2005

3.  Optimal training sets for Bayesian prediction of MeSH assignment.

Authors:  Sunghwan Sohn; Won Kim; Donald C Comeau; W John Wilbur
Journal:  J Am Med Inform Assoc       Date:  2008-04-24       Impact factor: 4.497

4.  Knowledge based word-concept model estimation and refinement for biomedical text mining.

Authors:  Antonio Jimeno Yepes; Rafael Berlanga
Journal:  J Biomed Inform       Date:  2014-12-12       Impact factor: 6.317

5.  Comparison and combination of several MeSH indexing approaches.

Authors:  Antonio Jose Jimeno Yepes; James G Mork; Dina Demner-Fushman; Alan R Aronson
Journal:  AMIA Annu Symp Proc       Date:  2013-11-16

6.  Learning regular expressions for clinical text classification.

Authors:  Duy Duc An Bui; Qing Zeng-Treitler
Journal:  J Am Med Inform Assoc       Date:  2014-02-27       Impact factor: 4.497

7.  Recommending MeSH terms for annotating biomedical articles.

Authors:  Minlie Huang; Aurélie Névéol; Zhiyong Lu
Journal:  J Am Med Inform Assoc       Date:  2011-05-25       Impact factor: 4.497

8.  An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition.

Authors:  George Tsatsaronis; Georgios Balikas; Prodromos Malakasiotis; Ioannis Partalas; Matthias Zschunke; Michael R Alvers; Dirk Weissenborn; Anastasia Krithara; Sergios Petridis; Dimitris Polychronopoulos; Yannis Almirantis; John Pavlopoulos; Nicolas Baskiotis; Patrick Gallinari; Thierry Artiéres; Axel-Cyrille Ngonga Ngomo; Norman Heino; Eric Gaussier; Liliana Barrio-Alvers; Michael Schroeder; Ion Androutsopoulos; Georgios Paliouras
Journal:  BMC Bioinformatics       Date:  2015-04-30       Impact factor: 3.169

9.  Feature engineering for MEDLINE citation categorization with MeSH.

Authors:  Antonio Jose Jimeno Yepes; Laura Plaza; Jorge Carrillo-de-Albornoz; James G Mork; Alan R Aronson
Journal:  BMC Bioinformatics       Date:  2015-04-08       Impact factor: 3.169

10.  Context-driven automatic subgraph creation for literature-based discovery.

Authors:  Delroy Cameron; Ramakanth Kavuluru; Thomas C Rindflesch; Amit P Sheth; Krishnaprasad Thirunarayan; Olivier Bodenreider
Journal:  J Biomed Inform       Date:  2015-02-07       Impact factor: 6.317

  10 in total
  19 in total

1.  ML-Net: multi-label classification of biomedical texts with deep neural networks.

Authors:  Jingcheng Du; Qingyu Chen; Yifan Peng; Yang Xiang; Cui Tao; Zhiyong Lu
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

2.  Ordinal convolutional neural networks for predicting RDoC positive valence psychiatric symptom severity scores.

Authors:  Anthony Rios; Ramakanth Kavuluru
Journal:  J Biomed Inform       Date:  2017-05-12       Impact factor: 6.317

3.  An end-to-end deep learning architecture for extracting protein-protein interactions affected by genetic mutations.

Authors:  Tung Tran; Ramakanth Kavuluru
Journal:  Database (Oxford)       Date:  2018-01-01       Impact factor: 3.451

4.  Exploratory Analysis of Marketing and Non-marketing E-cigarette Themes on Twitter.

Authors:  Sifei Han; Ramakanth Kavuluru
Journal:  Soc Inform (2016)       Date:  2016-10-19

5.  Distant supervision for treatment relation extraction by leveraging MeSH subheadings.

Authors:  Tung Tran; Ramakanth Kavuluru
Journal:  Artif Intell Med       Date:  2019-06-07       Impact factor: 5.326

6.  Extracting Drug-Drug Interactions with Word and Character-Level Recurrent Neural Networks.

Authors:  Ramakanth Kavuluru; Anthony Rios; Tung Tran
Journal:  IEEE Int Conf Healthc Inform       Date:  2017-09-14

7.  Cross-registry neural domain adaptation to extract mutational test results from pathology reports.

Authors:  Anthony Rios; Eric B Durbin; Isaac Hands; Susanne M Arnold; Darshil Shah; Stephen M Schwartz; Bernardo H L Goulart; Ramakanth Kavuluru
Journal:  J Biomed Inform       Date:  2019-08-08       Impact factor: 6.317

8.  Assessing the severity of positive valence symptoms in initial psychiatric evaluation records: Should we use convolutional neural networks?

Authors:  Hong-Jie Dai; Jitendra Jonnagaddala
Journal:  PLoS One       Date:  2018-10-16       Impact factor: 3.240

9.  Knowledge-Based Biomedical Word Sense Disambiguation with Neural Concept Embeddings

Authors:  Akm Sabbir; Antonio Jimeno-Yepes; Ramakanth Kavuluru
Journal:  Proc IEEE Int Symp Bioinformatics Bioeng       Date:  2018-01-11

Review 10.  Clinical concept extraction: A methodology review.

Authors:  Sunyang Fu; David Chen; Huan He; Sijia Liu; Sungrim Moon; Kevin J Peterson; Feichen Shen; Liwei Wang; Yanshan Wang; Andrew Wen; Yiqing Zhao; Sunghwan Sohn; Hongfang Liu
Journal:  J Biomed Inform       Date:  2020-08-06       Impact factor: 6.317

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.