Literature DB >> 31521249

Distant supervision for treatment relation extraction by leveraging MeSH subheadings.

Tung Tran1, Ramakanth Kavuluru2.   

Abstract

The growing body of knowledge in biomedicine is too vast for human consumption. Hence there is a need for automated systems able to navigate and distill the emerging wealth of information. One fundamental task to that end is relation extraction, whereby linguistic expressions of semantic relationships between biomedical entities are recognized and extracted. In this study, we propose a novel distant supervision approach for relation extraction of binary treatment relationships such that high quality positive/negative training examples are generated from PubMed abstracts by leveraging associated MeSH subheadings. The quality of generated examples is assessed based on the quality of supervised models they induce; that is, the mean performance of trained models (derived via bootstrapped ensembling) on a gold standard test set is used as a proxy for data quality. We show that our approach is preferable to traditional distant supervision for treatment relations and is closer to human crowd annotations in terms of annotation quality. For treatment relations, our generated training data performs at 81.38%, compared to traditional distant supervision at 64.33% and crowd-sourced annotations at 90.57% on the model-wide PR-AUC metric. We also demonstrate that examples generated using our method can be used to augment crowd-sourced datasets. Augmented models improve over non-augmented models by more than two absolute points on the more established F1 metric. We lastly demonstrate that performance can be further improved by implementing a classification loss that is resistant to label noise.
Copyright © 2019 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Distant supervision; MeSH subheadings; Medical treatment relation; Relation extraction

Year:  2019        PMID: 31521249      PMCID: PMC6748648          DOI: 10.1016/j.artmed.2019.06.002

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  14 in total

1.  Learning to forget: continual prediction with LSTM.

Authors:  F A Gers; J Schmidhuber; F Cummins
Journal:  Neural Comput       Date:  2000-10       Impact factor: 2.026

2.  RelEx--relation extraction using dependency parse trees.

Authors:  Katrin Fundel; Robert Küffner; Ralf Zimmer
Journal:  Bioinformatics       Date:  2006-12-01       Impact factor: 6.937

3.  Automatic extraction of relations between medical concepts in clinical texts.

Authors:  Bryan Rink; Sanda Harabagiu; Kirk Roberts
Journal:  J Am Med Inform Assoc       Date:  2011 Sep-Oct       Impact factor: 4.497

4.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

5.  Classification in the presence of label noise: a survey.

Authors:  Benoît Frénay; Michel Verleysen
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2014-05       Impact factor: 10.451

6.  Segment convolutional neural networks (Seg-CNNs) for classifying relations in clinical notes.

Authors:  Yuan Luo; Yu Cheng; Özlem Uzuner; Peter Szolovits; Justin Starren
Journal:  J Am Med Inform Assoc       Date:  2018-01-01       Impact factor: 4.497

7.  Extracting Drug-Drug Interactions with Word and Character-Level Recurrent Neural Networks.

Authors:  Ramakanth Kavuluru; Anthony Rios; Tung Tran
Journal:  IEEE Int Conf Healthc Inform       Date:  2017-09-14

8.  Predicting mental conditions based on "history of present illness" in psychiatric notes with deep neural networks.

Authors:  Tung Tran; Ramakanth Kavuluru
Journal:  J Biomed Inform       Date:  2017-06-10       Impact factor: 6.317

9.  Automatic extraction of semantic relations between medical entities: a rule based approach.

Authors:  Asma Ben Abacha; Pierre Zweigenbaum
Journal:  J Biomed Semantics       Date:  2011-10-06

10.  All-paths graph kernel for protein-protein interaction extraction with evaluation of cross-corpus learning.

Authors:  Antti Airola; Sampo Pyysalo; Jari Björne; Tapio Pahikkala; Filip Ginter; Tapio Salakoski
Journal:  BMC Bioinformatics       Date:  2008-11-19       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.