Literature DB >> 22744958

Automatic discourse connective detection in biomedical text.

Balaji Polepalli Ramesh1, Rashmi Prasad, Tim Miller, Brian Harrington, Hong Yu.   

Abstract

OBJECTIVE: Relation extraction in biomedical text mining systems has largely focused on identifying clause-level relations, but increasing sophistication demands the recognition of relations at discourse level. A first step in identifying discourse relations involves the detection of discourse connectives: words or phrases used in text to express discourse relations. In this study supervised machine-learning approaches were developed and evaluated for automatically identifying discourse connectives in biomedical text.
MATERIALS AND METHODS: Two supervised machine-learning models (support vector machines and conditional random fields) were explored for identifying discourse connectives in biomedical literature. In-domain supervised machine-learning classifiers were trained on the Biomedical Discourse Relation Bank, an annotated corpus of discourse relations over 24 full-text biomedical articles (~112,000 word tokens), a subset of the GENIA corpus. Novel domain adaptation techniques were also explored to leverage the larger open-domain Penn Discourse Treebank (~1 million word tokens). The models were evaluated using the standard evaluation metrics of precision, recall and F1 scores. RESULTS AND
CONCLUSION: Supervised machine-learning approaches can automatically identify discourse connectives in biomedical text, and the novel domain adaptation techniques yielded the best performance: 0.761 F1 score. A demonstration version of the fully implemented classifier BioConn is available at: http://bioconn.askhermes.org.

Mesh:

Year:  2012        PMID: 22744958      PMCID: PMC3422833          DOI: 10.1136/amiajnl-2011-000775

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  15 in total

1.  GENIA corpus--semantically annotated corpus for bio-textmining.

Authors:  J-D Kim; T Ohta; Y Tateisi; J Tsujii
Journal:  Bioinformatics       Date:  2003       Impact factor: 6.937

2.  Biomedical negation scope detection with conditional random fields.

Authors:  Shashank Agarwal; Hong Yu
Journal:  J Am Med Inform Assoc       Date:  2010 Nov-Dec       Impact factor: 4.497

3.  ABNER: an open source tool for automatically tagging genes, proteins and other entity names in text.

Authors:  Burr Settles
Journal:  Bioinformatics       Date:  2005-04-28       Impact factor: 6.937

4.  BANNER: an executable survey of advances in biomedical named entity recognition.

Authors:  Robert Leaman; Graciela Gonzalez
Journal:  Pac Symp Biocomput       Date:  2008

5.  Building a semantically annotated corpus of clinical texts.

Authors:  Angus Roberts; Robert Gaizauskas; Mark Hepple; George Demetriou; Yikun Guo; Ian Roberts; Andrea Setzer
Journal:  J Biomed Inform       Date:  2009-01-23       Impact factor: 6.317

6.  Domain adaptation for semantic role labeling in the biomedical domain.

Authors:  Daniel Dahlmeier; Hwee Tou Ng
Journal:  Bioinformatics       Date:  2010-02-23       Impact factor: 6.937

7.  LINNAEUS: a species name identification system for biomedical literature.

Authors:  Martin Gerner; Goran Nenadic; Casey M Bergman
Journal:  BMC Bioinformatics       Date:  2010-02-11       Impact factor: 3.169

Review 8.  Coreference resolution: a review of general methodologies and applications in the clinical domain.

Authors:  Jiaping Zheng; Wendy W Chapman; Rebecca S Crowley; Guergana K Savova
Journal:  J Biomed Inform       Date:  2011-08-12       Impact factor: 6.317

9.  Automatically extracting cancer disease characteristics from pathology reports into a Disease Knowledge Representation Model.

Authors:  Anni Coden; Guergana Savova; Igor Sominsky; Michael Tanenblatt; James Masanz; Karin Schuler; James Cooper; Wei Guan; Piet C de Groen
Journal:  J Biomed Inform       Date:  2008-12-27       Impact factor: 6.317

10.  The textual characteristics of traditional and Open Access scientific journals are similar.

Authors:  Karin Verspoor; K Bretonnel Cohen; Lawrence Hunter
Journal:  BMC Bioinformatics       Date:  2009-06-15       Impact factor: 3.169

View more
  6 in total

1.  Electronic health records-driven phenotyping: challenges, recent advances, and perspectives.

Authors:  Jyotishman Pathak; Abel N Kho; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2013-12       Impact factor: 4.497

2.  Leveraging existing corpora for de-identification of psychiatric notes using domain adaptation.

Authors:  Hee-Jin Lee; Yaoyun Zhang; Kirk Roberts; Hua Xu
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

3.  Biomedical text mining for research rigor and integrity: tasks, challenges, directions.

Authors:  Halil Kilicoglu
Journal:  Brief Bioinform       Date:  2018-11-27       Impact factor: 11.622

4.  Semi-supervised learning of causal relations in biomedical scientific discourse.

Authors:  Claudiu Mihăilă; Sophia Ananiadou
Journal:  Biomed Eng Online       Date:  2014-12-11       Impact factor: 2.819

5.  Automatically Recognizing Medication and Adverse Event Information From Food and Drug Administration's Adverse Event Reporting System Narratives.

Authors:  Balaji Polepalli Ramesh; Steven M Belknap; Zuofeng Li; Nadya Frid; Dennis P West; Hong Yu
Journal:  JMIR Med Inform       Date:  2014-06-27

6.  Adapting Word Embeddings from Multiple Domains to Symptom Recognition from Psychiatric Notes.

Authors:  Yaoyun Zhang; Hee-Jin Li; Jingqi Wang; Trevor Cohen; Kirk Roberts; Hua Xu
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2018-05-18
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.