Literature DB >> 21347060

Identifying discourse connectives in biomedical text.

Balaji Polepalli Ramesh1, Hong Yu.   

Abstract

Discourse connectives are words or phrases that connect or relate two coherent sentences or phrases and indicate the presence of discourse relations. Automatic recognition of discourse connectives may benefit many natural language processing applications. In this pilot study, we report the development of the supervised machine-learning classifiers with conditional random fields (CRFs) for automatically identifying discourse connectives in full-text biomedical articles. Our first classifier was trained on the open-domain 1 million token Penn Discourse Tree Bank (PDTB). We performed cross validation on biomedical articles (approximately 100K word tokens) that we annotated. The results show that the classifier trained on PDTB data attained a 0.55 F1-score for identifying discourse connectives in biomedical text, while the cross-validation results in the biomedical text attained a 0.69 F1-score, a much better performance despite a much smaller training size. Our preliminary analysis suggests the existence of domain-specific features, and we speculate that domain-adaption approaches may further improve performance.

Mesh:

Year:  2010        PMID: 21347060      PMCID: PMC3041460     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  8 in total

1.  GENIA corpus--semantically annotated corpus for bio-textmining.

Authors:  J-D Kim; T Ohta; Y Tateisi; J Tsujii
Journal:  Bioinformatics       Date:  2003       Impact factor: 6.937

2.  Biomedical negation scope detection with conditional random fields.

Authors:  Shashank Agarwal; Hong Yu
Journal:  J Am Med Inform Assoc       Date:  2010 Nov-Dec       Impact factor: 4.497

3.  ABNER: an open source tool for automatically tagging genes, proteins and other entity names in text.

Authors:  Burr Settles
Journal:  Bioinformatics       Date:  2005-04-28       Impact factor: 6.937

4.  Detecting hedge cues and their scope in biomedical text with conditional random fields.

Authors:  Shashank Agarwal; Hong Yu
Journal:  J Biomed Inform       Date:  2010-08-13       Impact factor: 6.317

5.  Second i2b2 workshop on natural language processing challenges for clinical records.

Authors:  Ozlem Uzuner
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

6.  The biomedical discourse relation bank.

Authors:  Rashmi Prasad; Susan McRoy; Nadya Frid; Aravind Joshi; Hong Yu
Journal:  BMC Bioinformatics       Date:  2011-05-23       Impact factor: 3.169

7.  New directions in biomedical text annotation: definitions, guidelines and corpus construction.

Authors:  W John Wilbur; Andrey Rzhetsky; Hagit Shatkay
Journal:  BMC Bioinformatics       Date:  2006-07-25       Impact factor: 3.169

8.  Multi-dimensional classification of biomedical text: toward automated, practical provision of high-utility text to diverse users.

Authors:  Hagit Shatkay; Fengxia Pan; Andrey Rzhetsky; W John Wilbur
Journal:  Bioinformatics       Date:  2008-08-20       Impact factor: 6.937

  8 in total
  3 in total

1.  Automatic discourse connective detection in biomedical text.

Authors:  Balaji Polepalli Ramesh; Rashmi Prasad; Tim Miller; Brian Harrington; Hong Yu
Journal:  J Am Med Inform Assoc       Date:  2012-06-28       Impact factor: 4.497

2.  Detecting causality from online psychiatric texts using inter-sentential language patterns.

Authors:  Jheng-Long Wu; Liang-Chih Yu; Pei-Chann Chang
Journal:  BMC Med Inform Decis Mak       Date:  2012-07-18       Impact factor: 2.796

3.  The biomedical discourse relation bank.

Authors:  Rashmi Prasad; Susan McRoy; Nadya Frid; Aravind Joshi; Hong Yu
Journal:  BMC Bioinformatics       Date:  2011-05-23       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.