Literature DB >> 26201352

The contribution of co-reference resolution to supervised relation detection between bacteria and biotopes entities.

Thomas Lavergne, Cyril Grouin, Pierre Zweigenbaum.   

Abstract

BACKGROUND: The acquisition of knowledge about relations between bacteria and their locations (habitats and geographical locations) in short texts about bacteria, as defined in the BioNLP-ST 2013 Bacteria Biotope task, depends on the detection of co-reference links between mentions of entities of each of these three types. To our knowledge, no participant in this task has investigated this aspect of the situation. The present work specifically addresses issues raised by this situation: (i) how to detect these co-reference links and associated co-reference chains; (ii) how to use them to prepare positive and negative examples to train a supervised system for the detection of relations between entity mentions; (iii) what context around which entity mentions contributes to relation detection when co-reference chains are provided.
RESULTS: We present experiments and results obtained both with gold entity mentions (task 2 of BioNLP-ST 2013) and with automatically detected entity mentions (end-to-end system, in task 3 of BioNLP-ST 2013). Our supervised mention detection system uses a linear chain Conditional Random Fields classifier, and our relation detection system relies on a Logistic Regression (aka Maximum Entropy) classifier. They use a set of morphological, morphosyntactic and semantic features. To minimize false inferences, co-reference resolution applies a set of heuristic rules designed to optimize precision. They take into account the types of the detected entity mentions, and take advantage of the didactic nature of the texts of the corpus, where a large proportion of bacteria naming is fairly explicit (although natural referring expressions such as "the bacteria" are common). The resulting system achieved a 0.495 F-measure on the official test set when taking as input the gold entity mentions, and a 0.351 F-measure when taking as input entity mentions predicted by our CRF system, both of which are above the best BioNLP-ST 2013 participant system.
CONCLUSIONS: We show that co-reference resolution substantially improves over a baseline system which does not use co-reference information: about 3.5 F-measure points on the test corpus for the end-to-end system (5.5 points on the development corpus) and 7 F-measure points on both development and test corpora when gold mentions are used. While this outperforms the best published system on the BioNLP-ST 2013 Bacteria Biotope dataset, we consider that it provides mostly a stronger baseline from which more work can be started. We also emphasize the importance and difficulty of designing a comprehensive gold standard co-reference annotation, which we explain is a key point to further progress on the task.

Entities:  

Mesh:

Year:  2015        PMID: 26201352      PMCID: PMC4511182          DOI: 10.1186/1471-2105-16-S10-S6

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  18 in total

1.  It's about this and that: a description of anaphoric expressions in clinical text.

Authors:  Yan Wang; Genevieve B Melton; Serguei Pakhomov
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

Review 2.  Evaluating the state of the art in coreference resolution for electronic medical records.

Authors:  Ozlem Uzuner; Andreea Bodnari; Shuying Shen; Tyler Forbush; John Pestian; Brett R South
Journal:  J Am Med Inform Assoc       Date:  2012-02-24       Impact factor: 4.497

3.  Integration of gene normalization stages and co-reference resolution using a Markov logic network.

Authors:  Hong-Jie Dai; Yen-Ching Chang; Richard Tzong-Han Tsai; Wen-Lian Hsu
Journal:  Bioinformatics       Date:  2011-06-17       Impact factor: 6.937

4.  Biological network extraction from scientific literature: state of the art and challenges.

Authors:  Chen Li; Maria Liakata; Dietrich Rebholz-Schuhmann
Journal:  Brief Bioinform       Date:  2013-02-22       Impact factor: 11.622

5.  Coreference based event-argument relation extraction on biomedical text.

Authors:  Katsumasa Yoshikawa; Sebastian Riedel; Tsutomu Hirao; Masayuki Asahara; Yuji Matsumoto
Journal:  J Biomed Semantics       Date:  2011-10-06

6.  The Genia Event and Protein Coreference tasks of the BioNLP Shared Task 2011.

Authors:  Jin-Dong Kim; Ngan Nguyen; Yue Wang; Jun'ichi Tsujii; Toshihisa Takagi; Akinori Yonezawa
Journal:  BMC Bioinformatics       Date:  2012-06-26       Impact factor: 3.169

7.  Event extraction of bacteria biotopes: a knowledge-intensive NLP-based approach.

Authors:  Zorana Ratkovic; Wiktoria Golik; Pierre Warnier
Journal:  BMC Bioinformatics       Date:  2012-06-26       Impact factor: 3.169

8.  BioNLP Shared Task--The Bacteria Track.

Authors:  Robert Bossy; Julien Jourde; Alain-Pierre Manine; Philippe Veber; Erick Alphonse; Maarten van de Guchte; Philippe Bessières; Claire Nédellec
Journal:  BMC Bioinformatics       Date:  2012-06-26       Impact factor: 3.169

9.  A rule based solution to co-reference resolution in clinical text.

Authors:  Ping Chen; David Hinote; Guoqing Chen
Journal:  J Am Med Inform Assoc       Date:  2012-10-11       Impact factor: 4.497

10.  Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features.

Authors:  Buzhou Tang; Hongxin Cao; Yonghui Wu; Min Jiang; Hua Xu
Journal:  BMC Med Inform Decis Mak       Date:  2013-04-05       Impact factor: 2.796

View more
  10 in total

1.  Bridging semantics and syntax with graph algorithms-state-of-the-art of extracting biomedical relations.

Authors:  Yuan Luo; Özlem Uzuner; Peter Szolovits
Journal:  Brief Bioinform       Date:  2016-02-05       Impact factor: 11.622

2.  Extracting medications and associated adverse drug events using a natural language processing system combining knowledge base and deep learning.

Authors:  Long Chen; Yu Gu; Xin Ji; Zhiyong Sun; Haodan Li; Yuan Gao; Yang Huang
Journal:  J Am Med Inform Assoc       Date:  2020-01-01       Impact factor: 4.497

3.  Relation Extraction from Clinical Narratives Using Pre-trained Language Models.

Authors:  Qiang Wei; Zongcheng Ji; Yuqi Si; Jingcheng Du; Jingqi Wang; Firat Tiryaki; Stephen Wu; Cui Tao; Kirk Roberts; Hua Xu
Journal:  AMIA Annu Symp Proc       Date:  2020-03-04

4.  Sortal anaphora resolution to enhance relation extraction from biomedical literature.

Authors:  Halil Kilicoglu; Graciela Rosemblat; Marcelo Fiszman; Thomas C Rindflesch
Journal:  BMC Bioinformatics       Date:  2016-04-14       Impact factor: 3.169

5.  Coreference annotation and resolution in the Colorado Richly Annotated Full Text (CRAFT) corpus of biomedical journal articles.

Authors:  K Bretonnel Cohen; Arrick Lanfranchi; Miji Joo-Young Choi; Michael Bada; William A Baumgartner; Natalya Panteleyeva; Karin Verspoor; Martha Palmer; Lawrence E Hunter
Journal:  BMC Bioinformatics       Date:  2017-08-17       Impact factor: 3.169

6.  A neural joint model for entity and relation extraction from biomedical text.

Authors:  Fei Li; Meishan Zhang; Guohong Fu; Donghong Ji
Journal:  BMC Bioinformatics       Date:  2017-03-31       Impact factor: 3.169

7.  Extraction of Information Related to Adverse Drug Events from Electronic Health Record Notes: Design of an End-to-End Model Based on Deep Learning.

Authors:  Fei Li; Weisong Liu; Hong Yu
Journal:  JMIR Med Inform       Date:  2018-11-26

8.  COPIOUS: A gold standard corpus of named entities towards extracting species occurrence from biodiversity literature.

Authors:  Nhung T H Nguyen; Roselyn S Gabud; Sophia Ananiadou
Journal:  Biodivers Data J       Date:  2019-01-22

9.  Unsupervised inference of implicit biomedical events using context triggers.

Authors:  Jin-Woo Chung; Wonsuk Yang; Jong C Park
Journal:  BMC Bioinformatics       Date:  2020-01-28       Impact factor: 3.169

10.  Bio-SCoRes: A Smorgasbord Architecture for Coreference Resolution in Biomedical Text.

Authors:  Halil Kilicoglu; Dina Demner-Fushman
Journal:  PLoS One       Date:  2016-03-02       Impact factor: 3.240

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.