Literature DB >> 25937701

HIGH-PRECISION BIOLOGICAL EVENT EXTRACTION: EFFECTS OF SYSTEM AND OF DATA.

K Bretonnel Cohen1, Karin Verspoor1, Helen L Johnson1, Chris Roeder1, Philip V Ogren1, William A Baumgartner1, Elizabeth White1, Hannah Tipney1, Lawrence Hunter1.   

Abstract

We approached the problems of event detection, argument identification, and negation and speculation detection in the BioNLP'09 information extraction challenge through concept recognition and analysis. Our methodology involved using the OpenDMAP semantic parser with manually written rules. The original OpenDMAP system was updated for this challenge with a broad ontology defined for the events of interest, new linguistic patterns for those events, and specialized coordination handling. We achieved state-of-the-art precision for two of the three tasks, scoring the highest of 24 teams at precision of 71.81 on Task 1 and the highest of 6 teams at precision of 70.97 on Task 2. We provide a detailed analysis of the training data and show that a number of trigger words were ambiguous as to event type, even when their arguments are constrained by semantic class. The data is also shown to have a number of missing annotations. Analysis of a sampling of the comparatively small number of false positives returned by our system shows that major causes of this type of error were failing to recognize second themes in two-theme events, failing to recognize events when they were the arguments to other events, failure to recognize nontheme arguments, and sentence segmentation errors. We show that specifically handling coordination had a small but important impact on the overall performance of the system. The OpenDMAP system and the rule set are available at http://bionlp.sourceforge.net.

Entities:  

Keywords:  BioNLP; conceptual analysis; event recognition; natural language processing; text mining

Year:  2011        PMID: 25937701      PMCID: PMC4414063          DOI: 10.1111/j.1467-8640.2011.00405.x

Source DB:  PubMed          Journal:  Comput Intell        ISSN: 0824-7935            Impact factor:   2.330


  15 in total

1.  Event extraction from biomedical papers using a full parser.

Authors:  A Yakushiji; Y Tateisi; Y Miyao; J Tsujii
Journal:  Pac Symp Biocomput       Date:  2001

2.  The potential use of SUISEKI as a protein interaction discovery tool.

Authors:  C Blaschke; A Valencia
Journal:  Genome Inform       Date:  2001

3.  Creating the gene ontology resource: design and implementation.

Authors: 
Journal:  Genome Res       Date:  2001-08       Impact factor: 9.043

4.  A reference ontology for biomedical informatics: the Foundational Model of Anatomy.

Authors:  Cornelius Rosse; José L V Mejino
Journal:  J Biomed Inform       Date:  2003-12       Impact factor: 6.317

5.  BRENDA, the enzyme database: updates and major new developments.

Authors:  Ida Schomburg; Antje Chang; Christian Ebeling; Marion Gremse; Christian Heldt; Gregor Huhn; Dietmar Schomburg
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

6.  Extracting human protein interactions from MEDLINE using a full-sentence parser.

Authors:  Nikolai Daraselia; Anton Yuryev; Sergei Egorov; Svetalana Novichkova; Alexander Nikitin; Ilya Mazo
Journal:  Bioinformatics       Date:  2004-01-22       Impact factor: 6.937

7.  An ontology for cell types.

Authors:  Jonathan Bard; Seung Y Rhee; Michael Ashburner
Journal:  Genome Biol       Date:  2005-01-14       Impact factor: 13.583

8.  Biomedical discovery acceleration, with applications to craniofacial development.

Authors:  Sonia M Leach; Hannah Tipney; Weiguo Feng; William A Baumgartner; Priyanka Kasliwal; Ronald P Schuyler; Trevor Williams; Richard A Spritz; Lawrence Hunter
Journal:  PLoS Comput Biol       Date:  2009-03-27       Impact factor: 4.475

9.  Nominalization and alternations in biomedical language.

Authors:  K Bretonnel Cohen; Martha Palmer; Lawrence Hunter
Journal:  PLoS One       Date:  2008-09-09       Impact factor: 3.240

10.  OpenDMAP: an open source, ontology-driven concept analysis engine, with applications to capturing knowledge regarding protein transport, protein interactions and cell-type-specific gene expression.

Authors:  Lawrence Hunter; Zhiyong Lu; James Firby; William A Baumgartner; Helen L Johnson; Philip V Ogren; K Bretonnel Cohen
Journal:  BMC Bioinformatics       Date:  2008-01-31       Impact factor: 3.169

View more
  8 in total

1.  Improving precision in concept normalization.

Authors:  Mayla Boguslav; K Bretonnel Cohen; William A Baumgartner; Lawrence E Hunter
Journal:  Pac Symp Biocomput       Date:  2018

2.  Studying PubMed usages in the field for complex problem solving: Implications for tool design.

Authors:  Barbara Mirel; Jean Song; Jennifer Steiner Tonks; Fan Meng; Weijian Xuan; Rafiqa Ameziane
Journal:  J Am Soc Inf Sci Technol       Date:  2013-05-01

3.  A generalizable NLP framework for fast development of pattern-based biomedical relation extraction systems.

Authors:  Yifan Peng; Manabu Torii; Cathy H Wu; K Vijay-Shanker
Journal:  BMC Bioinformatics       Date:  2014-08-23       Impact factor: 3.169

4.  Multiple kernels learning-based biological entity relationship extraction method.

Authors:  Xu Dongliang; Pan Jingchang; Wang Bailing
Journal:  J Biomed Semantics       Date:  2017-09-20

5.  Developing a Physical Activity Ontology to Support the Interoperability of Physical Activity Data.

Authors:  Hyeoneui Kim; Jessica Mentzer; Ricky Taira
Journal:  J Med Internet Res       Date:  2019-04-23       Impact factor: 5.428

6.  Annotating the biomedical literature for the human variome.

Authors:  Karin Verspoor; Antonio Jimeno Yepes; Lawrence Cavedon; Tara McIntosh; Asha Herten-Crabb; Zoë Thomas; John-Paul Plazzer
Journal:  Database (Oxford)       Date:  2013-04-12       Impact factor: 3.451

7.  A corpus for plant-chemical relationships in the biomedical domain.

Authors:  Wonjun Choi; Baeksoo Kim; Hyejin Cho; Doheon Lee; Hyunju Lee
Journal:  BMC Bioinformatics       Date:  2016-09-20       Impact factor: 3.169

8.  Knowledge-based biomedical Data Science.

Authors:  Lawrence E Hunter
Journal:  EPJ Data Sci       Date:  2017-12-08       Impact factor: 3.184

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.