Literature DB >> 21685052

Integration of gene normalization stages and co-reference resolution using a Markov logic network.

Hong-Jie Dai1, Yen-Ching Chang, Richard Tzong-Han Tsai, Wen-Lian Hsu.   

Abstract

MOTIVATION: Gene normalization (GN) is the task of normalizing a textual gene mention to a unique gene database ID. Traditional top performing GN systems usually need to consider several constraints to make decisions in the normalization process, including filtering out false positives, or disambiguating an ambiguous gene mention, to improve system performance. However, these constraints are usually executed in several separate stages and cannot use each other's input/output interactively. In this article, we propose a novel approach that employs a Markov logic network (MLN) to model the constraints used in the GN task. Firstly, we show how various constraints can be formulated and combined in an MLN. Secondly, we are the first to apply the two main concepts of co-reference resolution-discourse salience in centering theory and transitivity-to GN models. Furthermore, to make our results more relevant to developers of information extraction applications, we adopt the instance-based precision/recall/F-measure (PRF) in addition to the article-wide PRF to assess system performance.
RESULTS: Experimental results show that our system outperforms baseline and state-of-the-art systems under two evaluation schemes. Through further analysis, we have found several unexplored challenges in the GN task. CONTACT: hongjie@iis.sinica.edu.tw SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2011        PMID: 21685052     DOI: 10.1093/bioinformatics/btr358

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  9 in total

1.  A literature search tool for intelligent extraction of disease-associated genes.

Authors:  Jae-Yoon Jung; Todd F DeLuca; Tristan H Nelson; Dennis P Wall
Journal:  J Am Med Inform Assoc       Date:  2013-09-02       Impact factor: 4.497

2.  The contribution of co-reference resolution to supervised relation detection between bacteria and biotopes entities.

Authors:  Thomas Lavergne; Cyril Grouin; Pierre Zweigenbaum
Journal:  BMC Bioinformatics       Date:  2015-07-13       Impact factor: 3.169

3.  T-HOD: a literature-based candidate gene database for hypertension, obesity and diabetes.

Authors:  Hong-Jie Dai; Johnny Chi-Yang Wu; Richard Tzong-Han Tsai; Wen-Harn Pan; Wen-Lian Hsu
Journal:  Database (Oxford)       Date:  2013-02-12       Impact factor: 3.451

4.  A resource-saving collective approach to biomedical semantic role labeling.

Authors:  Richard Tzong-Han Tsai; Po-Ting Lai
Journal:  BMC Bioinformatics       Date:  2014-05-27       Impact factor: 3.169

5.  Collective instance-level gene normalization on the IGN corpus.

Authors:  Hong-Jie Dai; Johnny Chi-Yang Wu; Richard Tzong-Han Tsai
Journal:  PLoS One       Date:  2013-11-25       Impact factor: 3.240

6.  MET network in PubMed: a text-mined network visualization and curation system.

Authors:  Hong-Jie Dai; Chu-Hsien Su; Po-Ting Lai; Ming-Siang Huang; Jitendra Jonnagaddala; Toni Rose Jue; Shruti Rao; Hui-Jou Chou; Marija Milacic; Onkar Singh; Shabbir Syed-Abdul; Wen-Lian Hsu
Journal:  Database (Oxford)       Date:  2016-05-30       Impact factor: 3.451

7.  SPRENO: a BioC module for identifying organism terms in figure captions.

Authors:  Hong-Jie Dai; Onkar Singh
Journal:  Database (Oxford)       Date:  2018-01-01       Impact factor: 3.451

8.  An overview of the BioCreative 2012 Workshop Track III: interactive text mining task.

Authors:  Cecilia N Arighi; Ben Carterette; K Bretonnel Cohen; Martin Krallinger; W John Wilbur; Petra Fey; Robert Dodson; Laurel Cooper; Ceri E Van Slyke; Wasila Dahdul; Paula Mabee; Donghui Li; Bethany Harris; Marc Gillespie; Silvia Jimenez; Phoebe Roberts; Lisa Matthews; Kevin Becker; Harold Drabkin; Susan Bello; Luana Licata; Andrew Chatr-aryamontri; Mary L Schaeffer; Julie Park; Melissa Haendel; Kimberly Van Auken; Yuling Li; Juancarlos Chan; Hans-Michael Muller; Hong Cui; James P Balhoff; Johnny Chi-Yang Wu; Zhiyong Lu; Chih-Hsuan Wei; Catalina O Tudor; Kalpana Raja; Suresh Subramani; Jeyakumar Natarajan; Juan Miguel Cejuela; Pratibha Dubey; Cathy Wu
Journal:  Database (Oxford)       Date:  2013-01-17       Impact factor: 3.451

9.  NTTMUNSW BioC modules for recognizing and normalizing species and gene/protein mentions.

Authors:  Hong-Jie Dai; Onkar Singh; Jitendra Jonnagaddala; Emily Chia-Yu Su
Journal:  Database (Oxford)       Date:  2016-07-27       Impact factor: 3.451

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.