Literature DB >> 22151999

Cross-species gene normalization by species inference.

Chih-Hsuan Wei1, Hung-Yu Kao.   

Abstract

BACKGROUND: To access and utilize the rich information contained in the biomedical literature, the ability to recognize and normalize gene mentions referenced in the literature is crucial. In this paper, we focus on improvements to the accuracy of gene normalization in cases where species information is not provided. Gene names are often ambiguous, in that they can refer to the genes of many species. Therefore, gene normalization is a difficult challenge.
METHODS: We define "gene normalization" as a series of tasks involving several issues, including gene name recognition, species assignation and species-specific gene normalization. We propose an integrated method, GenNorm, consisting of three modules to handle the issues of this task. Every issue can affect overall performance, though the most important is species assignation. Clearly, correct identification of the species can decrease the ambiguity of orthologous genes.
RESULTS: In experiments, the proposed model attained the top-1 threshold average precision (TAP-k) scores of 0.3297 (k=5), 0.3538 (k=10), and 0.3535 (k=20) when tested against 50 articles that had been selected for their difficulty and the most divergent results from pooled team submissions. In the silver-standard-507 evaluation, our TAP-k scores are 0.4591 for k=5, 10, and 20 and were ranked 2nd, 2nd, and 3rd respectively. AVAILABILITY: A web service and input, output formats of GenNorm are available at http://ikmbio.csie.ncku.edu.tw/GN/.

Entities:  

Mesh:

Year:  2011        PMID: 22151999      PMCID: PMC3269940          DOI: 10.1186/1471-2105-12-S8-S5

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  22 in total

1.  Extracting protein interactions from text with the unified AkaneRE event extraction system.

Authors:  Rune Saetre; Kazuhiro Yoshida; Makoto Miwa; Takuya Matsuzaki; Yoshinobu Kano; Jun'ichi Tsujii
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2010 Jul-Sep       Impact factor: 3.710

2.  BioLMiner System: interaction normalization task and interaction pair task in the BioCreative II.5 challenge.

Authors:  Yifei Chen; Feng Liu; Bernard Manderick
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2010 Jul-Sep       Impact factor: 3.710

3.  An Overview of BioCreative II.5.

Authors:  Florian Leitner; Scott A Mardis; Martin Krallinger; Gianni Cesareni; Lynette A Hirschman; Alfonso Valencia
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2010 Jul-Sep       Impact factor: 3.710

4.  Multistage gene normalization and SVM-based ranking for protein interactor extraction in full-text articles.

Authors:  Hong-Jie Dai; Po-Ting Lai; Richard Tzong-Han Tsai
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2010 Jul-Sep       Impact factor: 3.710

5.  Efficient extraction of protein-protein interactions from full-text articles.

Authors:  Jörg Hakenberg; Robert Leaman; Nguyen Ha Vo; Siddhartha Jonnalagadda; Ryan Sullivan; Christopher Miller; Luis Tari; Chitta Baral; Graciela Gonzalez
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2010 Jul-Sep       Impact factor: 3.710

6.  High-performance gene name normalization with GeNo.

Authors:  Joachim Wermter; Katrin Tomanek; Udo Hahn
Journal:  Bioinformatics       Date:  2009-02-02       Impact factor: 6.937

7.  Detection of IUPAC and IUPAC-like chemical names.

Authors:  Roman Klinger; Corinna Kolárik; Juliane Fluck; Martin Hofmann-Apitius; Christoph M Friedrich
Journal:  Bioinformatics       Date:  2008-07-01       Impact factor: 6.937

8.  Overview of BioCreAtIvE task 1B: normalized gene lists.

Authors:  Lynette Hirschman; Marc Colosimo; Alexander Morgan; Alexander Yeh
Journal:  BMC Bioinformatics       Date:  2005-05-24       Impact factor: 3.169

9.  Entrez Gene: gene-centered information at NCBI.

Authors:  Donna Maglott; Jim Ostell; Kim D Pruitt; Tatiana Tatusova
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

10.  Overview of BioCreative II gene normalization.

Authors:  Alexander A Morgan; Zhiyong Lu; Xinglong Wang; Aaron M Cohen; Juliane Fluck; Patrick Ruch; Anna Divoli; Katrin Fundel; Robert Leaman; Jörg Hakenberg; Chengjie Sun; Heng-hui Liu; Rafael Torres; Michael Krauthammer; William W Lau; Hongfang Liu; Chun-Nan Hsu; Martijn Schuemie; K Bretonnel Cohen; Lynette Hirschman
Journal:  Genome Biol       Date:  2008-09-01       Impact factor: 13.583

View more
  36 in total

1.  iPTMnet: Integrative Bioinformatics for Studying PTM Networks.

Authors:  Karen E Ross; Hongzhan Huang; Jia Ren; Cecilia N Arighi; Gang Li; Catalina O Tudor; Mengxi Lv; Jung-Youn Lee; Sheng-Chih Chen; K Vijay-Shanker; Cathy H Wu
Journal:  Methods Mol Biol       Date:  2017

2.  SimConcept: A Hybrid Approach for Simplifying Composite Named Entities in Biomedicine.

Authors:  Chih-Hsuan Wei; Robert Leaman; Zhiyong Lu
Journal:  ACM BCB       Date:  2014

3.  Scalable Text Mining Assisted Curation of Post-Translationally Modified Proteoforms in the Protein Ontology.

Authors:  Karen E Ross; Darren A Natale; Cecilia Arighi; Sheng-Chih Chen; Hongzhan Huang; Gang Li; Jia Ren; Michael Wang; K Vijay-Shanker; Cathy H Wu
Journal:  CEUR Workshop Proc       Date:  2016-11-29

4.  SimConcept: a hybrid approach for simplifying composite named entities in biomedical text.

Authors:  Chih-Hsuan Wei; Robert Leaman; Zhiyong Lu
Journal:  IEEE J Biomed Health Inform       Date:  2015-04-13       Impact factor: 5.772

5.  PubTator central: automated concept annotation for biomedical full text articles.

Authors:  Chih-Hsuan Wei; Alexis Allot; Robert Leaman; Zhiyong Lu
Journal:  Nucleic Acids Res       Date:  2019-07-02       Impact factor: 16.971

6.  DES-Mutation: System for Exploring Links of Mutations and Diseases.

Authors:  Vasiliki Kordopati; Adil Salhi; Rozaimi Razali; Aleksandar Radovanovic; Faroug Tifratene; Mahmut Uludag; Yu Li; Ameerah Bokhari; Ahdab AlSaieedi; Arwa Bin Raies; Christophe Van Neste; Magbubah Essack; Vladimir B Bajic
Journal:  Sci Rep       Date:  2018-09-06       Impact factor: 4.379

7.  Accelerating literature curation with text-mining tools: a case study of using PubTator to curate genes in PubMed abstracts.

Authors:  Chih-Hsuan Wei; Bethany R Harris; Donghui Li; Tanya Z Berardini; Eva Huala; Hung-Yu Kao; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2012-11-17       Impact factor: 3.451

8.  Large-scale event extraction from literature with multi-level gene normalization.

Authors:  Sofie Van Landeghem; Jari Björne; Chih-Hsuan Wei; Kai Hakala; Sampo Pyysalo; Sophia Ananiadou; Hung-Yu Kao; Zhiyong Lu; Tapio Salakoski; Yves Van de Peer; Filip Ginter
Journal:  PLoS One       Date:  2013-04-17       Impact factor: 3.240

9.  SR4GN: a species recognition software tool for gene normalization.

Authors:  Chih-Hsuan Wei; Hung-Yu Kao; Zhiyong Lu
Journal:  PLoS One       Date:  2012-06-05       Impact factor: 3.240

10.  PubTator: a web-based text mining tool for assisting biocuration.

Authors:  Chih-Hsuan Wei; Hung-Yu Kao; Zhiyong Lu
Journal:  Nucleic Acids Res       Date:  2013-05-22       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.