Literature DB >> 15890744

Discovering patterns to extract protein-protein interactions from the literature: Part II.

Yu Hao1, Xiaoyan Zhu, Minlie Huang, Ming Li.   

Abstract

MOTIVATION: An enormous number of protein-protein interaction relationships are buried in millions of research articles published over the years, and the number is growing. Rediscovering them automatically is a challenging bioinformatics task. Solutions to this problem also reach far beyond bioinformatics.
RESULTS: We study a new approach that involves automatically discovering English expression patterns, optimizing them and using them to extract protein-protein interactions. In a sister paper, we described how to generate English expression patterns related to protein-protein interactions, and this approach alone has already achieved precision and recall rates significantly higher than those of other automatic systems. This paper continues to present our theory, focusing on how to improve the patterns. A minimum description length (MDL)-based pattern-optimization algorithm is designed to reduce and merge patterns. This has significantly increased generalization power, and hence the recall and precision rates, as confirmed by our experiments. AVAILABILITY: http://spies.cs.tsinghua.edu.cn.

Entities:  

Mesh:

Year:  2005        PMID: 15890744     DOI: 10.1093/bioinformatics/bti493

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  17 in total

1.  A comprehensive benchmark of kernel methods to extract protein-protein interactions from literature.

Authors:  Domonkos Tikk; Philippe Thomas; Peter Palaga; Jörg Hakenberg; Ulf Leser
Journal:  PLoS Comput Biol       Date:  2010-07-01       Impact factor: 4.475

2.  Simple tricks for improving pattern-based information extraction from the biomedical literature.

Authors:  Quang Long Nguyen; Domonkos Tikk; Ulf Leser
Journal:  J Biomed Semantics       Date:  2010-09-24

3.  The Text-mining based PubChem Bioassay neighboring analysis.

Authors:  Lianyi Han; Tugba O Suzek; Yanli Wang; Steve H Bryant
Journal:  BMC Bioinformatics       Date:  2010-11-08       Impact factor: 3.169

4.  PPI finder: a mining tool for human protein-protein interactions.

Authors:  Min He; Yi Wang; Wei Li
Journal:  PLoS One       Date:  2009-02-23       Impact factor: 3.240

5.  Evaluation of linguistic features useful in extraction of interactions from PubMed; application to annotating known, high-throughput and predicted interactions in I2D.

Authors:  Yun Niu; David Otasek; Igor Jurisica
Journal:  Bioinformatics       Date:  2009-10-22       Impact factor: 6.937

6.  PCorral--interactive mining of protein interactions from MEDLINE.

Authors:  Chen Li; Antonio Jimeno-Yepes; Miguel Arregui; Harald Kirsch; Dietrich Rebholz-Schuhmann
Journal:  Database (Oxford)       Date:  2013-05-02       Impact factor: 3.451

7.  PPInterFinder--a mining tool for extracting causal relations on human proteins from literature.

Authors:  Kalpana Raja; Suresh Subramani; Jeyakumar Natarajan
Journal:  Database (Oxford)       Date:  2013-01-15       Impact factor: 3.451

8.  Gene mention normalization and interaction extraction with context models and sentence motifs.

Authors:  Jörg Hakenberg; Conrad Plake; Loic Royer; Hendrik Strobelt; Ulf Leser; Michael Schroeder
Journal:  Genome Biol       Date:  2008-09-01       Impact factor: 13.583

9.  Exploiting likely-positive and unlabeled data to improve the identification of protein-protein interaction articles.

Authors:  Richard Tzong-Han Tsai; Hsi-Chuan Hung; Hong-Jie Dai; Yi-Wen Lin; Wen-Lian Hsu
Journal:  BMC Bioinformatics       Date:  2008       Impact factor: 3.169

10.  PolySearch: a web-based text mining system for extracting relationships between human diseases, genes, mutations, drugs and metabolites.

Authors:  Dean Cheng; Craig Knox; Nelson Young; Paul Stothard; Sambasivarao Damaraju; David S Wishart
Journal:  Nucleic Acids Res       Date:  2008-05-16       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.