Literature DB >> 11791231

The potential use of SUISEKI as a protein interaction discovery tool.

C Blaschke1, A Valencia.   

Abstract

Relevant information about protein interactions is stored in textual sources. This sources are commonly used not only as archives of what is already known but also as information for generating new knowledge, particularly to pose hypothesis about new possible interactions that can be inferred from the existing ones. This task is the more creative part of scientific work in experimental systems. We present a large-scale analysis for the prediction of new interactions based on the interaction network for the ones already known and detected automatically in the literature. During the last few years it has became clear that part of the information about protein interactions could be extracted with automatic tools, even if these tools are still far from perfect and key problems such as detection of protein names are not completely solved. We have developed a integrated automatic approach, called SUISEKI (System for Information Extraction on Interactions), able to extract protein interactions from collections of Medline abstracts. Previous experiments with the system have shown that it is able to extract almost 70% of the interactions present in relatively large text corpus, with an accuracy of approximately 80% (for the best defined interactions) that makes the system usable in real scenarios, both at the level of extraction of protein names and at the level of extracting interaction between them. With the analysis of the interaction map of Saccharomyces cerevisiae we show that interactions published in the years 2000/2001 frequently correspond to proteins or genes that were already very close in the interaction network deduced from the literature published before these years and that they are often connected to the same proteins. That is, discoveries are commonly done among highly connected entities. Some biologically relevant examples illustrate how interactions described in the year 2000 could have been proposed as reasonable working hypothesis with the information previously available in the automatically extracted network of interactions.

Entities:  

Mesh:

Substances:

Year:  2001        PMID: 11791231

Source DB:  PubMed          Journal:  Genome Inform        ISSN: 0919-9454


  18 in total

1.  Dragon TF Association Miner: a system for exploring transcription factor associations through text-mining.

Authors:  Hong Pan; Li Zuo; Vidhu Choudhary; Zhuo Zhang; Shoi Houi Leow; Fui Teen Chong; Yingliang Huang; Victor Wui Siong Ong; Bijayalaxmi Mohanty; Sin Lam Tan; S P T Krishnan; Vladimir B Bajic
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

2.  KID--an algorithm for fast and efficient text mining used to automatically generate a database containing kinetic information of enzymes.

Authors:  Stephanie Heinen; Bernhard Thielen; Dietmar Schomburg
Journal:  BMC Bioinformatics       Date:  2010-07-13       Impact factor: 3.169

3.  Kinase pathway database: an integrated protein-kinase and NLP-based protein-interaction resource.

Authors:  Asako Koike; Yoshiyuki Kobayashi; Toshihisa Takagi
Journal:  Genome Res       Date:  2003-06       Impact factor: 9.043

4.  HIGH-PRECISION BIOLOGICAL EVENT EXTRACTION: EFFECTS OF SYSTEM AND OF DATA.

Authors:  K Bretonnel Cohen; Karin Verspoor; Helen L Johnson; Chris Roeder; Philip V Ogren; William A Baumgartner; Elizabeth White; Hannah Tipney; Lawrence Hunter
Journal:  Comput Intell       Date:  2011-11       Impact factor: 2.330

5.  PreBIND and Textomy--mining the biomedical literature for protein-protein interactions using a support vector machine.

Authors:  Ian Donaldson; Joel Martin; Berry de Bruijn; Cheryl Wolting; Vicki Lay; Brigitte Tuekam; Shudong Zhang; Berivan Baskin; Gary D Bader; Katerina Michalickova; Tony Pawson; Christopher W V Hogue
Journal:  BMC Bioinformatics       Date:  2003-03-27       Impact factor: 3.169

6.  Dragon Plant Biology Explorer. A text-mining tool for integrating associations between genetic and biochemical entities with genome annotation and biochemical terms lists.

Authors:  Vladimir B Bajic; Merlin Veronika; Pardha Sarathi Veladandi; Archana Meka; Mok-Wei Heng; Kanagasabai Rajaraman; Hong Pan; Sanjay Swarup
Journal:  Plant Physiol       Date:  2005-08       Impact factor: 8.340

7.  Signalling network construction for modelling plant defence response.

Authors:  Dragana Miljkovic; Tjaša Stare; Igor Mozetič; Vid Podpečan; Marko Petek; Kamil Witek; Marina Dermastia; Nada Lavrač; Kristina Gruden
Journal:  PLoS One       Date:  2012-12-18       Impact factor: 3.240

8.  LAITOR--Literature Assistant for Identification of Terms co-Occurrences and Relationships.

Authors:  Adriano Barbosa-Silva; Theodoros G Soldatos; Ivan L F Magalhães; Georgios A Pavlopoulos; Jean-Fred Fontaine; Miguel A Andrade-Navarro; Reinhard Schneider; J Miguel Ortega
Journal:  BMC Bioinformatics       Date:  2010-02-01       Impact factor: 3.169

9.  ISDB: Interaction Sentence Database.

Authors:  Michael A Bauer; Robert E Belford; Jing Ding; Daniel Berleant
Journal:  BMC Res Notes       Date:  2010-05-03

10.  Biomedical text mining and its applications.

Authors:  Raul Rodriguez-Esteban
Journal:  PLoS Comput Biol       Date:  2009-12-24       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.