Literature DB >> 11301305

Mining literature for protein-protein interactions.

E M Marcotte1, I Xenarios, D Eisenberg.   

Abstract

MOTIVATION: A central problem in bioinformatics is how to capture information from the vast current scientific literature in a form suitable for analysis by computer. We address the special case of information on protein-protein interactions, and show that the frequencies of words in Medline abstracts can be used to determine whether or not a given paper discusses protein-protein interactions. For those papers determined to discuss this topic, the relevant information can be captured for the Database of Interacting PROTEINS: Furthermore, suitable gene annotations can also be captured.
RESULTS: Our Bayesian approach scores Medline abstracts for probability of discussing the topic of interest according to the frequencies of discriminating words found in the abstract. More than 80 discriminating words (e.g. complex, interaction, two-hybrid) were determined from a training set of 260 Medline abstracts corresponding to previously validated entries in the Database of Interacting Proteins. Using these words and a log likelihood scoring function, approximately 2000 Medline abstracts were identified as describing interactions between yeast proteins. This approach now forms the basis for the rapid expansion of the Database of Interacting Proteins.

Entities:  

Mesh:

Substances:

Year:  2001        PMID: 11301305     DOI: 10.1093/bioinformatics/17.4.359

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  57 in total

1.  DIP: The Database of Interacting Proteins: 2001 update.

Authors:  I Xenarios; E Fernandez; L Salwinski; X J Duan; M J Thompson; E M Marcotte; D Eisenberg
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  Predictome: a database of putative functional links between proteins.

Authors:  Joseph C Mellor; Itai Yanai; Karl H Clodfelter; Julian Mintseris; Charles DeLisi
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

3.  Computational approaches to protein-protein interaction.

Authors:  Giacomo Franzot; Oliviero Carugo
Journal:  J Struct Funct Genomics       Date:  2003

4.  Whole-genome annotation by using evidence integration in functional-linkage networks.

Authors:  Ulas Karaoz; T M Murali; Stan Letovsky; Yu Zheng; Chunming Ding; Charles R Cantor; Simon Kasif
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-23       Impact factor: 11.205

Review 5.  Toward a complete in silico, multi-layered embryonic stem cell regulatory network.

Authors:  Huilei Xu; Christoph Schaniel; Ihor R Lemischka; Avi Ma'ayan
Journal:  Wiley Interdiscip Rev Syst Biol Med       Date:  2010 Nov-Dec

Review 6.  Toward predictive models of mammalian cells.

Authors:  Avi Ma'ayan; Robert D Blitzer; Ravi Iyengar
Journal:  Annu Rev Biophys Biomol Struct       Date:  2005

Review 7.  From components to regulatory motifs in signalling networks.

Authors:  Avi Ma'ayan; Ravi Iyengar
Journal:  Brief Funct Genomic Proteomic       Date:  2006-02-20

Review 8.  The cognitive phenotype of Down syndrome: insights from intracellular network analysis.

Authors:  Avi Ma'ayan; Katheleen Gardiner; Ravi Iyengar
Journal:  NeuroRx       Date:  2006-07

9.  DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions.

Authors:  Ioannis Xenarios; Lukasz Salwínski; Xiaoqun Joyce Duan; Patrick Higney; Sul-Min Kim; David Eisenberg
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

10.  Textpresso: an ontology-based information retrieval and extraction system for biological literature.

Authors:  Hans-Michael Müller; Eimear E Kenny; Paul W Sternberg
Journal:  PLoS Biol       Date:  2004-09-21       Impact factor: 8.029

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.