Literature DB >> 14555618

Greedily building protein networks with confidence.

Joel S Bader1.   

Abstract

MOTIVATION: With genome sequences complete for human and model organisms, it is essential to understand how individual genes and proteins are organized into biological networks. Much of the organization is revealed by proteomics experiments that now generate torrents of data. Extracting relevant complexes and pathways from high-throughput proteomics data sets has posed a challenge, however, and new methods to identify and extract networks are essential. We focus on the problem of building pathways starting from known proteins of interest.
RESULTS: We have developed an efficient, greedy algorithm, SEEDY, that extracts biologically relevant biological networks from protein-protein interaction data, building out from selected seed proteins. The algorithm relies on our previous study establishing statistical confidence levels for interactions generated by two-hybrid screens and inferred from mass spectrometric identification of protein complexes. We demonstrate the ability to extract known yeast complexes from high-throughput protein interaction data with a tunable parameter that governs the trade-off between sensitivity and selectivity. DNA damage repair pathways are presented as a detailed example. We highlight the ability to join heterogeneous data sets, in this case protein-protein interactions and genetic interactions, and the appearance of cross-talk between pathways caused by re-use of shared components. SIGNIFICANCE AND COMPARISON: The significance of the SEEDY algorithm is that it is fast, running time O[(E + V) log V] for V proteins and E interactions, a single adjustable parameter controls the size of the pathways that are generated, and an associated P-value indicates the statistical confidence that the pathways are enriched for proteins with a coherent function. Previous approaches have focused on extracting sub-networks by identifying motifs enriched in known biological networks. SEEDY provides the complementary ability to perform a directed search based on proteins of interest. AVAILABILITY: SEEDY software (Perl source), data tables and confidence score models (R source) are freely available from the author.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 14555618     DOI: 10.1093/bioinformatics/btg358

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  17 in total

Review 1.  Algorithmic and analytical methods in network biology.

Authors:  Mehmet Koyutürk
Journal:  Wiley Interdiscip Rev Syst Biol Med       Date:  2010 May-Jun

Review 2.  Differential network analysis in human cancer research.

Authors:  Ryan Gill; Somnath Datta; Susmita Datta
Journal:  Curr Pharm Des       Date:  2014       Impact factor: 3.116

3.  Molecular triangulation: bridging linkage and molecular-network information for identifying candidate genes in Alzheimer's disease.

Authors:  Michael Krauthammer; Charles A Kaufmann; T Conrad Gilliam; Andrey Rzhetsky
Journal:  Proc Natl Acad Sci U S A       Date:  2004-10-07       Impact factor: 11.205

Review 4.  Gene module level analysis: identification to networks and dynamics.

Authors:  Xuewei Wang; Ertugrul Dalkic; Ming Wu; Christina Chan
Journal:  Curr Opin Biotechnol       Date:  2008-09-03       Impact factor: 9.740

5.  Chapter 5: Network biology approach to complex diseases.

Authors:  Dong-Yeon Cho; Yoo-Ah Kim; Teresa M Przytycka
Journal:  PLoS Comput Biol       Date:  2012-12-27       Impact factor: 4.475

6.  Investigating meta-approaches for reconstructing gene networks in a mammalian cellular context.

Authors:  Azree Nazri; Pietro Lio
Journal:  PLoS One       Date:  2012-01-09       Impact factor: 3.240

7.  Commensurate distances and similar motifs in genetic congruence and protein interaction networks in yeast.

Authors:  Ping Ye; Brian D Peyser; Forrest A Spencer; Joel S Bader
Journal:  BMC Bioinformatics       Date:  2005-11-09       Impact factor: 3.169

8.  Using a seed-network to query multiple large-scale gene expression datasets from the developing retina in order to identify and prioritize experimental targets.

Authors:  Laura A Hecker; Timothy C Alcon; Vasant G Honavar; M Heather West Greenlee
Journal:  Bioinform Biol Insights       Date:  2008-02-01

9.  RRW: repeated random walks on genome-scale protein networks for local cluster discovery.

Authors:  Kathy Macropol; Tolga Can; Ambuj K Singh
Journal:  BMC Bioinformatics       Date:  2009-09-09       Impact factor: 3.169

10.  An integrative -omics approach to identify functional sub-networks in human colorectal cancer.

Authors:  Rod K Nibbe; Mehmet Koyutürk; Mark R Chance
Journal:  PLoS Comput Biol       Date:  2010-01-15       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.