Literature DB >> 18556670

Ensemble non-negative matrix factorization methods for clustering protein-protein interactions.

Derek Greene1, Gerard Cagney, Nevan Krogan, Pádraig Cunningham.   

Abstract

MOTIVATION: When working with large-scale protein interaction data, an important analysis task is the assignment of pairs of proteins to groups that correspond to higher order assemblies. Previously a common approach to this problem has been to apply standard hierarchical clustering methods to identify such a groups. Here we propose a new algorithm for aggregating a diverse collection of matrix factorizations to produce a more informative clustering, which takes the form of a 'soft' hierarchy of clusters.
RESULTS: We apply the proposed Ensemble non-negative matrix factorization (NMF) algorithm to a high-quality assembly of binary protein interactions derived from two proteome-wide studies in yeast. Our experimental evaluation demonstrates that the algorithm lends itself to discovering small localized structures in this data, which correspond to known functional groupings of complexes. In addition, we show that the algorithm also supports the assignment of putative functions for previously uncharacterized proteins, for instance the protein YNR024W, which may be an uncharacterized component of the exosome.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18556670      PMCID: PMC3493126          DOI: 10.1093/bioinformatics/btn286

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  16 in total

1.  Networking proteins in yeast.

Authors:  T R Hazbun; S Fields
Journal:  Proc Natl Acad Sci U S A       Date:  2001-04-10       Impact factor: 11.205

2.  Learning the parts of objects by non-negative matrix factorization.

Authors:  D D Lee; H S Seung
Journal:  Nature       Date:  1999-10-21       Impact factor: 49.962

3.  The Database of Interacting Proteins: 2004 update.

Authors:  Lukasz Salwinski; Christopher S Miller; Adam J Smith; Frank K Pettit; James U Bowie; David Eisenberg
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  The nuclear actin-related proteins Arp7 and Arp9: a dimeric module that cooperates with architectural proteins for chromatin remodeling.

Authors:  Heather Szerlong; Anjanabha Saha; Bradley R Cairns
Journal:  EMBO J       Date:  2003-06-16       Impact factor: 11.598

5.  Metagenes and molecular pattern discovery using matrix factorization.

Authors:  Jean-Philippe Brunet; Pablo Tamayo; Todd R Golub; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2004-03-11       Impact factor: 11.205

6.  Sparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis.

Authors:  Hyunsoo Kim; Haesun Park
Journal:  Bioinformatics       Date:  2007-05-05       Impact factor: 6.937

7.  SGD: Saccharomyces Genome Database.

Authors:  J M Cherry; C Adler; C Ball; S A Chervitz; S S Dwight; E T Hester; Y Jia; G Juvik; T Roe; M Schroeder; S Weng; D Botstein
Journal:  Nucleic Acids Res       Date:  1998-01-01       Impact factor: 16.971

8.  The exosome: a conserved eukaryotic RNA processing complex containing multiple 3'-->5' exoribonucleases.

Authors:  P Mitchell; E Petfalski; A Shevchenko; M Mann; D Tollervey
Journal:  Cell       Date:  1997-11-14       Impact factor: 41.582

9.  MIPS: analysis and annotation of proteins from whole genomes.

Authors:  H W Mewes; C Amid; R Arnold; D Frishman; U Güldener; G Mannhaupt; M Münsterkötter; P Pagel; N Strack; V Stümpflen; J Warfsmann; A Ruepp
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

10.  Interactions between three common subunits of yeast RNA polymerases I and III.

Authors:  D Lalo; C Carles; A Sentenac; P Thuriaux
Journal:  Proc Natl Acad Sci U S A       Date:  1993-06-15       Impact factor: 11.205

View more
  16 in total

1.  Large-Scale Validation of Hypothesis Generation Systems via Candidate Ranking.

Authors:  Justin Sybrandt; Michael Shtutman; Ilya Safro
Journal:  Proc IEEE Int Conf Big Data       Date:  2019-01-24

2.  Recent advances in clustering methods for protein interaction networks.

Authors:  Jianxin Wang; Min Li; Youping Deng; Yi Pan
Journal:  BMC Genomics       Date:  2010-12-01       Impact factor: 3.969

3.  Co-clustering phenome-genome for phenotype classification and disease gene discovery.

Authors:  TaeHyun Hwang; Gowtham Atluri; MaoQiang Xie; Sanjoy Dey; Changjin Hong; Vipin Kumar; Rui Kuang
Journal:  Nucleic Acids Res       Date:  2012-06-26       Impact factor: 16.971

4.  Protein complex detection via weighted ensemble clustering based on Bayesian nonnegative matrix factorization.

Authors:  Le Ou-Yang; Dao-Qing Dai; Xiao-Fei Zhang
Journal:  PLoS One       Date:  2013-05-02       Impact factor: 3.240

5.  Defining the plasticity of transcription factor binding sites by Deconstructing DNA consensus sequences: the PhoP-binding sites among gamma/enterobacteria.

Authors:  Oscar Harari; Sun-Yang Park; Henry Huang; Eduardo A Groisman; Igor Zwir
Journal:  PLoS Comput Biol       Date:  2010-07-22       Impact factor: 4.475

6.  Microbial community pattern detection in human body habitats via ensemble clustering framework.

Authors:  Peng Yang; Xiaoquan Su; Le Ou-Yang; Hon-Nian Chua; Xiao-Li Li; Kang Ning
Journal:  BMC Syst Biol       Date:  2014-12-08

7.  A two-layer integration framework for protein complex detection.

Authors:  Le Ou-Yang; Min Wu; Xiao-Fei Zhang; Dao-Qing Dai; Xiao-Li Li; Hong Yan
Journal:  BMC Bioinformatics       Date:  2016-02-24       Impact factor: 3.169

8.  A novel method for discovering local spatial clusters of genomic regions with functional relationships from DNA contact maps.

Authors:  Xihao Hu; Christina Huan Shi; Kevin Y Yip
Journal:  Bioinformatics       Date:  2016-06-15       Impact factor: 6.937

9.  Paradigm of tunable clustering using Binarization of Consensus Partition Matrices (Bi-CoPaM) for gene discovery.

Authors:  Basel Abu-Jamous; Rui Fa; David J Roberts; Asoke K Nandi
Journal:  PLoS One       Date:  2013-02-11       Impact factor: 3.240

10.  An overview of the statistical methods used for inferring gene regulatory networks and protein-protein interaction networks.

Authors:  Amina Noor; Erchin Serpedin; Mohamed Nounou; Hazem Nounou; Nady Mohamed; Lotfi Chouchane
Journal:  Adv Bioinformatics       Date:  2013-02-21
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.