Literature DB >> 12836664

Integration of genomic datasets to predict protein complexes in yeast.

Ronald Jansen1, Ning Lan, Jiang Qian, Mark Gerstein.   

Abstract

The ultimate goal of functional genomics is to define the function of all the genes in the genome of an organism. A large body of information of the biological roles of genes has been accumulated and aggregated in the past decades of research, both from traditional experiments detailing the role of individual genes and proteins, and from newer experimental strategies that aim to characterize gene function on a genomic scale. It is clear that the goal of functional genomics can only be achieved by integrating information and data sources from the variety of these different experiments. Integration of different data is thus an important challenge for bioinformatics. The integration of different data sources often helps to uncover non-obvious relationships between genes, but there are also two further benefits. First, it is likely that whenever information from multiple independent sources agrees, it should be more valid and reliable. Secondly, by looking at the union of multiple sources, one can cover larger parts of the genome. This is obvious for integrating results from multiple single gene or protein experiments, but also necessary for many of the results from genome-wide experiments since they are often confined to certain (although sizable) subsets of the genome. In this paper, we explore an example of such a data integration procedure. We focus on the prediction of membership in protein complexes for individual genes. For this, we recruit six different data sources that include expression profiles, interaction data, essentiality and localization information. Each of these data sources individually contains some weakly predictive information with respect to protein complexes, but we show how this prediction can be improved by combining all of them. Supplementary information is available at http:// bioinfo.mbb.yale.edu/integrate/interactions/.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 12836664     DOI: 10.1023/a:1020495201615

Source DB:  PubMed          Journal:  J Struct Funct Genomics        ISSN: 1345-711X


  35 in total

1.  Clustering gene expression patterns.

Authors:  A Ben-Dor; R Shamir; Z Yakhini
Journal:  J Comput Biol       Date:  1999 Fall-Winter       Impact factor: 1.479

Review 2.  Exploring expression data: identification and analysis of coexpressed genes.

Authors:  L J Heyer; S Kruglyak; S Yooseph
Journal:  Genome Res       Date:  1999-11       Impact factor: 9.043

3.  Analysis of the yeast transcriptome with structural and functional categories: characterizing highly expressed proteins.

Authors:  R Jansen; M Gerstein
Journal:  Nucleic Acids Res       Date:  2000-03-15       Impact factor: 16.971

4.  Regulatory element detection using correlation with expression.

Authors:  H J Bussemaker; H Li; E D Siggia
Journal:  Nat Genet       Date:  2001-02       Impact factor: 38.330

5.  Functional discovery via a compendium of expression profiles.

Authors:  T R Hughes; M J Marton; A R Jones; C J Roberts; R Stoughton; C D Armour; H A Bennett; E Coffey; H Dai; Y D He; M J Kidd; A M King; M R Meyer; D Slade; P Y Lum; S B Stepaniants; D D Shoemaker; D Gachotte; K Chakraburtty; J Simon; M Bard; S H Friend
Journal:  Cell       Date:  2000-07-07       Impact factor: 41.582

6.  A Bayesian system integrating expression data with sequence patterns for localizing proteins: comprehensive application to the yeast genome.

Authors:  A Drawid; M Gerstein
Journal:  J Mol Biol       Date:  2000-08-25       Impact factor: 5.469

7.  A DNA microarray system for analyzing complex DNA samples using two-color fluorescent probe hybridization.

Authors:  D Shalon; S J Smith; P O Brown
Journal:  Genome Res       Date:  1996-07       Impact factor: 9.043

8.  Global analysis of protein activities using proteome chips.

Authors:  H Zhu; M Bilgin; R Bangham; D Hall; A Casamayor; P Bertone; N Lan; R Jansen; S Bidlingmaier; T Houfek; T Mitchell; P Miller; R A Dean; M Gerstein; M Snyder
Journal:  Science       Date:  2001-07-26       Impact factor: 47.728

9.  A comprehensive two-hybrid analysis to explore the yeast protein interactome.

Authors:  T Ito; T Chiba; R Ozawa; M Yoshida; M Hattori; Y Sakaki
Journal:  Proc Natl Acad Sci U S A       Date:  2001-03-13       Impact factor: 11.205

10.  The Yeast Proteome Database (YPD): a model for the organization and presentation of genome-wide functional data.

Authors:  P E Hodges; A H McKee; B P Davis; W E Payne; J I Garrels
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

View more
  23 in total

1.  TopNet: a tool for comparing biological sub-networks, correlating protein properties with topological statistics.

Authors:  Haiyuan Yu; Xiaowei Zhu; Dov Greenbaum; John Karro; Mark Gerstein
Journal:  Nucleic Acids Res       Date:  2004-01-14       Impact factor: 16.971

2.  Predicting protein complex membership using probabilistic network reliability.

Authors:  Saurabh Asthana; Oliver D King; Francis D Gibbons; Frederick P Roth
Journal:  Genome Res       Date:  2004-05-12       Impact factor: 9.043

3.  Assessing the limits of genomic data integration for predicting protein networks.

Authors:  Long J Lu; Yu Xia; Alberto Paccanaro; Haiyuan Yu; Mark Gerstein
Journal:  Genome Res       Date:  2005-07       Impact factor: 9.043

4.  Genomic analysis of the hierarchical structure of regulatory networks.

Authors:  Haiyuan Yu; Mark Gerstein
Journal:  Proc Natl Acad Sci U S A       Date:  2006-09-26       Impact factor: 11.205

5.  Computational approaches for predicting protein-protein interactions: a survey.

Authors:  Jingkai Yu; Farshad Fotouhi
Journal:  J Med Syst       Date:  2006-02       Impact factor: 4.460

6.  Incorporating Ontology-Driven Similarity Knowledge into Functional Genomics: An Exploratory Study.

Authors:  Francisco Azuaje; Olivier Bodenreider
Journal:  BIBE 2004       Date:  2004-05

7.  Bayesian data integration and variable selection for pan-cancer survival prediction using protein expression data.

Authors:  Arnab Kumar Maity; Anirban Bhattacharya; Bani K Mallick; Veerabhadran Baladandayuthapani
Journal:  Biometrics       Date:  2019-10-03       Impact factor: 2.571

8.  Data integration in genetics and genomics: methods and challenges.

Authors:  Jemila S Hamid; Pingzhao Hu; Nicole M Roslin; Vicki Ling; Celia M T Greenwood; Joseph Beyene
Journal:  Hum Genomics Proteomics       Date:  2009-01-12

9.  Detection of mammalian virulence determinants in highly pathogenic avian influenza H5N1 viruses: multivariate analysis of published data.

Authors:  S J Lycett; M J Ward; F I Lewis; A F Y Poon; S L Kosakovsky Pond; A J Leigh Brown
Journal:  J Virol       Date:  2009-07-22       Impact factor: 5.103

10.  Spectral affinity in protein networks.

Authors:  Konstantin Voevodski; Shang-Hua Teng; Yu Xia
Journal:  BMC Syst Biol       Date:  2009-11-29
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.