Literature DB >> 12826619

A Bayesian framework for combining heterogeneous data sources for gene function prediction (in Saccharomyces cerevisiae).

Olga G Troyanskaya1, Kara Dolinski, Art B Owen, Russ B Altman, David Botstein.   

Abstract

Genomic sequencing is no longer a novelty, but gene function annotation remains a key challenge in modern biology. A variety of functional genomics experimental techniques are available, from classic methods such as affinity precipitation to advanced high-throughput techniques such as gene expression microarrays. In the future, more disparate methods will be developed, further increasing the need for integrated computational analysis of data generated by these studies. We address this problem with MAGIC (Multisource Association of Genes by Integration of Clusters), a general framework that uses formal Bayesian reasoning to integrate heterogeneous types of high-throughput biological data (such as large-scale two-hybrid screens and multiple microarray analyses) for accurate gene function prediction. The system formally incorporates expert knowledge about relative accuracies of data sources to combine them within a normative framework. MAGIC provides a belief level with its output that allows the user to vary the stringency of predictions. We applied MAGIC to Saccharomyces cerevisiae genetic and physical interactions, microarray, and transcription factor binding sites data and assessed the biological relevance of gene groupings using Gene Ontology annotations produced by the Saccharomyces Genome Database. We found that by creating functional groupings based on heterogeneous data types, MAGIC improved accuracy of the groupings compared with microarray analysis alone. We describe several of the biological gene groupings identified.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12826619      PMCID: PMC166232          DOI: 10.1073/pnas.0832373100

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   12.779


  26 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  Detecting protein function and protein-protein interactions from genome sequences.

Authors:  E M Marcotte; M Pellegrini; H L Ng; D W Rice; T O Yeates; D Eisenberg
Journal:  Science       Date:  1999-07-30       Impact factor: 47.728

3.  Affinity precipitation of enzymes.

Authors:  P O Larsson; K Mosbach
Journal:  FEBS Lett       Date:  1979-02-15       Impact factor: 4.124

Review 4.  Protein functions in pre-mRNA splicing.

Authors:  C L Will; R Lührmann
Journal:  Curr Opin Cell Biol       Date:  1997-06       Impact factor: 8.382

5.  Cef1p is a component of the Prp19p-associated complex and essential for pre-mRNA splicing.

Authors:  W Y Tsai; Y T Chow; H R Chen; K T Huang; R I Hong; S P Jan; N Y Kuo; T Y Tsao; C H Chen; S C Cheng
Journal:  J Biol Chem       Date:  1999-04-02       Impact factor: 5.157

6.  A combined algorithm for genome-wide prediction of protein function.

Authors:  E M Marcotte; M Pellegrini; M J Thompson; T O Yeates; D Eisenberg
Journal:  Nature       Date:  1999-11-04       Impact factor: 49.962

7.  Quantitative monitoring of gene expression patterns with a complementary DNA microarray.

Authors:  M Schena; D Shalon; R W Davis; P O Brown
Journal:  Science       Date:  1995-10-20       Impact factor: 47.728

8.  SCPD: a promoter database of the yeast Saccharomyces cerevisiae.

Authors:  J Zhu; M Q Zhang
Journal:  Bioinformatics       Date:  1999 Jul-Aug       Impact factor: 6.937

9.  Use of a screen for synthetic lethal and multicopy suppressee mutants to identify two new genes involved in morphogenesis in Saccharomyces cerevisiae.

Authors:  A Bender; J R Pringle
Journal:  Mol Cell Biol       Date:  1991-03       Impact factor: 4.272

10.  Suppressors of yeast actin mutations.

Authors:  P Novick; B C Osmond; D Botstein
Journal:  Genetics       Date:  1989-04       Impact factor: 4.562

View more
  187 in total

1.  Real-time ligand binding pocket database search using local surface descriptors.

Authors:  Rayan Chikhi; Lee Sael; Daisuke Kihara
Journal:  Proteins       Date:  2010-07

2.  Predicting genetic modifier loci using functional gene networks.

Authors:  Insuk Lee; Ben Lehner; Tanya Vavouri; Junha Shin; Andrew G Fraser; Edward M Marcotte
Journal:  Genome Res       Date:  2010-06-09       Impact factor: 9.043

3.  GENEVESTIGATOR. Arabidopsis microarray database and analysis toolbox.

Authors:  Philip Zimmermann; Matthias Hirsch-Hoffmann; Lars Hennig; Wilhelm Gruissem
Journal:  Plant Physiol       Date:  2004-09       Impact factor: 8.340

4.  Whole-genome annotation by using evidence integration in functional-linkage networks.

Authors:  Ulas Karaoz; T M Murali; Stan Letovsky; Yu Zheng; Chunming Ding; Charles R Cantor; Simon Kasif
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-23       Impact factor: 11.205

5.  Revealing modularity and organization in the yeast molecular network by integrated analysis of highly heterogeneous genomewide data.

Authors:  Amos Tanay; Roded Sharan; Martin Kupiec; Ron Shamir
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-18       Impact factor: 11.205

Review 6.  Computational tools for prioritizing candidate genes: boosting disease gene discovery.

Authors:  Yves Moreau; Léon-Charles Tranchevent
Journal:  Nat Rev Genet       Date:  2012-07-03       Impact factor: 53.242

Review 7.  Methods for biological data integration: perspectives and challenges.

Authors:  Vladimir Gligorijević; Nataša Pržulj
Journal:  J R Soc Interface       Date:  2015-11-06       Impact factor: 4.118

8.  A proteogenomic approach to understand splice isoform functions through sequence and expression-based computational modeling.

Authors:  Hong-Dong Li; Gilbert S Omenn; Yuanfang Guan
Journal:  Brief Bioinform       Date:  2016-01-06       Impact factor: 11.622

9.  MULTI-WAY BLOCKMODELS FOR ANALYZING COORDINATED HIGH-DIMENSIONAL RESPONSES.

Authors:  Edoardo M Airoldi; Xiaopei Wang; Xiaodong Lin
Journal:  Ann Appl Stat       Date:  2013-12-23       Impact factor: 2.083

Review 10.  Sequencing and beyond: integrating molecular 'omics' for microbial community profiling.

Authors:  Eric A Franzosa; Tiffany Hsu; Alexandra Sirota-Madi; Afrah Shafquat; Galeb Abu-Ali; Xochitl C Morgan; Curtis Huttenhower
Journal:  Nat Rev Microbiol       Date:  2015-04-27       Impact factor: 60.633

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.