Literature DB >> 11473015

Rich probabilistic models for gene expression.

E Segal1, B Taskar, A Gasch, N Friedman, D Koller.   

Abstract

Clustering is commonly used for analyzing gene expression data. Despite their successes, clustering methods suffer from a number of limitations. First, these methods reveal similarities that exist over all of the measurements, while obscuring relationships that exist over only a subset of the data. Second, clustering methods cannot readily incorporate additional types of information, such as clinical data or known attributes of genes. To circumvent these shortcomings, we propose the use of a single coherent probabilistic model, that encompasses much of the rich structure in the genomic expression data, while incorporating additional information such as experiment type, putative binding sites, or functional information. We show how this model can be learned from the data, allowing us to discover patterns in the data and dependencies between the gene expression patterns and additional attributes. The learned model reveals context-specific relationships, that exist only over a subset of the experiments in the dataset. We demonstrate the power of our approach on synthetic data and on two real-world gene expression data sets for yeast. For example, we demonstrate a novel functionality that falls naturally out of our framework: predicting the "cluster" of the array resulting from a gene mutation based only on the gene's expression pattern in the context of other mutations.

Entities:  

Mesh:

Year:  2001        PMID: 11473015     DOI: 10.1093/bioinformatics/17.suppl_1.s243

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  22 in total

1.  Analysis of DNA microarrays using algorithms that employ rule-based expert knowledge.

Authors:  Kuang-Hung Pan; Chih-Jian Lih; Stanley N Cohen
Journal:  Proc Natl Acad Sci U S A       Date:  2002-02-19       Impact factor: 11.205

2.  Effects of threshold choice on biological conclusions reached during analysis of gene expression by DNA microarrays.

Authors:  Kuang-Hung Pan; Chih-Jian Lih; Stanley N Cohen
Journal:  Proc Natl Acad Sci U S A       Date:  2005-06-10       Impact factor: 11.205

3.  An integrated analysis of molecular aberrations in NCI-60 cell lines.

Authors:  Chen-Hsiang Yeang
Journal:  BMC Bioinformatics       Date:  2010-10-06       Impact factor: 3.169

4.  Data mining in genomics.

Authors:  Jae K Lee; Paul D Williams; Sooyoung Cheon
Journal:  Clin Lab Med       Date:  2008-03       Impact factor: 1.935

5.  Integrated analysis of yeast regulatory sequences for biologically linked clusters of genes.

Authors:  Albin Sandelin; Annette Höglund; Boris Lenhard; Wyeth W Wasserman
Journal:  Funct Integr Genomics       Date:  2003-06-25       Impact factor: 3.410

6.  Generalized Co-Clustering Analysis via Regularized Alternating Least Squares.

Authors:  Gen Li
Journal:  Comput Stat Data Anal       Date:  2020-05-04       Impact factor: 1.681

7.  Genome-wide matching of genes to cellular roles using guilt-by-association models derived from single sample analysis.

Authors:  Jeff A Klomp; Kyle A Furge
Journal:  BMC Res Notes       Date:  2012-07-23

8.  Boolean implication networks derived from large scale, whole genome microarray datasets.

Authors:  Debashis Sahoo; David L Dill; Andrew J Gentles; Robert Tibshirani; Sylvia K Plevritis
Journal:  Genome Biol       Date:  2008-10-30       Impact factor: 13.583

9.  Bayesian inference based modelling for gene transcriptional dynamics by integrating multiple source of knowledge.

Authors:  Shu-Qiang Wang; Han-Xiong Li
Journal:  BMC Syst Biol       Date:  2012-07-16

10.  A Bayesian framework for combining heterogeneous data sources for gene function prediction (in Saccharomyces cerevisiae).

Authors:  Olga G Troyanskaya; Kara Dolinski; Art B Owen; Russ B Altman; David Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  2003-06-25       Impact factor: 12.779

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.