Literature DB >> 16646829

Incorporating biological information as a prior in an empirical bayes approach to analyzing microarray data.

Wei Pan1.   

Abstract

Currently the practice of using existing biological knowledge in analyzing high throughput genomic and proteomic data is mainly for the purpose of validations. Here we take a different approach of incorporating biological knowledge into statistical analysis to improve statistical power and efficiency. Specifically, we consider how to fuse biological information into a mixture model to analyze microarray data. In contrast to a standard mixture model where it is assumed that all the genes come from the same (marginal) distribution, including an equal prior probability of having an event, such as having differential expression or being bound by a transcription factor (TF), our proposed mixture model allows the genes in different groups to have different distributions while the grouping of the genes reflects biological information. Using a list of about 800 putative cell cycle-regulated genes as prior biological knowledge, we analyze a genome-wide location data to detect binding sites of TF Fkh1. We find that our proposal improves over the standard approach, resulting in reduced false discovery rates (FDR), and hence it is a useful alternative to the current practice.

Entities:  

Year:  2005        PMID: 16646829     DOI: 10.2202/1544-6115.1124

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  7 in total

1.  Global analysis of exon creation versus loss and the role of alternative splicing in 17 vertebrate genomes.

Authors:  Alexander V Alekseyenko; Namshin Kim; Christopher J Lee
Journal:  RNA       Date:  2007-03-16       Impact factor: 4.942

Review 2.  Statistical methods for integrating multiple types of high-throughput data.

Authors:  Yang Xie; Chul Ahn
Journal:  Methods Mol Biol       Date:  2010

3.  Methodological Issues in Multistage Genome-wide Association Studies.

Authors:  Duncan C Thomas; Graham Casey; David V Conti; Robert W Haile; Juan Pablo Lewinger; Daniel O Stram
Journal:  Stat Sci       Date:  2009-11-01       Impact factor: 2.901

Review 4.  Use of pathway information in molecular epidemiology.

Authors:  Duncan C Thomas; David V Conti; James Baurley; Frederik Nijhout; Michael Reed; Cornelia M Ulrich
Journal:  Hum Genomics       Date:  2009-10       Impact factor: 4.639

5.  Batch correction of microarray data substantially improves the identification of genes differentially expressed in rheumatoid arthritis and osteoarthritis.

Authors:  Peter Kupfer; Reinhard Guthke; Dirk Pohlers; Rene Huber; Dirk Koczan; Raimund W Kinne
Journal:  BMC Med Genomics       Date:  2012-06-08       Impact factor: 3.063

6.  Novel application of multi-stimuli network inference to synovial fibroblasts of rheumatoid arthritis patients.

Authors:  Peter Kupfer; René Huber; Michael Weber; Sebastian Vlaic; Thomas Häupl; Dirk Koczan; Reinhard Guthke; Raimund W Kinne
Journal:  BMC Med Genomics       Date:  2014-07-03       Impact factor: 3.063

7.  A novel method incorporating gene ontology information for unsupervised clustering and feature selection.

Authors:  Shireesh Srivastava; Linxia Zhang; Rong Jin; Christina Chan
Journal:  PLoS One       Date:  2008-12-04       Impact factor: 3.240

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.