Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Genrate: a generative model that finds and scores new genes and exons in genomic microarray data.

Literature DB >> 15759654

Genrate: a generative model that finds and scores new genes and exons in genomic microarray data.

Brendan J Frey¹, Quaid D Morris, Wen Zhang, Naveed Mohammad, Timothy R Hughes.

Abstract

Recently, researchers have made some progress in using microarrays to validate predicted exons in genome sequence and find new gene structures. However, current methods rely on separately making threshold-based decisions on intensity of expression, similarity of expression profiles, and arrangements of exons in the genome. We have taken a Bayesian approach and developed GenRate, a generative model that accounts for both genome-wide expression data taken from multiple conditions (e.g. tissues) and co-location and density of probes in DNA sequence data. GenRate balances probabilistic evidence derived from different sources and outputs scores (log-likelihoods) for each gene model, enabling the estimation of false-positive and false-negative rates. The model has a number of local minima that is exponential in the length of the DNA sequence data, so direct application of the EM learning algorithm produces poor results. We describe a novel way of parameterizing the model using examples from the data set, so that good solutions are found using an efficient algorithm. We apply GenRate to a subset of mouse genome-wide expression data that we have created, and discuss the statistical significance of the genes found by GenRate. Three of the highest-ranking gene structures found by GenRate, each containing thousands of bases from the genome, are confirmed using RT-PCR experiments.

Entities: Species

Mesh：

Year: 2005 PMID： 15759654

Source DB: PubMed Journal: Pac Symp Biocomput ISSN： 2335-6928

Keyword Cloud
Cited

1 in total

1. Detecting transcriptionally active regions using genomic tiling arrays.

Authors: Gabor Halasz; Marinus F van Batenburg; Joelle Perusse; Sujun Hua; Xiang-Jun Lu; Kevin P White; Harmen J Bussemaker
Journal: Genome Biol Date: 2006 Impact factor: 13.583

1 in total