MOTIVATION: Phylogenetic shadowing is a comparative genomics principle that allows for the discovery of conserved regions in sequences from multiple closely related organisms. We develop a formal probabilistic framework for combining phylogenetic shadowing with feature-based functional annotation methods. The resulting model, a generalized hidden Markov phylogeny (GHMP), applies to a variety of situations where functional regions are to be inferred from evolutionary constraints. RESULTS: We show how GHMPs can be used to predict complete shared gene structures in multiple primate sequences. We also describe shadower, our implementation of such a prediction system. We find that shadower outperforms previously reported ab initio gene finders, including comparative human-mouse approaches, on a small sample of diverse exonic regions. Finally, we report on an empirical analysis of shadower's performance which reveals that as few as five well-chosen species may suffice to attain maximal sensitivity and specificity in exon demarcation. AVAILABILITY: A Web server is available at http://bonaire.lbl.gov/shadower
MOTIVATION: Phylogenetic shadowing is a comparative genomics principle that allows for the discovery of conserved regions in sequences from multiple closely related organisms. We develop a formal probabilistic framework for combining phylogenetic shadowing with feature-based functional annotation methods. The resulting model, a generalized hidden Markov phylogeny (GHMP), applies to a variety of situations where functional regions are to be inferred from evolutionary constraints. RESULTS: We show how GHMPs can be used to predict complete shared gene structures in multiple primate sequences. We also describe shadower, our implementation of such a prediction system. We find that shadower outperforms previously reported ab initio gene finders, including comparative human-mouse approaches, on a small sample of diverse exonic regions. Finally, we report on an empirical analysis of shadower's performance which reveals that as few as five well-chosen species may suffice to attain maximal sensitivity and specificity in exon demarcation. AVAILABILITY: A Web server is available at http://bonaire.lbl.gov/shadower
Authors: Michael C Wu; Peter Kraft; Michael P Epstein; Deanne M Taylor; Stephen J Chanock; David J Hunter; Xihong Lin Journal: Am J Hum Genet Date: 2010-06-11 Impact factor: 11.025
Authors: David DeCaprio; Jade P Vinson; Matthew D Pearson; Philip Montgomery; Matthew Doherty; James E Galagan Journal: Genome Res Date: 2007-08-09 Impact factor: 9.043
Authors: David C King; James Taylor; Laura Elnitski; Francesca Chiaromonte; Webb Miller; Ross C Hardison Journal: Genome Res Date: 2005-07-15 Impact factor: 9.043
Authors: Dario Boffelli; Claire V Weer; Li Weng; Keith D Lewis; Malak I Shoukry; Lior Pachter; David N Keys; Edward M Rubin Journal: Genome Res Date: 2004-11-15 Impact factor: 9.043
Authors: Kemal Sonmez; Naunihal T Zaveri; Ilan A Kerman; Sharon Burke; Charles R Neal; Xinmin Xie; Stanley J Watson; Lawrence Toll Journal: PLoS Comput Biol Date: 2009-01-09 Impact factor: 4.475