Literature DB >> 14988105

Multiple-sequence functional annotation and the generalized hidden Markov phylogeny.

Jon D McAuliffe1, Lior Pachter, Michael I Jordan.   

Abstract

MOTIVATION: Phylogenetic shadowing is a comparative genomics principle that allows for the discovery of conserved regions in sequences from multiple closely related organisms. We develop a formal probabilistic framework for combining phylogenetic shadowing with feature-based functional annotation methods. The resulting model, a generalized hidden Markov phylogeny (GHMP), applies to a variety of situations where functional regions are to be inferred from evolutionary constraints.
RESULTS: We show how GHMPs can be used to predict complete shared gene structures in multiple primate sequences. We also describe shadower, our implementation of such a prediction system. We find that shadower outperforms previously reported ab initio gene finders, including comparative human-mouse approaches, on a small sample of diverse exonic regions. Finally, we report on an empirical analysis of shadower's performance which reveals that as few as five well-chosen species may suffice to attain maximal sensitivity and specificity in exon demarcation. AVAILABILITY: A Web server is available at http://bonaire.lbl.gov/shadower

Entities:  

Mesh:

Year:  2004        PMID: 14988105     DOI: 10.1093/bioinformatics/bth153

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  17 in total

1.  Powerful SNP-set analysis for case-control genome-wide association studies.

Authors:  Michael C Wu; Peter Kraft; Michael P Epstein; Deanne M Taylor; Stephen J Chanock; David J Hunter; Xihong Lin
Journal:  Am J Hum Genet       Date:  2010-06-11       Impact factor: 11.025

2.  Begin at the beginning: predicting genes with 5' UTRs.

Authors:  Randall H Brown; Samuel S Gross; Michael R Brent
Journal:  Genome Res       Date:  2005-05       Impact factor: 9.043

3.  Subtree power analysis and species selection for comparative genomics.

Authors:  Jon D McAuliffe; Michael I Jordan; Lior Pachter
Journal:  Proc Natl Acad Sci U S A       Date:  2005-05-23       Impact factor: 11.205

4.  Conrad: gene prediction using conditional random fields.

Authors:  David DeCaprio; Jade P Vinson; Matthew D Pearson; Philip Montgomery; Matthew Doherty; James E Galagan
Journal:  Genome Res       Date:  2007-08-09       Impact factor: 9.043

5.  Complexity reduction in context-dependent DNA substitution models.

Authors:  William H Majoros; Uwe Ohler
Journal:  Bioinformatics       Date:  2008-11-18       Impact factor: 6.937

6.  Reference based annotation with GeneMapper.

Authors:  Sourav Chatterji; Lior Pachter
Journal:  Genome Biol       Date:  2006-04-05       Impact factor: 13.583

7.  Evaluation of regulatory potential and conservation scores for detecting cis-regulatory modules in aligned mammalian genome sequences.

Authors:  David C King; James Taylor; Laura Elnitski; Francesca Chiaromonte; Webb Miller; Ross C Hardison
Journal:  Genome Res       Date:  2005-07-15       Impact factor: 9.043

8.  Intraspecies sequence comparisons for annotating genomes.

Authors:  Dario Boffelli; Claire V Weer; Li Weng; Keith D Lewis; Malak I Shoukry; Lior Pachter; David N Keys; Edward M Rubin
Journal:  Genome Res       Date:  2004-11-15       Impact factor: 9.043

9.  Reranking candidate gene models with cross-species comparison for improved gene prediction.

Authors:  Qian Liu; Koby Crammer; Fernando C N Pereira; David S Roos
Journal:  BMC Bioinformatics       Date:  2008-10-14       Impact factor: 3.169

10.  Evolutionary sequence modeling for discovery of peptide hormones.

Authors:  Kemal Sonmez; Naunihal T Zaveri; Ilan A Kerman; Sharon Burke; Charles R Neal; Xinmin Xie; Stanley J Watson; Lawrence Toll
Journal:  PLoS Comput Biol       Date:  2009-01-09       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.