Literature DB >> 12538242

Gene finding with a hidden Markov model of genome structure and evolution.

Jakob Skou Pedersen1, Jotun Hein.   

Abstract

MOTIVATION: A growing number of genomes are sequenced. The differences in evolutionary pattern between functional regions can thus be observed genome-wide in a whole set of organisms. The diverse evolutionary pattern of different functional regions can be exploited in the process of genomic annotation. The modelling of evolution by the existing comparative gene finders leaves room for improvement.
RESULTS: A probabilistic model of both genome structure and evolution is designed. This type of model is called an Evolutionary Hidden Markov Model (EHMM), being composed of an HMM and a set of region-specific evolutionary models based on a phylogenetic tree. All parameters can be estimated by maximum likelihood, including the phylogenetic tree. It can handle any number of aligned genomes, using their phylogenetic tree to model the evolutionary correlations. The time complexity of all algorithms used for handling the model are linear in alignment length and genome number. The model is applied to the problem of gene finding. The benefit of modelling sequence evolution is demonstrated both in a range of simulations and on a set of orthologous human/mouse gene pairs. AVAILABILITY: Free availability over the Internet on www server: http://www.birc.dk/Software/evogene.

Entities:  

Mesh:

Year:  2003        PMID: 12538242     DOI: 10.1093/bioinformatics/19.2.219

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  25 in total

1.  GeneWise and Genomewise.

Authors:  Ewan Birney; Michele Clamp; Richard Durbin
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

2.  A comparative method for finding and folding RNA secondary structures within protein-coding regions.

Authors:  Jakob Skou Pedersen; Irmtraud Margret Meyer; Roald Forsberg; Peter Simmonds; Jotun Hein
Journal:  Nucleic Acids Res       Date:  2004-09-24       Impact factor: 16.971

3.  Conrad: gene prediction using conditional random fields.

Authors:  David DeCaprio; Jade P Vinson; Matthew D Pearson; Philip Montgomery; Matthew Doherty; James E Galagan
Journal:  Genome Res       Date:  2007-08-09       Impact factor: 9.043

4.  Complexity reduction in context-dependent DNA substitution models.

Authors:  William H Majoros; Uwe Ohler
Journal:  Bioinformatics       Date:  2008-11-18       Impact factor: 6.937

5.  Evaluation of regulatory potential and conservation scores for detecting cis-regulatory modules in aligned mammalian genome sequences.

Authors:  David C King; James Taylor; Laura Elnitski; Francesca Chiaromonte; Webb Miller; Ross C Hardison
Journal:  Genome Res       Date:  2005-07-15       Impact factor: 9.043

6.  Evolutionary modeling and prediction of non-coding RNAs in Drosophila.

Authors:  Robert K Bradley; Andrew V Uzilov; Mitchell E Skinner; Yuri R Bendaña; Lars Barquist; Ian Holmes
Journal:  PLoS One       Date:  2009-08-11       Impact factor: 3.240

7.  Identification of novel peptide hormones in the human proteome by hidden Markov model screening.

Authors:  Olivier Mirabeau; Emerald Perlas; Cinzia Severini; Enrica Audero; Olivier Gascuel; Roberta Possenti; Ewan Birney; Nadia Rosenthal; Cornelius Gross
Journal:  Genome Res       Date:  2007-02-06       Impact factor: 9.043

Review 8.  Whole-Genome Alignment and Comparative Annotation.

Authors:  Joel Armstrong; Ian T Fiddes; Mark Diekhans; Benedict Paten
Journal:  Annu Rev Anim Biosci       Date:  2018-10-31       Impact factor: 8.923

9.  Tools for simulating evolution of aligned genomic regions with integrated parameter estimation.

Authors:  Avinash Varadarajan; Robert K Bradley; Ian H Holmes
Journal:  Genome Biol       Date:  2008-10-08       Impact factor: 13.583

10.  Evolutionary sequence modeling for discovery of peptide hormones.

Authors:  Kemal Sonmez; Naunihal T Zaveri; Ilan A Kerman; Sharon Burke; Charles R Neal; Xinmin Xie; Stanley J Watson; Lawrence Toll
Journal:  PLoS Comput Biol       Date:  2009-01-09       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.