Literature DB >> 16597247

Using multiple alignments to improve gene prediction.

Samuel S Gross1, Michael R Brent.   

Abstract

The multiple species de novo gene prediction problem can be stated as follows: given an alignment of genomic sequences from two or more organisms, predict the location and structure of all protein-coding genes in one or more of the sequences. Here, we present a new system, N-SCAN (a.k.a. TWINSCAN 3.0), for addressing this problem. N-SCAN can model the phylogenetic relationships between the aligned genome sequences, context dependent substitution rates, and insertions and deletions. An implementation of N-SCAN was created and used to generate predictions for the entire human genome and the genome of the fruit fly Drosophila melanogaster. Analyses of the predictions reveal that N-SCAN's accuracy in both human and fly exceeds that of all previously published whole-genome de novo gene predictors.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16597247     DOI: 10.1089/cmb.2006.13.379

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  60 in total

1.  DNA-energetics-based analyses suggest additional genes in prokaryotes.

Authors:  Garima Khandelwal; Jalaj Gupta; B Jayaram
Journal:  J Biosci       Date:  2012-07       Impact factor: 1.826

2.  Comparative assessment of methods for aligning multiple genome sequences.

Authors:  Xiaoyu Chen; Martin Tompa
Journal:  Nat Biotechnol       Date:  2010-05-23       Impact factor: 54.908

3.  Extraordinary sequence divergence at Tsga8, an X-linked gene involved in mouse spermiogenesis.

Authors:  Jeffrey M Good; Dan Vanderpool; Kimberly L Smith; Michael W Nachman
Journal:  Mol Biol Evol       Date:  2010-12-24       Impact factor: 16.240

4.  Iterative gene prediction and pseudogene removal improves genome annotation.

Authors:  Marijke J van Baren; Michael R Brent
Journal:  Genome Res       Date:  2006-05       Impact factor: 9.043

5.  ESPERR: learning strong and weak signals in genomic sequence alignments to identify functional elements.

Authors:  James Taylor; Svitlana Tyekucheva; David C King; Ross C Hardison; Webb Miller; Francesca Chiaromonte
Journal:  Genome Res       Date:  2006-10-19       Impact factor: 9.043

6.  Generic eukaryotic core promoter prediction using structural features of DNA.

Authors:  Thomas Abeel; Yvan Saeys; Eric Bonnet; Pierre Rouzé; Yves Van de Peer
Journal:  Genome Res       Date:  2007-12-20       Impact factor: 9.043

7.  Conrad: gene prediction using conditional random fields.

Authors:  David DeCaprio; Jade P Vinson; Matthew D Pearson; Philip Montgomery; Matthew Doherty; James E Galagan
Journal:  Genome Res       Date:  2007-08-09       Impact factor: 9.043

8.  Realistic artificial DNA sequences as negative controls for computational genomics.

Authors:  Juan Caballero; Arian F A Smit; Leroy Hood; Gustavo Glusman
Journal:  Nucleic Acids Res       Date:  2014-05-06       Impact factor: 16.971

9.  Targeted discovery of novel human exons by comparative genomics.

Authors:  Adam Siepel; Mark Diekhans; Brona Brejová; Laura Langton; Michael Stevens; Charles L G Comstock; Colleen Davis; Brent Ewing; Shelly Oommen; Christopher Lau; Hung-Chun Yu; Jianfeng Li; Bruce A Roe; Phil Green; Daniela S Gerhard; Gary Temple; David Haussler; Michael R Brent
Journal:  Genome Res       Date:  2007-11-07       Impact factor: 9.043

10.  28-way vertebrate alignment and conservation track in the UCSC Genome Browser.

Authors:  Webb Miller; Kate Rosenbloom; Ross C Hardison; Minmei Hou; James Taylor; Brian Raney; Richard Burhans; David C King; Robert Baertsch; Daniel Blankenberg; Sergei L Kosakovsky Pond; Anton Nekrutenko; Belinda Giardine; Robert S Harris; Svitlana Tyekucheva; Mark Diekhans; Thomas H Pringle; William J Murphy; Arthur Lesk; George M Weinstock; Kerstin Lindblad-Toh; Richard A Gibbs; Eric S Lander; Adam Siepel; David Haussler; W James Kent
Journal:  Genome Res       Date:  2007-11-05       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.