Literature DB >> 9278063

Automated gene identification in large-scale genomic sequences.

Y Xu1, E C Uberbacher.   

Abstract

Computational methods for gene identification in genomic sequences typically have two phases: coding region recognition and gene parsing. While there are a number of effective methods for recognizing coding regions (exons), parsing the recognized exons into proper gene structures, to a large extent, remains an unsolved problem. We have developed a computer program which can automatically parse the recognized exons into gene models that are most consistent with the available Expressed Sequence Tags (ESTs) and a set of biological heuristics, derived empirically. The gene modeling algorithm used in this program provides a general framework for applying EST information so the modeling accuracy improves as the amount of available EST information increases. Based on preliminary tests on a number of large DNA sequences, using the dbEST database, we have observed that the algorithm can (1) accurately model complicated multiple gene structures, including embedded genes, (2) identify falsely-recognized exons and locate missed exons by the initial exon recognition phase, and (3) make more accurate exon boundary predictions, if the necessary EST information is available. We have extended this EST-based gene modeling algorithm to model genes on unfinished DNA contigs at the end of the shotgun sequencing. This extended version can automatically determine the orientations and the relative order of the DNA contigs (with gaps between them) using the available ESTs as reference models, before the gene modeling phase.

Mesh:

Year:  1997        PMID: 9278063     DOI: 10.1089/cmb.1997.4.325

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  24 in total

1.  Ab initio gene identification: prokaryote genome annotation with GeneScan and GLIMMER.

Authors:  Gautam Aggarwal; Ramakrishna Ramaswamy
Journal:  J Biosci       Date:  2002-02       Impact factor: 1.826

2.  Computational inference of homologous gene structures in the human genome.

Authors:  R F Yeh; L P Lim; C B Burge
Journal:  Genome Res       Date:  2001-05       Impact factor: 9.043

3.  The transcriptional activity of human Chromosome 22.

Authors:  John L Rinn; Ghia Euskirchen; Paul Bertone; Rebecca Martone; Nicholas M Luscombe; Stephen Hartman; Paul M Harrison; F Kenneth Nelson; Perry Miller; Mark Gerstein; Sherman Weissman; Michael Snyder
Journal:  Genes Dev       Date:  2003-02-15       Impact factor: 11.361

4.  Selecting open reading frames from DNA.

Authors:  Paola Zacchi; Daniele Sblattero; Fiorella Florian; Roberto Marzari; Andrew R M Bradbury
Journal:  Genome Res       Date:  2003-05       Impact factor: 9.043

Review 5.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

6.  Functional characterization of the human translocator protein (18kDa) gene promoter in human breast cancer cell lines.

Authors:  Amani Batarseh; Keith D Barlow; Daniel B Martinez-Arguelles; Vassilios Papadopoulos
Journal:  Biochim Biophys Acta       Date:  2011-09-18

7.  Analysis of a marsupial MHC region containing two recently duplicated class I loci.

Authors:  Katarzyna B Miska; April M Wright; Rachel Lundgren; Robert Sasaki-McClees; Amy Osterman; James M Gale; Robert D Miller
Journal:  Mamm Genome       Date:  2004-10       Impact factor: 2.957

8.  Evaluation of five ab initio gene prediction programs for the discovery of maize genes.

Authors:  Hong Yao; Ling Guo; Yan Fu; Lisa A Borsuk; Tsui-Jung Wen; David S Skibbe; Xiangqin Cui; Brian E Scheffler; Jun Cao; Scott J Emrich; Daniel A Ashlock; Patrick S Schnable
Journal:  Plant Mol Biol       Date:  2005-02       Impact factor: 4.076

9.  A computer program for aligning a cDNA sequence with a genomic DNA sequence.

Authors:  L Florea; G Hartzell; Z Zhang; G M Rubin; W Miller
Journal:  Genome Res       Date:  1998-09       Impact factor: 9.043

10.  Improved genome assembly and evidence-based global gene model set for the chordate Ciona intestinalis: new insight into intron and operon populations.

Authors:  Yutaka Satou; Katsuhiko Mineta; Michio Ogasawara; Yasunori Sasakura; Eiichi Shoguchi; Keisuke Ueno; Lixy Yamada; Jun Matsumoto; Jessica Wasserscheid; Ken Dewar; Graham B Wiley; Simone L Macmil; Bruce A Roe; Robert W Zeller; Kenneth E M Hastings; Patrick Lemaire; Erika Lindquist; Toshinori Endo; Kohji Hotta; Kazuo Inaba
Journal:  Genome Biol       Date:  2008-10-14       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.