Literature DB >> 9403056

A tool for analyzing and annotating genomic sequences.

X Huang1, M D Adams, H Zhou, A R Kerlavage.   

Abstract

We describe a tool for analyzing and annotating large genomic sequences containing introns. The analysis and annotation tool (AAT) includes two sets of programs, one for comparing the query sequence with a protein database and the other for comparing the query with a cDNA database. Each set contains a fast database search program and a rigorous alignment program. The database search program quickly identifies regions of the query sequence that are similar to a database sequence. Then the alignment program constructs an optimal alignment for each region and the database sequence. The alignment program also reports the coordinates of exons in the query sequence. Pairwise alignments of the query sequence with protein and cDNA database sequences are combined into multiple sequence alignments, which provide a view of all protein and cDNA sequences matching a query region. On a data set of 570 DNA sequences, AAT identified 94% of coding nucleotides correctly and 74% of exons exactly. Results of analyzing a human BAC sequence with the AAT tool are also presented. The AAT tool reduces the labor-intensive work of locating the exons of the query sequence and improves the process of defining intron-exon boundaries by using the wealth of available protein and cDNA data.

Entities:  

Mesh:

Substances:

Year:  1997        PMID: 9403056     DOI: 10.1006/geno.1997.4984

Source DB:  PubMed          Journal:  Genomics        ISSN: 0888-7543            Impact factor:   5.736


  81 in total

1.  Colinearity and its exceptions in orthologous adh regions of maize and sorghum.

Authors:  A P Tikhonov; P J SanMiguel; Y Nakajima; N M Gorenstein; J L Bennetzen; Z Avramova
Journal:  Proc Natl Acad Sci U S A       Date:  1999-06-22       Impact factor: 11.205

2.  An assessment of gene prediction accuracy in large DNA sequences.

Authors:  R Guigó; P Agarwal; J F Abril; M Burset; J W Fickett
Journal:  Genome Res       Date:  2000-10       Impact factor: 9.043

3.  Exploration of novel motifs derived from mouse cDNA sequences.

Authors:  Hideya Kawaji; Christian Schönbach; Yo Matsuo; Jun Kawai; Yasushi Okazaki; Yoshihide Hayashizaki; Hideo Matsuda
Journal:  Genome Res       Date:  2002-03       Impact factor: 9.043

4.  Evaluation of gene-finding programs on mammalian sequences.

Authors:  S Rogic; A K Mackworth; F B Ouellette
Journal:  Genome Res       Date:  2001-05       Impact factor: 9.043

5.  An optimized protocol for analysis of EST sequences.

Authors:  F Liang; I Holt; G Pertea; S Karamycheva; S L Salzberg; J Quackenbush
Journal:  Nucleic Acids Res       Date:  2000-09-15       Impact factor: 16.971

6.  Cloning and sequencing of cDNAs for hypothetical genes from chromosome 2 of Arabidopsis.

Authors:  Yong-Li Xiao; Mukesh Malik; Catherine A Whitelaw; Christopher D Town
Journal:  Plant Physiol       Date:  2002-12       Impact factor: 8.340

7.  Comparative genomics of Brassica oleracea and Arabidopsis thaliana reveal gene loss, fragmentation, and dispersal after polyploidy.

Authors:  Christopher D Town; Foo Cheung; Rama Maiti; Jonathan Crabtree; Brian J Haas; Jennifer R Wortman; Erin E Hine; Ryan Althoff; Tamara S Arbogast; Luke J Tallon; Marielle Vigouroux; Martin Trick; Ian Bancroft
Journal:  Plant Cell       Date:  2006-04-21       Impact factor: 11.277

8.  A unique set of 11,008 onion expressed sequence tags reveals expressed sequence and genomic differences between the monocot orders Asparagales and Poales.

Authors:  Joseph C Kuhl; Foo Cheung; Qiaoping Yuan; William Martin; Yayeh Zewdie; John McCallum; Andrew Catanach; Paul Rutherford; Kenneth C Sink; Maria Jenderek; James P Prince; Christopher D Town; Michael J Havey
Journal:  Plant Cell       Date:  2003-12-11       Impact factor: 11.277

9.  Computational gene prediction using multiple sources of evidence.

Authors:  Jonathan E Allen; Mihaela Pertea; Steven L Salzberg
Journal:  Genome Res       Date:  2004-01       Impact factor: 9.043

10.  A complexity reduction algorithm for analysis and annotation of large genomic sequences.

Authors:  Trees-Juen Chuang; Wen-Chang Lin; Hurng-Chun Lee; Chi-Wei Wang; Keh-Lin Hsiao; Zi-Hao Wang; Danny Shieh; Simon C Lin; Lan-Yang Ch'ang
Journal:  Genome Res       Date:  2003-02       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.