Literature DB >> 9548972

Analysis of EST-driven gene annotation in human genomic sequence.

L C Bailey1, D B Searls, G C Overton.   

Abstract

We have performed a systematic analysis of gene identification in genomic sequence by similarity search against expressed sequence tags (ESTs) to assess the suitability of this method for automated annotation of the human genome. A BLAST-based strategy was constructed to examine the potential of this approach, and was applied to test sets containing all human genomic sequences longer than 5 kb in public databases, plus 300 kb of exhaustively characterized benchmark sequence. At high stringency, 70%-90% of all annotated genes are detected by near-identity to EST sequence; >95% of ESTs aligning with well-annotated sequences overlap a gene. These ESTs provide immediate access to the corresponding cDNA clones for follow-up laboratory verification and subsequent biologic analysis. At lower stringency, up to 97% of annotated genes were identified by similarity to ESTs. The apparent false-positive rate rose to 55% of ESTs among all sequences and 20% among benchmark sequences at the lowest stringency, indicating that many genes in public database entries are unannotated. Approximately half of the alignments span multiple exons, and thus aid in the construction of gene predictions and elucidation of alternative splicing. In addition, ESTs from multiple cDNA libraries frequently cluster over genes, providing a starting point for crude expression profiles. Clone IDs may be used to form EST pairs, and particularly to extend models by associating alignments of lower stringency with high-quality alignments. These results demonstrate that EST similarity search is a practical general-purpose annotation technique that complements pattern recognition methods as a tool for gene characterization.

Entities:  

Mesh:

Substances:

Year:  1998        PMID: 9548972     DOI: 10.1101/gr.8.4.362

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  14 in total

1.  Shotgun sequencing of the human transcriptome with ORF expressed sequence tags.

Authors:  E Dias Neto; R G Correa; S Verjovski-Almeida; M R Briones; M A Nagai; W da Silva; M A Zago; S Bordin; F F Costa; G H Goldman; A F Carvalho; A Matsukuma; G S Baia; D H Simpson; A Brunstein; P S de Oliveira; P Bucher; C V Jongeneel; M J O'Hare; F Soares; R R Brentani; L F Reis; S J de Souza; A J Simpson
Journal:  Proc Natl Acad Sci U S A       Date:  2000-03-28       Impact factor: 11.205

2.  PipMaker--a web server for aligning two genomic DNA sequences.

Authors:  S Schwartz; Z Zhang; K A Frazer; A Smit; C Riemer; J Bouck; R Gibbs; R Hardison; W Miller
Journal:  Genome Res       Date:  2000-04       Impact factor: 9.043

Review 3.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

4.  GANESH: software for customized annotation of genome regions.

Authors:  Derek Huntley; Holger Hummerich; Damian Smedley; Sasivimol Kittivoravitkul; Mark McCarthy; Peter Little; Marek Sergot
Journal:  Genome Res       Date:  2003-09       Impact factor: 9.043

5.  Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies.

Authors:  Brian J Haas; Arthur L Delcher; Stephen M Mount; Jennifer R Wortman; Roger K Smith; Linda I Hannick; Rama Maiti; Catherine M Ronning; Douglas B Rusch; Christopher D Town; Steven L Salzberg; Owen White
Journal:  Nucleic Acids Res       Date:  2003-10-01       Impact factor: 16.971

6.  A transcript finishing initiative for closing gaps in the human transcriptome.

Authors:  Mari Cleide Sogayar; Anamaria A Camargo; Fabiana Bettoni; Dirce Maria Carraro; Lilian C Pires; Raphael B Parmigiani; Elisa N Ferreira; Eloísa de Sá Moreira; Maria do Rosário D de O Latorre; Andrew J G Simpson; Luciana Oliveira Cruz; Theri Leica Degaki; Fernanda Festa; Katlin B Massirer; Mari C Sogayar; Fernando Camargo Filho; Luiz Paulo Camargo; Marco A V Cunha; Sandro J De Souza; Milton Faria; Silvana Giuliatti; Leonardo Kopp; Paulo S L de Oliveira; Paulo B Paiva; Anderson A Pereira; Daniel G Pinheiro; Renato D Puga; Jorge Estefano S de Souza; Dulcineia M Albuquerque; Luís E C Andrade; Gilson S Baia; Marcelo R S Briones; Ana M S Cavaleiro-Luna; Janete M Cerutti; Fernando F Costa; Eugenia Costanzi-Strauss; Enilza M Espreafico; Adriana C Ferrasi; Emer S Ferro; Maria A H Z Fortes; Joelma R F Furchi; Daniel Giannella-Neto; Gustavo H Goldman; Maria H S Goldman; Arthur Gruber; Gustavo S Guimarães; Christine Hackel; Flavio Henrique-Silva; Edna T Kimura; Suzana G Leoni; Cláudia Macedo; Bettina Malnic; Carina V Manzini B; Suely K N Marie; Nilce M Martinez-Rossi; Marcelo Menossi; Elisabete C Miracca; Maria A Nagai; Francisco G Nobrega; Marina P Nobrega; Sueli M Oba-Shinjo; Márika K Oliveira; Guilherme M Orabona; Audrey Y Otsuka; Maria L Paço-Larson; Beatriz M C Paixão; Jose R C Pandolfi; Maria I M C Pardini; Maria R Passos Bueno; Geraldo A S Passos; Joao B Pesquero; Juliana G Pessoa; Paula Rahal; Cláudia A Rainho; Caroline P Reis; Tatiana I Ricca; Vanderlei Rodrigues; Silvia R Rogatto; Camila M Romano; Janaína G Romeiro; Antonio Rossi; Renata G Sá; Magaly M Sales; Simone C Sant'Anna; Patrícia L Santarosa; Fernando Segato; Wilson A Silva; Ismael D C G Silva; Neusa P Silva; Andrea Soares-Costa; Maria F Sonati; Bryan E Strauss; Eloiza H Tajara; Sandro R Valentini; Fabiola E Villanova; Laura S Ward; Dalila L Zanette
Journal:  Genome Res       Date:  2004-06-14       Impact factor: 9.043

7.  Fugu ESTs: new resources for transcription analysis and genome annotation.

Authors:  Melody S Clark; Yvonne J K Edwards; Dan Peterson; Sandra W Clifton; Amanda J Thompson; Masahide Sasaki; Yutaka Suzuki; Kiyoshi Kikuchi; Shugo Watabe; Koichi Kawakami; Sumio Sugano; Greg Elgar; Stephen L Johnson
Journal:  Genome Res       Date:  2003-11-12       Impact factor: 9.043

8.  A computer program for aligning a cDNA sequence with a genomic DNA sequence.

Authors:  L Florea; G Hartzell; Z Zhang; G M Rubin; W Miller
Journal:  Genome Res       Date:  1998-09       Impact factor: 9.043

9.  Identification of novel human genes evolutionarily conserved in Caenorhabditis elegans by comparative proteomics.

Authors:  C H Lai; C Y Chou; L Y Ch'ang; C S Liu; W Lin
Journal:  Genome Res       Date:  2000-05       Impact factor: 9.043

10.  Expressed sequence tags of the peanut pod nematode Ditylenchus africanus: the first transcriptome analysis of an Anguinid nematode.

Authors:  Annelies Haegeman; Joachim Jacob; Bartel Vanholme; Tina Kyndt; Makedonka Mitreva; Godelieve Gheysen
Journal:  Mol Biochem Parasitol       Date:  2009-04-19       Impact factor: 1.759

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.