Literature DB >> 9521930

EbEST: an automated tool using expressed sequence tags to delineate gene structure.

J Jiang1, H J Jacob.   

Abstract

Large numbers of expressed sequence tags (ESTs) continue to fill public and private databases with partial cDNA sequences. However, using this huge amount of ESTs to facilitate gene finding in genomic sequence imposes a challenge, especially to wet-lab scientists who often have limited computing resources. In an effort to consolidate the information hidden in the vast number of ESTs into a readable and manageable format, we have developed EbEST-a program that automates the process of using ESTs to help delineate gene structure in long stretches of genomic sequence. The EbEST program consists of three functional modules-the first module separates homologous ESTs into clusters and identifies the most informative ESTs within each cluster; the second module uses the informative ESTs to perform gapped alignment and to predict the exon-intron boundary; and the third module generates text file and graphic outputs that illustrate the orientation, exonic structure, and untranslated regions (UTRs) of putative genes in the genomic sequence being analyzed. Evaluation of EbEST with 176 human genes from the ALLSEQ set indicated that it performed in-line with several existing gene finding programs, but was more tolerant to sequencing errors. Furthermore, when EbEST was challenged with query sequences that harbor more than one gene, it suffered only a slight drop in performance, whereas the performance of the other programs evaluated decreased more. EbEST may be used as a stand-alone tool to annotate human genomic sequences with EST-derived gene elements, or can be used in conjunction with computational gene-recognition programs to increase the accuracy of gene prediction. [EbBEST is available at http://EbEST.ifrc.mcw.edu]

Entities:  

Mesh:

Year:  1998        PMID: 9521930      PMCID: PMC310694          DOI: 10.1101/gr.8.3.268

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  16 in total

1.  Complete genomic sequence and analysis of 117 kb of human DNA containing the gene BRCA1.

Authors:  T M Smith; M K Lee; C I Szabo; N Jerome; M McEuen; M Taylor; L Hood; M C King
Journal:  Genome Res       Date:  1996-11       Impact factor: 9.043

2.  Discovering and understanding genes in human DNA sequence using GRAIL.

Authors:  E C Uberbacher; Y Xu; R J Mural
Journal:  Methods Enzymol       Date:  1996       Impact factor: 1.600

3.  Generation and analysis of 280,000 human expressed sequence tags.

Authors:  L D Hillier; G Lennon; M Becker; M F Bonaldo; B Chiapelli; S Chissoe; N Dietrich; T DuBuque; A Favello; W Gish; M Hawkins; M Hultman; T Kucaba; M Lacy; M Le; N Le; E Mardis; B Moore; M Morris; J Parsons; C Prange; L Rifkin; T Rohlfing; K Schellenberg; M Bento Soares; F Tan; J Thierry-Meg; E Trevaskis; K Underwood; P Wohldman; R Waterston; R Wilson; M Marra
Journal:  Genome Res       Date:  1996-09       Impact factor: 9.043

4.  Toward the development of a gene index to the human genome: an assessment of the nature of high-throughput EST sequence data.

Authors:  J S Aaronson; B Eckman; R A Blevins; J A Borkowski; J Myerson; S Imran; K O Elliston
Journal:  Genome Res       Date:  1996-09       Impact factor: 9.043

5.  Evaluation of gene structure prediction programs.

Authors:  M Burset; R Guigó
Journal:  Genomics       Date:  1996-06-15       Impact factor: 5.736

6.  EST_GENOME: a program to align spliced DNA sequences to unspliced genomic DNA.

Authors:  R Mott
Journal:  Comput Appl Biosci       Date:  1997-08

7.  Initial assessment of human gene diversity and expression patterns based upon 83 million nucleotides of cDNA sequence.

Authors:  M D Adams; A R Kerlavage; R D Fleischmann; R A Fuldner; C J Bult; N H Lee; E F Kirkness; K G Weinstock; J D Gocayne; O White
Journal:  Nature       Date:  1995-09-28       Impact factor: 49.962

8.  Identification of protein coding regions in genomic DNA.

Authors:  E E Snyder; G D Stormo
Journal:  J Mol Biol       Date:  1995-04-21       Impact factor: 5.469

9.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

10.  Predicting internal exons by oligonucleotide composition and discriminant analysis of spliceable open reading frames.

Authors:  V V Solovyev; A A Salamov; C B Lawrence
Journal:  Nucleic Acids Res       Date:  1994-12-11       Impact factor: 16.971

View more
  8 in total

1.  Shotgun sequencing of the human transcriptome with ORF expressed sequence tags.

Authors:  E Dias Neto; R G Correa; S Verjovski-Almeida; M R Briones; M A Nagai; W da Silva; M A Zago; S Bordin; F F Costa; G H Goldman; A F Carvalho; A Matsukuma; G S Baia; D H Simpson; A Brunstein; P S de Oliveira; P Bucher; C V Jongeneel; M J O'Hare; F Soares; R R Brentani; L F Reis; S J de Souza; A J Simpson
Journal:  Proc Natl Acad Sci U S A       Date:  2000-03-28       Impact factor: 11.205

2.  Gene2EST: a BLAST2 server for searching expressed sequence tag (EST) databases with eukaryotic gene-sized queries.

Authors:  C Gemünd; C Ramu; B Altenberg-Greulich; T J Gibson
Journal:  Nucleic Acids Res       Date:  2001-03-15       Impact factor: 16.971

3.  DbClustal: rapid and reliable global multiple alignments of protein sequences detected by database searches.

Authors:  J D Thompson; F Plewniak; J Thierry; O Poch
Journal:  Nucleic Acids Res       Date:  2000-08-01       Impact factor: 16.971

4.  Gene structure prediction and alternative splicing analysis using genomically aligned ESTs.

Authors:  Z Kan; E C Rouchka; W R Gish; D J States
Journal:  Genome Res       Date:  2001-05       Impact factor: 9.043

Review 5.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

6.  A transcript finishing initiative for closing gaps in the human transcriptome.

Authors:  Mari Cleide Sogayar; Anamaria A Camargo; Fabiana Bettoni; Dirce Maria Carraro; Lilian C Pires; Raphael B Parmigiani; Elisa N Ferreira; Eloísa de Sá Moreira; Maria do Rosário D de O Latorre; Andrew J G Simpson; Luciana Oliveira Cruz; Theri Leica Degaki; Fernanda Festa; Katlin B Massirer; Mari C Sogayar; Fernando Camargo Filho; Luiz Paulo Camargo; Marco A V Cunha; Sandro J De Souza; Milton Faria; Silvana Giuliatti; Leonardo Kopp; Paulo S L de Oliveira; Paulo B Paiva; Anderson A Pereira; Daniel G Pinheiro; Renato D Puga; Jorge Estefano S de Souza; Dulcineia M Albuquerque; Luís E C Andrade; Gilson S Baia; Marcelo R S Briones; Ana M S Cavaleiro-Luna; Janete M Cerutti; Fernando F Costa; Eugenia Costanzi-Strauss; Enilza M Espreafico; Adriana C Ferrasi; Emer S Ferro; Maria A H Z Fortes; Joelma R F Furchi; Daniel Giannella-Neto; Gustavo H Goldman; Maria H S Goldman; Arthur Gruber; Gustavo S Guimarães; Christine Hackel; Flavio Henrique-Silva; Edna T Kimura; Suzana G Leoni; Cláudia Macedo; Bettina Malnic; Carina V Manzini B; Suely K N Marie; Nilce M Martinez-Rossi; Marcelo Menossi; Elisabete C Miracca; Maria A Nagai; Francisco G Nobrega; Marina P Nobrega; Sueli M Oba-Shinjo; Márika K Oliveira; Guilherme M Orabona; Audrey Y Otsuka; Maria L Paço-Larson; Beatriz M C Paixão; Jose R C Pandolfi; Maria I M C Pardini; Maria R Passos Bueno; Geraldo A S Passos; Joao B Pesquero; Juliana G Pessoa; Paula Rahal; Cláudia A Rainho; Caroline P Reis; Tatiana I Ricca; Vanderlei Rodrigues; Silvia R Rogatto; Camila M Romano; Janaína G Romeiro; Antonio Rossi; Renata G Sá; Magaly M Sales; Simone C Sant'Anna; Patrícia L Santarosa; Fernando Segato; Wilson A Silva; Ismael D C G Silva; Neusa P Silva; Andrea Soares-Costa; Maria F Sonati; Bryan E Strauss; Eloiza H Tajara; Sandro R Valentini; Fabiola E Villanova; Laura S Ward; Dalila L Zanette
Journal:  Genome Res       Date:  2004-06-14       Impact factor: 9.043

7.  Genomic shotgun array: a procedure linking large-scale DNA sequencing with regional transcript mapping.

Authors:  Ling-Hui Li; Jian-Chiuan Li; Yung-Feng Lin; Chung-Yen Lin; Chung-Yung Chen; Shih-Feng Tsai
Journal:  Nucleic Acids Res       Date:  2004-02-11       Impact factor: 16.971

8.  A method of precise mRNA/DNA homology-based gene structure prediction.

Authors:  Alexander Churbanov; Mark Pauley; Daniel Quest; Hesham Ali
Journal:  BMC Bioinformatics       Date:  2005-10-21       Impact factor: 3.169

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.