Literature DB >> 10779492

Using database matches with for HMMGene for automated gene detection in Drosophila.

A Krogh1.   

Abstract

The application of the gene finder HMMGene to the Adh region of the Drosophila melanogaster is described, and the prediction results are analyzed. HMMGene is based on a probabilistic model called a hidden Markov model, and the probabilistic framework facilitates the inclusion of database matches of varying degrees of certainty. It is shown that database matches clearly improve the performance of the gene finder. For instance, the sensitivity for coding exons predicted with both ends correct grows from 62% to 70% on a high-quality test set, when matches to proteins, cDNAs, repeats, and transposons are included. The specificity drops more than the sensitivity increases when ESTs are used. This is due to the high noise level in EST matches, and it is discussed in more detail why this is and how it might be improved.

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 10779492      PMCID: PMC310864          DOI: 10.1101/gr.10.4.523

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  9 in total

1.  A hidden Markov model that finds genes in E. coli DNA.

Authors:  A Krogh; I S Mian; D Haussler
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

2.  GeneMark.hmm: new solutions for gene finding.

Authors:  A V Lukashin; M Borodovsky
Journal:  Nucleic Acids Res       Date:  1998-02-15       Impact factor: 16.971

3.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1998.

Authors:  A Bairoch; R Apweiler
Journal:  Nucleic Acids Res       Date:  1998-01-01       Impact factor: 16.971

4.  Finding genes in DNA with a Hidden Markov Model.

Authors:  J Henderson; S Salzberg; K H Fasman
Journal:  J Comput Biol       Date:  1997       Impact factor: 1.479

Review 5.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

6.  Genome annotation assessment in Drosophila melanogaster.

Authors:  M G Reese; G Hartzell; N L Harris; U Ohler; J F Abril; S E Lewis
Journal:  Genome Res       Date:  2000-04       Impact factor: 9.043

7.  Genie--gene finding in Drosophila melanogaster.

Authors:  M G Reese; D Kulp; H Tammana; D Haussler
Journal:  Genome Res       Date:  2000-04       Impact factor: 9.043

8.  Prediction of complete gene structures in human genomic DNA.

Authors:  C Burge; S Karlin
Journal:  J Mol Biol       Date:  1997-04-25       Impact factor: 5.469

9.  An exploration of the sequence of a 2.9-Mb region of the genome of Drosophila melanogaster: the Adh region.

Authors:  M Ashburner; S Misra; J Roote; S E Lewis; R Blazej; T Davis; C Doyle; R Galle; R George; N Harris; G Hartzell; D Harvey; L Hong; K Houston; R Hoskins; G Johnson; C Martin; A Moshrefi; M Palazzolo; M G Reese; A Spradling; G Tsang; K Wan; K Whitelaw; S Celniker
Journal:  Genetics       Date:  1999-09       Impact factor: 4.562

  9 in total
  22 in total

1.  Computational inference of homologous gene structures in the human genome.

Authors:  R F Yeh; L P Lim; C B Burge
Journal:  Genome Res       Date:  2001-05       Impact factor: 9.043

2.  A complexity reduction algorithm for analysis and annotation of large genomic sequences.

Authors:  Trees-Juen Chuang; Wen-Chang Lin; Hurng-Chun Lee; Chi-Wei Wang; Keh-Lin Hsiao; Zi-Hao Wang; Danny Shieh; Simon C Lin; Lan-Yang Ch'ang
Journal:  Genome Res       Date:  2003-02       Impact factor: 9.043

3.  GAZE: a generic framework for the integration of gene-prediction data by dynamic programming.

Authors:  Kevin L Howe; Tom Chothia; Richard Durbin
Journal:  Genome Res       Date:  2002-09       Impact factor: 9.043

4.  GeneWise and Genomewise.

Authors:  Ewan Birney; Michele Clamp; Richard Durbin
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

5.  Score-based prediction of genomic islands in prokaryotic genomes using hidden Markov models.

Authors:  Stephan Waack; Oliver Keller; Roman Asper; Thomas Brodag; Carsten Damm; Wolfgang Florian Fricke; Katharina Surovcik; Peter Meinicke; Rainer Merkl
Journal:  BMC Bioinformatics       Date:  2006-03-16       Impact factor: 3.169

6.  Gene discovery using computational and microarray analysis of transcription in the Drosophila melanogaster testis.

Authors:  J Andrews; G G Bouffard; C Cheadle; J Lü; K G Becker; B Oliver
Journal:  Genome Res       Date:  2000-12       Impact factor: 9.043

7.  Molecular basis for mycophenolic acid biosynthesis in Penicillium brevicompactum.

Authors:  Torsten Bak Regueira; Kanchana Rueksomtawin Kildegaard; Bjarne Gram Hansen; Uffe H Mortensen; Christian Hertweck; Jens Nielsen
Journal:  Appl Environ Microbiol       Date:  2011-03-11       Impact factor: 4.792

Review 8.  Revamp a model-status and prospects of the Dictyostelium genome project.

Authors:  Ludwig Eichinger
Journal:  Curr Genet       Date:  2003-07-11       Impact factor: 3.886

9.  The Crohn's disease susceptibility gene DLG5 as a member of the CARD interaction network.

Authors:  Frauke Friedrichs; Liesbet Henckaerts; Severine Vermeire; Torsten Kucharzik; Tanja Seehafer; Maren Möller-Krull; Erich Bornberg-Bauer; Monika Stoll; January Weiner
Journal:  J Mol Med (Berl)       Date:  2008-03-12       Impact factor: 4.599

10.  Computational identification of protein coding potential of conserved sequence tags through cross-species evolutionary analysis.

Authors:  Flavio Mignone; Giorgio Grillo; Sabino Liuni; Graziano Pesole
Journal:  Nucleic Acids Res       Date:  2003-08-01       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.