Literature DB >> 9721203

Performance-guarantee gene predictions via spliced alignment.

A A Mironov1, M A Roytberg, P A Pevzner, M S Gelfand.   

Abstract

An important and still unsolved problem in gene prediction is designing an algorithm that not only predicts genes but estimates the quality of individual predictions as well. Since experimental biologists are interested mainly in the reliability of individual predictions (rather than in the average reliability of an algorithm) we attempted to develop a gene recognition algorithm that guarantees a certain quality of predictions. We demonstrate here that the similarity level with a related protein is a reliable quality estimator for the spliced alignment approach to gene recognition. We also study the average performance of the spliced alignment algorithm for different targets on a complete set of human genomic sequences with known relatives and demonstrate that the average performance of the method remains high even for very distant targets. Using plant, fungal, and prokaryotic target proteins for recognition of human genes leads to accurate predictions with 95, 93, and 91% correlation coefficient, respectively. For target proteins with similarity score above 60%, not only the average correlation coefficient is very high (97% and up) but also the quality of individual predictions is guaranteed to be at least 82%. It indicates that for this level of similarity the worst case performance of the spliced alignment algorithm is better than the average case performance of many statistical gene recognition methods. Copyright 1998 Academic Press.

Entities:  

Mesh:

Substances:

Year:  1998        PMID: 9721203     DOI: 10.1006/geno.1998.5251

Source DB:  PubMed          Journal:  Genomics        ISSN: 0888-7543            Impact factor:   5.736


  6 in total

1.  A complexity reduction algorithm for analysis and annotation of large genomic sequences.

Authors:  Trees-Juen Chuang; Wen-Chang Lin; Hurng-Chun Lee; Chi-Wei Wang; Keh-Lin Hsiao; Zi-Hao Wang; Danny Shieh; Simon C Lin; Lan-Yang Ch'ang
Journal:  Genome Res       Date:  2003-02       Impact factor: 9.043

2.  Identification of programmed translational -1 frameshifting sites in the genome of Saccharomyces cerevisiae.

Authors:  Michaël Bekaert; Hugues Richard; Bernard Prum; Jean-Pierre Rousset
Journal:  Genome Res       Date:  2005-10       Impact factor: 9.043

3.  Frequent alternative splicing of human genes.

Authors:  A A Mironov; J W Fickett; M S Gelfand
Journal:  Genome Res       Date:  1999-12       Impact factor: 9.043

4.  Fourfold faster rate of genome rearrangement in nematodes than in Drosophila.

Authors:  Avril Coghlan; Kenneth H Wolfe
Journal:  Genome Res       Date:  2002-06       Impact factor: 9.043

5.  Gene identification in novel eukaryotic genomes by self-training algorithm.

Authors:  Alexandre Lomsadze; Vardges Ter-Hovhannisyan; Yury O Chernoff; Mark Borodovsky
Journal:  Nucleic Acids Res       Date:  2005-11-28       Impact factor: 16.971

6.  A third approach to gene prediction suggests thousands of additional human transcribed regions.

Authors:  Gustavo Glusman; Shizhen Qin; M Raafat El-Gewely; Andrew F Siegel; Jared C Roach; Leroy Hood; Arian F A Smit
Journal:  PLoS Comput Biol       Date:  2006-03-17       Impact factor: 4.475

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.