Literature DB >> 9278061

Las Vegas algorithms for gene recognition: suboptimal and error-tolerant spliced alignment.

S H Sze1, P A Pevzner.   

Abstract

Recently, Gelfand, Mironov and Pevzner (1996) proposed a spliced alignment approach to gene recognition that provides 99% accurate recognition of human genes if a related mammalian protein is available. However, even 99% accurate gene predictions are insufficient for automated sequence annotation in large-scale sequencing projects and therefore have to be complemented by experimental gene verification. One hundred percent accurate gene predictions would lead to a substantial reduction of experimental work on gene identification. Our goal is to develop an algorithm that either predicts an exon assembly with accuracy sufficient for sequence annotation or warns a biologist that the accuracy of a prediction is insufficient and further experimental work is required. We study suboptimal and error-tolerant spliced alignment problems as the first steps towards such an algorithm, and report an algorithm which provides 100% accurate recognition of human genes in 37% of cases (if a related mammalian protein is available). In 52% of genes, the algorithm predicts at least one exon with 100% accuracy.

Entities:  

Mesh:

Year:  1997        PMID: 9278061     DOI: 10.1089/cmb.1997.4.297

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  5 in total

1.  A complexity reduction algorithm for analysis and annotation of large genomic sequences.

Authors:  Trees-Juen Chuang; Wen-Chang Lin; Hurng-Chun Lee; Chi-Wei Wang; Keh-Lin Hsiao; Zi-Hao Wang; Danny Shieh; Simon C Lin; Lan-Yang Ch'ang
Journal:  Genome Res       Date:  2003-02       Impact factor: 9.043

2.  Frequent alternative splicing of human genes.

Authors:  A A Mironov; J W Fickett; M S Gelfand
Journal:  Genome Res       Date:  1999-12       Impact factor: 9.043

3.  Integrated entropy-based approach for analyzing exons and introns in DNA sequences.

Authors:  Junyi Li; Li Zhang; Huinian Li; Yuan Ping; Qingzhe Xu; Rongjie Wang; Renjie Tan; Zhen Wang; Bo Liu; Yadong Wang
Journal:  BMC Bioinformatics       Date:  2019-06-10       Impact factor: 3.169

4.  An optimized approach for annotation of large eukaryotic genomic sequences using genetic algorithm.

Authors:  Biswanath Chowdhury; Arnav Garai; Gautam Garai
Journal:  BMC Bioinformatics       Date:  2017-10-24       Impact factor: 3.169

5.  MassGenie: A Transformer-Based Deep Learning Method for Identifying Small Molecules from Their Mass Spectra.

Authors:  Aditya Divyakant Shrivastava; Neil Swainston; Soumitra Samanta; Ivayla Roberts; Marina Wright Muelas; Douglas B Kell
Journal:  Biomolecules       Date:  2021-11-30
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.