Literature DB >> 20556861

Genetack: frameshift identification in protein-coding sequences by the Viterbi algorithm.

Ivan Antonov1, Mark Borodovsky.   

Abstract

We describe a new program for ab initio frameshift detection in protein-coding nucleotide sequences. The task is to distinguish the same strand overlapping ORFs that occur in the sequence due to a presence of a frameshifted gene from the same strand overlapping ORFs that encompass true overlapping or adjacent genes. The GeneTack program uses a hidden Markov model (HMM) of genomic sequence with possibly frameshifted protein-coding regions. The Viterbi algorithm finds the maximum likelihood path that discriminates between true adjacent genes and those adjacent protein-coding regions that just appear to be separate entities due to frameshifts. Therefore, the program can identify spurious predictions made by a conventional gene-finding program misled by a frameshift. We tested GeneTack as well as two earlier developed programs FrameD and FSFind on 17 prokaryotic genomes with frameshifts introduced randomly into known genes. We observed that the average frameshift prediction accuracy of GeneTack, in terms of (Sn + Sp)/2 values, was higher by a significant margin than the accuracy of two other programs. In addition, we observed that the average accuracy of GeneTack is favorably compared with the accuracy of the FSFind-BLAST program that uses protein database search to verify predicted frameshifts, even though GeneTack does not use external evidence. GeneTack is freely available at http://topaz.gatech.edu/GeneTack/.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20556861     DOI: 10.1142/s0219720010004847

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  19 in total

1.  A pilot study of bacterial genes with disrupted ORFs reveals a surprising profusion of protein sequence recoding mediated by ribosomal frameshifting and transcriptional realignment.

Authors:  Virag Sharma; Andrew E Firth; Ivan Antonov; Olivier Fayet; John F Atkins; Mark Borodovsky; Pavel V Baranov
Journal:  Mol Biol Evol       Date:  2011-06-14       Impact factor: 16.240

2.  Ab initio gene identification in metagenomic sequences.

Authors:  Wenhan Zhu; Alexandre Lomsadze; Mark Borodovsky
Journal:  Nucleic Acids Res       Date:  2010-04-19       Impact factor: 16.971

3.  Algorithms for hidden markov models restricted to occurrences of regular expressions.

Authors:  Paula Tataru; Andreas Sand; Asger Hobolth; Thomas Mailund; Christian N S Pedersen
Journal:  Biology (Basel)       Date:  2013-11-08

4.  GeneTack database: genes with frameshifts in prokaryotic genomes and eukaryotic mRNA sequences.

Authors:  Ivan Antonov; Pavel Baranov; Mark Borodovsky
Journal:  Nucleic Acids Res       Date:  2012-11-17       Impact factor: 16.971

5.  Gene discovery in EST sequences from the wheat leaf rust fungus Puccinia triticina sexual spores, asexual spores and haustoria, compared to other rust and corn smut fungi.

Authors:  Junhuan Xu; Rob Linning; John Fellers; Matthew Dickinson; Wenhan Zhu; Ivan Antonov; David L Joly; Michael E Donaldson; Tamar Eilam; Yehoshua Anikster; Travis Banks; Sarah Munro; Michael Mayo; Brian Wynhoven; Johar Ali; Richard Moore; Brent McCallum; Mark Borodovsky; Barry Saville; Guus Bakkeren
Journal:  BMC Genomics       Date:  2011-03-24       Impact factor: 3.969

Review 6.  Programmed Deviations of Ribosomes From Standard Decoding in Archaea.

Authors:  Federica De Lise; Andrea Strazzulli; Roberta Iacono; Nicola Curci; Mauro Di Fenza; Luisa Maurelli; Marco Moracci; Beatrice Cobucci-Ponzano
Journal:  Front Microbiol       Date:  2021-06-04       Impact factor: 5.640

7.  Gene prediction in metagenomic fragments based on the SVM algorithm.

Authors:  Yongchu Liu; Jiangtao Guo; Gangqing Hu; Huaiqiu Zhu
Journal:  BMC Bioinformatics       Date:  2013-04-10       Impact factor: 3.169

8.  Short-read reading-frame predictors are not created equal: sequence error causes loss of signal.

Authors:  William L Trimble; Kevin P Keegan; Mark D'Souza; Andreas Wilke; Jared Wilkening; Jack Gilbert; Folker Meyer
Journal:  BMC Bioinformatics       Date:  2012-07-28       Impact factor: 3.169

Review 9.  Current opportunities and challenges in microbial metagenome analysis--a bioinformatic perspective.

Authors:  Hanno Teeling; Frank Oliver Glöckner
Journal:  Brief Bioinform       Date:  2012-09-09       Impact factor: 11.622

10.  MetaGeneTack: ab initio detection of frameshifts in metagenomic sequences.

Authors:  Shiyuyun Tang; Ivan Antonov; Mark Borodovsky
Journal:  Bioinformatics       Date:  2012-11-04       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.