Literature DB >> 11242592

Amino acid translation program for full-length cDNA sequences with frameshift errors.

Y Fukunishi1, Y Hayashizaki.   

Abstract

Here we present an amino acid translation program designed to suggest the position of experimental frameshift errors and predict amino acid sequences for full-length cDNA sequences having phred scores. Our program generates artificial insertions into artificial deletions from low-accuracy positions of the original sequence, thereby generating many candidate sequences. The validity of the most probable sequence (the likelihood that it represents the actual protein) is evaluated by using a score (V(a)) that is calculated in light of the Kozak consensus, preferred codon usage, and position of the initiation codon. To evaluate the software, we have used a database in which, out of 612 cDNA sequences, 524 (86%) carried 773 frameshift errors in the coding sequence. Our software detected and corrected 48% of the total frameshift errors in 62% of the total cDNA sequences with frameshift errors. The false positive rate of frameshift correction was 9%, and 91% of the suggested frameshifts were true.

Mesh:

Substances:

Year:  2001        PMID: 11242592     DOI: 10.1152/physiolgenomics.2001.5.2.81

Source DB:  PubMed          Journal:  Physiol Genomics        ISSN: 1094-8341            Impact factor:   3.107


  19 in total

1.  Exploration of novel motifs derived from mouse cDNA sequences.

Authors:  Hideya Kawaji; Christian Schönbach; Yo Matsuo; Jun Kawai; Yasushi Okazaki; Yoshihide Hayashizaki; Hideo Matsuda
Journal:  Genome Res       Date:  2002-03       Impact factor: 9.043

2.  CDS annotation in full-length cDNA sequence.

Authors:  Masaaki Furuno; Takeya Kasukawa; Rintaro Saito; Jun Adachi; Harukazu Suzuki; Richard Baldarelli; Yoshihide Hayashizaki; Yasushi Okazaki
Journal:  Genome Res       Date:  2003-06       Impact factor: 9.043

3.  Analysis of 13000 unique Citrus clusters associated with fruit quality, production and salinity tolerance.

Authors:  Javier Terol; Ana Conesa; Jose M Colmenero; Manuel Cercos; Francisco Tadeo; Javier Agustí; Enriqueta Alós; Fernando Andres; Guillermo Soler; Javier Brumos; Domingo J Iglesias; Stefan Götz; Francisco Legaz; Xavier Argout; Brigitte Courtois; Patrick Ollitrault; Carole Dossat; Patrick Wincker; Raphael Morillon; Manuel Talon
Journal:  BMC Genomics       Date:  2007-01-25       Impact factor: 3.969

4.  Inferring alternative splicing patterns in mouse from a full-length cDNA library and microarray data.

Authors:  Hiromi Kochiwa; Ryosuke Suzuki; Takanori Washio; Rintaro Saito; Hidemasa Bono; Piero Carninci; Yasushi Okazaki; Rika Miki; Yoshihide Hayashizaki; Masaru Tomita
Journal:  Genome Res       Date:  2002-08       Impact factor: 9.043

5.  TriFLDB: a database of clustered full-length coding sequences from Triticeae with applications to comparative grass genomics.

Authors:  Keiichi Mochida; Takuhiro Yoshida; Tetsuya Sakurai; Yasunari Ogihara; Kazuo Shinozaki
Journal:  Plant Physiol       Date:  2009-05-15       Impact factor: 8.340

6.  Constitutive expression of a grapevine polygalacturonase-inhibiting protein affects gene expression and cell wall properties in uninfected tobacco.

Authors:  Erik Alexandersson; John Vw Becker; Dan Jacobson; Eric Nguema-Ona; Cobus Steyn; Katherine J Denby; Melané A Vivier
Journal:  BMC Res Notes       Date:  2011-11-13

7.  Addressing statistical biases in nucleotide-derived protein databases for proteogenomic search strategies.

Authors:  Paul Blakeley; Ian M Overton; Simon J Hubbard
Journal:  J Proteome Res       Date:  2012-10-15       Impact factor: 4.466

Review 8.  Major prospects for exploring canine vector borne diseases and novel intervention methods using 'omic technologies.

Authors:  Robin B Gasser; Cinzia Cantacessi; Bronwyn E Campbell; Andreas Hofmann; Domenico Otranto
Journal:  Parasit Vectors       Date:  2011-04-13       Impact factor: 3.876

9.  OrthoSelect: a protocol for selecting orthologous groups in phylogenomics.

Authors:  Fabian Schreiber; Kerstin Pick; Dirk Erpenbeck; Gert Wörheide; Burkhard Morgenstern
Journal:  BMC Bioinformatics       Date:  2009-07-16       Impact factor: 3.169

10.  A Statistical Method without Training Step for the Classification of Coding Frame in Transcriptome Sequences.

Authors:  Nicolas Carels; Diego Frías
Journal:  Bioinform Biol Insights       Date:  2013-01-23
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.