Literature DB >> 7903399

Detecting frame shifts by amino acid sequence comparison.

J M Claverie1.   

Abstract

Various amino acid substitution scoring matrices are used in conjunction with local alignments programs to detect regions of similarity and infer potential common ancestry between proteins. The usual scoring schemes derive from the implicit hypothesis that related proteins evolve from a common ancestor by the accumulation of point mutations and that amino acids tend to be progressively substituted by others with similar properties. However, other frequent single mutation events, like nucleotide insertion or deletion and gene inversion, change the translation reading frame and cause previously encoded amino acid sequences to become unrecognizable at once. Here, I derive five new types of scoring matrix, each capable of detecting a specific frame shift (deletion, insertion and inversion in 3 frames) and use them with a regular local alignments program to detect amino acid sequences that may have derived from alternative reading frames of the same nucleotide sequence. Frame shifts are inferred from the sole comparison of the protein sequences. The five scoring matrices were used with the BLASTP program to compare all the protein sequences in the Swissprot database. Surprisingly, the searches revealed hundreds of highly significant frame shift matches, of which many are likely to represent sequencing errors. Others provide some evidence that frame shift mutations might be used in protein evolution as a way to create new amino acid sequences from pre-existing coding regions.

Mesh:

Substances:

Year:  1993        PMID: 7903399     DOI: 10.1006/jmbi.1993.1666

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  10 in total

1.  Detecting and analyzing DNA sequencing errors: toward a higher quality of the Bacillus subtilis genome sequence.

Authors:  C Médigue; M Rose; A Viari; A Danchin
Journal:  Genome Res       Date:  1999-11       Impact factor: 9.043

2.  Comparative proteogenomics: combining mass spectrometry and comparative genomics to analyze multiple genomes.

Authors:  Nitin Gupta; Jamal Benhamida; Vipul Bhargava; Daniel Goodman; Elisabeth Kain; Ian Kerman; Ngan Nguyen; Noah Ollikainen; Jesse Rodriguez; Jian Wang; Mary S Lipton; Margaret Romine; Vineet Bafna; Richard D Smith; Pavel A Pevzner
Journal:  Genome Res       Date:  2008-04-21       Impact factor: 9.043

Review 3.  Computational methods for exon detection.

Authors:  J M Claverie
Journal:  Mol Biotechnol       Date:  1998-08       Impact factor: 2.695

4.  PairWise and SearchWise: finding the optimal alignment in a simultaneous comparison of a protein profile against all DNA translation frames.

Authors:  E Birney; J D Thompson; T J Gibson
Journal:  Nucleic Acids Res       Date:  1996-07-15       Impact factor: 16.971

5.  A frameshift error detection algorithm for DNA sequencing projects.

Authors:  G A Fichant; Y Quentin
Journal:  Nucleic Acids Res       Date:  1995-08-11       Impact factor: 16.971

6.  Frameshift and wild-type proteins are often highly similar because the genetic code and genomes were optimized for frameshift tolerance.

Authors:  Xiaolong Wang; Quanjiang Dong; Gang Chen; Jianye Zhang; Yongqiang Liu; Yujia Cai
Journal:  BMC Genomics       Date:  2022-06-02       Impact factor: 4.547

Review 7.  Genetic map of Salmonella typhimurium, edition VIII.

Authors:  K E Sanderson; A Hessel; K E Rudd
Journal:  Microbiol Rev       Date:  1995-06

8.  ICDS database: interrupted CoDing sequences in prokaryotic genomes.

Authors:  Emmanuel Perrodou; Caroline Deshayes; Jean Muller; Christine Schaeffer; Alain Van Dorsselaer; Raymond Ripp; Olivier Poch; Jean-Marc Reyrat; Odile Lecompte
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

9.  Frameshifting preserves key physicochemical properties of proteins.

Authors:  Lukas Bartonek; Daniel Braun; Bojan Zagrovic
Journal:  Proc Natl Acad Sci U S A       Date:  2020-03-03       Impact factor: 11.205

10.  Back-translation for discovering distant protein homologies in the presence of frameshift mutations.

Authors:  Marta Girdea; Laurent Noe; Gregory Kucherov
Journal:  Algorithms Mol Biol       Date:  2010-01-04       Impact factor: 1.405

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.