Literature DB >> 10382671

A fast algorithm for genome-wide analysis of proteins with repeated sequences.

M Pellegrini1, E M Marcotte, T O Yeates.   

Abstract

We present a fast algorithm to search for repeating fragments within protein sequences. The technique is based on an extension of the Smith-Waterman algorithm that allows the calculation of sub-optimal alignments of a sequence against itself. We are able to estimate the statistical significance of all sub-optimal alignment scores. We also rapidly determine the length of the repeating fragment and the number of times it is found in a sequence. The technique is applied to sequences in the Swissprot database, and to 16 complete genomes. We find that eukaryotic proteins contain more internal repeats than those of prokaryotic and archael organisms. The finding that 18% of yeast sequences and 28% of the known human sequences contain detectable repeats emphasizes the importance of internal duplication in protein evolution.

Entities:  

Mesh:

Substances:

Year:  1999        PMID: 10382671

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  27 in total

1.  BAliBASE (Benchmark Alignment dataBASE): enhancements for repeats, transmembrane sequences and circular permutations.

Authors:  A Bahr; J D Thompson; J C Thierry; O Poch
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  Amino acid repeat patterns in protein sequences: their diversity and structural-functional implications.

Authors:  M V Katti; R Sami-Subbu; P K Ranjekar; V S Gupta
Journal:  Protein Sci       Date:  2000-06       Impact factor: 6.725

3.  Uncovering the rules for protein-protein interactions from yeast genomic data.

Authors:  Jin Wang; Chunhe Li; Erkang Wang; Xidi Wang
Journal:  Proc Natl Acad Sci U S A       Date:  2009-02-23       Impact factor: 11.205

4.  Two different domains of the luciferase gene in the heterotrophic dinoflagellate Noctiluca scintillans occur as two separate genes in photosynthetic species.

Authors:  Liyun Liu; J Woodland Hastings
Journal:  Proc Natl Acad Sci U S A       Date:  2006-11-27       Impact factor: 11.205

5.  Protein tandem repeats - the more perfect, the less structured.

Authors:  Julien Jorda; Bin Xue; Vladimir N Uversky; Andrey V Kajava
Journal:  FEBS J       Date:  2010-06       Impact factor: 5.542

6.  Unraveling the Mechanics of a Repeat-Protein Nanospring: From Folding of Individual Repeats to Fluctuations of the Superhelix.

Authors:  Marie Synakewicz; Rohan S Eapen; Albert Perez-Riba; Pamela J E Rowling; Daniela Bauer; Andreas Weißl; Gerhard Fischer; Marko Hyvönen; Matthias Rief; Laura S Itzhaki; Johannes Stigler
Journal:  ACS Nano       Date:  2022-03-08       Impact factor: 15.881

7.  Lineage-specific protein repeat expansions and contractions reveal malleable regions of immune genes.

Authors:  Lokdeep Teekas; Sandhya Sharma; Nagarjun Vijay
Journal:  Genes Immun       Date:  2022-10-06       Impact factor: 4.248

8.  Proteome-wide identification of mycobacterial pupylation targets.

Authors:  Christian Poulsen; Yusuf Akhter; Amy Hye-Won Jeon; Gerold Schmitt-Ulms; Helmut E Meyer; Anja Stefanski; Kai Stühler; Matthias Wilmanns; Young-Hwa Song
Journal:  Mol Syst Biol       Date:  2010-07-13       Impact factor: 11.429

9.  Ab initio detection of fuzzy amino acid tandem repeats in protein sequences.

Authors:  Marco Pellegrini; Maria Elena Renda; Alessio Vecchio
Journal:  BMC Bioinformatics       Date:  2012-03-21       Impact factor: 3.169

10.  Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins.

Authors:  Michelle Simon; John M Hancock
Journal:  Genome Biol       Date:  2009-06-01       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.