Literature DB >> 10772867

Homology-based method for identification of protein repeats using statistical significance estimates.

M A Andrade1, C P Ponting, T J Gibson, P Bork.   

Abstract

Short protein repeats, frequently with a length between 20 and 40 residues, represent a significant fraction of known proteins. Many repeats appear to possess high amino acid substitution rates and thus recognition of repeat homologues is highly problematic. Even if the presence of a certain repeat family is known, the exact locations and the number of repetitive units often cannot be determined using current methods. We have devised an iterative algorithm based on optimal and sub-optimal score distributions from profile analysis that estimates the significance of all repeats that are detected in a single sequence. This procedure allows the identification of homologues at alignment scores lower than the highest optimal alignment score for non-homologous sequences. The method has been used to investigate the occurrence of eleven families of repeats in Saccharomyces cerevisiae, Caenorhabditis elegans and Homo sapiens accounting for 1055, 2205 and 2320 repeats, respectively. For these examples, the method is both more sensitive and more selective than conventional homology search procedures. The method allowed the detection in the SwissProt database of more than 2000 previously unrecognised repeats belonging to the 11 families. In addition, the method was used to merge several repeat families that previously were supposed to be distinct, indicating common phylogenetic origins for these families. Copyright 2000 Academic Press.

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 10772867     DOI: 10.1006/jmbi.2000.3684

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  73 in total

1.  BAliBASE (Benchmark Alignment dataBASE): enhancements for repeats, transmembrane sequences and circular permutations.

Authors:  A Bahr; J D Thompson; J C Thierry; O Poch
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  Proteins of the endoplasmic-reticulum-associated degradation pathway: domain detection and function prediction.

Authors:  C P Ponting
Journal:  Biochem J       Date:  2000-10-15       Impact factor: 3.857

3.  A combination of the F-box motif and kelch repeats defines a large Arabidopsis family of F-box proteins.

Authors:  M A Andrade; M González-Guzmán; R Serrano; P L Rodríguez
Journal:  Plant Mol Biol       Date:  2001-07       Impact factor: 4.076

Review 4.  Assembly and regulation of the yeast vacuolar H+-ATPase.

Authors:  Patricia M Kane; Anne M Smardon
Journal:  J Bioenerg Biomembr       Date:  2003-08       Impact factor: 2.945

5.  The Mitochondrial Protein Import Machinery of Plants (MPIMP) database.

Authors:  Ryan Lister; Monika W Murcha; James Whelan
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

6.  A novel gene IBF1 is required for the inhibition of brown pigment deposition in rice hull furrows.

Authors:  Tian Shao; Qian Qian; Ding Tang; Jun Chen; Ming Li; Zhukuan Cheng; Qiong Luo
Journal:  Theor Appl Genet       Date:  2012-03-15       Impact factor: 5.699

7.  Novel WDR72 mutation and cytoplasmic localization.

Authors:  S-K Lee; F Seymen; K-E Lee; H-Y Kang; M Yildirim; E Bahar Tuna; K Gencay; Y-H Hwang; K H Nam; R J De La Garza; J C-C Hu; J P Simmer; J-W Kim
Journal:  J Dent Res       Date:  2010-10-11       Impact factor: 6.116

8.  A structural model for the HAT domain of Utp6 incorporating bioinformatics and genetics.

Authors:  Erica A Champion; Lenka Kundrat; Lynne Regan; Susan J Baserga
Journal:  Protein Eng Des Sel       Date:  2009-06-10       Impact factor: 1.650

9.  The yeast N(alpha)-acetyltransferase NatA is quantitatively anchored to the ribosome and interacts with nascent polypeptides.

Authors:  Matthias Gautschi; Sören Just; Andrej Mun; Suzanne Ross; Peter Rücknagel; Yves Dubaquié; Ann Ehrenhofer-Murray; Sabine Rospert
Journal:  Mol Cell Biol       Date:  2003-10       Impact factor: 4.272

10.  M148R and M149R are two virulence factors for myxoma virus pathogenesis in the European rabbit.

Authors:  Sophie Blanié; Jérémy Mortier; Maxence Delverdier; Stéphane Bertagnoli; Christelle Camus-Bouclainville
Journal:  Vet Res       Date:  2008-11-19       Impact factor: 3.683

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.