Literature DB >> 8108381

A method to recognize distant repeats in protein sequences.

J Heringa1, P Argos.   

Abstract

An automated algorithm is presented that delineates protein sequence fragments which display similarity. The method incorporates a selection of a number of local nonoverlapping sequence alignments with the highest similarity scores and a graph-theoretical approach to elucidate the consistent start and end points of the fragments comprising one or more ensembles of related subsequences. The procedure allows the simultaneous identification of different types of repeats within one sequence. A multiple alignment of the resulting fragments is performed and a consensus sequence derived from the ensemble(s). Finally, a profile is constructed from the multiple alignment to detect possible and more distant members within the sequence. The method tolerates mutations in the repeats as well as insertions and deletions. The sequence spans between the various repeats or repeat clusters may be of different lengths. The technique has been applied to a number of proteins where the repeating fragments have been derived from information additional to the protein sequences.

Mesh:

Substances:

Year:  1993        PMID: 8108381     DOI: 10.1002/prot.340170407

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  11 in total

1.  BAliBASE (Benchmark Alignment dataBASE): enhancements for repeats, transmembrane sequences and circular permutations.

Authors:  A Bahr; J D Thompson; J C Thierry; O Poch
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  Amino acid repeat patterns in protein sequences: their diversity and structural-functional implications.

Authors:  M V Katti; R Sami-Subbu; P K Ranjekar; V S Gupta
Journal:  Protein Sci       Date:  2000-06       Impact factor: 6.725

3.  Protein structure prediction and analysis using the Robetta server.

Authors:  David E Kim; Dylan Chivian; David Baker
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

4.  Ab initio detection of fuzzy amino acid tandem repeats in protein sequences.

Authors:  Marco Pellegrini; Maria Elena Renda; Alessio Vecchio
Journal:  BMC Bioinformatics       Date:  2012-03-21       Impact factor: 3.169

5.  Algorithm to find distant repeats in a single protein sequence.

Authors:  Nirjhar Banerjee; Rangarajan Sarani; Chellamuthu Vasuki Ranjani; Govindaraj Sowmiya; Daliah Michael; Narayanasamy Balakrishnan; Kanagaraj Sekar
Journal:  Bioinformation       Date:  2008-09-19

6.  HHrep: de novo protein repeat detection and the origin of TIM barrels.

Authors:  Johannes Söding; Michael Remmert; Andreas Biegert
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

7.  Multiple alignment of protein sequences with repeats and rearrangements.

Authors:  Tu Minh Phuong; Chuong B Do; Robert C Edgar; Serafim Batzoglou
Journal:  Nucleic Acids Res       Date:  2006-10-26       Impact factor: 16.971

8.  Understanding and identifying amino acid repeats.

Authors:  Hong Luo; Harm Nijveen
Journal:  Brief Bioinform       Date:  2014-07       Impact factor: 11.622

9.  Identification and Analysis of Long Repeats of Proteins at the Domain Level.

Authors:  David Mary Rajathei; Subbiah Parthasarathy; Samuel Selvaraj
Journal:  Front Bioeng Biotechnol       Date:  2019-10-08

Review 10.  Tandem Repeats in Proteins: Prediction Algorithms and Biological Role.

Authors:  Marco Pellegrini
Journal:  Front Bioeng Biotechnol       Date:  2015-09-24
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.