Literature DB >> 12112704

Swaps in protein sequences.

Amit Fliess1, Benny Motro, Ron Unger.   

Abstract

An important question in protein evolution is to what extent proteins may have undergone swaps (switches of domain or fragment order) during evolution. Such events might have occurred in several forms: Swaps of short fragments, swaps of structural and functional motifs, or recombination of domains in multidomain proteins. This question is important for the theoretical understanding of the evolution of proteins, and has practical implications for using swaps as a design tool in protein engineering. In order to analyze the question systematically, we conducted a large scale survey of possible swaps and permutations among all pairs of protein from the Swissport database. A swap is defined as a specific kind of sequence mutation between two proteins in which two fragments that appear in both sequences have different relative order in the two sequences. For example, aXbYc and dYeXf are defined as a swap, where X and Y represent sequence fragments that switched their order. Identifying such swaps is difficult using standard sequence comparison packages. One of the main problems in the analysis stems from the fact that many sequences contain repeats, which may be identified as false-positive swaps. We have used two different approaches to detect pairs of proteins with swaps. The first approach is based on the predefined list of domains in Pfam. We identified all the proteins that share at least two domains and analyzed their relative order, looking for pairs in which the order of these domains was switched. We designed an algorithm to distinguish between real swaps and duplications. In the second approach, we used Blast to detect pairs of proteins that share several fragments. Then, we used an automatic procedure to select pairs that are likely to contain swaps. Those pairs were analyzed visually, using a graphical tool, to eliminate duplications. Combining these approaches, about 140 different cases of swaps in the Swissprot database were found (after eliminating multiple pairs within the same family). Some of the cases have been described in the literature, but many are novel examples. Although each new example identified may be interesting to analyze, our main conclusion is that cases of swaps are rare in protein evolution. This observation is at odds with the common view that proteins are very modular to the point that modules (e.g., domains) can be shuffled between proteins with minimal constraints. Our study suggests that sequential constraints, i.e., the relative order between domains, are highly conserved. Copyright 2002 Wiley-Liss, Inc.

Mesh:

Substances:

Year:  2002        PMID: 12112704     DOI: 10.1002/prot.10156

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  8 in total

1.  Global extent of horizontal gene transfer.

Authors:  In-Geol Choi; Sung-Hou Kim
Journal:  Proc Natl Acad Sci U S A       Date:  2007-03-07       Impact factor: 11.205

2.  Mapping sequences by parts.

Authors:  Gilles Didier; Carito Guziolowski
Journal:  Algorithms Mol Biol       Date:  2007-09-19       Impact factor: 1.405

3.  Predict impact of single amino acid change upon protein structure.

Authors:  Christian Schaefer; Burkhard Rost
Journal:  BMC Genomics       Date:  2012-06-18       Impact factor: 3.969

4.  New tricks for "old" domains: how novel architectures and promiscuous hubs contributed to the organization and evolution of the ECM.

Authors:  Graham Cromar; Ka-Chun Wong; Noeleen Loughran; Tuan On; Hongyan Song; Xuejian Xiong; Zhaolei Zhang; John Parkinson
Journal:  Genome Biol Evol       Date:  2014-10-15       Impact factor: 3.416

5.  PhyloPro2.0: a database for the dynamic exploration of phylogenetically conserved proteins and their domain architectures across the Eukarya.

Authors:  Graham L Cromar; Anthony Zhao; Xuejian Xiong; Lakshmipuram S Swapna; Noeleen Loughran; Hongyan Song; John Parkinson
Journal:  Database (Oxford)       Date:  2016-03-15       Impact factor: 3.451

6.  Structural and functional characterization of the LldR from Corynebacterium glutamicum: a transcriptional repressor involved in L-lactate and sugar utilization.

Authors:  Yong-Gui Gao; Hiroaki Suzuki; Hiroshi Itou; Yong Zhou; Yoshikazu Tanaka; Masaaki Wachi; Nobuhisa Watanabe; Isao Tanaka; Min Yao
Journal:  Nucleic Acids Res       Date:  2008-11-06       Impact factor: 16.971

7.  A comprehensive analysis of non-sequential alignments between all protein structures.

Authors:  Alexej Abyzov; Valentin A Ilyin
Journal:  BMC Struct Biol       Date:  2007-11-16

8.  cpRAS: a novel circularly permuted RAS-like GTPase domain with a highly scattered phylogenetic distribution.

Authors:  Marek Elias; Marian Novotny
Journal:  Biol Direct       Date:  2008-05-29       Impact factor: 4.540

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.