Literature DB >> 8919874

Search for ancient patterns in protein sequences.

G Thode1, J A García-Ranea, J Jimenez.   

Abstract

Proteins of related functions are often similar in sequence, reflecting a common phylogenetic origin. Proteins with no known homology are probably diversified proteins, too distantly related to known sequences in databases to retain significant similarity. All proteins, however, probably share common ancestries if one moves far enough back in evolution; therefore, given the huge accumulation of protein sequences in current databases, it could be expected that some proteins with no obvious sequence resemblance to any other share some residues that could represent footprints of ancient common ancestries. To identify such putative footprints, we have searched for short stretches of amino acids present in a given protein sequence that are also found in a significant number of nonrelated proteins in the database. The significantly high frequency of occurrence of these "patterns" in the database would support a common evolutionary source, and a diversity of non-related proteins that contain the pattern would express their ancient origin. Using this strategy, significant patterns were found in actual exons, but not in randomized amino acid sequences, nor in "translated" sequences of noncoding DNA, suggesting that this strategy actually leads to the identification of patterns with a biological significance. These significant patterns are not randomly positioned along the sequences analyzed, but they tend to accumulate within specific regions, producing a profile of discrete "domains." In some well-known proteins analyzed in this study, some of these domains are coincident with known motifs. Thus, the procedure described in this paper could be useful for identifying ancient patterns and domains in protein sequences, some of which could also have a functional or structural significance.

Mesh:

Substances:

Year:  1996        PMID: 8919874     DOI: 10.1007/bf02198848

Source DB:  PubMed          Journal:  J Mol Evol        ISSN: 0022-2844            Impact factor:   2.395


  24 in total

1.  Protein sequences--homologies and motifs.

Authors:  M J Sternberg; S A Islam
Journal:  Trends Biotechnol       Date:  1991-09       Impact factor: 19.536

2.  Database of homology-derived protein structures and the structural meaning of sequence alignment.

Authors:  C Sander; R Schneider
Journal:  Proteins       Date:  1991

Review 3.  Protein modules.

Authors:  M Baron; D G Norman; I D Campbell
Journal:  Trends Biochem Sci       Date:  1991-01       Impact factor: 13.807

4.  A database of protein structure families with common folding motifs.

Authors:  L Holm; C Ouzounis; C Sander; G Tuparev; G Vriend
Journal:  Protein Sci       Date:  1992-12       Impact factor: 6.725

5.  Library of common protein motifs.

Authors:  M J Sternberg
Journal:  Nature       Date:  1991-01-10       Impact factor: 49.962

6.  Motifs of cadherin- and fibronectin type III-related sequences and evolution of the receptor-type-protein tyrosine kinases: sequence similarity between proto-oncogene ret and cadherin family.

Authors:  K Kuma; N Iwabe; T Miyata
Journal:  Mol Biol Evol       Date:  1993-05       Impact factor: 16.240

Review 7.  Convergent evolution: the need to be explicit.

Authors:  R F Doolittle
Journal:  Trends Biochem Sci       Date:  1994-01       Impact factor: 13.807

8.  Evidence for a protein domain superfamily shared by the cyclins, TFIIB and RB/p107.

Authors:  T J Gibson; J D Thompson; A Blocker; T Kouzarides
Journal:  Nucleic Acids Res       Date:  1994-03-25       Impact factor: 16.971

9.  1H-NMR assignment and secondary structure of human insulin-like growth factor-I (IGF-I) in solution.

Authors:  A Sato; S Nishimura; T Ohkubo; Y Kyogoku; S Koyama; M Kobayashi; T Yasuda; Y Kobayashi
Journal:  J Biochem       Date:  1992-04       Impact factor: 3.387

10.  Complementation of fission yeast cdc2ts and cdc25ts mutants identifies two cell cycle genes from Drosophila: a cdc2 homologue and string.

Authors:  J Jimenez; L Alphey; P Nurse; D M Glover
Journal:  EMBO J       Date:  1990-11       Impact factor: 11.598

View more
  4 in total

1.  The identification of Wos2, a p23 homologue that interacts with Wee1 and Cdc2 in the mitotic control of fission yeasts.

Authors:  M J Muñoz; E R Bejarano; R R Daga; J Jimenez
Journal:  Genetics       Date:  1999-12       Impact factor: 4.562

2.  A computational strategy for protein function assignment which addresses the multidomain problem.

Authors:  A J Pérez; A Rodríguez; O Trelles; G Thode
Journal:  Comp Funct Genomics       Date:  2002

3.  Peptide vocabulary analysis reveals ultra-conservation and homonymity in protein sequences.

Authors:  Derek Gatherer
Journal:  Bioinform Biol Insights       Date:  2009-11-24

4.  AnABlast: a new in silico strategy for the genome-wide search of novel genes and fossil regions.

Authors:  Juan Jimenez; Caia D S Duncan; María Gallardo; Juan Mata; Antonio J Perez-Pulido
Journal:  DNA Res       Date:  2015-10-21       Impact factor: 4.458

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.