Literature DB >> 18256727

Compressing proteomes: the relevance of medium range correlations.

Dario Benedetto1, Emanuele Caglioti, Claudia Chica.   

Abstract

We study the nonrandomness of proteome sequences by analysing the correlations that arise between amino acids at a short and medium range, more specifically, between amino acids located 10 or 100 residues apart; respectively. We show that statistical models that consider these two types of correlation are more likely to seize the information contained in protein sequences and thus achieve good compression rates. Finally, we propose that the cause for this redundancy is related to the evolutionary origin of proteomes and protein sequences.

Year:  2007        PMID: 18256727      PMCID: PMC3171345          DOI: 10.1155/2007/60723

Source DB:  PubMed          Journal:  EURASIP J Bioinform Syst Biol        ISSN: 1687-4145


  21 in total

1.  Universally conserved positions in protein folds: reading evolutionary signals about stability, folding kinetics and function.

Authors:  L A Mirny; E I Shakhnovich
Journal:  J Mol Biol       Date:  1999-08-06       Impact factor: 5.469

2.  Biological sequence compression algorithms.

Authors:  T Matsumoto; K Sadakane; H Imai
Journal:  Genome Inform Ser Workshop Genome Inform       Date:  2000

3.  An evolutionary model for the origin of non-randomness, long-range order and fractality in the genome.

Authors:  Y Almirantis; A Provata
Journal:  Bioessays       Date:  2001-07       Impact factor: 4.345

Review 4.  Comparison of ARM and HEAT protein repeats.

Authors:  M A Andrade; C Petosa; S I O'Donoghue; C W Müller; P Bork
Journal:  J Mol Biol       Date:  2001-05-25       Impact factor: 5.469

Review 5.  Protein families and their evolution-a structural perspective.

Authors:  Christine A Orengo; Janet M Thornton
Journal:  Annu Rev Biochem       Date:  2005       Impact factor: 23.643

6.  Identification of latent periodicity in amino acid sequences of protein families.

Authors:  V P Turutina; A A Laskin; N A Kudryashov; K G Skryabin; E V Korotkov
Journal:  Biochemistry (Mosc)       Date:  2006-01       Impact factor: 2.487

7.  Enlarged similarity of nucleic acid sequences.

Authors:  E V Korotkov; M A Korotkova
Journal:  DNA Res       Date:  1996-06-30       Impact factor: 4.458

8.  Smoothness within ruggedness: the role of neutrality in adaptation.

Authors:  M A Huynen; P F Stadler; W Fontana
Journal:  Proc Natl Acad Sci U S A       Date:  1996-01-09       Impact factor: 11.205

9.  The evolution and recognition of protein sequence repeats.

Authors:  J Heringa
Journal:  Comput Chem       Date:  1994-09

10.  Microbial gene identification using interpolated Markov models.

Authors:  S L Salzberg; A L Delcher; S Kasif; O White
Journal:  Nucleic Acids Res       Date:  1998-01-15       Impact factor: 16.971

View more
  1 in total

1.  AC2: An Efficient Protein Sequence Compression Tool Using Artificial Neural Networks and Cache-Hash Models.

Authors:  Milton Silva; Diogo Pratas; Armando J Pinho
Journal:  Entropy (Basel)       Date:  2021-04-26       Impact factor: 2.524

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.