Literature DB >> 8007006

The evolution of proteins from random amino acid sequences: II. Evidence from the statistical distributions of the lengths of modern protein sequences.

S H White1.   

Abstract

This paper continues an examination of the hypothesis that modern proteins evolved from random heteropeptide sequences. In support of the hypothesis, White and Jacobs (1993, J Mol Evol 36:79-95) have shown that any sequence chosen randomly from a large collection of nonhomologous proteins has a 90% or better chance of having a lengthwise distribution of amino acids that is indistinguishable from the random expectation regardless of amino acid type. The goal of the present study was to investigate the possibility that the random-origin hypothesis could explain the lengths of modern protein sequences without invoking specific mechanisms such as gene duplication or exon splicing. The sets of sequences examined were taken from the 1989 PIR database and consisted of 1,792 "super-family" proteins selected to have little sequence identity, 623 E. coli sequences, and 398 human sequences. The length distributions of the proteins could be described with high significance by either of two closely related probability density functions: The gamma distribution with parameter 2 or the distribution for the sum of two exponential random independent variables. A simple theory for the distributions was developed which assumes that (1) protoprotein sequences had exponentially distributed random independent lengths, (2) the length dependence of protein stability determined which of these protoproteins could fold into compact primitive proteins and thereby attain the potential for biochemical activity, (3) the useful protein sequences were preserved by the primitive genome, and (4) the resulting distribution of sequence lengths is reflected by modern proteins. The theory successfully predicts the two observed distributions which can be distinguished by the functional form of the dependence of protein stability on length. The theory leads to three interesting conclusions. First, it predicts that a tetra-nucleotide was the signal for primitive translation termination. This prediction is entirely consistent with the observations of Brown et al. (1990a,b, Nucleic Acids Res 18:2079-2086 and 18: 6339-6345) which show that tetra-nucleotides (stop codon plus following nucleotide) are the actual signals for termination of translation in both prokaryotes and eukaryotes. Second, the strong dependence of statistical length distributions on sequence-termination signaling codes implies that the evolution of stop codons and translation-termination processes was as important as gene splicing in early evolution. Third, because the theory is based upon a simple no-exon stochastic model, it provides a plausible alternative to a limited universe of exons from which all proteins evolved by gene duplication and exon splicing (Dorit et al. 1990, Science 250:1377-1382).

Entities:  

Mesh:

Substances:

Year:  1994        PMID: 8007006     DOI: 10.1007/BF00163155

Source DB:  PubMed          Journal:  J Mol Evol        ISSN: 0022-2844            Impact factor:   2.395


  35 in total

1.  Sequence analysis suggests that tetra-nucleotides signal the termination of protein synthesis in eukaryotes.

Authors:  C M Brown; P A Stockwell; C N Trotman; W P Tate
Journal:  Nucleic Acids Res       Date:  1990-11-11       Impact factor: 16.971

2.  Evolution of the structure of ferredoxin based on living relics of primitive amino Acid sequences.

Authors:  R V Eck; M O Dayhoff
Journal:  Science       Date:  1966-04-15       Impact factor: 47.728

Review 3.  On the origin of RNA splicing and introns.

Authors:  P A Sharp
Journal:  Cell       Date:  1985-09       Impact factor: 41.582

4.  Implications of thermodynamics of protein folding for evolution of primary sequences.

Authors:  E I Shakhnovich; A M Gutin
Journal:  Nature       Date:  1990-08-23       Impact factor: 49.962

5.  Do exons code for structural or functional units in proteins?

Authors:  T W Traut
Journal:  Proc Natl Acad Sci U S A       Date:  1988-05       Impact factor: 11.205

Review 6.  Evolution of the genetic code as affected by anticodon content.

Authors:  S Osawa; T H Jukes
Journal:  Trends Genet       Date:  1988-07       Impact factor: 11.639

7.  Evolution of anticodons: variations in the genetic code.

Authors:  T H Jukes; S Osawa; A Muto; N Lehman
Journal:  Cold Spring Harb Symp Quant Biol       Date:  1987

8.  Repeating sequences and gene duplication in proteins.

Authors:  A D McLachlan
Journal:  J Mol Biol       Date:  1972-03-14       Impact factor: 5.469

9.  The evolution of proteins from random amino acid sequences. I. Evidence from the lengthwise distribution of amino acids in modern protein sequences.

Authors:  S H White; R E Jacobs
Journal:  J Mol Evol       Date:  1993-01       Impact factor: 2.395

10.  Origins of structure in globular proteins.

Authors:  H S Chan; K A Dill
Journal:  Proc Natl Acad Sci U S A       Date:  1990-08       Impact factor: 11.205

View more
  9 in total

Review 1.  Reverse transcriptase: mediator of genomic plasticity.

Authors:  J Brosius; H Tiedge
Journal:  Virus Genes       Date:  1995       Impact factor: 2.332

2.  tRNA acceptor stem and anticodon bases form independent codes related to protein folding.

Authors:  Charles W Carter; Richard Wolfenden
Journal:  Proc Natl Acad Sci U S A       Date:  2015-06-01       Impact factor: 11.205

3.  Segmented structure of protein sequences and early evolution of genome by combinatorial fusion of DNA elements.

Authors:  E N Trifonov
Journal:  J Mol Evol       Date:  1995-03       Impact factor: 2.395

4.  A variant upstream of IFNL3 (IL28B) creating a new interferon gene IFNL4 is associated with impaired clearance of hepatitis C virus.

Authors:  Ludmila Prokunina-Olsson; Brian Muchmore; Wei Tang; Ruth M Pfeiffer; Heiyoung Park; Harold Dickensheets; Dianna Hergott; Patricia Porter-Gill; Adam Mumy; Indu Kohaar; Sabrina Chen; Nathan Brand; McAnthony Tarway; Luyang Liu; Faruk Sheikh; Jacquie Astemborski; Herbert L Bonkovsky; Brian R Edlin; Charles D Howell; Timothy R Morgan; David L Thomas; Barbara Rehermann; Raymond P Donnelly; Thomas R O'Brien
Journal:  Nat Genet       Date:  2013-01-06       Impact factor: 38.330

5.  Modern and prebiotic amino acids support distinct structural profiles in proteins.

Authors:  Vyacheslav Tretyachenko; Jiří Vymětal; Tereza Neuwirthová; Jiří Vondrášek; Kosuke Fujishima; Klára Hlouchová
Journal:  Open Biol       Date:  2022-06-22       Impact factor: 7.124

6.  Mathematical modeling and comparison of protein size distribution in different plant, animal, fungal and microbial species reveals a negative correlation between protein size and protein number, thus providing insight into the evolution of proteomes.

Authors:  Axel Tiessen; Paulino Pérez-Rodríguez; Luis José Delaye-Arredondo
Journal:  BMC Res Notes       Date:  2012-02-01

Review 7.  Peptides before and during the nucleotide world: an origins story emphasizing cooperation between proteins and nucleic acids.

Authors:  Stephen D Fried; Kosuke Fujishima; Mikhail Makarov; Ivan Cherepashuk; Klara Hlouchova
Journal:  J R Soc Interface       Date:  2022-02-09       Impact factor: 4.118

8.  Virtual 2D map of cyanobacterial proteomes.

Authors:  Tapan Kumar Mohanta; Yugal Kishore Mohanta; Satya Kumar Avula; Amilia Nongbet; Ahmed Al-Harrasi
Journal:  PLoS One       Date:  2022-10-03       Impact factor: 3.752

9.  Randomness in Sequence Evolution Increases over Time.

Authors:  Guangyu Wang; Shixiang Sun; Zhang Zhang
Journal:  PLoS One       Date:  2016-05-25       Impact factor: 3.240

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.