Literature DB >> 1287653

Quantile distributions of amino acid usage in protein classes.

S Karlin1, B E Blaisdell, P Bucher.   

Abstract

A comparative study of the compositional properties of various protein sets from both cellular and viral organisms is presented. Invariants and contrasts of amino acid usages have been discerned for different protein function classes and for different species using robust statistical methods based on quantile distributions and stochastic ordering relationships. In addition, a quantitative criterion to assess amino acid compositional extremes relative to a reference protein set is proposed and applied. Invariants of amino acid usage relate mainly to the central range of quantile distributions, whereas contrasts occur mainly in the tails of the distributions, especially contrasts between eukaryote and prokaryote species. Influences from genomic constraint are evident, for example, in the arginine:lysine ratios and the usage frequencies of residues encoded by G + C-rich versus A + T-rich codon types. The structurally similar amino acids, glutamate versus aspartate and phenylalanine versus tyrosine, show stochastic dominance relationships for most species protein sets favoring glutamate and phenylalanine respectively. The quantile distribution of hydrophobic amino acid usages in prokaryote data dominates the corresponding quantile distribution in human data. In contrast, glutamate, cysteine, proline and serine usages in human proteins dominate the corresponding quantile distributions in Escherichia coli. E. coli dominates human in the use of basic residues, but no dominance ordering applies to acidic residues. The discussion centers on commonalities and anomalies of the amino acid compositional spectrum in relation to species, function, cellular localization, biochemical and steric attributes, complexity of the amino acid biosynthetic pathway, amino acid relative abundances and founder effects.

Entities:  

Mesh:

Substances:

Year:  1992        PMID: 1287653     DOI: 10.1093/protein/5.8.729

Source DB:  PubMed          Journal:  Protein Eng        ISSN: 0269-2139


  16 in total

1.  Correlation analysis of amino acid usage in protein classes.

Authors:  S Karlin; P Bucher
Journal:  Proc Natl Acad Sci U S A       Date:  1992-12-15       Impact factor: 11.205

2.  Environmental signatures in proteome properties.

Authors:  Luciano Brocchieri
Journal:  Proc Natl Acad Sci U S A       Date:  2004-05-24       Impact factor: 11.205

3.  Assessments of DNA inhomogeneities in yeast chromosome III.

Authors:  S Karlin; B E Blaisdell; R J Sapolsky; L Cardon; C Burge
Journal:  Nucleic Acids Res       Date:  1993-02-11       Impact factor: 16.971

4.  Underlying order in protein sequence organization.

Authors:  A L Berman; E Kolker; E N Trifonov
Journal:  Proc Natl Acad Sci U S A       Date:  1994-04-26       Impact factor: 11.205

5.  Unusual charge configurations in transcription factors of the basic RNA polymerase II initiation complex.

Authors:  S Karlin
Journal:  Proc Natl Acad Sci U S A       Date:  1993-06-15       Impact factor: 11.205

6.  Side-chain entropy and packing in proteins.

Authors:  S Bromberg; K A Dill
Journal:  Protein Sci       Date:  1994-07       Impact factor: 6.725

7.  Improving mass defect filters for human proteins.

Authors:  Melinda L Toumi; Heather Desaire
Journal:  J Proteome Res       Date:  2010-10-01       Impact factor: 4.466

8.  Uniquely localized intra-molecular amino acid concentrations at the glycolytic enzyme catalytic/active centers of Archaea, Bacteria and Eukaryota are associated with their proposed temporal appearances on earth.

Authors:  J Dennis Pollack; David Gerard; Dennis K Pearl
Journal:  Orig Life Evol Biosph       Date:  2013-05-29       Impact factor: 1.950

9.  Why is CpG suppressed in the genomes of virtually all small eukaryotic viruses but not in those of large eukaryotic viruses?

Authors:  S Karlin; W Doerfler; L R Cardon
Journal:  J Virol       Date:  1994-05       Impact factor: 5.103

10.  Isolation and characterization of the gene encoding 2,3-oxidosqualene-lanosterol cyclase from Saccharomyces cerevisiae.

Authors:  Z Shi; C J Buntel; J H Griffin
Journal:  Proc Natl Acad Sci U S A       Date:  1994-07-19       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.