Literature DB >> 9298646

Comparing the predicted and observed properties of proteins encoded in the genome of Escherichia coli K-12.

A J Link1, K Robison, G M Church.   

Abstract

Mining the emerging abundance of microbial genome sequences for hypotheses is an exciting prospect of "functional genomics". At the forefront of this effort, we compared the predictions of the complete Escherichia coli genomic sequence with the observed gene products by assessing 381 proteins for their mature N-termini, in vivo abundances, isoelectric points, molecular masses, and cellular locations. Two-dimensional gel electrophoresis (2-DE) and Edman sequencing were combined to sequence Coomassie-stained 2-DE spots representing the abundant proteins of wild-type E. coli K-12 strains. Greater than 90% of the abundant proteins in the E. coli proteome lie in a small isoelectric point and molecular mass window of 4-7 and 10-100 kDa, respectively. We identified several highly abundant proteins, YjbJ, YjbP, YggX, HdeA, and AhpC, which would not have been predicted from the genomic sequence alone. Of the 223 uniquely identified loci, 60% of the encoded proteins are proteolytically processed. As previously reported, the initiator methionine was efficiently cleaved when the penultimate amino acid was serine or alanine. In contrast, when the penultimate amino acid was threonine, glycine, or proline, cleavage was variable, and valine did not signal cleavage. Although signal peptide cleavage sites tended to follow predicted rules, the length of the putative signal sequence was occassionally greater than the consensus. For proteins predicted to be in the cytoplasm or inner membrane, the N-terminal amino acids were highly constrained compared to proteins localized to the periplasm or outer membrane. Although cytoplasmic proteins follow the N-end rule for protein stability, proteins in the periplasm or outer membrane do not follow this rule; several have N-terminal amino acids predicted to destabilize the proteins. Surprisingly, 18% of the identified 2-DE spots represent isoforms in which protein products of the same gene have different observed pI and M(r), suggesting they are post-translationally processed. Although most of the predicted and observed values for isoelectric point and molecular mass show reasonable concordance, for several proteins the observed values significantly deviate from the expected values. Such discrepancies may represent either highly processed proteins or misinterpretations of the genomic sequence. Our data suggest that AhpC, CspC, and HdeA exist as covalent homomultimers, and that IcdA exists as at least three isoforms even under conditions in which covalent modification is not predicted. We enriched for proteins based on subcellular location and found several proteins in unexpected subcellular locations.

Entities:  

Mesh:

Substances:

Year:  1997        PMID: 9298646     DOI: 10.1002/elps.1150180807

Source DB:  PubMed          Journal:  Electrophoresis        ISSN: 0173-0835            Impact factor:   3.535


  104 in total

1.  Analysis of protein synthesis rates after initiation of chromosome replication in Escherichia coli.

Authors:  D Bechtloff; B Grünenfelder; T Akerlund; K Nordström
Journal:  J Bacteriol       Date:  1999-10       Impact factor: 3.490

2.  The Escherichia coli MG1655 in silico metabolic genotype: its definition, characteristics, and capabilities.

Authors:  J S Edwards; B O Palsson
Journal:  Proc Natl Acad Sci U S A       Date:  2000-05-09       Impact factor: 11.205

3.  GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions.

Authors:  J Besemer; A Lomsadze; M Borodovsky
Journal:  Nucleic Acids Res       Date:  2001-06-15       Impact factor: 16.971

4.  ZCURVE: a new system for recognizing protein-coding genes in bacterial and archaeal genomes.

Authors:  Feng-Biao Guo; Hong-Yu Ou; Chun-Ting Zhang
Journal:  Nucleic Acids Res       Date:  2003-03-15       Impact factor: 16.971

5.  A novel alkyl hydroperoxidase (AhpD) of Anabaena PCC7120 confers abiotic stress tolerance in Escherichia coli.

Authors:  Alok Kumar Shrivastava; Shilpi Singh; Prashant Kumar Singh; Sarita Pandey; L C Rai
Journal:  Funct Integr Genomics       Date:  2014-11-13       Impact factor: 3.410

6.  Hierarchy of sequence-dependent features associated with prokaryotic translation.

Authors:  Gila Lithwick; Hanah Margalit
Journal:  Genome Res       Date:  2003-12       Impact factor: 9.043

Review 7.  Contribution of structural genomics to understanding the biology of Escherichia coli.

Authors:  Allan Matte; J Sivaraman; Irena Ekiel; Kalle Gehring; Zongchao Jia; Miroslaw Cygler
Journal:  J Bacteriol       Date:  2003-07       Impact factor: 3.490

8.  Proteomic survey of metabolic pathways in rice.

Authors:  Antonius Koller; Michael P Washburn; B Markus Lange; Nancy L Andon; Cosmin Deciu; Paul A Haynes; Lara Hays; David Schieltz; Ryan Ulaszek; Jing Wei; Dirk Wolters; John R Yates
Journal:  Proc Natl Acad Sci U S A       Date:  2002-08-05       Impact factor: 11.205

9.  Distinct characteristics of two 2-Cys peroxiredoxins of Vibrio vulnificus suggesting differential roles in detoxifying oxidative stress.

Authors:  Ye-Ji Bang; Man Hwan Oh; Sang Ho Choi
Journal:  J Biol Chem       Date:  2012-10-24       Impact factor: 5.157

10.  Controlling and quantifying protein concentration in Escherichia coli.

Authors:  Shannon L Speer; Alex J Guseman; Jon B Patteson; Brandie M Ehrmann; Gary J Pielak
Journal:  Protein Sci       Date:  2019-05-22       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.