Literature DB >> 18792769

Using genome-wide protein sequence data to predict amino acid conservation.

Peter Palenchar1, Mathew Mount, Douglas Cusato, Jeffery Dougherty.   

Abstract

For most proteins, multiple sequence alignments are a viable method to identify functionally and structurally important amino acids, but for most organisms, there is a subset of proteins that are unique or found in a few closely related organisms. For these proteins, it is not possible to produce sequence alignments that are useful in identifying functionally or structurally important amino acids. We have investigated the relationship between amino acid conservation and five factors (the amino acid's identity, N-terminal neighbor, C-terminal neighbor, the local hydropathy of surrounding amino acids, and the local expected net charge of the surrounding amino acids based on the primary sequence) in Escherichia coli proteins. For four of the factors examined (all but the amino acid's identity), there is a significant relationship with conservation for some of the standard 20 amino acids. Using the combination of all five factors, we show that it is possible to calculate a score based on the primary sequences of a subset of E. coli proteins that has statistically significant predictive value with respect to predicting conserved amino acids in other E. coli proteins and Saccharomyces cerevisiae proteins. As these five variables show significant relationships with conservation, we have termed them conservation factors.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18792769     DOI: 10.1007/s10930-008-9150-3

Source DB:  PubMed          Journal:  Protein J        ISSN: 1572-3887            Impact factor:   2.371


  10 in total

Review 1.  Bioinformatics in protein analysis.

Authors:  B Persson
Journal:  EXS       Date:  2000

2.  Predicting protein--protein interactions from primary structure.

Authors:  J R Bock; D A Gough
Journal:  Bioinformatics       Date:  2001-05       Impact factor: 6.937

3.  Nucleotide sequence of the gene for a fibronectin-binding protein from Staphylococcus aureus: use of this peptide sequence in the synthesis of biologically active peptides.

Authors:  C Signäs; G Raucci; K Jönsson; P E Lindgren; G M Anantharamaiah; M Höök; M Lindberg
Journal:  Proc Natl Acad Sci U S A       Date:  1989-01       Impact factor: 11.205

4.  Probabilistic prediction of protein-protein interactions from the protein sequences.

Authors:  Arunkumar Chinnasamy; Ankush Mittal; Wing-Kin Sung
Journal:  Comput Biol Med       Date:  2005-10-25       Impact factor: 4.589

5.  An assessment of protein secondary structure prediction methods based on amino acid sequence.

Authors:  P Argos; J Schwarz; J Schwarz
Journal:  Biochim Biophys Acta       Date:  1976-08-09

Review 6.  Prediction of the secondary structure of proteins from their amino acid sequence.

Authors:  P Y Chou; G D Fasman
Journal:  Adv Enzymol Relat Areas Mol Biol       Date:  1978

7.  A simple method for displaying the hydropathic character of a protein.

Authors:  J Kyte; R F Doolittle
Journal:  J Mol Biol       Date:  1982-05-05       Impact factor: 5.469

8.  Amino acid biases in the N- and C-termini of proteins are evolutionarily conserved and are conserved between functionally related proteins.

Authors:  Peter M Palenchar
Journal:  Protein J       Date:  2008-08       Impact factor: 2.371

9.  Identification of homologs in insignificant blast hits by exploiting extrinsic gene properties.

Authors:  Jos Boekhorst; Berend Snel
Journal:  BMC Bioinformatics       Date:  2007-09-21       Impact factor: 3.169

10.  The COG database: an updated version includes eukaryotes.

Authors:  Roman L Tatusov; Natalie D Fedorova; John D Jackson; Aviva R Jacobs; Boris Kiryutin; Eugene V Koonin; Dmitri M Krylov; Raja Mazumder; Sergei L Mekhedov; Anastasia N Nikolskaya; B Sridhar Rao; Sergei Smirnov; Alexander V Sverdlov; Sona Vasudevan; Yuri I Wolf; Jodie J Yin; Darren A Natale
Journal:  BMC Bioinformatics       Date:  2003-09-11       Impact factor: 3.169

  10 in total
  2 in total

1.  Sequence conservation in the prediction of catalytic sites.

Authors:  Yongchao Dou; Xingbo Geng; Hongyun Gao; Jialiang Yang; Xiaoqi Zheng; Jun Wang
Journal:  Protein J       Date:  2011-04       Impact factor: 2.371

2.  Random acceleration and steered molecular dynamics simulations reveal the (un)binding tunnels in adenosine deaminase and critical residues in tunnels.

Authors:  Yue Pan; Renrui Qi; Minghao Li; Bingda Wang; Honglan Huang; Weiwei Han
Journal:  RSC Adv       Date:  2020-12-11       Impact factor: 4.036

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.