Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Using genome-wide protein sequence data to predict amino acid conservation.

Literature DB >> 18792769

Using genome-wide protein sequence data to predict amino acid conservation.

Peter Palenchar¹, Mathew Mount, Douglas Cusato, Jeffery Dougherty.

Abstract

For most proteins, multiple sequence alignments are a viable method to identify functionally and structurally important amino acids, but for most organisms, there is a subset of proteins that are unique or found in a few closely related organisms. For these proteins, it is not possible to produce sequence alignments that are useful in identifying functionally or structurally important amino acids. We have investigated the relationship between amino acid conservation and five factors (the amino acid's identity, N-terminal neighbor, C-terminal neighbor, the local hydropathy of surrounding amino acids, and the local expected net charge of the surrounding amino acids based on the primary sequence) in Escherichia coli proteins. For four of the factors examined (all but the amino acid's identity), there is a significant relationship with conservation for some of the standard 20 amino acids. Using the combination of all five factors, we show that it is possible to calculate a score based on the primary sequences of a subset of E. coli proteins that has statistically significant predictive value with respect to predicting conserved amino acids in other E. coli proteins and Saccharomyces cerevisiae proteins. As these five variables show significant relationships with conservation, we have termed them conservation factors.

Entities: Species

Mesh：

Substances：

Year: 2008 PMID： 18792769 DOI： 10.1007/s10930-008-9150-3

Source DB: PubMed Journal: Protein J ISSN： 1572-3887 Impact factor: 2.371

10 in total

Review 1. Bioinformatics in protein analysis.

Authors: B Persson
Journal: EXS Date: 2000

2. Predicting protein--protein interactions from primary structure.

Authors: J R Bock; D A Gough
Journal: Bioinformatics Date: 2001-05 Impact factor: 6.937

3. Nucleotide sequence of the gene for a fibronectin-binding protein from Staphylococcus aureus: use of this peptide sequence in the synthesis of biologically active peptides.

Authors: C Signäs; G Raucci; K Jönsson; P E Lindgren; G M Anantharamaiah; M Höök; M Lindberg
Journal: Proc Natl Acad Sci U S A Date: 1989-01 Impact factor: 11.205

4. Probabilistic prediction of protein-protein interactions from the protein sequences.

Authors: Arunkumar Chinnasamy; Ankush Mittal; Wing-Kin Sung
Journal: Comput Biol Med Date: 2005-10-25 Impact factor: 4.589

5. An assessment of protein secondary structure prediction methods based on amino acid sequence.

Authors: P Argos; J Schwarz; J Schwarz
Journal: Biochim Biophys Acta Date: 1976-08-09

Review 6. Prediction of the secondary structure of proteins from their amino acid sequence.

Authors: P Y Chou; G D Fasman
Journal: Adv Enzymol Relat Areas Mol Biol Date: 1978

7. A simple method for displaying the hydropathic character of a protein.

Authors: J Kyte; R F Doolittle
Journal: J Mol Biol Date: 1982-05-05 Impact factor: 5.469

8. Amino acid biases in the N- and C-termini of proteins are evolutionarily conserved and are conserved between functionally related proteins.

Authors: Peter M Palenchar
Journal: Protein J Date: 2008-08 Impact factor: 2.371

9. Identification of homologs in insignificant blast hits by exploiting extrinsic gene properties.

Authors: Jos Boekhorst; Berend Snel
Journal: BMC Bioinformatics Date: 2007-09-21 Impact factor: 3.169

10. The COG database: an updated version includes eukaryotes.

Authors: Roman L Tatusov; Natalie D Fedorova; John D Jackson; Aviva R Jacobs; Boris Kiryutin; Eugene V Koonin; Dmitri M Krylov; Raja Mazumder; Sergei L Mekhedov; Anastasia N Nikolskaya; B Sridhar Rao; Sergei Smirnov; Alexander V Sverdlov; Sona Vasudevan; Yuri I Wolf; Jodie J Yin; Darren A Natale
Journal: BMC Bioinformatics Date: 2003-09-11 Impact factor: 3.169

10 in total

2 in total

1. Sequence conservation in the prediction of catalytic sites.

Authors: Yongchao Dou; Xingbo Geng; Hongyun Gao; Jialiang Yang; Xiaoqi Zheng; Jun Wang
Journal: Protein J Date: 2011-04 Impact factor: 2.371

2. Random acceleration and steered molecular dynamics simulations reveal the (un)binding tunnels in adenosine deaminase and critical residues in tunnels.

Authors: Yue Pan; Renrui Qi; Minghao Li; Bingda Wang; Honglan Huang; Weiwei Han
Journal: RSC Adv Date: 2020-12-11 Impact factor: 4.036

2 in total