| Literature DB >> 16689689 |
Abstract
The three-dimensional (3D) structure prediction of proteins is an important task in bioinformatics. Finding energy functions that can better represent residue-residue and residue-solvent interactions is a crucial way to improve the prediction accuracy. The widely used contact energy functions mostly only consider the contact frequency between different types of residues; however, we find that the contact frequency also relates to the residue hydrophobic environment. Accordingly, we present an improved contact energy function to integrate the two factors, which can reflect the influence of hydrophobic interaction on the stabilization of protein 3D structure more effectively. Furthermore, a fold recognition (threading) approach based on this energy function is developed. The testing results obtained with 20 randomly selected proteins demonstrate that, compared with common contact energy functions, the proposed energy function can improve the accuracy of the fold template prediction from 20% to 50%, and can also improve the accuracy of the sequence-template alignment from 35% to 65%.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16689689 PMCID: PMC5172539 DOI: 10.1016/s1672-0229(05)03030-5
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Fig. 1Examples of the relationship between the residue contact energy and the residue hydrophobic environment. The x-axis represents twenty types of amino acids, and the y-axis represents the residue contact energy; different curves represent different hydrophobic environment. The letters H, N, and P denote hydrophobic, neutral (uncertain), and polar (hydrophilic), respectively. A. Residue contact energy related to asparagine (ASP). It can be seen that the contact energy of residue pairs containing asparagine tends to be high when the residue pairs are in the HH state (that is, both residues are in the hydrophobic position; the only exception is the pair with cysteine), but tends to be low in the HP or pH state. B. Residue contact energy related to cysteine (CYS). For some residue pairs consisting of cysteine and another residue such as cysteine, leucine, tryosin, or tyrosine, the contact energy is high when the residue pairs are in the PP state, which is distinct from that related to asparagine.
Features of the Twenty Testing Proteins
| PDB ID | Secondary structure | No. of S–S bonds | Group | |
|---|---|---|---|---|
| No. of | No. of | |||
| 3 | 0 | 0 | Better | |
| 3 | 0 | 0 | Better | |
| 3 | 0 | 0 | Better | |
| 0 | 3 | 3 | Better | |
| 2 | 0 | 0 | Better | |
| 1 | 2 | 0 | Better | |
| 2 | 0 | 0 | Better | |
| 3 | 0 | 0 | Better | |
| 2 | 2 | 2 | Better | |
| 2 | 2 | 0 | Better | |
| 0 | 3 | 0 | Better | |
| 1 | 3 | 0 | Better | |
| 5 | 0 | 3 | Better | |
| 5 | 0 | 0 | Better | |
| 0 | 3 | 1 | Equal | |
| 2 | 0 | 2 | Worse | |
| 1 | 3 | 3 | Worse | |
| 0 | 3 | 3 | Worse | |
| 0 | 2 | 2 | Worse | |
| 1 | 3 | 0 | Worse | |
Percentages of the Testing Proteins with Different z-score Ranks
| Energy function | Top 10% | Top 25% | Top 50% |
|---|---|---|---|
| Common energy function | 20% | 55% | 85% |
| Improved energy function | 50% | 80% | 95% |
Fig. 2The flow chart of the threading process in this study.