| Literature DB >> 17570865 |
Milana Frenkel-Morgenstern1, Rachel Magid, Eran Eyal, Shmuel Pietrokovski.
Abstract
BACKGROUND: Accurate prediction of intra-protein residue contacts from sequence information will allow the prediction of protein structures. Basic predictions of such specific contacts can be further refined by jointly analyzing predicted contacts, and by adding information on the relative positions of contacts in the protein primary sequence.Entities:
Mesh:
Year: 2007 PMID: 17570865 PMCID: PMC1892094 DOI: 10.1186/1471-2105-8-S5-S6
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The GARP refinement procedure. Schematic example for the calculation of a mutual clustering coefficient (C) and window averaging. Residues (nodes) are shown as circles, predicted interactions (edges) are shown as black lines, and part of the protein backbone sequence is shown as thick grey lines. The analyzed edge (v, w) is shown in bold with some other edges of the graph. Edges, which connect nodes v and w through a third node, are shown as regular lines. The mutual clustering coefficient compares the number of these edges, or their integrated weights, with the number, or weights, expected from all edges, in which v or w are present. The five top and five bottom nodes form the sequence window for nodes w or v, respectively. Edge (v, w) is given the mean Cvalue of all edges between the two windows.
L/10 best scores accuracy improvements of GARP contact prediction for P2PconPred on the training set using parameters: 5% top scores, W = 2, Geometric unweighted mutual clustering coefficient.
| 0.12 ± 0.018 | 0.22 ± 0.024 | 0.14 ± 0.020 | 0.24 ± 0.029 | |
L/10 best scores accuracy improvements of GARP contact prediction for P2PconPred on the test set using parameters: 5% top scores a, W = 2, Geometric unweighted mutual coefficient.
| 0.12 ± 0.018 | 0.24 ± 0.027 | 0.18 ± 0.024 | 0.26 ± 0.030 | |
| 0.23 ± 0.027 | 0.16 ± 0.021 | - | - | |
a In cases where less then L/10 predictions where reported by GARP, the calculation was repeated starting with the top 10% of scores. This was necessary for no more then five families in predicting contacts within the core.