| Literature DB >> 17538689 |
Luke Hakes1, David L Robertson, Stephen G Oliver, Simon C Lovell.
Abstract
By combining crystallographic information with protein-interaction data obtained through traditional experimental means, this paper determines the most appropriate method for generating protein-interaction networks that incorporate data derived from protein complexes. We propose that a combined method should be considered; in which complexes composed of five chains or less are decomposed using the matrix model, whereas the spoke model is used to derive pairwise interactions for those with six chains or more. The results presented here should improve the accuracy and relevance of studies investigating the topology of protein-interaction networks.Entities:
Year: 2006 PMID: 17538689 PMCID: PMC1838958 DOI: 10.1155/2007/49356
Source DB: PubMed Journal: Comp Funct Genomics ISSN: 1531-6912
Figure 1Possible modeling methodologies for experimentally determined protein complexes: (A) actual topology of protein complex; (B) spoke model, interactions are assigned between bait (blue) and each captured prey; (C) matrix model, all possible interacting pairs are assumed. Balls represent polypeptide chains within a protein complex; lines between balls represent a physical interaction between those chains.
Figure 2Performance of the matrix model on 133 structures with different numbers of unique polypeptide chains. The matrix model performs well for structures ≤ 5 chains, illustrated by the large number of complexes in this region of the graph that are either fully connected (illustrated by the red line) or have large numbers of connections between member chains. Inset: density plot showing complex size distribution, ≈ 50% of all complexes have ≤ 5 unique chains. Tighter contours represent increasing numbers of protein chains.
Scores for each of the PQS structures that passed the filtering criterion. STP is spoke-model true positives, SFP is spoke-model true negatives, SFN is spoke-model false negatives, MTP is matrix-model true positives, MFP is matrix-model false positives, MFN is matrix-model false negatives. Score spoke is overall score for spoke model, score matrix is overall score for matrix model. Bold scores indicate best performing model, underlining is used when both models perform equally well.
| Structure | Description | Chains | STP | SFP | SFN | MTP | MFP | MFN | Score spoke | Score matrix |
|
| ||||||||||
| 1iru | 20S proteasome | 12 | 26 | 40 | 0 | 26 | 40 | 0 |
|
|
| 1k8a | Large ribosomal subunit | 12 | 0 | 0 | 3 | 5 | 61 | 0 | −3 | −56 |
| 1y1v | RNA polymerase II-TFIIs | 12 | 15 | 32 | 3 | 18 | 48 | 0 | −20 | −30 |
| 1n32 | Small ribosomal subunit | 9 | 1 | 14 | 1 | 3 | 33 | 0 | −14 | −30 |
| 1sxj | RFC bound to PCNA | 5 | 6 | 4 | 0 | 6 | 4 | 0 |
|
|
| 1u2v | ARP2/3 | 5 | 4 | 2 | 1 | 6 | 4 | 0 | 1 |
|
| 1id3 | Nucleasome | 4 | 2 | 0 | 3 | 5 | 1 | 0 | −1 |
|
| 1gw5 | AP2 | 3 | 3 | 0 | 0 | 3 | 0 | 0 |
|
|
| 1kyo | Cytochrome BC1 | 3 | 1 | 0 | 1 | 2 | 1 | 0 | 0 |
|
| 1ntk | Cytochrome BC1 | 3 | 0 | 0 | 1 | 1 | 1 | 0 | −1 | −1 |
| 1qo1 | ATP synthase motor | 3 | 1 | 0 | 2 | 3 | 0 | 0 | −1 |
|