| Literature DB >> 17640363 |
Benjamin Schuster-Böckler1, Alex Bateman.
Abstract
BACKGROUND: Protein interactions are thought to be largely mediated by interactions between structural domains. Databases such as iPfam relate interactions in protein structures to known domain families. Here, we investigate how the domain interactions from the iPfam database are distributed in protein interactions taken from the HPRD, MPact, BioGRID, DIP and IntAct databases.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17640363 PMCID: PMC1940023 DOI: 10.1186/1471-2105-8-259
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Comparison of coverage of iPfam domain pairs on protein interactions. For each species, the height of the column reflects the number of known protein–protein interactions in the data set. The columns are split according to the proportion of interactions that contain an iPfam domain pair (blue), that contain any other Pfam domains on both proteins (red), and those that contain no Pfam domain pair (yellow).
iPfam domain pair coverage on protein interactions
| 4314 | 26.96% | 1281 | 211 | 178 | 7.12 | |
| 5780 | 92.72% | 45707 | 2045 | 528 | 57.49 | |
| 22437 | 13.47% | 5310 | 221 | 76 | 9.90 | |
| 16251 | 43.22% | 21921 | 641 | 195 | 21.79 | |
| 38213 | 21.40% | 24065 | 4577 | 1373 | 116.86 |
For each species, we list the size of the proteome as defined in Integr8 and the fraction of this proteome that is represented in the protein interaction sets, followed by the total number of binary protein interactions and the fraction of those that contain an iPfam domain pair. The last two columns show the results of the network shuffling experiments.
Figure 2Frequencies of iPfam domain pairs in E. coli, S. cerevisiae and H. sapiens protein interactions. Each point in this graph represents a set of protein interactions. The abscissa reflects the number of interactions in each set that contain the same iPfam domain pair. The ordinate shows the number of distinct such sets, each defined by a different iPfam domain pair. In both H. sapiens (blue) and S. cerevisiae (green) a small number of iPfam domain pairs covers a large fraction of the interactome, whereas in E. coli, no iPfam domain occurs in more than 4 experimental interactions at a time. Dotted lines denote fitted monomial functions, showing that the distributions follow a power law.
Figure 3Average frequency of iPfam domain pairs relative to degree of node. Each point represents a protein in the interaction networks of H. sapiens (blue) and S. cerevisiae (green). For each protein, we calculate the degree, defined as the number of interactions the protein is involved in. On the y-axis, we show the average number of iPfam domain pairs in edges adjacent to proteins of degree x. We calculated a Spearman correlation of 0.68 and 0.71, for H. sapiens and S. cerevisiae. The correlation is outlined by dotted lines.
Matrix of mutual shared iPfam domain pairs
| 158 | 35 | 30 | 135 | 347 | ||
| 129 | 164 | 524 | 835 | |||
| 102 | 172 | 197 | ||||
| 241 | 266 | |||||
| 1221 |
The Table shows the number of co-occurences of iPfam domain pairs between two species. The right-most column lists the total number of unique iPfam pairs found in each species' experimental interactions.
Figure 4Venn diagramm showing the fractions of iPfam domain pairs found in the E. coli, S. cerevisiae and H. sapiens binary protein interaction sets. The three circles represent the iPfam domain pairs observed in the respective species. The overlaps denote co-observed iPfam domain pairs. The grey set in the background represents iPfam domain pairs not found in the three species.
Prediction of total number of iPfam domain pairs
| Species | Θ | Ψ | Ξ | ||||||
| 1281 | 211 | 4314 | 1163 | 347 | 8957 | 2070 | |||
| 45707 | 2045 | 5780 | 5359 | 835 | 8957 | 2119 | |||
| 5310 | 221 | 22437 | 3022 | 197 | 8957 | 2612 | |||
| 21921 | 641 | 16251 | 7023 | 266 | 8957 | 2777 | |||
| 24065 | 4577 | 38213 | 8179 | 1221 | 8957 | 3476 |
ΘThe total number of observed interactions in species S
θThe number of observed interactions in species S that contain an iPfam domain pair
ΨThe proteome size for species S
ψThe number of proteins from species S that are seen in an interaction screen
χThe number of iPfam domain pairs observed in species S
The predicted total number of iPfam domain pairs in species S
Ξ The total number of known Pfam domains
ζThe number of Pfam domains observed in all protein of species S
The estimated total number of iPfam domains in all species
Prediction results are shown in bold font.
Figure 5Species distribution of iPfam domain pairs. This pie chart shows how many iPfam domain pairs were found in PDB structures from each species. The total number is larger than the 4030 unique iPfam pairs in the database because an iPfam pair can be found in structures from several species.