| Literature DB >> 19108706 |
Philip Wong1, Sonja Althammer, Andrea Hildebrand, Andreas Kirschner, Philipp Pagel, Bernd Geissler, Pawel Smialowski, Florian Blöchl, Matthias Oesterheld, Thorsten Schmidt, Normann Strack, Fabian J Theis, Andreas Ruepp, Dmitrij Frishman.
Abstract
BACKGROUND: We have recently released a comprehensive, manually curated database of mammalian protein complexes called CORUM. Combining CORUM with other resources, we assembled a dataset of over 2700 mammalian complexes. The availability of a rich information resource allows us to search for organizational properties concerning these complexes.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19108706 PMCID: PMC2645396 DOI: 10.1186/1471-2164-9-629
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Composition of mammalian complexes. A) The number of complexes (y-axis) with a particular number of unique proteins (x-axis) is plotted. B) The number of proteins (y-axis) participating in a particular number of complexes (x-axis) is plotted.
Figure 2Protein length in human complexes. A) Length distribution of proteins in human complexes generated randomly from the entire known human proteome. Mean lengths of proteins versus the number of unique subunits in: B) model 1 random complexes generated from the entire proteome C) annotated human complexes D) model 1 random human complexes generated from data we collected.
Figure 3Complexes and selection. dN/dS distribution of genes associated with A) all human proteins and B) proteins in the human complex data. D) The mean dN/dS ratio for human-mouse orthologs is plotted against the number of unique proteins in the complex. Complexes with more unique proteins have a significantly smaller mean dN/dS ratios than those with less unique proteins (t-test: P < 3.1 × 10-7). F) The dN/dS ratio for human-mouse orthologs is plotted against the number of complexes they participate in. Proteins participating in more complexes have significantly lower dN/dS ratios than those with less complex participation (t-test: P < 7.4 × 10-9). C, E) Both trends are rarely observed for model 1 random complexes.
Homogeneity of protein properties in mammalian complexes of 3 or more proteins.
| Protein Complex Data | Property | Mean Stdev. Annotated | Mean Stdev. Random |
| All mammalian | pI | 1.5 ± 0.7 (1.5) | 1.8 ± 0.7 (1.8) |
| All mammalian | % Helix | 14.4 ± 7.8 (13.5) | 18.6 ± 7.5 (18.6) |
| All mammalian | % Sheet | 9.5 ± 5.9 (8.7) | 10.9 ± 5.4 (10.3) |
| All mammalian | % Coil | 10.3 ± 5.0 (9.8) | 13.3 ± 5.6 (13) |
| Human (against mouse) | 0.07 ± 0.05 (0.06) | 0.09 ± 0.06 (0.08) |
The standard deviations of protein property values for each annotated complex are significantly smaller than ones for model 1 random complexes (MC-test using 1000 complexes generated randomly with replacement: P < 0.001). The means ± standard deviation of these standard deviations for each property are shown in the last two columns along with median values (in parentheses).
Figure 4Coverage by binary domain and protein interactions. Coverage is indicated by values in the Venn diagrams. Example: A) Protein content coverage. 71% of the proteins in our complexes with 3 or more unique proteins can be explained by either domain-domain or binary protein interactions. The full protein content of 29% of these complexes can be explained by either interaction-type. 14% of binary ppi can be explained by predicted domain-domain interactions. B) Interaction coverage. 5% of IntAct binary interactions could be explained from predicted domain-domain interactions. 77% of solved protein-protein contacts in our complexes of known structure with 3 or more unique proteins can be explained by either domain-domain or binary protein interactions. All solved contacts in 52% of these complexes can likewise be explained by these interactions.
Figure 5Large proteins tend to interact with relatively smaller partners. For each protein pair of different proteins, the length of the larger protein (protein1) is plotted on the x-axis. A,B,C) The difference in length with its partner (protein2) is plotted on y-axis. Larger proteins tend to have a larger length difference with their partners compared to proteins of smaller size. We see this trend amongst A) IntAct binary protein-protein interactions and B) Random binary interactions generated from the IntAct data C) binary interactions mapped onto all collected mammalian complexes. For all plots A-C, the trends are significant (t-test: P < 0.001).