| Literature DB >> 15983135 |
Iaroslav Ispolatov1, Anton Yuryev, Ilya Mazo, Sergei Maslov.
Abstract
We demonstrate that protein-protein interaction networks in several eukaryotic organisms contain significantly more self-interacting proteins than expected if such homodimers randomly appeared in the course of the evolution. We also show that on average homodimers have twice as many interaction partners than non-self-interacting proteins. More specifically, the likelihood of a protein to physically interact with itself was found to be proportional to the total number of its binding partners. These properties of dimers are in agreement with a phenomenological model, in which individual proteins differ from each other by the degree of their 'stickiness' or general propensity toward interaction with other proteins including oneself. A duplication of self-interacting proteins creates a pair of paralogous proteins interacting with each other. We show that such pairs occur more frequently than could be explained by pure chance alone. Similar to homodimers, proteins involved in heterodimers with their paralogs on average have twice as many interacting partners than the rest of the network. The likelihood of a pair of paralogous proteins to interact with each other was also shown to decrease with their sequence similarity. This points to the conclusion that most of interactions between paralogs are inherited from ancestral homodimeric proteins, rather than established de novo after duplication. We finally discuss possible implications of our empirical observations from functional and evolutionary standpoints.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15983135 PMCID: PMC1160523 DOI: 10.1093/nar/gki678
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Estimated total number of proteins Ntotal, number of proteins involved in the PPI networks NPP1, the number of dimers or self-interacting proteins Ndimer, the average network degree (the number of neighbors) 〈k〉 over all NPP1 and the average degree 〈k〉dimer of self-interacting proteins
| Species | 〈 | 〈 | |||
|---|---|---|---|---|---|
| Yeast | 6713 | 4876 | 179 | 6.6 ± 0.2 | 12.4 ± 1.2 |
| Worm | 22 268 | 3137 | 89 | 3.3 ± 0.1 | 13.1 ± 2.2 |
| Fly | 26 148 | 6962 | 160 | 5.9 ± 0.1 | 14.2 ± 1.2 |
| Human | 25 000–50 000 | 5331 | 1045 | 5.7 ± 0.1 | 14.0 ± 0.6 |
To avoid an obvious bias for homodimers, the degree k does not count the protein itself among its binding partners.
Figure 1The likelihood Pdimer(k) of a fly protein of a given degree k to self-interact plotted versus its degree k in the PPI network. The dashed line is the linear fit Pdimer(k) = 0.0035 k. To improve the statistics, the degree k in this and subsequent figures is logarithmically binned.
Figure 2The fraction of homodimers Pdimer(k) among human proteins as a function of their degree k. Dashed and dotted–dashed lines are fits with the Equation 1 and pself = 0.035 and pself = 0.055 correspondingly. The second value provides the best fit overall, while the first value better fits the low k region. The inset demonstrates the linear scaling of log[1 − Pdimer(k)] with k in the large k region as expected from the Equation 1. The solid line corresponds to pself = 0.051.
Figure 3The fraction of homodimers among yeast (open squares) and worm (filled circles) proteins plotted versus their total number of binding partners. The solid line corresponds to the linear fit with to the worm data.
The number of linked pairs of paralogous proteins nlinked paralogs, the number of linked pairs nlinked paralogs expected by pure chance alone, the average degree 〈k〉linked paralogs of proteins known to interact with some of their paralogs and the average degree 〈k〉dimer of self-interacting (dimer) proteins
| Species | 〈 | 〈 | |||
|---|---|---|---|---|---|
| Yeast | 3409 | 251 | 4 ± 2 | 14.3 ± 1.9 | 12.4 ± 1.2 |
| Fly | 12 991 | 142 | 11 ± 3 | 11.1 ± 1.0 | 14.2 ± 1.2 |
| Worm | 3480 | 105 | 3 ± 2 | 5.8 ± 0.9 | 13.1 ± 2.2 |
| Human | 21 562 | 1280 | 24 ± 5 | 10.2 ± 0.6 | 14.0 ± 0.6 |
Figure 4The probability for two paralogous proteins to bind to each other Plinked paralogs versus their sequence similarity s for (top to the bottom) human, yeast, worm and fly. Even the most distant paralogs are more likely to interact with each other than a randomly selected pair of proteins. Such randomly expected probability is equal to 1.1 × 10−3 in the human, 1.3 × 10−3 in the yeast, 1.1 × 10−3 in the worm and 0.8 × 10−3 in the fly dataset.
Numbers of certain types of proteins for yeast, worm, fly and human
| Species | ||||||
|---|---|---|---|---|---|---|
| Yeast | 4876 | 1682 | 321 | 179 | 67 | 36 |
| Worm | 3137 | 1578 | 143 | 89 | 47 | 13 |
| Fly | 6962 | 2951 | 169 | 160 | 59 | 17 |
| Human | 5331 | 3840 | 1548 | 1045 | 789 | 460 |
NPP1, proteins present in the network; NPP1−p, network proteins with at least one paralog present in the network; N1−p, proteins linked to at least one of their paralogs; Ndimer, homodimers; Nd−p, homodimers that have at least one paralog among network proteins; Nd−1−p, homodimers linked to at least one of their paralogs.