| Literature DB >> 16938899 |
Abstract
Simple eukaryotic cells such as yeast could contain around 800 protein complexes, as two new comprehensive studies show. But slightly different approaches resulted in surprising differences between the two datasets, showing that more work is required to get a complete picture of the yeast interactome.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16938899 PMCID: PMC1779532 DOI: 10.1186/gb-2006-7-6-223
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Comparison of two projects aimed at determining the number of protein complexes in S. cerevisiae
| Gavin | Krogan | |
| Number of proteins TAP-targeted | 6,466 | ? |
| TAP fusion expression/purification attempts | 3,206 | 4,562 |
| Successful purifications | ? | 2,357* |
| Proteins with more than zero partners | 1,993 | ? |
| Distinct proteins identified | 2,760 | 4,087 (2,708 in core set) |
| 'Distinct' complexes identified† | 491 | 547 |
| Average number of proteins per complex | 3.1 (core proteins) | 4.9 |
*At least one protein was identified in each of these purifications; 1,613 baits were successfully analyzed using SDS-PAGE and matrix-assisted laser desorption ionization mass spectrometry (MALDI) and 2,001 baits by liquid chromatography followed by tandem mass spectrometry (LC-MS/MS); 1,257 purifications were successful using both, 356 using only MALDI and 744 using only LC-MS/MS. †Note that these are not necessarily distinct physical complexes but computationally derived complexes. A question mark indicates that a number for this was not given in the article.
Figure 1The difference between complex purifications and protein complexes. (a,b) When two proteins belonging to the same complex are tagged in independent yeast strains and the other components of the complex identified, the two purifications rarely return precisely the same list of components. (c,d) Although proteins in a complex are associated, it is usually unclear which proteins interact directly with each other. To predict direct interactions, either the matrix (c) or spoke model (d) is applied to lists of co-purified proteins. To evaluate such interactions Gavin et al. [1] invented the socio-affinity index (SAI). In brief, the SAI quantifies the tendency for a protein pair (for example, Aro1 and Gfa1) to identify each other when one of them is tagged (as in b) and to co-purify when other proteins are tagged (as in a) relative to what would be expected from their frequency in the dataset: that is, how many times this protein was found as prey. High-affinity SAI values result when both proteins co-purify when either one is tagged (without co-purifying many other proteins) and when both are always seen together in purifications made with other baits. (e) For this particular complex one core, two modules and two attachments have been identified. Note that modules cannot be computed from only two purifications; the assemblies of Figure 1e are derived from the eight purifications shown in Table 2 and additional purifications not shown.
Purifications leading to the definition of complex 314 in the study by Gavin et al. [1]
| Preys | |||||||||
| Baits | Aco1 | Aha1 | Aro1 | Cct8 | Gfa1 | Pfk1 | Pfk2 | Yhb1 | Additional |
| Aco1 | - | - | - | - | - | - | - | - | - |
| Aha1 | - | x | x | - | x | x | - | - | +26 |
| Aro1 | - | - | x | x | x | x | - | x | +24 |
| Cct8 | - | - | - | - | - | - | - | - | - |
| Gfa1 | - | - | - | - | - | - | - | - | - |
| Pfk1 | - | - | - | - | - | - | - | - | - |
| Pfk2 | - | - | - | - | - | x | x | - | +3 |
| Yhb1 | x | - | - | - | x | - | - | x | +18 |
| Prey count | 10 | 7 | 17 | 10 | 106 | 30 | 21 | 13 | |
Purifications that lead to the definition of complex 314 as described by Gavin et al. [1] and in Figure 1e. Each line represents a single purification with the bait indicated. For each of the bait proteins, the columns indicate the prey proteins (x) associated with the bait or not found with the bait (-). Complex 314 consists of the core proteins Aro1 and Cct8, which were found in two purifications of which only one (bait: Aro1) is shown here. Aco1 was found in 10 purifications, that is, in 9 purifications in addition to the one using Yhb1 as bait. Only baits that were included in the 'final' complex by the SAI algorithm are shown in this table. In addition, the modules 103 (Pfk1-Pfk2, found in five purifications) and 114 (Aco1-Yhb1, found in three purifications) associated with the core, as indicated by the co-purification of Pfk2 with Pfk1 and of Aco1 with Yhb1. Finally, Aha1 and Gfa1 were classified as attachments because they were not found consistently associated with any of the other components and thus could not be classified as core or module. Note that Aha1, Aro1 and Yhb1 had many more proteins co-purified when they were used as baits than when they were prey. For example, when Aha1 was used as a bait, the four proteins Aha1, Aro1, Gfa1 and Pfk1 were identified as binding to it, plus another 26 proteins not shown here; these are indicated in the Additional column. Purifications with Aco1, Cct8, Gfa1 and Pfk1 as baits appear to have been unsuccessful. Note that the information in this table is not sufficient to derive the complex shown in Figure 1e but also requires information from additional purifications only indicated in the row 'Prey count' and in the column 'Additional'.
Figure 2Protein complexes, purification data and two-hybrid interactions. Protein complex 285 from Krogan et al. [2] (composed of Spc72, Spc97 and Tub4 - the proteins in the area shaded yellow) was compared with complex 219 from Gavin et al. [1] (composed of all the proteins in the yellow and pink areas). Note that the two 'complexes' have been derived from many purifications by computational means and do not necessarily represent physical entities. The purifications from [1] using Spc97 and Tub4 as baits did not produce completely overlapping prey sets (there was no purification reported with Spc72 as bait); for example, only Tub4 but not Spc97 co-purified with Tef1 and Pfk1, whereas Spc97 but not Tub4 pulled down Spc72. Independently, two-hybrid screens have found a number of interactions between the members of these complexes and with other proteins, as indicated by arrows (pointing from the bait to the prey). The two-hybrid data are from [11,18-20].
Figure 3The two TAP studies yield dramatically different protein complexes. (a) Comparison of the composition of most similar complexes from Gavin et al. [1] and Krogan et al. [2]. Each circle represents a comparison of a complex from [1] to its most similar counterpart (that is, the complex that shares most of its proteins) in the dataset of [2]. The size of the complex is plotted along the x-axis and the percentage overlap in composition with the complex from [2] is plotted along the y-axis. The diamonds represent the same exercise carried out for complexes from [2] compared with their most similar counterparts from [1]. As an example, complex 219 (solid black circle) is 60% identical to complex 285 (solid black circle), whereas complex 285 has a 100% overlap with 219 (Figure 2). As each symbol refers to a one-way comparison, symbols may be superimposed but usually refer to different complexes/comparisons. (b) The pie-chart on the left shows the overlap between the 491 complexes identified in Gavin et al. [1], including cores, modules and attachments, and the 547 complexes from Krogan et al. [2], whereas the pie-chart on the right presents the converse analysis. For example, from the right-hand chart, there are 77 complexes reported in [2] that each have 50-90% of their proteins contained in a complex reported in [1]. In the left-hand chart, complex 219 from [1], with an overlap of 60% with complex 285 from [2] would be one of the 46 complexes in the slice showing the overlap range 50-90%. Conversely, complex 285 is one of the 48 complexes from [2] that overlap >90% with one of the complexes from [1], in this case complex 219. More details, such as which complexes are related to which, are available at [21].