| Literature DB >> 19563678 |
Stefanie De Bodt1, Sebastian Proost, Klaas Vandepoele, Pierre Rouzé, Yves Van de Peer.
Abstract
BACKGROUND: Large-scale identification of the interrelationships between different components of the cell, such as the interactions between proteins, has recently gained great interest. However, unraveling large-scale protein-protein interaction maps is laborious and expensive. Moreover, assessing the reliability of the interactions can be cumbersome.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19563678 PMCID: PMC2719670 DOI: 10.1186/1471-2164-10-288
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Interolog detection and filtering. Proteins in similar color belong to the same orthologous group as identified by OrthoMCL. Connecting lines indicate predicted interactions between proteins. First, all possible combinations between proteins in the two orthologous groups are predicted. Second, some connections between proteins do not hold true when considering the genomic features e.g. GO cellular component. The result is the filtered interactome (solid lines), which extended to the predicted interactome using the interactions that did not pass the filters (dashed lines).
Figure 2Assessment of genomic features. Frequency distributions of A) GO Cellular Component similarity scores, B) GO Biological Process similarity scores and C) Pearson Correlation Coefficients for real and random protein-protein interactions.
Experimentally identified proteins and protein-protein interactions
| Number of proteins | 1722 |
| Number of proteins with OG | 1626 |
| Number of proteins with a BP GO annotation (depth = 5) | 1026 |
| Number of proteins with a CC GO annotation (depth = 5) | 688 |
| Number of proteins with gene expression data | 1821 |
| Number of interactions | 3035 |
| Number of interactions with OG | 3032 |
| Number of interactions between proteins with sufficient information on different genomic features | 1457 |
| Number of interactions with a BP GO annotation (depth = 5, depth = 8) | 1457,157 |
| Number of interactions with BP score > = 5, > = 8 | 913, 123 |
| Number of interactions with a CC GO annotation (depth = 5) | 757 |
| Number of interactions with CC score > = 5 | 565 |
| Number of interactions with gene expression data | 2482 |
| Number of interactions with PCC> = 0.3 | 837 |
| Number of interactions with a BP GO annotation (depth = 5) and expression data | 1276 |
| Number of interactions with BP score > = 5 and PCC > = 0.3) | 317 |
| Total number of filtered interactions with CC score > = 5 or BP score > = 5 and PCC > = 0.3 or BP score > = 8) | 767 |
Number of proteins and interactions with genomic feature information for Arabidopsis protein-protein interactions from public databases (OG = orthologous group, BP = biological process, CC = cellular component, GO = gene ontology, PCC = Pearson Correlation Coefficient)
Predicted protein-protein interactions
| Yeast | 4662 | 175322 |
| Yeast CC5 | 1182 | 9384 |
| Yeast BP5+PCC 0.3 | 1249 | 8986 |
| Yeast BP8 | 290 | 1336 |
| Yeast total filtered | 1919 | 16145 |
| Human | 3346 | 28397 |
| Human CC5 | 694 | 1821 |
| Human BP5+PCC 0.3 | 632 | 1082 |
| Human BP8 | 185 | 421 |
| Human total filtered | 1089 | 2795 |
| C. elegans | 1549 | 4174 |
| C. elegans CC5 | 149 | 199 |
| C. elegans BP5+PCC 0.3 | 234 | 256 |
| C. elegans BP8 | 70 | 100 |
| C. elegans total filtered | 330 | 432 |
| Drosophila | 3355 | 19315 |
| Drosophila CC5 | 430 | 924 |
| Drosophila BP5+PCC0.3 | 368 | 438 |
| Drosophila BP8 | 87 | 140 |
| Drosophila total filtered | 646 | 1282 |
| All species – Interologs with CC annotation (CC > = 5) | 2123 (1431) | 43698 (11891) |
| All species – Interologs with BP annotation (BP > = 5, BP > = 8) | 3087 (2111, 416) | 79475 (25037, 1890) |
| All species – Interologs with gene expression data (PCC > = 0.3) | 5369 (4688) | 161928 (62178) |
| All species – Filtered | 2233 | 18674 |
| All species – Predicted | 3014 | 51885 |
| All species – interologs (without filtering) | 6206 | 216972 |
Number of Arabidopsis proteins and protein-protein interactions detected using different source organisms and different genomic feature filters (BP = biological process, CC = cellular component, PCC = Pearson Correlation Coefficient)
Figure 3Overlap between datasets of protein-protein interactions. Overlap between experimentally identified protein-protein interactions available in public databases (see methods) and predicted sets from this study and the Geisler-Lee et al. study [45]. Filtered interactions from this study are compared to interactions with a confidence value of two or higher from the Geisler-Lee et al. [45] study.
Figure 4Protein-protein interactions involved in 'DNA replication'. Green edges represent PCC values > 0.3 (see Figure 5 for more details). For grey edges, no co-expression information is available.
Figure 5Protein-protein interactions involved in plant-specific processes. A. Protein-protein interactions involved in imprinting genes. B. ARP2/3 complex. Edges in black depict previous experimentally identified interactions. For grey edges, no co-expression information is available.