| Literature DB >> 18053208 |
Sara Zanivan1, Ilaria Cascone, Chiara Peyron, Ivan Molineris, Serena Marchio, Michele Caselle, Federico Bussolino.
Abstract
We propose a new approach to identify interacting proteins based on gene expression data. By using hypergeometric distribution and extensive Monte-Carlo simulations, we demonstrate that looking at synchronous expression peaks in a single time interval is a high sensitivity approach to detect co-regulation among interacting proteins. Combining gene expression and Gene Ontology similarity analyses enabled the extraction of novel interactions from microarray datasets. Applying this approach to p21-activated kinase 1, we validated alpha-tubulin and early endosome antigen 1 as its novel interactors.Entities:
Mesh:
Substances:
Year: 2007 PMID: 18053208 PMCID: PMC2246258 DOI: 10.1186/gb-2007-8-12-r256
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Set of known human multi-protein complexes analyzed
| Number of genes | ||||
| Protein complex | Complex type | Thy-Thy2 | Thy-Thy3 | Thy-Noc |
| ATP_F0 | Permanent | 7 | 10 | 10 |
| ATP_F1 | Permanent | 4 | 3 | 3 |
| COX | Permanent | 7 | 6 | 8 |
| SRS | Permanent | 16 | 20 | 20 |
| LRS | Permanent | 15 | 18 | 18 |
| MLRS | Permanent | 22 | 36 | 37 |
| MSRS | Permanent | 20 | 30 | 30 |
| Proteasome | Permanent | 21 | 21 | 23 |
| PD | Permanent | 6 | 7 | 7 |
| RNA Pol II | Permanent | 10 | 10 | 10 |
| RNA Pol III | Permanent | 4 | 6 | 5 |
| AP2 | Transient | 2 | 4 | 4 |
| APC | Transient | 5 | 8 | 8 |
| Arp2-3 | Transient | 6 | 3 | 5 |
| ARC | Transient | 4 | 5 | 5 |
| Centrosome | Transient | 42 | 50 | 51 |
| Dynactin | Transient | 7 | 9 | 7 |
| Exocyst | Transient | 7 | 7 | 7 |
| Exosome | Transient | 3 | 5 | 5 |
| FA | Transient | 37 | 46 | 47 |
| GTC | Transient | 5 | 6 | 6 |
| Nucleopore | Transient | 27 | 29 | 30 |
| Nucleosome | Transient | 17 | 24 | 24 |
| ORC | Transient | 4 | 5 | 6 |
| RFC | Transient | 3 | 4 | 4 |
| SRP | Transient | 3 | 5 | 4 |
| SCF | Transient | 3 | 3 | 3 |
| SNARE complex | Transient | 7 | 7 | 7 |
| SWI-SNF | Transient | 12 | 10 | 12 |
| TAFIID | Transient | 8 | 13 | 13 |
| TRAPP | Transient | 2 | 5 | 6 |
| VHL | Transient | 4 | 4 | 4 |
The number of genes representing each protein complex is reported for the three analyzed HeLa cell-cycle datasets. AP2, adaptor-related protein complex 2; APC, anaphase promoting complex; ARC, axin related complex; ATP_F0, ATP synthase, H+ transporting, mitochondrial F0 complex; ATP_F1, ATP synthase, H+ transporting, mitochondrial F1 complex; COX, cytochrome c oxidase; FA, focal adhesion; GTC, golgi transport complex; MSRS, mitochondrial small ribosomal subunit; ORC, origin recognition complex; PD, pyruvate dehydrogenase; RNA Pol II, RNA polymerase II; SRP, signal recognition particle; TRAPP, trafficking protein particle complex; VHL, von Hippel-Lindau complex.
P values for Thy-Noc dataset
| Peaks of expression ( | Pearson ( | |||||||
| Protein complex | 2 h-0 h | 4 h-2 h | 6 h-4 h | 8 h-6 h | 10 h-8 h | 12 h-10 h | 14 h-12 h | Cell-cycle |
| AP2 | 3.67E-01 | 7.47E-01 | 7.05E-01 | 1.00E+00 | 4.78E-01 | 6.26E-01 | 3.73E-01 | 1.00E+00 |
| ARC | 3.49E-02 | 1.00E+00 | 1.00E+00 | 6.95E-03 | 1.00E+00 | 7.28E-02 | 1.00E+00 | 1.00E+00 |
| Arp2-3 | 1.00E+00 | 1.51E-01 | 3.95E-01 | 1.00E+00 | 5.56E-01 | 1.00E+00 | 5.00E-01 | 1.00E+00 |
| ATP_F0 | 1.00E+00 | 9.68E-01 | 9.53E-01 | 6.30E-01 | 8.03E-01 | 1.00E+00 | 2.21E-03 | 3.29E-03 |
| ATP_F1 | 1.00E+00 | 6.43E-01 | 1.00E+00 | 4.92E-01 | 1.00E+00 | 1.00E+00 | 6.77E-01 | 1.00E+00 |
| APC | 2.14E-01 | 7.26E-01 | 6.65E-01 | 2.07E-01 | 7.28E-01 | 8.60E-01 | 6.95E-02 | 7.16E-01 |
| COX | 1.00E+00 | 7.26E-01 | 3.54E-01 | 1.00E+00 | 3.43E-01 | 1.00E+00 | 2.20E-01 | 1.96E-06 |
| Centrosome | 7.58E-01 | 9.13E-01 | 9.90E-01 | 5.96E-01 | 7.11E-02 | 7.27E-02 | 1.46E-01 | 1.79E-03 |
| Dynactin | 9.26E-01 | 1.00E+00 | 8.82E-01 | 7.94E-01 | 6.80E-01 | 8.21E-01 | 3.50E-02 | 2.35E-01 |
| Exocyst | 6.93E-01 | 3.32E-01 | 5.87E-01 | 3.44E-02 | 6.80E-01 | 1.81E-01 | 3.85E-01 | 2.35E-01 |
| Exosome | 8.44E-01 | 8.20E-01 | 3.95E-01 | 1.00E+00 | 1.00E+00 | 1.00E+00 | 1.82E-01 | 3.62E-01 |
| FA | 3.81E-01 | 6.38E-01 | 1.49E-01 | 7.64E-02 | 9.37E-01 | 1.27E-01 | 1.21E-01 | 1.53E-01 |
| GTC | 6.02E-01 | 8.73E-01 | 4.97E-01 | 7.41E-01 | 2.24E-01 | 3.89E-01 | 6.10E-01 | 4.90E-01 |
| LRS | 9.99E-01 | 8.14E-01 | 9.96E-01 | 9.83E-01 | 9.46E-01 | 1.00E+00 | 4.42E-04 | 2.79E-07 |
| MLRS | 9.26E-01 | 6.68E-01 | 6.69E-01 | 4.78E-01 | 4.17E-02 | 9.74E-01 | 1.11E-04 | 9.41E-01 |
| MSRS | 9.40E-01 | 2.33E-01 | 2.47E-01 | 5.82E-01 | 9.52E-01 | 9.73E-01 | 3.11E-04 | 6.32E-03 |
| Nucleopore | 3.14E-01 | 3.68E-01 | 1.40E-01 | 8.82E-01 | 9.92E-01 | 3.24E-01 | 1.13E-03 | 3.60E-01 |
| Nucleosome | 9.91E-01 | 8.62E-03 | 9.71E-01 | 3.53E-01 | 8.94E-01 | 9.79E-01 | 3.28E-01 | 4.46E-33 |
| ORC | 2.75E-01 | 8.73E-01 | 8.40E-01 | 1.01E-01 | 6.23E-01 | 3.89E-01 | 1.00E+00 | 1.00E+00 |
| PD | 6.93E-01 | 9.10E-01 | 8.33E-02 | 4.28E-01 | 6.80E-01 | 4.73E-01 | 7.00E-01 | 6.11E-01 |
| Proteasome | 6.03E-01 | 9.96E-01 | 4.02E-01 | 9.62E-01 | 6.92E-01 | 1.00E+00 | 5.18E-08 | 1.99E-03 |
| RFC | 3.67E-01 | 1.00E+00 | 2.84E-01 | 1.84E-01 | 1.10E-01 | 6.26E-01 | 1.00E+00 | 1.00E+00 |
| RNA Pol II | 6.45E-01 | 5.92E-01 | 4.79E-03 | 1.00E+00 | 1.00E+00 | 1.00E+00 | 3.88E-01 | 8.68E-01 |
| RNA Pol III | 8.44E-01 | 1.00E+00 | 3.95E-01 | 2.66E-01 | 1.00E+00 | 7.28E-02 | 8.48E-01 | 1.00E+00 |
| SNARE | 9.26E-01 | 1.14E-01 | 5.87E-01 | 7.94E-01 | 2.83E-01 | 4.73E-01 | 9.29E-01 | 6.11E-01 |
| SWI-SNF | 5.40E-01 | 9.04E-01 | 3.92E-01 | 9.33E-01 | 1.00E+00 | 7.73E-01 | 3.14E-01 | 7.92E-01 |
| SRP | 9.19E-02 | 7.47E-01 | 7.05E-01 | 1.00E+00 | 1.00E+00 | 6.26E-01 | 9.47E-02 | 1.00E+00 |
| SCF | 6.72E-01 | 6.43E-01 | 6.00E-01 | 1.00E+00 | 3.86E-01 | 1.00E+00 | 3.10E-02 | 1.00E+00 |
| SRS | 9.94E-01 | 2.00E-01 | 2.57E-01 | 9.33E-01 | 5.95E-01 | 9.93E-01 | 7.69E-03 | 1.67E-11 |
| TAFIID | 8.20E-01 | 3.18E-01 | 9.81E-01 | 5.05E-01 | 3.08E-01 | 1.00E+00 | 1.96E-01 | 4.49E-01 |
| TRAPP | 6.02E-01 | 8.73E-01 | 1.00E+00 | 7.41E-01 | 6.23E-01 | 1.00E+00 | 2.82E-01 | 1.00E+00 |
| VHL | 1.00E+00 | 1.00E+00 | 2.84E-01 | 1.00E+00 | 1.00E+00 | 1.00E+00 | 3.73E-01 | 1.00E+00 |
P values obtained with the expression peaks method in each time interval of the cell-cycle or with Pearson correlation coefficient throughout the cell-cycle. AP2, adaptor-related protein complex 2; APC, anaphase promoting complex; ARC, axin related complex; ATP_F0, ATP synthase, H+ transporting, mitochondrial F0 complex; ATP_F1, ATP synthase, H+ transporting, mitochondrial F1 complex; COX, cytochrome c oxidase; FA, focal adhesion; GTC, golgi transport complex; MSRS, mitochondrial small ribosomal subunit; ORC, origin recognition complex; PD, pyruvate dehydrogenase; RNA Pol II, RNA polymerase II; SRP, signal recognition particle; TRAPP, trafficking protein particle complex; VHL, von Hippel-Lindau complex.
Percentage of synchronously peaking genes in the Thy-Noc dataset
| Peaks of expression (% of peaking genes per complex) | |||||||
| Protein complex | 2 h-0 h | 4 h-2 h | 6 h-4 h | 8 h-6 h | 10 h-8 h | 12 h-10 h | 14 h-12 h |
| AP2 | 50 | 25 | 25 | 0 | 25 | 25 | 50 |
| ARC | 80 | 0 | 0 | 80 | 0 | 60 | 0 |
| Arp2-3 | 0 | 60 | 40 | 0 | 20 | 0 | 40 |
| ATP_F0 | 0 | 10 | 10 | 20 | 10 | 0 | 80 |
| ATP_F1 | 0 | 33 | 0 | 33 | 0 | 0 | 33 |
| APC | 50 | 25 | 25 | 38 | 13 | 13 | 63 |
| COX | 0 | 25 | 38 | 0 | 25 | 0 | 50 |
| Centrosome | 27 | 22 | 14 | 20 | 24 | 31 | 39 |
| Dynactin | 14 | 0 | 14 | 14 | 14 | 14 | 71 |
| Exocyst | 29 | 43 | 29 | 57 | 14 | 43 | 43 |
| Exosome | 20 | 20 | 40 | 0 | 0 | 0 | 60 |
| FA | 34 | 28 | 34 | 30 | 9 | 30 | 40 |
| GTC | 33 | 17 | 33 | 17 | 33 | 33 | 33 |
| LRS | 6 | 22 | 6 | 6 | 6 | 0 | 72 |
| MLRS | 22 | 27 | 24 | 22 | 27 | 11 | 62 |
| MSRS | 20 | 37 | 33 | 20 | 7 | 10 | 63 |
| Nucleopore | 37 | 33 | 37 | 13 | 3 | 27 | 60 |
| Nucleosome | 13 | 54 | 13 | 25 | 8 | 8 | 38 |
| ORC | 50 | 17 | 17 | 50 | 17 | 33 | 0 |
| PD | 29 | 14 | 57 | 29 | 14 | 29 | 29 |
| Proteasome | 30 | 9 | 30 | 9 | 13 | 0 | 87 |
| RFC | 50 | 0 | 50 | 50 | 50 | 25 | 0 |
| RNA Pol II | 30 | 30 | 70 | 0 | 0 | 0 | 40 |
| RNA Pol III | 20 | 0 | 40 | 40 | 0 | 60 | 20 |
| SNARE | 14 | 57 | 29 | 14 | 29 | 29 | 14 |
| SWI-SNF | 33 | 17 | 33 | 8 | 0 | 17 | 42 |
| SRP | 75 | 25 | 25 | 0 | 0 | 25 | 75 |
| SCF | 33 | 33 | 33 | 0 | 33 | 0 | 100 |
| SRS | 10 | 40 | 35 | 10 | 15 | 5 | 60 |
| TAFIID | 23 | 38 | 8 | 23 | 23 | 0 | 46 |
| TRAPP | 33 | 17 | 0 | 17 | 17 | 0 | 50 |
| VHL | 0 | 0 | 50 | 0 | 0 | 0 | 50 |
For each protein complex the percentage of its synchronously peaking genes in each time interval is reported. AP2, adaptor-related protein complex 2; APC, anaphase promoting complex; ARC, axin related complex; ATP_F0, ATP synthase, H+ transporting, mitochondrial F0 complex; ATP_F1, ATP synthase, H+ transporting, mitochondrial F1 complex; COX, cytochrome c oxidase; FA, focal adhesion; GTC, golgi transport complex; MSRS, mitochondrial small ribosomal subunit; ORC, origin recognition complex; PD, pyruvate dehydrogenase; RNA Pol II, RNA polymerase II; SRP, signal recognition particle; TRAPP, trafficking protein particle complex; VHL, von Hippel-Lindau complex.
Figure 1Comparison of the FDRs for the Pearson correlation coefficient (Prs) and expression peaks (Pks) methods as a function of p value. For the Thy-Noc HeLa cell-cycle dataset, estimated FDRs (y-axis) are reported as a function of the Bonferroni corrected p value (x-axis).
Figure 2Comparison of sensitivity for the Pearson correlation coefficient (Prs) and expression peaks (Pks) methods. The number of complexes with best p value equal to or lower than the corresponding one on the x-axis is plotted for each HeLa cell-cycle dataset at a fixed FDR: (a) Thy-Thy2; (b) Thy-Thy3; (c) Thy-Noc.
Figure 3Non-synchronized HeLa cells. The number of complexes with a best p value equal to or lower than the corresponding one on the x-axis is plotted for three non-synchronized and stressed HeLa datasets at a fixed FDR: (a) dithiothreitol (DTT); (b) heat shock; (c) tunicamycin.
Figure 4PAK1 physically interacts with α-tubulin. HeLa cell lysate was immunoprecipitated with anti-PAK1 antibody and blotted with anti-α-tubulin antibody. The figure is representative of three experiments obtained with similar results.
Figure 5Experimental evidence for the interaction of PAK1 with EEA1. Confocal analysis of the cross section (a) and the vertical section (c) of PDGF-induced MEF cell reveals that endogenous EEA1 colocalized (yellow) with PAK1-GFP. (b) Quantification of the colocalization where the x-axis represents the white line in the inset (rotated -90° compared to (a)) and the y-axis represents the fluorescence intensity. The first peak of intensity in both channels indicates that PAK1 (green) and EEA1 (red) were enriched at the same site. (d) Sequence matching (computed with the multiple sequence alignment program ClustalW) obtained for the phage-display selected peptide QLRSEGPF and the aminoacidic sequence of EEA1. (e) Binding of the selected peptide (QLRSEGPF) to GST-CRIB and the negative control performed with GST alone. Binding of the insertless phage was tested with either GST or GST-CRIB, which showed no differences in affinity. The y-axis represents the absorbance (OD 450 nm). Results are the mean of triplicate experiments.