| Literature DB >> 20513650 |
Rohit Singh1, Daniel Park, Jinbo Xu, Raghavendra Hosur, Bonnie Berger.
Abstract
Struct2Net is a web server for predicting interactions between arbitrary protein pairs using a structure-based approach. Prediction of protein-protein interactions (PPIs) is a central area of interest and successful prediction would provide leads for experiments and drug design; however, the experimental coverage of the PPI interactome remains inadequate. We believe that Struct2Net is the first community-wide resource to provide structure-based PPI predictions that go beyond homology modeling. Also, most web-resources for predicting PPIs currently rely on functional genomic data (e.g. GO annotation, gene expression, cellular localization, etc.). Our structure-based approach is independent of such methods and only requires the sequence information of the proteins being queried. The web service allows multiple querying options, aimed at maximizing flexibility. For the most commonly studied organisms (fly, human and yeast), predictions have been pre-computed and can be retrieved almost instantaneously. For proteins from other species, users have the option of getting a quick-but-approximate result (using orthology over pre-computed results) or having a full-blown computation performed. The web service is freely available at http://struct2net.csail.mit.edu.Entities:
Mesh:
Year: 2010 PMID: 20513650 PMCID: PMC2896152 DOI: 10.1093/nar/gkq481
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Availability of experimental PPI data for major eukaryotic organisms
| Organism | Number of interactions | Percentage of proteins with at least one interaction |
|---|---|---|
| Mouse | 1486 | 6.0 |
| Human | 26 640 | 41.8 |
| Worm | 4559 | 14.5 |
| Fly | 22 740 | 52.7 |
| Yeast | 48 901 | 93.5 |
The statistics here and in Figure 1 have been computed using data from BioGRID (v 2.0.53) (35). Data based on phenotypic suppression/enhancement and synthetic interaction was excluded as these experiments do not provide evidence of a direct physical interaction between proteins.
Figure 1.Rate of discovery of new eukaryotic PPI data has slowed.
Figure 2.Sensitivity versus specificity. The prediction algorithm can achieve 60% sensitivity while maintaining 75% specificity as measured on the test set. Here, sensitivity = (true positives)/(true positives + false negatives) and specificity = (true negatives)/(true negatives + false positives). We constructed a training set and test set of positive and negative examples from yeast and fly, using criteria we have developed to identify high-confidence positive and negative examples of PPIs (see the website FAQ for details). After training the logistic regression model on the training set, its performance was measured on the test set.
Figure 3.Web interface and output of Struct2Net. (A and B) Web server entry page. (C) A query option for either a quick-but-approximate approach (using orthology over pre-computed predictions from yeast, fly and human) or a full-blown computation using the Struct2Net algorithm. (D) Example of an output page when choosing to thread pairs of sequences onto all templates. Confidence scores for a potential interaction are displayed along with associated template–sequence alignments and threading details.
Struct2Net results for set of interacting and non-interacting protein pairs
| Struct2Net | HOMCOS | InterPretS | |||||
|---|---|---|---|---|---|---|---|
| Job ID | Test pairs | Uniprot IDs | Templates | Confidence | Zseqcon | Zcon | Best |
| Interacting protein pairs | |||||||
| 1OLYN2PM | 1b34AB | P62314, P62316 | 1d3bAB | 0.620 | −40.9 | −3.37 | 1.62 |
| N9LJTIPG | 1g65JK | P22141, P30656 | 1iruKL | 0.590 | −62.7 | −1.34 | No Hits |
| Q44OTFMD | 1gl2BC | O70439, O88384 | 1gl2BC | 0.958 | −37.9 | −4.38 | 3.42 |
| HZ0N1HR9 | 1sxjBC | P40339, P38629 | 1sxjBC | 0.251 | −81.3 | −3.77 | 2.87 |
| NQARC82J | 1finAB | P24941, P20248 | 1e9hAB | 0.428 | −70.7 | −2.96 | 3.04 |
| 4LJQHZA | 1ukvGY | P39958, P01123 | 1ukvGY | 0.662 | −67.3 | −6.23 | 3.90 |
| 4LFMIDJ | 1bi7AB | Q00534, P42771 | 1bi7AB | 0.385 | −51.1 | −2.37 | 0.84 |
| 9N2PHLBI | 1id3AF | P61830, P02309 | 1aoiAB | 0.989 | −45.6 | −5.39 | 4.59 |
| SNTT8NHN | 1s1hJN | P38701, P41058 | 1s1gJN | 0.990 | −23.7 | −0.27 | 1.28 |
| NBTGSU4P | 1ow3AB | Q07960, P61586 | 1ow3AB | 0.425 | −62.3 | −3.98 | 2.58 |
| −54.35 | −3.57 | 2.87 | |||||
| Standard Deviation | 0.25 | 14.9 | 1.72 | 1.14 | |||
| Non-interacting protein pairs | |||||||
| JTP3Q280 | 1g3nAB | Q00534, P42773 | 1g3nAB | 0.347 | −57.1 | −2.88 | 1.61 |
| JCEFCQGQ | 1oiuBC | P24941, P20248 | 1e9hBA | 0.428 | −70.5 | −2.87 | 3.04 |
| YD4L76VD | 1gotAB | P04695, P62871 | 1gg2AB | 0.249 | −83.5 | −3.39 | 2.98 |
| YRJQ0JZI | 1ow3AB | Q07960, P61586 | 1ow3AB | 0.425 | −62.3 | −3.98 | 2.62 |
| JQ260ZEC | 1f3mAC | Q13153, Q13153 | 1f3mAC | 0.718 | −43.5 | −7.05 | 3.49 |
| VJ8BPGQ2 | 1a9nAB | P09661, P08579 | 1a9nAB | 0.334 | −45.1 | −2.82 | 2.69 |
| 0OLMGNWZ | 1k5dAC | P62826, P41391 | none | 0.169 | −66.7 | −4.15 | 2.29 |
| 8WEA7WWS | 1fq1AB | Q16667, P24941 | 1fq1AB | 0.425 | −60.9 | −1.90 | 1.83 |
| EWVFV6TL | 1fbvAC | P22681, P68036 | 1fbvAC | 0.717 | −53.2 | −0.87 | -0.31 |
| WVW4S9TW | 1qbkBC | Q92973, P62826 | none | 0.180 | −61.4 | −1.35 | 2.48 |
| −60.42 | −3.13 | 2.27 | |||||
| Standard Deviation | 0.25 | 14.9 | 1.72 | 1.14 | |||
We chose sets of interacting and non-interacting protein pairs; these pairs are taken from an analysis by HOMCOS authors (22). Low confidence templates are indicated as ‘none’. For comparison, the scores from HOMCOS and InterPretS for these pairs are also shown. Struct2Net provides a confidence score between 0 and 1 (0 indicates minimum confidence while 1 indicates maximum confidence). HOMCOS provides a Zcon measure, while InterPretS provides Z-scores. The average positive and negative scores are separated by a larger magnitude in Struct2Net: the separation is about 0.96 SD in Struct2Net; the corresponding separation in HOMCOS is 0.26 SD, and in InterPretS is 0.53 SD. Clearly, the Struct2Net score better distinguishes between interacting and non-interacting pairs.