| Literature DB >> 26861823 |
Abstract
MOTIVATION: Many intracellular signaling processes are mediated by interactions involving peptide recognition modules such as SH3 domains. These domains bind to small, linear protein sequence motifs which can be identified using high-throughput experimental screens such as phage display. Binding motif patterns can then be used to computationally predict protein interactions mediated by these domains. While many protein-protein interaction prediction methods exist, most do not work with peptide recognition module mediated interactions or do not consider many of the known constraints governing physiologically relevant interactions between two proteins.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26861823 PMCID: PMC4908317 DOI: 10.1093/bioinformatics/btw045
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Work flow of PRM mediated PPI prediction pipeline. (A) Proteome is scanned using a PWM built with experimentally derived binding peptides (e.g. from phage display) of a given SH3 domain for potential interactors. (B) Separate Bayesian classifiers for peptide and protein features. (C) Integration of classifiers for predicting interacting and non-interacting protein pairs (Color version of this figure is available at Bioinformatics online.)
Fig. 2.Prediction efficacy of individual peptide features: disordered region (DR), surface accessibility (SA), peptide conservation (PC), structural contact (SC); and protein features: cellular component (CC), biological process (BP), molecular function (MF), gene expression (EX), sequence signature (SS) (Color version of this figure is available at Bioinformatics online.)
Evaluation of peptide and protein classifiers
| Test | Classifier | MCC | ACC | AUROC | |
|---|---|---|---|---|---|
| Filtered | Peptide | 0.74 | 0.87 | 0.87 | 0.92 |
| Protein | 0.68 | 0.83 | 0.83 | 0.94 | |
| Unfiltered | Peptide | 0.72 | 0.86 | 0.86 | 0.92 |
| Protein | 0.63 | 0.80 | 0.80 | 0.92 |
Matthews correlation coefficient (MCC), accuracy (ACC), F1-score and area under ROC curve (AUROC) of protein and peptide classifiers for blind tests are shown. MCC, ACC and F1-score are reported at threshold score . The filtered set has no missing values for any of the features, whereas unfiltered includes all feature data (as would be the case in a real world prediction scenario).
Fig. 3.Performance of peptide, protein and combined classifiers on the curated SH3 domain mediated PPI set (Color version of this figure is available at Bioinformatics online.)