| Literature DB >> 25735772 |
Fábio Madeira1, Michele Tinti2, Gavuthami Murugesan1, Emily Berrett1, Margaret Stafford3, Rachel Toth4, Christian Cole1, Carol MacKintosh2, Geoffrey J Barton5.
Abstract
MOTIVATION: The 14-3-3 family of phosphoprotein-binding proteins regulates many cellular processes by docking onto pairs of phosphorylated Ser and Thr residues in a constellation of intracellular targets. Therefore, there is a pressing need to develop new prediction methods that use an updated set of 14-3-3-binding motifs for the identification of new 14-3-3 targets and to prioritize the downstream analysis of >2000 potential interactors identified in high-throughput experiments.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25735772 PMCID: PMC4495292 DOI: 10.1093/bioinformatics/btv133
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Performance of the classifiers tested by Jackknife. ROC curves and the AUC scores for (A) model training (bold line) and model testing (dotted line); (B) final models and (C) comparison to Netphorest Scansite on the BLIND dataset. ANN, PSSM and SVM models were trained at the redundancy level of at least one residue difference and for a [−6:4] motif window
Comparison of the predictors developed in this study with Scansite and ELM, for an external BLIND dataset comprising 38 literature-curated 14-3-3-binding sites and 32 non-binding sites
| Predictor | TP | FP | TN | FN | SN (%) | SP (%) | PPV (%) | ACC (%) | MCC |
|---|---|---|---|---|---|---|---|---|---|
| Consensus | 28 | 4 | 28 | 10 | 73.7 | 87.5 | 87.5 | ||
| ANN | 29 | 5 | 27 | 9 | 76.3 | 84.4 | 85.3 | 80.0 | 0.60 |
| PSSM | 26 | 3 | 29 | 12 | 68.4 | 90.6 | 89.7 | 78.6 | 0.60 |
| SVM | 27 | 4 | 28 | 11 | 71.1 | 87.5 | 87.1 | 78.6 | |
| Netphorest Scansite | 28 | 7 | 25 | 10 | 73.7 | 78.1 | 80.0 | 75.7 | |
| Scansite2 | 28 | 7 | 25 | 10 | 73.7 | 78.1 | 80.0 | 75.7 | |
| ELM | 24 | 4 | 28 | 14 | 63.2 | 87.5 | 85.7 | 74.3 | |
| Scansite3 | 27 | 7 | 25 | 11 | 71.1 | 78.1 | 79.4 | 74.3 | |
| Scansite3 | 32 | 13 | 19 | 6 | 84.2 | 59.4 | 71.1 | 72.9 | |
| Scansite2 | 12 | 0 | 32 | 26 | 31.6 | 100.0 | 100.0 | 62.9 | |
| Scansite2 | 17 | 4 | 28 | 21 | 44.7 | 87.5 | 81.0 | 64.3 | |
| Scansite3 | 9 | 0 | 32 | 29 | 23.7 | 100.0 | 100.0 | 58.6 | |
| Scansite3 | 17 | 4 | 28 | 21 | 44.7 | 87.5 | 81.0 | 64.3 |
SN, sensitivity; SP, specificity; NPV, negative predictive value; ACC, accuracy. The table is sorted by MCC score.
aThe results shown were calculated based on optimal thresholds derived from accuracy/cut-off plots for the final models. The cut-offs are 0.55, 0.80, 0.25 and 0.50, for ANN, PSSM, SVN and Consensus, respectively.
bThe consensus predictor averages the scores obtained by the three methods: ANN, PSSM and SVM.
cScansite PSSM prediction scores were obtained by querying Netphorest. An optimal cut-off of 0.15 resulted in the balanced performance observed for Scansite2 low.
dBased on categorical classification of the queried phosphoproteins.
*The significance level of each method’s MCC score was assessed against the MCC score of the consensus predictor by computing a distribution of MCC scores for 100 bootstrap replicates with replacement, randomly selecting examples from the BLIND dataset. Underlined MCC scores indicate that the method is significantly worse than the consensus predictor (P < 0.05), whereas double underline indicates high significance (P < 0.001).
Top 20 high scoring predictions and their respective scores
| Rank | Protein | Description | Site | Motif | Consensus |
|---|---|---|---|---|---|
| 1 | PPP1R3G | Protein phosphatase 1 regulatory subunit 3G | 86 | 1.97 | |
| 2 | 37 | 1.88 | |||
| 3 | SSFA2 | Sperm-specific antigen 2 | 739 | 1.87 | |
| 4 | ALOX12 | Arachidonate 12-lipoxygenase, 12S-type | 246 | 1.83 | |
| 5 | SORBS2 | Sorbin and SH3 domain-containing protein 2 | 259 | 1.77 | |
| 6 | NELFE | Negative elongation factor E | 251 | 1.75 | |
| 7 | ANKRD63 | Ankyrin repeat domain-containing protein 63 | 332 | 1.74 | |
| 8 | SECISBP2L | Selenocysteine insertion sequence-binding protein 2-like | 251 | 1.72 | |
| 9 | FAM13A | Family with sequence similarity 13A | 741 | 1.72 | |
| 10 | FAM189A2 | Family with sequence similarity 189A2 | 275 | 1.71 | |
| 11 | 25 | 1.67 | |||
| 12 | TRAK2 | Trafficking kinesin-binding protein 2 | 420 | 1.67 | |
| 13 | CEP57 | Centrosomal protein of 57 kDa | 55 | 1.66 | |
| 14 | GOLGA5 | Golgin subfamily A member 5 | 116 | 1.66 | |
| 15 | CISD2 | CDGSH iron-sulphur domain-containing protein 2 | 106 | 1.66 | |
| 16 | TBC1D22A | TBC1 domain family member 22A | 167 | 1.65 | |
| 17 | HUWE1 | E3 ubiquitin-protein ligase HUWE1 | 649 | 1.65 | |
| 18 | UBR4 | E3 ubiquitin-protein ligase UBR4 | 2715 | 1.62 | |
| 19 | CEP170 | Centrosomal protein of 170 kDa | 644 | 1.60 | |
| 20 | TMEM40 | Transmembrane protein 40 | 137 | 1.59 |
Proteins that have shown affinity to 14-3-3 in high-throughput experiments and 2R-ohnologue members were identified by querying ANIA. FAM122A and FAM122B were experimentally verified to bind 14-3-3 in this study.
aThe consensus predictor averages the scores obtained by the three methods: ANN, PSSM and SVM.
bProteins that have shown affinity to 14-3-3 in high-throughput experiments.
cProtein members of 2R-ohnologue families.
Fig. 2.Regulated binding of 14-3-3 proteins to FAM122-GFP proteins. (A) HEK293 cells growing in media containing 10% (v/v) serum were transfected to express FAM122A-GFP, FAM122B-GFP and FAM122C-GFP proteins (the latter as 152 and 196 residue isoforms, respectively, excluding the GFP). GFP-tagged proteins isolated from cell lysates (120 mg) with GFP-Trap® were tested for their ability to bind directly to 14-3-3s in Far-Western assays (overlay) and by coimmunoprecipitation of endogenous 14-3-3s (14-3-3s) using the K19 pan-14-3-3 antibody. Anti-GFP signals show levels of the tagged proteins in the immunoprecipitates. (B) FAM122A-GFP bound to GFPTrap® was dephosphorylated with lambda phosphatase or not when the phosphatase was inhibited with EDTA. The immunoprecipitates were washed and FAM122A-GFP analyzed for its ability to bind directly to 14-3-3s (overlay) and for retention of co-purified endogenous 14-3-3 proteins (14-3-3s). Cells had been grown in standard medium (serum), with 100 nM calyculin A (a protein phosphatase inhibitor) added for approximately 5 min before lysis, as indicated. (C) Wild-type FAM122A-GFP and the indicated serine/threonine-to-alanine mutant proteins were isolated from transfected cells and tested for direct binding to 14-3-3s and for co-immunoprecipitating endogenous 14-3-3s. (D) HEK293 cells were transfected to express Ser37Ala/Ser62Ala- FAM122A-GFP and Ser37Ala/Thr64Ala-FAM122A-GFP, as indicated. Cells were serum starved for 10 h, then stimulated with serum (10% (v/v) for 30 min) or forskolin (20 µM for 30 min) with or without H89 pre-treatment (30 µM for 30 min), as indicated. Proteins immunoprecipitated from lysates were tested for 14-3-3 binding (overlay) and coimmunoprecipitation of endogenous 14-3-3 (14-3-3)