| Literature DB >> 25948583 |
Evangelia Petsalaki1, Andreas O Helbig2, Anjali Gopal3, Adrian Pasculescu3, Frederick P Roth4, Tony Pawson3.
Abstract
While phospho-proteomics studies have shed light on the dynamics of cellular signaling, they mainly describe global effects and rarely explore mechanistic details, such as kinase/substrate relationships. Tools and databases, such as NetworKIN and PhosphoSitePlus, provide valuable regulatory details on signaling networks but rely on prior knowledge. They therefore provide limited information on less studied kinases and fewer unexpected relationships given that better studied signaling events can mask condition- or cell-specific 'network wiring'. SELPHI is a web-based tool providing in-depth analysis of phospho-proteomics data that is intuitive and accessible to non-bioinformatics experts. It uses correlation analysis of phospho-sites to extract kinase/phosphatase and phospho-peptide associations, and highlights the potential flow of signaling in the system under study. We illustrate SELPHI via analysis of phospho-proteomics data acquired in the presence of erlotinib-a tyrosine kinase inhibitor (TKI)-in cancer cells expressing TKI-resistant and -sensitive variants of the Epidermal Growth Factor Receptor. In this data set, SELPHI revealed information overlooked by the reporting study, including the known role of MET and EPHA2 kinases in conferring resistance to erlotinib in TKI sensitive strains. SELPHI can significantly enhance the analysis of phospho-proteomics data contributing to improved understanding of sample-specific signaling networks. SELPHI is freely available via http://llama.mshri.on.ca/SELPHI.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25948583 PMCID: PMC4489257 DOI: 10.1093/nar/gkv459
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Main analysis customization input parameters and their default values (Full table of input options is presented in Supplementary Table S1)
| Parameter | Default | Description |
|---|---|---|
| Clustering method | NONE | Cluster phospho-peptides using either PCA & k-means ( |
| Ratio Cutoff | 3 | Threshold for phospho-peptide intensity ratio |
| Correlation Method | Spearman | Choose correlation index among Spearman, Pearson and Kendall tau. |
| Correlation Cutoff | 0.8–0.9 | Correlation coefficient threshold. Default 0.9 for Pearson, 0.8 for Spearman or Kendall tau |
| Correlation | 0.05 | Threshold for significance of correlation of phospho-peptide profiles |
Figure 1.Workflow of SELPHI. SELPHI first identifies the UniprotKB IDs and sequence location of the input phosphosites. The data are then filtered according to the input cutoff and clustering, KEGG pathway/GO term enrichment analysis and correlation analysis is applied on the input data. The result is an exploratory representation of the global effects relating to cell pathways and functions, a network view of potentially relevant kinase/phosphatase/substrate associations with the likely flow of signaling, as well as motifs enriched in the data set.
Figure 2.(A) Signaling Pathway and (B) Disease pathway enrichment in the modulated peptides of the case study data set comprising samples collected from adenocarcinoma cells expressing tyrosine kinase inhibitor (TKI) resistant and sensitive EGFR in the presence and absence of the TKI erlotinib (33). The size of the boxes represents the odds ratio of pathway enrichment and the color represents the sum of log(ratios) of the phospho-peptide in each condition shown in the x axis. Clustering was done according to the latter measure. (C) (i) Phospho-profile changes for the peptides in each cluster (ii) GO terms enrichment (5) for each cluster. (D) Heatmaps of correlations between tyrosine kinases and phosphatases and their associated phospho-peptides. (E) Logo (31) of significantly (P-value < 0.05) over-represented residues in the phospho-peptides associated with MAPK1 (Supplementary Table S6) in the case study against a background of 10000 random S/T/Y centered peptides from the non-redundant (at 40% sequence identity) human proteome.
Databases used to integrate information for annotation and filtering
| Database/Tool | Description |
|---|---|
| GeneMania ( | Database of functional association networks |
| PhosphoSitePlus ( | Database of phospho-sites, including kinase-substrate information and role of phosphorylation on function |
| PhosphoELM ( | Database of phospho-sites and kinase-substrate relationships |
| KEGG Pathways ( | Database of curated pathways |
| Animal TFDB ( | Database of animal transcription factors and binding sites |
| nTFdb ( | Database of worm transcription factors |
| YEASTRACT ( | Database of yeast transcription factors and binding sites |
| ELM ( | Database of Eukaryotic Linear Motifs |
| NetworKIN ( | Tool to predict kinase/substrate relationships |
| STRING ( | Database of protein associations and interactions |
| ProteoConnections ( | Data integration tool for proteomics data (SELPHI uses the motifs list) |