Literature DB >> 25332398

The human DEPhOsphorylation database DEPOD: a 2015 update.

Guangyou Duan1, Xun Li1, Maja Köhn2.   

Abstract

Phosphatases are crucial enzymes in health and disease, but the knowledge of their biological roles is still limited. Identifying substrates continues to be a great challenge. To support the research on phosphatase-kinase-substrate networks we present here an update on the human DEPhOsphorylation Database: DEPOD (http://www.depod.org or http://www.koehn.embl.de/depod). DEPOD is a manually curated open access database providing human phosphatases, their protein and non-protein substrates, dephosphorylation sites, pathway involvements and external links to kinases and small molecule modulators. All internal data are fully searchable including a BLAST application. Since the first release, more human phosphatases and substrates, their associated signaling pathways (also from new sources), and interacting proteins for all phosphatases and protein substrates have been added into DEPOD. The user interface has been further optimized; for example, the interactive human phosphatase-substrate network contains now a 'highlight node' function for phosphatases, which includes the visualization of neighbors in the network.
© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25332398      PMCID: PMC4383878          DOI: 10.1093/nar/gku1009

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Reversible phosphorylation plays important roles across a wide range of cellular processes. Therein phosphatases are one important protein class that antagonizes the roles of kinases to maintain the phosphorylation homeostasis by hydrolyzing the phosphoryl group from protein or non-protein compounds. Compared to the available data resources for kinases (1), there are still very few public data resources about phosphatases, especially regarding the phosphatase–substrate relations. So far several data resources for phosphatases have been developed, and most of them focus on specific aspects. For example, PTP-central provides a comprehensive resource of one of the 18 phosphatase families (2), that is the protein tyrosine phosphatase family (PTPs), in eukaryotic species based on a genomic scale PTP prediction tool termed Y-Phosphatomer (3). Another comprehensive phosphatase resource is the Eukaryotic protein Kinase and protein Phosphatase Database (EKPD) which is a hierarchical database of eukaryotic protein kinases and protein phosphatases (4). Both, the EKPD and PTP-central do not provide the phosphatase–substrate relationship information. The human phosphatase portal (HuPho) includes phosphatase–substrate relations by integrating diverse annotation information from several public web resources or scientific literature using text-mining techniques (5). This database is however only searchable for proteins, and does not contain an interactive network. Several other older phosphatase data resources like the PTP Database and PhosphoregDB are either out of service or not continually updated anymore. While all these phosphatase-containing databases are very useful, we developed DEPOD to provide a comprehensive, high quality, manually curated, fully searchable and interactive human phosphatase resource to the signaling research community (2). DEPOD is continuously updated and extended. In the following, the previously only briefly described construction process and functionalities of DEPOD will be explained, including new features introduced after the first release (2).

THE DEPOD RESOURCE

Construction of DEPOD

The construction of DEPOD is shown in Figure 1. Human phosphatases with enzymatic activities were retrieved from the Homo sapiens gene data set in the Ensembl database (release 75) (2,6). In addition to the active phosphatases, we have now also included inactive ones, that is the ones without measured enzymatic activities, to provide a comprehensive human phosphatase database. The Gene Ontology (GO) molecular function term (7) and phosphatase-related Pfam protein family entries (8) were used to screen candidate phosphatase-encoding genes as shown in Figure 1. The human phosphatase substrate information is obtained by the combination of the ‘dephosphorylation’ posttranslational modification data in the Human Protein Reference Database (9), ‘dephosphorylation’ interaction data retrieved from the PSICQUIC service (10), substrate information from UniProt annotation (11) and the scientific literature using PubMed and Google. After manual curation of literature reported substrates, the reliability score of the desphosphorylation interaction is assigned according to the dephosphorylation bioassay type and number of different laboratories conducting the corresponding bioassay (that is the number of unrelated corresponding authors of different publications, see also (2) and Figure 1). The corresponding literature to the phosphatase-substrate interaction is referenced via links to PubMed (www.ncbi.nlm.nih.gov/pubmed) and Europe PubMed Central (europepmc.org). The visualization of phosphatase–substrate relations is supported using Cytoscape Web (12). Furthermore, links to phosphorylation databases like Phospho.ELM (13), PhosphoSitePlus (14) and NetworKIN (15) provide the kinase information related to the substrates. Basic information and links to other databases integrated for phosphatases or substrates include but are not limited to HGNC (16), Entrez Gene (17), UniProt (11), KEGG COMPOUND (18), chEBI (19), chEMBL (20), PDB (21) and Pfam (8). Phosphatase or protein substrate related protein interaction motifs from ELM (22) are also added as a new feature for this release in the basic information page. The pathway information for phosphatases or substrates is extracted from different sources: KEGG Pathway (18), NCI-Nature PID Pathway (23) and, with this update, Reactome Pathway (24). Beyond the pathway information also as a new feature, the protein interaction information for phosphatase or substrate extracted from IntAct (25), MINT (26) and STRING (27) is provided as an integrated resource. The brief methodology of DEPOD can also be found by clicking on the ‘About DEPOD’ option on each page of DEPOD or on the bottom of the home page.
Figure 1.

Flow chart of construction process of the DEPOD (version 1.1). The DEPOD core data are shown in red. EC, Enzyme Commission.

Flow chart of construction process of the DEPOD (version 1.1). The DEPOD core data are shown in red. EC, Enzyme Commission.

Statistics of DEPOD

Currently DEPOD contains 228 human active phosphatases of which 194 have 387 substrates including 298 protein substrates and 89 non-protein substrates, forming 1096 dephosphorylation interactions in total. We now added 11 inactive human phosphatases, thus in total there are 239 human phosphatases in the database. Furthermore, there are 213 pathways from the KEGG database (18), 206 pathways from the NCI-Nature Pathway Interaction Database (PID) (23) and 560 Reactome pathways (24) mapped onto phosphatases and substrates. As a new feature, there are 13984 protein interactions from IntAct, 6884 protein interactions from MINT and 6782 protein interactions from STRING including phosphatases or substrates.

Usage of DEPOD

Detailed documentation for the usage of DEPOD is provided by clicking on the ‘User Manual’ option on the left-hand navigation for each page of DEPOD or on the bottom of the home page. A brief usage introduction of DEPOD is shown below.

Search of DEPOD

Users of DEPOD can use the left-hand navigation on each page of DEPOD to browse the phosphatases (‘Human Phosphatases’), protein substrates (‘Protein Substrates’), non-protein substrates (‘Non-Protein Substrates’) and pathways (‘Pathway Mapping’) which provides an overview of all the related data in DEPOD. Another straightforward way is to use the keyword-based searches on phosphatases, substrates, pathways and, previously not available (2), dephosphorylation sites as shown in Figure 2A. Users can also search DEPOD using the quick search option on the top right corner of each page (‘Quick Search’), which gives the results in the categories ‘phosphatases’, ‘substrates’ and ‘pathways’ as tabs (Figure 2B). A sequence homology search functionality is provided that users can use to search homologous phosphatases or protein substrates in DEPOD using their own interesting sequences from any species.
Figure 2.

Usage screenshots of DEPOD. (A) Search of phosphatases/substrates/dephosphorylation site/pathways and sequence (BLAST); (B) Quick search of DEPOD; (C) Human phosphatase–substrate network.

Usage screenshots of DEPOD. (A) Search of phosphatases/substrates/dephosphorylation site/pathways and sequence (BLAST); (B) Quick search of DEPOD; (C) Human phosphatase–substrate network.

Visualization of the dephosphorylation network

As an important data component of DEPOD, manually curated phosphatase–substrate relations can be explored using the interactive visualization functionality based on Cytoscape Web (10) (with the flash installed in the web browser). Phosphatases and their substrates are connected via signal arrow links, for which the widths indicate the reliability scores of the corresponding interactions. The details of each node will be shown on the top right area by clicking on the corresponding node and its first neighbor nodes can be selected by right click on the node. Users can search their phosphatase or substrate of interest using the search box on the bottom right corner (‘Highlight Node’) and select any neighboring nodes with different path lengths as shown in Figure 2C.

Download of DEPOD data

On the download page, human phosphatases and related information included in DEPOD are downloadable in xls format for interested users. The human phosphatase-substrate interaction and dephosphosphorylation sites are now downloadable in TXT and PSI-MI Tab 2.5 formats (28). Beyond this, pathways from KEGG, NCI Nature PID and Reactome pathways mapped on phosphatases and their substrates are available for download.

Feedback and contact

As a human phosphatases data portal, users can also deposit substrate candidates of human active phosphatases to DEPOD using ‘Deposit Data’ on the left-hand navigation (see also the ‘About DEPOD’ page in the database). More general suggestions or comments for DEPOD can be sent to us by the ‘Feedback’ page that can be found on the left-hand navigation, or by sending us an email by clicking on the ‘Contact’ option on the left-hand navigation on each page. The ‘Feedback’ or ‘Contact’ option can also be accessed on the bottom of the home page.

Update on the structure-sequence-based phosphatase classification

Since we have now included also the inactive human phosphatases, we performed another re-classification of human phosphatases (for methodological details see (2)) that includes the inactive phosphatases as well as few more (potentially) active phosphatases that were previously not included. These were found by manual comparison of DEPOD data with Ensembl (STS-1, NT5DC1, NT5DC2, NT5DC4), HuPho (CDC14C) and EKPD (STS-1) data. Supplementary Figure S1 shows the phosphatase families where the new entries have been added. Of these newly added phosphatases, STS-1 represents a very interesting entry due to its novel classification as family 8 member. It was previously not classified into a historical phosphatase class (2), and thus this reveals a new family relationship. It is the only phosphatase in family 8 with tyrosine phosphatase activity.

CONCLUSION AND FUTURE PERSPECTIVES

DEPOD provides a web interface to access manually curated data on human active phosphatases and their substrates. These data can be used to develop tools for predicting dephosphorylation sites for phosphatases or to aid in substrate-based drug design (29). Pathway mapping provides clues to the potential involvement of phosphatases and substrates in novel pathways. External links to kinases conducting phosphorylation makes it possible to connect phosphatases and kinases through their common substrates, and could help to understand which pathways and phosphatases could be effected by kinase inhibition in drug discovery. The online interactive visualization tool enables a clear overview of known phosphatase interaction, and can be used to connect the phosphatase or substrate of interest with their nearest or further neighbors. DEPOD has become a valuable resource for the phosphorylation signaling community (30) and will continue to be an essential resource for future phospho-signaling studies. DEPOD will be continuously updated according to the research progress of the phosphorylation signaling community by adding new human phosphatases, substrates of phosphatases, dephosphorylation sites, further protein–protein interaction data (31) and so on. With the advancement in our knowledge of the substrate specificity for phosphatases, more related information is planned to be added into DEPOD such as physicochemical properties (32) and protein interaction motifs (22).

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.
  32 in total

Review 1.  A summary of computational resources for protein phosphorylation.

Authors:  Yu Xue; Xinjiao Gao; Jun Cao; Zexian Liu; Changjiang Jin; Longping Wen; Xuebiao Yao; Jian Ren
Journal:  Curr Protein Pept Sci       Date:  2010-09       Impact factor: 3.272

2.  PSICQUIC and PSISCORE: accessing and scoring molecular interactions.

Authors:  Bruno Aranda; Hagen Blankenburg; Samuel Kerrien; Fiona S L Brinkman; Arnaud Ceol; Emilie Chautard; Jose M Dana; Javier De Las Rivas; Marine Dumousseau; Eugenia Galeota; Anna Gaulton; Johannes Goll; Robert E W Hancock; Ruth Isserlin; Rafael C Jimenez; Jules Kerssemakers; Jyoti Khadake; David J Lynn; Magali Michaut; Gavin O'Kelly; Keiichiro Ono; Sandra Orchard; Carlos Prieto; Sabry Razick; Olga Rigina; Lukasz Salwinski; Milan Simonovic; Sameer Velankar; Andrew Winter; Guanming Wu; Gary D Bader; Gianni Cesareni; Ian M Donaldson; David Eisenberg; Gerard J Kleywegt; John Overington; Sylvie Ricard-Blum; Mike Tyers; Mario Albrecht; Henning Hermjakob
Journal:  Nat Methods       Date:  2011-06-29       Impact factor: 28.547

3.  The calcineurin signaling network evolves via conserved kinase-phosphatase modules that transcend substrate identity.

Authors:  Aaron Goldman; Jagoree Roy; Bernd Bodenmiller; Stefanie Wanka; Christian R Landry; Ruedi Aebersold; Martha S Cyert
Journal:  Mol Cell       Date:  2014-06-12       Impact factor: 17.970

4.  Cytoscape Web: an interactive web-based network browser.

Authors:  Christian T Lopes; Max Franz; Farzana Kazi; Sylva L Donaldson; Quaid Morris; Gary D Bader
Journal:  Bioinformatics       Date:  2010-07-23       Impact factor: 6.937

5.  Phospho.ELM: a database of phosphorylation sites--update 2011.

Authors:  Holger Dinkel; Claudia Chica; Allegra Via; Cathryn M Gould; Lars J Jensen; Toby J Gibson; Francesca Diella
Journal:  Nucleic Acids Res       Date:  2010-11-09       Impact factor: 16.971

6.  Entrez Gene: gene-centered information at NCBI.

Authors:  Donna Maglott; Jim Ostell; Kim D Pruitt; Tatiana Tatusova
Journal:  Nucleic Acids Res       Date:  2010-11-28       Impact factor: 16.971

7.  ChEMBL: a large-scale bioactivity database for drug discovery.

Authors:  Anna Gaulton; Louisa J Bellis; A Patricia Bento; Jon Chambers; Mark Davies; Anne Hersey; Yvonne Light; Shaun McGlinchey; David Michalovich; Bissan Al-Lazikani; John P Overington
Journal:  Nucleic Acids Res       Date:  2011-09-23       Impact factor: 16.971

8.  MINT, the molecular interaction database: 2012 update.

Authors:  Luana Licata; Leonardo Briganti; Daniele Peluso; Livia Perfetto; Marta Iannuccelli; Eugenia Galeota; Francesca Sacco; Anita Palma; Aurelio Pio Nardozza; Elena Santonico; Luisa Castagnoli; Gianni Cesareni
Journal:  Nucleic Acids Res       Date:  2011-11-16       Impact factor: 16.971

9.  NetworKIN: a resource for exploring cellular phosphorylation networks.

Authors:  Rune Linding; Lars Juhl Jensen; Adrian Pasculescu; Marina Olhovsky; Karen Colwill; Peer Bork; Michael B Yaffe; Tony Pawson
Journal:  Nucleic Acids Res       Date:  2007-11-02       Impact factor: 16.971

10.  Human Protein Reference Database--2009 update.

Authors:  T S Keshava Prasad; Renu Goel; Kumaran Kandasamy; Shivakumar Keerthikumar; Sameer Kumar; Suresh Mathivanan; Deepthi Telikicherla; Rajesh Raju; Beema Shafreen; Abhilash Venugopal; Lavanya Balakrishnan; Arivusudar Marimuthu; Sutopa Banerjee; Devi S Somanathan; Aimy Sebastian; Sandhya Rani; Somak Ray; C J Harrys Kishore; Sashi Kanth; Mukhtar Ahmed; Manoj K Kashyap; Riaz Mohmood; Y L Ramachandra; V Krishna; B Abdul Rahiman; Sujatha Mohan; Prathibha Ranganathan; Subhashri Ramabadran; Raghothama Chaerkady; Akhilesh Pandey
Journal:  Nucleic Acids Res       Date:  2008-11-06       Impact factor: 16.971

View more
  32 in total

Review 1.  Protein Bioinformatics Databases and Resources.

Authors:  Chuming Chen; Hongzhan Huang; Cathy H Wu
Journal:  Methods Mol Biol       Date:  2017

2.  The homophilic receptor PTPRK selectively dephosphorylates multiple junctional regulators to promote cell-cell adhesion.

Authors:  Gareth W Fearnley; Katherine A Young; James R Edgar; Robin Antrobus; Iain M Hay; Wei-Ching Liang; Nadia Martinez-Martin; WeiYu Lin; Janet E Deane; Hayley J Sharpe
Journal:  Elife       Date:  2019-03-29       Impact factor: 8.140

3.  Tissue-Specific Signaling Networks Rewired by Major Somatic Mutations in Human Cancer Revealed by Proteome-Wide Discovery.

Authors:  Junfei Zhao; Feixiong Cheng; Zhongming Zhao
Journal:  Cancer Res       Date:  2017-03-31       Impact factor: 12.701

4.  Proteogenomic Characterization of Endometrial Carcinoma.

Authors:  Yongchao Dou; Emily A Kawaler; Daniel Cui Zhou; Marina A Gritsenko; Chen Huang; Lili Blumenberg; Alla Karpova; Vladislav A Petyuk; Sara R Savage; Shankha Satpathy; Wenke Liu; Yige Wu; Chia-Feng Tsai; Bo Wen; Zhi Li; Song Cao; Jamie Moon; Zhiao Shi; MacIntosh Cornwell; Matthew A Wyczalkowski; Rosalie K Chu; Suhas Vasaikar; Hua Zhou; Qingsong Gao; Ronald J Moore; Kai Li; Sunantha Sethuraman; Matthew E Monroe; Rui Zhao; David Heiman; Karsten Krug; Karl Clauser; Ramani Kothadia; Yosef Maruvka; Alexander R Pico; Amanda E Oliphant; Emily L Hoskins; Samuel L Pugh; Sean J I Beecroft; David W Adams; Jonathan C Jarman; Andy Kong; Hui-Yin Chang; Boris Reva; Yuxing Liao; Dmitry Rykunov; Antonio Colaprico; Xi Steven Chen; Andrzej Czekański; Marcin Jędryka; Rafał Matkowski; Maciej Wiznerowicz; Tara Hiltke; Emily Boja; Christopher R Kinsinger; Mehdi Mesri; Ana I Robles; Henry Rodriguez; David Mutch; Katherine Fuh; Matthew J Ellis; Deborah DeLair; Mathangi Thiagarajan; D R Mani; Gad Getz; Michael Noble; Alexey I Nesvizhskii; Pei Wang; Matthew L Anderson; Douglas A Levine; Richard D Smith; Samuel H Payne; Kelly V Ruggles; Karin D Rodland; Li Ding; Bing Zhang; Tao Liu; David Fenyö
Journal:  Cell       Date:  2020-02-13       Impact factor: 41.582

5.  Phosphoproteome Analysis of Cells Infected with Adapted and Nonadapted Influenza A Virus Reveals Novel Pro- and Antiviral Signaling Networks.

Authors:  Axel Weber; Sharmistha Dam; Vera V Saul; Irina Kuznetsova; Christin Müller; Karin Fritz-Wolf; Katja Becker; Uwe Linne; Hongbo Gu; Matthew P Stokes; Stephan Pleschka; Michael Kracht; M Lienhard Schmitz
Journal:  J Virol       Date:  2019-06-14       Impact factor: 5.103

6.  Unbiased identification of substrates of protein tyrosine phosphatase ptp-3 in C. elegans.

Authors:  Christopher J Mitchell; Min-Sik Kim; Jun Zhong; Raja Sekhar Nirujogi; Anjun K Bose; Akhilesh Pandey
Journal:  Mol Oncol       Date:  2016-03-25       Impact factor: 6.603

7.  Enrichr: a comprehensive gene set enrichment analysis web server 2016 update.

Authors:  Maxim V Kuleshov; Matthew R Jones; Andrew D Rouillard; Nicolas F Fernandez; Qiaonan Duan; Zichen Wang; Simon Koplev; Sherry L Jenkins; Kathleen M Jagodnik; Alexander Lachmann; Michael G McDermott; Caroline D Monteiro; Gregory W Gundersen; Avi Ma'ayan
Journal:  Nucleic Acids Res       Date:  2016-05-03       Impact factor: 16.971

8.  Functional Genomics Approach Identifies Novel Signaling Regulators of TGFα Ectodomain Shedding.

Authors:  Jennifer L Wilson; Eirini Kefaloyianni; Lauren Stopfer; Christina Harrison; Venkata S Sabbisetti; Ernest Fraenkel; Douglas A Lauffenburger; Andreas Herrlich
Journal:  Mol Cancer Res       Date:  2017-10-10       Impact factor: 5.852

9.  NicheNet: modeling intercellular communication by linking ligands to target genes.

Authors:  Robin Browaeys; Wouter Saelens; Yvan Saeys
Journal:  Nat Methods       Date:  2019-12-09       Impact factor: 28.547

10.  Computational Phosphorylation Network Reconstruction: An Update on Methods and Resources.

Authors:  Min Zhang; Guangyou Duan
Journal:  Methods Mol Biol       Date:  2021
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.