| Literature DB >> 23193284 |
Pablo Minguez1, Ivica Letunic, Luca Parca, Peer Bork.
Abstract
Post-translational modifications (PTMs) are involved in the regulation and structural stabilization of eukaryotic proteins. The combination of individual PTM states is a key to modulate cellular functions as became evident in a few well-studied proteins. This combinatorial setting, dubbed the PTM code, has been proposed to be extended to whole proteomes in eukaryotes. Although we are still far from deciphering such a complex language, thousands of protein PTM sites are being mapped by high-throughput technologies, thus providing sufficient data for comparative analysis. PTMcode (http://ptmcode.embl.de) aims to compile known and predicted PTM associations to provide a framework that would enable hypothesis-driven experimental or computational analysis of various scales. In its first release, PTMcode provides PTM functional associations of 13 different PTM types within proteins in 8 eukaryotes. They are based on five evidence channels: a literature survey, residue co-evolution, structural proximity, PTMs at the same residue and location within PTM highly enriched protein regions (hotspots). PTMcode is presented as a protein-based searchable database with an interactive web interface providing the context of the co-regulation of nearly 75 000 residues in >10 000 proteins.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23193284 PMCID: PMC3531129 DOI: 10.1093/nar/gks1230
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.PTMcode integrates five types of evidence channels to collect known and predicted functional associated PTMs within the same protein, named in here as: (A) ‘Co-evolution’ where two modified residues are found to be significantly co-evolving across eukaryotes. The multiple sequence alignment in the right corresponds to the orthologous group of the protein modified by the two acetylated lysines shown. (B) ‘Structural distance’ where two modified residues are found closer than a threshold (based on the distance of known PTM interactions). (C) ‘Same residue’ where the two modifications target the exact same residue, for instance, serine 2124 of the mouse protein Bsn can be modified either by phosphorylation or O-linked glycosylation. (D) ‘Manual annotation’, derived from a literature survey in order to cover known associated PTMs. (E) ‘PTM hotspots’ that represent protein regions enriched in modifications, the figure shows a region of 60 amino acids where 18 of them can be modified.
Number of functional associations that are predicted by each of the evidence channels and those that at least have two different evidences, only considering the four types of evidences that assign concrete pairwise association (co-evolution, structural distance, same residue and manual annotation)
| Co-evolution | Structural distance | Same residue | Manual annotation | |
|---|---|---|---|---|
| Co-evolution | 399 707 | |||
| Structural distance | 1101 | 2466 | ||
| Same residue | 0 | 0 | 576 | |
| Manual annotation | 13 | 6 | 0 | 57 |
Figure 2.The PTMcode database uses in its first release 13 different types of PTMs that are abbreviated in a two letter code as: Ph (phosphorylation), NG (N-linked glycosylation), Ac (acetylation), OG (O-linked glycosylation), Ub (ubiquitination), Me (methylation), SM (SUMOylation), Hy (hydroxylation), Ca (carboxylation), Pa (palmitoylation), Su (sulfation), Ni (nitrosylation) and CG (C-linked glycosylation). The known and predicted functional associations between two of these types of modifications are accessible by selecting any two PTM types in an interactive network that offers all possible connections (A). These connections are constrained by the fact that both modifications at least should happen in the same protein. The PTM types are represented here by symbols, the size of which corresponds to their relative abundance in the database. The link widths represent the number of proteins modified by the two respective PTM types normalized by the total number of proteins harbouring the less abundant PTM type. The width thus indicates relative coverage of the particular pair-wise functional associations. For instance, based on the link widths, we see that a large proportion of proteins that are nitrosylated are also phosphorylated, while the proportion of proteins that are both hydroxylated and phosphorylated is smaller. Upon selection in the WEB interface, two types of modifications are activated, and all the pairs of known and predicted functional association in all proteins are shown in a table (B) where each of the entries can be further explored. These tables are available for download as text files.
Figure 3.PTMcode offers the exploration of post-translational regulation within thousands of proteins. (A) Interactive graphical display of functional associations between PTMs within the human EGF receptor (EGFR). The protein is represented by the grey line at the top with globular domains and unstructured regions taken from the database SMART. Below this, PTM hotspots are shown as red lines (in EGFR, three hotspots were identified). At the bottom, the different PTMs are mapped and by clicking at one, the functional associations with any other PTM are shown by arches coloured according to the five evidence channels. All functional associations for a particular PTM can be further explored from a table (B) that is interactively displayed. The conservation score for the PTMs that are predicted to be associated is listed in the table together with all the evidences that support the prediction. Clicking on each of the evidences will show a pop-up with more detailed information, for example, alignments or mapping of PTM pairs onto three dimensional structures.