| Literature DB >> 16381945 |
Tzong-Yi Lee1, Hsien-Da Huang, Jui-Hung Hung, Hsi-Yuan Huang, Yuh-Shyong Yang, Tzu-Hao Wang.
Abstract
dbPTM is a database that compiles information on protein post-translational modifications (PTMs), such as the catalytic sites, solvent accessibility of amino acid residues, protein secondary and tertiary structures, protein domains and protein variations. The database includes all of the experimentally validated PTM sites from Swiss-Prot, PhosphoELM and O-GLYCBASE. Only a small fraction of Swiss-Prot proteins are annotated with experimentally verified PTM. Although the Swiss-Prot provides rich information about the PTM, other structural properties and functional information of proteins are also essential for elucidating protein mechanisms. The dbPTM systematically identifies three major types of protein PTM (phosphorylation, glycosylation and sulfation) sites against Swiss-Prot proteins by refining our previously developed prediction tool, KinasePhos (http://kinasephos.mbc.nctu.edu.tw/). Solvent accessibility and secondary structure of residues are also computationally predicted and are mapped to the PTM sites. The resource is now freely available at http://dbPTM.mbc.nctu.edu.tw/.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16381945 PMCID: PMC1347446 DOI: 10.1093/nar/gkj083
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1The data generation flow of the dbPTM database.
The list of the integrated external data sources
| Databases | Description | Statistics |
|---|---|---|
| Swiss-Prot ( | Experimental PTMs | 11 025 PTM sites within 4921 proteins |
| Putative PTMs | 72 308 PTM sites within 31 026 proteins | |
| Proteins | 176 469 Proteins | |
| Protein variants | 32 101 Variants corresponding to 6115 proteins | |
| PhosphoELM ( | Experimental phosphorylation sites | 1703 PTM sites within 556 proteins |
| O-GLYCBASE ( | Experimental glycosylation sites | 2353 PTM sites within 185 glycoproteins |
| RESID ( | PTMs | 373 PTM types |
| InterPro ( | Protein domain | 1 113 928 Entries corresponding to 161 988 Swiss-Prot entries |
| Ensembl ( | Human variations | 23 378 Non-synonymous SNPs within 7230 Swiss-Prot human proteins |
| PDB ( | Protein structures | 31 721 Entries corresponding to 6806 Swiss-Prot proteins |
The list of the integrated annotated tools
| Tools | Description |
|---|---|
| KinasePhos ( | Identifying kinase-specific phosphorylation sites |
| KinasePhos-like sulfation | Identifying sulfation sites |
| KinasePhos-like N-linked glycosylation | Identifying N-linked glycosylation sites |
| KinasePhos-like C-linked glycosylation | Identifying C-linked glycosylation sites |
| DSSP ( | Calculating the secondary structure and solvent accessibility of residues |
| RVP-net ( | Predicting the solvent accessibility of residues |
| PSIPRED ( | Predicting the protein secondary structures |
| Weblogo ( | Generating sequence logo for PTM substrates |
The data statistics of the dbPTM database
| PTM types | Substrates | No. of known PTMs | No. of putative PTMs | Total |
|---|---|---|---|---|
| Phosphorylation | Serine, threonine, tyrosine, aspartate, histidine or cysteine | 3367 | 5852 | |
| Serine, threonine and tyrosine (predicted in this resource, ASA > 0%) | 1 346 067 | 661 975 (ASA > 25%) | ||
| Serine, threonine and tyrosine (predicted in this resource, ASA > 25%) | 652 756 | |||
| Glycosylation | N-linked, O-linked and C-linked glycosylation | 4586 | 55 059 | |
| N-linked asparagines and C-linked tryptophane (predicted in this resource, ASA > 0%) | 43 894 | 94 132 (ASA > 25%) | ||
| N-linked asparagines and C-linked tryptophane (predicted in this resource, ASA > 25%) | 33 887 | |||
| Sulfation | Serine, threonine and tyrosine | 144 | 413 | |
| Tyrosine (predicted in this resource, ASA > 0%) | 189 457 | 13 872 (ASA > 25%) | ||
| Tyrosine (predicted in this resource, ASA > 25%) | 13 315 | |||
| Lipidation | GPI-anchor, N-terminal myristoylation and palmitoylation | 520 | 4688 | 5208 |
| Acetylation | N-terminal of some residues and side chain of lysine or cysteine | 1019 | 1580 | 2599 |
| Amidation | Generally at the C-terminal of a mature active peptide after oxidative cleavage of last glycine | 1554 | 523 | 2077 |
| Methylation | Generally of N-terminal phenylalanine, side chain of lysine, arginine, histidine, ralinenes or glutamate and C-terminal cysteine | 455 | 1105 | 1560 |
| Hydroxylation | Generally of ralinenes, aspartate, raline or lysine | 816 | 515 | 1331 |
| Pyrrolidone carboxylic acid | N-terminal glutamine which has formed an internal cyclic lactam | 567 | 408 | 975 |
| Gamma-carboxyglutamic acid | 4-Carboxyglutamate | 343 | 263 | 606 |
| Trimethylation | N6-methylated lysine, N6,N6,N6-trimethyllysine, N,N,N-trimethylalanine | 158 | 294 | 452 |
| Blocked | Unidentified N- or C-terminal blocking group | 108 | 10 | 118 |
| FAD | O-8alpha-FAD tyrosine, Pros-8alpha-FAD histidine, S-8alpha-FAD cysteine and Tele-8alpha-FAD histidine | 12 | 77 | 89 |
| S-nitrosylation | S-nitrosocysteine | 5 | 59 | 64 |
| Formylation | Of the N-terminal methionine | 35 | 27 | 62 |
| Deamidation | Deamidated asparagin and deamidated glutamine (needs to be followed by a G) | 33 | 18 | 51 |
| Citrullination | Citrulline | 7 | 41 | 48 |
| Others | 328 | 1274 | 1134 | |
| Total | 14 057 | 772 154 | 786 211 |
ASA, accessible surface area.
Figure 2The graphical interface reveals the PTMs, the solvent accessibility of the residues, protein variations, protein secondary structures and protein functional domains.