| Literature DB >> 23172288 |
Tianshun Gao1, Zexian Liu, Yongbo Wang, Han Cheng, Qing Yang, Anyuan Guo, Jian Ren, Yu Xue.
Abstract
In this work, we developed a family-based database of UUCD (http://uucd.biocuckoo.org) for ubiquitin and ubiquitin-like conjugation, which is one of the most important post-translational modifications responsible for regulating a variety of cellular processes, through a similar E1 (ubiquitin-activating enzyme)-E2 (ubiquitin-conjugating enzyme)-E3 (ubiquitin-protein ligase) enzyme thioester cascade. Although extensive experimental efforts have been taken, an integrative data resource is still not available. From the scientific literature, 26 E1s, 105 E2s, 1003 E3s and 148 deubiquitination enzymes (DUBs) were collected and classified into 1, 3, 19 and 7 families, respectively. To computationally characterize potential enzymes in eukaryotes, we constructed 1, 1, 15 and 6 hidden Markov model (HMM) profiles for E1s, E2s, E3s and DUBs at the family level, separately. Moreover, the ortholog searches were conducted for E3 and DUB families without HMM profiles. Then the UUCD database was developed with 738 E1s, 2937 E2s, 46 631 E3s and 6647 DUBs of 70 eukaryotic species. The detailed annotations and classifications were also provided. The online service of UUCD was implemented in PHP + MySQL + JavaScript + Perl.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23172288 PMCID: PMC3531133 DOI: 10.1093/nar/gks1103
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The data statistics of known E1, E2, E3 and DUB proteins
| Organism | E1 | E2 | E3 | DUB | Total |
|---|---|---|---|---|---|
| 8 | 39 | 475 | 91 | 613 | |
| 2 | 6 | 71 | 14 | 93 | |
| 0 | 5 | 46 | 4 | 55 | |
| 5 | 6 | 57 | 2 | 70 | |
| 3 | 16 | 78 | 19 | 116 | |
| 2 | 4 | 39 | 6 | 51 | |
| 5 | 27 | 182 | 10 | 224 | |
| Others | 1 | 2 | 55 | 2 | 60 |
| Total | 26 | 105 | 1003 | 148 | 1282 |
From the scientific literature, we manually collected experimentally identified E1s, E2s, E3s and DUBs, respectively.
Figure 1.The heat map of the classifications and protein numbers of E1s, E2s, DUBs and several major groups for E3 ligases.
Figure 2.The browse option of UUCD. We provided two strategies for browsing the database: (A) by species and (B) by classifications. (C) For each family, a brief description and associated members of family were present. (D) The detailed information of human β-TrCP.
Figure 3.The search and advance options. (A) The database can be queried with one or multiple keywords. (B) Advance search allows users to input up to three terms for the precise search. (C) Blast search option was designed for searching database with one protein sequence in FASTA format. (D) HMM search option will scan the inputted protein sequence with pre-constructed HMM profiles.