| Literature DB >> 17142225 |
Yoshinobu Igarashi1, Alexey Eroshkin, Svetlana Gramatikova, Kosi Gramatikoff, Ying Zhang, Jeffrey W Smith, Andrei L Osterman, Adam Godzik.
Abstract
Beyond the well-known role of proteolytic machinery in protein degradation and turnover, many specialized proteases play a key role in various regulatory processes. Thousands of highly specific proteolytic events are associated with normal and pathological conditions, including bacterial and viral infections. However, the information about individual proteolytic events is dispersed over multiple publications and is not easily available for large-scale analysis. CutDB is one of the first systematic efforts to build an easily accessible collection of documented proteolytic events for natural proteins in vivo or in vitro. A CutDB entry is defined by a unique combination of these three attributes: protease, protein substrate and cleavage site. Currently, CutDB integrates 3070 proteolytic events for 470 different proteases captured from public archives (such as MEROPS and HPRD) and publications. CutDB supports various types of data searches and displays, including clickable network diagrams. Most importantly, CutDB is a community annotation resource based on a Wikipedia approach, providing a convenient user interface to input new data online. A recent contribution of 568 proteolytic events by several experts in the field of matrix metallopeptidases suggests that this approach will significantly accelerate the development of CutDB content. CutDB is publicly available at http://cutdb.burnham.org.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17142225 PMCID: PMC1669773 DOI: 10.1093/nar/gkl813
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1(a) Front page of CutDB. (b) Search result using ‘M10.001 (matrix metallopeptidase-1)’ of MEROPS peptidase code. (c) Record describing ‘M10.001’ cleaves matrix metallopeptidase-1 preproprotein. (d) Network diagram centered around ‘M10.001’. (e) Structures of matrix metallopeptidase-1 in PDB with highlighted cut sites.
Objects and their attributes in the CutDB record
| Objects | Attributes |
|---|---|
| Protease | Name, organism, IDs |
| Substrate | Name, organism, IDs, sequence, cut site, size of products |
| Biological context | Consequences (e.g. activation/inactivation), pathway, tissue, cell line, cellular localization, disease |
| Metadata | Confirmed in experiments or predicted, methods of identification, original citation, curator, curated date, comments |
Current number of proteolytic event records in CutDB
| Protease family | Number of records |
|---|---|
| Aspartic | 339 |
| Cysteine | 592 |
| Glutamic | 45 |
| Metallo | 1218 |
| Serine | 880 |
| Threonine | 6 |
Number of proteolytic events in major metallopeptidases
| Protease name | Peptidase code by MEROPS | Number of records |
|---|---|---|
| MMP-1 | M10.001 | 41 |
| MMP-8 | M10.002 | 16 |
| MMP-2 | M10.003 | 54 |
| MMP-9 | M10.004 | 177 |
| MMP-3 | M10.005 | 118 |
| MMP-7 | M10.008 | 75 |
| MMP-13 | M10.013 | 59 |
| MT1-MMP | M10.014 | 90 |
| MMP-20 | M10.019 | 18 |
| MMP-26 | M10.029 | 12 |
Figure 2(a) When the user inputs ‘cadhe’ into substrate definition, every human protein definition in RefSeq that contains ‘cadhe’ in the definition is immediately displayed in the pull-down menu. (b) When the user clicks one of the protein names, the server starts to search protein names and returns the chosen protein's organism name to the next item on the Web browser. (c) When the user selects ‘Homo sapiens’ from the organism names, the NCBI GI number that contains both the protein and organism names is displayed in the next pull-down menu. (d) The user selects one of the NCBI GI numbers; then, its amino acid sequence and gene symbol appear.