| Literature DB >> 26527717 |
Neil D Rawlings1, Alan J Barrett2, Robert Finn3.
Abstract
The MEROPS database (http://merops.sanger.ac.uk) is an integrated source of information about peptidases, their substrates and inhibitors, which are of great relevance to biology, medicine and biotechnology. The hierarchical classification of the database is as follows: homologous sets of sequences are grouped into a protein species; protein species are grouped into a family; families are grouped into clans. There is a type example for each protein species (known as a 'holotype'), family and clan, and each protein species, family and clan has its own unique identifier. Pages to show the involvement of peptidases and peptidase inhibitors in biological pathways have been created. Each page shows the peptidases and peptidase inhibitors involved in the pathway, along with the known substrate cleavages and peptidase-inhibitor interactions, and a link to the KEGG database of biological pathways. Links have also been established with the IUPHAR Guide to Pharmacology. A new service has been set up to allow the submission of identified substrate cleavages so that conservation of the cleavage site can be assessed. This should help establish whether or not a cleavage site is physiologically relevant on the basis that such a cleavage site is likely to be conserved.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26527717 PMCID: PMC4702814 DOI: 10.1093/nar/gkv1118
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Counts of protein species, families and clans for proteolytic enzymes and protein inhibitors in the MEROPS database
| MEROPS 9.9 | MEROPS 9.13 | |||
|---|---|---|---|---|
| Peptidases | Inhibitors | Peptidases | Inhibitors | |
| Sequences | 413 834 | 28 502 | 523 871 | 74 658 |
| Identifiers (Total) | 4384 | 712 | 4622 | 712 |
| experimentally characterized and sequenced | 2638 | 593 | 2794 | 597 |
| hypothetical from model organisms | 1352 | 0 | 1628 | 0 |
| not active as peptidase or inhibitor | 324 | 115 | 333 | 115 |
| experimentally characterized but unsequenced | 203 | 0 | 203 | 0 |
| pseudogenes | 70 | 0 | 70 | 0 |
| Compound and complex proteins | 16 | 49 | 16 | 56 |
| Families | 244 | 76 | 253 | 79 |
| Clans | 55 | 39 | 61 | 39 |
The numbers in Release 9.13 of MEROPS (July 2015) are compared to those in Release 9.9 of MEROPS (August 2013). A peptidase is referred to as ‘unsequenced’ when no sequence is known, or the known sequence fragments are insufficient to be able to assign the peptidase to a family.
Information in the MEROPS database
| MEROPS 9.9 | MEROPS 9.13 | |
|---|---|---|
| Substrate cleavages: total | 61 357 | 64 470 |
| Substrate cleavages: physiological | 16 580 | 17 131 |
| Substrate cleavages: non-physiological | 35 364 | 36 981 |
| Substrate cleavages: pathological | 1235 | 1426 |
| Substrate cleavages: synthetic substrates | 5235 | 5965 |
| Peptidase-inhibitor interactions: total | 4475 | 6202 |
| Peptidase-inhibitor interactions: proteins | 1373 | 1428 |
| Peptidase-inhibitor interactions: SMI | 2790 | 4419 |
| References | 52 600 | 59 155 |
Substrate cleavage totals do not include cleavages derived only from the SwissProt database (mainly removal of initiating methionines and signal peptides). A naturally occurring cleavage is described as ‘physiological’ when the peptidase and substrate are from the same organism, and ‘pathological’ if the organisms differ and are pathogen and host.
Figure 1.An example of a biological pathway page. The page displaying peptidases and peptidase inhibitors involved in the blood coagulation and complement activation pathway is shown. Only part of each table is shown.
Examples of input and output from the Analyse Substrates service. a) A table showing data for submission to the Analyse Substrates service. The user is asked to supply a plain text file with elements separated by tabs. There should be one cleavage per line, and each cleavage should consist of the MEROPS identifier of the peptidase performing the cleavage, the UniProt accession of the protein substrate, and the residue number after which cleavage occurs (the P1 residue). Residue numbering should be taken from the relevant UniProt entry. Column headings (shown here for guidance only) are not required
| UniProt accession | cleaved at | |
|---|---|---|
| C01.034 | O15144 | 68 |
| M10.003 | O15144 | 208 |
| S08.071 | P97857 | 251 |
| M04.009 | P08697 | 398 |
| S09.007 | P08697 | 30 |
A table of results from a submission to the substrate analysis service. The table shows the MEROPS identifier for the peptidase (submitted by the user); the total number of cleavages in all substrates for this peptidase; the UniProt accession of the protein substrate (submitted by the user); the P1 position in the cleavage (submitted by the user); the number of mismatches from an alignment of homologues from the same UniRef50 entry as the submitted substrate for positions P4 to P4’; and a URL linking to the same alignment on the MEROPS website.
| MEROPS ID | total cleavages | UniProt Acc | homologues in UniRef50 | cleavage | P4 | P3 | P2 | P1 | P1’ | P2’ | P3’ | P4’ | URL |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| C01.034 | 756 | O15144 | 32 | 68 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | URL |
| M10.003 | 3417 | O15144 | 32 | 208 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | URL |
| S08.071 | 208 | P97857 | 83 | 251 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | URL |
| M04.009 | 24 | P08697 | 128 | 398 | 50 | 1 | 31 | 31 | 49 | 0 | 31 | 2 | URL |
| S09.007 | 6 | P08697 | 128 | 30 | 46 | 57 | 53 | 53 | 60 | 29 | 64 | 13 | URL |
Figure 2.An example of results returned by the substrate analysis service. Alignment of homologues of alpha-2-antiplasmin showing cleavage at residue 30 by fibroblast activation protein alpha subunit (S09.007). This is the fifth result from the list shown above. The known cleavage occurs in the human protein (UniProt: P08697; 27) and is highlighted in green. Four residues either side of the cleavage site are highlighted. A conserved residue is highlighted in pink. A residue that differs from that in the human sequence is highlighted in orange if the same amino acid is known to occupy the same binding site from another substrate of this peptidase; and in black if the amino acid is not known to occur in the same binding site from any substrate for this peptidase. The high number of residues highlighted in black indicates that this cleavage is unlikely to be physiologically relevant, but note that the number of cleavages known for this peptidase is low.