| Literature DB >> 29145643 |
Neil D Rawlings1, Alan J Barrett1, Paul D Thomas2, Xiaosong Huang2, Alex Bateman1, Robert D Finn1.
Abstract
The MEROPS database (http://www.ebi.ac.uk/merops/) is an integrated source of information about peptidases, their substrates and inhibitors. The hierarchical classification is: protein-species, family, clan, with an identifier at each level. The MEROPS website moved to the EMBL-EBI in 2017, requiring refactoring of the code-base and services provided. The interface to sequence searching has changed and the MEROPS protein sequence libraries can be searched at the EMBL-EBI with HMMER, FastA and BLASTP. Cross-references have been established between MEROPS and the PANTHER database at both the family and protein-species level, which will help to improve curation and coverage between the resources. Because of the increasing size of the MEROPS sequence collection, in future only sequences of characterized proteins, and from completely sequenced genomes of organisms of evolutionary, medical or commercial significance will be added. As an example, peptidase homologues in four proteomes from the Asgard superphylum of Archaea have been identified and compared to other archaean, bacterial and eukaryote proteomes. This has given insights into the origins and evolution of peptidase families, including an expansion in the number of proteasome components in Asgard archaeotes and as organisms increase in complexity. Novel structures for proteasome complexes in archaea are postulated.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29145643 PMCID: PMC5753285 DOI: 10.1093/nar/gkx1134
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Counts of protein-species, families and clans for proteolytic enzymes and protein inhibitors in the MEROPS database
| MEROPS 12.0 (September 2017) | MEROPS 9.13 (July 2015) | |||
|---|---|---|---|---|
| Peptidases (change) | Inhibitors (change) | Peptidases | Inhibitors | |
| Sequences | 908 326 (384 4550 | 134 011 (59 353) | 523 871 | 74 658 |
| Identifiers (Total) | 5267 (645) | 868 (156) | 4622 | 712 |
|
| 3181 (387) | 725 (128) | 2794 | 597 |
|
| 1407 (–221) | 0 | 1628 | 0 |
|
| 357 (24) | 115 (0) | 333 | 115 |
|
| 215 (12) | 0 | 203 | 0 |
|
| 72 (2) | 0 | 70 | 0 |
|
| 17 (1) | 53 (-3) | 16 | 56 |
| Families | 268 (15) | 82 (3) | 253 | 79 |
| Clans | 62 (1) | 39 (0) | 61 | 39 |
The numbers in Release 12.0 of MEROPS (September 2017) are compared to those in Release 9.13 of MEROPS (July 2015). A peptidase is referred to as ‘unsequenced’ when no sequence is known, or the known sequence fragments are insufficient to be able to assign the peptidase to a family. The number of identifiers for hypothetical peptidases from model organisms has decreased because many have now been experimentally characterized.
Information in the MEROPS database
| MEROPS 12.0 (change) | MEROPS 9.13 | |
|---|---|---|
| Substrate cleavages: total | 92 216 (27 746) | 64 470 |
| Substrate cleavages: physiological | 22 100 (4969) | 17 131 |
| Substrate cleavages: non-physiological | 59 485 (22 504) | 36 981 |
| Substrate cleavages: pathological | 1413 (–13) | 1426 |
| Substrate cleavages: synthetic substrates | 6016 (51) | 5965 |
| Peptidase-inhibitor interactions: total | 6455 (253) | 6202 |
| Peptidase-inhibitor interactions: proteins | 1486 | 1428 |
| Peptidase-inhibitor interactions: SMI | 4970 | 4419 |
| References | 64 647 | 59 155 |
Substrate cleavage totals do not include cleavages derived only from the SwissProt database (mainly removal of initiating methionines and signal peptides). A naturally occurring cleavage is described as ‘physiological’ when the peptidase and substrate are from the same organism, and ‘pathological’ if the organisms differ and are pathogen and host.
Comparison of the MEROPS and PANTHER databases
|
| PANTHER | Overlap | % | Unique to | % | |
|---|---|---|---|---|---|---|
| Families | 428 | 14710 | 203 | 47.43 | 225 | 52.57 |
| Identifiers/subfamilies | 4989 | 76032 | 2920 | 58.56 | 2069 | 41.47 |
Counts of the different entry types from the databases are shown. Percentages are calculated only for the MEROPS database.
Commonest source organisms for MEROPS holotypes not in the PANTHER database
| Holotypes | Species (common name) | MEROPS identifiers |
|---|---|---|
| 23 |
| A01.020, C01.024, C01.041, C01.168, I03.004, I04.032, I04.065, I06.002, I12.009, I12.010, I12.950, I12.951, I13.003, I13.005, I25.051, I25.052, I25.053, I25.054, I25.055, I25.056, I25.057, I25.059, I25.060 |
| 21 |
| A01.087, A01.098, C01.071, C01.149, C01.165, I01.024, I01.025, I01.026, I01.027, I01.041, I81.001, I82.001, M16.021, M16.022, S08.141, S08.154, S54.019, S54.020, S54.021, S54.022, S54.023 |
| 15 |
| I03.002, I03.020, I13.002, I13.006, I20.001, I20.950, I25.015, I25.034, I25.035, I25.036, I25.037, I25.038, I25.039, I25.040, I37.001 |
| 14 |
| I01.039, I01.963, I01.964, I01.965, I19.002, I19.003, I19.004, I19.005, I19.006, I19.007, I19.008, I19.009, I19.010, S01.413 |
| 14 |
| C02.022, C10.002, C10.003, C25.001, C25.002, C25.003, C25.004, M13.009, M14.023, S01.527, S09.017, S09.075, S46.001, U32.001 |
| 13 |
| I25.026, M12.138, M12.139, M12.140, M12.170, M12.304, M12.305, M12.320, S01.179, S01.180, S01.353, S01.354, S01.433 |
| 12 |
| I02.034, I02.957, I02.958, I02.960, I02.961, I02.962, I02.963, I02.964, I02.965, I08.010, M12.310, M13.011 |
| 12 |
| A01.026, A01.077, I78.001, M03.009, M19.013, M28.022, M36.001, M77.001, M77.002, M77.003, M77.004, S09.012 |
| 12 |
| M12.022, M12.134, M12.178, M12.315, M12.326, S01.253, S01.331, S01.332, S01.333, S01.338, S01.350, S01.497 |
| 12 |
| M12.014, M12.015, M12.213, M12.238, S01.022, S01.024, S01.048, S01.049, S01.050, S01.126, S01.240, S01.245 |
| 11 |
| C01.086, C39.007, C40.006, M01.002, M03.007, M13.004, S08.019, S08.059, S08.116, S15.001, U27.001 |
| 11 |
| I04.031, I04.066, I04.083, M01.013, M01.030, S01.018, S01.040, S01.427, S01.444, S01.467, S01.484 |
| 10 |
| C10.001, C39.003, C51.002, C60.003, C60.006, C66.001, I69.001, I69.002, S08.020, S08.027 |
| 10 |
| M01.009, M07.001, S08.091, S12.001, S26.025, S33.002, S33.006, S33.007, S33.992, S37.001 |
The number of holotypes, source organism and MEROPS identifiers are shown.
Figure 1.An example peptidase summary page showing cross-references to external databases. The page displaying the summary for cathepsin B is shown and includes cross-references to the Enzyme Nomenclature, TreeFam and PANTHER databases.
Figure 2.Examples of proteasome complexes. Schematic representations of proteasome complexes from selected organisms are shown. The predicted relationship between the organisms is shown as a simplified phylogenetic tree at the left of the figure. The symmetry of each ring, ring composition and structure of the complexes are shown; these are hypothetical for proteasomes from Pyrococcus, Sulfolobus and the Lokiarchaeote. Homologues of peptidase family T1 are shown as filled spheres: alpha subunits are shown in shades of blue, and beta subunits in shades of green (proteolytically active subunits) and grey (proteolytically inactive). The unrelated HslU subunits are shown as orange spheres.