| Literature DB >> 23203881 |
Ida Schomburg1, Antje Chang, Sandra Placzek, Carola Söhngen, Michael Rother, Maren Lang, Cornelia Munaretto, Susanne Ulas, Michael Stelzer, Andreas Grote, Maurice Scheer, Dietmar Schomburg.
Abstract
The BRENDA (BRaunschweig ENzyme DAtabase) enzyme portal (http://www.brenda-enzymes.org) is the main information system of functional biochemical and molecular enzyme data and provides access to seven interconnected databases. BRENDA contains 2.7 million manually annotated data on enzyme occurrence, function, kinetics and molecular properties. Each entry is connected to a reference and the source organism. Enzyme ligands are stored with their structures and can be accessed via their names, synonyms or via a structure search. FRENDA (Full Reference ENzyme DAta) and AMENDA (Automatic Mining of ENzyme DAta) are based on text mining methods and represent a complete survey of PubMed abstracts with information on enzymes in different organisms, tissues or organelles. The supplemental database DRENDA provides more than 910 000 new EC number-disease relations in more than 510 000 references from automatic search and a classification of enzyme-disease-related information. KENDA (Kinetic ENzyme DAta), a new amendment extracts and displays kinetic values from PubMed abstracts. The integration of the EnzymeDetector offers an automatic comparison, evaluation and prediction of enzyme function annotations for prokaryotic genomes. The biochemical reaction database BKM-react contains non-redundant enzyme-catalysed and spontaneous reactions and was developed to facilitate and accelerate the construction of biochemical models.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23203881 PMCID: PMC3531171 DOI: 10.1093/nar/gks1049
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Preliminary BRENDA-supplied entries of aspartic proteinases
| EC-class | Name | Reaction and specificity |
|---|---|---|
| 3.4.23.B6 | Mason-Pfizer monkey virus proteinase | The enzyme cleaves 17 amino acids of the C-terminal 38-amino-acid cytoplasmic tail of the transmembrane protein TM of the released immature virus. |
| 3.4.23.B10 | Rous sarcoma virus retropepsin | The cleavage sequence in the natural substrate NC-PR is PPAVS-/-LAMTMRR. The activity can be improved by substitution by Trp, Tyr, Phe, Leu, Arg, Glu, His or Ala in P1, Tyr in P3′, and Arg, Phe, Asn or His in P3. |
| 3.4.23.B11 | Spumapepsin | Good cleavage at the peptide bonds: Asn-Thr, Asn-Gln, Asn-Cys and Asn-Ala. |
| 3.4.23.B13 | Proteinase P15 | Efficient cleavage of Ala-Thr-His-Glu-Val-Tyr-Phe(NO2)-Val-Arg-Lys-Ala, no cleavage with Ser, Arg or Glu at P1, Gly or Phe at P2, and Pro at P3. Specifically liberates the five major structural proteins from the common gag precursor, as well as reverse transcriptase and integrase from the gag-pol precursor. |
| 3.4.23.B14 | Plasmepsin IV | Cleavage of hemoglobin. In the S3 and S2 subsites, the plasmepsin 4 orthologs all prefer hydrophobic amino acid residues, Phe or Ile, but reject charged residues such as Lys or Asp. In S2′ and S3′ subsites these plasmepsins tolerate both hydrophobic and hydrophilic residues. |
| 3.4.23.B2 | Simian immunodeficiency virus proteinase | The enzyme may have a wide substrate specificity. Good cleavage of the peptide bonds Met-Met and Tyr-Pro. Cleavage is also observed at Phe-Pro, Phe-Leu, Leu-Phe, Leu-Ala, Glu-Ala and Tyr-Ala. |
| 3.4.23.B3 | Equine infectious anemia virus proteinase | Processing at the authentic HIV-1 PR recognition site and release of the mature p17 matrix and the p24 capsid protein, as a result of the cleavage of the -SQNY-/-PIVQ- cleavage site. |
| 3.4.23.B4 | Feline immunodeficiency virus protease | The enzyme seems to have a preference for Val in P1′ and Phe in P1. In contrast to the HIV-1 protease the feline immunodeficiency virus protease does not cleave the peptide KSGVFVQNGLVK at the Phe-Val bond. Gln in P2′ may be inhibitory. In contrast to HIV-1 protease the feline immunodeficiency virus protease does not cleave peptide KSGNFVVNGLVK at the Phe-Val bond. Asn in P2 may be inhibitory. |
| 3.4.23.B5 | Murine leukemia virus protease | Processing of viral polyprotein. The retroviral protease is essential for virus replication, by processing of viral Gag and Gag-Pol polyproteins. |
| 3.4.23.B8 | Human T-cell leukemia virus type 1 protease | Processing at the authentic HIV-1 PR recognition site and release of the mature p17 matrix and the p24 capsid protein, as a result of the cleavage of the -SQNY-/-PIVQ- cleavage site. |
| 3.4.23.B9 | Bovine leukemia virus protease | The best substrate YDPPAILPII is bearing the natural cleavage site between the matrix and the capsid proteins of BLV Gag precursor. polyprotein. Good cleavage of the peptide bonds: Leu-Pro, Leu-Val, Gly-Val and Leu-Pro. |
| 3.4.23.B1 | Napsin | proteolytic cleavage of polypeptides to large and stable peptides. |
| 3.4.23.B17 | Walleye dermal sarcoma virus proteinase | Processing of viral polyprotein. Preference order for P1 position is Phe > Tyr > Leu, Met > Ala. Gly is preferred at position P3. Ala and Pro are preferred at position P4. Asn, Cys or Leu are preferred at position P2. |
| 3.4.23.B18 | Mouse mammary tumor virus retropepsin | Processing of viral polyprotein. Selective for large aromatic residues (Tyr and Phe) at position P1. Phe and Leu are preferred at position P3. No hydrolysis of substrates with Gly or Ala at position P3. Medium-sized or large hydrophobic residues as Ile, Leu and Phe are preferred at position P4. |
| 3.4.23.B19 | Plasmepsin V | Cleavage of hemoglobin. In contrast to the food vacuole plasmepsins, detergent-solubilized PMV does not bind the aspartic protease inhibitor pepstatin. |
| 3.4.23.B20 | HycD peptidase | This enzyme specifically removes a 15-amino acid peptide from the C-terminus of the precursor of the large subunit of hydrogenase 2 [UniProt: P0ACE0] in |
Increase of the manually annotated data content in BRENDA from 2008 to 2012
| 2008 | 2012 | Increase from 2008 to 2012 (%) | |
|---|---|---|---|
| EC classes | 4824 | 5372 | 11.4 |
| Enzyme names | 59 341 | 85 399 | 43.9 |
| Organisms | 8930 | 10 511 | 17.7 |
| Source/tissue | 53 547 | 87 632 | 63.7 |
| Localization | 21 857 | 30 894 | 41.3 |
| KM-values | 89 012 | 121 298 | 36.3 |
| Ki-values | 19 018 | 33 819 | 77.8 |
| kcat-values | 29 149 | 49 224 | 68.9 |
| Mutant properties | 30 437 | 59 841 | 96.6 |
| Ligand names | 119 315 | 177 850 | 49.1 |
| Ligand structures | 48 651 | 103 222 | 112.2 |
| Substrates/products | 244 236 | 324 591 | 32.9 |
| Inhibitors | 127 146 | 203 727 | 60.2 |
| Stability information | 36 683 | 44 716 | 21.9 |
| References | 84 607 | 126 405 | 49.4 |
The numbers refer to the combination of enzyme-organism-(protein)-value. The numbers of the Ligand Names and Ligand Structures, the Organism and the References specify the unique entries in BRENDA.
Figure 1.Relative distribution of the kinetic categories in the results of the KENDA data.
Figure 2.Abstract from Pubmed (ID 19090111) for α-glucosidase showing the kinetic values highlighted by the KENDA procedure.
Enzyme–disease-related data in the DRENDA database
| Disease | PubMed IDs | EC no. |
|---|---|---|
| Neoplasm | 110 485 | 1272 |
| Infection | 25 937 | 1067 |
| Carcinoma | 22 141 | 829 |
| Breast neoplasm | 18 053 | 667 |
Results of the DRENDA validation process for the classification of the enzyme-disease relationships
| Category | Precision | Recall | F1 Score | Accuracy | Specificity | Entries |
|---|---|---|---|---|---|---|
| therapeutic application 4 | 0.972 | 0.530 | 0.686 | 0.750 | 0.984 | 114 477 |
| therapeutic application 3 | 0.909 | 0.606 | 0.727 | 0.766 | 0.936 | 173 134 |
| therapeutic application 2 | 0.900 | 0.818 | 0.857 | 0.859 | 0.903 | 330 335 |
| therapeutic application 1 | 0.868 | 0.894 | 0.881 | 0.875 | 0.855 | 415 923 |
| ongoing research 4 | 0.800 | 0.229 | 0.356 | 0.571 | 0.939 | 139 238 |
| ongoing research 3 | 0.765 | 0.371 | 0.500 | 0.616 | 0.878 | 252 136 |
| ongoing research 2 | 0.750 | 0.543 | 0.630 | 0.670 | 0.806 | 341 592 |
| ongoing research 1 | 0.720 | 0.686 | 0.702 | 0.700 | 0.714 | 440 043 |
| diagnostic usage 4 | 0.892 | 0.388 | 0.541 | 0.680 | 0.956 | 175 150 |
| diagnostic usage 3 | 0.848 | 0.588 | 0.694 | 0.749 | 0.900 | 279 794 |
| diagnostic usage 2 | 0.784 | 0.682 | 0.730 | 0.754 | 0.822 | 374 820 |
| diagnostic usage 1 | 0.674 | 0.753 | 0.711 | 0.703 | 0.656 | 491 214 |
| causal interaction 4 | 0.923 | 0.249 | 0.392 | 0.508 | 0.964 | 259 620 |
| causal interaction 3 | 0.897 | 0.324 | 0.476 | 0.545 | 0.934 | 335 838 |
| causal interaction 2 | 0.868 | 0.490 | 0.626 | 0.627 | 0.869 | 456 842 |
| causal interaction 1 | 0.848 | 0.627 | 0.721 | 0.691 | 0.803 | 555 127 |
Figure 3.Workflow for matching reactions and compounds in BKM-react.
Figure 4.Statistical view of the search results for E. coli K-12 DH10B in EnzymeDetector.