| Literature DB >> 25324316 |
Nozomi Nagano1, Naoko Nakayama2, Kazuyoshi Ikeda3, Masaru Fukuie3, Kiyonobu Yokota3, Takuo Doi3, Tsuyoshi Kato4, Kentaro Tomii2.
Abstract
The EzCatDB database (http://ezcatdb.cbrc.jp/EzCatDB/) has emphasized manual classification of enzyme reactions from the viewpoints of enzyme active-site structures and their catalytic mechanisms based on literature information, amino acid sequences of enzymes (UniProtKB) and the corresponding tertiary structures from the Protein Data Bank (PDB). Reaction types such as hydrolysis, transfer, addition, elimination, isomerization, hydride transfer and electron transfer have been included in the reaction classification, RLCP. This database includes information related to ligand molecules on the enzyme structures in the PDB data, classified in terms of cofactors, substrates, products and intermediates, which are also necessary to elucidate the catalytic mechanisms. Recently, the database system was updated. The 3D structures of active sites for each PDB entry can be viewed using Jmol or Rasmol software. Moreover, sequence search systems of two types were developed for the EzCatDB database: EzCat-BLAST and EzCat-FORTE. EzCat-BLAST is suitable for quick searches, adopting the BLAST algorithm, whereas EzCat-FORTE is more suitable for detecting remote homologues, adopting the algorithm for FORTE protein structure prediction software. Another system, EzMetAct, is also available to searching for major active-site structures in EzCatDB, for which PDB-formatted queries can be searched.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25324316 PMCID: PMC4384017 DOI: 10.1093/nar/gku946
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Output for an NRPS sequence (UniProtKB; B8NXQ7) by EzCat-BLAST. (A) Top page. In the ‘RESULT SUMMARY’, sequence matches between query and hit sequences are shown as colored bars along with active-site matches. For this query sequence, four adenylation domains of NRPS, which are homologous to the representative sequences from EzCatDB entries, M00170, M00347 and M00009, were detected. In the ‘RESULT LIST’, the corresponding EzCatDB and UniProtKB IDs for the top hit sequences are shown with the position of the query sequence, bit score, E-value and active-site conservation for each hit sequence. (B) Alignment display for the query and hit sequences. Active-site residues are colored according to the type of active-site residues. Sidechain catalytic residues are presented in magenta, whereas mainchain catalytic residues are shown in green. The cofactor binding residue is shown in orange. The link to EzCatDB entry is also shown with the alignment data.
Figure 2.Output for a methyltransferase sequence (UniProtKB; Q2U5R7) by EzCat-FORTE. In the ‘RESULT LIST’ of top page, sequence matches between query and hit sequences are shown by colored bars. The corresponding EzCatDB IDs for the hit sequences are presented with Z-score, positions of query and hit sequences and E.C. numbers for each hit. For this query sequence, a domain for methyltransferase, which is homologous to the representative sequences from EzCatDB entries, S00637, D00823 and D00075, can be detected. In the alignment display, alignment of the query and top hit sequence (EzCatDB ID; S00637) are shown. Active-site residues for the hit sequence are colored. If the matched residue of the query sequence is the same as the active-site residues of the hit sequence, then it can also be colored. The link to EzCatDB entry is also shown in the table of the alignment display.
Current contents of EzCatDB
| Related data | Descriptiona | Number of corresponding dataa,b |
|---|---|---|
| DB codes for EzCatDB | Entry ID in this EzCatDB database | 871 EzCatDB entries |
| RLCP classification | Hierarchic classification of reaction mechanisms | 516 EzCatDB entries for 439 RLCP classes |
| Compound data | Compound data for substrates/products/cofactors | 1106 compound data |
| 1020 KEGG compounds ( | ||
| Intermediate data | Compound data for intermediates | 151 intermediate data |
| 3 KEGG compounds ( | ||
| E.C. numbers | Enzyme commission number ( | 667 E.C. numbers |
| Catalytic domains | Catalytic domains of enzyme structures that have active-site residues, based on CATH classification ( | 335 CATH domains ( |
| UniProtKB | Protein sequence data from UniProtKB ( | 1612 sequences (1413 representative sequences) |
| PDB | Tertiary structure data from Protein Data Bank ( | 6704 PDB entries ( |
aReference numbers are shown in parentheses.
bThe number of data might change through update.