| Literature DB >> 19909539 |
Demet Sirim1, Florian Wagner, Andrey Lisitsa, Jürgen Pleiss.
Abstract
BACKGROUND: Cytochrome P450 monooxygenases (CYPs) form a vast and diverse enzyme class of particular interest in drug development and a high biotechnological potential. Although very diverse in sequence, they share a common structural fold. For the comprehensive and systematic comparison of protein sequences and structures the Cytochrome P450 Engineering Database (CYPED) was established. It was built up based on an extensible data model that enables its functions readily enhanced. DESCRIPTION: The new version of the CYPED contains information on sequences and structures of 8613 and 47 proteins, respectively, which strictly follow Nelson's classification rules for homologous families and superfamilies. To gain biochemical information on substrates and inhibitors, the CYPED was linked to the Cytochrome P450 Knowledgebase (CPK). To overcome differences in the data model and inconsistencies in the content of CYPED and CPK, a metric was established based on sequence similarity to link protein sequences as primary keys. In addition, the annotation of structurally and functionally relevant residues was extended by a reliable prediction of conserved secondary structure elements and by information on the effect of single nucleotide polymorphisms.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19909539 PMCID: PMC2779185 DOI: 10.1186/1471-2091-10-27
Source DB: PubMed Journal: BMC Biochem ISSN: 1471-2091 Impact factor: 4.059
Figure 1CYPED - CPK integration pipeline. Identification and assignment algorithm of the CYPED proteins and the corresponding CPK entries. The steps of the algorithm involve a BLAST search of each of the CYPED entries against the CPK, a ranking of the hits by E-value and a final pairwise alignment of the original CYPED entry with the corresponding CPK-hits to obtain the percentage identity.