| Literature DB >> 24412978 |
Syed Asad Rahman1, Sergio Martinez Cuesta1, Nicholas Furnham2, Gemma L Holliday2, Janet M Thornton1.
Abstract
We present EC-BLAST (http://www.ebi.ac.uk/thornton-srv/software/rbl/), an algorithm and Web tool for quantitative similarity searches between enzyme reactions at three levels: bond change, reaction center and reaction structure similarity. It uses bond changes and reaction patterns for all known biochemical reactions derived from atom-atom mapping across each reaction. EC-BLAST has the potential to improve enzyme classification, identify previously uncharacterized or new biochemical transformations, improve the assignment of enzyme function to sequences, and assist in enzyme engineering.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24412978 PMCID: PMC4122987 DOI: 10.1038/nmeth.2803
Source DB: PubMed Journal: Nat Methods ISSN: 1548-7091 Impact factor: 28.547
Figure 1Description of EC-BLAST model highlighting the overall process diagram.
Figure 2All-by-all comparison across ~6000 mapped representative enzyme reactions in the EC-BLAST database.
a) Typical output from a reaction query search as ranked list of reactions. The searches are based on the reaction similarity metrics for i) bond changes, ii) reaction centres and iii) structure similarity. The colours in the arrows illustrate similarity between reactions; green being highly similar through to red as the most dissimilar reaction. b) The distribution of Jaccard similarity scores for the three different metrics is shown as density plots for (T) above 0. The yellow violin shapes indicate the kernel density estimation of the data at different scores, the thick black line indicates the middle 2 quadrants in the distribution of each score and the white circle gives the median for each metric. c) The accuracy plot for the prediction of IUBMB EC sub-subclass derived for the three metrics for a given cut-off. d) ROC plot highlights the ability of the similarity metrics to retrieve the IUBMB EC sub-subclass matches.
Figure 3Characterising the universe of Enzyme Reactions using EC-BLAST.
a) Distribution of overall top 20 bond changes in the 6 primary IUBMB EC classes calculated from ~ 6000 reactions; b) Hierarchical clustering of IUBMB EC classes based on bond changes using Euclidean distance and Ward method. The ‘C(R/S)’ sign denotes stereo changes associated with carbon chiral inversion. The ‘%’ sign denotes bond changes in a ring system and the ‘*’ sign stands for a two-headed arrow denoting a change of bond order. The top 5 Bond changes in the 6 IUBMB EC primary classes are shown. Clustering of 5073 representative reactions, using a combination of bond and reaction centre similarity scores. Each circle represents one reaction, coloured by primary IUBMB EC class. c) All reaction similarity clusters with p < 0.01 and cluster size >= 3 reactions are shown arranged in a network according to reaction similarity. Circles indicate two clusters (case i & ii) with mixed EC classes. d) All reaction similarity clusters with p < 0.01 and cluster size > 10 reactions are shown.