Literature DB >> 24359119

Substrate sequences tell similar stories as binding cavities: commentary.

Julian E Fuchs1, Klaus R Liedl.   

Abstract

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 24359119      PMCID: PMC3871284          DOI: 10.1021/ci4005783

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


× No keyword cloud information.
Similarities in binding cavities attract attention for the prediction and doptimization of ligand selectivity. Glinca and Klebe propose a clustering based on physicochemical properties of the binding site analyzed with Cavbase and conclude that their novel cavity-based method tells more than sequences.[5] We agree that protein structures are key in understanding of ligand recognition. Still, we think that sequences can tell a lot, if the focus is shifted away from protein sequences toward substrate sequences. We show that an analysis of protease substrates, inherently containing valuable information about binding site characteristics, can be directly utilized to predict potential off-targets. Selectivity is a central issue in drug design, as drugs frequently hit more than a single target.[1] Therefore, molecular modeling aims at the prediction of polypharmacology with different approaches followed. Applied methods include ligand-based and structure-based methods as well as network analyses.[2−4] Glinca and Klebe demonstrated recently that similarities in physicochemical characteristics of the binding cavity directly relate to overlapping substrate readout.[5] By application to protease test sets they show that their cavity-based approach yields similar results as analysis of ligand data from ChEMBL,[6] thereby outperforming a similarity analysis of protease sequences. Hence, they conclude, that “cavities tell more than sequences”. We definitely agree that structural information on the binding site is crucial in the rationalization of substrate recognition. Still, we think that sequence information can contribute significantly to an understanding of substrate specificity, when the focus is shifted from protease sequences toward substrate sequences. A plethora of protease substrate sequences has been deposited in the MEROPS database in recent years.[7] They are frequently depicted as sequence logos[8] to visualize substrate preferences of proteases. Recently, we showed, how these sequence logos can be utilized to yield a quantitative metric for protease specificity.[9] Thereby, we also showed that information on protein sequences only is insufficient to predict protease specificity. Furthermore, similarities in protease substrate recognition can be directly deduced via analysis of sequence logos.[10] We expect this approach to complement structure-based comparisons, as substrate sequences inherently contain information on binding site characteristics. Substrate peptides probe protease cavities via similar features as Cavbase[11] by binding of hydrophobic and hydrophilic, positively and negatively charged, and aromatic amino acids. We performed a substrate sequence-based similarity analysis of the serine protease test set of Glinca and Klebe. Substrate data was downloaded from MEROPS, normalized to the respective natural abundance of amino acids,[12] and converted to vectors containing 20 amino acid probabilities at 8 substrate position P4 to P4′. After normalization, scalar products of these substrate vectors yield pairwise protease similarites ranging from 0 to 1.[10] Heatmap obtained for clustering of proteases based on similarities in peptide substrates. Deep blue color depicts maximum similarity, whereas red regions show dissimilarity in substrate recognition. Six resulting protease clusters are separated with horizontal lines. A comparison of all eleven serine proteases in the set yields a heat map depicting similarities in protease substrate recognition (see Figure 1). Furthermore, a hierarchical clustering based on complete-linkage yielding six clusters was performed as suggested by Glinca and Klebe.
Figure 1

Heatmap obtained for clustering of proteases based on similarities in peptide substrates. Deep blue color depicts maximum similarity, whereas red regions show dissimilarity in substrate recognition. Six resulting protease clusters are separated with horizontal lines.

The resulting protease similarity map and clustering shows pronounced overlap with the cavity-based analysis of Glinca and Klebe. Thus, substrate sequence analysis shows similar discriminative power as an analysis of binding pockets. Urokinase-type (uPA) and tissue-type plasminogen activator (tPA) form a consistent cluster as in the study of Glinca and Klebe. Furhermore, our clustering nicely groups trypsin, thrombin, and factor Xa (FXa), known to show pronounced overlap in substrate recognition of small molecules.[13] In conclusion we show that sequences can tell a lot on substrate recognition of proteases, if substrate sequences are considered. We are sure that peptide substrates comprise valuable information on protease recognition and propose their usage for the prediction of off-target effects, thereby complementing structure-based approaches.
  13 in total

1.  A new method to detect related function among proteins independent of sequence and fold homology.

Authors:  Stefan Schmitt; Daniel Kuhn; Gerhard Klebe
Journal:  J Mol Biol       Date:  2002-10-18       Impact factor: 5.469

2.  Quantifying the relationships among drug classes.

Authors:  Jérôme Hert; Michael J Keiser; John J Irwin; Tudor I Oprea; Brian K Shoichet
Journal:  J Chem Inf Model       Date:  2008-03-13       Impact factor: 4.956

3.  Drug-target network.

Authors:  Muhammed A Yildirim; Kwang-Il Goh; Michael E Cusick; Albert-László Barabási; Marc Vidal
Journal:  Nat Biotechnol       Date:  2007-10       Impact factor: 54.908

4.  Sequence logos: a new way to display consensus sequences.

Authors:  T D Schneider; R M Stephens
Journal:  Nucleic Acids Res       Date:  1990-10-25       Impact factor: 16.971

5.  Oligopeptide biases in protein sequences and their use in predicting protein coding regions in nucleotide sequences.

Authors:  P McCaldon; P Argos
Journal:  Proteins       Date:  1988

6.  Structural basis for inhibition promiscuity of dual specific thrombin and factor Xa blood coagulation inhibitors.

Authors:  H Nar; M Bauer; A Schmid; J M Stassen; W Wienen; H W Priepke; I K Kauffmann; U J Ries; N H Hauel
Journal:  Structure       Date:  2001-01-10       Impact factor: 5.006

7.  Relating protein pharmacology by ligand chemistry.

Authors:  Michael J Keiser; Bryan L Roth; Blaine N Armbruster; Paul Ernsberger; John J Irwin; Brian K Shoichet
Journal:  Nat Biotechnol       Date:  2007-02       Impact factor: 54.908

8.  ChEMBL: a large-scale bioactivity database for drug discovery.

Authors:  Anna Gaulton; Louisa J Bellis; A Patricia Bento; Jon Chambers; Mark Davies; Anne Hersey; Yvonne Light; Shaun McGlinchey; David Michalovich; Bissan Al-Lazikani; John P Overington
Journal:  Nucleic Acids Res       Date:  2011-09-23       Impact factor: 16.971

9.  MEROPS: the database of proteolytic enzymes, their substrates and inhibitors.

Authors:  Neil D Rawlings; Alan J Barrett; Alex Bateman
Journal:  Nucleic Acids Res       Date:  2011-11-15       Impact factor: 16.971

10.  Large-scale prediction and testing of drug activity on side-effect targets.

Authors:  Eugen Lounkine; Michael J Keiser; Steven Whitebread; Dmitri Mikhailov; Jacques Hamon; Jeremy L Jenkins; Paul Lavan; Eckhard Weber; Allison K Doak; Serge Côté; Brian K Shoichet; Laszlo Urban
Journal:  Nature       Date:  2012-06-10       Impact factor: 49.962

View more
  2 in total

1.  Dynamics Govern Specificity of a Protein-Protein Interface: Substrate Recognition by Thrombin.

Authors:  Julian E Fuchs; Roland G Huber; Birgit J Waldner; Ursula Kahler; Susanne von Grafenstein; Christian Kramer; Klaus R Liedl
Journal:  PLoS One       Date:  2015-10-23       Impact factor: 3.240

Review 2.  Determinants of Macromolecular Specificity from Proteomics-Derived Peptide Substrate Data.

Authors:  Julian E Fuchs; Oliver Schilling; Klaus R Liedl
Journal:  Curr Protein Pept Sci       Date:  2017       Impact factor: 3.272

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.