Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Substrate sequences tell similar stories as binding cavities: commentary.

Literature DB >> 24359119

Substrate sequences tell similar stories as binding cavities: commentary.

Julian E Fuchs¹, Klaus R Liedl.

Abstract

Entities: Chemical Disease Gene Species

Mesh：

Substances：
Peptide Hydrolases

Year: 2013 PMID： 24359119 PMCID： PMC3871284 DOI： 10.1021/ci4005783

Source DB: PubMed Journal: J Chem Inf Model ISSN： 1549-9596 Impact factor: 4.956

× No keyword cloud information.

Similarities in binding cavities attract attention for the prediction and doptimization of ligand selectivity. Glinca and Klebe propose a clustering based on physicochemical properties of the binding site analyzed with Cavbase and conclude that their novel cavity-based method tells more than sequences.[5] We agree that protein structures are key in understanding of ligand recognition. Still, we think that sequences can tell a lot, if the focus is shifted away from protein sequences toward substrate sequences. We show that an analysis of protease substrates, inherently containing valuable information about binding site characteristics, can be directly utilized to predict potential off-targets. Selectivity is a central issue in drug design, as drugs frequently hit more than a single target.[1] Therefore, molecular modeling aims at the prediction of polypharmacology with different approaches followed. Applied methods include ligand-based and structure-based methods as well as network analyses.[2−4] Glinca and Klebe demonstrated recently that similarities in physicochemical characteristics of the binding cavity directly relate to overlapping substrate readout.[5] By application to protease test sets they show that their cavity-based approach yields similar results as analysis of ligand data from ChEMBL,[6] thereby outperforming a similarity analysis of protease sequences. Hence, they conclude, that “cavities tell more than sequences”. We definitely agree that structural information on the binding site is crucial in the rationalization of substrate recognition. Still, we think that sequence information can contribute significantly to an understanding of substrate specificity, when the focus is shifted from protease sequences toward substrate sequences. A plethora of protease substrate sequences has been deposited in the MEROPS database in recent years.[7] They are frequently depicted as sequence logos[8] to visualize substrate preferences of proteases. Recently, we showed, how these sequence logos can be utilized to yield a quantitative metric for protease specificity.[9] Thereby, we also showed that information on protein sequences only is insufficient to predict protease specificity. Furthermore, similarities in protease substrate recognition can be directly deduced via analysis of sequence logos.[10] We expect this approach to complement structure-based comparisons, as substrate sequences inherently contain information on binding site characteristics. Substrate peptides probe protease cavities via similar features as Cavbase[11] by binding of hydrophobic and hydrophilic, positively and negatively charged, and aromatic amino acids. We performed a substrate sequence-based similarity analysis of the serine protease test set of Glinca and Klebe. Substrate data was downloaded from MEROPS, normalized to the respective natural abundance of amino acids,[12] and converted to vectors containing 20 amino acid probabilities at 8 substrate position P4 to P4′. After normalization, scalar products of these substrate vectors yield pairwise protease similarites ranging from 0 to 1.[10] Heatmap obtained for clustering of proteases based on similarities in peptide substrates. Deep blue color depicts maximum similarity, whereas red regions show dissimilarity in substrate recognition. Six resulting protease clusters are separated with horizontal lines. A comparison of all eleven serine proteases in the set yields a heat map depicting similarities in protease substrate recognition (see Figure 1). Furthermore, a hierarchical clustering based on complete-linkage yielding six clusters was performed as suggested by Glinca and Klebe.

Figure 1

Heatmap obtained for clustering of proteases based on similarities in peptide substrates. Deep blue color depicts maximum similarity, whereas red regions show dissimilarity in substrate recognition. Six resulting protease clusters are separated with horizontal lines.

The resulting protease similarity map and clustering shows pronounced overlap with the cavity-based analysis of Glinca and Klebe. Thus, substrate sequence analysis shows similar discriminative power as an analysis of binding pockets. Urokinase-type (uPA) and tissue-type plasminogen activator (tPA) form a consistent cluster as in the study of Glinca and Klebe. Furhermore, our clustering nicely groups trypsin, thrombin, and factor Xa (FXa), known to show pronounced overlap in substrate recognition of small molecules.[13] In conclusion we show that sequences can tell a lot on substrate recognition of proteases, if substrate sequences are considered. We are sure that peptide substrates comprise valuable information on protease recognition and propose their usage for the prediction of off-target effects, thereby complementing structure-based approaches.

13 in total

1. A new method to detect related function among proteins independent of sequence and fold homology.

Authors: Stefan Schmitt; Daniel Kuhn; Gerhard Klebe
Journal: J Mol Biol Date: 2002-10-18 Impact factor: 5.469

2. Quantifying the relationships among drug classes.

Authors: Jérôme Hert; Michael J Keiser; John J Irwin; Tudor I Oprea; Brian K Shoichet
Journal: J Chem Inf Model Date: 2008-03-13 Impact factor: 4.956

3. Drug-target network.

Authors: Muhammed A Yildirim; Kwang-Il Goh; Michael E Cusick; Albert-László Barabási; Marc Vidal
Journal: Nat Biotechnol Date: 2007-10 Impact factor: 54.908

4. Sequence logos: a new way to display consensus sequences.

Authors: T D Schneider; R M Stephens
Journal: Nucleic Acids Res Date: 1990-10-25 Impact factor: 16.971

5. Oligopeptide biases in protein sequences and their use in predicting protein coding regions in nucleotide sequences.

Authors: P McCaldon; P Argos
Journal: Proteins Date: 1988

6. Structural basis for inhibition promiscuity of dual specific thrombin and factor Xa blood coagulation inhibitors.

Authors: H Nar; M Bauer; A Schmid; J M Stassen; W Wienen; H W Priepke; I K Kauffmann; U J Ries; N H Hauel
Journal: Structure Date: 2001-01-10 Impact factor: 5.006

7. Relating protein pharmacology by ligand chemistry.

Authors: Michael J Keiser; Bryan L Roth; Blaine N Armbruster; Paul Ernsberger; John J Irwin; Brian K Shoichet
Journal: Nat Biotechnol Date: 2007-02 Impact factor: 54.908

8. ChEMBL: a large-scale bioactivity database for drug discovery.

Authors: Anna Gaulton; Louisa J Bellis; A Patricia Bento; Jon Chambers; Mark Davies; Anne Hersey; Yvonne Light; Shaun McGlinchey; David Michalovich; Bissan Al-Lazikani; John P Overington
Journal: Nucleic Acids Res Date: 2011-09-23 Impact factor: 16.971

9. MEROPS: the database of proteolytic enzymes, their substrates and inhibitors.

Authors: Neil D Rawlings; Alan J Barrett; Alex Bateman
Journal: Nucleic Acids Res Date: 2011-11-15 Impact factor: 16.971

10. Large-scale prediction and testing of drug activity on side-effect targets.

Authors: Eugen Lounkine; Michael J Keiser; Steven Whitebread; Dmitri Mikhailov; Jacques Hamon; Jeremy L Jenkins; Paul Lavan; Eckhard Weber; Allison K Doak; Serge Côté; Brian K Shoichet; Laszlo Urban
Journal: Nature Date: 2012-06-10 Impact factor: 49.962

2 in total

1. Dynamics Govern Specificity of a Protein-Protein Interface: Substrate Recognition by Thrombin.

Authors: Julian E Fuchs; Roland G Huber; Birgit J Waldner; Ursula Kahler; Susanne von Grafenstein; Christian Kramer; Klaus R Liedl
Journal: PLoS One Date: 2015-10-23 Impact factor: 3.240

Review 2. Determinants of Macromolecular Specificity from Proteomics-Derived Peptide Substrate Data.

Authors: Julian E Fuchs; Oliver Schilling; Klaus R Liedl
Journal: Curr Protein Pept Sci Date: 2017 Impact factor: 3.272

2 in total