Literature DB >> 19574295

Discovering rules for protein-ligand specificity using support vector inductive logic programming.

Lawrence A Kelley1, Paul J Shrimpton, Stephen H Muggleton, Michael J E Sternberg.   

Abstract

Structural genomics initiatives are rapidly generating vast numbers of protein structures. Comparative modelling is also capable of producing accurate structural models for many protein sequences. However, for many of the known structures, functions are not yet determined, and in many modelling tasks, an accurate structural model does not necessarily tell us about function. Thus, there is a pressing need for high-throughput methods for determining function from structure. The spatial arrangement of key amino acids in a folded protein, on the surface or buried in clefts, is often the determinants of its biological function. A central aim of molecular biology is to understand the relationship between such substructures or surfaces and biological function, leading both to function prediction and to function design. We present a new general method for discovering the features of binding pockets that confer specificity for particular ligands. Using a recently developed machine-learning technique which couples the rule-discovery approach of inductive logic programming with the statistical learning power of support vector machines, we are able to discriminate, with high precision (90%) and recall (86%) between pockets that bind FAD and those that bind NAD on a large benchmark set given only the geometry and composition of the backbone of the binding pocket without the use of docking. In addition, we learn rules governing this specificity which can feed into protein functional design protocols. An analysis of the rules found suggests that key features of the binding pocket may be tied to conformational freedom in the ligand. The representation is sufficiently general to be applicable to any discriminatory binding problem. All programs and data sets are freely available to non-commercial users at http://www.sbg.bio.ic.ac.uk/svilp_ligand/.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19574295      PMCID: PMC3913550          DOI: 10.1093/protein/gzp035

Source DB:  PubMed          Journal:  Protein Eng Des Sel        ISSN: 1741-0126            Impact factor:   1.650


  28 in total

1.  The EMOTIF database.

Authors:  J Y Huang; D L Brutlag
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

Review 2.  Structural genomics and its importance for gene function analysis.

Authors:  J Skolnick; J S Fetrow; A Kolinski
Journal:  Nat Biotechnol       Date:  2000-03       Impact factor: 54.908

3.  Recognition templates for predicting adenylate-binding sites in proteins.

Authors:  S Zhao; G M Morris; A J Olson; D S Goodsell
Journal:  J Mol Biol       Date:  2001-12-14       Impact factor: 5.469

Review 4.  A tour of structural genomics.

Authors:  S E Brenner
Journal:  Nat Rev Genet       Date:  2001-10       Impact factor: 53.242

5.  Protein structure prediction and structural genomics.

Authors:  D Baker; A Sali
Journal:  Science       Date:  2001-10-05       Impact factor: 47.728

6.  Annotation in three dimensions. PINTS: Patterns in Non-homologous Tertiary Structures.

Authors:  Alexander Stark; Robert B Russell
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

7.  The automatic discovery of structural principles describing protein fold space.

Authors:  Adrian P Cootes; Stephen H Muggleton; Michael J E Sternberg
Journal:  J Mol Biol       Date:  2003-07-18       Impact factor: 5.469

8.  The hydroxynitrile lyase from almond: a lyase that looks like an oxidoreductase.

Authors:  I Dreveny; K Gruber; A Glieder; A Thompson; C Kratky
Journal:  Structure       Date:  2001-09       Impact factor: 5.006

9.  The relation between the divergence of sequence and structure in proteins.

Authors:  C Chothia; A M Lesk
Journal:  EMBO J       Date:  1986-04       Impact factor: 11.598

10.  Functional genomic hypothesis generation and experimentation by a robot scientist.

Authors:  Ross D King; Kenneth E Whelan; Ffion M Jones; Philip G K Reiser; Christopher H Bryant; Stephen H Muggleton; Douglas B Kell; Stephen G Oliver
Journal:  Nature       Date:  2004-01-15       Impact factor: 49.962

View more
  3 in total

1.  Homology modeling and structural comparison of leucine rich repeats of Toll like receptors 1-10 of ruminants.

Authors:  Anandan Swathi; Gopal Dhinakar Raj; Angamuthu Raja; Krishnaswamy Gopalan Tirumurugaan
Journal:  J Mol Model       Date:  2013-06-28       Impact factor: 1.810

2.  Knowledge discovery in variant databases using inductive logic programming.

Authors:  Hoan Nguyen; Tien-Dao Luu; Olivier Poch; Julie D Thompson
Journal:  Bioinform Biol Insights       Date:  2013-03-18

3.  LIMLE, a new molecule over-expressed following activation, is involved in the stimulatory properties of dendritic cells.

Authors:  Laëtitia Le Texier; Justine Durand; Amélie Lavault; Philippe Hulin; Olivier Collin; Yvan Le Bras; Maria-Cristina Cuturi; Elise Chiffoleau
Journal:  PLoS One       Date:  2014-04-04       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.