Literature DB >> 17444516

Protein docking using surface matching and supervised machine learning.

Andrew J Bordner1, Andrey A Gorin.   

Abstract

Computational prediction of protein complex structures through docking offers a means to gain a mechanistic understanding of protein interactions that mediate biological processes. This is particularly important as the number of experimentally determined structures of isolated proteins exceeds the number of structures of complexes. A comprehensive docking procedure is described in which efficient sampling of conformations is achieved by matching surface normal vectors, fast filtering for shape complementarity, clustering by RMSD, and scoring the docked conformations using a supervised machine learning approach. Contacting residue pair frequencies, residue propensities, evolutionary conservation, and shape complementarity score for each docking conformation are used as input data to a Random Forest classifier. The performance of the Random Forest approach for selecting correctly docked conformations was assessed by cross-validation using a nonredundant benchmark set of X-ray structures for 93 heterodimer and 733 homodimer complexes. The single highest rank docking solution was the correct (near-native) structure for slightly more than one third of the complexes. Furthermore, the fraction of highly ranked correct structures was significantly higher than the overall fraction of correct structures, for almost all complexes. A detailed analysis of the difficult to predict complexes revealed that the majority of the homodimer cases were explained by incorrect oligomeric state annotation. Evolutionary conservation and shape complementarity score as well as both underrepresented and overrepresented residue types and residue pairs were found to make the largest contributions to the overall prediction accuracy. Finally, the method was also applied to docking unbound subunit structures from a previously published benchmark set. (c) 2007 Wiley-Liss, Inc.

Mesh:

Substances:

Year:  2007        PMID: 17444516     DOI: 10.1002/prot.21406

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  13 in total

1.  A knowledge-based scoring function to assess quaternary associations of proteins.

Authors:  Abhilesh S Dhawanjewar; Ankit A Roy; Mallur S Madhusudhan
Journal:  Bioinformatics       Date:  2020-06-01       Impact factor: 6.937

2.  MDockPP: A hierarchical approach for protein-protein docking and its application to CAPRI rounds 15-19.

Authors:  Sheng-You Huang; Xiaoqin Zou
Journal:  Proteins       Date:  2010-11-15

3.  Human and server docking prediction for CAPRI round 30-35 using LZerD with combined scoring functions.

Authors:  Lenna X Peterson; Hyungrae Kim; Juan Esquivel-Rodriguez; Amitava Roy; Xusi Han; Woong-Hee Shin; Jian Zhang; Genki Terashi; Matt Lee; Daisuke Kihara
Journal:  Proteins       Date:  2016-10-14

4.  DockRank: ranking docked conformations using partner-specific sequence homology-based protein interface prediction.

Authors:  Li C Xue; Rafael A Jordan; Yasser El-Manzalawy; Drena Dobbs; Vasant Honavar
Journal:  Proteins       Date:  2013-10-17

5.  Protein-protein docking benchmark version 3.0.

Authors:  Howook Hwang; Brian Pierce; Julian Mintseris; Joël Janin; Zhiping Weng
Journal:  Proteins       Date:  2008-11-15

6.  PAIRpred: partner-specific prediction of interacting residues from sequence and structure.

Authors:  Fayyaz ul Amir Afsar Minhas; Brian J Geiss; Asa Ben-Hur
Journal:  Proteins       Date:  2013-12-06

7.  The scoring of poses in protein-protein docking: current capabilities and future directions.

Authors:  Iain H Moal; Mieczyslaw Torchala; Paul A Bates; Juan Fernández-Recio
Journal:  BMC Bioinformatics       Date:  2013-10-01       Impact factor: 3.169

8.  Protein-RNA complexes and efficient automatic docking: expanding RosettaDock possibilities.

Authors:  Adrien Guilhot-Gaudeffroy; Christine Froidevaux; Jérôme Azé; Julie Bernauer
Journal:  PLoS One       Date:  2014-09-30       Impact factor: 3.240

9.  Protein-protein docking using region-based 3D Zernike descriptors.

Authors:  Vishwesh Venkatraman; Yifeng D Yang; Lee Sael; Daisuke Kihara
Journal:  BMC Bioinformatics       Date:  2009-12-09       Impact factor: 3.169

10.  Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces.

Authors:  Andrew J Bordner; Andrey A Gorin
Journal:  BMC Bioinformatics       Date:  2008-05-12       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.