Jianyi Yang1, Ambrish Roy, Yang Zhang. 1. Department of Computational Medicine and Bioinformatics and Department of Biological Chemistry, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109-2218, USA.
Abstract
MOTIVATION: Identification of protein-ligand binding sites is critical to protein function annotation and drug discovery. However, there is no method that could generate optimal binding site prediction for different protein types. Combination of complementary predictions is probably the most reliable solution to the problem. RESULTS: We develop two new methods, one based on binding-specific substructure comparison (TM-SITE) and another on sequence profile alignment (S-SITE), for complementary binding site predictions. The methods are tested on a set of 500 non-redundant proteins harboring 814 natural, drug-like and metal ion molecules. Starting from low-resolution protein structure predictions, the methods successfully recognize >51% of binding residues with average Matthews correlation coefficient (MCC) significantly higher (with P-value <10(-9) in student t-test) than other state-of-the-art methods, including COFACTOR, FINDSITE and ConCavity. When combining TM-SITE and S-SITE with other structure-based programs, a consensus approach (COACH) can increase MCC by 15% over the best individual predictions. COACH was examined in the recent community-wide COMEO experiment and consistently ranked as the best method in last 22 individual datasets with the Area Under the Curve score 22.5% higher than the second best method. These data demonstrate a new robust approach to protein-ligand binding site recognition, which is ready for genome-wide structure-based function annotations. AVAILABILITY: http://zhanglab.ccmb.med.umich.edu/COACH/
MOTIVATION: Identification of protein-ligand binding sites is critical to protein function annotation and drug discovery. However, there is no method that could generate optimal binding site prediction for different protein types. Combination of complementary predictions is probably the most reliable solution to the problem. RESULTS: We develop two new methods, one based on binding-specific substructure comparison (TM-SITE) and another on sequence profile alignment (S-SITE), for complementary binding site predictions. The methods are tested on a set of 500 non-redundant proteins harboring 814 natural, drug-like and metal ion molecules. Starting from low-resolution protein structure predictions, the methods successfully recognize >51% of binding residues with average Matthews correlation coefficient (MCC) significantly higher (with P-value <10(-9) in student t-test) than other state-of-the-art methods, including COFACTOR, FINDSITE and ConCavity. When combining TM-SITE and S-SITE with other structure-based programs, a consensus approach (COACH) can increase MCC by 15% over the best individual predictions. COACH was examined in the recent community-wide COMEO experiment and consistently ranked as the best method in last 22 individual datasets with the Area Under the Curve score 22.5% higher than the second best method. These data demonstrate a new robust approach to protein-ligand binding site recognition, which is ready for genome-wide structure-based function annotations. AVAILABILITY: http://zhanglab.ccmb.med.umich.edu/COACH/
Authors: S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman Journal: Nucleic Acids Res Date: 1997-09-01 Impact factor: 16.971
Authors: Carlos Eduardo Dulcey; Yossef López de Los Santos; Myriam Létourneau; Eric Déziel; Nicolas Doucet Journal: FEBS J Date: 2019-06-21 Impact factor: 5.542
Authors: Arcangela Iuso; Marit Wiersma; Hans-Joachim Schüller; Ben Pode-Shakked; Dina Marek-Yagel; Mathias Grigat; Thomas Schwarzmayr; Riccardo Berutti; Bader Alhaddad; Bart Kanon; Nicola A Grzeschik; Jürgen G Okun; Zeev Perles; Yishay Salem; Ortal Barel; Amir Vardi; Marina Rubinshtein; Tal Tirosh; Gal Dubnov-Raz; Ana C Messias; Caterina Terrile; Iris Barshack; Alex Volkov; Camilla Avivi; Eran Eyal; Elisa Mastantuono; Muhamad Kumbar; Shachar Abudi; Matthias Braunisch; Tim M Strom; Thomas Meitinger; Georg F Hoffmann; Holger Prokisch; Tobias B Haack; Bianca J J M Brundel; Dorothea Haas; Ody C M Sibon; Yair Anikster Journal: Am J Hum Genet Date: 2018-05-10 Impact factor: 11.025