Jinchao Yu1, Geraldine Picord2, Pierre Tuffery2, Raphael Guerois1. 1. Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, University Paris-Saclay, CEA-Saclay, 91191 Gif-sur-Yvette. 2. INSERM U973, MTi, F-75205 Paris, Université Paris Diderot, Sorbonne Paris Cité F-75205 Paris and Ressource Parisienne en Bioinformatique Structurale, F-75205 Paris, France.
Abstract
MOTIVATION: The HHsearch algorithm, implementing a hidden Markov model (HMM)-HMM alignment method, has shown excellent alignment performance in the so-called twilight zone (target-template sequence identity with ∼20%). However, an optimal alignment by HHsearch may contain small to large errors, leading to poor structure prediction if these errors are located in important structural elements. RESULTS: HHalign-Kbest server runs a full pipeline, from the generation of suboptimal HMM-HMM alignments to the evaluation of the best structural models. In the HHsearch framework, it implements a novel algorithm capable of generating k-best HMM-HMM suboptimal alignments rather than only the optimal one. For large proteins, a directed acyclic graph-based implementation reduces drastically the memory usage. Improved alignments were systematically generated among the top k suboptimal alignments. To recognize them, corresponding structural models were systematically generated and evaluated with Qmean score. The method was benchmarked over 420 targets from the SCOP30 database. In the range of HHsearch probability of 20-99%, average quality of the models (TM-score) raised by 4.1-16.3% and 8.0-21.0% considering the top 1 and top 10 best models, respectively. AVAILABILITY AND IMPLEMENTATION: http://bioserv.rpbs.univ-paris-diderot.fr/services/HHalign-Kbest/ (source code and server). CONTACT: guerois@cea.fr. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: The HHsearch algorithm, implementing a hidden Markov model (HMM)-HMM alignment method, has shown excellent alignment performance in the so-called twilight zone (target-template sequence identity with ∼20%). However, an optimal alignment by HHsearch may contain small to large errors, leading to poor structure prediction if these errors are located in important structural elements. RESULTS: HHalign-Kbest server runs a full pipeline, from the generation of suboptimal HMM-HMM alignments to the evaluation of the best structural models. In the HHsearch framework, it implements a novel algorithm capable of generating k-best HMM-HMM suboptimal alignments rather than only the optimal one. For large proteins, a directed acyclic graph-based implementation reduces drastically the memory usage. Improved alignments were systematically generated among the top k suboptimal alignments. To recognize them, corresponding structural models were systematically generated and evaluated with Qmean score. The method was benchmarked over 420 targets from the SCOP30 database. In the range of HHsearch probability of 20-99%, average quality of the models (TM-score) raised by 4.1-16.3% and 8.0-21.0% considering the top 1 and top 10 best models, respectively. AVAILABILITY AND IMPLEMENTATION: http://bioserv.rpbs.univ-paris-diderot.fr/services/HHalign-Kbest/ (source code and server). CONTACT: guerois@cea.fr. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Carlyle Ribeiro Lima; Nicolas Carels; Ana Carolina Ramos Guimaraes; Pierre Tufféry; Philippe Derreumaux Journal: J Mol Model Date: 2016-09-24 Impact factor: 1.810
Authors: Georg Mohr; Sukrit Silas; Jennifer L Stamos; Kira S Makarova; Laura M Markham; Jun Yao; Patricia Lucas-Elío; Antonio Sanchez-Amat; Andrew Z Fire; Eugene V Koonin; Alan M Lambowitz Journal: Mol Cell Date: 2018-10-18 Impact factor: 17.970
Authors: Oleg Klykov; Carmen van der Zwaan; Albert J R Heck; Alexander B Meijer; Richard A Scheltema Journal: Proc Natl Acad Sci U S A Date: 2020-01-10 Impact factor: 11.205
Authors: Winston X Yan; Shaorong Chong; Huaibin Zhang; Kira S Makarova; Eugene V Koonin; David R Cheng; David A Scott Journal: Mol Cell Date: 2018-03-15 Impact factor: 17.970