Literature DB >> 17876813

Exploring the extremes of sequence/structure space with ensemble fold recognition in the program Phyre.

Riccardo M Bennett-Lovsey1, Alex D Herbert, Michael J E Sternberg, Lawrence A Kelley.   

Abstract

Structural and functional annotation of the large and growing database of genomic sequences is a major problem in modern biology. Protein structure prediction by detecting remote homology to known structures is a well-established and successful annotation technique. However, the broad spectrum of evolutionary change that accompanies the divergence of close homologues to become remote homologues cannot easily be captured with a single algorithm. Recent advances to tackle this problem have involved the use of multiple predictive algorithms available on the Internet. Here we demonstrate how such ensembles of predictors can be designed in-house under controlled conditions and permit significant improvements in recognition by using a concept taken from protein loop energetics and applying it to the general problem of 3D clustering. We have developed a stringent test that simulates the situation where a protein sequence of interest is submitted to multiple different algorithms and not one of these algorithms can make a confident (95%) correct assignment. A method of meta-server prediction (Phyre) that exploits the benefits of a controlled environment for the component methods was implemented. At 95% precision or higher, Phyre identified 64.0% of all correct homologous query-template relationships, and 84.0% of the individual test query proteins could be accurately annotated. In comparison to the improvement that the single best fold recognition algorithm (according to training) has over PSI-Blast, this represents a 29.6% increase in the number of correct homologous query-template relationships, and a 46.2% increase in the number of accurately annotated queries. It has been well recognised in fold prediction, other bioinformatics applications, and in many other areas, that ensemble predictions generally are superior in accuracy to any of the component individual methods. However there is a paucity of information as to why the ensemble methods are superior and indeed this has never been systematically addressed in fold recognition. Here we show that the source of ensemble power stems from noise reduction in filtering out false positive matches. The results indicate greater coverage of sequence space and improved model quality, which can consequently lead to a reduction in the experimental workload of structural genomics initiatives. (c) 2007 Wiley-Liss, Inc.

Mesh:

Substances:

Year:  2008        PMID: 17876813     DOI: 10.1002/prot.21688

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  237 in total

1.  Molecular modeling studies of Fatty acyl-CoA synthetase (FadD13) from Mycobacterium tuberculosis--a potential target for the development of antitubercular drugs.

Authors:  Nidhi Jatana; Sarvesh Jangid; Garima Khare; Anil K Tyagi; Narayanan Latha
Journal:  J Mol Model       Date:  2010-05-08       Impact factor: 1.810

2.  The major facilitator superfamily-type protein LbtC promotes the utilization of the legiobactin siderophore by Legionella pneumophila.

Authors:  Christa H Chatfield; Brendan J Mulhern; V K Viswanathan; Nicholas P Cianciotto
Journal:  Microbiology (Reading)       Date:  2011-12-08       Impact factor: 2.777

3.  Legionella pneumophila LbtU acts as a novel, TonB-independent receptor for the legiobactin siderophore.

Authors:  Christa H Chatfield; Brendan J Mulhern; Denise M Burnside; Nicholas P Cianciotto
Journal:  J Bacteriol       Date:  2011-01-28       Impact factor: 3.490

4.  Cloning and characterization of high mobility group box protein 1 (HMGB1) of Wuchereria bancrofti and Brugia malayi.

Authors:  Sivasakthivel Thirugnanam; Gnanasekar Munirathinam; Anandharaman Veerapathran; Gajalakshmi Dakshinamoorthy; Maryada V Reddy; Kalyanasundaram Ramaswamy
Journal:  Parasitol Res       Date:  2012-03-09       Impact factor: 2.289

5.  Aicardi-Goutieres syndrome gene and HIV-1 restriction factor SAMHD1 is a dGTP-regulated deoxynucleotide triphosphohydrolase.

Authors:  Rebecca D Powell; Paul J Holland; Thomas Hollis; Fred W Perrino
Journal:  J Biol Chem       Date:  2011-11-07       Impact factor: 5.157

6.  Comparative genomics and evolution of the alpha-defensin multigene family in primates.

Authors:  Sabyasachi Das; Nikolas Nikolaidis; Hiroki Goto; Chelsea McCallister; Jianxu Li; Masayuki Hirano; Max D Cooper
Journal:  Mol Biol Evol       Date:  2010-05-09       Impact factor: 16.240

7.  Insight into the Sialome of the Bed Bug, Cimex lectularius.

Authors:  Ivo M B Francischetti; Eric Calvo; John F Andersen; Van M Pham; Amanda J Favreau; Kent D Barbian; Alvaro Romero; Jesus G Valenzuela; José M C Ribeiro
Journal:  J Proteome Res       Date:  2010-08-06       Impact factor: 4.466

8.  The annotation of full zinc proteomes.

Authors:  Ivano Bertini; Leonardo Decaria; Antonio Rosato
Journal:  J Biol Inorg Chem       Date:  2010-05-05       Impact factor: 3.358

9.  Peptidoglycan crosslinking relaxation promotes Helicobacter pylori's helical shape and stomach colonization.

Authors:  Laura K Sycuro; Zachary Pincus; Kimberley D Gutierrez; Jacob Biboy; Chelsea A Stern; Waldemar Vollmer; Nina R Salama
Journal:  Cell       Date:  2010-05-28       Impact factor: 41.582

10.  Drosophila hold'em is required for a subset of meiotic crossovers and interacts with the dna repair endonuclease complex subunits MEI-9 and ERCC1.

Authors:  Eric F Joyce; S Nikhila Tanneti; Kim S McKim
Journal:  Genetics       Date:  2008-10-28       Impact factor: 4.562

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.