Literature DB >> 15822099

Clustering algorithms for identifying core atom sets and for assessing the precision of protein structure ensembles.

David A Snyder1, Gaetano T Montelione.   

Abstract

An important open question in the field of NMR-based biomolecular structure determination is how best to characterize the precision of the resulting ensemble of structures. Typically, the RMSD, as minimized in superimposing the ensemble of structures, is the preferred measure of precision. However, the presence of poorly determined atomic coordinates and multiple "RMSD-stable domains"--locally well-defined regions that are not aligned in global superimpositions--complicate RMSD calculations. In this paper, we present a method, based on a novel, structurally defined order parameter, for identifying a set of core atoms to use in determining superimpositions for RMSD calculations. In addition we present a method for deciding whether to partition that core atom set into "RMSD-stable domains" and, if so, how to determine partitioning of the core atom set. We demonstrate our algorithm and its application in calculating statistically sound RMSD values by applying it to a set of NMR-derived structural ensembles, superimposing each RMSD-stable domain (or the entire core atom set, where appropriate) found in each protein structure under consideration. A parameter calculated by our algorithm using a novel, kurtosis-based criterion, the epsilon-value, is a measure of precision of the superimposition that complements the RMSD. In addition, we compare our algorithm with previously described algorithms for determining core atom sets. The methods presented in this paper for biomolecular structure superimposition are quite general, and have application in many areas of structural bioinformatics and structural biology.

Mesh:

Substances:

Year:  2005        PMID: 15822099     DOI: 10.1002/prot.20402

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  22 in total

Review 1.  A community resource of experimental data for NMR / X-ray crystal structure pairs.

Authors:  John K Everett; Roberto Tejero; Sarath B K Murthy; Thomas B Acton; James M Aramini; Michael C Baran; Jordi Benach; John R Cort; Alexander Eletsky; Farhad Forouhar; Rongjin Guan; Alexandre P Kuzin; Hsiau-Wei Lee; Gaohua Liu; Rajeswari Mani; Binchen Mao; Jeffrey L Mills; Alexander F Montelione; Kari Pederson; Robert Powers; Theresa Ramelot; Paolo Rossi; Jayaraman Seetharaman; David Snyder; G V T Swapna; Sergey M Vorobiev; Yibing Wu; Rong Xiao; Yunhuang Yang; Cheryl H Arrowsmith; John F Hunt; Michael A Kennedy; James H Prestegard; Thomas Szyperski; Liang Tong; Gaetano T Montelione
Journal:  Protein Sci       Date:  2015-09-22       Impact factor: 6.725

2.  Ensemble-based convergence analysis of biomolecular trajectories.

Authors:  Edward Lyman; Daniel M Zuckerman
Journal:  Biophys J       Date:  2006-04-14       Impact factor: 4.033

3.  A robust method for quantitative identification of ordered cores in an ensemble of biomolecular structures by non-linear multi-dimensional scaling using inter-atomic distance variance matrix.

Authors:  Naohiro Kobayashi
Journal:  J Biomol NMR       Date:  2014-01-03       Impact factor: 2.835

4.  Definition and classification of evaluation units for CASP10.

Authors:  Todd J Taylor; Chin-Hsien Tai; Yuanpeng J Huang; Jeremy Block; Hongjun Bai; Andriy Kryshtafovych; Gaetano T Montelione; Byungkook Lee
Journal:  Proteins       Date:  2013-11-22

5.  Recommendations of the wwPDB NMR Validation Task Force.

Authors:  Gaetano T Montelione; Michael Nilges; Ad Bax; Peter Güntert; Torsten Herrmann; Jane S Richardson; Charles D Schwieters; Wim F Vranken; Geerten W Vuister; David S Wishart; Helen M Berman; Gerard J Kleywegt; John L Markley
Journal:  Structure       Date:  2013-09-03       Impact factor: 5.006

6.  Improved technologies now routinely provide protein NMR structures useful for molecular replacement.

Authors:  Binchen Mao; Rongjin Guan; Gaetano T Montelione
Journal:  Structure       Date:  2011-06-08       Impact factor: 5.006

7.  The expanded FindCore method for identification of a core atom set for assessment of protein structure prediction.

Authors:  David A Snyder; Jennifer Grullon; Yuanpeng J Huang; Roberto Tejero; Gaetano T Montelione
Journal:  Proteins       Date:  2014-02

8.  X-ray vs. NMR structures as templates for computational protein design.

Authors:  Michael Schneider; Xiaoran Fu; Amy E Keating
Journal:  Proteins       Date:  2009-10

9.  Distance matrix-based approach to protein structure prediction.

Authors:  Andrzej Kloczkowski; Robert L Jernigan; Zhijun Wu; Guang Song; Lei Yang; Andrzej Kolinski; Piotr Pokarowski
Journal:  J Struct Funct Genomics       Date:  2009-02-18

10.  Accurate automated protein NMR structure determination using unassigned NOESY data.

Authors:  Srivatsan Raman; Yuanpeng J Huang; Binchen Mao; Paolo Rossi; James M Aramini; Gaohua Liu; Gaetano T Montelione; David Baker
Journal:  J Am Chem Soc       Date:  2010-01-13       Impact factor: 15.419

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.