Literature DB >> 9829692

Extraction of geometrically similar substructures: least-squares and Chebyshev fitting and the difference distance matrix.

A M Lesk1.   

Abstract

In analysis, comparison and classification of conformations of proteins, a common computational task involves extractions of similar substructures. Structural comparisons are usually based on either of two measures of similarity: the root-mean-square (r.m.s.) deviation upon optimal superposition, or the maximal element of the difference distance matrix. The analysis presented here clarifies the relationships between different measures of structural similarity, and can provide a basis for developing algorithms and software to extract all maximal common well-fitting substructures from proteins. Given atomic coordinates of two proteins, many methods have been described for extracting some substantial (if not provably maximal) common substructure with low r.m.s. deviation. This is a relatively easy task compared with the problem addressed here, i.e., that of finding all common substructures with r.m.s. deviation less than a prespecified threshold. The combinatorial problems associated with similar subset extraction are more tractable if expressed in terms of the maximal element of the difference distance matrix than in terms of the r.m.s. deviation. However, it has been difficult to correlate these alternative measures of structural similarity. The purpose of this article is to make this connection. We first introduce a third measure of structural similarity: the maximum distance between corresponding pairs of points after superposition to minimize this value. This corresponds to fitting in the Chebyshev norm. Properties of Chebyshev superposition are derived. We describe relationships between the r.m.s. and minimax (Chebyshev) deviations upon optimal superposition, and between the Chebyshev deviation and the maximal element of the difference distance matrix. Combining these produces a relationship between the r.m.s. deviation upon optimal superposition and the maximal element of the difference distance matrix. Based on these results, we can apply algorithms and software for finding subsets of the difference distance matrix for which all elements are less than a specified bound, either to select only subsets for which the r.m.s.deviation is less than or equal to a specified threshold, or to select subsets that include all subsets for which the r.m.s. deviation is less than or equal to a threshold.

Mesh:

Year:  1998        PMID: 9829692

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  4 in total

1.  Automatic classification of protein structure by using Gauss integrals.

Authors:  Peter Rogen; Boris Fain
Journal:  Proc Natl Acad Sci U S A       Date:  2002-12-27       Impact factor: 11.205

2.  Prediction of plasma protein binding of drugs using Kier-Hall valence connectivity indices and 4D-fingerprint molecular similarity analyses.

Authors:  Jianzhong Liu; Liu Yang; Yi Li; Dahua Pan; Anton J Hopfinger
Journal:  J Comput Aided Mol Des       Date:  2005-11-03       Impact factor: 3.686

3.  Robust probabilistic superposition and comparison of protein structures.

Authors:  Martin Mechelke; Michael Habeck
Journal:  BMC Bioinformatics       Date:  2010-07-01       Impact factor: 3.169

4.  Homology Models and Molecular Dynamics Simulations of Main Proteinase from Coronavirus Associated with Severe Acute Respiratory Syndrome (SARS).

Authors:  Hsuan-Liang Liu; Jin-Chung Lin; Yih Ho; Wei-Chan Hsieh; Chin-Wen Chen; Yuan-Chen Su
Journal:  J Chin Chem Soc       Date:  2013-09-25       Impact factor: 1.967

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.