Literature DB >> 19224393

Distance matrix-based approach to protein structure prediction.

Andrzej Kloczkowski1, Robert L Jernigan, Zhijun Wu, Guang Song, Lei Yang, Andrzej Kolinski, Piotr Pokarowski.   

Abstract

Much structural information is encoded in the internal distances; a distance matrix-based approach can be used to predict protein structure and dynamics, and for structural refinement. Our approach is based on the square distance matrix D = [r(ij)(2)] containing all square distances between residues in proteins. This distance matrix contains more information than the contact matrix C, that has elements of either 0 or 1 depending on whether the distance r (ij) is greater or less than a cutoff value r (cutoff). We have performed spectral decomposition of the distance matrices D = sigma lambda(k)V(k)V(kT), in terms of eigenvalues lambda kappa and the corresponding eigenvectors v kappa and found that it contains at most five nonzero terms. A dominant eigenvector is proportional to r (2)--the square distance of points from the center of mass, with the next three being the principal components of the system of points. By predicting r (2) from the sequence we can approximate a distance matrix of a protein with an expected RMSD value of about 7.3 A, and by combining it with the prediction of the first principal component we can improve this approximation to 4.0 A. We can also explain the role of hydrophobic interactions for the protein structure, because r is highly correlated with the hydrophobic profile of the sequence. Moreover, r is highly correlated with several sequence profiles which are useful in protein structure prediction, such as contact number, the residue-wise contact order (RWCO) or mean square fluctuations (i.e. crystallographic temperature factors). We have also shown that the next three components are related to spatial directionality of the secondary structure elements, and they may be also predicted from the sequence, improving overall structure prediction. We have also shown that the large number of available HIV-1 protease structures provides a remarkable sampling of conformations, which can be viewed as direct structural information about the dynamics. After structure matching, we apply principal component analysis (PCA) to obtain the important apparent motions for both bound and unbound structures. There are significant similarities between the first few key motions and the first few low-frequency normal modes calculated from a static representative structure with an elastic network model (ENM) that is based on the contact matrix C (related to D), strongly suggesting that the variations among the observed structures and the corresponding conformational changes are facilitated by the low-frequency, global motions intrinsic to the structure. Similarities are also found when the approach is applied to an NMR ensemble, as well as to atomic molecular dynamics (MD) trajectories. Thus, a sufficiently large number of experimental structures can directly provide important information about protein dynamics, but ENM can also provide a similar sampling of conformations. Finally, we use distance constraints from databases of known protein structures for structure refinement. We use the distributions of distances of various types in known protein structures to obtain the most probable ranges or the mean-force potentials for the distances. We then impose these constraints on structures to be refined or include the mean-force potentials directly in the energy minimization so that more plausible structural models can be built. This approach has been successfully used by us in 2006 in the CASPR structure refinement (http://predictioncenter.org/caspR).

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19224393      PMCID: PMC3018873          DOI: 10.1007/s10969-009-9062-2

Source DB:  PubMed          Journal:  J Struct Funct Genomics        ISSN: 1345-711X


  53 in total

1.  Principal eigenvector of contact matrices and hydrophobicity profiles in proteins.

Authors:  Ugo Bastolla; Markus Porto; H Eduardo Roman; Michele Vendruscolo
Journal:  Proteins       Date:  2005-01-01

2.  Approximate multiple protein structure alignment using the sum-of-pairs distance.

Authors:  Jieping Ye; Ravi Janardan
Journal:  J Comput Biol       Date:  2004       Impact factor: 1.479

3.  Accurate detection of very sparse sequence motifs.

Authors:  Andreas Heger; Michael Lappe; Liisa Holm
Journal:  J Comput Biol       Date:  2004       Impact factor: 1.479

4.  Clustering algorithms for identifying core atom sets and for assessing the precision of protein structure ensembles.

Authors:  David A Snyder; Gaetano T Montelione
Journal:  Proteins       Date:  2005-06-01

5.  The inference of protein-protein interactions by co-evolutionary analysis is improved by excluding the information about the phylogenetic relationships.

Authors:  Tetsuya Sato; Yoshihiro Yamanishi; Minoru Kanehisa; Hiroyuki Toh
Journal:  Bioinformatics       Date:  2005-06-30       Impact factor: 6.937

6.  Comparison of tRNA motions in the free and ribosomal bound structures.

Authors:  Yongmei Wang; Robert L Jernigan
Journal:  Biophys J       Date:  2005-08-19       Impact factor: 4.033

7.  Version 1.2 of the Crystallography and NMR system.

Authors:  Axel T Brunger
Journal:  Nat Protoc       Date:  2007       Impact factor: 13.491

8.  The combinatorial distance geometry method for the calculation of molecular conformation. II. Sample problems and computational statistics.

Authors:  T F Havel; G M Crippen; I D Kuntz; J M Blaney
Journal:  J Theor Biol       Date:  1983-10-07       Impact factor: 2.691

9.  Myosin flexibility: structural domains and collective vibrations.

Authors:  Isabelle Navizet; Richard Lavery; Robert L Jernigan
Journal:  Proteins       Date:  2004-02-15

Review 10.  Protein co-evolution, co-adaptation and interactions.

Authors:  Florencio Pazos; Alfonso Valencia
Journal:  EMBO J       Date:  2008-09-25       Impact factor: 11.598

View more
  12 in total

1.  Distance-based protein folding powered by deep learning.

Authors:  Jinbo Xu
Journal:  Proc Natl Acad Sci U S A       Date:  2019-08-09       Impact factor: 11.205

2.  Coming to Grips with Ambiguity: Ion Mobility-Mass Spectrometry for Protein Quaternary Structure Assignment.

Authors:  Joseph D Eschweiler; Aaron T Frank; Brandon T Ruotolo
Journal:  J Am Soc Mass Spectrom       Date:  2017-07-27       Impact factor: 3.109

3.  Structural features that predict real-value fluctuations of globular proteins.

Authors:  Michal Jamroz; Andrzej Kolinski; Daisuke Kihara
Journal:  Proteins       Date:  2012-02-13

4.  eProS--a database and toolbox for investigating protein sequence-structure-function relationships through energy profiles.

Authors:  Florian Heinke; Stefan Schildbach; Daniel Stockmann; Dirk Labudde
Journal:  Nucleic Acids Res       Date:  2012-11-17       Impact factor: 16.971

5.  Reverse engineering the cooperative machinery of human hemoglobin.

Authors:  Zhong Ren
Journal:  PLoS One       Date:  2013-11-27       Impact factor: 3.240

6.  PolyFold: An interactive visual simulator for distance-based protein folding.

Authors:  Andrew J McGehee; Sutanu Bhattacharya; Rahmatullah Roche; Debswapna Bhattacharya
Journal:  PLoS One       Date:  2020-12-03       Impact factor: 3.240

7.  Dimeric interactions and complex formation using direct coevolutionary couplings.

Authors:  Ricardo N dos Santos; Faruck Morcos; Biman Jana; Adriano D Andricopulo; José N Onuchic
Journal:  Sci Rep       Date:  2015-09-04       Impact factor: 4.379

8.  Reaction trajectory revealed by a joint analysis of protein data bank.

Authors:  Zhong Ren
Journal:  PLoS One       Date:  2013-11-11       Impact factor: 3.240

9.  Heatmapper: web-enabled heat mapping for all.

Authors:  Sasha Babicki; David Arndt; Ana Marcu; Yongjie Liang; Jason R Grant; Adam Maciejewski; David S Wishart
Journal:  Nucleic Acids Res       Date:  2016-05-17       Impact factor: 16.971

10.  A distance geometry-based description and validation of protein main-chain conformation.

Authors:  Joana Pereira; Victor S Lamzin
Journal:  IUCrJ       Date:  2017-08-08       Impact factor: 4.769

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.