Literature DB >> 15781410

Protein linear indices of the 'macromolecular pseudograph alpha-carbon atom adjacency matrix' in bioinformatics. Part 1: prediction of protein stability effects of a complete set of alanine substitutions in Arc repressor.

Yovani Marrero-Ponce1, Ricardo Medina-Marrero, Juan A Castillo-Garit, Vicente Romero-Zaldivar, Francisco Torrens, Eduardo A Castro.   

Abstract

A novel approach to bio-macromolecular design from a linear algebra point of view is introduced. A protein's total (whole protein) and local (one or more amino acid) linear indices are a new set of bio-macromolecular descriptors of relevance to protein QSAR/QSPR studies. These amino-acid level biochemical descriptors are based on the calculation of linear maps on Rn[f k(xmi):Rn-->Rn] in canonical basis. These bio-macromolecular indices are calculated from the kth power of the macromolecular pseudograph alpha-carbon atom adjacency matrix. Total linear indices are linear functional on Rn. That is, the kth total linear indices are linear maps from Rn to the scalar R[f k(xm):Rn-->R]. Thus, the kth total linear indices are calculated by summing the amino-acid linear indices of all amino acids in the protein molecule. A study of the protein stability effects for a complete set of alanine substitutions in the Arc repressor illustrates this approach. A quantitative model that discriminates near wild-type stability alanine mutants from the reduced-stability ones in a training series was obtained. This model permitted the correct classification of 97.56% (40/41) and 91.67% (11/12) of proteins in the training and test set, respectively. It shows a high Matthews correlation coefficient (MCC=0.952) for the training set and an MCC=0.837 for the external prediction set. Additionally, canonical regression analysis corroborated the statistical quality of the classification model (Rcanc=0.824). This analysis was also used to compute biological stability canonical scores for each Arc alanine mutant. On the other hand, the linear piecewise regression model compared favorably with respect to the linear regression one on predicting the melting temperature (tm) of the Arc alanine mutants. The linear model explains almost 81% of the variance of the experimental tm (R=0.90 and s=4.29) and the LOO press statistics evidenced its predictive ability (q2=0.72 and scv=4.79). Moreover, the TOMOCOMD-CAMPS method produced a linear piecewise regression (R=0.97) between protein backbone descriptors and tm values for alanine mutants of the Arc repressor. A break-point value of 51.87 degrees C characterized two mutant clusters and coincided perfectly with the experimental scale. For this reason, we can use the linear discriminant analysis and piecewise models in combination to classify and predict the stability of the mutant Arc homodimers. These models also permitted the interpretation of the driving forces of such folding process, indicating that topologic/topographic protein backbone interactions control the stability profile of wild-type Arc and its alanine mutants.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15781410     DOI: 10.1016/j.bmc.2005.01.062

Source DB:  PubMed          Journal:  Bioorg Med Chem        ISSN: 0968-0896            Impact factor:   3.641


  9 in total

1.  3D-chiral atom, atom-type, and total non-stochastic and stochastic molecular linear indices and their applications to central chirality codification.

Authors:  Yovani Marrero-Ponce; Juan A Castillo-Garit
Journal:  J Comput Aided Mol Des       Date:  2005-06       Impact factor: 3.686

2.  Multi-output model with Box-Jenkins operators of linear indices to predict multi-target inhibitors of ubiquitin-proteasome pathway.

Authors:  Gerardo M Casañola-Martin; Huong Le-Thi-Thu; Facundo Pérez-Giménez; Yovani Marrero-Ponce; Matilde Merino-Sanjuán; Concepción Abad; Humberto González-Díaz
Journal:  Mol Divers       Date:  2015-03-10       Impact factor: 2.943

3.  Quantitative relationship between mutated amino-acid sequence of human copper-transporting ATPases and their related diseases.

Authors:  Shaomin Yan; Guang Wu
Journal:  Mol Divers       Date:  2008-08-08       Impact factor: 2.943

4.  Bond-based linear indices in QSAR: computational discovery of novel anti-trichomonal compounds.

Authors:  Yovani Marrero-Ponce; Alfredo Meneses-Marcel; Oscar M Rivera-Borroto; Ramón García-Domenech; Jesus Vicente De Julián-Ortiz; Alina Montero; José Antonio Escario; Alicia Gómez Barrio; David Montero Pereira; Juan José Nogal; Ricardo Grau; Francisco Torrens; Christian Vogel; Vicente J Arán
Journal:  J Comput Aided Mol Des       Date:  2008-05-16       Impact factor: 3.686

5.  Bond-based linear indices of the non-stochastic and stochastic edge-adjacency matrix. 1. Theory and modeling of ChemPhys properties of organic molecules.

Authors:  Yovani Marrero-Ponce; Eugenio R Martínez-Albelo; Gerardo M Casañola-Martín; Juan A Castillo-Garit; Yunaimy Echevería-Díaz; Vicente Romero Zaldivar; Jan Tygat; José E Rodriguez Borges; Ramón García-Domenech; Francisco Torrens; Facundo Pérez-Giménez
Journal:  Mol Divers       Date:  2010-01-10       Impact factor: 2.943

6.  Graph Theory-Based Sequence Descriptors as Remote Homology Predictors.

Authors:  Guillermin Agüero-Chapin; Deborah Galpert; Reinaldo Molina-Ruiz; Evys Ancede-Gallardo; Gisselle Pérez-Machado; Gustavo A de la Riva; Agostinho Antunes
Journal:  Biomolecules       Date:  2019-12-23

7.  Tensor Algebra-based Geometrical (3D) Biomacro-Molecular Descriptors for Protein Research: Theory, Applications and Comparison with other Methods.

Authors:  Julio E Terán; Yovani Marrero-Ponce; Ernesto Contreras-Torres; César R García-Jacas; Ricardo Vivas-Reyes; Enrique Terán; F Javier Torres
Journal:  Sci Rep       Date:  2019-08-06       Impact factor: 4.379

8.  Alignment-free prediction of mycobacterial DNA promoters based on pseudo-folding lattice network or star-graph topological indices.

Authors:  Alcides Perez-Bello; Cristian Robert Munteanu; Florencio M Ubeira; Alexandre Lopes De Magalhães; Eugenio Uriarte; Humberto González-Díaz
Journal:  J Theor Biol       Date:  2008-10-17       Impact factor: 2.691

9.  QSAR for RNases and theoretic-experimental study of molecular diversity on peptide mass fingerprints of a new Leishmania infantum protein.

Authors:  Humberto González-Díaz; María A Dea-Ayuela; Lázaro G Pérez-Montoto; Francisco J Prado-Prado; Guillermín Agüero-Chapín; Francisco Bolas-Fernández; Roberto I Vazquez-Padrón; Florencio M Ubeira
Journal:  Mol Divers       Date:  2009-07-04       Impact factor: 2.943

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.