Literature DB >> 16733478

A tabular approach to the sequence-to-structure relation in proteins (tetrapeptide representation) for de novo protein design.

Jan Meus1, Michał Brylinski, Monika Piwowar, Piotr Piwowar, Zdzisław Wiśniowski, Justyna Stefaniak, Leszek Konieczny, Grzegorz Surówka, Irena Roterman.   

Abstract

BACKGROUND: Experimental observations classify the protein-folding process as a multi-step event. The backbone conformation has been experimentally recognized as responsible for the early-stage structural forms of a polypeptide. The sequence-to-structure and structure-to-sequence relation is critical for predicting protein structure. A contingency table representing this relation for tetrapeptides in their early-stage is presented. Their correlation seems to be essential in protein-folding simulation. MATERIAL/
METHODS: The polypeptide chains of all the proteins in the Protein Data Bank were transformed into their early-stage structural forms. The tetrapeptide was selected as the structural unit. Tetrapetide sequences and structures were expressed by letter codes. The transformation of a contingency table of any size (here: 160,000x2401) to a 2x2 table performed for each non-zero cell of the original table allowed calculation of the rho-coefficient measuring the strength of the relation.
RESULTS: High values of the rho-coefficient extracted sequences of strong structural determinability and structures of high sequence selectivity. The web-site program to calculate the rho-coefficient ranking list was constructed to enable applying this method to any problem of contingency table analysis.
CONCLUSIONS: The results revealed sequence-to-structure (and vice versa) correlation in early-stage folding. Surprisingly, the irregular structural forms of loops and bends appeared to be highly determined. Comparison of these results with another method based on information entropy revealed high accordance. The method oriented on interpretation of a large contingency table seems very useful especially for large-scale microarray analysis, a very popular technique in the post-genomic era.

Mesh:

Substances:

Year:  2006        PMID: 16733478

Source DB:  PubMed          Journal:  Med Sci Monit        ISSN: 1234-1010


  2 in total

1.  Localization of ligand binding site in proteins identified in silico.

Authors:  Michal Brylinski; Marek Kochanczyk; Elzbieta Broniatowska; Irena Roterman
Journal:  J Mol Model       Date:  2007-03-30       Impact factor: 1.810

2.  Directional Association Measurement in Contingency Tables: Genomic Case.

Authors:  Monika Piwowar; Tomasz KuŁaga
Journal:  J Comput Biol       Date:  2018-12-18       Impact factor: 1.479

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.