Literature DB >> 17387739

Property-based sequence representations do not adequately encode local protein folding information.

A D Solis1, S Rackovsky.   

Abstract

We examine the informatic characteristics of amino acid representations based on physical properties. We demonstrate that sequences rewritten using contracted alphabets based on physical properties do not encode local folding information well. The best four-character alphabet can only encode approximately 57% of the maximum possible amount of structural information. This result suggests that property-based representations that operate on a local length scale are not likely to be useful in homology searches and fold-recognition exercises. 2007 Wiley-Liss, Inc.

Mesh:

Substances:

Year:  2007        PMID: 17387739     DOI: 10.1002/prot.21434

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  4 in total

1.  Global characteristics of protein sequences and their implications.

Authors:  S Rackovsky
Journal:  Proc Natl Acad Sci U S A       Date:  2010-04-26       Impact factor: 11.205

2.  Nonlinearities in protein space limit the utility of informatics in protein biophysics.

Authors:  S Rackovsky
Journal:  Proteins       Date:  2015-09-10

3.  Identification of GATC- and CCGG-recognizing Type II REases and their putative specificity-determining positions using Scan2S--a novel motif scan algorithm with optional secondary structure constraints.

Authors:  Masha Y Niv; Lucy Skrabanek; Richard J Roberts; Harold A Scheraga; Harel Weinstein
Journal:  Proteins       Date:  2008-05-01

4.  Fold-specific sequence scoring improves protein sequence matching.

Authors:  Sumudu P Leelananda; Andrzej Kloczkowski; Robert L Jernigan
Journal:  BMC Bioinformatics       Date:  2016-08-30       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.