| Literature DB >> 9640559 |
S Basu1, A Pan, C Dutta, J Das.
Abstract
The present report proposes a new method for the chaos game representation (CGR) of different families of proteins. Using concatenated amino acid sequences of proteins belonging to a particular family and a 12-sided regular polygon, each vertex of which represents a group of amino acid residues leading to conservative substitutions, the method can generate the CGR of the family and allows pictorial representation of the pattern characterizing the family. An estimation of the percentages of points plotted in different segments of the CGR (grid points) allows quantification of the nonrandomness of the CGR patterns generated. The CGRs of different protein families exhibited distinct visually identifiable patterns. This implies that different functional classes of proteins follow specific statistical biases in the distribution of different mono-, di-, tri-, or higher order peptides along their primary sequences. The potential of grid counts as the discriminative and diagnostic signature of a family of proteins is discussed.Mesh:
Substances:
Year: 1997 PMID: 9640559 DOI: 10.1016/s1093-3263(97)00106-x
Source DB: PubMed Journal: J Mol Graph Model ISSN: 1093-3263 Impact factor: 2.518