| Literature DB >> 20975898 |
Abstract
Several non-redundant ensembles of protein three-dimensional structures were analyzed in order to estimate their natural clustering tendency by means of the Cox-Lewis coefficient. It was observed that, despite proteins tend to aggregate into different and well separated groups, some overlap between different clusters occurs. This suggests that classifications bases only on structural data cannot allow a systematic classification of proteins. Additional information are in particular needed in order to monitor completely the complex evolutionary relationships between proteins.Entities:
Keywords: Cluster analysis; clustering tendency; protein fold; protein structural domains; protein structure classification
Year: 2010 PMID: 20975898 PMCID: PMC2951670 DOI: 10.6026/97320630004347
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1Example of Cox-Lewis statistics. Filled circles represent subjects in a bi-dimensional space. Open circles represent geometrical points. u is the minimal distance between a geometrical point (a) and a subject (b). w is the minimal distance between the same subject (b) and another subject (c). The geometrical point (a) is randomly selected. The geometrical point (a) is randomly selected. The Cox-Lewis coefficient can be computed with equation (2) on the basis of a set of randomly selected points (a).
Figure 2Scheme of the computations performed to obtain the Cox-Lewis coefficient of clustering tendency
Figure 3Examples of various levels of clustering tendency in a bi-dimensional space. Four types of objects are plotted in each graph (white diamonds, black diamonds, plus symbols, and black triangles). The Cox-Lewis coefficient decreases from 6.927 to 1.037 if the separation between the four clusters decreases. Both axes are in arbitrary units (a.u.).