Literature DB >> 25517067

A geometric clustering algorithm with applications to structural data.

Shutan Xu1, Shuxue Zou, Lincong Wang.   

Abstract

An important feature of structural data, especially those from structural determination and protein-ligand docking programs, is that their distribution could be mostly uniform. Traditional clustering algorithms developed specifically for nonuniformly distributed data may not be adequate for their classification. Here we present a geometric partitional algorithm that could be applied to both uniformly and nonuniformly distributed data. The algorithm is a top-down approach that recursively selects the outliers as the seeds to form new clusters until all the structures within a cluster satisfy a classification criterion. The algorithm has been evaluated on a diverse set of real structural data and six sets of test data. The results show that it is superior to the previous algorithms for the clustering of structural data and is similar to or better than them for the classification of the test data. The algorithm should be especially useful for the identification of the best but minor clusters and for speeding up an iterative process widely used in NMR structure determination.

Keywords:  algorithms; distance geometry; drug design; protein structure

Mesh:

Substances:

Year:  2014        PMID: 25517067      PMCID: PMC4425229          DOI: 10.1089/cmb.2014.0162

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  14 in total

1.  Electrostatics of nanosystems: application to microtubules and the ribosome.

Authors:  N A Baker; D Sept; S Joseph; M J Holst; J A McCammon
Journal:  Proc Natl Acad Sci U S A       Date:  2001-08-21       Impact factor: 11.205

2.  Automated clustering of ensembles of alternative models in protein structure databases.

Authors:  Francisco S Domingues; Jörg Rahnenführer; Thomas Lengauer
Journal:  Protein Eng Des Sel       Date:  2004-08-19       Impact factor: 1.650

3.  Clustering Molecular Dynamics Trajectories: 1. Characterizing the Performance of Different Clustering Algorithms.

Authors:  Jianyin Shao; Stephen W Tanner; Nephi Thompson; Thomas E Cheatham
Journal:  J Chem Theory Comput       Date:  2007-11       Impact factor: 6.006

4.  A critical assessment of docking programs and scoring functions.

Authors:  Gregory L Warren; C Webster Andrews; Anna-Maria Capelli; Brian Clarke; Judith LaLonde; Millard H Lambert; Mika Lindvall; Neysa Nevins; Simon F Semus; Stefan Senger; Giovanna Tedesco; Ian D Wall; James M Woolven; Catherine E Peishoff; Martha S Head
Journal:  J Med Chem       Date:  2006-10-05       Impact factor: 7.446

5.  Comparing geometric and kinetic cluster algorithms for molecular simulation data.

Authors:  Bettina Keller; Xavier Daura; Wilfred F van Gunsteren
Journal:  J Chem Phys       Date:  2010-02-21       Impact factor: 3.488

6.  CATH--a hierarchic classification of protein domain structures.

Authors:  C A Orengo; A D Michie; S Jones; D T Jones; M B Swindells; J M Thornton
Journal:  Structure       Date:  1997-08-15       Impact factor: 5.006

7.  SCOP: a structural classification of proteins database for the investigation of sequences and structures.

Authors:  A G Murzin; S E Brenner; T Hubbard; C Chothia
Journal:  J Mol Biol       Date:  1995-04-07       Impact factor: 5.469

8.  Molecular recognition of receptor sites using a genetic algorithm with a description of desolvation.

Authors:  G Jones; P Willett; R C Glen
Journal:  J Mol Biol       Date:  1995-01-06       Impact factor: 5.469

9.  Representing an ensemble of NMR-derived protein structures by a single structure.

Authors:  M J Sutcliffe
Journal:  Protein Sci       Date:  1993-06       Impact factor: 6.725

10.  Structure-function relationships of cellular retinoic acid-binding proteins. Quantitative analysis of the ligand binding properties of the wild-type proteins and site-directed mutants.

Authors:  L Wang; Y Li; H Yan
Journal:  J Biol Chem       Date:  1997-01-17       Impact factor: 5.157

View more
  2 in total

1.  Efficiency of Stratification for Ensemble Docking Using Reduced Ensembles.

Authors:  Bing Xie; John D Clark; David D L Minh
Journal:  J Chem Inf Model       Date:  2018-08-29       Impact factor: 4.956

2.  A New Secondary Structure Assignment Algorithm Using Cα Backbone Fragments.

Authors:  Chen Cao; Guishen Wang; An Liu; Shutan Xu; Lincong Wang; Shuxue Zou
Journal:  Int J Mol Sci       Date:  2016-03-11       Impact factor: 5.923

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.