Literature DB >> 15319469

Automated clustering of ensembles of alternative models in protein structure databases.

Francisco S Domingues1, Jörg Rahnenführer, Thomas Lengauer.   

Abstract

Experimentally determined protein structures have been classified in different public databases according to their structural and evolutionary relationships. Frequently, alternative structural models, determined using X-ray crystallography or NMR spectroscopy, are available for a protein. These models can present significant structural dissimilarity. Currently there is no classification available for these alternative structures. In order to classify them, we developed STRuster, an automated method for clustering ensembles of structural models according to their backbone structure. The method is based on the calculation of carbon alpha (Calpha) distance matrices. Two filters are applied in the calculation of the dissimilarity measure in order to identify both large and small (but significant) backbone conformational changes. The resulting dissimilarity value is used for hierarchical clustering and partitioning around medoids (PAM). Hierarchical clustering reflects the hierarchy of similarities between all pairs of models, while PAM groups the models into the 'optimal' number of clusters. The method has been applied to cluster the structures in each SCOP species level and can be easily applied to any other sets of conformers. The results are available at: http://bioinf.mpi-sb.mpg.de/projects/struster/.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15319469     DOI: 10.1093/protein/gzh063

Source DB:  PubMed          Journal:  Protein Eng Des Sel        ISSN: 1741-0126            Impact factor:   1.650


  6 in total

1.  Counting clusters using R-NN curves.

Authors:  Rajarshi Guha; Debojyoti Dutta; David J Wild; Ting Chen
Journal:  J Chem Inf Model       Date:  2007-06-30       Impact factor: 4.956

2.  A geometric clustering algorithm with applications to structural data.

Authors:  Shutan Xu; Shuxue Zou; Lincong Wang
Journal:  J Comput Biol       Date:  2014-12-17       Impact factor: 1.479

3.  StralSV: assessment of sequence variability within similar 3D structures and application to polio RNA-dependent RNA polymerase.

Authors:  Adam T Zemla; Dorothy M Lang; Tanya Kostova; Raul Andino; Carol L Ecale Zhou
Journal:  BMC Bioinformatics       Date:  2011-06-02       Impact factor: 3.169

4.  Comparative analysis of protein structure alignments.

Authors:  Gabriele Mayr; Francisco S Domingues; Peter Lackner
Journal:  BMC Struct Biol       Date:  2007-07-26

5.  STRALCP--structure alignment-based clustering of proteins.

Authors:  Adam Zemla; Brian Geisbrecht; Jason Smith; Marisa Lam; Bonnie Kirkpatrick; Mark Wagner; Tom Slezak; Carol Ecale Zhou
Journal:  Nucleic Acids Res       Date:  2007-11-26       Impact factor: 16.971

Review 6.  Computational analysis of protein interaction networks for infectious diseases.

Authors:  Archana Pan; Chandrajit Lahiri; Anjana Rajendiran; Buvaneswari Shanmugham
Journal:  Brief Bioinform       Date:  2015-08-10       Impact factor: 11.622

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.