Literature DB >> 21310747

Entropy-accelerated exact clustering of protein decoys.

Francois Berenger1, Yong Zhou, Rojan Shrestha, Kam Y J Zhang.   

Abstract

MOTIVATION: Clustering is commonly used to identify the best decoy among many generated in protein structure prediction when using energy alone is insufficient. Calculation of the pairwise distance matrix for a large decoy set is computationally expensive. Typically, only a reduced set of decoys using energy filtering is subjected to clustering analysis. A fast clustering method for a large decoy set would be beneficial to protein structure prediction and this still poses a challenge.
RESULTS: We propose a method using propagation of geometric constraints to accelerate exact clustering, without compromising the distance measure. Our method can be used with any metric distance. Metrics that are expensive to compute and have known cheap lower and upper bounds will benefit most from the method. We compared our method's accuracy against published results from the SPICKER clustering software on 40 large decoy sets from the I-TASSER protein folding engine. We also performed some additional speed comparisons on six targets from the 'semfold' decoy set. In our tests, our method chose a better decoy than the energy criterion in 25 out of 40 cases versus 20 for SPICKER. Our method also was shown to be consistently faster than another fast software performing exact clustering named Calibur. In some cases, our approach can even outperform the speed of an approximate method. AVAILABILITY: Our C++ software is released under the GNU General Public License. It can be downloaded from http://www.riken.jp/zhangiru/software/durandal_released.tgz.

Mesh:

Substances:

Year:  2011        PMID: 21310747     DOI: 10.1093/bioinformatics/btr072

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  9 in total

1.  Improving accuracy of protein contact prediction using balanced network deconvolution.

Authors:  Hai-Ping Sun; Yan Huang; Xiao-Fan Wang; Yang Zhang; Hong-Bin Shen
Journal:  Proteins       Date:  2015-01-24

2.  Fast algorithm for population-based protein structural model analysis.

Authors:  Jingfen Zhang; Dong Xu
Journal:  Proteomics       Date:  2013-01-03       Impact factor: 3.984

3.  A probabilistic fragment-based protein structure prediction algorithm.

Authors:  David Simoncini; Francois Berenger; Rojan Shrestha; Kam Y J Zhang
Journal:  PLoS One       Date:  2012-07-19       Impact factor: 3.240

4.  Efficient sampling in fragment-based protein structure prediction using an estimation of distribution algorithm.

Authors:  David Simoncini; Kam Y J Zhang
Journal:  PLoS One       Date:  2013-07-25       Impact factor: 3.240

5.  From Extraction of Local Structures of Protein Energy Landscapes to Improved Decoy Selection in Template-Free Protein Structure Prediction.

Authors:  Nasrin Akhter; Amarda Shehu
Journal:  Molecules       Date:  2018-01-19       Impact factor: 4.411

6.  Unsupervised and Supervised Learning over theEnergy Landscape for Protein Decoy Selection.

Authors:  Nasrin Akhter; Gopinath Chennupati; Kazi Lutful Kabir; Hristo Djidjev; Amarda Shehu
Journal:  Biomolecules       Date:  2019-10-14

7.  Ranking near-native candidate protein structures via random forest classification.

Authors:  Hongjie Wu; Hongmei Huang; Weizhong Lu; Qiming Fu; Yijie Ding; Jing Qiu; Haiou Li
Journal:  BMC Bioinformatics       Date:  2019-12-24       Impact factor: 3.169

8.  An improved method to detect correct protein folds using partial clustering.

Authors:  Jianjun Zhou; David S Wishart
Journal:  BMC Bioinformatics       Date:  2013-01-16       Impact factor: 3.169

9.  Cryo-EM structure of the human ELMO1-DOCK5-Rac1 complex.

Authors:  Mutsuko Kukimoto-Niino; Kazushige Katsura; Rahul Kaushik; Haruhiko Ehara; Takeshi Yokoyama; Tomomi Uchikubo-Kamo; Reiko Nakagawa; Chiemi Mishima-Tsumagari; Mayumi Yonemochi; Mariko Ikeda; Kazuharu Hanada; Kam Y J Zhang; Mikako Shirouzu
Journal:  Sci Adv       Date:  2021-07-21       Impact factor: 14.136

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.