Literature DB >> 19640752

Clustering files of chemical structures using the Székely-Rizzo generalization of Ward's method.

Thibault Varin1, Ronan Bureau, Christoph Mueller, Peter Willett.   

Abstract

Ward's method is extensively used for clustering chemical structures represented by 2D fingerprints. This paper compares Ward clusterings of 14 datasets (containing between 278 and 4332 molecules) with those obtained using the Székely-Rizzo clustering method, a generalization of Ward's method. The clusters resulting from these two methods were evaluated by the extent to which the various classifications were able to group active molecules together, using a novel criterion of clustering effectiveness. Analysis of a total of 1400 classifications (Ward and Székely-Rizzo clustering methods, 14 different datasets, 5 different fingerprints and 10 different distance coefficients) demonstrated the general superiority of the Székely-Rizzo method. The distance coefficient first described by Soergel performed extremely well in these experiments, and this was also the case when it was used in simulated virtual screening experiments.

Mesh:

Year:  2009        PMID: 19640752     DOI: 10.1016/j.jmgm.2009.06.006

Source DB:  PubMed          Journal:  J Mol Graph Model        ISSN: 1093-3263            Impact factor:   2.518


  8 in total

1.  Weighted voting-based consensus clustering for chemical structure databases.

Authors:  Faisal Saeed; Ali Ahmed; Mohd Shahir Shamsir; Naomie Salim
Journal:  J Comput Aided Mol Des       Date:  2014-05-15       Impact factor: 3.686

2.  Genomic cfDNA Analysis of Aqueous Humor in Retinoblastoma Predicts Eye Salvage: The Surrogate Tumor Biopsy for Retinoblastoma.

Authors:  Jesse L Berry; Liya Xu; Irsan Kooi; A Linn Murphree; Rishvanth K Prabakar; Mark Reid; Kevin Stachelek; Bao Han A Le; Lisa Welter; Bibiana J Reiser; Patricia Chévez-Barrios; Rima Jubran; Thomas C Lee; Jonathan W Kim; Peter Kuhn; David Cobrinik; James Hicks
Journal:  Mol Cancer Res       Date:  2018-07-30       Impact factor: 5.852

3.  Estimating Linear and Nonlinear Gene Coexpression Networks by Semiparametric Neighborhood Selection.

Authors:  Juho A J Kontio; Marko J Rinta-Aho; Mikko J Sillanpää
Journal:  Genetics       Date:  2020-05-15       Impact factor: 4.562

4.  Pharmacological affinity fingerprints derived from bioactivity data for the identification of designer drugs.

Authors:  Kedan He
Journal:  J Cheminform       Date:  2022-06-07       Impact factor: 8.489

5.  Voting-based consensus clustering for combining multiple clusterings of chemical structures.

Authors:  Faisal Saeed; Naomie Salim; Ammar Abdo
Journal:  J Cheminform       Date:  2012-12-17       Impact factor: 5.514

6.  The comparison of automated clustering algorithms for resampling representative conformer ensembles with RMSD matrix.

Authors:  Hyoungrae Kim; Cheongyun Jang; Dharmendra K Yadav; Mi-Hyun Kim
Journal:  J Cheminform       Date:  2017-03-23       Impact factor: 5.514

7.  An integrated method for optimized identification of effective natural inhibitors against SARS-CoV-2 3CLpro.

Authors:  Qi Liao; Ziyu Chen; Yanlin Tao; Beibei Zhang; Xiaojun Wu; Li Yang; Qingzhong Wang; Zhengtao Wang
Journal:  Sci Rep       Date:  2021-11-23       Impact factor: 4.379

8.  Open-source platform to benchmark fingerprints for ligand-based virtual screening.

Authors:  Sereina Riniker; Gregory A Landrum
Journal:  J Cheminform       Date:  2013-05-30       Impact factor: 5.514

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.