Literature DB >> 25379955

Investigation of the use of spectral clustering for the analysis of molecular data.

Sonny Gan1, David A Cosgrove, Eleanor J Gardiner, Valerie J Gillet.   

Abstract

Spectral clustering involves placing objects into clusters based on the eigenvectors and eigenvalues of an associated matrix. The technique was first applied to molecular data by Brewer [J. Chem. Inf. Model. 2007, 47, 1727-1733] who demonstrated its use on a very small dataset of 125 COX-2 inhibitors. We have determined suitable parameters for spectral clustering using a wide variety of molecular descriptors and several datasets of a few thousand compounds and compared the results of clustering using a nonoverlapping version of Brewer's use of Sarker and Boyer's algorithm with that of Ward's and k-means clustering. We then replaced the exact eigendecomposition method with two different approximate methods and concluded that Singular Value Decomposition is the most appropriate method for clustering larger compound collections of up to 100,000 compounds. We have also used spectral clustering with the Tversky coefficient to generate two sets of clusters linked by a common set of eigenvalues and have used this novel approach to cluster sets of fragments such as those used in fragment-based drug design.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25379955     DOI: 10.1021/ci500480b

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  3 in total

1.  Clustering molecular dynamics trajectories for optimizing docking experiments.

Authors:  Renata De Paris; Christian V Quevedo; Duncan D Ruiz; Osmar Norberto de Souza; Rodrigo C Barros
Journal:  Comput Intell Neurosci       Date:  2015-03-22

2.  MetMaxStruct: A Tversky-Similarity-Based Strategy for Analysing the (Sub)Structural Similarities of Drugs and Endogenous Metabolites.

Authors:  Steve O'Hagan; Douglas B Kell
Journal:  Front Pharmacol       Date:  2016-08-22       Impact factor: 5.810

3.  Analysis of drug-endogenous human metabolite similarities in terms of their maximum common substructures.

Authors:  Steve O'Hagan; Douglas B Kell
Journal:  J Cheminform       Date:  2017-03-09       Impact factor: 5.514

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.