Literature DB >> 16896541

V-cluster algorithm: a new algorithm for clustering molecules based upon numeric data.

Jun Xu1, Qiang Zhang, Chen-Kon Shih.   

Abstract

Clustering molecules based on numeric data such as, gene-expression data, physiochemical properties, or theoretical data is very important in drug discovery and other life sciences. Most approaches use hierarchical clustering algorithms, non-hierarchical algorithms (for examples, K-mean and K-nearest neighbor), and other similar methods (for examples, the Self-Organization Mapping (SOM) and the Support Vector Machine (SVM)). These approaches are non-robust (results are not consistent) and, computationally expensive. This paper will report a new, non-hierarchical algorithm called the V-Cluster (V stands for vector) Algorithm. This algorithm produces rational, robust results while reducing computing complexity. Similarity measurement and data normalization rules are also discussed along with case studies. When molecules are represented in a set of numeric vectors, the V-Cluster Algorithm clusters the molecules in three steps: (1) ranking the vectors based upon their overall intensity levels, (2) computing cluster centers based upon neighboring density, and (3) assigning molecules to their nearest cluster center. The program is written in C/C++ language, and runs on Window95/NT and UNIX platforms. With the V-Cluster program, the user can quickly complete the clustering process and, easily examine the results by use of thumbnail graphs, superimposed intensity curves of vectors, and spreadsheets. Multi-functional query tools have also been implemented.

Mesh:

Year:  2006        PMID: 16896541     DOI: 10.1007/s11030-006-9023-7

Source DB:  PubMed          Journal:  Mol Divers        ISSN: 1381-1991            Impact factor:   2.943


  10 in total

1.  Superparamagnetic clustering of data.

Authors: 
Journal:  Phys Rev Lett       Date:  1996-04-29       Impact factor: 9.161

2.  Statistical mechanics and phase transitions in clustering.

Authors: 
Journal:  Phys Rev Lett       Date:  1990-08-20       Impact factor: 9.161

3.  PATTERN CLUSTERING BY MULTIVARIATE MIXTURE ANALYSIS.

Authors:  J H Wolfe
Journal:  Multivariate Behav Res       Date:  1970-04-01       Impact factor: 5.923

4.  A Review Of Monte Carlo Tests Of Cluster Analysis.

Authors:  G W Milligan
Journal:  Multivariate Behav Res       Date:  1981-07-01       Impact factor: 5.923

5.  Comparative Cluster Analysis Of Patterns Of Vocational Interest.

Authors:  J H Wolfe
Journal:  Multivariate Behav Res       Date:  1978-01-01       Impact factor: 5.923

6.  The Literature On Cluster Analysis.

Authors:  R K Blashfield; M S Aldenderfer
Journal:  Multivariate Behav Res       Date:  1978-07-01       Impact factor: 5.923

7.  A Computer Program for Classifying Plants.

Authors:  D J Rogers; T T Tanimoto
Journal:  Science       Date:  1960-10-21       Impact factor: 47.728

8.  A nonparametric algorithm for detecting clusters using hierarchical structure.

Authors:  R Mizoguchi; M Shimura
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  1980-04       Impact factor: 6.226

9.  Practical problems in a method of cluster analysis.

Authors:  F H Marriott
Journal:  Biometrics       Date:  1971-09       Impact factor: 2.571

10.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

  10 in total
  1 in total

1.  Predicting dual-targeting anti-influenza agents using multi-models.

Authors:  Yu Wang; Hu Ge; Yali Li; Yufang Xie; Yingyan He; Mengyan Xu; Qiong Gu; Jun Xu
Journal:  Mol Divers       Date:  2014-10-02       Impact factor: 2.943

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.