| Literature DB >> 15629056 |
Haiyan Pan1, Jun Zhu, Danfu Han.
Abstract
A hybrid GA (genetic algorithm)-based clustering (HGACLUS) schema, combining merits of the Simulated Annealing, was described for finding an optimal or near-optimal set of medoids. This schema maximized the clustering success by achieving internal cluster cohesion and external cluster isolation. The performance of HGACLUS and other methods was compared by using simulated data and open microarray gene-expression datasets. HGACLUS was generally found to be more accurate and robust than other methods discussed in this paper by the exact validation strategy and the explicit cluster number.Entities:
Mesh:
Year: 2003 PMID: 15629056 PMCID: PMC5172428 DOI: 10.1016/s1672-0229(03)01033-7
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Fig. 1The 2-dimensional graphs of Models 1 and 2 of the first two gene components extracted by PCA.
Fig. 2AThe average VRC and Silhouette Width values of five clustering methods for Models 1 and 2.
Fig. 3AThe VRC and Silhouette Width values of five clustering methods for real gene expression datasets.
Fig. 2BThe VRC and Silhouette Width values of three GA-based clustering methods for Model 2 with 50 runs.
Fig. 3BThe VRC and Silhouette Width values of three GA-based clustering methods for Embryonal CNS (2) and NCI60 with 50 runs.