Literature DB >> 26789941

Univariate Screening Measures for Cluster Analysis.

J R Donoghue.   

Abstract

lnciusion of irrelevant variables in a cluster analysis adversely affects subgroup recovery. This article examines using moment-based statistics to screen variables; only variables which pass the screening are then used in clustering. Normal mixtures are analytically shown often to possess negative kurtosis. Two related measures, m and coefficient of bimodality 6, are also examined. A Monte Carlo study compared the screening measures to no selection, De Soete's (1988) ultrametric weights, and Fowlkes, Gnanadesikan, and Kettenring's (1988) forward selection procedure. Screening based on kurtosis degraded recovery and is not recommended. In contrast, screening on m or on b improved recovery over both no selection and forward selection, and screening performed as well as ultrametric weights. Combining screening with ultrametric weights performed extremely well. All methods were found to be somewhat sensitive to other types of emr. Screening variables appears a viable alternative to both ultrametric weights and forward selection. The potential advantages and disadvantages of screening are considered.

Year:  1995        PMID: 26789941     DOI: 10.1207/s15327906mbr3003_5

Source DB:  PubMed          Journal:  Multivariate Behav Res        ISSN: 0027-3171            Impact factor:   5.923


  1 in total

1.  HDSI: High dimensional selection with interactions algorithm on feature selection and testing.

Authors:  Rahi Jain; Wei Xu
Journal:  PLoS One       Date:  2021-02-16       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.