Literature DB >> 17369873

Linear Transformations and the k-Means Clustering Algorithm: Applications to Clustering Curves.

Thaddeus Tarpey1.   

Abstract

Functional data can be clustered by plugging estimated regression coefficients from individual curves into the k-means algorithm. Clustering results can differ depending on how the curves are fit to the data. Estimating curves using different sets of basis functions corresponds to different linear transformations of the data. k-means clustering is not invariant to linear transformations of the data. The optimal linear transformation for clustering will stretch the distribution so that the primary direction of variability aligns with actual differences in the clusters. It is shown that clustering the raw data will often give results similar to clustering regression coefficients obtained using an orthogonal design matrix. Clustering functional data using an L(2) metric on function space can be achieved by clustering a suitable linear transformation of the regression coefficients. An example where depressed individuals are treated with an antidepressant is used for illustration.

Entities:  

Year:  2007        PMID: 17369873      PMCID: PMC1828125          DOI: 10.1198/000313007X171016

Source DB:  PubMed          Journal:  Am Stat        ISSN: 0003-1305            Impact factor:   8.710


  3 in total

1.  Finite mixture modeling with mixture outcomes using the EM algorithm.

Authors:  B Muthén; K Shedden
Journal:  Biometrics       Date:  1999-06       Impact factor: 2.571

2.  Allometric extension.

Authors:  S Bartoletti; B D Flury; D G Nel
Journal:  Biometrics       Date:  1999-12       Impact factor: 2.571

3.  Predictors of relapse during fluoxetine continuation or maintenance treatment of major depression.

Authors:  P J McGrath; J W Stewart; E Petkova; F M Quitkin; J D Amsterdam; J Fawcett; F W Reimherr; J F Rosenbaum; C M Beasley
Journal:  J Clin Psychiatry       Date:  2000-07       Impact factor: 4.384

  3 in total
  7 in total

1.  Partitioning of Functional Data for Understanding Heterogeneity in Psychiatric Conditions.

Authors:  Eva Petkova; Thaddeus Tarpey
Journal:  Stat Interface       Date:  2009-01-01       Impact factor: 0.582

2.  Optimal Partitioning for Linear Mixed Effects Models: Applications to Identifying Placebo Responders.

Authors:  Thaddeus Tarpey; Eva Petkova; Yimeng Lu; Usha Govindarajulu
Journal:  J Am Stat Assoc       Date:  2010-01-01       Impact factor: 5.033

3.  Principal Point Classification: Applications to Differentiating Drug and Placebo Responses in Longitudinal Studies.

Authors:  Thaddeus Tarpey; Eva Petkova
Journal:  J Stat Plan Inference       Date:  2010-02-01       Impact factor: 1.111

4.  Stratified Psychiatry via Convexity-Based Clustering with Applications Towards Moderator Analysis.

Authors:  Thaddeus Tarpey; Eva Petkova; Liangyu Zhu
Journal:  Stat Interface       Date:  2016-07-01       Impact factor: 0.582

5.  Reducing uncertainty in the american community survey through data-driven regionalization.

Authors:  Seth E Spielman; David C Folch
Journal:  PLoS One       Date:  2015-02-27       Impact factor: 3.240

6.  Identifying patterns in treatment response profiles in acute bipolar mania: a cluster analysis approach.

Authors:  Ilya A Lipkovich; John P Houston; Jonna Ahl
Journal:  BMC Psychiatry       Date:  2008-07-29       Impact factor: 3.630

7.  Partitioning of functional gene expression data using principal points.

Authors:  Jaehee Kim; Haseong Kim
Journal:  BMC Bioinformatics       Date:  2017-10-12       Impact factor: 3.169

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.