Literature DB >> 31424089

Biclustering via sparse clustering.

Erika S Helgeson1, Qian Liu2, Guanhua Chen3, Michael R Kosorok2, Eric Bair4.   

Abstract

In identifying subgroups of a heterogeneous disease or condition, it is often desirable to identify both the observations and the features which differ between subgroups. For instance, it may be that there is a subgroup of individuals with a certain disease who differ from the rest of the population based on the expression profile for only a subset of genes. Identifying the subgroup of patients and subset of genes could lead to better-targeted therapy. We can represent the subgroup of individuals and genes as a bicluster, a submatrix, U , of a larger data matrix, X , such that the features and observations in U differ from those not contained in U . We present a novel two-step method, SC-Biclust, for identifying U . In the first step, the observations in the bicluster are identified to maximize the sum of the weighted between-cluster feature differences. In the second step, features in the bicluster are identified based on their contribution to the clustering of the observations. This versatile method can be used to identify biclusters that differ on the basis of feature means, feature variances, or more general differences. The bicluster identification accuracy of SC-Biclust is illustrated through several simulated studies. Application of SC-Biclust to pain research illustrates its ability to identify biologically meaningful subgroups.
© 2019 The International Biometric Society.

Entities:  

Keywords:  biclustering; hierarchical clustering; high-dimensional data; k-means clustering; sparse clustering

Year:  2019        PMID: 31424089      PMCID: PMC7028479          DOI: 10.1111/biom.13136

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  16 in total

1.  Sparse Biclustering of Transposable Data.

Authors:  Kean Ming Tan; Daniela M Witten
Journal:  J Comput Graph Stat       Date:  2014       Impact factor: 2.302

2.  Potential psychosocial risk factors for chronic TMD: descriptive data and empirically identified domains from the OPPERA case-control study.

Authors:  Roger B Fillingim; Richard Ohrbach; Joel D Greenspan; Charles Knott; Ronald Dubner; Eric Bair; Cristina Baraian; Gary D Slade; William Maixner
Journal:  J Pain       Date:  2011-11       Impact factor: 5.820

3.  Study methods, recruitment, sociodemographic findings, and demographic representativeness in the OPPERA study.

Authors:  Gary D Slade; Eric Bair; Kunthel By; Flora Mulkey; Cristina Baraian; Rebecca Rothwell; Maria Reynolds; Vanessa Miller; Yoly Gonzalez; Sharon Gordon; Margarete Ribeiro-Dasilva; Pei Feng Lim; Joel D Greenspan; Ron Dubner; Roger B Fillingim; Luda Diatchenko; William Maixner; Dawn Dampier; Charles Knott; Richard Ohrbach
Journal:  J Pain       Date:  2011-11       Impact factor: 5.820

4.  Pain sensitivity risk factors for chronic TMD: descriptive data and empirically identified domains from the OPPERA case control study.

Authors:  Joel D Greenspan; Gary D Slade; Eric Bair; Ronald Dubner; Roger B Fillingim; Richard Ohrbach; Charlie Knott; Flora Mulkey; Rebecca Rothwell; William Maixner
Journal:  J Pain       Date:  2011-11       Impact factor: 5.820

5.  Biclustering via sparse singular value decomposition.

Authors:  Mihee Lee; Haipeng Shen; Jianhua Z Huang; J S Marron
Journal:  Biometrics       Date:  2010-12       Impact factor: 2.571

6.  Biclustering with heterogeneous variance.

Authors:  Guanhua Chen; Patrick F Sullivan; Michael R Kosorok
Journal:  Proc Natl Acad Sci U S A       Date:  2013-07-08       Impact factor: 11.205

7.  Robust biclustering by sparse singular value decomposition incorporating stability selection.

Authors:  Martin Sill; Sebastian Kaiser; Axel Benner; Annette Kopp-Schneider
Journal:  Bioinformatics       Date:  2011-06-02       Impact factor: 6.937

8.  A framework for feature selection in clustering.

Authors:  Daniela M Witten; Robert Tibshirani
Journal:  J Am Stat Assoc       Date:  2010-06-01       Impact factor: 5.033

9.  Molecular portraits of human breast tumours.

Authors:  C M Perou; T Sørlie; M B Eisen; M van de Rijn; S S Jeffrey; C A Rees; J R Pollack; D T Ross; H Johnsen; L A Akslen; O Fluge; A Pergamenschikov; C Williams; S X Zhu; P E Lønning; A L Børresen-Dale; P O Brown; D Botstein
Journal:  Nature       Date:  2000-08-17       Impact factor: 49.962

10.  Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses.

Authors:  A Bhattacharjee; W G Richards; J Staunton; C Li; S Monti; P Vasa; C Ladd; J Beheshti; R Bueno; M Gillette; M Loda; G Weber; E J Mark; E S Lander; W Wong; B E Johnson; T R Golub; D J Sugarbaker; M Meyerson
Journal:  Proc Natl Acad Sci U S A       Date:  2001-11-13       Impact factor: 11.205

View more
  2 in total

1.  Moderator effect of sex in the clustering of treatment-seeking patients with gambling problems.

Authors:  Susana Jiménez-Murcia; Roser Granero; Mònica Giménez; Amparo Del Pino-Gutiérrez; Gemma Mestre-Bach; Teresa Mena-Moreno; Laura Moragas; Marta Baño; Jéssica Sánchez-González; Marta de Gracia; Isabel Baenas-Soto; S Fabrizio Contaldo; Eduardo Valenciano-Mendoza; Bernat Mora-Maltas; Hibai López-González; José M Menchón; Fernando Fernández-Aranda
Journal:  Neuropsychiatr       Date:  2020-03-03

2.  Multidimensional molecular measurements-environment interaction analysis for disease outcomes.

Authors:  Yaqing Xu; Mengyun Wu; Shuangge Ma
Journal:  Biometrics       Date:  2021-07-02       Impact factor: 1.701

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.