Literature DB >> 29785064

Identification of relevant subtypes via preweighted sparse clustering.

Sheila Gaynor1, Eric Bair2.   

Abstract

Cluster analysis methods are used to identify homogeneous subgroups in a data set. In biomedical applications, one frequently applies cluster analysis in order to identify biologically interesting subgroups. In particular, one may wish to identify subgroups that are associated with a particular outcome of interest. Conventional clustering methods generally do not identify such subgroups, particularly when there are a large number of high-variance features in the data set. Conventional methods may identify clusters associated with these high-variance features when one wishes to obtain secondary clusters that are more interesting biologically or more strongly associated with a particular outcome of interest. A modification of sparse clustering can be used to identify such secondary clusters or clusters associated with an outcome of interest. This method correctly identifies such clusters of interest in several simulation scenarios. The method is also applied to a large prospective cohort study of temporomandibular disorders and a leukemia microarray data set.

Entities:  

Keywords:  Cancer; Cluster analysis; High-dimensional data; K-means clustering; Temporomandibular disorders

Year:  2017        PMID: 29785064      PMCID: PMC5959300          DOI: 10.1016/j.csda.2017.06.003

Source DB:  PubMed          Journal:  Comput Stat Data Anal        ISSN: 0167-9473            Impact factor:   1.681


  30 in total

1.  Mixture modelling of gene expression data from microarray experiments.

Authors:  Debashis Ghosh; Arul M Chinnaiyan
Journal:  Bioinformatics       Date:  2002-02       Impact factor: 6.937

2.  Gene expression profiling predicts clinical outcome of breast cancer.

Authors:  Laura J van 't Veer; Hongyue Dai; Marc J van de Vijver; Yudong D He; Augustinus A M Hart; Mao Mao; Hans L Peterse; Karin van der Kooy; Matthew J Marton; Anke T Witteveen; George J Schreiber; Ron M Kerkhoven; Chris Roberts; Peter S Linsley; René Bernards; Stephen H Friend
Journal:  Nature       Date:  2002-01-31       Impact factor: 49.962

3.  Semi-supervised recursively partitioned mixture models for identifying cancer subtypes.

Authors:  Devin C Koestler; Carmen J Marsit; Brock C Christensen; Margaret R Karagas; Raphael Bueno; David J Sugarbaker; Karl T Kelsey; E Andres Houseman
Journal:  Bioinformatics       Date:  2010-08-16       Impact factor: 6.937

4.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

5.  Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses.

Authors:  A Bhattacharjee; W G Richards; J Staunton; C Li; S Monti; P Vasa; C Ladd; J Beheshti; R Bueno; M Gillette; M Loda; G Weber; E J Mark; E S Lander; W Wong; B E Johnson; T R Golub; D J Sugarbaker; M Meyerson
Journal:  Proc Natl Acad Sci U S A       Date:  2001-11-13       Impact factor: 11.205

6.  Gene expression profiling identifies clinically relevant subtypes of prostate cancer.

Authors:  Jacques Lapointe; Chunde Li; John P Higgins; Matt van de Rijn; Eric Bair; Kelli Montgomery; Michelle Ferrari; Lars Egevad; Walter Rayford; Ulf Bergerheim; Peter Ekman; Angelo M DeMarzo; Robert Tibshirani; David Botstein; Patrick O Brown; James D Brooks; Jonathan R Pollack
Journal:  Proc Natl Acad Sci U S A       Date:  2004-01-07       Impact factor: 11.205

7.  Study protocol, sample characteristics, and loss to follow-up: the OPPERA prospective cohort study.

Authors:  Eric Bair; Naomi C Brownstein; Richard Ohrbach; Joel D Greenspan; Ronald Dubner; Roger B Fillingim; William Maixner; Shad B Smith; Luda Diatchenko; Yoly Gonzalez; Sharon M Gordon; Pei-Feng Lim; Margarete Ribeiro-Dasilva; Dawn Dampier; Charles Knott; Gary D Slade
Journal:  J Pain       Date:  2013-12       Impact factor: 5.820

8.  Psychological factors associated with development of TMD: the OPPERA prospective cohort study.

Authors:  Roger B Fillingim; Richard Ohrbach; Joel D Greenspan; Charles Knott; Luda Diatchenko; Ronald Dubner; Eric Bair; Cristina Baraian; Nicole Mack; Gary D Slade; William Maixner
Journal:  J Pain       Date:  2013-12       Impact factor: 5.820

9.  Identification of clusters of individuals relevant to temporomandibular disorders and other chronic pain conditions: the OPPERA study.

Authors:  Eric Bair; Sheila Gaynor; Gary D Slade; Richard Ohrbach; Roger B Fillingim; Joel D Greenspan; Ronald Dubner; Shad B Smith; Luda Diatchenko; William Maixner
Journal:  Pain       Date:  2016-06       Impact factor: 7.926

10.  Semi-supervised methods to predict patient survival from gene expression data.

Authors:  Eric Bair; Robert Tibshirani
Journal:  PLoS Biol       Date:  2004-04-13       Impact factor: 8.029

View more
  3 in total

1.  Neural network-based clustering model of ischemic stroke patients with a maximally distinct distribution of 1-year vascular outcomes.

Authors:  Joon-Tae Kim; Nu Ri Kim; Su Hoon Choi; Seungwon Oh; Man-Seok Park; Seung-Han Lee; Byeong C Kim; Jonghyun Choi; Min Soo Kim
Journal:  Sci Rep       Date:  2022-06-08       Impact factor: 4.996

2.  Neural Survival Clustering: Non-parametric mixture of neural networks for survival clustering.

Authors:  Vincent Jeanselme; Brian Tom; Jessica Barrett
Journal:  Proc Mach Learn Res       Date:  2022

3.  LncRNA-Encoded Short Peptides Identification Using Feature Subset Recombination and Ensemble Learning.

Authors:  Siyuan Zhao; Jun Meng; Yushi Luan
Journal:  Interdiscip Sci       Date:  2021-07-25       Impact factor: 2.233

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.