Literature DB >> 22495652

Supervised Bayesian latent class models for high-dimensional data.

Stacia M Desantis1, E Andrés Houseman, Brent A Coull, Catherine L Nutt, Rebecca A Betensky.   

Abstract

High-grade gliomas are the most common primary brain tumors in adults and are typically diagnosed using histopathology. However, these diagnostic categories are highly heterogeneous and do not always correlate well with survival. In an attempt to refine these diagnoses, we make several immunohistochemical measurements of YKL-40, a gene previously shown to be differentially expressed between diagnostic groups. We propose two latent class models for classification and variable selection in the presence of high-dimensional binary data, fit by using Bayesian Markov chain Monte Carlo techniques. Penalization and model selection are incorporated in this setting via prior distributions on the unknown parameters. The methods provide valid parameter estimates under conditions in which standard supervised latent class models do not, and outperform two-stage approaches to variable selection and parameter estimation in a variety of settings. We study the properties of these methods in simulations, and apply these methodologies to the glioma study for which identifiable three-class parameter estimates cannot be obtained without penalization. With penalization, the resulting latent classes correlate well with clinical tumor grade and offer additional information on survival prognosis that is not captured by clinical diagnosis alone. The inclusion of YKL-40 features also increases the precision of survival estimates. Fitting models with and without YKL-40 highlights a subgroup of patients who have glioblastoma (GBM) diagnosis but appear to have better prognosis than the typical GBM patient.
Copyright © 2012 John Wiley & Sons, Ltd.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22495652      PMCID: PMC3701307          DOI: 10.1002/sim.4448

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  20 in total

1.  A mixture model-based approach to the clustering of microarray expression data.

Authors:  G J McLachlan; R W Bean; D Peel
Journal:  Bioinformatics       Date:  2002-03       Impact factor: 6.937

2.  Variable selection for model-based high-dimensional clustering and its application to microarray data.

Authors:  Sijian Wang; Ji Zhu
Journal:  Biometrics       Date:  2007-10-26       Impact factor: 2.571

3.  A penalized latent class model for ordinal data.

Authors:  Stacia M Desantis; E Andrés Houseman; Brent A Coull; Anat Stemmer-Rachamimov; Rebecca A Betensky
Journal:  Biostatistics       Date:  2007-07-11       Impact factor: 5.899

4.  Fitting semiparametric random effects models to large data sets.

Authors:  Michael L Pennell; David B Dunson
Journal:  Biostatistics       Date:  2007-04-11       Impact factor: 5.899

5.  Latent variable modeling of diagnostic accuracy.

Authors:  I Yang; M P Becker
Journal:  Biometrics       Date:  1997-09       Impact factor: 2.571

6.  Expression of oligodendroglial and astrocytic lineage markers in diffuse gliomas: use of YKL-40, ApoE, ASCL1, and NKX2-2.

Authors:  Audrey Rousseau; Catherine L Nutt; Rebecca A Betensky; A John Iafrate; Moonjoo Han; Keith L Ligon; David H Rowitch; David N Louis
Journal:  J Neuropathol Exp Neurol       Date:  2006-12       Impact factor: 3.685

7.  Quasi-symmetric latent class models, with application to rater agreement.

Authors:  A Agresti; J B Lang
Journal:  Biometrics       Date:  1993-03       Impact factor: 2.571

8.  Joint analysis of time-to-event and multiple binary indicators of latent classes.

Authors:  Klaus Larsen
Journal:  Biometrics       Date:  2004-03       Impact factor: 2.571

9.  Semi-supervised methods to predict patient survival from gene expression data.

Authors:  Eric Bair; Robert Tibshirani
Journal:  PLoS Biol       Date:  2004-04-13       Impact factor: 8.029

10.  Feature selection for predicting tumor metastases in microarray experiments using paired design.

Authors:  Qihua Tan; Mads Thomassen; Torben A Kruse
Journal:  Cancer Inform       Date:  2007-03-20
View more
  1 in total

1.  Biclustering Models for Two-Mode Ordinal Data.

Authors:  Eleni Matechou; Ivy Liu; Daniel Fernández; Miguel Farias; Bergljot Gjelsvik
Journal:  Psychometrika       Date:  2016-06-21       Impact factor: 2.500

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.