Literature DB >> 23908561

Simplex Factor Models for Multivariate Unordered Categorical Data.

Anirban Bhattacharya1, David B Dunson.   

Abstract

Gaussian latent factor models are routinely used for modeling of dependence in continuous, binary, and ordered categorical data. For unordered categorical variables, Gaussian latent factor models lead to challenging computation and complex modeling structures. As an alternative, we propose a novel class of simplex factor models. In the single-factor case, the model treats the different categorical outcomes as independent with unknown marginals. The model can characterize flexible dependence structures parsimoniously with few factors, and as factors are added, any multivariate categorical data distribution can be accurately approximated. Using a Bayesian approach for computation and inferences, a Markov chain Monte Carlo (MCMC) algorithm is proposed that scales well with increasing dimension, with the number of factors treated as unknown. We develop an efficient proposal for updating the base probability vector in hierarchical Dirichlet models. Theoretical properties are described, and we evaluate the approach through simulation examples. Applications are described for modeling dependence in nucleotide sequences and prediction from high-dimensional categorical features.

Entities:  

Keywords:  Classification; Contingency table; Factor analysis; Latent variable; Mutual information; Nonnegative tensor factorization; Nonparametric Bayes; Polytomous regression

Year:  2012        PMID: 23908561      PMCID: PMC3728016          DOI: 10.1080/01621459.2011.646934

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  13 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  A global map of p53 transcription-factor binding sites in the human genome.

Authors:  Chia-Lin Wei; Qiang Wu; Vinsensius B Vega; Kuo Ping Chiu; Patrick Ng; Tao Zhang; Atif Shahab; How Choong Yong; YuTao Fu; Zhiping Weng; JianJun Liu; Xiao Dong Zhao; Joon-Lin Chew; Yen Ling Lee; Vladimir A Kuznetsov; Wing-Kin Sung; Lance D Miller; Bing Lim; Edison T Liu; Qiang Yu; Huck-Hui Ng; Yijun Ruan
Journal:  Cell       Date:  2006-01-13       Impact factor: 41.582

3.  Kernel stick-breaking processes.

Authors:  David B Dunson; Ju-Hyun Park
Journal:  Biometrika       Date:  2008       Impact factor: 2.445

4.  Posterior consistency in conditional distribution estimation.

Authors:  Debdeep Pati; David B Dunson; Surya T Tokdar
Journal:  J Multivar Anal       Date:  2013-04-01       Impact factor: 1.473

5.  DESCRIBING DISABILITY THROUGH INDIVIDUAL-LEVEL MIXTURE MODELS FOR MULTIVARIATE BINARY DATA.

Authors:  Elena A Erosheva; Stephen E Fienberg; Cyrille Joutard
Journal:  Ann Appl Stat       Date:  2007       Impact factor: 2.083

6.  Nonparametric Bayesian models through probit stick-breaking processes.

Authors:  Abel Rodríguez; David B Dunson
Journal:  Bayesian Anal       Date:  2011-03-01       Impact factor: 3.728

7.  Multi-variate probit analysis.

Authors:  J R Ashford; R R Sowden
Journal:  Biometrics       Date:  1970-09       Impact factor: 2.571

8.  Some mathematical notes on three-mode factor analysis.

Authors:  L R Tucker
Journal:  Psychometrika       Date:  1966-09       Impact factor: 2.500

9.  Nonparametric Bayes local partition models for random effects.

Authors:  David B Dunson
Journal:  Biometrika       Date:  2009       Impact factor: 2.445

10.  Nonparametric Bayes Modeling of Multivariate Categorical Data.

Authors:  David B Dunson; Chuanhua Xing
Journal:  J Am Stat Assoc       Date:  2012-01-01       Impact factor: 5.033

View more
  11 in total

1.  Bayesian modeling of temporal dependence in large sparse contingency tables.

Authors:  Tsuyoshi Kunihama; David B Dunson
Journal:  J Am Stat Assoc       Date:  2013-01-01       Impact factor: 5.033

2.  COMPOSITE MIXTURE OF LOG-LINEAR MODELS WITH APPLICATION TO PSYCHIATRIC STUDIES.

Authors:  Emanuele Aliverti; David B Dunson
Journal:  Ann Appl Stat       Date:  2022-06-13       Impact factor: 1.959

3.  MULTIVARIATE MIXED MEMBERSHIP MODELING: INFERRING DOMAIN-SPECIFIC RISK PROFILES.

Authors:  Massimiliano Russo; Burton H Singer; David B Dunson
Journal:  Ann Appl Stat       Date:  2022-03-28       Impact factor: 1.959

4.  Fast Moment Estimation for Generalized Latent Dirichlet Models.

Authors:  Shiwen Zhao; Barbara E Engelhardt; Sayan Mukherjee; David B Dunson
Journal:  J Am Stat Assoc       Date:  2018-11-13       Impact factor: 4.369

5.  BAYESIAN SEMIPARAMETRIC ANALYSIS FOR TWO-PHASE STUDIES OF GENE-ENVIRONMENT INTERACTION.

Authors:  Jaeil Ahn; Bhramar Mukherjee; Stephen B Gruber; Malay Ghosh
Journal:  Ann Appl Stat       Date:  2013-03       Impact factor: 2.083

6.  Longitudinal Mixed Membership Trajectory Models for Disability Survey Data.

Authors:  Daniel Manrique-Vallier
Journal:  Ann Appl Stat       Date:  2014-12       Impact factor: 2.083

7.  Clustering and variable selection in the presence of mixed variable types and missing data.

Authors:  C B Storlie; S M Myers; S K Katusic; A L Weaver; R G Voigt; P E Croarkin; R E Stoeckel; J D Port
Journal:  Stat Med       Date:  2018-05-17       Impact factor: 2.373

8.  TENSOR DECOMPOSITIONS AND SPARSE LOG-LINEAR MODELS.

Authors:  James E Johndrow; Anirban Bhattacharya; David B Dunson
Journal:  Ann Stat       Date:  2017-02-21       Impact factor: 4.028

9.  Exploring dependence between categorical variables: Benefits and limitations of using variable selection within Bayesian clustering in relation to log-linear modelling with interaction terms.

Authors:  Michail Papathomas; Sylvia Richardson
Journal:  J Stat Plan Inference       Date:  2016-06       Impact factor: 1.111

10.  Cross-Cultural Revision and Psychometric Properties of the Chinese Version of the Autism Spectrum Rating Scale (2-5 Years).

Authors:  Hao Zhou; Chunpei Li; Xuerong Luo; Lijie Wu; Yi Huang; Lan Zhang; Xiaobing Zou; Xiu Xu; Yong-Hui Jiang; Weili Yan; Yi Wang
Journal:  Front Neurol       Date:  2018-06-27       Impact factor: 4.003

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.