Literature DB >> 35875263

Fast Moment Estimation for Generalized Latent Dirichlet Models.

Shiwen Zhao1, Barbara E Engelhardt2, Sayan Mukherjee1, David B Dunson1.   

Abstract

We develop a generalized method of moments (GMM) approach for fast parameter estimation in a new class of Dirichlet latent variable models with mixed data types. Parameter estimation via GMM has computational and statistical advantages over alternative methods, such as expectation maximization, variational inference, and Markov chain Monte Carlo. A key computational advantage of our method, Moment Estimation for latent Dirichlet models (MELD), is that parameter estimation does not require instantiation of the latent variables. Moreover, performance is agnostic to distributional assumptions of the observations. We derive population moment conditions after marginalizing out the sample-specific Dirichlet latent variables. The moment conditions only depend on component mean parameters. We illustrate the utility of our approach on simulated data, comparing results from MELD to alternative methods, and we show the promise of our approach through the application to several datasets. Supplementary materials for this article are available online.

Entities:  

Keywords:  Generalized method of moments; Latent Dirichlet allocation; Latent variables; Mixed membership model; Mixed scale data; Tensor factorization

Year:  2018        PMID: 35875263      PMCID: PMC9302535          DOI: 10.1080/01621459.2017.1341839

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   4.369


  9 in total

1.  Association mapping in structured populations.

Authors:  J K Pritchard; M Stephens; N A Rosenberg; P Donnelly
Journal:  Am J Hum Genet       Date:  2000-05-26       Impact factor: 11.025

2.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

3.  Bayesian Gaussian Copula Factor Models for Mixed Data.

Authors:  Jared S Murray; David B Dunson; Lawrence Carin; Joseph E Lucas
Journal:  J Am Stat Assoc       Date:  2013-06-01       Impact factor: 5.033

4.  FastMotif: spectral sequence motif discovery.

Authors:  Nicoló Colombo; Nikos Vlassis
Journal:  Bioinformatics       Date:  2015-04-16       Impact factor: 6.937

5.  Analysis of E. coli promoter sequences.

Authors:  C B Harley; R P Reynolds
Journal:  Nucleic Acids Res       Date:  1987-03-11       Impact factor: 16.971

6.  Simplex Factor Models for Multivariate Unordered Categorical Data.

Authors:  Anirban Bhattacharya; David B Dunson
Journal:  J Am Stat Assoc       Date:  2012-03-01       Impact factor: 5.033

7.  Nonparametric Bayes Modeling of Multivariate Categorical Data.

Authors:  David B Dunson; Chuanhua Xing
Journal:  J Am Stat Assoc       Date:  2012-01-01       Impact factor: 5.033

8.  Model-implied instrumental variable-generalized method of moments (MIIV-GMM) estimators for latent variable models.

Authors:  Kenneth A Bollen; Stanislav Kolenikov; Shawn Bauldry
Journal:  Psychometrika       Date:  2013-04-11       Impact factor: 2.500

9.  Patterns of cis regulatory variation in diverse human populations.

Authors:  Barbara E Stranger; Stephen B Montgomery; Antigone S Dimas; Leopold Parts; Oliver Stegle; Catherine E Ingle; Magda Sekowska; George Davey Smith; David Evans; Maria Gutierrez-Arcelus; Alkes Price; Towfique Raj; James Nisbett; Alexandra C Nica; Claude Beazley; Richard Durbin; Panos Deloukas; Emmanouil T Dermitzakis
Journal:  PLoS Genet       Date:  2012-04-19       Impact factor: 5.917

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.