Literature DB >> 35757598

MULTIVARIATE MIXED MEMBERSHIP MODELING: INFERRING DOMAIN-SPECIFIC RISK PROFILES.

Massimiliano Russo1, Burton H Singer2, David B Dunson3.   

Abstract

Characterizing the shared memberships of individuals in a classification scheme poses severe interpretability issues, even when using a moderate number of classes (say 4). Mixed membership models quantify this phenomenon, but they typically focus on goodness-of-fit more than on interpretable inference. To achieve a good numerical fit, these models may in fact require many extreme profiles, making the results difficult to interpret. We introduce a new class of multivariate mixed membership models that, when variables can be partitioned into subject-matter based domains, can provide a good fit to the data using fewer profiles than standard formulations. The proposed model explicitly accounts for the blocks of variables corresponding to the distinct domains along with a cross-domain correlation structure, which provides new information about shared membership of individuals in a complex classification scheme. We specify a multivariate logistic normal distribution for the membership vectors, which allows easy introduction of auxiliary information leveraging a latent multivariate logistic regression. A Bayesian approach to inference, relying on Pólya gamma data augmentation, facilitates efficient posterior computation via Markov Chain Monte Carlo. We apply this methodology to a spatially explicit study of malaria risk over time on the Brazilian Amazon frontier.

Entities:  

Keywords:  Admixture model; Contingency table; Latent Dirichlet allocation; Multivariate categorical data; Multivariate logistic normal distribution

Year:  2022        PMID: 35757598      PMCID: PMC9222983          DOI: 10.1214/21-aoas1496

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   1.959


  10 in total

1.  The magical number seven plus or minus two: some limits on our capacity for processing information.

Authors:  G A MILLER
Journal:  Psychol Rev       Date:  1956-03       Impact factor: 8.934

2.  Mixed-membership models of scientific publications.

Authors:  Elena Erosheva; Stephen Fienberg; John Lafferty
Journal:  Proc Natl Acad Sci U S A       Date:  2004-03-12       Impact factor: 11.205

3.  Finding scientific topics.

Authors:  Thomas L Griffiths; Mark Steyvers
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-10       Impact factor: 11.205

4.  Spatial patterns of malaria in the Amazon: implications for surveillance and targeted interventions.

Authors:  Marcia Caldas de Castro; Diana Oya Sawyer; Burton H Singer
Journal:  Health Place       Date:  2006-07-11       Impact factor: 4.078

5.  DESCRIBING DISABILITY THROUGH INDIVIDUAL-LEVEL MIXTURE MODELS FOR MULTIVARIATE BINARY DATA.

Authors:  Elena A Erosheva; Stephen E Fienberg; Cyrille Joutard
Journal:  Ann Appl Stat       Date:  2007       Impact factor: 2.083

6.  Mixed Membership Stochastic Blockmodels.

Authors:  Edoardo M Airoldi; David M Blei; Stephen E Fienberg; Eric P Xing
Journal:  J Mach Learn Res       Date:  2008-09       Impact factor: 3.654

7.  Mathematical typology: a grade of membership technique for obtaining disease definition.

Authors:  M A Woodbury; J Clive; A Garson
Journal:  Comput Biomed Res       Date:  1978-06

8.  Simplex Factor Models for Multivariate Unordered Categorical Data.

Authors:  Anirban Bhattacharya; David B Dunson
Journal:  J Am Stat Assoc       Date:  2012-03-01       Impact factor: 5.033

9.  Black/white differences in health status and mortality among the elderly.

Authors:  L Berkman; B Singer; K Manton
Journal:  Demography       Date:  1989-11

10.  Malaria risk on the Amazon frontier.

Authors:  Marcia Caldas de Castro; Roberto L Monte-Mór; Diana O Sawyer; Burton H Singer
Journal:  Proc Natl Acad Sci U S A       Date:  2006-02-06       Impact factor: 11.205

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.