Literature DB >> 25485026

CLUSTERING SOUTH AFRICAN HOUSEHOLDS BASED ON THEIR ASSET STATUS USING LATENT VARIABLE MODELS.

Damien McParland1, Isobel Claire Gormley1, Tyler H McCormick2, Samuel J Clark3, Chodziwadziwa Whiteson Kabudula4, Mark A Collinson5.   

Abstract

The Agincourt Health and Demographic Surveillance System has since 2001 conducted a biannual household asset survey in order to quantify household socio-economic status (SES) in a rural population living in northeast South Africa. The survey contains binary, ordinal and nominal items. In the absence of income or expenditure data, the SES landscape in the study population is explored and described by clustering the households into homogeneous groups based on their asset status. A model-based approach to clustering the Agincourt households, based on latent variable models, is proposed. In the case of modeling binary or ordinal items, item response theory models are employed. For nominal survey items, a factor analysis model, similar in nature to a multinomial probit model, is used. Both model types have an underlying latent variable structure-this similarity is exploited and the models are combined to produce a hybrid model capable of handling mixed data types. Further, a mixture of the hybrid models is considered to provide clustering capabilities within the context of mixed binary, ordinal and nominal response data. The proposed model is termed a mixture of factor analyzers for mixed data (MFA-MD). The MFA-MD model is applied to the survey data to cluster the Agincourt households into homogeneous groups. The model is estimated within the Bayesian paradigm, using a Markov chain Monte Carlo algorithm. Intuitive groupings result, providing insight to the different socio-economic strata within the Agincourt region.

Entities:  

Keywords:  Clustering; Metropolis-within-Gibbs; item response theory; mixed data

Year:  2014        PMID: 25485026      PMCID: PMC4256055          DOI: 10.1214/14-AOAS726

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  10 in total

1.  Finite mixture modeling with mixture outcomes using the EM algorithm.

Authors:  B Muthén; K Shedden
Journal:  Biometrics       Date:  1999-06       Impact factor: 2.571

2.  Bayesian Gaussian Copula Factor Models for Mixed Data.

Authors:  Jared S Murray; David B Dunson; Lawrence Carin; Joseph E Lucas
Journal:  J Am Stat Assoc       Date:  2013-06-01       Impact factor: 5.033

3.  Research into health, population and social transitions in rural South Africa: data and methods of the Agincourt Health and Demographic Surveillance System.

Authors:  Kathleen Kahn; Stephen M Tollman; Mark A Collinson; Samuel J Clark; Rhian Twine; Benjamin D Clark; Mildred Shabangu; Francesc Xavier Gómez-Olivé; Obed Mokoena; Michel L Garenne
Journal:  Scand J Public Health Suppl       Date:  2007-08       Impact factor: 3.021

4.  Constructing socio-economic status indices: how to use principal components analysis.

Authors:  Seema Vyas; Lilani Kumaranayake
Journal:  Health Policy Plan       Date:  2006-10-09       Impact factor: 3.344

5.  Socio-economic differences in health, nutrition, and population within developing countries: an overview.

Authors:  D R Gwatkin; S Rutstein; K Johnson; E Suliman; A Wagstaff; A Amouzou
Journal:  Niger J Clin Pract       Date:  2007-12       Impact factor: 0.968

6.  Bayesian Analysis of Multivariate Nominal Measures Using Multivariate Multinomial Probit Models.

Authors:  Xiao Zhang; W John Boscardin; Thomas R Belin
Journal:  Comput Stat Data Anal       Date:  2008-03-15       Impact factor: 1.681

7.  DESCRIBING DISABILITY THROUGH INDIVIDUAL-LEVEL MIXTURE MODELS FOR MULTIVARIATE BINARY DATA.

Authors:  Elena A Erosheva; Stephen E Fienberg; Cyrille Joutard
Journal:  Ann Appl Stat       Date:  2007       Impact factor: 2.083

8.  Sparse Bayesian infinite factor models.

Authors:  A Bhattacharya; D B Dunson
Journal:  Biometrika       Date:  2011-06       Impact factor: 2.445

9.  Estimating wealth effects without expenditure data--or tears: an application to educational enrollments in states of India.

Authors:  D Filmer; L H Pritchett
Journal:  Demography       Date:  2001-02

10.  CLUSTERING SOUTH AFRICAN HOUSEHOLDS BASED ON THEIR ASSET STATUS USING LATENT VARIABLE MODELS.

Authors:  Damien McParland; Isobel Claire Gormley; Tyler H McCormick; Samuel J Clark; Chodziwadziwa Whiteson Kabudula; Mark A Collinson
Journal:  Ann Appl Stat       Date:  2014-06-01       Impact factor: 2.083

  10 in total
  3 in total

1.  A Model-Based Approach to Simultaneous Clustering and Dimensional Reduction of Ordinal Data.

Authors:  Monia Ranalli; Roberto Rocci
Journal:  Psychometrika       Date:  2017-09-06       Impact factor: 2.500

2.  Generalized infinite factorization models.

Authors:  L Schiavon; A Canale; D B Dunson
Journal:  Biometrika       Date:  2022-01-19       Impact factor: 3.028

3.  CLUSTERING SOUTH AFRICAN HOUSEHOLDS BASED ON THEIR ASSET STATUS USING LATENT VARIABLE MODELS.

Authors:  Damien McParland; Isobel Claire Gormley; Tyler H McCormick; Samuel J Clark; Chodziwadziwa Whiteson Kabudula; Mark A Collinson
Journal:  Ann Appl Stat       Date:  2014-06-01       Impact factor: 2.083

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.