| Literature DB >> 24901659 |
M S Gilthorpe1, D L Dahly2, Y K Tu3, L D Kubzansky4, E Goodman5.
Abstract
Lifecourse trajectories of clinical or anthropological attributes are useful for identifying how our early-life experiences influence later-life morbidity and mortality. Researchers often use growth mixture models (GMMs) to estimate such phenomena. It is common to place constrains on the random part of the GMM to improve parsimony or to aid convergence, but this can lead to an autoregressive structure that distorts the nature of the mixtures and subsequent model interpretation. This is especially true if changes in the outcome within individuals are gradual compared with the magnitude of differences between individuals. This is not widely appreciated, nor is its impact well understood. Using repeat measures of body mass index (BMI) for 1528 US adolescents, we estimated GMMs that required variance-covariance constraints to attain convergence. We contrasted constrained models with and without an autocorrelation structure to assess the impact this had on the ideal number of latent classes, their size and composition. We also contrasted model options using simulations. When the GMM variance-covariance structure was constrained, a within-class autocorrelation structure emerged. When not modelled explicitly, this led to poorer model fit and models that differed substantially in the ideal number of latent classes, as well as class size and composition. Failure to carefully consider the random structure of data within a GMM framework may lead to erroneous model inferences, especially for outcomes with greater within-person than between-person homogeneity, such as BMI. It is crucial to reflect on the underlying data generation processes when building such models.Entities:
Mesh:
Year: 2014 PMID: 24901659 PMCID: PMC4098080 DOI: 10.1017/S2040174414000130
Source DB: PubMed Journal: J Dev Orig Health Dis ISSN: 2040-1744 Impact factor: 2.401
Study data set structure and features
|
| Mean BMI ( | |
|---|---|---|
| Gender | ||
| Male | 745 (48.8) | 23.0 (5.6) |
| Female | 783 (51.2) | 23.7 (6.3) |
| Race/ethnicity | ||
| White | 809 (52.9) | 22.5 (5.0) |
| Black | 719 (47.1) | 24.4 (6.8) |
| Pubertal status | ||
| Puberty | 749 (49.1) | 21.9 (5.5) |
| Post-puberty | 776 (50.9) | 24.8 (6.1) |
| Parent’s education | ||
| <High school | 358 (23.4) | 23.8 (6.4) |
| High school | 447 (29.3) | 24.4 (6.9) |
| Some college | 419 (27.4) | 23.0 (5.3) |
| College or more | 304 (19.9) | 21.9 (4.3) |
BMI, body mass index.
Summary of growth mixture model (GMM) convergence characteristics and model-fit criteria for the illustrative study data: 10 restricted standard GMMs (Std) and 10 restricted AR1 GMMs (AR1)
| Convergence | -2LL | AIC | BIC | ||||
|---|---|---|---|---|---|---|---|
| % success | % agree | Estimate | df | Estimate | Estimate | AR1(ρ) | |
| Std | |||||||
| 2-class | 100.0 | 100.0 | 24,235.8 | 15 | 24,265.8 | 24,345.8 | – |
| 3-class | 100.0 | 100.0 | 23,742.2 | 20 | 23,782.2 | 23,888.9 | – |
| 4-class | 99.9 | 100.0 | 23,396.8 | 25 | 23,446.8 | 23,580.1 | – |
| 5-class | 99.8 | 90.8 | 23,140.8 | 30 | 23,200.8 | 23,360.7 | – |
| 6-class | 99.8 | 60.5 | 22,949.8 | 35 | 23,019.8 | 23,206.4 | – |
| 7-class | 99.6 | 100.0 | 22,841.5 | 40 | 22,921.5 | 23,134.8 | – |
| 8-class | 99.4 | 100.0 | 22,758.2 | 45 | 22,848.2 | 23,088.1 | – |
| 9-class | 99.1 | 28.9 | 22,699.9 | 50 | 22,799.9 | 23,066.5 | – |
| 10-class | 98.9 | 10.7 | 22,656.3 | 55 | 22,766.3 | 23,059.5 | – |
| 11-class | 98.7 | 11.5 | 22,617.1 | 60 | 22,737.1 | 23,057.0 | – |
| AR1 | |||||||
| 2-class | 19.7 | 100.0 | 22,916.2 | 15 | 22,946.2 | 23,026.2 | 0.944 |
| 3-class | 20.1 | 100.0 | 22,692.4 | 21 | 22,734.4 | 22,846.3 | 0.923 |
| 4-class | 20.1 | 26.1 | 22,613.6 | 26 | 22,665.6 | 22,804.3 | 0.873 |
| 5-class | 20.1 | 23.4 | 22,561.9 | 31 | 22,623.9 | 22,789.2 | 0.842 |
| 6-class | 19.4 | 48.1 | 22,522.8 | 36 | 22,594.8 | 22,786.8 | 0.850 |
| 7-class | 19.5 | 46.3 | 22,499.2 | 41 | 22,581.2 | 22,799.8 | 0.805 |
| 8-class | 19.7 | 8.7 | 22,472.1 | 46 | 22,564.1 | 22,809.3 | 0.733 |
| 9-class | 19.7 | 1.4 | 22,450.7 | 51 | 22,552.7 | 22,824.6 | 0.734 |
| 10-class | 19.4 | 1.9 | 22,432.0 | 56 | 22,544.0 | 22,842.6 | 0.742 |
| 11-class | 19.4 | 0.2 | 22,414.4 | 61 | 22,536.4 | 22,861.6 | 0.742 |
LL, log-likelihood; AIC, Akaike’s Information Criterion; BIC, Bayesian Information Criterion; df, degrees of freedom.
Percentage of successes derived as proportion of the 20k random starts that converged to a maximum likelihood.
Percentage of successes as proportion of the 2k (10%) better models that converged that also agree on the same log-likelihood value derived.
For the two-class AR1 model, the intercept variance was constrained to zero to attain convergence with non-negative variances.
Fig. 1Likelihood-based model-fit criteria for growth mixture models (GMMs): 10 restricted standard (Std) and 10 restricted AR1 (AR1).
Fig. 2Variation in class trajectory intercept residual variances and model trajectory intercept residual variance: 10 restricted standardgrowth mixture models (GMMs; Std) and 10 restricted AR1 GMMs (AR1); for the two-class AR1 model, intercept variance was constrained to zero to attain convergence with non-negative variances.
Mean likelihood statistics for growth mixture models (GMMs) of nine simulated data sets
| Mean -2LL | df | Mean AIC | Mean BIC | Mean AR1(ρ) | |
|---|---|---|---|---|---|
| Unrestricted | |||||
| 2-class | 41,877.6 | 21 | 41,919.6 | 42,031.6 | – |
| Restricted standard | |||||
| 1-class | 42,130.9 | 5 | 42,140.9 | 42,167.6 | – |
| 2-class | 41,980.8 | 11 | 42,002.8 | 42,061.5 | – |
| 3-class | 41,919.5 | 17 | 41,953.5 | 42,044.1 | – |
| 4-class | 41,892.4 | 23 | 41,938.4 | 42,061.0 | – |
| 5-class | 41,871.5 | 29 | 41,929.5 | 42,084.2 | – |
| Restricted AR1 | |||||
| 1-class | 42,017.9 | 6 | 42,029.9 | 42,061.9 | 0.255 |
| 2-class | 41,936.9 | 12 | 41,960.9 | 42,024.8 | 0.269 |
| 3-class | 41,905.5 | 18 | 41,941.5 | 42,037.4 | 0.141 |
LL, log-likelihood; AIC, Akaike’s Information Criterion; BIC, Bayesian Information Criterion; df, degrees of freedom.
Class correspondence for two-class growth mixture models (GMMs): unrestricted random effects and restricted random effects either with or without AR1: mean (s.d.) modal assignments of 1528 individuals across the nine simulated data sets
| GMM contrast made | % correspondence | Rand (%) | Adjusted Rand (%) |
|---|---|---|---|
| Against true | |||
| Unrestricted | 86.1 (0.9) | 76.1 (1.3) | 52.2 (2.5) |
| Restricted AR1 | 86.7 (1.2) | 76.9 (1.7) | 53.8 (3.5) |
| Restricted | 52.2 (1.6) | 50.1 (0.1) | 0.2 (0.3) |
| Against each other | |||
| Unrestricted | 92.5 (1.9) | 86.1 (3.2) | 72.2 (6.5) |
| Unrestricted | 52.9 (3.1) | 50.3 (0.5) | 0.6 (1.1) |
| Restricted AR1 | 53.3 (2.7) | 50.3 (0.5) | 0.6 (1.0) |
Adjusted Rand accommodates for chance.
Contrast of class correspondence based on ordered class sizes for 10 growth mixture models (GMMs) with and without AR1 based on modal assignment of 1528 individuals in the illustrative data set
| Modal assignment | ||||
|---|---|---|---|---|
| Classes | % correspondence | Rand (%) | Adjusted Rand (%) | Drift (%) |
| 2-class | 90.7 | 83.1 | 65.9 | 116 |
| 3-class | 88.6 | 50.6 | 0.3 | 56 |
| 4-class | 68.0 | 77.9 | 54.1 | −175 |
| 5-class | 66.9 | 80.3 | 58.2 | −326 |
| 6-class | 25.1 | 75.1 | 34.6 | −327 |
| 7-class | 54.7 | 80.2 | 46.3 | −320 |
| 8-class | 45.9 | 79.6 | 42.7 | −389 |
| 9-class | 20.5 | 64.5 | −0.6 | −214 |
| 10-class | 40.5 | 81.6 | 46.4 | −335 |
| 11-class | 21.3 | 66.5 | 0.4 | −128 |
Adjusted Rand accommodates for chance.
Net difference in the number of individuals within the smaller classes within the AR1 model.
For the 2-class AR1 model, the intercept variance was constrained to zero to attain convergence with non-negative variances.