| Literature DB >> 19136448 |
Li Su1, Brian D M Tom, Vernon T Farewell.
Abstract
Semicontinuous data in the form of a mixture of zeros and continuously distributed positive values frequently arise in biomedical research. Two-part mixed models with correlated random effects are an attractive approach to characterize the complex structure of longitudinal semicontinuous data. In practice, however, an independence assumption about random effects in these models may often be made for convenience and computational feasibility. In this article, we show that bias can be induced for regression coefficients when random effects are truly correlated but misspecified as independent in a 2-part mixed model. Paralleling work on bias under nonignorable missingness within a shared parameter model, we derive and investigate the asymptotic bias in selected settings for misspecified 2-part mixed models. The performance of these models in practice is further evaluated using Monte Carlo simulations. Additionally, the potential bias is investigated when artificial zeros, due to left censoring from some detection or measuring limit, are incorporated. To illustrate, we fit different 2-part mixed models to the data from the University of Toronto Psoriatic Arthritis Clinic, the aim being to examine whether there are differential effects of disease activity and damage on physical functioning as measured by the health assessment questionnaire scores over the course of psoriatic arthritis. Some practical issues on variance component estimation revealed through this data analysis are considered.Entities:
Mesh:
Year: 2009 PMID: 19136448 PMCID: PMC2648907 DOI: 10.1093/biostatistics/kxn044
Source DB: PubMed Journal: Biostatistics ISSN: 1465-4644 Impact factor: 5.899
Fig. 1.Bar plot for the HAQ data in Section 1.1.
Fig. 2.Contour plots of asymptotic bias for the intercept term β0 in misspecified 2-part mixed model in Section 3.1 by occurrence random-intercept variance σ2 and intraclass correlation ψ = σ2/(σ2 + σe2), stratified by correlation between random effects (ρ = (0.2,0.5,0.8)) and overall proportion of zeros (i.e. intercept term in the binary part θ0 = ( − 0.5,0.5,1.5); (θ1,θ2) = ( − 1,log(2)) are fixed). The error variance is fixed at σe2 = 0.08
Parameter estimates in the binary part of the model for the HAQ data
| Parameters | Misspecified model | Full model | Latent process model | |||
| Estimate (SE) | Estimate (SE) | Estimate (SE) | ||||
| Intercept | – 1.0199 (0.4079) | 0.0129 | – 1.0015(0.3746) | 0.0078 | – 0.9909(0.3556) | 0.0056 |
| Age at onset of PsA | 0.6031 (0.1743) | 0.0006 | 0.6266(0.1611) | 0.0001 | 0.6392(0.1538) | < 0.0001 |
| Sex | ||||||
| Male | ||||||
| Female | 1.9944(0.3603) | < 0.0001 | 2.0080 (0.3276) | < 0.0001 | 2.0037(0.3149) | < 0.0001 |
| PsA disease duration | – 0.0027 (0.0259) | 0.9169 | 0.0156(0.0232) | 0.5027 | 0.0166(0.0220) | 0.4501 |
| Actively inflamed joints | 0.1758 (0.0513) | 0.0007 | 0.1566(0.0495) | 0.0017 | 0.1380(0.0465) | 0.0032 |
| Clinically deformed joints | – 0.0161 (0.0321) | 0.6165 | 0.0120(0.0260) | 0.6441 | 0.0179(0.0238) | 0.4531 |
| PASI score | 0.1941 (0.1257) | 0.1233 | 0.1754(0.1086) | 0.1071 | 0.1543(0.1017) | 0.1299 |
| Morning stiffness | ||||||
| No | ||||||
| Yes | 1.5953 (0.2319) | < 0.0001 | 1.5777(0.2112) | < 0.0001 | 1.5691(0.2018) | < 0.0001 |
| ESR | 0.3030 (0.1310) | 0.0213 | 0.2988(0.1164) | 0.0106 | 0.2971(0.1103) | 0.0074 |
| Medications | ||||||
| None | ||||||
| NSAIDs | 0.2998 (0.2743) | 0.2751 | 0.2955(0.2529) | 0.2435 | 0.2960(0.2439) | 0.2257 |
| DMARDs | 0.3074 (0.2508) | 0.2211 | 0.3100(0.2295) | 0.1776 | 0.3138(0.2197) | 0.1541 |
| Steroids | 0.9945 (0.4698) | 0.0350 | 0.9946(0.4458) | 0.0263 | 0.9927(0.4355) | 0.0232 |
| Interaction of actively inflamed | 0.0002 (0.0034) | 0.9502 | – 0.0003(0.0033) | 0.9403 | 0.0003(0.0031) | 0.9300 |
| joints with arthritis duration | ||||||
| Interaction of clinical deformed | 0.0032 (0.0016) | 0.0442 | 0.0022(0.0013) | 0.0844 | 0.0018(0.0011) | 0.1102 |
| joints with arthritis duration | ||||||
| 4.2519 (0.8549) | < 0.0001 | 4.3930(0.8924) | < 0.0001 | 4.2641(0.9001) | < 0.0001 | |
| ( | 0.9423(0.0373) | < 0.0001 | ( | |||
SE, standard error.
Parameter estimates in the continuous part of the model for the HAQ data
| Parameters | Misspecified model | Full model | Latent process model | |||
| Estimate (SE) | Estimate (SE) | Estimate (SE) | ||||
| Intercept | 0.3176(0.0567) | < 0.0001 | 0.2149(0.0556) | 0.0001 | 0.1748(0.0555) | 0.0018 |
| Age at onset of PsA | 0.1011(0.0242) | < 0.0001 | 0.1009(0.0245) | < 0.0001 | 0.0984(0.0250) | 0.0001 |
| Sex | ||||||
| Male | ||||||
| Female | 0.1811(0.0505) | 0.0004 | 0.2225(0.0512) | < 0.0001 | 0.2461(0.0523) | < 0.0001 |
| PsA disease duration | 0.0039(0.0033) | 0.2272 | 0.0035(0.0032) | 0.2726 | 0.0044(0.0032) | 0.1719 |
| Actively inflamed joints | 0.0219(0.0028) | < 0.0001 | 0.0239(0.0027) | < 0.0001 | 0.0243(0.0027) | < 0.0001 |
| Clinically deformed joints | 0.0058(0.0031) | 0.0627 | 0.0052(0.0031) | 0.0957 | 0.0051(0.0031) | 0.1034 |
| PASI score | 0.0128(0.0140) | 0.3636 | 0.0247(0.0134) | 0.0667 | 0.0257(0.0134) | 0.0553 |
| Morning stiffness | ||||||
| No | ||||||
| Yes | 0.1502(0.0274) | < 0.0001 | 0.1573(0.0263) | < 0.0001 | 0.1620(0.0262) | < 0.0001 |
| ESR | 0.0395(0.0132) | 0.0028 | 0.0388(0.0127) | 0.0024 | 0.0374(0.0126) | 0.0033 |
| Medications | ||||||
| None | ||||||
| NSAIDs | – 0.0240 (0.0289) | 0.4065 | – 0.0177(0.0281) | 0.5288 | – 0.0181(0.0280) | 0.5194 |
| DMARDs | 0.0224(0.0280) | 0.4252 | 0.0235(0.0272) | 0.3889 | 0.0226(0.0272) | 0.4064 |
| Steroids | 0.0457(0.0453) | 0.3135 | 0.0493(0.0441) | 0.2641 | 0.0481(0.0441) | 0.2761 |
| Interaction of actively inflamed | – 0.0004(0.0002) | 0.0290 | – 0.0004(0.0002) | 0.0072 | – 0.0005(0.0002) | 0.0042 |
| joints with arthritis duration | ||||||
| Interaction of clinical deformed | 0.0002(0.0001) | 0.1122 | 0.0003(0.0001) | 0.0330 | 0.0003(0.0001) | 0.0351 |
| joints with arthritis duration | ||||||
| 0.1587(0.0154) | < 0.0001 | 0.1732(0.0166) | < 0.0001 | — | — | |
| — | — | — | — | 0.2074 (0.0210) | < 0.0001 | |
| 0.0785(0.0040) | < 0.0001 | 0.0774(0.0039) | < 0.0001 | 0.0779(0.0039) | < 0.0001 | |
| ( | 0.9423(0.0373) | < 0.0001 | ( | |||
| – 2 log- likelihood (both parts) | 2116.0 | 2018.1 | 2022.2 | |||
| AIC | 2178.0 | 2082.1 | 2084.2 | |||
SE, standard error.
Fig. 3.Contour plots of profile likelihood (in terms of the deviance) occurrence random-intercept variance σ2 and correlation ρ from 4 simulated data sets (N = 250) with different combinations of true values for σ2 and θ0; other variance components are fixed at their true values σ2 = 0.20 and σe2 = 0.08; the true value for β0 is set as β0 = 0.5; the black dots are maximum likelihood estimates of σ2 at different values of ρ