| Literature DB >> 29958864 |
Katherine Schaumberg1, Erin E Reilly2, Lisa M Anderson3, Sasha Gorrell4, Shirley B Wang5, Margarita Sala6.
Abstract
OBJECTIVE: Outcome variables gauging the frequency of specific disordered eating behaviors (e.g., binge eating, vomiting) are common in the study of eating and health behaviors. The nature of such data presents several analytical challenges, which may be best addressed through the application of underutilized statistical approaches. While zero-sensitive models are well-supported by methodologists, application of these models has yet to gain traction among a widespread audience of researchers who study eating-related behaviors. The current study examined several approaches to predicting count-based behaviors, including zero-sensitive (i.e., zero-inflated and hurdle) regression models.Entities:
Keywords: Binge eating; Compensatory behaviors; Count data; Eating disorders; Regression; Zero-sensitive
Mesh:
Year: 2018 PMID: 29958864 PMCID: PMC6778476 DOI: 10.1016/j.appet.2018.06.030
Source DB: PubMed Journal: Appetite ISSN: 0195-6663 Impact factor: 3.868
Comparison of Regression-based Models
| Assumptions | Benefits | Drawbacks | |
|---|---|---|---|
| • Outcome variable normally distributed | • Familiar to most researchers | • Normality and homoscedasticity assumptions are rarely met | |
| • Outcome variable normally distributed | • Familiar to most researchers | • Transformation does not restore normality and homoscedasticity in all cases | |
| • Dichotomous outcomes | • Only predicts possible probabilities | • Only appropriate for dichotomous outcomes (or those recoded to be dichotomous) | |
| • Outcome assumed to be distributed as a Poisson random variable | • Can be used in highly skewed distributions | • Selecting a Poisson model when the data are over-dispersed can result in Type I errors | |
| • Allows for independent specification of the mean and variance | • Can be used in highly skewed distributions | • May not be appropriate for a large number of zeros | |
| • Assumes a logistic regression model for the zero vs. non-zero portion of the outcome | • May be most successful in evaluating outcomes when there is a preponderance of zeros | • Requires more power | |
| • All zeros are structural zeros (i.e., true zeros) | • Appropriate when the zero portion of the model and the count portion of the model are considered to arise from discrete processes | • Requires more power |
Descriptive Statistics of Binge Eating and Compensatory Behaviors
| Variable | Mean | Variance | % Zeros | % Subthreshold | % Threshold | Mean of Non-zero Distribution (N) | Variance of Non-zero distribution |
|---|---|---|---|---|---|---|---|
| Binge Eating | 1.50 | 14.07 | 67.6% | 18.1% | 14.3% | 4.26 (170) | 28.85 |
| Vomiting | .51 | 7.76 | 92.2% | 4.0% | 3.8% | 6.48 (41) | 62.05 |
| Laxative Use | .39 | 2.07 | 94.1% | 2.3% | 3.6% | 6.58 (31) | 32.78 |
| All Purging | .89 | 17.81 | 90.3% | 3.1% | 6.7% | 9.21 (51) | 108.77 |
| Exercise | 3.77 | 46.76 | 59.3% | 14.1% | 26.7% | 9.26 (214) | 64.09 |
| All Compensatory | 4.66 | 75.76 | 56.8% | 13.5% | 29.8% | 10.80 (227) | 109.31 |
Note: Reported count variables presented based on responses to the Eating Disorder Examination Questionnaire (EDE-Q). Subthreshold defined as frequency of 1–3 times over the past 28 days, threshold defined as 4 or more times over the past 28 days. ‘All Purging’ = composite mean of Vomiting and/or Laxative Use frequency over the past 28 days. ‘All Compensatory’= composite mean of Vomiting, Laxative Use, and/or Exercise frequency over the past 28 days.
Regression Coefficients and Model parameters
| Model Fit | Stress | Restraint | Body Dissat | Gender | Wt Suppression | ||||
|---|---|---|---|---|---|---|---|---|---|
| Binge Eating – OLS | 3.48 | 0.93 | 1.76 | 0.47 | 0.29 (0.13) | 0.45(0.17) | 0.54 (0.22) | 0.62 (0.33) | 0.01 (0.01) |
| Binge Eating – OLS Inv | 0.30 | 0.87 | 0.24 | 0.71 | 0.05 (0.01) | 0.06 (0.01) | 0.06 (0.02) | −0.02 (0.03) | 0.0003 (0.001) |
| Binge Eating – LR | 1.03 | -- | 0.94 | -- | 0.30 (0.08) | 0.32 (0.11) | 0.36 (0.14) | −0.04 (0.22) | 0.003 (0.009) |
| Binge Eating – PR | 1.94 | -- | 1.51 | -- | 0.17 (0.03) | 0.19 (0.03) | 0.32 (0.05) | 0.41 (0.08) | 0.004 (0.002) |
| Binge Eating – ZIP | 1.39 | -- | 0.77 | -- | |||||
| Binomial | 0.30 (0.09) | 0.32 (0.10) | 0.34 (0.14) | −0.11 (0.23) | 0.003 (0.008) | ||||
| Count | 0.04 (0.03) | 0.06 (0.03) | 0.16 (0.05) | 0.51 (0.09) | 0.004 (0.002) | ||||
| Binge Eating – NB | 0.84 | -- | 0.75 | -- | 0.32 (0.08) | 0.32 (0.10) | 0.32 (0.14) | 0.60 (0.22) | 0.008 (0.008) |
| Binge Eating - ZINB | 1.12 | -- | 0.56 | -- | |||||
| Binomial | 0.70 (0.27) | 1.27 (0.55) | 0.72 (0.55) | −0.47 (0.58) | 0.08 (0.04) | ||||
| Count | 0.09 (0.08) | 0.13 (0.10) | 0.26 (0.14) | 0.76 (0.26) | −0.001 (0.007) | ||||
| Binge Eating – NB hurdle | 1.01 | -- | 0.56 | -- | |||||
| Binomial | 0.31 (0.08) | 0.32 (0.10) | 0.36 (0.14) | −0.04 (0.22) | 0.003 (0.001) | ||||
| Count | 0.07 (0.09) | 0.12 (0.10) | 0.22 (0.15) | 0.87 (0.27) | 0.006 (0.008) | ||||
| Exercise – OLS | 6.02 | 0.87 | 4.07 | 0.59 | −0.15 (0.48) | 2.25 (0.28) | 0.05 (0.38) | 1.67 (0.58) | 0.005 (0.24) |
| Exercise – OLS Inv | 0.36 | 0.87 | 0.38 | 0.75 | 0.02 (0.03) | 0.26 (0.04) | 0.12 (0.06) | 0.24 (0.08) | 0.001(0.003) |
| Exercise – LR | 1.06 | -- | 0.98 | -- | 0.09 (0.08) | 0.58 (0.11) | 0.31 (0.14) | 0.55 (0.22) | 0.003 (0.008) |
| Exercise – PR | 2.70 | -- | 2.31 | -- | −0.03 (0.02) | 0.41(0.02) | 0.05 (0.03) | 0.47 (0.05) | 0.001 (0.002) |
| Exercise – ZIP | 1.38 | -- | 0.96 | -- | |||||
| Binomial | 0.09 (0.08) | 0.57 (0.11) | 0.32 (0.14) | 0.54 (0.22) | 0.003 (0.009) | ||||
| Count | −0.06 (0.02) | 0.22 (0.02) | −0.08 (0.03) | 0.25 (0.05) | 0.003 (0.001) | ||||
| Exercise – NB | 0.90 | -- | 0.81 | -- | 0.006 (0.07) | 0.45 (0.10) | 0.20 (0.13) | 0.71 (0.21) | 0.006 (0.008) |
| Exercise – ZINB | 0.89 | -- | 0.61 | -- | |||||
| Binomial | 0.12 (0.11) | 0.85 (0.25) | 0.62 (0.33) | 0.70 (0.31) | 0.003 (0.01) | ||||
| Count | −0.07 (0.06) | 0.25 (0.08) | −0.10 (0.11) | 0.27 (0.18) | 0.002 (0.006) | ||||
| Exercise – NB hurdle | 0.93 | -- | 0.64 | -- | |||||
| Binomial | 0.08 (0.08) | 0.57 (0.11) | 0.31 (0.14) | 0.54 (0.22) | −0.003 (0.008) | ||||
| Count | −0.08 (0.05) | 0.24 (0.07) | −0.06 (0.09) | 0.29 (0.16) | 0.003 (0.005) | ||||
| Purging - OLS | 4.04 | 0.95 | 1.58 | 0.37 | 0.27 (0.14) | 0.56 (0.19) | 0.06 (0.25) | 0.56 (0.39) | 0.04 (0.02) |
| Purging – OLS Inv | 0.23 | 0.95 | 0.12 | 0.51 | 0.03 (0.01) | 0.03 (0.01) | 0.002 (0.01) | 0.01 (0.02) | 0.002 (0.001) |
| Purging - LR | 0.74 | -- | 0.54 | -- | 0.40 (0.12) | 0.37 (0.15) | 0.03 (0.21) | 0.02 (0.36) | 0.02 (0.01) |
| Purging - PR | 1.97 | -- | 1.35 | -- | 0.23 (0.03) | 0.42 (0.04) | 0.09 (0.06) | 0.58 (0.11) | 0.02 (0.002) |
| Purging - ZIP | 1.60 | -- | 0.51 | -- | |||||
| Binomial | 0.39 (0.12) | 0.37 (0.14) | 0.03 (0.21) | −0.03 (0.36) | 0.02 (0.01) | ||||
| Count | 0.002 (0.03) | 0.13 (0.05) | 0.04 (0.08) | 0.59 (0.10) | 0.009 (0.003) | ||||
| Purging – NB | 0.49 | -- | 0.45 | -- | 0.43 (0.19) | 0.29 (0.24) | 0.22 (0.32) | 0.47 (0.51) | 0.03 (0.02) |
| Purging - ZINB | 1.08 | -- | 0.35 | -- | |||||
| Binomial | 0.44 (0.14) | 0.37 (0.17) | 0.02 (0.24) | −0.12 (0.39) | 0.02 (0.02) | ||||
| Count | −0.04 (0.12) | 0.11 (0.24) | 0.08 (0.38) | 0.63 (0.44) | 0.003 (0.02) | ||||
| Purging – NB hurdle | 1.09 | -- | 0.35 | -- | |||||
| Binomial | 0.39 (0.12) | 0.37 (0.15) | 0.03 (0.21) | −0.02 (0.36) | 0.02 (0.01) | ||||
| Count | −0.04 (0.12) | 0.10 (0.21) | 0.10 (0.33) | 0.63 (0.42) | 0.01 (0.01) | ||||
Note. Purging includes laxative use and vomiting. Exercise refers to endorsement of driven exercise. Outcome variables measured by the Eating Disorder Examination – Questionnaire (EDE-Q). Stress = Stress related to life events measured by the Daily Stress Inventory. Body Dissatisfaction measured by the EDE-Q. Weight Suppression calculated as the difference between an individual’s highest adult weight and self-reported current weight. OLS = ordinary least squares regression. OLS-Inverse = Inverse transformation applied to dependent variable. Inverse coefficients were reversed in sign to aide interpretation. LR = logistic regression. PR = Poisson regression
Significant at the 0.05 level
Significant at the 0.01 level
Significant at the 0.001 level.
Regression coefficients are unstandardized and reflect the influence of a 1-unit change in the predictor on the predicted level of the outcome variable when all other factors are set to their mean values. For example, a 1-unit increase in stress would predict a .29 increase in number of binge eating episodes in the OLS model, and an increase of .05 inverse units in the inverse transformed model. Coefficients are interpreted in the units relevant to outcomes. Thus, while they are interpretable within models as a measure of effect size, they are not comparable across models.
Model Fit Comparisons
| Poisson v. ZIP | Negative Binomial v. ZINB | |||||
|---|---|---|---|---|---|---|
| Vuong Z | AIC Corrected | BIC Corrected | Vuong Z | AIC Corrected | BIC Corrected | |
| Binge Eating | 6.03 | 5.93 | 5.74 | 4.22 | 3.33 | 1.43 (p = .07) |
| Exercise | 10.84 | 10.77 | 10.64 | 5.96 | 5.34 | 4.05 |
| Purging | 4.84 | 4.79 | 4.71 | 3.47 | 2.40 | 0.15 |
Note. Purging includes laxative use and vomiting. Exercise refers to endorsement of driven exercise. Outcome variables measured by the Eating Disorder Examination – Questionnaire (EDE-Q). Stress = Stress related to life events measured by the Daily Stress Inventory. Body Dissatisfaction measured by the EDE-Q. Weight Suppression calculated as the difference between an individual’s highest adult weight and self-reported current weight. NB = negative binomial regression. ZINB Count = count portion of the zero-inflated negative binomial model. ZINB Binomial = Binomial portion of the zero-inflated negative binomial model. Vuong Z = model comparison statistic.
Significant at the 0.05 level
Significant at the 0.01 level
Significant at the 0.001 level.
Clinical Interpretation - Predicted Levels of Behavior at Varying Levels of Risk Across Models
| Mean Levels of Predictors | Stress +1SD | Restraint +1SD | Body Dissat +1SD | Wt Suppress +1SD | Female | All +1SD +Female | |
|---|---|---|---|---|---|---|---|
| Raw Data | 1.50 | 1.59 | |||||
| | 67.6% | 61.8% | |||||
| | 4.62 | 5.54 | |||||
| OLS | 1.72 | 2.10 | 2.06 | 2.48 | 1.88 | 1.43 | 3.06 |
| OLS.Inv | 2.70 | 2.93 | 2.87 | 3.07 | 2.68 | 2.66 | 3.51 |
| LR | 66.57% | 57.50% | 60.99% | 54.31% | 65.67% | 66.13% | 37.40% |
| PR | 1.23 | 1.53 | 1.43 | 1.93 | 1.30 | 1.02 | 2.40 |
| NB | 1.16 | 1.74 | 1.47 | 1.83 | 1.26 | 0.88 | 2.86 |
| ZIP | 1.34 | 1.78 | 1.63 | 2.23 | 1.44 | 1.09 | 2.80 |
| | 65.51% | 56.36% | 59.90% | 53.84% | 64.66% | 64.3% | 36.3% |
| | 3.88 | 4.07 | 4.06 | 4.83 | 4.08 | 3.06 | 4.40 |
| ZINB | 1.77 | 2.12 | 2.08 | 2.76 | 1.86 | 1.38 | 2.45 |
| | 10.35% | 4.49% | 4.20% | 4.02% | 4.41% | 8.51% | 0.21% |
| | 1.97 | 2.12 | 2.17 | 2.87 | 1.94 | 1.27 | 2.46 |
| Hurdle NB | 1.28 | 1.73 | 1.60 | 2.19 | 1.38 | 1.00 | 2.68 |
| | 43.99% | 30.56% | 36.41% | 29.79% | 43.68% | 35.01% | 0.00% |
| | 2.29 | 2.49 | 2.51 | 3.13 | 2.46 | 1.53 | 2.68 |
| Raw Data | 3.78 | 3.65 | |||||
| | 59.3% | 57.9% | |||||
| | 9.26 | 7.83 | |||||
| OLS | 4.88 | 4.69 | 6.57 | 4.95 | 4.95 | 4.11 | 5.73 |
| OLS.Inv | 3.24 | 3.31 | 3.72 | 3.67 | 3.21 | 3.02 | 3.96 |
| LR | 52.22% | 49.42% | 41.46% | 41.34% | 53.01% | 58.39% | 35.11% |
| PR | 3.62 | 3.50 | 4.92 | 3.86 | 3.67 | 2.91 | 4.12 |
| NB | 3.41 | 3.44 | 4.81 | 4.53 | 3.64 | 2.46 | 4.95 |
| ZIP | 4.16 | 4.08 | 6.04 | 4.61 | 4.23 | 3.24, | 5.14, |
| | 52.2% | 49.37% | 42.49% | 41.22% | 53.00% | 58.36%, | 35.02%, |
| | 8.72 | 8.05 | 10.32 | 7.85 | 9.02 | 7.78 | 7.91 |
| ZINB | 4.78 | 4.61 | 6.88 | 5.20 | 4.86 | 3.72 | 5.25 |
| | 34.27% | 30.78% | 21.56% | 17.99% | 34.98% | 41.86% | 12.33% |
| | 7.28 | 6.66 | 8.76 | 6.34 | 7.47 | 6.41 | 6.30 |
| Hurdle NB | 4.11 | 3.99 | 5.93 | 4.67 | 4.20 | 3.18 | 5.12 |
| | 47.02% | 43.3% | 36.22% | 34.32% | 48.13% | 53.17% | 27.26% |
| | 7.77 | 7.03 | 9.30 | 7.11 | 8.10 | 6.79 | 7.04 |
| Raw Data | 0.89, | 0.84, | |||||
| | 90.3%, 9.22 | 88.8%, | |||||
| | 8.60 | ||||||
| OLS | 1.72 | 1.52 | 1.59 | 1.26 | 1.65 | 0.91 | 2.26 |
| OLS.Inv | 2.21 | 2.29 | 2.27 | 2.21 | 2.27 | 2.19 | 2.43 |
| LR | 91.72% | 86.94% | 89.34% | 91.37% | 89.74% | 91.64% | 78.97% |
| PR | 0.66 | 0.88 | 0.91 | 0.75 | 0.79 | 0.50 | 1.29 |
| NB | 0.57 | 0.99 | 0.71 | 0.78 | 0.78 | 0.46 | 1.83 |
| ZIP | 0.62 | 0.98 | 0.88 | 0.67 | 0.86 | 0.47 | 1.55 |
| | 91.70% | 86.90% | 89.32% | 91.36% | 89.71% | 91.59% | 78.91% |
| | 7.47 | 7.48 | 8.20 | 7.87 | 8.31 | 5.69 | 7.36 |
| ZINB | 0.65 | 1.02 | 0.91 | 0.76 | 0.87 | 0.51 | 1.61 |
| | 89.37% | 82.71% | 86.44% | 89.06% | 86.40% | 88.81% | 71.74% |
| | 6.15 | 5.87 | 6.71 | 6.96 | 6.40 | 4.60 | 5.64 |
| Hurdle NB | 0.64 | 0.97 | 0.88 | 0.76 | 0.84 | 0.51 | 1.56 |
| | 89.74% | 83.68% | 86.94% | 89.54% | 87.41% | 89.10 | 73.71% |
| | 6.28 | 5.96 | 6.77 | 7.25 | 6.73 | 4.69 | 1.56 |
Note. Purging includes laxative use and vomiting. Exercise refers to endorsement of driven exercise. Outcome variables measured by the Eating Disorder Examination – Questionnaire (EDE-Q). Stress = Stress related to life events measured by the Daily Stress Inventory. Body dissat = Body Dissatisfaction measured by the EDE-Q. Wt Suppress = Weight Suppression calculated as the difference between an individual’s highest adult weight and self-reported current weight. OLS = ordinary least squares regression. OLS-Inverse = ordinary least squares regression with inverse transformation applied to dependent variable. LR = logistic regression. PR = Poisson regression. NB = negative binomial. Numbers reflect predicted scores, percentage (or percentage likelihood) of zero scores, and predicted means of the non-zero distribution. Zero scores in zero-inflated models represent structural zeros, with the assumption that some zero scores may be accounted for in the count portion of the model as sampling zeroes. Predicted models were used to estimate means, percentage zero, and mean of non-distributions for appropriate values. In models indicating +1SD, a value of a predictor was chosen at one standard deviation above the mean value for that predictor, with all other predictors set to their mean level.