| Literature DB >> 29187409 |
Tao Xu1, Guangjin Zhu2, Shaomei Han1.
Abstract
OBJECTIVES: The number of depression symptoms can be considered as count data in order to get complete and accurate analyses findings in studies of depression. This study aims to compare the goodness of fit of four count outcomes models by a large survey sample to identify the optimum model for a risk factor study of the number of depression symptoms.Entities:
Keywords: count data; depression; negative binomial regression; over dispersion; poisson regression; zero-inflated
Mesh:
Year: 2017 PMID: 29187409 PMCID: PMC5719265 DOI: 10.1136/bmjopen-2017-016471
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Characteristics of respondents
| Characteristics | n | % | |
| Sex | Male | 6601 | 42.70 |
| Female | 8861 | 57.31 | |
| Occupation | Mental | 10 133 | 65.53 |
| Physical | 5329 | 34.47 | |
| Nationalities | Han | 10 527 | 68.08 |
| Yi | 3855 | 24.93 | |
| Others | 1080 | 6.98 | |
| Marital status | Married | 5089 | 35.92 |
| Single | 8825 | 62.29 | |
| Widowed or divorced | 254 | 1.79 | |
| Hypertension | Yes | 2453 | 15.86 |
| No | 13 009 | 84.14 | |
| Obesity | Yes | 667 | 4.31 |
| No | 14 795 | 95.69 | |
| Tobacco smoking | Yes | 1953 | 12.63 |
| No | 13 509 | 87.27 | |
| Alcohol drinking | Yes | 1706 | 11.03 |
| No | 13 756 | 88.97 | |
| Stress | Yes | 1361 | 8.80 |
| No | 14 102 | 91.20 | |
| Positive events | Yes | 644 | 4.17 |
| No | 14 818 | 95.83 | |
| Negative events | Yes | 3274 | 21.17 |
| No | 12 188 | 78.83 |
Proportions and predictive probabilities of each counts (%)
| Count | Observed | Poisson | NB | ZIP | ZINB |
| 0 | 39.28 | 28.10 | 36.89 | 39.22 | 39.04 |
| 1 | 23.74 | 33.19 | 28.02 | 20.63 | 22.67 |
| 2 | 15.23 | 21.61 | 16.40 | 18.67 | 17.64 |
| 3 | 10.38 | 10.42 | 8.83 | 11.75 | 10.58 |
| 4 | 6.33 | 4.24 | 4.62 | 5.84 | 5.46 |
| 5 | 3.21 | 1.58 | 2.41 | 2.47 | 2.58 |
| 6 | 1.40 | 0.56 | 1.27 | 0.94 | 1.15 |
| 7 | 0.43 | 0.20 | 0.68 | 0.33 | 0.50 |
NB, negative binomial; ZINB, zero-inflated negaitve binomial; ZIP, zero-inflated poisson.
Fitting goodness statistics of regression models
| Models | Log likelihood | AIC | BIC |
| Poisson | −25012 | 50 053 | 50 168 |
| NB | −24128 | 48 289 | 48 411 |
| ZIP | −23912 | 47 884 | 48 113 |
| ZINB | −23854 | 47 771 | 48 008 |
AIC, akaike’s information criterion; BIC, bayesian information criterion; NB, negative binomial; ZINB, zero-inflated negaitve binomial; ZIP, zero-inflated poisson.
Figure 1Predictive probabilities of the four models. ZINB, zero-inflated negative binomial model; ZIP, zero-inflated Poisson model.
ZINB regression coefficients for the number of depression symptoms
| Logit section | Negative binomial section | |||||||
| β | Z | P | 95% CI for β | β | Z | P | 95% CI for β | |
| Age | −0.003 | −0.850 | 0.396 | −0.010 to 0.004 | −0.004 | −4.100 | <0.0001 | −0.007 to −0.002 |
| Sex (female) | −0.256 | −3.150 | 0.002 | −0.415 to −0.097 | 0.131 | 6.220 | <0.0001 | 0.090 to 0.173 |
| Hypertension | −0.085 | −0.810 | 0.419 | −0.292 to 0.121 | −0.002 | −0.070 | 0.944 | −0.068 to 0.063 |
| Mental labourers | −0.804 | −6.530 | <0.001 | −1.045 to −0.563 | 0.094 | 2.830 | 0.005 | 0.029 to 0.159 |
| Smoker | 0.158 | 1.380 | 0.169 | −0.067 to 0.384 | 0.017 | 0.450 | 0.652 | −0.056 to 0.090 |
| Alcohol drinker | −0.377 | −3.060 | 0.002 | −0.619 to −0.136 | 0.087 | 2.510 | 0.012 | 0.019 to 0.155 |
| Yi nationality | 0.698 | 9.010 | <0.001 | 0.547 to 0.850 | 0.016 | 0.680 | 0.497 | −0.031 to 0.063 |
| Other race | 0.007 | 0.040 | 0.969 | −0.329 to 0.342 | 0.012 | 0.330 | 0.744 | −0.059 to 0.082 |
| Widowed or divorced | −0.642 | −1.820 | 0.068 | −1.332 to 0.048 | 0.017 | 0.210 | 0.833 | −0.144 to 0.179 |
| Single | −0.445 | −4.370 | <0.001 | −0.644 to −0.245 | −0.005 | −0.190 | 0.852 | −0.057 to 0.047 |
| Obesity | −0.150 | −0.880 | 0.379 | −0.483 to 0.184 | 0.078 | 1.620 | 0.106 | −0.017 to 0.174 |
| Stress | −0.997 | −5.820 | <0.001 | −1.333 to −0.661 | 0.472 | 18.620 | <0.001 | 0.422 to 0.522 |
| Positive events | −2.179 | −2.010 | 0.045 | −4.306 to −0.053 | −0.040 | −0.910 | 0.364 | −0.127 to 0.047 |
| Negative events | −0.449 | −4.460 | <0.001 | −0.646 to −0.252 | 0.250 | 11.990 | <0.001 | 0.209 to 0.290 |
| Intercept | −0.052 | −0.250 | 0.806 | −0.462 to 0.359 | 0.260 | 4.270 | <0.001 | 0.141 to 0.380 |
ZINB, zero-inflated negaitve binomial.
Logistic regression coefficients for depression
| β | Wald | P value | OR | 95% CI for OR | |
| Age | −0.005 | 9.501 | 0.002 | 0.995 | 0.992 to 0.998 |
| Sex (female) | 0.286 | 75.085 | <0.001 | 1.331 | 1.248 to 1.420 |
| Hypertension | 0.027 | 0.320 | 0.572 | 1.027 | 0.937 to 1.126 |
| Mental labourers | 0.476 | 87.311 | <0.001 | 1.610 | 1.457 to 1.779 |
| Smoker | −0.066 | 1.381 | 0.240 | 0.936 | 0.839 to 1.045 |
| Alcohol drinker | 0.293 | 27.855 | <0.001 | 1.340 | 1.202 to 1.494 |
| Yi nationality | −0.291 | 64.958 | <0.001 | 0.748 | 0.697 to 0.803 |
| Other race | 0.000 | 0.000 | 0.997 | 1.000 | 0.893 to 1.120 |
| Widowed or divorced | 0.310 | 6.627 | 0.010 | 1.363 | 1.077 to 1.726 |
| Single | 0.184 | 16.295 | <0.001 | 1.202 | 1.099 to 1.314 |
| Obesity | 0.191 | 6.522 | 0.011 | 1.211 | 1.045 to 1.402 |
| Stress | 1.246 | 573.265 | <0.001 | 3.476 | 3.139 to 3.849 |
| Positive events | 0.314 | 18.732 | <0.001 | 1.369 | 1.187 to 1.578 |
| Negative events | 0.575 | 251.969 | <0.001 | 1.777 | 1.656 to 1.908 |