| Literature DB >> 29259768 |
Tina Lohse1, Sabine Rohrmann1, David Faeh1, Torsten Hothorn1.
Abstract
Body mass indices (BMIs) are applied to monitor weight status and associated health risks in populations. Binary or multinomial logistic regression models are commonly applied in this context, but are only applicable to BMI values categorized within a small set of defined ad hoc BMI categories. This approach precludes comparisons with studies and models based on different categories. In addition, ad hoc categorization of BMI values prevents the estimation and analysis of the underlying continuous BMI distribution and leads to information loss. As an alternative to multinomial regression following ad hoc categorization, we propose a continuous outcome logistic regression model for the estimation of a continuous BMI distribution. Parameters of interest, such as odds ratios for specific categories, can be extracted from this model post hoc in a general way. A continuous BMI logistic regression that describes BMI distributions avoids the necessity of ad hoc and post hoc category choice and simplifies between-study comparisons and pooling of studies for joint analyses. The method was evaluated empirically using data from the Swiss Health Survey.Entities:
Keywords: Distribution regression; conditional distribution; odds ratio; smoking; transformation model
Year: 2017 PMID: 29259768 PMCID: PMC5721934 DOI: 10.12688/f1000research.12934.1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Conditional distribution of BMI for WHO Categories.
For baseline characteristics x, the probabilities obtained from model (4) for BMI ≤ 18.5, BMI ≤ 25, and BMI ≤ 30 are given for each combination of smoking and sex of the individual. The model was fitted using the likelihood (Lik) defined by BMI measurements categorized according to the WHO and according to a different categorization with intervals of two BMI units (Int 1). Numeric intervals taking rounding error into account (Int 2) and “exact” BMI values were used to estimate the model parameters. The differences between these four ways of evaluating the likelihood with respect to the estimated probabilities were marginal.
| BMI: | ≤ 18.5 | ≤ 25 | ≤ 30 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sex | Smoking | Lik.: | WHO | Int 1 | Int 2 | Exact | WHO | Int 1 | Int 2 | Exact | WHO | Int 1 | Int 2 | Exact |
| Female | Never | 0.056 | 0.039 | 0.043 | 0.044 | 0.764 | 0.735 | 0.728 | 0.728 | 0.943 | 0.929 | 0.932 | 0.932 | |
| Former | 0.053 | 0.038 | 0.043 | 0.043 | 0.748 | 0.717 | 0.712 | 0.712 | 0.941 | 0.932 | 0.931 | 0.931 | ||
| Light | 0.079 | 0.051 | 0.062 | 0.063 | 0.787 | 0.759 | 0.755 | 0.755 | 0.968 | 0.955 | 0.957 | 0.957 | ||
| Medium | 0.047 | 0.042 | 0.048 | 0.048 | 0.768 | 0.732 | 0.723 | 0.723 | 0.948 | 0.944 | 0.942 | 0.942 | ||
| Heavy | 0.084 | 0.086 | 0.071 | 0.071 | 0.740 | 0.705 | 0.713 | 0.712 | 0.946 | 0.937 | 0.938 | 0.939 | ||
| Male | Never | 0.003 | 0.004 | 0.004 | 0.004 | 0.546 | 0.503 | 0.507 | 0.507 | 0.921 | 0.907 | 0.910 | 0.910 | |
| Former | 0.000 | 0.002 | 0.002 | 0.002 | 0.500 | 0.411 | 0.405 | 0.406 | 0.912 | 0.887 | 0.884 | 0.885 | ||
| Light | 0.000 | 0.002 | 0.003 | 0.003 | 0.545 | 0.497 | 0.497 | 0.497 | 0.932 | 0.918 | 0.926 | 0.925 | ||
| Medium | 0.000 | 0.006 | 0.005 | 0.005 | 0.569 | 0.522 | 0.521 | 0.522 | 0.932 | 0.914 | 0.922 | 0.922 | ||
| Heavy | 0.006 | 0.003 | 0.003 | 0.003 | 0.525 | 0.469 | 0.462 | 0.461 | 0.901 | 0.881 | 0.879 | 0.879 | ||
Estimated proportional odds ratios of covariates.
The odds ratios exp( ) along with 95% confidence intervals for the covariates age (centered at 40 years), education, alcohol intake, fruit and vegetable consumption, physical activity, education, nationality, and region are given for the four ways of evaluating the likelihood of model (4), i.e.,, using BMI measurements categorized according to the WHO and according to a different categorization with intervals of two BMI units (Int 2), numeric intervals taking rounding error into account (Int 2), and “exact” BMI values.
| Likelihood | ||||
|---|---|---|---|---|
| Covariate | WHO | Int 1 | Int 2 | Exact |
| Age (centered at 40 in y) | 0.968 (0.966–0.970) | 0.969 (0.967–0.971) | 0.968 (0.967–0.970) | 0.968 (0.967–0.970) |
| Alcohol intake (g/d) | 1.002 (0.999–1.004) | 1.003 (1.001–1.005) | 1.003 (1.001–1.004) | 1.002 (1.001–1.004) |
| Fruit and vegetables | ||||
| High | 1 | 1 | 1 | 1 |
| Low | 0.880 (0.824–0.940) | 0.928 (0.874–0.986) | 0.929 (0.878–0.983) | 0.929 (0.878–0.983) |
| Physical activity | ||||
| High | 1 | 1 | 1 | 1 |
| Moderate | 0.836 (0.774–0.903) | 0.850 (0.792–0.912) | 0.863 (0.808–0.921) | 0.862 (0.808–0.921) |
| Low | 0.695 (0.640–0.756) | 0.743 (0.688–0.802) | 0.769 (0.716–0.827) | 0.769 (0.716–0.826) |
| Education | ||||
| Mandatory | 1 | 1 | 1 | 1 |
| Secondary | 1.095 (0.992–1.209) | 1.252 (1.141–1.373) | 1.256 (1.150–1.371) | 1.254 (1.149–1.369) |
| Tertiary | 1.604 (1.441–1.786) | 1.760 (1.594–1.944) | 1.785 (1.625–1.961) | 1.781 (1.622–1.956) |
| Nationality | ||||
| Swiss | 1 | 1 | 1 | 1 |
| Foreign | 0.785 (0.728–0.848) | 0.832 (0.776–0.893) | 0.810 (0.758–0.864) | 0.809 (0.758–0.864) |
| Region | ||||
| German speaking | 1 | 1 | 1 | 1 |
| French speaking | 1.175 (1.091–1.266) | 1.147 (1.071–1.228) | 1.134 (1.063–1.208) | 1.133 (1.063–1.208) |
| Italian speaking | 1.190 (1.026–1.382) | 1.173 (1.024–1.344) | 1.236 (1.086–1.405) | 1.234 (1.085–1.403) |
Figure 1. Conditional distribution of BMI.
For each combination of smoking and sex, the conditional distribution function of BMI ℙ(BMI ≤ b | smk, sex, x) corresponding to model (4) was evaluated for baseline covariates x at all possible BMI values b. Red, female BMI distributions; blue, male BMI distributions; solid lines, BMI distributions of active smokers; dashed lines, never smoked; gray vertical lines, WHO categories 18.5, 25, 30. The model was fitted using “exact” BMI values.
Figure 2. Conditional distribution of BMI.
For each combination of smoking and sex, the conditional density of BMI corresponding to model (4) was evaluated for baseline covariates x at all possible BMI values b. Red, female BMI distributions; blue, male BMI distributions; solid lines, BMI distributions of active smokers; dashed lines, never smoked; gray vertical lines, WHO categories 18.5, 25, 30. The model was fitted using “exact” BMI values.
Estimated non-proportional odds ratios for smoking.
Odds ratios comparing all levels of smoking to the level never smoked for the events BMI ≤ 18.5, BMI ≤ 25, and BMI ≤ 30 obtained from model (4) were fitted to “exact” BMI measurements; 95% confidence intervals are given.
| BMI | ||||
|---|---|---|---|---|
| Sex | Smoking | ≤ 18.5 | ≤ 25 | ≤ 30 |
| Female | Never | 1 | 1 | 1 |
| Former | 0.993 (0.794–1.241) | 0.922 (0.825–1.031) | 0.987 (0.823–1.183) | |
| Light | 1.462 (1.135–1.884) | 1.152 (0.977–1.358) | 1.638 (1.187–2.259) | |
| Medium | 1.106 (0.823–1.488) | 0.975 (0.830–1.146) | 1.182 (0.894–1.564) | |
| Heavy | 1.674 (1.188–2.358) | 0.925 (0.756–1.131) | 1.116 (0.798–1.562) | |
| Male | Never | 1 | 1 | 1 |
| Former | 0.457 (0.193–1.081) | 0.664 (0.598–0.737) | 0.757 (0.649–0.883) | |
| Light | 0.727 (0.275–1.922) | 0.960 (0.825–1.117) | 1.226 (0.926–1.622) | |
| Medium | 1.352 (0.631–2.900) | 1.059 (0.917–1.223) | 1.170 (0.911–1.503) | |
| Heavy | 0.852 (0.336–2.161) | 0.832 (0.721–0.961) | 0.716 (0.579–0.885) | |