| Literature DB >> 26017695 |
Ning Ding1, Keith Dear2, Shuyu Guo3, Fan Xiang3, Robyn Lucas4.
Abstract
The debate on the causal association between vitamin D status, measured as serum concentration of 25-hydroxyvitamin D (25[OH]D), and various health outcomes warrants investigation in large-scale health surveys. Measuring the 25(OH)D concentration for each participant is not always feasible, because of the logistics of blood collection and the costs of vitamin D testing. To address this problem, past research has used predicted 25(OH)D concentration, based on multivariable linear regression, as a proxy for unmeasured vitamin D status. We restate this approach in a mathematical framework, to deduce its possible pitfalls. Monte Carlo simulation and real data from the National Health and Nutrition Examination Survey 2005-06 are used to confirm the deductions. The results indicate that variables that are used in the prediction model (for 25[OH]D concentration) but not in the model for the health outcome (called instrumental variables), play an essential role in the identification of an effect. Such variables should be unrelated to the health outcome other than through vitamin D; otherwise the estimate of interest will be biased. The approach of predicted 25(OH)D concentration derived from multivariable linear regression may be valid. However, careful verification that the instrumental variables are unrelated to the health outcome is required.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26017695 PMCID: PMC4445919 DOI: 10.1371/journal.pone.0125551
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of papers employing predicted 25(OH)D score to examine associations between vitamin D status and health outcomes.
| Reference | Health outcome | Statistical model in | Instrumental variables(s) |
|---|---|---|---|
| Giovannucci et al, 2006 [ | Cancer incidence and mortality | Cox | Geographical residence, Dietary vitamin D intake, vitamin D supplements, Race |
| Ng et al, 2009 [ | Colorectal cancer | Cox | Geographical region |
| Liu et al, 2010 [ | Type 2 diabetes | Cox | Month of blood sampling, total vitamin D intake, physical activity score, smoking status, total energy intake, BMI |
| Jimenez et al, 2012 [ | Tooth loss and periodontitis | Cox | UVB radiation flux at residence, dietary and supplemental intake of vitamin D |
| Gilbert et al, 2012 [ | Risk factors for prostate cancer (PSA level, BMI, Family history of prostate cancer) | Linear and Logistic | Sun exposure, dietary intake, Anthropometric, clinical and demographic factors |
| Liu et al, 2013[ | Endometrial cancer | Cox | Vitamin D intake from food, vitamin D intake from supplements, UVB flux based on state of residence, physical activity, alcohol intake |
| Ananthakrishnan et al, 2012 [ | Crohn's disease | Cox | Dietary and supplemental vitamin D intake, exposure to sunlight, race, regional ultraviolet-B radiation intensity |
| Harris et al, 2013 [ | Endometriosis | Cox | Race, geographical region, season of blood draw, dietary vitamin D intake |
| Joh et al, 2013 [ | Renal cancer | Cox | UVB radiation flux at residence, dietary and supplement intake of vitamin D, postmenopausal hormone use |
a Adjusted for waist circumference in Stage II;
b Since backwards stepwise regression was employed, the instrumental variables used varied across regressions.
Fig 1The effect of omitting z from the health outcome equation.
In theory, the total effect of z on health outcome, H, is δ + θd. In practice, using z as an instrumental variable causes bias in the estimated effect of the 25(OH)D score, D, on the health outcome, because the direct effect of z on the health outcome is incorrectly captured as being mediated by D.
Stage-II estimates of the coefficient for each of the variables (x, y, D) included in the Cox proportional hazard models for different specifications, Monte Carlo simulations, sample size 5000, d 1 = 0.2 and δ 1 = 0.4.
| Variable | Pre-set value | Estimated value | 95% CI |
|
|---|---|---|---|---|
| Specification I | ||||
|
| 0.10 | -0.68 | (-0.74, -0.61) | <0.001 |
|
| 0.30 | 0.29 | (0.26, 0.32) | <0.001 |
|
| 0.50 | 2.42 | (2.27, 2.57) | <0.001 |
| Specification II | ||||
|
| 0.10 | -0.13 | (-0.18, -0.09) | <0.001 |
|
| 0.30 | 0.27 | (0.25, 0.30) | <0.001 |
|
| 0.50 | 1.02 | (0.93, 1.10) | <0.001 |
| Specification III | ||||
|
| 0.10 | 0.10 | (0.05, 0.15) | 0.003 |
|
| 0.30 | 0.30 | (0.27, 0.32) | <0.001 |
|
| 0.40 | 0.39 | (0.36, 0.43) | <0.001 |
|
| 0.50 | 0.49 | (0.40, 0.59) | <0.001 |
a z 1 is the only instrumental variable, but it is invalid;
b z 1 and z 2 are the invalid and valid instrumental variables respectively;
c z 2 is the only instrumental variable, and it is valid. Specification III is correct.
Fig 2Estimates of the association between the 25(OH)D concentration and the health outcome (ϴ) according to the change in (a) d 1, with δ 1 = 0.4 and (b) δ 1, with d 1 = 0.2; sample size 5000, Specification I and II.
Summary of characteristics according to vitamin D status (based on measured serum 25(OH)D concentration) among 4,002 adults of the National Health and Nutrition Examination Survey (2005–06), American adults who had three readings of systolic blood pressure.
| Severe deficiency<10 ng/mL | Mild deficiency10~20 ng/mL | Adequacy> = 20 ng/mL |
| |
|---|---|---|---|---|
| Overall n(%) | 365 (9.1) | 1427 (35.6) | 2210 (55.3) | |
| Gender | 0.14 | |||
| Male, n (%) | 141(7.3) | 711(36.6) | 1089(56.1) | |
| Female, n(%) | 224(10.9) | 716(34.7) | 1121(54.4) | |
| Age (years), mean(SD) | 41.3 (18.3) | 44.1 (19.0) | 45.4 (18.8) | <0.001 |
| Overweight or obesity status | <0.001 | |||
| BMI>25, n (%) | 277(10.2) | 1050(38.6) | 1390(51.2) | |
| BMI≤25, n(%) | 88(6.9) | 377(29.3) | 820(63.8) | |
| Systolic blood pressure (mmHg) mean(SD) | 124.1 (19.5) | 123.4 (18.0) | 120.7 (17.3) | <0.001 |
a P values were derived from Kolmogorov-Smirnov tests;
b P values for trend (two-sided) were derived from trend tests.
Multivariable association between measured 25(OH)D concentration and systolic blood pressure; determinants of 25(OH)D concentration (Stage I); and multivariable association between predicted 25(OH)D score and systolic blood pressure (Stage II) among 4,002 adults of the National Health and Nutrition Examination Survey (2005–06), US, OLS.
| Variable | Systolic blood pressure |
|
| ||||||
|---|---|---|---|---|---|---|---|---|---|
| Coefficient | 95% CI |
| Coefficient | 95% CI |
| Coefficient | 95% CI |
| |
| Measured 25(OH)D concentration | -0.16 | (-0.21, -0.11) | <0.001 | ||||||
| Predicted 25(OH)D score | -1.15 | (-1.48, -0.82) | <0.001 | ||||||
| Age | 0.42 | (0.40, 0.45) | <0.001 | 0.02 | (0.01, 0.04) | <0.001 | 0.45 | (0.42, 0.47) | <0.001 |
| Male (vs. female) | 3.71 | (2.74, 4.67) | <0.001 | -0.28 | (-0.86, 0.30) | 0.35 | 3.43 | (2.45, 4.41) | <0.001 |
| Overweight or obesity (BMI>25 vs≤25) | 3.13 | (2.08, 4.18) | <0.001 | -3.16 | (-3.79, -2.53) | <0.001 | |||
| Constant | 102.63 | (100.80, 104.45) | <0.001 | 22.89 | (21.05, 23.74) | <0.001 | 130.19 | (124.55, 132.45) | <0.001 |
| Adjusted R2 | 0.24 | 0.02 | 0.24 | ||||||