| Literature DB >> 28278129 |
Kedir N Turi1, David M Buchner2, Diana S Grigsby-Toussaint2.
Abstract
INTRODUCTION: Statistical models for assessing risk of type 2 diabetes are usually additive with linear terms that use non-nationally representative data. The objective of this study was to use nationally representative data on diabetes risk factors and spline regression models to determine the ability of models with nonlinear and interaction terms to assess the risk of type 2 diabetes.Entities:
Mesh:
Year: 2017 PMID: 28278129 PMCID: PMC5345963 DOI: 10.5888/pcd14.160244
Source DB: PubMed Journal: Prev Chronic Dis ISSN: 1545-1151 Impact factor: 2.830
Distribution of Variables for the Study Samplea (N = 5,471), National Health and Nutrition Examination Survey (NHANES), 2005–2012
| Characteristic | Total No. (% | Has Diabetes | |
|---|---|---|---|
| No (% | Yes (% | ||
|
| |||
| Male | 2,510 (45.9) | 2,182 (46.0) | 328 (45.1) |
| Female | 2,961 (54.1) | 2,561(54.0) | 400 (54.9) |
| All | 5,471 (100.0) | 4,743 (100.0) | 728 (100.0) |
|
| |||
| Mexican American | 984 (17.9) | 818 (17.3) | 166 (22.8) |
| Other Hispanic ethnicity | 572 (10.5) | 486 (10.3) | 86 (11.8) |
| Non-Hispanic white | 2,668 (48.8) | 2,360 (49.8) | 308 (42.3) |
| Black | 994 (18.2) | 849 (17.9) | 145 (19.9) |
| Others (Asian, American Indian, multiracial) | 253 (4.6) | 230 (4.9) | 23 (3.2) |
| All | 5,471 (100.0) | 4,743 (100.0) | 728 (100.0) |
|
| |||
| Never married | 1,076 (19.7) | 1,009 (21.3) | 67 (9.2) |
| Married or living with partner | 3,181 (58.1) | 2,778 (58.6) | 403 (55.4) |
| Separated, divorced, or widowed | 1,214 (22.2) | 956 (20.1) | 258 (35.4) |
| All | 5,471 (100.0) | 4,743 (100.0) | 728 (100.0) |
|
| |||
| Full food security | 3,818 (69.8) | 3,297 (69.5) | 521 (71.6) |
| High marginal food security | 592 (10.8) | 521 (11.0) | 71 (9.8) |
| Low food security | 675 (12.3) | 596 (12.6) | 79 (10.8) |
| Very low food security | 386 (7.1) | 329 (6.9) | 57 (7.8) |
| All | 5,471 (100.0) | 4,743 (100.0) | 728 (100.0) |
|
| |||
| Less than high school diploma | 1,523 (27.8) | 1,218 (25.7) | 305 (41.9) |
| High school diploma | 1,326 (24.2) | 1,139 (24.0) | 187 (25.7) |
| Some college or higher | 2,622 (47.9) | 2,386 (50.3) | 236 (32.4) |
| All | 5,471 (100.0) | 4,743 (100.0) | 728 (100.0) |
|
| |||
| Never smoker | 3,804 (69.5) | 3,272 (69.0) | 532 (73.1) |
| Former smoker | 255 (4.7) | 231 (4.9) | 24 (3.3) |
| Current smoker | 1,412 (25.8) | 1,240 (26.1) | 172 (23.6) |
| All | 5,471 (100.0) | 4,743 (100.0) | 728 (100.0) |
|
| |||
| Abstainers | 841 (15.4) | 682 (14.4) | 159 (21.8) |
| Occasional drinker | 2,545 (46.5) | 2,192 (46.2) | 353 (48.5) |
| Moderate drinker | 901 (16.5) | 834 (17.6) | 67 (9.2) |
| Heavy drinker | 1,184 (21.6) | 1,035 (21.8) | 149 (20.5) |
| All | 5,471 (100.0) | 4,743 (100.0) | 728 (100.0) |
|
| |||
| No | 3,321 (60.7) | 3,039 (64.1) | 282 (38.7) |
| Yes | 2,150 (39.3) | 1,704 (35.9) | 446 (61.3) |
| All | 5,471 (100.0) | 4,743 (100.0) | 728 (100.0) |
|
| |||
| Age, y | 46.7 (17.4) | 44.6 (16.9) | 61.1 (13.1) |
| Body mass index, kg/m2 | 27.5 (5.6) | 27.1 (5.4) | 30.0 (5.8) |
| Waist circumference, cm | 95.0 (13.9) | 93.7 (13.6) | 103.4 (13.2) |
| Subscapular skinfold, mm | 21.3 (8.2) | 20.8 (8.2) | 24.5 (7.7) |
| Family income-to-poverty ratio | 2.5 (1.6) | 2.5 (1.6) | 2.2 (1.5) |
| Depression, mean score | 2.6 (3.7) | 2.5 (3.6) | 3.1 (4.3) |
| Moderate-to-vigorous physical activity, no. of min/d | 101.7 (157.1) | 108.0 (160.9) | 60.6 (121.9) |
| Diastolic blood pressure, mm Hg | 69.6 (11.7) | 69.7 (11.5) | 68.9 (13.0) |
| Systolic blood pressure, mm Hg | 121.8 (18.3) | 120.3 (17.4) | 131.6 (20.8) |
| Sleep duration, h | 6.8 (1.4) | 6.8 (1.4) | 6.7 (1.6) |
Abbreviations: SD, standard deviation.
Data from 4 waves (2005–2006 to 2011–2012), excluding data for pregnant participants and those aged younger than 20 years, were combined to obtain the sample. The sample was composed of those who answered the question “Other than during pregnancy, have you ever been told by a doctor or health professional that you have diabetes or sugar diabetes?” or those who had valid laboratory test results for diabetes; the sample consisted of 22,660 observations. We then excluded participants who were missing data on 1 or more of the 17 independent variables examined; the final sample consisted of 5,471 participants.
NHANES assesses household food security through a 10-item questionnaire and classifies adults into 4 categories: 1) full food security (no food security concerns), 2) marginal food security (1 or 2 concerns), 3) low food security (3–5 concerns), and very low food security (6–10 concerns).
Participants who self-reported smoking at least 100 cigarettes in their lifetime and smoking every day or some days at the time of the interview were classified as current smokers. Participants who reported smoking at least 100 cigarettes in their lifetime and not smoking at all at the time of the interview were classified as ex-smokers. Participants who reported smoking fewer than 100 cigarettes in their lifetime were classified as never-smokers.
Categorized as abstainer (<12 alcoholic drinks in lifetime; 15.4%), occasional drinker (≤1 alcoholic drink per week in previous 12 months, 46.5%), moderate drinker (2 or 3 alcoholic drinks per week in previous 12 months; 16.5%), and heavy drinker (>3 alcoholic drinks per week in previous 12 months, 21.6%) (16).
Calculated by dividing family income by the federal poverty threshold specific to household size and year. A ratio of 1.0 or greater indicates income above the poverty level.
Computed from the Patient Health Questionnaire-9 (19); scored from 0 to 36, with higher scores indicating higher levels of depression.
An estimate of average daily time spent in moderate-to-vigorous physical activity (MVPA) was computed by adding up the minutes of reported MVPA during transportation, work, and leisure activities from 7 days of recall and dividing the total number of minutes by 7.
Results of Additive MARS Modela After Backward Selection and 2-Way Interaction MARS Modelb, National Health and Nutrition Examination Survey (N = 5,471), 2005–2012c
| Variable/Basis Function | Coefficient |
|---|---|
|
| |
|
| 4.40 |
|
| |
| Max (age [y] − 69, 0) | 0.07 |
| Max (69 − age [y], 0) | −0.11 |
| Max (age [y] − 45, 0) | −0.06 |
|
| −1.03 |
|
| −0.51 |
|
| 0.34 |
|
| |
| Max (body mass index − 48.2, 0), kg/m2 | 0.32 |
| Max (48.2 − body mass index, 0), kg/m2 | 0.06 |
| Max (141.4 − waist circumference, 0), cm | −0.05 |
| Max (27.7 − subscapular skinfold, 0), cm | −0.06 |
|
| |
| Max (depression score − 13, 0) | −0.82 |
| Max (depression score − 11, 0) | 0.63 |
|
| |
| Max (diastolic blood pressure − 83.3, 0) | 0.19 |
| Max (83.3 − diastolic blood pressure, 0) | −0.05 |
| Max (diastolic blood pressure − 91.3, 0) | −0.13 |
| Max (diastolic blood pressure − 55.3, 0) | −0.09 |
| Max (124.6 − systolic blood pressure, 0) | −0.02 |
| Max (systolic blood pressure − 158, 0) | 0.02 |
|
| |
| Max (28.6 − moderate-to-vigorous physical activity, 0) | 0.01 |
|
| |
| Max (7 − sleep duration, 0) | 0.11 |
|
| |
|
| 1.85 |
| Max (age − 69, 0), y | −0.22 |
| Max (69 − age, 0), y | −0.03 |
| No family history of diabetes | −0.56 |
| Max (141.3 − waist circumference, 0), cm | −0.03 |
| Max (subscapular skinfold − 11.1, 0), mm | 0.03 |
| Max (11.1 − subscapular skinfold, 0), mm | −0.19 |
| Max (body mass index − 21.9, 0), kg/m2 | −0.13 |
| Max (21.9 − body mass index, 0), kg/m2 | −0.51 |
| Max (family income-to-poverty ratio | −0.19 |
| Max (0.77 − family income-to-poverty ratio | −1.29 |
| Max (age − 69, 0), y × max (waist circumference − 123.4, 0), cm | 0.19 |
| Max (age − 69, 0), y × max (123.4 − waist circumference, 0), cm | 0.005 |
| Max (69 − age, 0), y × max (systolic blood pressure − 190, 0), mm Hg | 0.005 |
| Max (69 − age, 0), y × max (190 − systolic blood pressure, 0), mm Hg | −0.0007 |
| Max (age − 69, 0), y × max (body mass index − 18.9, 0), kg/m2 | 0.014 |
| Max (age − 69, 0), y × max (18.9 − body mass index, 0), kg/m2 | 0.20 |
| Max (69 − age, 0), y × max (body mass index − 24.5, 0), kg/m2 | 0.007 |
| Max (69 − age, 0), y × max (24.5 − body mass index, 0), kg/m2 | 0.009 |
| Max (21.9 − body mass index, 0), kg/m2 × max (53 − age, 0), y | −0.008 |
| Max (21.9 − body mass index, 0), kg/m2 × max (family income-to-poverty ratio | −0.19 |
| Max (21.9 − body mass index, 0), kg/m2 × max (0.3 − family income-to-poverty ratio | 3.007 |
| Max (family income-to-poverty ratio | 0.02 |
| Max (family income-to-poverty ratio | 0.007 |
| Max (141.3 − waist circumference, 0), cm × max (12 − depression score | −0.001 |
| Has a family history of diabetes × Mexican American ethnicity | 0.64 |
| Has a family history of diabetes × max (79.3 − diastolic blood pressure, 0), mm Hg | 0.02 |
| Has a high school diploma × max (141.3 − waist circumference, 0), cm | −0.01 |
Abbreviations: MARS, multivariate adaptive regression splines; max, maximum.
Additive MARS does not incorporate interactions between basis functions.
2-way MARS model features interactions between 2 basis functions. A spline is a smooth piecewise polynomial (an expression of >2 algebraic terms); knots are the places where the polynomials join.
Example of an interpretation of basis function: at each knot of a variable (for example, 69 y and 45 y for age) there are 2 basis functions that reflect each other. Both basis functions could be significant, or only one could be significant. Age <69 y and age >69 y are both significant and have linear coefficients of −0.11 and 0.07, respectively. In the same variable at a knot value of 45, only 1 basis function is significant (age >45 y) and is included in the model; it has a linear coefficient of −0.06. If age does not fall into these ranges (for example, if age <69 for basis function max [age, y − 69, 0]), it would be assigned a value of 0; thus max (age, y − 69, 0) indicates this mathematical relationship.
Computed from the Patient Health Questionnaire-9 (17); scored from 0 to 36, with higher scores indicating higher levels of depression.
Calculated by dividing family income by the federal poverty threshold specific to household size and year. A ratio of 1.0 or greater indicates income above the poverty level.
Figure 1A) Contribution of interaction between systolic blood pressure and age to the risk of diabetes, B) Contribution of interaction between diastolic blood pressure and age to the risk of diabetes, C) Contribution of interaction between age and sleep duration to the risk of diabetes, and D) Contribution of interaction between age and depression score to the risk of diabetes.
Figure 2Area under the receiver operating characteristics curve (AUC) comparing 3 diabetes risk-prediction models: a logistic regression model, an additive MARS model, and 2-way interaction MARS model. Abbreviation: MARS, multivariate adaptive regression splines.
Comparison of Logistic Regression Model, Additive MARS Model, and 2-Way Interaction MARS Model Using Net Reclassification Indexa and Integrated Discrimination Improvementb
| Criteria | Logistic vs Additive | Logistic vs 2-Way MARS | Additive MARS |
|---|---|---|---|
| NRI (95% CI) [ | 0.5 (0.2 to 0.8) [.004] | 0.03 (−0.3 to 0.3) [.80] | −0.2 (−0.5 to 0.08) [.15] |
| IDI (95% CI) [ | 0.05 (0.006 to 0.08) [.02] | −0.04 (−0.06 to −0.008) [.01] | 0.04 (−0.08 to 0.004) [.08] |
Abbreviations: CI, confidence interval; IDI, integrated discrimination improvement; MARS, multivariate adaptive regression splines; NRI, net reclassification index.
NRI quantifies how well a new model, compared with a previous model, reclassifies participants.
IDI quantifies how well a new model (with added variables), compared with a previous model (with fewer variables), predicts a binary outcome of interest.
Additive MARS does not incorporate interactions between basis functions.
2-Way MARS model features interactions between 2 basis functions.