| Literature DB >> 33918760 |
Xiaolu Cheng1, Shuo-Yu Lin1, Jin Liu2, Shiyong Liu3, Jun Zhang4, Peng Nie5, Bernard F Fuemmeler6, Youfa Wang7, Hong Xue1.
Abstract
BACKGROUND: Obesity prevalence has become one of the most prominent issues in global public health. Physical activity has been recognized as a key player in the obesity epidemic.Entities:
Keywords: disparity; machine learning; obesity; physical activity
Year: 2021 PMID: 33918760 PMCID: PMC8069304 DOI: 10.3390/ijerph18083966
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Study population characteristics.
| Variables | Normal (18.5 ≤ BMI < 25 kg/m2) | Overweight (25 ≤ BMI < 30 kg/m2) | Obese (BMI ≥ 30 kg/m2) | |||
|---|---|---|---|---|---|---|
| Observations | Mean (%) | Observations | Mean (%) | Observations | Mean (%) | |
| Gender | ||||||
| Male | 1046 | 47.72 | 1505 | 59.65 | 1131 | 46.22 |
| Female | 1146 | 52.28 | 1018 | 40.35 | 1316 | 53.78 |
| Age | 2192 | 48.60 | 2523 | 52.13 | 2447 | 50.02 |
| Race | ||||||
| Non-Hispanic White | 806 | 36.77 | 857 | 33.97 | 708 | 28.93 |
| Non-Hispanic Black | 334 | 15.24 | 368 | 14.59 | 508 | 20.76 |
| Mexican American | 420 | 19.16 | 593 | 23.50 | 606 | 24.77 |
| Other Race, Including Multi-Racial | 342 | 15.60 | 396 | 15.70 | 410 | 16.76 |
| Other Hispanic | 290 | 13.23 | 309 | 12.25 | 215 | 8.79 |
| Education Level | ||||||
| Less than 9th Grade | 702 | 32.03 | 917 | 36.35 | 862 | 35.23 |
| 9–11th Grade (Includes 12th Grade with No Diploma) | 251 | 11.45 | 280 | 11.1 | 292 | 11.93 |
| High School Grad/GED or Equivalent | 347 | 15.83 | 438 | 17.36 | 465 | 19 |
| Some College or AA Degree | 344 | 15.69 | 369 | 14.63 | 400 | 16.35 |
| College Graduate or Above | 451 | 20.57 | 421 | 16.69 | 341 | 13.94 |
| Refused | 95 | 4.33 | 95 | 3.77 | 87 | 3.56 |
| Do Not Know | 1 | 0.05 | 1 | 0.04 | 0 | 0 |
| Marital Status | ||||||
| Married | 807 | 36.82 | 1016 | 40.27 | 903 | 36.90 |
| Widowed | 445 | 20.30 | 514 | 20.37 | 485 | 19.82 |
| Divorced | 305 | 13.91 | 331 | 13.12 | 356 | 14.55 |
| Separated | 148 | 6.75 | 203 | 8.05 | 219 | 8.95 |
| Never Married | 327 | 14.92 | 276 | 10.94 | 319 | 13.04 |
| Living with Partner | 126 | 5.75 | 143 | 5.67 | 122 | 4.99 |
| Refused | 34 | 1.55 | 40 | 1.59 | 43 | 1.76 |
| Family PIR | 2192 | 2.62 | 2523 | 2.74 | 2447 | 2.60 |
| Sum Intensity Value | 2192 | 1,584,527.52 | 2523 | 1,562,816.99 | 2447 | 1,298,389.77 |
| Duration of Different Activity Intensity Levels (in Minutes) | ||||||
| Sedentary | 2192 | 7988.89 | 2523 | 7945.10 | 2447 | 8146.58 |
| Light | 2192 | 1468.27 | 2523 | 1498.67 | 2447 | 1407.09 |
| Lifestyle | 2192 | 500.84 | 2523 | 522.37 | 2447 | 450.55 |
| Moderate | 2192 | 112.70 | 2523 | 106.91 | 2447 | 73.21 |
| Vigorous | 2192 | 7.45 | 2523 | 4.63 | 2447 | 1.93 |
Evaluations of accuracy, sensitivity, and specificity of 11 prediction models of overweightness and obesity.
| Method | Logistic | Naïve | RBF † | Local | CVR † | Random Subspace | Decision Table | Multiobject | Random Tree | J48 ‡ | Multilayer | Mean Value of 11 Models |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Accuracy | 69.4% | 69.11% | 69.5% | 69.63% | 69.9% | 70.01% | 69.59% | 69.37% | 60.71% | 68.70% | 68.65% | 62.37% |
| Sensitivity | 70.0% | 70.8% | 70.2% | 70.7% | 70.9% | 70.6% | 70.1% | 70.0% | 71.6% | 72.9% | 72.0% | 70.89% |
| Specificity | 51.6% | 48.1% | 51.6% | 52.1% | 54.4% | 57.3% | 51.4% | 49.5% | 35.6% | 47.9% | 47.2% | 49.70% |
Notes: † RBF = radial basis function; KNN = local k-nearest neighbors; CVR = classification via regression; random subspace; ‡ J48 = J48 is an algorithm to generate decision trees.
Figure 1Receiver operating characteristic (ROC) curves for local KNN, random subspace, decision table, and multiobject algorithms.
Evaluations of accuracy, sensitivity, and specificity of 11 prediction models of obesity.
| Method | Logistic | Naïve | RBF † | Local | CVR | Random Subspace | Decision Table | Multiobject | Random Tree | J48 ‡ | Multilayer | Mean Value of 11 Models |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Accuracy | 65.78% | 49.64% | 66.20% | 65.92% | 65.89% | 67.03% | 66.32% | 65.78% | 58.42% | 63.66% | 64.48% | 57.69% |
| Sensitivity | 49.4% | 38.6% | 52.7% | 50.3% | 50.3% | 56.8% | 52.4% | 49.1% | 39.5% | 45.8% | 49.2% | 48.55% |
| Specificity | 66.5% | 76.7% | 67.1% | 68.1% | 67.2% | 68.0% | 67.9% | 66.3% | 68.7% | 69.9% | 69.9% | 68.75% |
Notes: † RBF = radial basis function; KNN = local k-nearest neighbors; CVR = classification via regression; random subspace; ‡ J48 = J48 is an algorithm to generate decision trees.
Figure 2ROC curves for CVR, random subspace, random tree, and multiobject algorithms.
Ranked feature importance in predicting weight status based on information gain.
| Rank | Feature Meaning | Contribution |
|---|---|---|
| 1 | Duration of moderate-intensity activity in one week | 0.0211 |
| 2 | Duration of vigorous-intensity activity in one week | 0.0140 |
| 3 | Age | 0.0137 |
| 4 | The sum of the intensity value recorded by the physical activity monitor in one week | 0.0133 |
| 5 | Race/Ethnicity | 0.0110 |
| 6 | Duration of sedentary-intensity activity in one week | 0.0053 |
| 7 | Duration of lifestyle-intensity activity in one week | 0.0045 |
| 8 | Duration of light-intensity activity in one week | 0.0042 |
| 9 | Gender | 0.0040 |
| 10 | Education level | 0.0027 |
| 11 | Poverty income ratio (PIR) | 0 |
| 12 | Marital status | 0 |