| Literature DB >> 33805257 |
Giovanni Delnevo1, Giacomo Mancini2, Marco Roccetti1, Paola Salomoni1, Elena Trombini3, Federica Andrei3.
Abstract
This study investigates on the relationship between affect-related psychological variables and Body Mass Index (BMI). We have utilized a novel method based on machine learning (ML) algorithms that forecast unobserved BMI values based on psychological variables, like depression, as predictors. We have employed various machine learning algorithms, including gradient boosting and random forest, with psychological variables relative to 221 subjects to predict both the BMI values and the BMI status (normal, overweight, and obese) of those subjects. We have found that the psychological variables in use allow one to predict both the BMI values (with a mean absolute error of 5.27-5.50) and the BMI status with an accuracy of over 80% (metric: F1-score). Further, our study has also confirmed the particular efficacy of psychological variables of negative type, such as depression for example, compared to positive ones, to achieve excellent predictive BMI values.Entities:
Keywords: artificial intelligence; depression; machine learning; obesity; reproducibility
Year: 2021 PMID: 33805257 PMCID: PMC8037317 DOI: 10.3390/s21072361
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Number of subjects for each BMI class.
| BMI Class | Number of Subjects |
|---|---|
| Normal Weight | 60 |
| Overweight | 25 |
| Obesity | 136 |
Figure 1Classification: F1-scores of the cross-validation on the training set.
Hyper-parameters and values tested during tuning for the classification.
| Algorithm | Parameter | Values |
|---|---|---|
| MLP | Activation Function | identity, logistic, tanh, |
| Solver | lbfgs, | |
| Max Iterations | 200, | |
| Alpha | ||
| Hidden layer size | ||
| RF | Min Samples Leaf | 1, 3, |
| Min Samples Split | ||
| Max Depth | 3, | |
| Max Features | ||
| Criterion | gini, | |
| Bootstrap | ||
| Number of Estimators | 50, 100, | |
| GB | Learning Rate | 0.01, 0.05. 0.1, |
| Min Samples Leaf | 1, 3, | |
| Min Samples Split | ||
| Max Depth | 3, 5, | |
| Max Features | ||
| Criterion | ||
| Subsample | 0.5, | |
| Number of Estimators | 50, 100, | |
| ET | Min Samples Leaf | 1, |
| Min Samples Split | 2, | |
| Max Depth | 3, 5, | |
| Max Features | ||
| Criterion | gini, | |
| Number of Estimators | 50, 100, 200, |
Classification: F1-score, specificity/sensitivity on the test set, after tuning.
| Algorithm | Class | Sensitivity | Specificity | F1-Score |
|---|---|---|---|---|
| MLP | Normal Weight | 0.67 | 0.91 | 0.81 |
| Overweight | 0.60 | 0.90 | ||
| Obesity | 0.89 | 0.88 | ||
| GB | Normal Weight | 0.83 | 0.91 | 0.85 |
| Overweight | 0.60 | 0.93 | ||
| Obesity | 0.89 | 0.94 | ||
| RF | Normal Weight | 0.92 | 0.91 | 0.89 |
| Overweight | 0.60 | 0.97 | ||
| Obesity | 0.93 | 0.94 | ||
| ET | Normal Weight | 0.75 | 0.88 | 0.82 |
| Overweight | 0.60 | 0.97 | ||
| Obesity | 0.89 | 0.82 |
Regression: Mean Absolute Error and Pearson Correlation Coefficient of the cross-validation on the training set.
| Algorithm | MAE | PCC |
|---|---|---|
| LASSO | 4.35 | 0.81 |
| EN | 4.35 | 0.8 |
| CART | 5.93 | 0.63 |
| KNN | 4.37 | 0.79 |
| SVR | 5.33 | 0.75 |
| MLP | 9.44 | 0.5 |
| AB | 4.62 | 0.76 |
| GB | 4.58 | 0.76 |
| RF | 4.65 | 0.77 |
Hyper-parameters and values tested during tuning for the regression.
| Algorithm | Parameter | Values |
|---|---|---|
| LASSO | Alpha | |
| EN | Alpha | |
| KNN | N Neighbors | 3, 7, 11, 15, |
| Leaf Size | ||
| Weights | uniform, | |
| Algorithm | auto, ball tree, kd tree, | |
| RF | Min Samples Leaf | 1, |
| Min Samples Split | ||
| Max Depth | 3, | |
| Max Features | ||
| Criterion | mse, | |
| Bootstrap | ||
| Number of Estimators | 50, | |
| Gb | Learning Rate | 0.01, |
| Min Samples Leaf | 1, 3, | |
| Min Samples Split | ||
| Max Depth | 3, 5, | |
| Max Features | ||
| Criterion | friedman mse, | |
| Subsample | 0.5, | |
| Number of Estimators | 50, | |
| ET | Min Samples Leaf | |
| Min Samples Split | ||
| Max Depth | 3, 5, | |
| Max Features | ||
| Criterion | mse, | |
| Number of Estimators | 50, |
Regression: Mean Absolute Error and Pearson Correlation Coefficient on the training and test set, after the tuning phase.
| Algorithm | 4-Fold CV | Test | ||
|---|---|---|---|---|
| MAE | PCC | MAE | PCC | |
| LASSO | 4.35 | 0.81 | 6.00 | 0.72 |
| EN | 4.35 | 0.80 | 6.52 | 0.70 |
| KNN | 4.31 | 0.76 | 5.50 | 0.76 |
| GB | 4.14 | 0.79 | 5.27 | 0.75 |
| RF | 4.26 | 0.79 | 5.31 | 0.78 |
| ET | 4.41 | 0.78 | 5.57 | 0.76 |
Classification: positive vs. negative variables, F1-scores of the cross-validation on the training set.
| Algorithm | Class | Positive Variables | Negative Variables | ||||
|---|---|---|---|---|---|---|---|
| Sen | Spec | F1 | Sen | Spec | F1 | ||
| KNN | Normal Weight | 0.38 | 0.66 | 0.43 |
|
|
|
| Overweight | 0.35 | 0.69 |
|
| |||
| Obesity | 0.41 | 0.78 |
|
| |||
| CART | Normal Weight | 0.33 | 0.67 | 0.44 |
|
|
|
| Overweight | 0.15 | 0.80 |
|
| |||
| Obesity | 0.50 | 0.74 |
|
| |||
| SVC | Normal Weight | 0.48 | 0.73 | 0.51 |
|
|
|
| Overweight | 0.30 | 0.74 |
|
| |||
| Obesity | 0.51 | 0.74 |
|
| |||
| MLP | Normal Weight | 0.50 | 0.69 | 0.52 |
|
|
|
| Overweight | 0.25 | 0.78 |
|
| |||
| Obesity | 0.53 | 0.77 |
|
| |||
| AB | Normal Weight | 0.42 | 0.68 | 0.49 |
|
|
|
| Overweight | 0.15 | 0.83 |
|
| |||
| Obesity | 0.56 | 0.63 |
|
| |||
| GB | Normal Weight | 0.40 | 0.75 | 0.51 |
|
|
|
| Overweight | 0.25 | 0.81 |
|
| |||
| Obesity | 0.57 | 0.59 |
|
| |||
| RF | Normal Weight | 0.38 | 0.72 | 0.47 |
|
|
|
| Overweight | 0.20 | 0.80 |
|
| |||
| Obesity | 0.52 | 0.56 |
|
| |||
| ET | Normal Weight | 0.38 | 0.73 | 0.49 |
|
|
|
| Overweight | 0.20 | 0.78 |
|
| |||
| Obesity | 0.55 | 0.62 |
|
| |||
Regression: positive vs. negative variables, Mean Absolute Error and Pearson Correlation Coefficient of the cross-validation on the train set.
| Algorithm | Positive Variables | Negative Variables | ||
|---|---|---|---|---|
| MAE | PCC | MAE | PCC | |
| LASSO | 8.05 | 0.16 |
|
|
| EN | 8.03 | 0.16 |
|
|
| CART | 9.96 | 0.23 |
|
|
| KNN | 8.47 | 0.1 |
|
|
| SVR | 8.19 | 0.15 |
|
|
| MLP | 9.88 | 0.08 |
|
|
| AB | 8.01 | 0.17 |
|
|
| GB | 8.04 | 0.28 |
|
|
| RF | 8.13 | 0.3 |
|
|
| ET | 8.14 | 0.33 |
|
|
Classification: F1-scores of the cross-validation on the training set, removing in turn one negative psychological variable.
| Algorithm | Class | No DE | No TA | No BE | No ES | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sen | Spec | F1 | Sen | Spec | F1 | Sen | Spec | F1 | Sen | Spec | F1 | ||
| KNN | N.W. | 0.60 | 0.77 |
| 0.63 | 0.91 | 0.80 | 0.63 | 0.85 | 0.74 | 0.56 | 0.92 | 0.78 |
| Over. | 0.30 | 0.76 | 0.55 | 0.84 | 0.20 | 0.83 | 0.45 | 0.82 | |||||
| Obes. | 0.58 | 0.82 | 0.89 | 0.96 | 0.86 | 0.94 | 0.92 | 0.96 | |||||
| CART | N.W. | 0.46 | 0.80 |
| 0.65 | 0.88 | 0.75 | 0.56 | 0.91 | 0.75 | 0.75 | 0.88 | 0.78 |
| Over. | 0.25 | 0.83 | 0.30 | 0.86 | 0.35 | 0.85 | 0.30 | 0.90 | |||||
| Obes. | 0.66 | 0.62 | 0.86 | 0.87 | 0.90 | 0.85 | 0.88 | 0.88 | |||||
| SVC | N.W. | 0.65 | 0.81 |
| 0.75 | 0.92 | 0.85 | 0.54 | 0.88 | 0.77 | 0.73 | 0.91 | 0.84 |
| Over. | 0.35 | 0.80 | 0.50 | 0.91 | 0.35 | 0.85 | 0.45 | 0.91 | |||||
| Obes. | 0.63 | 0.79 | 0.95 | 0.96 | 0.94 | 0.97 | 0.95 | 0.94 | |||||
| MLP | N.W. | 0.65 | 0.78 |
| 0.69 | 0.91 | 0.82 | 0.75 | 0.88 | 0.81 | 0.69 | 0.91 | 0.82 |
| Over. | 0.30 | 0.87 | 0.50 | 0.89 | 0.30 | 0.90 | 0.45 | 0.89 | |||||
| Obes. | 0.68 | 0.75 | 0.93 | 0.94 | 0.93 | 0.96 | 0.94 | 0.94 | |||||
| AB | N.W. | 0.52 | 0.81 |
| 0.77 | 0.87 | 0.75 | 0.73 | 0.88 | 0.72 | 0.75 | 0.85 | 0.76 |
| Over. | 0.30 | 0.80 | 0.30 | 0.83 | 0.30 | 0.78 | 0.35 | 0.85 | |||||
| Obes. | 0.61 | 0.66 | 0.79 | 0.93 | 0.75 | 0.96 | 0.80 | 0.94 | |||||
| GB | N.W. | 0.54 | 0.83 |
| 0.67 | 0.89 | 0.79 | 0.67 | 0.88 | 0.79 | 0.75 | 0.91 | 0.82 |
| Over. | 0.25 | 0.86 | 0.30 | 0.90 | 0.30 | 0.90 | 0.40 | 0.90 | |||||
| Obes. | 0.70 | 0.63 | 0.94 | 0.90 | 0.94 | 0.90 | 0.93 | 0.94 | |||||
| RF | N.W. | 0.54 | 0.80 |
| 0.69 | 0.88 | 0.79 | 0.65 | 0.89 | 0.79 | 0.79 | 0.89 | 0.82 |
| Over. | 0.25 | 0.85 | 0.30 | 0.90 | 0.35 | 0.90 | 0.35 | 0.92 | |||||
| Obes. | 0.69 | 0.68 | 0.92 | 0.90 | 0.94 | 0.90 | 0.93 | 0.93 | |||||
| ET | N.W. | 0.58 | 0.81 |
| 0.71 | 0.90 | 0.81 | 0.60 | 0.88 | 0.78 | 0.81 | 0.91 | 0.83 |
| Over. | 0.20 | 0.85 | 0.35 | 0.91 | 0.35 | 0.90 | 0.35 | 0.92 | |||||
| Obes. | 0.69 | 0.69 | 0.94 | 0.90 | 0.94 | 0.90 | 0.94 | 0.93 | |||||
Regression: Mean Absolute Error and Pearson Correlation Coefficient of the cross-validation on the training set, removing in turn one negative psychological variable.
| Algorithm | No DE | No TA | No BE | No ES | ||||
|---|---|---|---|---|---|---|---|---|
| MAE | PCC | MAE | PCC | MAE | PCC | MAE | PCC | |
| LASSO | 6.83 | 0.58 | 5.01 | 0.78 | 4.59 | 0.82 | 4.47 | 0.82 |
| EN | 6.83 | 0.58 | 5 | 0.78 | 4.58 | 0.82 | 4.46 | 0.82 |
| CART | 9.07 | 0.3 | 5.92 | 0.69 | 5.8 | 0.72 | 6.39 | 0.67 |
| KNN | 6.68 | 0.57 | 4.86 | 0.78 | 4.41 | 0.83 | 4.6 | 0.8 |
| SVR | 6.92 | 0.56 | 4.86 | 0.79 | 4.82 | 0.81 | 4.46 | 0.81 |
| MLP | 8.18 | 0.44 | 7.26 | 0.61 | 7.09 | 0.66 | 8.02 | 0.62 |
| AB | 7.25 | 0.54 | 4.58 | 0.8 | 4.56 | 0.82 | 4.63 | 0.8 |
| GB | 6.69 | 0.55 | 4.6 | 0.79 | 4.41 | 0.82 | 4.66 | 0.79 |
| RF | 6.77 | 0.53 | 4.54 | 0.81 | 4.54 | 0.82 | 4.75 | 0.79 |
| ET | 7.09 | 0.51 | 4.45 | 0.82 | 4.55 | 0.83 | 4.67 | 0.8 |