| Literature DB >> 34838133 |
Rumana Rois1, Manik Ray2, Atikur Rahman2, Swapan K Roy3.
Abstract
BACKGROUND: Stress-related mental health problems are one of the most common causes of the burden in university students worldwide. Many studies have been conducted to predict the prevalence of stress among university students, however most of these analyses were predominantly performed using the basic logistic regression (LR) model. As an alternative, we used the advanced machine learning (ML) approaches for detecting significant risk factors and to predict the prevalence of stress among Bangladeshi university students.Entities:
Keywords: Confusion matrix; Decision tree; Feature selection; Mental health; ROC; Random forest; Support vector machine; k-fold cross-validation
Mesh:
Year: 2021 PMID: 34838133 PMCID: PMC8627029 DOI: 10.1186/s41043-021-00276-5
Source DB: PubMed Journal: J Health Popul Nutr ISSN: 1606-0997 Impact factor: 2.000
Frequency distribution and relationship with stress among university students
| Variables | Total 355 | Stress ( | ||
|---|---|---|---|---|
| Yes (%) | ||||
| Gender | ||||
| Female | 204 (57.5) | 56 (27.5) | 2.386 | 0.131 |
| Male | 151 (42.5) | 53 (35.1) | ||
| University | ||||
| 1. Jahangirnagar University | 199 (56.1) | 59 (29.6) | 38.811 | 0.066 |
| 2. University of Dhaka | 21 (5.9) | 5 (23.8) | ||
| … | … | … | ||
| 27. National University | 2 (0.6) | 1 (50.0) | ||
| 28. University of Rajshahi | 20 (5.6) | 7 (35.0) | ||
| Background | ||||
| Arts | 65 (18.3) | 20 (30.8) | 2.891 | 0.576 |
| Science | 183 (51.5) | 50 (27.3) | ||
| Commerce | 40 (11.3) | 14 (35.0) | ||
| Medical | 30 (8.5) | 12 (40.0) | ||
| Engineering | 37 (10.4) | 13 (35.1) | ||
| Academic year | ||||
| 1st year | 46 (13.0) | 19 (41.3) | 3.506 | 0.477 |
| 2nd year | 33 (9.3) | 8 (24.2) | ||
| 3rd year | 31 (8.7) | 10 (32.3) | ||
| 4th year | 24 (6.8) | 8 (33.3) | ||
| Masters | 221 (62.3) | 64 (29.0) | ||
| Pulse rate | ||||
| Low | 57 (16.1) | 55 (96.5) | 200.75 | < 0.001* |
| Normal | 273 (76.9) | 32 (11.7) | ||
| High | 25 (7.0) | 22 (88.0) | ||
| Alcoholic | ||||
| Yes | 30 (8.5) | 9 (30.0) | 0.008 | 0.930 |
| No | 325 (91.5) | 100 (30.8) | ||
| Smoking status | ||||
| Yes | 56 (15.8) | 22 (29.1) | 2.301 | 0.129 |
| No | 299 (84.2) | 22 (39.3) | ||
| Sleep time | ||||
| Less than normal | 29 (8.2) | 9 (31) | 5.441 | 0.066 |
| Normal | 225 (63.4) | 78 (34.7) | ||
| More than normal | 101 (28.5) | 22 (21.8) | ||
| SBP | ||||
| Hypotension | 19 (5.4) | 19 (100) | 84.320 | < 0.001* |
| Normotensive | 275 (77.5) | 59 (21.5) | ||
| Prehypertensive | 48 (13.5) | 18 (37.5) | ||
| Hypertensive | 13 (3.7) | 13 (100) | ||
| DBP | ||||
| Hypotension | 13 (3.7) | 13 (100) | 79.554 | < 0.001* |
| Normotensive | 273 (76.9) | 63 (23.1) | ||
| Prehypertensive | 45 (12.7) | 11 (24.4) | ||
| Hypertensive | 24 (6.8) | 22 (91.7) | ||
| BMI | ||||
| Underweight | 77 (21.7) | 24 (31.2) | 1.710 | 0.425 |
| Normal weight | 198 (55.8) | 56 (28.3) | ||
| Overweight/obese | 80 (22.5) | 29 (36.3) | ||
*Statistically significant at the 0.05 level
Fig. 1Features selection using the Boruta algorithm
Accuracy, sensitivity, specificity and precision of different ML models
| Models | Accuracy | Sensitivity | Specificity | Precision | ||||
|---|---|---|---|---|---|---|---|---|
| Mean | SE | Mean | SE | Mean | SE | Mean | SE | |
| DT | 0.8845 | 0.017 | 0.8908 | 0.8639 | 0.076 | 0.9581 | 0.024 | |
| RF | 0.075 | |||||||
| SVM (polynomial kernel) | 0.7782 | 0.035 | 0.8504 | 0.047 | 0.6047 | 0.8406 | 0.039 | |
| SVM (linear kernel) | 0.8054 | 0.039 | 0.8460 | 0.045 | 0.7188 | 0.172 | 0.8969 | 0.068 |
| LR | 0.7723 | 0.037 | 0.8160 | 0.045 | 0.6175 | 0.094 | 0.8848 | 0.031 |
Mean Mean of different scores of the repeated runs, SE Standard Error of different scores of the repeated runs
Fig. 2The ROC curves to predict mental stress using DT, RF, SVM, and LR models
Result of K-Fold cross-validation of ML Models
| Models | Threefold | Fivefold | 10-Fold | |||
|---|---|---|---|---|---|---|
| MAcc | SE | MAcc | SE | MAcc | SE | |
| DT | 0.8759 | 0.0419 | 0.8901 | 0.0138 | 0.8870 | 0.0361 |
| RF | 0.0291 | |||||
| SVM (polynomial kernel) | 0.7718 | 0.0215 | 0.7915 | 0.0559 | 0.7855 | 0.0661 |
| SVM (linear kernel) | 0.8085 | 0.8338 | 0.0187 | 0.8309 | 0.0383 | |
| LR | 0.7830 | 0.0396 | 0.7718 | 0.0566 | 0.7713 | 0.0669 |
MAcc Mean of Accuracy scores from each fold, SE Standard Error of Accuracy scores
Fig. 3Top one tree from the fitted RF model to predict university student’s mental stress
Prediction of university student’s stress using the fitted RF model
| Pulse rate | SBP | DBP | Smoking | Dept | Sleep status | Predicted stress |
|---|---|---|---|---|---|---|
| High | Hypertensive | Hypertensive | No | Arts | Normal | Stressed |
| Normal | Hypotension | Hypotension | No | Science | More than normal | Non-stressed |
| High | Normotensive | Hypotension | No | Medical | Normal | Non-stressed |
| Normal | Prehypertensive | Prehypertensive | Yes | Engineering | Less than normal | Stressed |
| Low | Normotensive | Hypotension | Yes | Medical | Less than normal | Stressed |
Odds ratios (OR) with 95% CIs, and p-values obtained from the LR model
| Variables | OR | (95% CI) | |
|---|---|---|---|
| Pulse rate | |||
| Low (ref.) | 1.000 | – | – |
| Normal | 0.002 | (0.000–0.013) | < 0.001* |
| High | 0.037 | (0.004–0.389) | 0.006* |
| Smoking | |||
| No (ref.) | 1.000 | – | – |
| Yes | 4.112 | (1.591–10.628) | 0.004* |
| Sleep time | |||
| Less than normal (ref.) | 1.000 | – | – |
| Normal | 5.244 | (0.811–33.911) | 0.082 |
| More than normal | 5.660 | (0.808–39.650) | 0.081 |
| SBP | |||
| Hypotension (ref.) | 1.000 | – | – |
| Normotensive | 0.000 | (0.000–.) | 0.998 |
| Prehypertensive | 0.000 | (0.000–.) | 0.998 |
| Hypertensive | 0.315 | (0.000–.) | 1.000 |
| DBP | |||
| Hypotension (ref.) | 1.000 | – | – |
| Normotensive | 0.000 | (0.000–.) | 0.998 |
| Prehypertensive | 0.000 | (0.000–.) | 0.999 |
| Hypertensive | 0.000 | (0.000–.) | 0.999 |
| Background | |||
| Arts (ref.) | 1.000 | – | – |
| Science | 1.428 | (0.456–4.474) | 0.541 |
| Commerce | 0.655 | (0.117–3.682) | 0.631 |
| Medical | 2.925 | (0.545–15.702) | 0.211 |
| Engineering | 3.210 | (0.814–12.665) | 0.096 |
OR = 1 for the reference category
*Significant at 5% level