| Literature DB >> 33917206 |
Leandro Donisi1,2, Giuseppe Cesarelli2,3, Armando Coccia2,4, Monica Panigazzi2, Edda Maria Capodaglio2, Giovanni D'Addio2.
Abstract
Many activities may elicit a biomechanical overload. Among these, lifting loads can cause work-related musculoskeletal disorders. Aspiring to improve risk prevention, the National Institute for Occupational Safety and Health (NIOSH) established a methodology for assessing lifting actions by means of a quantitative method based on intensity, duration, frequency and other geometrical characteristics of lifting. In this paper, we explored the machine learning (ML) feasibility to classify biomechanical risk according to the revised NIOSH lifting equation. Acceleration and angular velocity signals were collected using a wearable sensor during lifting tasks performed by seven subjects and further segmented to extract time-domain features: root mean square, minimum, maximum and standard deviation. The features were fed to several ML algorithms. Interesting results were obtained in terms of evaluation metrics for a binary risk/no-risk classification; specifically, the tree-based algorithms reached accuracies greater than 90% and Area under the Receiver operating curve characteristics curves greater than 0.9. In conclusion, this study indicates the proposed combination of features and algorithms represents a valuable approach to automatically classify work activities in two NIOSH risk groups. These data confirm the potential of this methodology to assess the biomechanical risk to which subjects are exposed during their work activity.Entities:
Keywords: IMUs; NIOSH; biomechanical risk assessment; ergonomics; feature extraction; health monitoring; lifting; machine learning; wearable device; work-related musculoskeletal disorders
Mesh:
Year: 2021 PMID: 33917206 PMCID: PMC8068056 DOI: 10.3390/s21082593
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1(a) Opal System: Access Point, Docking Station, Opal Sensor and Mobility Lab software. (b) Placement of the Opal Sensor in lumbosacral region through an elastic belt.
Figure 2Opal Sensor with the illustration of the x-axis, y-axis and z-axis.
Anthropometric characteristics of the study population presented as mean ± standard deviation.
| Age (years) | 27.71 | ± | 1.60 |
| Height (cm) | 167.40 | ± | 4.86 |
| Weight (kg) | 69.00 | ± | 69.00 |
| Body Mass Index (kg/m2) | 24.51 | ± | 24.51 |
Combinations of the height, frequency and weight variables for lifting activities corresponding to LI 0.5 and 1.3.
| First Trial | Second Trial | |||||||
|---|---|---|---|---|---|---|---|---|
| Displacement | Frequency | Weight Lifted (kg) | Displacement | Frequency | Weight Lifted (kg) | |||
| m & f | m | f | m | f | m | f | ||
| [50–125] | 2.5 | 7 | 5 | [50–125] | 6 | 4 | 15 | 10 |
| [30–125] | 2.5 | 5 | 4 | [30–125] | 5 | 3 | 13 | 8 |
| [50–150] | 2.5 | 5 | 4 | [50–150] | 5 | 3 | 13 | 8 |
Figure 3Lifting phases of the lifting task: picking point (a) with squatting technique, intermediate points (b,c) with trunk extension, and destination point (d) up to the final height, and then restart the cycle.
Figure 4(a) Acceleration and angular velocity signals along the 3 axes associated to the NO RISK class, lifting task performed with a LI < 1. (b) Acceleration and angular velocity signals along the 3 axes associated to the RISK class, lifting task performed with a LI > 1.
Scores from stratified tenfold cross-validation (CV) evaluation metrics averaged over the seven subjects (mean ± standard deviation) using the features extracted from the acceleration signals along the three axes.
| Algorithms | Accuracy | Sensitivity | Specificity | AucRoc |
|---|---|---|---|---|
| DT | 0.97 ± 0.05 | 0.97 ± 0.04 | 0.97 ± 0.05 | 0.97 ± 0.03 |
| RF | 0.98 ± 0.02 | 0.98 ± 0.03 | 0.99 ± 0.02 | 0.99 ± 0.01 |
| GB | 0.97 ± 0.04 | 0.98 ± 0.03 | 0.96 ± 0.06 | 0.98 ± 0.02 |
| AB | 0.98 ± 0.02 | 0.98 ± 0.02 | 0.98 ± 0.04 | 0.97 ± 0.03 |
| kNN | 0.90 ± 0.09 | 0.88 ± 0.10 | 0.91 ± 0.11 | 0.94 ± 0.06 |
| NB | 0.96 ± 0.03 | 0.96 ± 0.04 | 0.96 ± 0.04 | 0.99 ± 0.01 |
| MLP | 0.95 ± 0.06 | 0.93 ± 0.07 | 0.96 ± 0.06 | 0.97 ± 0.03 |
| SVM | 0.83 ± 0.20 | 0.83 ± 0.29 | 0.82 ± 0.37 | 0.85 ± 0.22 |
| LR | 0.79 ± 0.15 | 0.79 ± 0.14 | 0.79 ± 0.19 | 0.84 ± 0.13 |
Abbreviations. AB: AdaBoost; AucRoc: Area under the curve Receiver operator characteristic; DT: Decision Tree; GB: Gradient Boost; kNN: k-Nearest Neighbor; LR: Logistic Regression; NB: Naïve Bayes; MLP: Multilayer Perceptron; RF: Random Forest; SVM: Support Vector Machine.
Scores from stratified tenfold CV evaluation metrics averaged over the seven subjects (mean ± standard deviation) using the features extracted from the angular velocity signals along the three axes.
| Algorithms | Accuracy | Sensitivity | Specificity | AucRoc |
|---|---|---|---|---|
| DT | 0.88 ± 0.11 | 0.87 ± 0.14 | 0.88 ± 0.08 | 0.88 ± 0.12 |
| RF | 0.90 ± 0.11 | 0.90 ± 0.13 | 0.89 ± 0.09 | 0.94 ± 0.10 |
| GB | 0.89 ± 0.13 | 0.90 ± 0.14 | 0.88 ± 0.13 | 0.92 ± 0.09 |
| AB | 0.89 ± 0.14 | 0.89 ± 0.16 | 0.89 ± 0.12 | 0.92 ± 0.12 |
| kNN | 0.82 ± 0.10 | 0.81 ± 0.14 | 0.82 ± 0.09 | 0.88 ± 0.09 |
| NB | 0.86 ± 0.12 | 0.83 ± 0.17 | 0.89 ± 0.08 | 0.92 ± 0.08 |
| MLP | 0.90 ± 0.12 | 0.88 ± 0.16 | 0.92 ± 0.09 | 0.94 ± 0.08 |
| SVM | 0.68 ± 0.19 | 0.91 ± 0.13 | 0.44 ± 0.42 | 0.82 ± 0.16 |
| LR | 0.84 ± 0.08 | 0.84 ± 0.13 | 0.84 ± 0.05 | 0.90 ± 0.07 |
Abbreviations. AB: AdaBoost; AucRoc: Area under the curve Receiver operator characteristic; DT: Decision Tree; GB: Gradient Boost; kNN: k-Nearest Neighbor; LR: Logistic Regression; NB: Naïve Bayes; MLP: Multilayer Perceptron; RF: Random Forest; SVM: Support Vector Machine.
Figure 5Feature importance based on the Information Gain value. Abbreviations. aRMSx: x-axis acceleration Root Mean Square; aRMSy: y-axis acceleration Root Mean Square; aRMSz: z-axis acceleration Root Mean Square; aSTDx: x-axis acceleration Standard Deviation; aSTDy: y-axis acceleration Standard Deviation; aSTDz: z-axis acceleration Standard Deviation; aMINx: x-axis acceleration Minimum; aMINy: y-axis acceleration Minimum; aMINz: z-axis acceleration Minimum; aMAXx: x-axis acceleration Maximum; aMAXy: y-axis acceleration Maximum; aMAXz: z-axis acceleration Maximum; vRMSx: x-axis angular velocity Root Mean Square; vRMSy: y-axis angular velocity Root Mean Square; vRMSz: z-axis angular velocity Root Mean Square; vSTDx: x-axis angular velocity Standard Deviation; vSTDy: y-axis angular velocity Standard Deviation; vSTDz: z-axis angular velocity Standard Deviation; vMINx: x-axis angular velocity Minimum; vMINy: y-axis angular velocity Minimum; vMINz: z-axis angular velocity Minimum; vMAXx: x-axis angular velocity Maximum; vMAXy: y-axis angular velocity Maximum; vMAXz: z-axis angular velocity Maximum.
Scores from stratified tenfold CV evaluation metrics considering the entire study sample and using the features extracted from both acceleration and angular velocity signals along the three axes with a non-zero IG.
| Algorithms | Accuracy | Sensitivity | Specificity | AucRoc |
|---|---|---|---|---|
| DT | 0.91 | 0.89 | 0.92 | 0.93 |
| RF | 0.95 | 0.94 | 0.95 | 0.99 |
| GB | 0.95 | 0.94 | 0.96 | 0.99 |
| AB | 0.80 | 0.72 | 0.87 | 0.90 |
| kNN | 0.84 | 0.83 | 0.85 | 0.91 |
| NB | 0.67 | 0.63 | 0.71 | 0.75 |
| MLP | 0.91 | 0.91 | 0.91 | 0.97 |
| SVM | 0.71 | 0.70 | 0.71 | 0.79 |
| LR | 0.66 | 0.67 | 0.66 | 0.68 |
Abbreviations. AB: AdaBoost; AucRoc: Area under the curve Receiver operator characteristic; DT: Decision Tree; GB: Gradient Boost; kNN: k-Nearest Neighbor; LR: Logistic Regression; NB: Naïve Bayes; MLP: Multilayer Perceptron; RF: Random Forest; SVM: Support Vector Machine.
Confusion matrix of the best algorithm in terms of evaluation metrics scores: the GB.
| NO RISK | YES RISK | |
|---|---|---|
| NO RISK | 197 | 13 |
| YES RISK | 8 | 202 |
Scores from leave-one-subject-out evaluation metrics considering the entire study sample and using the features extracted from both acceleration and angular velocity signals along the three axes.
| Algorithms | Accuracy | Sensitivity | Specificity | AucRoc |
|---|---|---|---|---|
| DT | 0.88 | 0.93 | 0.83 | 0.88 |
| RF | 0.88 | 0.97 | 0.80 | 0.97 |
| GB | 0.75 | 0.97 | 0.53 | 0.92 |
| AB | 0.88 | 0.97 | 0.80 | 0.96 |
| kNN | 0.50 | 0.00 | 1.00 | 0.48 |
| NB | 0.82 | 0.97 | 0.67 | 0.97 |
| MLP | 0.73 | 0.97 | 0.50 | 0.87 |
| SVM | 0.77 | 0.97 | 0.57 | 0.82 |
| LR | 0.70 | 0.97 | 0.43 | 0.72 |
Abbreviations. AB: AdaBoost; AucRoc: Area under the curve Receiver operator characteristic; DT: Decision Tree; GB: Gradient Boost; kNN: k-Nearest Neighbor; LR: Logistic Regression; NB: Naïve Bayes; MLP: Multilayer Perceptron; RF: Random Forest; SVM: Support Vector Machine.