| Literature DB >> 30995789 |
Christine F Martindale1, Sebastijan Sprager2, Bjoern M Eskofier3.
Abstract
Activity monitoring using wearables is becoming ubiquitous, although accurate cycle level analysis, such as step-counting and gait analysis, are limited by a lack of realistic and labeled datasets. The effort required to obtain and annotate such datasets is massive, therefore we propose a smart annotation pipeline which reduces the number of events needing manual adjustment to 14%. For scenarios dominated by walking, this annotation effort is as low as 8%. The pipeline consists of three smart annotation approaches, namely edge detection of the pressure data, local cyclicity estimation, and iteratively trained hierarchical hidden Markov models. Using this pipeline, we have collected and labeled a dataset with over 150,000 labeled cycles, each with 2 phases, from 80 subjects, which we have made publicly available. The dataset consists of 12 different task-driven activities, 10 of which are cyclic. These activities include not only straight and steady-state motions, but also transitions, different ranges of bouts, and changing directions. Each participant wore 5 synchronized inertial measurement units (IMUs) on the wrists, shoes, and in a pocket, as well as pressure insoles and video. We believe that this dataset and smart annotation pipeline are a good basis for creating a benchmark dataset for validation of other semi- and unsupervised algorithms.Entities:
Keywords: activity recognition; benchmark database; cyclic activities; gait analysis; gait phases; home monitoring; inertial measurement unit; semi-supervised learning; smart annotation
Mesh:
Year: 2019 PMID: 30995789 PMCID: PMC6515106 DOI: 10.3390/s19081820
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Participant characteristics of publicly Available Wearable-based Cyclic Activity Recognition Databases (Healthy subjects).
| Dataset Name | No. of Subjects | Duration | Cyclic Labels |
|---|---|---|---|
| MAREA [ | 20 | 34 min per subject | Heel strike, toe off (using FSR) |
| Bradjic [ | 27 | Unknown | Step count (from video) |
| CMU-MMAC [ | 43 | 5 recipes | N/A |
| MHAD [ | 12 | 82 min (total) | N/A |
| OU-ISIR2012 [ | 744 | Under 20 steps per subject | N/A |
| ZJU-GaitAcc [ | 175 | 6 times 20 m | Cycle borders (Signal analysis) |
| Real World (HAR) [ | 15 | 70 min per subject | N/A |
| Digital Biobank [ | 70 | 40 m plus 2 times 2 min | Cycle borders (Signal analysis) |
| Kluge [ | 15 | 1166 strides | Heel strike, toe off, heel off |
| (Mocap) | |||
| Dailiac [ | 23 | 20 min | N/A |
| Martindale [ | 18 | 3500 strides | Cycle borders |
| BASA [ | 15 | 20 min per subject | Step count |
| FAU-Gait (current paper) | 80 | 20 min per subject | Heel strike, toe off |
| (extending [ | (smart annotation) |
Sensor and activity details of publicly Available Wearable-based Cyclic Activity Recognition Databases (Healthy subjects).
| Dataset Name | Sensors and Sensor Location |
|
|---|---|---|
| MAREA [ | FSR, Acc (128 Hz) | Treadmill & outdoors, |
| Left wrist, ankles, waist | Walk & run, slope & flat | |
| Bradjic [ | Acc, Gyr, Mag (100 Hz) | Slow, normal & fast |
| Phone in pockets/hand/bag | straight walk | |
| CMU-MMAC [ | Acc, Gyr, (60 Hz) Mocap, Audio | Cooking |
| Back, legs & arms | ||
| MHAD [ | Acc (30 Hz), mocap | 11 actions: Jump, clap |
| Wrists, ankles, hips | Throw, wave, punch, ... | |
| OU-ISIR2012 [ | Acc, Gyr, (100 Hz) | Level walking |
| Waist | Slope & stair | |
| ZJU-GaitAcc [ | Acc (100 Hz) | Walk |
| Arm, wrist, waist, ankle, thigh | ||
| Real World (HAR) [ | Acc, Gyr, GPS, Mag, Light, Sound | Walk, run, sit, stand, |
| Chest, forearm, head, shin, | Lie, stairs, jump | |
| Thigh, upper arm, and waist | ||
| Digital Biobank [ | Acc, Gyr (100 Hz) | Straight walk |
| Shoes | ||
| Kluge [ | Acc, Gyr (100 Hz) | Level walk |
| Shoes | ||
| Dailiac [ | Acc, Gyr (100 Hz) | 13 activities: Sit, lie, |
| Shoes, chest, hip, wrist | Walk, stair, treadmill, skip ... | |
| Indoor & outdoors | ||
| Martindale [ | Acc, Gyr (200 Hz) | Walk, run, stand |
| Ankles | Outdoors | |
| BASA [ | Acc, Gyr (200 Hz) | Stairs, walk jog, |
| Shoe, wrist | Sit, lie, stand | |
| FAU-Gait (current paper) | Acc, Gyr (200 Hz), Pressure insole | Walk, jog, run, stand, |
| (extending [ | Shoes, wrists, pocket | Jump, hop, skip, cycle, ... |
Abbreviations: Accelerometer (Acc), Gyroscope (Gyr), Magnetometer (Mag), Force sensitive resistor (FSR), Motion capture (mocap).
Subject characteristics.
| Characteristic | Unit | Mean and Variance |
|---|---|---|
| Age | [years] | 27 ± 6 |
| Gender | [F/M] | 28/52 |
| Height | [cm] | 174 ± 7 |
| Weight | [kg] | 66 ± 18 |
| Shoe size | (EU) | 41 ± 2 |
| Handedness | [R/L] | 74/4 |
| Location | [Erlangen/Ljubljana] | 56/24 |
Figure 1Sensor type and location. (a) Photograph of IMU sensor system in 3D printed case. (b) Diagram showing sensor attachment to shoe, using industrial Velcro and Moticon insole, which was used instead of the original sports shoe insole pressure sensor location and approximate size and shape within the insole. Also showing axes location for the insole accelerometer. (c) IMU sensor locations on the body with corresponding axes.
Figure 2Layout of the activities (Germany). The starting point of each activity is shown by the letter whose key is in the table on the right. Double lines in the table separate the 4 sections of activities.
Tasks with corresponding activity labels; indicating if activity is cyclic and if it includes phases.
| Task | Cyclic Activities with Phase Labels | Non-Cyclic Activities |
|---|---|---|
| Walk 2 × 20 m | Walk | Rest |
| Walk slalom | Walk | Rest |
| Walk circuit | Walk | Rest |
| Walk between activities | Walk | Rest |
| Sign name on posters | Walk | Rest |
| Jog 2 × 20 m | Jog | Rest |
| Jog circuit | Jog | Rest |
| Run 2 × 20 m | Run | Rest |
| Run slalom | Run | Rest |
| Run circuit | Run | Rest |
| Sitting at tables | Walk | Sit, Rest |
| Stepping | Stairs | Rest |
| Stairs | Stairs | Rest |
| Double jump | Jump | Rest |
| Jumping | Jump | Rest |
| Jumping on spot | Jump | Rest |
| Side-stepping | Side-step | Rest |
| Hopping | Hop | Rest |
| Skipping | Skip | Rest |
| Cycling | Cycle (no phase labels) | Rest |
| Running on spot | RunOnSpot | Rest |
Figure 3Logarithmic histogram of walking bout lengths, for all subjects.
Duration and quantity of activities.
| Activity | Mean Duration | Mean Cycle Count | Total Cycles |
|---|---|---|---|
| Walking | 523 s ± 63 s | 492 ± 56 | 78,684 |
| Jogging | 60 s ± 12 s | 78 ± 17 | 12,475 |
| Running | 55 s ± 14 s | 85 ± 18 | 13,592 |
| Stairs | 65 s ± 11 s | 67 ± 6 | 10,709 |
| Jumping | 40 s ± 6 s | 77 ± 8 | 12,333 |
| Hopping | 11 s ± 2 s | 12 ± 2 | 1932 |
| Skipping | 14 s ± 4 s | 27 ± 10 | 4389 |
| Side-step | 9 s ± 2 s | 16 ± 3 | 2653 |
| Run on spot | 16 s ± 2 s | 21 ± 1 | 985 |
| Cycling | 62 s ± 10 s | 85 ± 18 | 9482 |
| Sit | 120 s ± 34 s | - | - |
| Rest | 397 s ± 106 s | - | - |
| Total | 1343 s ± 137 s | 983 ± 67 | 157,340 |
calculated per person, on the left foot data; total for all subject.
Figure 4Boxplot of the stride times per activity, over all subjects.
Figure 5Boxplot of the swing duration per activity, over all subjects.
Figure 6Boxplots of the effort required for manual correction of each batch of subjects using the smart annotation approaches. (a) Shows the actual effort per batch where Batch 1 was using the edge-detection method and Batches 2 to 7 were using the hHMM method, where the data from the previous batches is always added to the training set of the subsequent model. (b) Shows the labeling effort for each batch’s hHMM model when predicting the subject data from Batch 7.
Mean of the labeling effort per activity, over all subjects in a given batch. Effort being the percentage of labels added or removed during manual labeling.
| Activity | Overall Effort [%] | Effort Using EdgeDet [%] | Effort Using hHMM [%] |
|---|---|---|---|
| All 7 Batches | Batch 1 | Batch 7 | |
| All | 16.3 ± 4.3 | 16.0 ± 5.3 | 14.1 ± 1.8 |
| Walking (total) | 12.5 ± 5.7 | 16.8 ± 6.1 | 8.5 ± 1.8 |
| - circuit | 6.2 ± 8.9 | 12.6 ± 11.4 | 0.8 ± 0.6 |
| - 2 times 20 m | 7.1 ± 8.9 | 14.6 ± 11.4 | 2.2 ± 1.4 |
| - slalom | 7.9 ± 10.4 | 15.9 ± 15.4 | 2.2 ± 2.0 |
| - between activities | 11.0 ± 6.2 | 16.4 ± 6.0 | 7.6 ± 3.1 |
| - posters | 38.7 ± 17.6 | 51.5 ± 14.1 | 23.5 ± 9.1 |
| - between sitting | 81.2 ± 40.7 | 45.4 ± 12.6 | 83.5 ± 47.9 |
| Jogging (total) | 9.1 ± 8.4 | 14.5 ± 9.4 | 4.9 ± 2.1 |
| - circuit | 7.6 ± 8.2 | 13.0 ± 9.3 | 1.2 ± 0.6 |
| - 2 times 20 m | 12.9 ± 11.7 | 18.1 ± 13.5 | 15.7 ± 5.3 |
| Running (total) | 15.5 ± 8.1 | 18.0 ± 9.4 | 14.9 ± 11.3 |
| - circuit | 7.9 ± 9.7 | 17.5 ± 12.4 | 1.2 ± 0.7 |
| - 2 times 20 m | 22.7 ± 14.4 | 26.2 ± 17.0 | 20.4 ± 12.8 |
| - cones | 25.4 ± 19.2 | 10.6 ± 7.7 | 43.1 ± 28.8 |
| Stairs (total) | 38.7 ± 18.9 | 18.4 ± 10.0 | 49.4 ± 14.9 |
| - stepping | 35.3 ± 22.7 | 17.0 ± 13.8 | 36.3 ± 18.3 |
| - stairs | 44.0 ± 24.2 | 20.3 ± 10.5 | 70.8 ± 23.3 |
| Jumping (total) | 12.4 ± 10.7 | 9.5 ± 7.6 | 8.1 ± 2.5 |
| - jumping jacks | 7.7 ± 14.2 | 6.6 ± 6.8 | 2.6 ± 1.9 |
| - jumping on spot | 14.7 ± 23.8 | 13.3 ± 17.6 | 3.7 ± 2.9 |
| - double jump | 20.7 ± 15.1 | 12.3 ± 9.5 | 24.1 ± 10.4 |
| Hopping | 46.3 ± 29.6 | 27.0 ± 21.5 | 56.7 ± 28.1 |
| Skipping | 26.2 ± 32.6 | 9.9 ± 9.5 | 23.8 ± 15.2 |
| Side-step | 47.1 ± 37.5 | 12.6 ± 8.8 | 40.9 ± 14.3 |
| Run on spot | 12.3 ± 12.4 | 12.3 ± 12.4 | - |
| Cycle | 2.9 ± 3.7 | - | - |
Using the peak-detection method.
Mean of the F1 score per activity, over all subjects in a given batch.
| Activity | Overall F1 [%] | F1 Using EdgeDet [%] | F1 Using hHMM [%] |
|---|---|---|---|
| All 7 Batches | Batch 1 | Batch 7 | |
| All | 89.5 ± 4.8 | 83.8 ± 4.7 | 93.0 ± 0.8 |
| Walking (total) | 90.5 ± 7.5 | 80.5 ± 5.7 | 95.8 ± 0.9 |
| - circuit | 94.1 ± 8.7 | 84.4 ± 9.9 | 99.6 ± 0.3 |
| - 2 times 20 m | 96.4 ± 4.7 | 92.5 ± 6.1 | 98.9 ± 0.7 |
| - slalom | 96.0 ± 5.3 | 91.9 ± 7.8 | 98.9 ± 1.0 |
| - between activities | 90.0 ± 9.5 | 76.7 ± 6.0 | 96.2 ± 1.5 |
| - posters | 79.3 ± 11.1 | 68.2 ± 10.9 | 88.0 ± 4.6 |
| - between sitting | 65.3 ± 10.7 | 71.9 ± 9.1 | 67.2 ± 13.2 |
| Jogging (total) | 93.7 ± 6.8 | 86.9 ± 7.8 | 97.5 ± 1.1 |
| - circuit | 93.2 ± 8.7 | 83.4 ± 9.7 | 99.4 ± 0.3 |
| - 2 times 20 m | 93.5 ± 6.1 | 90.6 ± 7.2 | 92.1 ± 2.7 |
| Running (total) | 90.2 ± 6.2 | 84.3 ± 6.8 | 92.4 ± 5.8 |
| - circuit | 92.7 ± 9.8 | 80.2 ± 9.4 | 99.4 ± 0.4 |
| - 2 times 20 m | 88.3 ± 7.7 | 86.4 ± 9.0 | 89.5 ± 6.8 |
| - cones | 86.0 ± 10.1 | 91.5 ± 5.3 | 78.4 ± 14.6 |
| Stairs (total) | 79.0 ± 7.7 | 84.3 ± 6.6 | 75.6 ± 7.6 |
| - stepping | 81.5 ± 10.6 | 88.1 ± 8.1 | 81.9 ± 9.2 |
| - stairs | 75.2 ± 10.1 | 78.0 ± 8.2 | 65.4 ± 11.9 |
| Jumping (total) | 92.9 ± 5.9 | 92.5 ± 5.2 | 95.9 ± 1.2 |
| - jumping jacks | 95.1 ± 9.6 | 94.8 ± 4.7 | 98.7 ± 0.9 |
| - jumping on spot | 92.4 ± 11.1 | 91.5 ± 9.4 | 98.2 ± 1.4 |
| - double jump | 89.0 ± 7.0 | 90.6 ± 6.2 | 87.9 ± 5.2 |
| Hopping | 77.4 ± 11.8 | 82.1 ± 9.6 | 72.9 ± 13.3 |
| Skipping | 87.2 ± 13.4 | 92.5 ± 6.2 | 88.2 ± 7.5 |
| Side-step | 76.8 ± 14.7 | 90.3 ± 5.2 | 79.7 ± 7.5 |
| Run on spot | 90.9 ± 7.4 | 90.9 ± 7.4 | - |
Mean of the miss rate (MR) and false discovery rates (FDR) per activity, over all subjects in a given batch.
| Activity | Overall | Overall | MR [%] | FDR [%] | MR [%] | FDR [%] |
|---|---|---|---|---|---|---|
| Activity | MR [%] | FDR [%] | EdgeDet | EdgeDet | hHMM | hHMM |
| All 7 batches | All 7 batches | Batch 1 | Batch 1 | Batch 7 | Batch 7 | |
| All | 10.4 ± 6.5 | 10.5 ± 3.6 | 19.1 ± 5.1 | 13.1 ± 4.6 | 0.6 ± 6.4 | 7.6 ± 1.2 |
| Walking (total) | 9.3 ± 9.5 | 9.6 ± 5.7 | 22.6 ± 6.1 | 16.1 ± 5.8 | 0.8 ± 3.3 | 5.1 ± 1.4 |
| - circuit | 6.4 ± 9.1 | 5.4 ± 8.3 | 17.1 ± 9.9 | 14.1 ± 10.0 | 0.5 ± 0.5 | 0.3 ± 0.2 |
| - 2 times 20 m | 3.9 ± 5.7 | 3.3 ± 3.9 | 9.0 ± 7.7 | 5.9 ± 4.7 | 0.5 ± 1.2 | 1.0 ± 1.2 |
| - slalom | 4.1 ± 5.7 | 3.9 ± 5.0 | 9.1 ± 8.2 | 7.0 ± 7.5 | 1.0 ± 1.1 | 1.1 ± 2.2 |
| - between activities | 9.6 ± 11.1 | 10.2 ± 8.2 | 25.4 ± 6.6 | 20.9 ± 6.3 | 1.4 ± 3.2 | 4.3 ± 1.7 |
| - posters | 22.5 ± 16.1 | 16.2 ± 9.2 | 43.1 ± 12.0 | 12.9 ± 8.7 | 5.3 ± 13.5 | 10.2 ± 5.7 |
| - between sitting | 28.9 ± 11.9 | 34.1 ± 21.6 | 40.8 ± 10.3 | 6.9 ± 8.2 | 8.6 ± 20.7 | 40.0 ± 17.6 |
| Jogging (total) | 6.9 ± 7.6 | 5.7 ± 6.0 | 14.9 ± 8.8 | 11.0 ± 7.4 | 1.0 ± 3.0 | 2.0 ± 1.1 |
| - circuit | 7.5 ± 9.5 | 6.0 ± 8.2 | 18.7 ± 9.8 | 14.3 ± 10.0 | 0.4 ± 0.9 | 0.3 ± 0.3 |
| - 2 times 20 m | 6.9 ± 7.1 | 6.0 ± 5.4 | 11.0 ± 9.1 | 7.4 ± 6.1 | 2.6 ± 8.9 | 7.0 ± 2.9 |
| Running (total) | 11.1 ± 7.3 | 8.4 ± 5.3 | 18.3 ± 7.4 | 12.8 ± 6.7 | 6.4 ± 8.3 | 6.8 ± 5.2 |
| - circuit | 8.5 ± 11.1 | 5.9 ± 8.5 | 23.1 ± 10.0 | 16.0 ± 9.3 | 0.6 ± 1.1 | 0.1 ± 0.3 |
| 2 times 20 m | 13.1 ± 9.1 | 10.1 ± 6.7 | 15.9 ± 10.7 | 10.9 ± 8.4 | 8.7 ± 12.2 | 8.6 ± 5.0 |
| - cones | 14.7 ± 11.8 | 13.1 ± 8.7 | 9.3 ± 6.4 | 7.5 ± 5.6 | 15.1 ± 21.6 | 21.7 ± 14.1 |
| Stairs (total) | 21.2 ± 7.6 | 20.3 ± 9.5 | 20.2 ± 8.5 | 10.2 ± 6.0 | 8.7 ± 23.3 | 25.5 ± 6.6 |
| - stepping | 19.0 ± 10.4 | 17.8 ± 11.5 | 14.9 ± 9.8 | 8.4 ± 7.2 | 9.7 ± 17.6 | 18.5 ± 8.8 |
| - stairs | 24.5 ± 11.3 | 24.2 ± 12.1 | 28.1 ± 10.7 | 13.7 ± 8.5 | 13.4 ± 32.5 | 36.5 ± 10.7 |
| Jumping (total) | 7.2 ± 7.4 | 6.7 ± 5.2 | 9.2 ± 6.1 | 5.6 ± 4.8 | 1.1 ± 3.9 | 4.2 ± 1.5 |
| - jumping jacks | 5.7 ± 11.9 | 3.4 ± 4.9 | 7.3 ± 6.7 | 2.9 ± 3.1 | 1.1 ± 1.2 | 1.4 ± 0.9 |
| - jumping on spot | 6.9 ± 11.0 | 7.9 ± 11.7 | 10.0 ± 10.7 | 6.5 ± 8.9 | 1.4 ± 1.0 | 2.6 ± 2.3 |
| - double jump | 9.7 ± 6.7 | 12.1 ± 8.3 | 9.8 ± 6.4 | 8.9 ± 7.0 | 5.1 ± 12.2 | 11.9 ± 5.5 |
| Hopping | 18.5 ± 12.2 | 25.2 ± 14.1 | 19.4 ± 12.9 | 15.1 ± 9.7 | 13.6 ± 24.1 | 29.5 ± 14.2 |
| Skipping | 11.0 ± 11.8 | 14.1 ± 15.2 | 7.6 ± 7.7 | 7.0 ± 6.8 | 7.0 ± 11.1 | 12.5 ± 8.0 |
| Side-step | 20.7 ± 14.4 | 25.1 ± 15.5 | 9.1 ± 6.9 | 10.0 ± 5.3 | 8.5 ± 19.3 | 21.0 ± 7.2 |
| Run on spot | 9.4 ± 7.5 | 8.6 ± 7.7 | 9.4 ± 7.5 | 8.6 ± 7.7 | - | - |