| Literature DB >> 33171977 |
Jack H Geissinger1, Alan T Asbeck2.
Abstract
In recent years, wearable sensors have become common, with possible applications in biomechanical monitoring, sports and fitness training, rehabilitation, assistive devices, or human-computer interaction. Our goal was to achieve accurate kinematics estimates using a small number of sensors. To accomplish this, we introduced a new dataset (the Virginia Tech Natural Motion Dataset) of full-body human motion capture using XSens MVN Link that contains more than 40 h of unscripted daily life motion in the open world. Using this dataset, we conducted self-supervised machine learning to do kinematics inference: we predicted the complete kinematics of the upper body or full body using a reduced set of sensors (3 or 4 for the upper body, 5 or 6 for the full body). We used several sequence-to-sequence (Seq2Seq) and Transformer models for motion inference. We compared the results using four different machine learning models and four different configurations of sensor placements. Our models produced mean angular errors of 10-15 degrees for both the upper body and full body, as well as worst-case errors of less than 30 degrees. The dataset and our machine learning code are freely available.Entities:
Keywords: inertial sensors; kinematics; motion dataset; self-supervised learning; sparse sensors
Year: 2020 PMID: 33171977 PMCID: PMC7664234 DOI: 10.3390/s20216330
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1An overview of the approach to full-body human motion inference in this paper. We use a set of sparse segments, pass five frames (0.125 ms) of segment orientation and linear acceleration data into a neural network, and then predict full-body segment orientations for those five frames. The dots on the second mannequin (“Sparse Segments”) indicate the segment locations used to predict the remaining kinematics; the blue dot is the location of the sensor to which the others are normalized. XSens data is used for the input and ground truth. The symbols and refer to orientation and linear acceleration at time t, respectively, for each segment. Note that we also study upper-body motion inference (with 15 segments of full body information, and 3–4 segments used as the sparse inputs), and we study Transformers as another deep learning algorithm.
Figure 2(a) An XSens MVN Link inertial sensor (MTX2-4A7G6) placed on a footpad that is inserted into the participant’s shoe under the laces during data collection. (b) A bodypack connected to a battery. The bodypack also has two ports for connecting the sensors. The button on the lower right of the bodypack is for turning on/off the bodypack, and for starting a calibration or data collection session. (c) A pair of XSens inertial sensors attached to the hand and wrist. The glove contains a pouch with Velcro for attaching an inertial sensor and webbing that constrains the sensor from rotating. The wrist’s inertial sensor was attached to a portion of the strap and then wrapped with the remainder of the strap to constrain it further and avoid any unwanted rotation.
Figure 3Location and placement of the inertial sensors in this study using the guidelines set by XSens. The sensors were also wrapped with excess webbing, as shown in Figure 2c (not shown in this photo for ease of seeing the sensor locations). Some participants used custom shirts with Velcro patches when the XSens shirt was too small. The wires along the back and limbs were secured and bundled together to make sure they did not catch on objects. Note that there is a sensor on the sternum (indicated by the orange box), but it is inside a pocket on the front of the shirt so is not visible in the photo.
Figure 4Diagrams of a mannequin showing the inertial sensor locations for the different configurations examined. The white dots denote the segments whose inertial sensor data (segment orientation, acceleration) is passed into the model. These segments are normalized relative to the blue dot, which also has its segment data passed into the model. Note that the only differences between the upper-body and full-body for each configuration are the sensor data about the lower legs. Using this additional information the full-body models are responsible for predicting the orientation of the 8 segments of the lower-body in addition to the upper-body.
An overview of the natural motion dataset presented in this paper, and comparison to the largest comparable datasets. Further information about the dataset is in the Appendix A in Table A1. Our dataset has a comparable number of hours to two other large motion capture datasets, Archive of Motion Capture as Surface Shapes (AMASS) and KIT, although they have many more subjects than our dataset. Our dataset included more hours per subject since subjects did everyday activities in real world environments outside of laboratories. A benefit of KIT is that it includes object interaction, hand motions, and other data on top of full-body kinematics. In the table, MoCap stands for Motion Capture, and MoSh++ is the improved version of Motion and Shape capture [10].
| Ours | AMASS [ | KIT [ | |
|---|---|---|---|
| Hours of Data | 40.6 | 45.2 | 37.2 |
| Number of Subjects | 17 | 460 | 224 |
| M/F | 13/4 | N/A | 106/37 |
| Avg. Hours/Subject | 2.39 | 0.10 | 0.17 |
| Lab Environment | No | Yes | Yes |
| Body Kinematics Only | Yes | Yes | No |
| Methodology | Inertial MoCap | SMPL [ | Optical MoCap |
Metadata for our dataset. Subjects (“Subj.”) with a “P” prefix were participants on or near Virginia Tech’s campus, while a “W” prefix indicates participants at a home improvement store. The number of frames in the log (“# Frames”) and the duration (“Dur.”) in hours are specified. The Notes column describes the general activities the participants were doing during data collection.
| Subj. | Day | # Frames | Dur. | Notes |
|---|---|---|---|---|
|
|
|
|
| |
| P1 | Day 1 | 1557894 | 1.8 | The participant was in a lab environment operating an experiment involving a virtual reality setup and other hardware. They stand and sit repeatedly to make adjustments. They manipulated many things with their hands. |
| P2 | Day 1 | 1103415 | 1.27 | The participant was sitting in a chair in an office environment, performing work on a computer that included CAD design. The participant was also talking with others and walking around. The participant was having a technical discussion with another person throughout the duration of the data collection. |
| Day 2 | 1293893 | 1.5 | The participant was working in a lab environment, manipulating scrap metal and other materials with their hands. They also operated a drill press. The participant was talking with other people, working with his hands, and walking around an office. | |
| Day 3 | 1604016 | 1.85 | The participant was working in a lab environment, manipulating scrap metal and other materials with their hands. They also operated a drill press. The participant was talking with other people, working with his hands, and walking around an office. | |
| Day 4 | 1349583 | 1.56 | The participant was cleaning up a lab environment and machine shop. They stand and walk around for most of the trial while manipulating objects with their hands. | |
| P3 | Day 1 | 1377289 | 1.6 | The participant was in an open office/lab environment at their desk. The participant was reading and writing on a tablet while sitting and reclining in their chair. The participant also walked around some, spoke with others, and manipulated things with their hands. |
| P4 | Day 1 | 1709725 | 1.97 | The participant was sitting in an office environment working at a computer. The participant was sitting and manipulating things at their desk for the majority of the trial. |
| Day 2 | 529900 | 0.61 | The participant was sitting at a desk in a lab environment doing office work. | |
| P5 | Day 1 | 272731 | 0.32 | The participant attended a sketching session. They walked to the sketching session and drew for the remainder of the log. They did other interesting things, like hold the door open for people, reach into their backpack while sitting down, and operate their laptop in their lap. |
| P6 | Day 1 | 1709741 | 1.98 | The participant was sitting at a desk, working on a computer, and talking with others. |
| Day 2 | 1242864 | 1.44 | The participant was standing for the duration of the log and operating an experiment. They were primarily interacting and talking with an experiment collaborator and the participant who was testing an exoskeleton and assisting with data collection. They were leading a person through the experiment, handing them weights, standing, and performing other actions. | |
| P7 | Day 1 | 1256115 | 1.45 | The participant walked around Virginia Tech’s campus, went to class, and also drove around in their car to a local department store. |
| P8 | Day 1 | 1894520 | 2.19 | The participant walked around campus, attended office hours, and worked on homework with their computer while sitting down. |
| P9 | Day 1 | 3299680 | 3.82 | The participant walked across campus to meet a friend for lunch, sat down, ate food, and did stretching motions to show their friend how the XSens worked. They also went to class, took notes, and spoke with people about the XSens suit. |
| P10 | Day 1 | 1358481 | 1.57 | The participant walked around campus to a teacher’s office hours and worked in the office. They worked on a computer and on their homework while standing up. |
| P11 | Day 1 | 2574733 | 2.98 | The participant walked across Virginia Tech’s campus to attend class. The participant then walked back across campus, got take-out food from a restaurant, and returned to the starting location. |
| P12 | Day 1 | 1709704 | 1.98 | The participant walked across campus to a lab meeting and their office where they sat and discussed things with other people. |
| P13 | Day 1 | 1587648 | 1.84 | The participant was at their home near Virginia Tech’s campus. They were doing Spring cleaning activities, such as sweeping and vacuuming. They also emptied a dishwasher, played the piano, played video games, and took a nap. |
| Day 2 | 209547 | 0.24 | The participant did exercises at their home near Virginia Tech’s campus, including Frankensteins, jumping jacks, pushups, mountain climbers, burpees, and some jogging. The participant was working at a standing desk and using their smartphone at their home. The participant also walked to their car and started to drive to complete some errands. The later data in the car had errors so it was trimmed to only include the participant reversing out of their parking spot. | |
| W1 | Day 1 | 1707956 | 1.98 | The participant pushed and pulled carts, looked at and wrote on a clipboard or a touchpad, placed items on shelves, and bent down to objects low to the ground. |
| W2 | Day 1 | 506650 | 0.59 | The participant pushed/pulled carts, bent over to objects low to the floor and knelt on the ground to pull something once. |
| Day 2 | 1159655 | 1.35 | The participant operated a small touchscreen, spent some time interacting with something overhead, leaned far forward for something and bent over in other postures, as well. They also poured some material, pulled and pushed a cart, sat down for a small time, and operated machinery. The participant also operated a dolly, including putting an object onto it. | |
| Day 3 | 715716 | 0.83 | The participant walked around while pushing an object similar to a wheelbarrow. They also pushed and pulled carts. They did some overhead lifting of materials and low-height object manipulation with various bending postures. The participant carried objects on their shoulders and spent some time looking upwards at something. | |
| W3 | Day 1 | 1709720 | 1.97 | The participant sorted materials and placed objects in containers. They carried objects around. The participant also did low-height object manipulation with various bending postures. They also pushed and pulled carts. |
| W4 | Day 1 | 1646669 | 1.9 | The participant did low-height object manipulation, including squatting. They operated machinery while standing up (possibly a stand-up forklift), pushed and pulled carts, and spent a lot of time sorting bins and other containers. |
The performance of the various models with differing configurations for upper-body motion inference with the regular test set. The values reported are the mean angle difference in degrees for the test set. Each configuration uses the entire upper-body as output (15 segment orientations with the pelvis included). Figure 4 shows diagrams of the configurations. Bolded values are the minimum in each column, and bolded gray values are within of the minimum.
| Model | Config. 1 | Config. 2 | Config. 3 | Config. 4 |
|---|---|---|---|---|
| Seq2Seq |
|
|
| 15.64 |
| Seq2Seq (BiRNN, Attn) |
|
|
| 15.69 |
| Transformer Enc. |
| 12.56 |
|
|
| Transformer |
|
|
| 15.77 |
The performance of the various models with differing configurations for full-body motion inference with the regular test set. The values reported are the mean angle difference in degrees for the test set. Each configuration uses the full-body as output. Figure 4 shows diagrams of the configurations. Bolded values are the minimum in each column, and bolded gray values are within of the minimum.
| Model | Config. 1 | Config. 2 | Config. 3 | Config. 4 |
|---|---|---|---|---|
| Seq2Seq | 11.54 |
|
|
|
| Seq2Seq (BiRNN, Attn) | 11.48 |
|
|
|
| Transformer Enc. | 11.53 |
| 12.97 |
|
| Transformer |
|
|
| 18.16 |
The performance of the various models doing upper-body motion inference with the special test set. The values reported are the mean angle difference in degrees. Each configuration uses the entire upper-body as output (15 segment orientations with the pelvis included). Figure 4 shows diagrams of the configurations. Bolded values are the minimum in each column, and bolded gray values are within of the minimum.
| Model | Config. 1 | Config. 2 | Config. 3 | Config. 4 |
|---|---|---|---|---|
| Seq2Seq | 19.38 | 18.52 | 22.18 | 22.85 |
| Seq2Seq (BiRNN, Attn) |
|
|
| 22.83 |
| Transformer Enc. |
| 19.95 | 22.92 | 22.52 |
| Transformer | 18.71 |
|
|
|
The performance of the various models doing full-body motion inference with the special test set. The values reported are the mean angle difference in degrees. Figure 4 shows diagrams of the different configurations. Bolded values are the minimum in each column; there are no values within of the minimum in this table.
| Model | Config. 1 | Config. 2 | Config. 3 | Config. 4 |
|---|---|---|---|---|
| Seq2Seq | 21.35 | 22.03 |
| 23.17 |
| Seq2Seq (BiRNN, Attn) |
|
| 21.48 | 24.51 |
| Transformer Enc. | 22.54 | 20.78 | 23.36 |
|
| Transformer | 21.64 | 22.30 | 22.08 | 28.12 |
Figure 5Histograms showing the performance of the different models on the regular test set under different upper-body configurations.
Figure 6Histograms showing the performance of the different models on the regular test set under different full-body configurations.
Figure 7A set of sitting postures from the validation set where the participant went to a drawing/sketching club meeting. The rows are of different postures, such as (a) sitting at a desk and typing on a computer, (b) sitting at a desk with elbows on the table, (c) sitting at a desk reaching down into their bag, and (d) sitting at a desk and sketching.
Figure 8A set of postures from the validation set where the participant was working at a home improvement store. The rows are of different postures, such as (a) standing with legs crossed talking with someone, (b) reaching for something in a one-legged (“golfer’s”) lift, (c) reaching for something with legs split fore-aft, and (d) reaching up for something overhead.
Figure 9A set of postures from the validation set where the participant was working at a home improvement store. The rows are of different postures, such as (a) walking with their hand on a cart, (b) kneeling on the ground while reaching for something, (c) putting on a vest, and (d) lifting something with both hands.
Figure 10A set of postures from the validation set where the participant was working at a home improvement store. The Transformer had low accuracy (>20) for each of these postures. The rows are of different postures, such as (a) operating machinery (possible a stand-up forklift), (b) reaching for an object or box across their chest, (c) reaching for an object with one hand across their chest, and (d) performing some action with high elbow flexion.
Figure 11A set of postures from the special test set where the participant was at their apartment. The rows are of different postures, such as (a) lying in bed napping ( mean angle difference with Transformer), (b) doing push ups ( mean angle difference with Transformer), (c) doing Frankensteins ( mean angle difference with Transformer), and (d) doing mountain climbers ( mean angle difference with Transformer).
Figure 12A set of postures from the special test set where the participant was at their apartment. The rows are of different postures during a burpee exercise where the participant does a push up, brings their feet to their hands, and then springs up. The errors for these postures are 61° and 55°, respectively. In (a), the participant is bent over fully after bringing their feet to their hands. In (b), the participant is moving their feet back to drop into a push up position.