| Literature DB >> 35890959 |
Yukihiko Aoyagi1, Shigeki Yamada2,3,4,5, Shigeo Ueda6, Chifumi Iseki7, Toshiyuki Kondo7, Keisuke Mori8, Yoshiyuki Kobayashi9, Tadanori Fukami10, Minoru Hoshimaru6, Masatsune Ishikawa4,11, Yasuyuki Ohta7.
Abstract
To quantitatively assess pathological gait, we developed a novel smartphone application for full-body human motion tracking in real time from markerless video-based images using a smartphone monocular camera and deep learning. As training data for deep learning, the original three-dimensional (3D) dataset comprising more than 1 million captured images from the 3D motion of 90 humanoid characters and the two-dimensional dataset of COCO 2017 were prepared. The 3D heatmap offset data consisting of 28 × 28 × 28 blocks with three red-green-blue colors at the 24 key points of the entire body motion were learned using the convolutional neural network, modified ResNet34. At each key point, the hottest spot deviating from the center of the cell was learned using the tanh function. Our new iOS application could detect the relative tri-axial coordinates of the 24 whole-body key points centered on the navel in real time without any markers for motion capture. By using the relative coordinates, the 3D angles of the neck, lumbar, bilateral hip, knee, and ankle joints were estimated. Any human motion could be quantitatively and easily assessed using a new smartphone application named Three-Dimensional Pose Tracker for Gait Test (TDPT-GT) without any body markers or multipoint cameras.Entities:
Keywords: deep learning; markerless motion capture; motion tracking; quantitative gait assessment; smartphone device
Mesh:
Year: 2022 PMID: 35890959 PMCID: PMC9322512 DOI: 10.3390/s22145282
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Sample of the input training data for deep learning. The three consecutive images on the left (black 1–3) are dancing movements, and those on the right (white 1–3) are walking movements.
Figure 2Overview of the training process and backbone network for the pose estimation framework. The colored cells are the basic cells designed using ResNet34, and the white cells are the added cells.
Figure 3Output 3D relative coordinates of 24 key points on 3D heatmaps.
Figure 4Snapshots of the TDPT for Gait Test (TDPT-GT) application and calculated 3D joint angles and artificial intelligence (AI) scores, also called the confidence scores in a healthy young volunteer. The iPhone was fixed as horizontal as possible to the floor. The entire body of the subject from the head to the toes must always be in the videoframe. The blue-colored number at the left of the videoframe shows the reliability of the position information of the entire body as an AI score, and the small numbers below show the 3D angles (AI scores) of the left and right knee joints, and (AI scores) of the left and right ankle joints.
Figure 5Three-dimensional angles (solid lines) calculated by the relative 3D coordinates estimated by the TDPT for Gait Test (TDTP-GT) application and artificial intelligence (AI) scores (dotted lines). When each AI score is 0.7 (black dotted line) or higher, the 3D angles at the right (blue) and left (red) hip joints (a), knee joints (b), and ankle joints (c) are considered to be relatively reliable.
Figure 6Simultaneous measurement by two methods of TDPT for Gait Test (TDPT-GT) application and Vicon Motion System. Blue dots indicate 3D coordinates; blue lines indicate both upper extremities, yellow indicates the trunk, gray indicates the left lower extremity, and orange indicates the right lower extremity.
Relationship between 3D coordinates with TDPT for Gait Test (TDTP-GT) application and those with VICON Motion System.
| TDPT-GT | VICON (×10) |
| |
|---|---|---|---|
| Right Shoulder | (0.3, 35.3, 1.6) | (4.2, 14.0, 0.7) | (0.87, 0.58, −0.84) |
| Left Shoulder | (−7.3, 35.0, −6.4) | (3.8, 14.0, 2.2) | (0.90, −0.49, −0.34) |
| Right Elbow | (4.3, 8.1, 4.4) | (4.6, 11.5, 0.2) | (0.92, −0.12, −0.94) |
| Left Elbow | (−11.0, 7.3, −9.9) | (3.8, 11.4, 2.8) | (0.89, −0.36, −0.67) |
| Right Wrist | (4.6, −15.3, 8.7) | (4.4, 10.1, −0.7) | (0.95, −0.23, −0.89) |
| Left Wrist | (−17.5, −14.4, −12.0) | (3.4, 10.1, 3.4) | (0.87, −0.40, −0.82) |
| Right Hip joint | (−1.1, −27.3, 2.9) | (4.6, 8.8, 1.2) | (0.85, −0.02, −0.85) |
| Left Hip joint | (−5.9, −27.5, −2.0) | (4.3, 8.8, 2.0) | (0.79, 0.52, −0.29) |
| Right Knee | (−2.7, −79.1, 1.9) | (4.7, 4.8, 1.4) | (0.93, −0.77, −0.60) |
| Left Knee | (−8.3, −79.0, −4.0) | (4.5, 4.9, 2.1) | (0.77, 0.49, −0.19) |
| Right Ankle | (−3.3, −132.2, 4.4) | (5.2, 1.2, 1.6) | (0.84, −0.75, −0.88) |
| Left Ankle | (−7.2, −131.6, −2.4) | (4.9, 1.2, 2.2) | (0.81, 0.14, −0.61) |
| Right Toe | (−6.0, −140.3, 7.4) | (4.8, 0.7, 1.3) | (0.92, −0.66, −0.77) |
| Left Toe | (−11.1, −140.4, −0.6) | (4.4, 0.7, 2.2) | (0.74, 0.57, −0.72) |
The mean values of 3D coordinates (X, Y, Z) for TDPT-GT and VICON (×10). r: Pearson’s correlation coefficient for each coordinate (X, Y, Z).