| Literature DB >> 29869300 |
Steffi L Colyer1,2, Murray Evans1,3, Darren P Cosker1,3, Aki I T Salo4,5.
Abstract
BACKGROUND: The study of human movement within sports biomechanics and rehabilitation settings has made considerable progress over recent decades. However, developing a motion analysis system that collects accurate kinematic data in a timely, unobtrusive and externally valid manner remains an open challenge. MAIN BODY: This narrative review considers the evolution of methods for extracting kinematic information from images, observing how technology has progressed from laborious manual approaches to optoelectronic marker-based systems. The motion analysis systems which are currently most widely used in sports biomechanics and rehabilitation do not allow kinematic data to be collected automatically without the attachment of markers, controlled conditions and/or extensive processing times. These limitations can obstruct the routine use of motion capture in normal training or rehabilitation environments, and there is a clear desire for the development of automatic markerless systems. Such technology is emerging, often driven by the needs of the entertainment industry, and utilising many of the latest trends in computer vision and machine learning. However, the accuracy and practicality of these systems has yet to be fully scrutinised, meaning such markerless systems are not currently in widespread use within biomechanics.Entities:
Keywords: Automatic analysis; Body model; Cameras; Discriminative approaches; Gait; Generative algorithms; Motion capture; Rehabilitation; Sports biomechanics; Technique
Year: 2018 PMID: 29869300 PMCID: PMC5986692 DOI: 10.1186/s40798-018-0139-y
Source DB: PubMed Journal: Sports Med Open ISSN: 2198-9761
A selection of commercially available full-body markerless systems
| Company | Cameras | Capture environments | Integration with other biomechanical tools | Real-time capacity |
|---|---|---|---|---|
| Captury Studio Ultimate | Unlimited number with combination of resolutions | No specific background necessary. | None. Applications primarily within entertainment | Yes |
| BioStage | 8–18 (120 fps in real-time) | Laboratory-based | Force plates and electromyography | Yes |
| Shape 3D | Up to 8 high-speed colour cameras | Can operate outdoors but stable background with good contrast is required | Force plates, electromyography and pressure sensors | No |
Information obtained from company web-pages (accessed July 2017)
Fig. 1General structure of a markerless motion capture whether using generative (green) or discriminative (orange) algorithms
Fig. 2Example of a depth map. Brighter pixels are further away from the camera. Black pixels are either too far away or on objects that do not reflect near infrared light
Fig. 3Example of a poseable skeleton model. “Bones” of a pre-specified length are connected at joints, and rotation of the bones around these joints allows the skeleton to be posed. The skeleton model is commonly fit to both marker-based motion capture data and computer vision-based markerless systems
Fig. 4Sum of Gaussian body model from Stoll [75]. A skeleton (left) forms the foundation of the model, providing limb-lengths and body pose. The body is given volume and appearance information through the use of 3D Spatial Gaussians arranged along the skeleton (represented by spheres). The resulting information allows the model to be fit to image data
Fig. 5Skinned Multi-Person Linear Model (SMPL) [79] body model. This model does not have an explicit skeleton. Instead, the surface of a person is represented by a mesh of triangles. A set of parameters (learnt through regression) allows the shape of the model to be changed from a neutral mean (left) to a fatter (middle) or thinner, taller, or other body shape. Once shaped, the centres of joints are inferred from the neutrally posed mesh, and then the mesh can be rotated around these joints to produce a posed body (right)
Fig. 6Silhouette on the right from chroma keying the image on the left. When seen as only a silhouette, it is not possible to infer if the mannequin is facing towards or away from the camera
Fig. 7The generation of a visual hull, which is a type of 3D reconstruction of an object viewed from multiple cameras. Top row: images of an object are captured as 2D images from multiple directions. Middle row: these images are processed to produce silhouette images for each camera. Bottom left: the silhouettes are back-projected from each camera, resulting in cone-like regions of space. Bottom right: the intersection of these cones results in the visual hull
Overview of studies comparing markerless with conventional motion analysis systems
| Publication | Movement(s) analysed | Markerless system description | Procedure/system for comparison | Number of cameras | Outcome |
|---|---|---|---|---|---|
| Trewartha et al. [ | Starjump, somersaults | Gen-locked video cameras (50 Hz), subject-specific model | Manual digitising (TARGET system) | 3 | RMS differences for three movements ranged from 10 mm and 30 mm for pelvis location and between 2° and 8° for body configuration angles. |
| Corazza et al. [ | Walking | Visual hull construction and a priori subject-specific model | Virtual environment (Poser software) | 16 | RMS errors of hip, knee and ankle angles ranged from 2.0° (hip abduction/adduction) to 9.0 (ankle dorsi/plantar flexion) |
| Mündermann et al. [ | Walking | Video cameras (75 Hz), visual hull construction and a priori subject-specific model | Qualisys (120 Hz) | 8 | Average knee joint angle deviation: 2.3° (sagittal plane) and 1.6° (frontal plane). |
| Corazza et al. [ | Walking | Video cameras (120 Hz), visual hull construction and a priori subject-specific model | Qualisys (120 Hz) | 8 | Average deviations between joint (hip, knee, ankle, shoulder, elbow and wrist) centres: 15 mm mean absolute error (ranged from 9 to 19 mm) |
| Choppin and Wheat [ | Reaching, throwing, jumping | Microsoft Kinect (30 Hz) | Motion Analysis Corporation (60 Hz) | 1 Kinect, 12 optoelectronic | Flexion/extension and abduction/adduction of hip, knee, elbow and shoulder; shoulder plane and elevation studied. Maximum abduction error: 44.1° and 13.9°. Maximum flexion error: 36.2° and 19.5° (NITE and IPIsoft tracking algorithms, respectively) |
| Ceseracciu et al. [ | Walking | BTS | BTS | 8 | Maximum RMS differences range: 11.0° (ankle dorsi/ plantar flexion) to 34.7° (hip internal/external rotation) |
| Sandau et al. [ | Walking | Monochrome cameras (75 Hz), unconstrained articulated model fit to 3D point clouds (aided by full body patterned suit) | Ariel Performance Analysis System | 8 | RMS differences in lower limb 3D angles ranged between 1.8° (hip abduction/adduction) and 4.9° (hip internal/external rotation) |
| Ong et al. [ | Walking and jogging | Point Grey cameras (25 Hz) | Motion Analysis Corporation (100 Hz) | 2 markerless, 8 marker-based | RMS differences ranged from 0.2° (knee abduction/adduction of jogging) to 1.0° (ankle dorsi/plantar flexion of walking). Significant differences between markerless and marker-based for the ankle joint angles. |
RMS root mean square
Selection of published validation results against the HumanEva datasets
| Publication | 3D joint position error (mm) | Standard deviation of error (mm) |
|---|---|---|
| Corazza et al. [ | 79.0 | 11.5 |
| Amin et al. [ | 54.5 | |
| Belagiannis et al. [ | 68.3 | |
| Saini et al. [ | 45.7 | 5.3 |
| Guo et al. [ | 46.8 | |
| Elhayek et al. [ | 66.5 | |
| Rhodin et al. [ | 54.6 | 24.2 |
| Bogo et al. [ | 79.9 |
Fig. 8An example image from the HumanEva dataset used to validate markerless systems within computer vision. White dots indicate the location of tracked reflective markers and the cyan lines represent the defined skeleton model fit to the marker data. Although useful as an early benchmark for markerless tracking systems, the dataset has clear limitations for assessing the quality of any markerless tracking results, especially in the context of biomechanics. Notice that the markers are attached to clothing, marker clusters are not utilised, and the joint centres inferred from the fitted skeleton are not closely aligned with how the person appears in the image (e.g. right elbow and hip joints). See further information in the text