| Literature DB >> 34475412 |
Fabian Horst1, Djordje Slijepcevic2, Marvin Simak3, Wolfgang I Schöllhorn3.
Abstract
The Gutenberg Gait Database comprises data of 350 healthy individuals recorded in our laboratory over the past seven years. The database contains ground reaction force (GRF) and center of pressure (COP) data of two consecutive steps measured - by two force plates embedded in the ground - during level overground walking at self-selected walking speed. The database includes participants of varying ages, from 11 to 64 years. For each participant, up to eight gait analysis sessions were recorded, with each session comprising at least eight gait trials. The database provides unprocessed (raw) and processed (ready-to-use) data, including three-dimensional GRF and two-dimensional COP signals during the stance phase. These data records offer new possibilities for future studies on human gait, e.g., the application as a reference set for the analysis of pathological gait patterns, or for automatic classification using machine learning. In the future, the database will be expanded continuously to obtain an even larger and well-balanced database with respect to age, sex, and other gait-specific factors.Entities:
Mesh:
Year: 2021 PMID: 34475412 PMCID: PMC8413275 DOI: 10.1038/s41597-021-01014-6
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Demographic details of individual datasets and the total database.
| Dataset | ID | N | Sex (male/female) | Age (years) Mean (SD) | Body Mass (kg) Mean (SD) | Body Height (m) Mean (SD) |
|---|---|---|---|---|---|---|
| Horst | 1 | 8 | 2/6 | 23.3 (2.4) | 65.9 (8.0) | 1.73 (0.07) |
| Horst | 2 | 9 | 6/3 | 27.4 (3.0) | 73.2 (13.3) | 1.74 (0.11) |
| Horst | 3 | 128 | 76/52 | 23.8 (9.0) | 71.3 (13.0) | 1.77 (0.08) |
| Horst | 4 | 57 | 28/29 | 23.1 (2.7) | 67.9 (11.3) | 1.74 (0.10) |
| Burdack | 5 | 33 | 14/19 | 25.1 (6.7) | 65.1 (9.6) | 1.71 (0.09) |
| Unpublished Study 1 | 6 | 38 | 38/0 | 28.0 (10.8) | 78.2 (9.7) | 1.81 (0.04) |
| Unpublished Study 2 | 7 | 26 | 26/0 | 24.7 (2.9) | 79.8 (8.8) | 1.82 (0.07) |
| Unpublished Study 3 | 8 | 25 | 0/25 | 23.3 (4.2) | 62.6 (7.6) | 1.67 (0.05) |
| Unpublished Study 4 | 9 | 23 | 15/8 | 24.0 (2.5) | 69.1 (10.5) | 1.77 (0.10) |
| Unpublished Study 5 | 10 | 3 | — | — | 72.4 (7.8) | — |
*For dataset 2 and dataset 5 the experimental protocol was identical. In the analysis conducted by
Burdack et al. (2020)[34], the data from both datasets were analysed together.
Fig. 1Frequency distribution of age, body mass, body height, and walking speed for all (upper panel), female (middle panel), and male (lower panel) participants. The distributions are based on the values of the initial session of each participant. For the waking speed, the mean values of the gait trials of the initial session are shown.
Data recording and experimental protocol details of the individual datasets.
| Dataset | ID | Force Plate Configuration | Walking Speed Estimation Method | Gait Analysis Sessions | Familiarization Trials | Gait Trials per Session | Total Number of Gait Trials |
|---|---|---|---|---|---|---|---|
| Horst | 1 | inline | infrared cameras | 8 | 20(4)** | 15 | 949 |
| Horst | 2 | inline | infrared cameras | 6 | 20(5)** | 15 | 806 |
| Horst | 3 | staggered | light barriers | 1(2)* | 5 | 10 | 1,737 |
| Horst | 4 | inline | infrared cameras | 1 | 20 | 20 | 1,130 |
| Burdack | 5 | inline | infrared cameras | 6 | 20(5)** | 15 | 2,959 |
| Unpublished Study 1 | 6 | inline | — | 1 | 10 | 10 | 377 |
| Unpublished Study 2 | 7 | staggered | light barriers | 1 | 5 | 8 | 233 |
| Unpublished Study 3 | 8 | inline | — | 1 | 10 | 15 | 374 |
| Unpublished Study 4 | 9 | inline | infrared cameras | 1 | 5 | 10 | 231 |
| Unpublished Study 5 | 10 | inline | — | 1 | 5 | 8 | 23 |
*Forty-seven out of one hundred and twenty-eight participants attended a second gait analysis session.
**Numbers in parentheses () represent the number of familiarization trials performed by participants before follow-up sessions in experimental protocols with repeated gait analysis sessions.
Description of the data stored in the “GRF_*.csv” files. “*” for the associated file name is a placeholder for “right” and “left” (adapted from Horsak et al.[35]).
| Variables | Associated file | Format | Dimension | Unit | Description |
|---|---|---|---|---|---|
| Vertical GRF | GRF_F_V-RAW_*.csv | double | 1 × n | Newton | Unprocessed vertical ground reaction force |
| Anterior-posterior GRF | GRF_F_AP-RAW_*.csv | double | 1 × n | Newton | Unprocessed breaking and propulsive shear force |
| Medio-lateral GRF | GRF_F_ML_RAW_*.csv | double | 1 × n | Newton | Unprocessed medio-lateral shear force |
| COP anterior-posterior | GRF_COP_AP_RAW_*.csv | double | 1 × n | Meter | Unprocessed COP coordinate in walking direction |
| COP medio-lateral | GRF_COP_ML_RAW_*.csv | double | 1 × n | Meter | Unprocessed COP coordinate in medio-lateral direction |
| Vertical GRF | GRF-F_V_PRO_*.csv | double | 1 × n | Multiple of body weight | Processed vertical ground reaction force |
| Anterior-posterior GRF | GRF_F_AP_PRO_*.csv | double | 1 × n | Multiple of body weight | Processed breaking and propulsive shear force |
| Medio-lateral GRF | GRF-F_ML_PRO_*.csv | double | 1 × n | Multiple of body weight | Processed medio-lateral shear force |
| COP anterior-posterior | GRF_COP_AP_PRO_*.csv | double | 1 × n | Meter | Processed COP coordinate in walking direction |
| COP medio-lateral | GRF_COP_ML_PRO_*.csv | double | 1 × n | Meter | Processed COP coordinate in medio-lateral direction |
| Walking Speed | GRF_walking_speed.csv | double | 1 × n | Measured walking speed |
n is either the number of frames during one step across the force plate for the unprocessed data (“RAW”) or a time-normalized vector of 101 points for the
processed (“PRO”) data. Note that the first four columns of each file hold the DATASET_ID, SUBJECT_ID, SESSION_ID, and TRIAL_ID.
Description of the information stored in the metadata file (adapted from Horsak et al.[35]).
| Categories/Variables | Format | Unit | Description |
|---|---|---|---|
| DATASET_ID | integer | — | Unique identifier of a dataset |
| SUBJECT_ID | integer | — | Unique identifier of a participant |
| SESSION_ID | integer | — | Unique identifier of a gait analysis session |
| CLASS_LABEL* | string | — | Annotated class labels |
| CLASS_LABEL_DETAILED* | string | — | Annotated class labels for subclasses |
| SEX | binary | — | female = 0, male = 1 |
| AGE | integer | years | Age at recording date |
| HEIGHT | integer | centimeter | Body height in centimeters |
| BODY_WEIGHT | double | Body weight in Newton | |
| BODY_MASS | double | kg | Body mass |
| SHOE_SIZE | double | EU | Shoe size in the Continental European System |
| AFFECTED_SIDE* | integer | — | left = 0, right = 1, both = 2, none = NaN |
| SHOD_CONDITION* | integer | — | barefoot & socks = 0, normal shoe = 1, orthopedic shoe = 2 |
| ORTHOPEDIC_INSOLE* | binary | — | without insole = 0, with insole = 1 |
| SPEED* | integer | — | slow = 1, self-selected = 2, fast = 3 walking speed class |
| READMISSION* | integer | — | indicates the number of readmission = 0 L n |
| SESSION_TYPE* | integer | — | initial = 1, control = 2, initial after readmission = 3 |
| SESSION_DATE | string | — | date of gait analysis session in the format “DD-MM-YYYY hh:mm” |
| TRAIN* | binary | — | is part ( = 1) or is not part ( = 0) of TRAIN |
| TRAIN_BALANCED* | binary | — | is part ( = 1) or is not part ( = 0) of TRAIN_BALANCED* |
| TEST* | binary | — | is part ( = 1) or is not part ( = 0) of TEST |
*The metadata items highlighted by an asterisk were included primarily to ensure a consistent data structure between
the Gutenberg Gait Database and the GaitRec dataset[35].
Fig. 2Visualization of vertical (left panel), anterior-posterior (central panel), and medio-lateral (right panel) force components of the body weight (BW)-normalized GRF measurements per dataset. Mean and standard deviation signals (calculated per dataset) are highlighted as solid and dashed colored lines.
Fig. 3Visualization of zero-centered anterior-posterior (left panel) and mean-centered medio-lateral (right panel) components of the COP measurements per dataset. Mean and standard deviation signals (calculated per dataset) are highlighted as solid and dashed colored lines. We carefully inspected the gait trials where the signals differed considerably and made sure that these differences were not the result of measurement or calculation errors. Using the kinematic data, we were able to verify that the deviating signals were from gait trials of forefoot or midfoot walking participants.
| Measurement(s) | ground reaction force • centre of pressure • walking behavior • gait measurement • Normal Gait |
| Technology Type(s) | force plate • Sensor Device |
| Factor Type(s) | age • sex • walking speed |
| Sample Characteristic - Organism | Homo sapiens |
| Sample Characteristic - Environment | laboratory environment |