Literature DB >> 29018355

Criterion-Validity of Commercially Available Physical Activity Tracker to Estimate Step Count, Covered Distance and Energy Expenditure during Sports Conditions.

Yvonne Wahl1,2, Peter Düking3, Anna Droszez2, Patrick Wahl2,4, Joachim Mester2.   

Abstract

Background: In the past years, there was an increasing development of physical activity tracker (Wearables). For recreational people, testing of these devices under walking or light jogging conditions might be sufficient. For (elite) athletes, however, scientific trustworthiness needs to be given for a broad spectrum of velocities or even fast changes in velocities reflecting the demands of the sport. Therefore, the aim was to evaluate the validity of eleven Wearables for monitoring step count, covered distance and energy expenditure (EE) under laboratory conditions with different constant and varying velocities.
Methods: Twenty healthy sport students (10 men, 10 women) performed a running protocol consisting of four 5 min stages of different constant velocities (4.3; 7.2; 10.1; 13.0 km·h-1), a 5 min period of intermittent velocity, and a 2.4 km outdoor run (10.1 km·h-1) while wearing eleven different Wearables (Bodymedia Sensewear, Beurer AS 80, Polar Loop, Garmin Vivofit, Garmin Vivosmart, Garmin Vivoactive, Garmin Forerunner 920XT, Fitbit Charge, Fitbit Charge HR, Xaomi MiBand, Withings Pulse Ox). Step count, covered distance, and EE were evaluated by comparing each Wearable with a criterion method (Optogait system and manual counting for step count, treadmill for covered distance and indirect calorimetry for EE).
Results: All Wearables, except Bodymedia Sensewear, Polar Loop, and Beurer AS80, revealed good validity (small MAPE, good ICC) for all constant and varying velocities for monitoring step count. For covered distance, all Wearables showed a very low ICC (<0.1) and high MAPE (up to 50%), revealing no good validity. The measurement of EE was acceptable for the Garmin, Fitbit and Withings Wearables (small to moderate MAPE), while Bodymedia Sensewear, Polar Loop, and Beurer AS80 showed a high MAPE up to 56% for all test conditions.
Conclusion: In our study, most Wearables provide an acceptable level of validity for step counts at different constant and intermittent running velocities reflecting sports conditions. However, the covered distance, as well as the EE could not be assessed validly with the investigated Wearables. Consequently, covered distance and EE should not be monitored with the presented Wearables, in sport specific conditions.

Entities:  

Keywords:  athletes; biofeedback; monitoring; validity; wearables

Year:  2017        PMID: 29018355      PMCID: PMC5615304          DOI: 10.3389/fphys.2017.00725

Source DB:  PubMed          Journal:  Front Physiol        ISSN: 1664-042X            Impact factor:   4.566


Introduction

In the past years, there was an increasing development of physical activity trackers (Wearables) which earned them the first place in the ACSM Worldwide Survey of Fitness Trends in 2016 and 2017, leaving popular topics like “High-intensity interval training” and “strength training” behind (Thompson, 2015, 2016). Besides having applications for physical fitness and health in the general population by monitoring a plethora of different variables like step count, covered distance and energy expenditure (EE), Wearables may be useful for (elite) athletes as well. In these populations, Wearables might be used to monitor aspects of training load (Düking et al., 2016) as well as physical activity during leisure time and provide biofeedback to optimize exercises (Düking et al., 2017). However, before Wearables can be used beneficially, the parameters they provide need to be scientifically trustworthy which implies that Wearables have sufficient validity which unfortunately is often an issue especially with commercially available Wearables (Sperlich and Holmberg, 2016). Several studies, recently summarized by Evenson et al. (2015) and Düking et al. (2016), tackled this issue and investigated the scientific trustworthiness of different Wearables under a variety of different conditions like walking, jogging, cycling, or resistance exercise under laboratory as well as under free-living conditions. Yet, scientific evaluations are strictly speaking only meaningful for the specific conditions the device was tested in and transfer of the results of these studies should be done carefully (Bassett et al., 2012). For recreational people, testing under walking or light jogging conditions might be sufficient. For (elite) athletes, however, scientific trustworthiness needs to be given for a broad spectrum of velocities or even fast changes in velocities reflecting the demands of the sport. There is scarce literature stating the validity of consumer level Wearables under sport specific conditions, even though some of the herein analyzed wearables are validated in the general population (El-Amrawy and Nounou, 2015; Alsubheen et al., 2016; An et al., 2017; Price et al., 2017). Therefore the aim of the present study was to investigate the (concurrent) criterion-validity of eleven consumer Wearables concerning the amount of step count, covered distance and EE during running at four different velocities, an intermittent profile reflecting conditions in a soccer match and a 15-min outdoor trial at a constant velocity.

Materials and methods

For the determination of the validity of step count, covered distance and EE, the criterion measures are described below. In order to test the validity of the eleven Wearables in a standardized situation under laboratory conditions, participants performed a running protocol of a total duration of 25 min, which consisted of four stages of different constant velocities lasting 5 min each, as well as a 5 min period of intermittent velocity. Validity for outdoor conditions was subsequently tested during a 15-min run at a constant velocity. The validity of the Wearables for step count, covered distance and EE was assessed during a single session of treadmill walking and running, using methods similar to previous validation studies (Takacs et al., 2014).

Subjects and ethics statement

A total of 20 healthy and active sport students (10 male and 10 female) volunteered to participate in this study. All subjects gave written informed consent to the participation in the study. The study was performed in accordance with the declaration of Helsinki and approved by the Ethic Committee of the German Sport University Cologne.

Instruments

Criterion measures

The Optogait system (OPTOGait, Microgate Srl, Bolzano, Italy) was used as the criterion measure for monitoring step count on the treadmill. The system is integrated within the sidebars of the treadmill (Pulsar, h/p/ cosmos sports and medical GmbH, Traunstein, Germany) and uses a photoelectric cell system to precisely measure the number of step count, which is a reliable (ICC = 0.962) and valid (ICC = 0.997) method for measuring step counts during treadmill trials (Lee et al., 2014). Step count was additionally assessed by a manual counter, which was also used in the outdoor condition. The covered distance measured by the treadmill was used as a criterion measure and was determined based on the calibrated treadmill output (displayed on the electronic output of the treadmill in meters, based on the speed of the treadmill belt and time for each revolution of the belt) according to Takacs et al. (2014). The slope of the treadmill was automatically set at 1%. The Metamax 3B (Metamax 3B, CORTEX Biophysik GmbH, Leipzig, Germany) is a portable gas analyzer allowing measurements of oxygen uptake under laboratory and free-living conditions, which was used in this study to calculate EE via indirect calorimetry as the criterion measure for EE. For the calculation of EE, oxygen uptake (VO2) was measured continuously breath by breath during the whole exercise and calculated according to previous reports (Scott et al., 2006). Before each session, the Metamax 3B flowmeter and gas analyzers were calibrated using a 3-liter syringe and a known gas mixture (15% O2 and 5% CO2). During calibration of the gas analyzer (O2 and CO2 sensors), the Metamax3B alternates sampling of the known gas mixture and ambient air. The Metamax 3B is a valid and reliable system for measuring oxygen uptake (Vogler et al., 2010). Methods of indirect calorimetry are the most commonly used to quantify human EE in both laboratory and field settings, typically by measuring oxygen uptake (Hills et al., 2014).

Wearables

Eleven Wearables were tested, including: Bodymedia Sensewear MF (300€, BodyMedia Inc, Pittsburgh, PA), Polar Loop (50€; Polar Electro, Kempele, Finnland), Beurer AS80 (30€; Beurer GmbH, Ulm, Germany), Fitbit Charge and Fitbit Charge HR (80€, 100€; Fitbit Inc, San Francisco, CA), Garmin Vivofit (90€), Garmin Vivosmart (100€), Garmin Vivoactive (250€), Garmin Forerunner 920XT (470€) (Garmin, Olathe, Kansas), Withings Pulse Ox (100€) (Withings SA, Issy-les-Moulineaux, France), Xiaomi MiBand (15€; Xiaomi Inc, Beijing, China). All devices use a triaxial accelerometer; Garmin Vivoactive and Garmin Forerunner 920XT also include a GPS sensor. The Fitbit Charge HR and all Garmin devices also use heart rate to calculate EE using photoplethysmography or chest belt sensors, respectively.

Exercise study protocol

After arriving in the laboratory, anthropometric (weight, height, body fat) and personal data (date of birth, sex, handedness) of the participants were collected and transferred to all devices. Afterward, eleven Wearables were fixed at the wrist in a randomized order. The Bodymedia Sensewear armband and one Withings Pulse Ox device were placed on the backside of the upper arm and the hip, respectively. For the measurement of heart rate of the Garmin Wearables, the participants were fitted with a heart rate chestbelt. First, the participants were asked to lay down for 20 min. After the first 10 min, the measurement of resting EE was started using indirect calorimetry technique. Second, the running protocol was started, consisting of four 5 min stages of different constant velocities (walking: 4.3; 7.0; running: 10.1; 13.0 km·h−1) each separated by 5 min of passive rest. After these constant velocities stages, a 5 min period of intermittent velocity followed. This protocol was extracted from a smoothed running trial during a real soccer match (Amisco Data from a soccer match of the 1. German soccer league). The mean running velocity was 9.1 km·h−1, including twelve sprints with a maximal velocity of 22.4 km·h−1. Maximal acceleration and deceleration were 5.47 km·h−2 (1.52 m·s−2) and −4.88 km·h−2 (−1.36 m·s−2), respectively. Remaining time was covered with walking, defined by velocities smaller than 7.33 km·h−1, which is considered as preferred transition speed between walking and running (Rotstein et al., 2005). Besides the tests under laboratory conditions, ten participants (5 men, 5 women) performed a run of 2.4 km at a constant velocity of 10.1 km·h−1 under free-living conditions (Figure 1).
Figure 1

Exercise study protocol.

Exercise study protocol.

Statistical analysis

Descriptive statistics (mean ± SD) summarize the characteristics of the participants, including age, weight, height and percent of body fat. All data were tested for normality with no further transformation needed. The validity of the Wearables was determined, as previously performed by other validation studies (Kooiman et al., 2015; Bai et al., 2016; An et al., 2017), by several statistical tests: Systematic differences between the Wearables and the criterion measurement: mean absolute percentage error (MAPE) compared to the criterion measurement (mean difference Wearables–criterion measurement ·100· mean criterion measurement−1). Correlation between the Wearables and the criterion measurement: Intraclass Correlation Coefficient (ICC) (two-way random, absolute agreement, single measure, 95% confidence interval) (Shrout and Fleiss, 1979), common cut-off points for validity assessment: >0.90 (excellent), 0.75–0.90 (good), 0.60–0.75 (moderate), and <0.60 (low). Measure of precision: typical error (TE): TE = SD ·√1-ICC. Level of agreement between the Wearables and the criterion measurement: upper and lower limits of agreement (LoA) as described by Bland-Altman. All statistical analyses of the data were performed by using a statistics software package SPSS (version 23.0, IBM SPSS Statistics).

Results

For the laboratory study, 20 participants were included (10 males, mean ± SD age: 26.1 ± 2.8 years; height: 182.3 ± 7.4 cm; weight: 81.1 ± 11.2 kg; body fat 11.5 ± 2.6%, and 10 females mean ± SD age: 24.2 ± 1.9 years; height: 168.2 ± 6.7 cm; weight: 60.2 ± 5.5 kg; body fat 17.9 ± 4.9%). The outdoor condition and the Withings Pulse Ox (Hip) were tested with a fewer number of participants (5 males and 5 females). Due to the high amount of lacking data, we excluded the Xaomi Miband from any data analysis. The mean differences (criterion–wearable), 95% CI for step count, distance, and EE for all velocities are shown in Figures 2–4. MAPE, ICC, TE, and LoA are shown in Table 1 (step count), Table 2 (distance), Table 3 (EE).
Figure 2

Difference in step count (n) between criterion measure and the eleven activity trackers at different running velocities (A–F), data are shown as mean ± 95% CI. Mean number of steps (± SD) measured by the criterion measure: 4.3 km·h−1 = 538 ± 29; 7.2 km·h−1 = 785 ± 38; 10.1 km·h−1 = 822 ± 51; 13.0 km·h−1 = 863 ± 56; intermittent = 1,231 ± 127; outdoor = 2,456 ±145 steps. SW, Bodymedia Sensewear; PL, Polar Loop; B80, Beurer AS80; GVF, Garmin Vivofit; GVS, Garmin Vivosmart; GVA, Garmin Vivoactive; GFR, Garmin Forerunner 920XT; FC, Fitbit Charge; FHR, Fitbit Charge HR; WPO H, Withings Pulse Ox Hip; WPO W, Withings Pulse Ox Wrist.

Figure 4

Differences in EE (kcal) between the criterion measure and the eleven activity trackers at different running verlocities (A–F), data are shown as mean ± 95% CI. Mean EE (± SD) by the criterion method were: 4.3 km·h−1 = 24 ± 6; 7.2 km·h−1 = 47 ± 10; 10.1 km·h−1 = 61 ± 13; 13.0 km·h−1 = 74 ± 17; intermittent = 96 ± 18; outdoor = 210 ± 49 kcal. SW, Bodymedia Sensewear, PL, Polar Loop; B80, Beurer AS80; GVF, Garmin Vivofit; GVS, Garmin Vivosmart; GVA, Garmin Vivoactive; GFR, Garmin Forerunner 920XT; FC, Fitbit Charge; FHR, Fitbit Charge HR; WPO H, Withings Pulse Ox Hip; WPO W, Withings Pulse Ox Wrist.

Table 1

Mean absolute percentage error (MAPE), Intraclass Correlation Coefficient (ICC; 95%CI), typical error (TE), and upper & lower limits of agreement (LoA) for all Wearables for step count.

Bodymedia SensewearPolar LoopBeurer AS80Garmin VivofitGarmin VivosmartGarmin VivoactiveGarmin Forerunner 920XTFitbit ChargeFitbit Charge HRWithings Pulse Ox HipWithings Pulse Ox Wrist
4.3 km·h−1MAPE−15.5−8.7−6.1−0.7−0.30.11.5−3.2−0.6−0.2−0.9
ICC (95% CI)0.18 (−0.08–0.53)0.06 (−0.19–0.40)0.20 (−0.17–0.55)0.89 (0.74–0.95)0.97 (0.92–0.99)0.94 (0.85–0.97)0.72 (0.19–0.93)0.57 (0.14–0.81)0.96 (0.89–0.98)0.98 (0.94–1.0)0.82 (0.60–0.92)
TE37.158.254.65.31.42.416.917.01.60.78.1
LoA−3−16369−16587−15328−3614−1821−2071−5434−6912−199−1232−42
7.2 km·h−1MAPE−12.8−9.6−0.10.20.30.10.3−1.10.5−0.003−0.4
ICC (95% CI)0.18 (−0.09−0.53)−0.27 (−0.55−0.16)0.99 (0.98−1.0)0.99 (0.97−1.0)0.98 (0.96−1.0)0.94 (0.85−0.98)0.98 (0.94−1.0)0.83 (0.62−0.93)0.97 (0.91−0.99)0.59 (0.03−0.88)0.99 (0.99−1.0)
TE48.5110.20.40.60.93.61.010.61.61.60.6
LoA4−210174−3327−913−916−1130−2817−1142−5822−145−58−15
10.1 km·h−1MAPE−12.5−5.4−2.50.7−0.20.60.04−1.30.01−0.2−1.9
ICC (95% CI)0.27 (−0.08−0.65)0.39 (−0.08−0.72)0.72 (0.34−0.89)0.91 (0.78−0.96)0.97 (0.93−0.99)0.97 (0.93−0.99)0.99 (0.96−1.0)0.85 (0.66−0.94)0.99 (0.98−1.0)0.99 (0.98−1.0)0.83 (0.51−0.94)
TE41.234.517.96.82.22.00.910.80.70.79.6
LoA−8−19741−13345−8850−3923−2727−1717−1743−6614−1311−1530−62
13.0 km·h−1MAPE−13.5−3.3−3.7−2.0−0.4−0.20.1−1.9−0.3−0.6−4.8
ICC (95% CI)0.23 (−0.08−0.60)0.69 (0.26−0.88)0.49 (0.07−0.76)0.73 (0.43−0.88)0.97 (0.93−0.99)0.99 (0.98−1.00)0.99 (0.97−1.00)0.78 (0.49−0.91)0.99 (0.98−1.00)0.96 (0.86−0.99)0.25 (−0.11−0.59)
TE47.223.037.324.42.30.80.815.20.73.253.0
LoA−12−22352−11069−13575−10923−3015−1717−1546−8111−1625−3776−164
Inter−mittentMAPE−9.9−13.3−16.2−1.4−1.1−1.5−2.7−3.3−1.1−4.7−16.0
ICC (95% CI)0.62 (−0.06−0.90)0.31 (0.09−0.70)0.37 (−0.11−0.72)0.88 (0.71−0.95)0.95 (0.87−0.98)0.89 (0.74−0.96)0.93 (0.72−0.98)0.88 (0.43−0.96)0.93 (0.84−0.97)0.91 (−0.02−0.99)0.34 (−0.08−0.73)
TE29.773.9119.517.99.016.813.816.011.57.074.0
LoA−28−1277−34299−49180−12364−9378−12169−13648−13371−100−15−106−20−377
OutdoorMAPE−3.0−1.9−4.7−0.02−0.04−0.10.03−0.10.20.1−0.6
ICC (95% CI)0.82 (0.07−0.96)0.83 (0.32−0.96)0.50 (−0.06−0.84)0.98 (0.93−1.0)0.98 (0.92−1.0)0.98 (0.92−1.0)0.98 (0.93−1.0)0.98 (0.91−0.99)0.98 (0.92−0.99)0.98 (0.92−1.0)0.98 (0.91−1.0)
TE27.124.5127.24.34.54.64.34.84.64.33.8
LoA51−19972−161237−46860−6063−6063−6561−5865−6770−5864−5638−68
Table 2

Mean absolute percentage error (MAPE), Intraclass Correlation Coefficient (ICC; 95%CI), typical error (TE), and upper & lower limits of agreement (LoA) for all Wearables for covered distance.

Beurer AS80Garmin VivofitGarmin VivosmartGarmin VivoactiveGarmin Forerunner 920XTFitbit ChargeFitbit Charge HRWithings Pulse Ox HipWithings Pulse Ox Wrist
4.3 km·h−1MAPE−17.69.817.717.41.35.38.019.017.8
ICC (95% CI)0.01 (−0.22−0.32)0.02 (−0.30−0.27)0.01 (−0.03−0.08)0.004 (−0.08−0.16)0.003 (−0.46−0.45)−0.003 (−0.27−0.35)0.02 (−0.08−0.22)0.01 (−0.05−0.20)−0.01 (−0.19−0.55)
TE73.330.319.634.569.227.117.932.157.8
LoA82−20795−2510225130−6141−13172−3464−714519169−57
7.2 km·h−1MAPE−18.123.3.53.551.426.016.013.157.658.3
ICC (95% CI)−0.02 (−0.15−0.22)−0.01 (−0.06−0.11)−0.002 (−0.01−0.02)−0.002 (−0.01−0.02)−0.003 (−0.11−0.19)−0.01 (−0.09−0.14)−0.02 (−0.19−0.25)−0.003 (−0.05−0.14)−0.003 (−0.03−0.6)
TE90.264.459.059.3102.952.576.5126.5111.6
LoA66−28426614436206441208357−45198−6227−70608126560124
10.1 km·h−1MAPE−40.3−7.814.213.97.0−13.9−13.424.322.3
ICC (95% CI)−0.003 (−0.03−0.05)0.05 (−0.21−0.35)−0.01 (−0.08−0.14)−0.03 (−0.10−0.13)−0.05 (−0.38−0.35)−0.01 (−0.05−0.09)−0.01 (−0.12−0.19)−0.01 (−0.10−0.25)−0.02 (−0.13−0.18)
TE101.667.164.665.1125.948.478.7123.8122.8
LoA−142−54069−201244−6243−9300−182−23−21240−267467−16415−61
13.0 km·h−1MAPE−51.9−25.0−8.1−6.1−3.3−29.9−29.51.00.7
ICC (95% CI)−0.005 (−0.02−0.03)0.007 (−0.02−0.07)−0.10 (−0.28−0.20)−0.04 (−0.23−0.26)−0.15 (−0.54−0.30)−0.002 (−0.01−0.02)−0.001 (−0.02−0.03)0.04 (−0.57−0.62)−0.11 (−0.55−0.36)
TE119.774.181.872.6149.961.771.3137.3127.0
LoA−331−799−127−41870−24973−206237−311−205−447−181−461321−229225−247
IntermittentMAPE−42.3−3.714.215.512.4−13.3−13.2−0.910.2
ICC (95% CI)0.05 (−0.03−0.22)0.28 (−0.11−0.62)0.08 (−0.08−0.34)−0.07 (−0.21−0.19)0.17 (−0.13−0.50)0.11 (−0.10−0.41)0.07 (−0.10−0.33)0.09 (−0.10−0.46)0.10 (−0.37−0.52)
TE139.267.189.5142.4149.599.3104.6139.6169.1
LoA−198−758110−199344−22443−97465−17855−35861−364532−42339−360
OutdoorMAPE−33.8−3.217.715.59.7−10.1−9.716.727.3
ICC (95% CI)0.000 (−0.01−0.05)0.000 (−0.34−0.50)0.000 (−0.02−0.07)0.000 (−0.02−0.06)0.000 (−0.62−0.61)0.000 (−0.07−0.20)0.000 (−0.22−0.42)0.000 (−0.08−0.27)0.000 (−0.13−0.32)
TE156.293.9104.699.8231.579.2200.2375.9436.0
LoA−504−1116112−256669259661270520−388−26−336171−6131515421515−194
Table 3

Mean absolute percentage error (MAPE), Intraclass Correlation Coefficient (ICC; 95%CI), typical error (TE), and upper & lower limits of agreement (LoA) for all Wearables for energy expenditure.

Bodymedia SensewearPolar LoopBeurer AS80Garmin VivofitGarmin VivosmartGarmin VivoactiveGarmin Forerunner 920XTFitbit ChargeFitbit Charge HRWithings Pulse Ox HipWithings Pulse Ox Wrist
4.3 km·h−1MAPE−4.356.417.01.32.94.0−26.675.083.3−9.9−11.0
ICC (95% CI)0.61 (0.25−0.82)−0.11 (−0.26−0.31)−0.17 (−0.54−0.27)0.73 (0.42−0.88)0.91 (0.78−0.96)0.85 (0.66−0.94)0.35 (−0.08−0.68)0.15 (−0.04−0.50)0.17 (−0.06−0.52)0.44 (0.11−0.81)0.28 (−0.10−0.62)
TE2.69.68.52.20.91.16.04.67.24.34.4
LoA6.6−9.729.5−6.418.1−12.98.1−8.26.5−5.06.4−5.08.0−21.126.87.435.04.27.8−14.66.6−13.9
7.2 km·h−1MAPE−1.453.8−18.918.735.836.8−16.733.618.218.616.9
ICC (95% CI)0.77 (0.50−0.90)0.02 (−0.25−0.48)−0.09 (−0.29−0.22)0.35 (−0.05−0.67)0.21 (−0.12−0.55)0.28 (−0.12−0.55)0.25 (−0.11−0.58)0.29 (−0.11−0.64)0.58 (−0.02−0.84)−0.15 (−0.68−0.52)−0.15 (−0.56−0.31)
TE2.922.912.111.516.313.810.89.85.021.822.2
LoA10.5−13.467.6−23.112.0−33.535.5−20.251.6−20.448.3−15.315.5−33.437.2−8.222.9−7.348.1−31.544.5−36.8
10.1 km·h−1MAPE−17.251.2−33.06.524.020.2−9.313.520.49.45.4
ICC (95% CI)0.52 (−0.10−0.83)−0.09 (−0.26−0.33)−0.04 (−0.15−0.16)0.76 (0.49−0.90)0.46 (−0.01−0.76)0.69 (−0.07−0.92)0.25 (−0.15−0.60)0.68 (0.13−0.88)0.43 (−0.01−0.73)0.27 (−0.51−0.78)0.20 (−0.28−0.59)
TE5.122.814.94.312.63.114.34.310.615.715.2
LoA3.1−25.768.3−17.36.5−50.720.0−14.148.0−19.122.80.825.1−39.821.7−8.238.4−16.439.1−33.034.3−32.1
13.0 km·h−1MAPE−25.341.2−42.7−11.19.06.2−11.1−0.122.2−8.4−5.3
ICC (95% CI)0.39 (−0.1−0.76)−0.25 (−0.45−0.33)−0.02 (−0.09−0.13)0.56 (0.12−0.81)0.59 (0.23−0.81)0.82 (0.59−0.92)0.28 (−0.11−0.62)0.73 (0.43−0.88)0.43 (−0.01−0.73)0.54 (−0.12−0.87)0.46 (0.05−0.74)
TE8.230.418.09.011.84.217.65.514.213.111.4
LoA1.1−39.876.3−30.31.4−68.617.3−35.942.2−30.023.0−15.630.5−50.818.8−22.351.5−22.132.0−43.523.4−37.3
IntermittentMAPE−12.45.6−45.9−21.32.0−1.3−9.22.425.5−48.8−38.9
ICC (95% CI)0.49 (−0.02−0.78)−0.30 (−0.89−0.43)0.000 (−0.05−0.10)0.54 (−0.10−0.84)0.43 (−0.02−0.73)0.74 (0.44−0.89)0.22 (−0.19−0.59)0.58 (0.19−0.81)0.43 (−0.05−0.74)0.01 (−0.10−0.29)0.11 (−0.05−0.40)
TE10.025.918.911.824.87.925.89.720.419.214.8
LoA14.6−40.546.5−42.5−8.8−82.811.4−53.567.0−61.629.4−31.546.9−67.830.0−28.578.1−27.73.7−72.1−14.0−75.5
OutdoorMAPE−20.822.1−48.4−20.2−1.5−4.5−21.2−4.5−12.0−5.5−4.5
ICC (95% CI)0.43 (−0.11−0.82)−0.18 (−0.56−0.48)−0.04 (−0.12−0.22)0.56 (−0.09−0.89)0.82 (0.43−0.95)0.91 (0.64−0.98)0.34 (−0.14−0.77)0.64 (0.11−0.89)0.53 (−0.05−0.85)0.21 (−0.44−0.72)0.22 (−0.41−0.72)
TE21.871.456.814.313.65.431.918.624.452.050.0
LoA9.7−103.6163.0−94.81.3−216.9−1.9−86.659.0−66.824.3−46.229.3−124.546.2−75.640.2−99.597.3−132.291.7−130.4
Difference in step count (n) between criterion measure and the eleven activity trackers at different running velocities (A–F), data are shown as mean ± 95% CI. Mean number of steps (± SD) measured by the criterion measure: 4.3 km·h−1 = 538 ± 29; 7.2 km·h−1 = 785 ± 38; 10.1 km·h−1 = 822 ± 51; 13.0 km·h−1 = 863 ± 56; intermittent = 1,231 ± 127; outdoor = 2,456 ±145 steps. SW, Bodymedia Sensewear; PL, Polar Loop; B80, Beurer AS80; GVF, Garmin Vivofit; GVS, Garmin Vivosmart; GVA, Garmin Vivoactive; GFR, Garmin Forerunner 920XT; FC, Fitbit Charge; FHR, Fitbit Charge HR; WPO H, Withings Pulse Ox Hip; WPO W, Withings Pulse Ox Wrist. Difference in covered distance (m) between the criterion measure and the nine activity trackers at different running velocities (A–F), data are shown as mean ± 95% CI. Mean covered distance (± SD) by the criterion measure were: 4.3 km·h−1 = 358 ± 4; 7.2 km·h−1 = 601 ± 6; 10.1 km·h−1 = 845 ± 12; 13.0 km·h−1 = 1,088 ± 21; intermittent = 1,139 ± 45; outdoor = 2,400 ± 0 meter. B80, Beurer AS80; GVF, Garmin Vivofit; GVS, Garmin Vivosmart; GVA, Garmin Vivoactive; GFR, Garmin Forerunner 920XT; FC, Fitbit Charge; FHR, Fitbit Charge HR; WPO H, Withings Pulse Ox Hip; WPO W, Withings Pulse Ox Wrist. Differences in EE (kcal) between the criterion measure and the eleven activity trackers at different running verlocities (A–F), data are shown as mean ± 95% CI. Mean EE (± SD) by the criterion method were: 4.3 km·h−1 = 24 ± 6; 7.2 km·h−1 = 47 ± 10; 10.1 km·h−1 = 61 ± 13; 13.0 km·h−1 = 74 ± 17; intermittent = 96 ± 18; outdoor = 210 ± 49 kcal. SW, Bodymedia Sensewear, PL, Polar Loop; B80, Beurer AS80; GVF, Garmin Vivofit; GVS, Garmin Vivosmart; GVA, Garmin Vivoactive; GFR, Garmin Forerunner 920XT; FC, Fitbit Charge; FHR, Fitbit Charge HR; WPO H, Withings Pulse Ox Hip; WPO W, Withings Pulse Ox Wrist. Mean absolute percentage error (MAPE), Intraclass Correlation Coefficient (ICC; 95%CI), typical error (TE), and upper & lower limits of agreement (LoA) for all Wearables for step count. Mean absolute percentage error (MAPE), Intraclass Correlation Coefficient (ICC; 95%CI), typical error (TE), and upper & lower limits of agreement (LoA) for all Wearables for covered distance. Mean absolute percentage error (MAPE), Intraclass Correlation Coefficient (ICC; 95%CI), typical error (TE), and upper & lower limits of agreement (LoA) for all Wearables for energy expenditure.

Step count

The mean step count (± SD) measured by the criterion measure was: 538 ± 29 (4.3 km·h−1); 785 ± 38 (7.2 km·h−1); 822 ± 51 (10.1 km·h−1); 863 ± 56 (13.0 km·h−1); 1,231 ± 127 (intermittent); 2,456 ± 145 (outdoor) steps. Bodymedia Sensewear, Polar Loop, and Beurer AS80 showed a substantial MAPE up to 16%, a low to moderate ICC, a large TE (up to 100 steps), and the broadest LoA. The other Wearables showed a small MAPE (<2%) for all test conditions as well as a good to excellent ICC. Garmin Vivosmart, Garmin Vivoactive, Fitbit Charge HR, Withings Pulse Ox Hip showed a small TE, and the narrowest LoA.

Covered distance

The mean covered distance (± SD) by the criterion measure was: 358 ± 4 (4.3 km·h−1); 601 ± 6 (7.2 km·h−1); 845 ± 12 (10.1 km·h−1); 1,088 ± 21 (13.0 km·h−1); 1,139 ± 45 (intermittent); 2,400 ± 0 (outdoor) m. Beurer AS80 showed a high MAPE (17.6 up to 51.9%) for all test conditions. Garmin Vivofit, Vivosmart, Vivoactive, Forerunner, Fibit Charge, Charge HR and Withings showed a moderate MAPE (1.3–29.9%) for all test conditions expect 7.2 km·h−1. The ICC for all Wearables was very low (<0.1). Garmin Vivosmart, Garmin Vivoactive, Fitbit Charge, and Fitbit Charge HR showed a small TE, and the narrowest LoA.

Energy expenditure

The mean EE (± SD) by the criterion measure were: 24 ± 6 (4.3 km·h−1); 47 ± 10 (7.2 km·h−1); 61 ± 13 (10.1 km·h−1); 74 ± 17 (13.0 km·h−1); 96 ± 18 (intermittent); 210 ± 49 (outdoor) kcal. Bodymedia Sensewear, Polar Loop, Beurer AS80 showed a high MAPE up to 56% for all test conditions. The Garmin, Fitbit and Withings Wearables showed a small to moderate MAPE (1.3–21.2 %) for 10.1 km·h−1, 13.0 km·h−1, and the Outdoor condition. Garmin Vivofit, Vivosmart, Vivoactive, Fitbit Charge and Charge HR showed a moderate to good ICC, whereas Bodymedia Sensewear, Polar Loop, Beurer AS80, Garmin Forerunner 920XT and Withings Pulse Ox showed a low ICC. Bodymedia Sensewear, Garmin Vivofit, Garmin Vivoactive, Fitbit Charge showed a small TE, and the narrowest LoA.

Discussion

The aim of the present study was to investigate the criterion-validity of eleven Wearables for step count, covered distance and EE over a large spectrum of constant and intermittent velocities reflecting sports conditions. The results indicate that most Wearables, except Beurer AS80, Polar Loop, Bodymedia Sensewear provide an acceptable level of validity concerning step count for all constant velocities, the intermittent protocol as well as for the outdoor condition. The parameters covered distance and EE, however, exhibited a low validity for any of the conditions for most of the Wearables. The Xaomi Miband did lack a high amount of data and we, therefore, want to discourage using this Wearable to monitor step count, distance, and EE in sports conditions. In line with the present study, other laboratory-based studies also showed generally high correlations for step count between the criterion measure and Wearables (Takacs et al., 2014; Diaz et al., 2015; Evenson et al., 2015). Tudor-Locke et al. (2006) stated that Wearables generally should not exceed a MAPE of 1% compared to the criterion measure during walking on a treadmill at a speed of 4.8 km·h−1 in order to be considered accurate. Garmin Vivosmart, Garmin Vivoactive, Garmin Forerunner 920 XT, Fitbit Charge HR, and Withings Pulse Ox (Hip) had a MAPE <1% over all test conditions. Fitbit Charge and Garmin Vivofit had a slightly higher MAPE of <3%, still representing good results. Bodymedia Sensewear, Polar Loop, and Beurer AS80 had MAPE between 3.7 and 15.5%, whereby all devices underestimated the number of steps taken. When errors were higher, the direction tended to be an under-estimation of step count by the tracker compared to the criterion. This may be particularly problematic at slow walking speeds (Evenson et al., 2015). Garmin Vivosmart, Garmin Vivoactive, Fitbit Charge HR, and Withings Pulse Ox indicated the narrowest LoA (less than 50 steps for the constant velocities). This can be considered as a relatively small range. The range between the upper and lower LoA of Bodymedia Sensewear, Polar Loop, and Beurer AS80 (up to 200 steps) are considered to be too large to be used interchangeably with the criterion measure. In a sport specific condition like a marathon run with an average velocity of 10.1 km·h−1 an average step count of 60.000 steps represents an error of +60 steps for Fitbit Charge HR or −7.500 steps for Bodymedia Sensewear. For the intermittent velocities, which are typical for most sport disciplines, the discrepancy was high, revealing an underestimation for all Wearables between −14 ± 40 steps (Garmin Vivosmart) up to −198 ± 91 (Withings Pulse Ox Wrist). For intermittent sports, like a 90 min competitive soccer game, players will cover on average about 13.000 steps, which represents a small error of −143 steps for Fitbit Charge HR/Garmin Vivosmart up to a high underestimation of 2.106 steps for Beurer AS80. The outdoor condition, which resembled the same velocity as the third speed on the treadmill (10.1 km·h−1), showed similar results as the laboratory testing using constant velocities. In summary, the step count for most of the Wearables, except Bodymedia Sensewear, Polar Loop, and Beurer AS80 showed to be valid. However, generally, there is a tendency to underestimate the number of steps. One might speculate, that a reduced arm movement while walking/running leads to an underestimation of the step count. Furthermore, it might be a problem of the adjustment of the sensitivity of the accelerometers and different algorithms. The manufacturers have the problem, that wearables should not count every single arm movement during daily life as a step. Therefore, the acceleration needs to exceed a certain threshold to be processed by the algorithm and to be counted as a step. The measurement of covered distance showed no consistent discrepancy over the different velocities between the Wearables and the criterion measure. The Wearables mainly showed an overestimation of distance for constant slower velocities (4.3 and 7.2 km·h−1) and an underestimation of distance for higher velocities (13.0 km·h−1). This is in line with the study of Takacs et al. (2014), showing an overestimation for slower speeds (3.2–4.7 km·h−1) and an underestimation for faster speeds (6.4 km·h−1). In elite sport fast running velocities often occur, and consequently, the covered distance will be underestimated in these instances with the presented Wearables. The highest MAPE (−18.1 to 58.3%) of all Wearables was reached at the velocity of 7.2 km·h−1, whereas the lower velocity of walking (4.3 km·h−1) showed a better MAPE (1.3 to 19%). The ICC ranged from 0.0 to 0.2 for all tested conditions, indicating poor agreement with the criterion measure. This is line with the study of Takacs et al. (2014), showing small ICC between 0.0 and 0.05. Although Garmin Vivosmart, Garmin Vivoactive, Fitbit Charge, and Fitbit Charge HR showed the narrowest LoA, the range is still insufficiently high. In sport specific situations, like a marathon run at 10.1 km·h−1, covered distance will be overestimated by ~2.94 km with Garmin Forerunner 920XT, or underestimated by ~16.9 km with Beurer AS80. In the intermittent protocol, the covered distance derived from Wearables show a high discrepancy compared to the criterion measure, with some Wearables overestimating (Withings Pulse Ox Hip, Garmin Forerunner 920XT, Garmin Vivoactive, Garmin Vivosmart), others underestimating this parameter (Fitbit Charge HR, Fitbit Charge, Garmin Vivofit, Beurer AS80). For intermittent sports, like a 90 min soccer game (mean distance 12 km), the covered distance will be underestimated by ~1.080 m using Withings Pulse Ox hip up to ~5.076 m using Beurer AS80 based on our findings. The outdoor condition (10.1 km·h−1) showed similar high MAPE compared to the laboratory condition with the same Wearables overestimating (Withings Pulse Ox Wrist and Hip, Garmin Forerunner 920XT, Garmin Vivoactive, Garmin Vivosmart) or underestimating (Fitbit Charge HR, Fitbit Charge, Garmin Vivofit, Beurer AS80) the covered distance. In summary, for monitoring the covered distance, no Wearable could achieve good validity for all laboratory-based constant and intermittent velocities as well as in the outdoor condition. We acknowledge that the covered distance can be assessed by other Wearables employing for example receivers for Global Navigation Satellite Systems such as Global Positioning Systems (Cummins et al., 2013) and it seems that this technology is superior to accelerometry to derive the covered distance in sports conditions. The measurement of EE showed no consistent discrepancy over the different velocities between the Wearables and the criterion measure. The Wearables mainly showed an overestimation of EE for constant slower velocities (4.3; 7.2; 10.1 km·h−1) and an underestimation of EE for higher velocities (13.0 km·h−1). Overall, Bodymedia Sensewear, Polar Loop, Beurer AS80 showed a low validity for all test conditions. The Garmin, Fitbit and Withings Wearables showed a better validity with small to moderate MAPE (1.3–21.2%) for the faster velocities (10.1 km·h−1, 13.0 km·h−1). The results are in line with a review of Evenson et al. (2015) showing a low validity for EE in 10 adult studies. Although Bodymedia Sensewear, Garmin Vivofit, Garmin Vivoactive, and Fitbit Charge showed the narrowest LoA, the range is still insufficiently high. The ICC ranged from moderate to substantial agreement, while larger bias show the tendency to underestimate EE. Extrapolated to a marathon run (~3,000 kcal), this equates to an error of ~86 kcal overestimation for Withings Pulse Ox Wrist up to ~820 kcal for Polar Loop for a runner of 70 kg with a finishing time of 4:13 h (McArdle et al., 2000). Fitbit Charge, Garmin Vivoactive, Garmin Vivosmart, and Polar Loop showed relative small MAPE (<5.6%) for the intermittent protocol, whereas the other devices mainly underestimate the EE (Withings Pulse Ox (Wrist or Hip), Garmin Forerunner 920XT, Garmin Vivofit, Beurer AS80, Bodymedia Sensewear). For intermittent sports, like a 90 min soccer game (mean EE ~1300 kcal), EE will be underestimated by ~17 kcal using Garmin Vivoactive up to ~630 kcal using Withings Pulse Ox hip. The outdoor condition showed a completely contrary pattern compared to the laboratory condition (10.1 km·h−1). While all devices underestimate the EE in the outdoor condition, most of the devices overestimate EE in the comparable laboratory condition. This is surprising, but may be an issue of reliability, an aspect we intentionally did not target in our study. To clarify this, we want to encourage researchers in conducting reliability studies on the presented Wearables. In summary, the presented Wearables should be used very cautiously to assess EE.

Limitations

Generally, we have to acknowledge some limitations of the present study. First, there might be some limitations arising from calculating EE via indirect calorimetry using the device Metamax 3B (Lighton, 2008). Even though the experiments were conducted within 2 weeks of time, which might limit the degradation of the oxygen sensor, previous studies showed, that the Metamax 3B produces acceptably stable and reliable results, but is not adequately valid during moderate and vigorous exercise without some further correction of VO2 and VCO2 (Macfarlane and Wong, 2012). As in every validation study, we cannot be entirely sure if some error arises from the criterion-measure and encourage to see the results of this study in light of these limitations. Second, the velocities on the treadmill were not randomized, as we expected that higher velocities would influence slower velocities more than the other way round. Therefore, we decided not to randomize the velocities, but to gradually increase the velocity. Additionally, during the 5 min rest periods, spirometric and heart rate values decreased to resting levels. Anyhow, we cannot completely discard a cardiovascular drift. Third, in comparison to several previous validation studies (Kooiman et al., 2015; Bai et al., 2016; An et al., 2017), we investigated a similar number of subjects. However, the relatively small sample size might limit the statistical power of the present results. There are several statistical approaches for validation studies. However, possibly no statistical approach will remain uncriticised and every approach has its advantages and drawbacks. According to previously published validation studies (Kooiman et al., 2015; Bai et al., 2016; An et al., 2017), we used the statistical approach from this studies.

Conclusion

In our study, most Wearables provide an acceptable level of validity for step counts at different constant and intermittent running velocities reflecting sports conditions. The most valid Wearables, represented by the smallest MAPE, to monitor step count were Garmin Vivosmart, Garmin Vivoactive, Garmin Forerunner 920XT, Fitbit Charge, Fitbit Charge HR and Withings Pulse Ox (Hip). Yet, the covered distance, as well as the EE, could not be assessed validly with the investigated Wearables. Especially in sport specific conditions, like a marathon run or a 90 min soccer game, covered distance and EE showed high errors for nearly all Wearables. Consequently, covered distance and EE should not be monitored with the presented Wearables.

Author contributions

All authors listed have made a substantial, direct and intellectual contribution to the work, and approved it for publication.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  22 in total

1.  Validity, reliability and stability of the portable Cortex Metamax 3B gas analysis system.

Authors:  D J Macfarlane; P Wong
Journal:  Eur J Appl Physiol       Date:  2011-11-11       Impact factor: 3.078

2.  Evaluation of quality of commercial pedometers.

Authors:  Catrine Tudor-Locke; Susan B Sisson; Sarah M Lee; Cora L Craig; Ronald C Plotnikoff; Adrian Bauman
Journal:  Can J Public Health       Date:  2006 Mar-Apr

3.  Validation of the Fitbit One activity monitor device during treadmill walking.

Authors:  Judit Takacs; Courtney L Pollock; Jerrad R Guenther; Mohammadreza Bahar; Christopher Napier; Michael A Hunt
Journal:  J Sci Med Sport       Date:  2013-10-31       Impact factor: 4.319

4.  Validity and reliability of the Cortex MetaMax3B portable metabolic system.

Authors:  Andrew J Vogler; Anthony J Rice; Christopher J Gore
Journal:  J Sports Sci       Date:  2010-05       Impact factor: 3.337

5.  Are Currently Available Wearable Devices for Activity Tracking and Heart Rate Monitoring Accurate, Precise, and Medically Beneficial?

Authors:  Fatema El-Amrawy; Mohamed Ismail Nounou
Journal:  Healthc Inform Res       Date:  2015-10-31

6.  Accuracy of the vivofit activity tracker.

Authors:  Sana'a A Alsubheen; Amanda M George; Alicia Baker; Linda E Rohr; Fabien A Basset
Journal:  J Med Eng Technol       Date:  2016-06-07

7.  Differences in oxygen uptake but equivalent energy expenditure between a brief bout of cycling and running.

Authors:  Christopher B Scott; Nathanael D Littlefield; Jeffrey D Chason; Michael P Bunker; Elizabeth M Asselin
Journal:  Nutr Metab (Lond)       Date:  2006-01-03       Impact factor: 4.169

Review 8.  Assessment of physical activity and energy expenditure: an overview of objective measures.

Authors:  Andrew P Hills; Najat Mokhtar; Nuala M Byrne
Journal:  Front Nutr       Date:  2014-06-16

Review 9.  Systematic review of the validity and reliability of consumer-wearable activity trackers.

Authors:  Kelly R Evenson; Michelle M Goto; Robert D Furberg
Journal:  Int J Behav Nutr Phys Act       Date:  2015-12-18       Impact factor: 6.457

Review 10.  Comparison of Non-Invasive Individual Monitoring of the Training and Health of Athletes with Commercially Available Wearable Technologies.

Authors:  Peter Düking; Andreas Hotho; Hans-Christer Holmberg; Franz Konstantin Fuss; Billy Sperlich
Journal:  Front Physiol       Date:  2016-03-09       Impact factor: 4.566

View more
  31 in total

Review 1.  Perspective: Food-Based Dietary Guidelines in Europe-Scientific Concepts, Current Status, and Perspectives.

Authors:  Angela Bechthold; Heiner Boeing; Inge Tetens; Lukas Schwingshackl; Ute Nöthlings
Journal:  Adv Nutr       Date:  2018-09-01       Impact factor: 8.701

2.  Estimation of Heart Rate and Energy Expenditure Using a Smart Bracelet during Different Exercise Intensities: A Reliability and Validity Study.

Authors:  Yihui Cai; Zi Wang; Wanxia Zhang; Weiya Kong; Jiayao Jiang; Ruobing Zhao; Dongxue Wang; Leyi Feng; Guoxin Ni
Journal:  Sensors (Basel)       Date:  2022-06-21       Impact factor: 3.847

3.  Review of Validity and Reliability of Garmin Activity Trackers.

Authors:  Kelly R Evenson; Camden L Spade
Journal:  J Meas Phys Behav       Date:  2020-06

Review 4.  Smart Technology and Orthopaedic Surgery: Current Concepts Regarding the Impact of Smartphones and Wearable Technology on Our Patients and Practice.

Authors:  Neil V Shah; Richard Gold; Qurratul-Ain Dar; Bassel G Diebo; Carl B Paulino; Qais Naziri
Journal:  Curr Rev Musculoskelet Med       Date:  2021-11-03

Review 5.  Wearable activity trackers-advanced technology or advanced marketing?

Authors:  Ren-Jay Shei; Ian G Holder; Alicia S Oumsang; Brittni A Paris; Hunter L Paris
Journal:  Eur J Appl Physiol       Date:  2022-04-21       Impact factor: 3.346

Review 6.  Accuracy and Precision of Energy Expenditure, Heart Rate, and Steps Measured by Combined-Sensing Fitbits Against Reference Measures: Systematic Review and Meta-analysis.

Authors:  Guillaume Chevance; Natalie M Golaszewski; Elizabeth Tipton; Eric B Hekler; Matthew Buman; Gregory J Welk; Kevin Patrick; Job G Godino
Journal:  JMIR Mhealth Uhealth       Date:  2022-04-13       Impact factor: 4.947

7.  Integrated Framework of Load Monitoring by a Combination of Smartphone Applications, Wearables and Point-of-Care Testing Provides Feedback that Allows Individual Responsive Adjustments to Activities of Daily Living.

Authors:  Peter Düking; Silvia Achtzehn; Hans-Christer Holmberg; Billy Sperlich
Journal:  Sensors (Basel)       Date:  2018-05-19       Impact factor: 3.576

8.  Evaluation of a Low-Cost Commercial Actigraph and Its Potential Use in Detecting Cultural Variations in Physical Activity and Sleep.

Authors:  Pavlos Topalidis; Cristina Florea; Esther-Sevil Eigl; Anton Kurapov; Carlos Alberto Beltran Leon; Manuel Schabus
Journal:  Sensors (Basel)       Date:  2021-05-29       Impact factor: 3.847

9.  Acute Physiological Effects of Continuous Versus Intermittent Walking During Golf in Individuals With Knee Osteoarthritis: A Pilot Study.

Authors:  Prakash Jayabalan; Rachel Bergman; Emilio Jauregui; Chad Hanaoka; Aaron M Stoker
Journal:  Am J Phys Med Rehabil       Date:  2021-07-26       Impact factor: 3.412

10.  Energy expenditure estimation from respiration variables.

Authors:  Rahel Gilgen-Ammann; Marcel Koller; Céline Huber; Riikka Ahola; Topi Korhonen; Thomas Wyss
Journal:  Sci Rep       Date:  2017-11-22       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.