| Literature DB >> 35072942 |
Pablo Molina-Garcia1,2, Hannah L Notbohm3, Moritz Schumann3,4, Rob Argent5,6,7, Megan Hetherington-Rauth8, Julie Stang9, Wilhelm Bloch3, Sulin Cheng3,4, Ulf Ekelund9, Luis B Sardinha8, Brian Caulfield5,6, Jan Christian Brønd10, Anders Grøntved10, Francisco B Ortega11,12,13.
Abstract
BACKGROUND: Technological advances have recently made possible the estimation of maximal oxygen consumption (VO2max) by consumer wearables. However, the validity of such estimations has not been systematically summarized using meta-analytic methods and there are no standards guiding the validation protocols.Entities:
Mesh:
Year: 2022 PMID: 35072942 PMCID: PMC9213394 DOI: 10.1007/s40279-021-01639-y
Source DB: PubMed Journal: Sports Med ISSN: 0112-1642 Impact factor: 11.928
Fig. 1Flowchart of the systematic review process
Characteristics of included studies (N = 14)
| References | Participants | Age (years) | Wearable device. HR assessment | Setup information | Reference standard | Statistical analysis | ||
|---|---|---|---|---|---|---|---|---|
| Anderson et al. 2019 [ | 25 recreational runners, men (17) and women (8) | 39.4 ± 10.8 | Garmin Fenix 5X. Wrist-measured HR (PPG) | Age, sex, height, and weight | Exercise test: walking or jogging warm-up + 10-min run at their highest perceived pace + 5-min cool down walking | Indirect calorimetry: ParvoMedics TrueOne 2400 | Treadmill: Bruce running protocol (speed and inclination increase each 3 min) | |
| Carrier et al. 2020 [ | 17 recreational runners, men (8) and women (9) | 24.8 ± 4.3 | Garmin Fenix 3 + chest HR strap | HRmax and unspecified info | Exercise test: 15-min outdoor run above 70% HRmax | Indirect calorimetry: ParvoMedics | Treadmill: modified Costill-Fox running protocol (speed increase first and 2% inclination increase second each 2 min) | |
| Cooper and Shafer 2019 [ | 19 healthy, men (9) and women (10) | 21.9 ± 4.2 | Polar A300 + chest HR strap | Age, sex, height, and weight | Resting HR: 5 min supine position | Indirect calorimetry: Cosmed Fitmate Pro | Treadmill: Bruce running protocol (speed and inclination increase each 3 min) | Pearson’s |
| Crouter et al. 2004 [ | 20 active men (10) and women (10) | Men: 26.0 ± 3.1 Women: 23.0 ± 2.4 | Polar S410 + chest HR strap | Age, sex, height, weight, and physical activity level | Resting HR: supine position | Indirect calorimetry: ParvoMedics TrueMax 2400 | Treadmill: individual ramp running protocol (individual start, increase 1% incline per min) | |
| Esco et al. 2011 [ | 50 active men | 24.0 ± 5.1 | Polar F11 + chest HR strap | Age, sex, height, weight, and physical activity level | Resting HR: supine position | Indirect calorimetry: ParvoMedics TrueOne 2400 | Treadmill: Bruce running protocol (speed and inclination increase each 3 min) | |
| Esco et al. 2014 [ | 20 female soccer players | 21.5 ± 1.7 | Polar FT40 + chest HR strap | Age, sex, height, weight, and physical activity level | Resting HR: 5 min supine position | Indirect calorimetry: ParvoMedics TrueOne 2400 | Treadmill: Bruce running protocol (speed and inclination increase each 3 min) | Bland–Altman and MAPE |
| Freeberg et al. 2019 [ | 30 healthy, men (17) and women (13) | 21.7 ± 3.1 | Fitbit Charge 2. Wrist-measured HR (PPG) | Not specified | Exercise test: 2 × 10 min at highest intensity possible | Indirect calorimetry: ParvoMedics TrueOne 2400 | Treadmill: individual ramp running protocol (4–7 mph, increase 1% incline per min) + verification test | ANOVA, Pearson’s |
| Klepin et al. 2019 [ | 65 healthy men (27) and women (33) | 31.0 ± 7.3 | Fitbit Charge 2. Wrist-measured HR (PPG) | Age, sex, handedness, height, and weight | Exercise test: 3 × 15 min at comfortable pace | Indirect calorimetry: Cosmed | Treadmill: ramp running protocol (5 mph, increase by 0.75 MET per min) | Bland–Altman and MAPE |
| Kraft and Dow 2017 [ | 16 healthy, men (10) and women (6) | 22.4 ± 5.2 | Garmin Forerunner 920XT + chest HR strap | Height and weight | Exercise test: 10 min self-paced run | Indirect calorimetry: ParvoMedics TrueOne 2400 | Treadmill: Bruce running protocol (speed and inclination increase each 3 min) | |
| Kraft and Dow 2018 [ | 18 healthy, men (12) and women (6) | 21.3 ± 2.2 | Polar RS300X + chest HR strap | Age, height, weight, sex, and activity level | Resting HR: 5 min supine position | Indirect calorimetry: ParvoMedics TrueOne 2400 | Treadmill: Bruce running protocol (speed and inclination increase each 3 min) | |
| Lowe et al. 2010 [ | 32 active women | 20.3 ± 1.9 | Polar F6 + chest HR strap | Age, sex, height, and weight | Resting HR: 5 min sitting position | Indirect calorimetry: ParvoMedics | Treadmill: Bruce running protocol (speed and inclination increase each 3 min) | |
| Passler et al. 2019 [ | 24 healthy, men (13) and women (11) | 23.4 ± 2.1 | Polar V800. Wrist-measured HR (PPG) | Not specified | Resting test: 10 min supine position (pretest), 3 min supine position, 3 min standing position | Indirect calorimetry: Metalyzer 3B-R3, Cortex | Treadmill: ramp protocol (7 km·h−1, increase by 0.5 km·h−1 per min) | |
| Garmin Forerunner 920 XT. Wrist-measured HR (PPG) | Not specified | Exercise test: > 10 min self-paced run | ||||||
| Snyder et al. 2019 [ | 44 healthy, men (22) and women (22) | Men: 24.7 ± 5.4 Women: 25.0 ± 4.3 | Polar V800 + chest HR strap | Age, sex, height, weight, and physical activity level | Resting HR: 5 min supine position | Indirect calorimetry: ParvoMedics TrueOne 2400 | Treadmill: Bruce running protocol (speed and inclination increase each 3 min) | ANOVA, Bland–Altman and Pearson’s |
| Garmin Forerunner 230 + chest HR strap | Age, sex, height, weight, and HRmax | Exercise test: 10 min self- paced run | ||||||
| Wagner et al. 2020 [ | 23 healthy men | 23.1 ± 2.5 | Garmin GF5 | Exercise test: 10 min and 30 s all out run | Indirect calorimetry: Metalyzer 3B, Cortex | Treadmill: ramp running protocol (10 km·h−1, incline 5%, increase by 2.5% per min) | Bland–Altman and ICC |
ANOVA analysis of variance, HR heart rate, HR maximum heart rate, ICC intraclass correlation coefficient, MAPE mean absolute percentage error, MET metabolic equivalent, PPG photoplethysmography, VO maximal oxygen consumption
Fig. 2Risk of bias assessment divided by domains
Fig. 3Pooled bias and SE for wearables VO2max using resting conditions (A) and exercise tests (B) relative to the reference standard. A negative bias represents an underestimation and a positive bias an overestimation of the VO2max estimated from wearables in comparison to the reference standard. CI confidence interval, SE standard error, VO maximal oxygen consumption. *Heart rate was measured with chest strap. In the remaining articles not flagged with an asterisk, heart rate was measured using photoplethysmography technology on the wrist
Fig. 4Bland–Altman meta-analysis for the comparison of wearable-derived VO2max using resting conditions and exercise tests with the reference VO2max. The y-axis is the bias between the wearable and reference VO2max (wearable − reference), with positive values indicating an overestimation and negative values an underestimation by the wearable. The x-axis is the mean VO2max between the wearable and reference. CI confidence interval, VO maximal oxygen consumption. *Heart rate was measured with chest strap. In the remaining articles not flagged with an asterisk, heart rate was measured using photoplethysmography technology on the wrist
Fig. 5Six domains and corresponding variables of interest identified as being of importance in the validation of consumer wearable estimation of VO2max. VO maximal oxygen consumption
The proposed best-practice protocols for the validation of wearable-derived VO2max
| Domain | Variable | Protocol consideration | Reporting consideration |
|---|---|---|---|
| Target population | Population | If purpose is to validate wearable-derived If purpose is to use wearables in specific clinical applications, validation should be performed in homogenous samples | Report the inclusion/exclusion criteria defining the target population and recruitment methodology and provide basic demographic information (e.g., age, height, weight, or BMI) |
| Age | Validation protocols targeting a general healthy population should include the main age ranges: children (< 12 years), adolescents and adults (13–64 years), and older adults | Average and range of sample age should be reported | |
| Sex | Include an equal sample of males and females within the study | The number of female and male participants should be reported | |
| Sample size | For those studies aimed at testing the accuracy of a given device, a sample size calculation should be performed based on the previously published data according to Lu et al.[ | Describe the sample size calculation if included If sample size calculation is not feasible, cite previous literature supporting the inclusion of a recommended sample size Describe the flow of sample size recruited and analyzed | |
| Reference standard | Indirect calorimetry | The gold standard for the assessment of Any brand of metabolic cart is accepted when reporting validity and reliability, as well as measuring both The metabolic cart should be properly calibrated before the | Indicate if indirect calorimetry was used Report the metabolic cart used, the type of recording technology (e.g., breath-by-breath), and whether the metabolic cart used is valid and reliable Describe the calibration process of the metabolic cart |
| Index measure | Wearable devices | Consumer wearables should be worn in ecological body locations in accordance with the manufacturer’s instructions. If wrist worn, a maximum of 2 devices per wrist should be used at the same time, with placement being randomly counterbalanced between participants Wearable devices can measure HR with PPG and/or chest-strap technology, and this may have an impact on the | Report the placement of the device and information on order of placement if more than one wrist worn device is used Specify whether HR was recorded with PPG on wrist/arm (or others) or chest-strap technology |
| Testing protocols and conditions for both reference and index measure | Maximal graded exercise testing with indirect calorimetry | The accepted protocol to assess Maximal test requires participants to perform to the point of volitional fatigue, and at least two accepted criteria are recommended to ensure that participants are reaching the maximum effort during the tests. The ACSM proposes several maximum-effort criteria that can be used [ A verification phase after the maximal test is recommended to compare both Any type of exercise testing is accepted (e.g., walking, running, or biking) as long as it adapts to the type of activity in which the consumer wearable is intended to be validated In populations unable to perform maximal test, submaximal exercise-based equations might be an alternative to predict | Report whether maximal or submaximal exercise test is being used. In the case of submaximal test, provide a rationale of its implementation and specify the exercise-based equations used In maximal exercise test, report the need for reaching volitional fatigue and indicate the maximum-effort criteria included (at least two criteria) Report the type of exercise testing used as well as its characteristics (e.g., increase in the ramp inclination in treadmill tests or power increase in cycle-ergometer tests) |
| Standardized conditions before the reference and index measure | Participants should not consume a significant caloric uptake at least 2 h before the exercise test No caffeine, similar stimulants, or alcohol should be consumed 24 h before the exercise test No intensive sports activities should be performed 48 h before the exercise test Participants should not take any medication that may alter the normal HR response to a maximal exercise The exercise test should begin with at least 2–3 min warm-up | Report the standardized conditions followed by participants Describe the warm-up characteristics | |
| Wearable device set up | Follow the manufacturer’s instructions for the Provide all the information required by the device, since in some cases this is used to improve the If the device has the option to select a specific exercise mode (i.e., indoor running, cycling, walking, etc.), choose the mode that best reflects the activity that is going to be performed In those wearable devices using GPS data, it is recommended to perform the test outdoor to ensure a proper GPS connection | Report the device model and version Report what demographic details are input into the device per participant for initiation Report what mode (if any) is used during each activity (i.e., indoor running, cycling, walking, etc.) If GPS is used, indicate that the satellite connection was checked before the exercise test | |
| Data processing | Indirect calorimetry processing | If a time average is used to reduce variability in the indirect calorimetry data, typically this should be between 15 and 30 s [ If a breath average is used, a 15-breath running average is recommended [ Confirm that the maximum-effort criteria were met when interpreting the | Report the time-averaged or breath-averaged sampling used Report whether maximal or peak Detail the data processing conducted in the |
| Time interval between evaluations | If resting conditions are used for wearable If the wearable test involves exercising, between 24 and 48 h is recommended to ensure an effective muscle recovery. If the maximal test is evaluated first, a time interval between 48 and 72 h is recommended [ | Report the time interval between both assessments | |
| Statistical analysis | Statistical tests | To assess device accuracy, the following statistical tests should be performed: 1. Bland–Altman with limits of agreement 2. Least product regression of the difference against the means 3. MAPE Subgroup analysis is encouraged if sample size allows. (e.g., sex, age category, ethnicity, BMI) | Include Bland–Altman plots for a visual inspection of the validity results Binary conclusions about the validity of the device should not be made if a formal sample size analysis has not been conducted |
ACSM American College of Sports Medicine, BMI body mass index, HR heart rate, MAPE mean absolute percentage error, PPG photoplethysmography, VO maximal oxygen consumption
The INTERLIVE checklist to be considered for the validation protocol of wearable to estimate maximal oxygen consumption (VO2max)
| Target population assessment |
| Age |
| Children (< 12 years) |
| Adolescents (12–18 years) |
| Adults (18–65 years) |
| Older adults (> 65 years) |
| Sex (equal sample of males and females) |
Sample size Calculated based on previously published or pilot study data OR If previous data is not available, sample of convenience ( |
| Reference standard |
| The gold standard is a maximal exercise test in laboratory conditions with indirect calorimetry |
| Any brand of metabolic cart is accepted and should be calibrated following manufacturer’s instructions |
| Index device assessment |
Consumer wearables placed according to manufacturer’s instructions to be tested in ecological locations Hear rate can be measured with both chest strap or PPG, and it should be reported which of them was used |
| Testing protocols and conditions |
| To consider at least 2 maximal-effort criteria during the incremental test |
| A verification phase after the maximal test is recommended to corroborate the |
| Any type of exercise testing is accepted (e.g., walking, running, or biking) as long as it adapts to the type of activity in which the consumer wearable is intended to be validated |
| Control the standardized conditions before the maximal exercise test |
| Follow the manufacturer’s instructions for the |
| Provide all the setup information required by the devices |
| If exercise mode is available, choose the one that best reflects the activity to be performed |
| Ensure an optimal GPS connection when this data is used |
| Processing |
| If |
| If a breath-by-breath average is used, a 15-breath running average is recommended |
| Confirm that the maximum-effort criteria were met when interpreting the |
| In those wearables using resting conditions, no time interval is needed |
| In exercise conditions, an interval between 24 and 48 h is recommended |
| Statistical analysis |
| Bland–Altman with limits of agreement |
| Least products regression of the differences against the means |
| MAPE |
See the Table 2 for more detailed information about each item
INTERLIVE Towards Intelligent Health and Well-Being Network of Physical Activity Assessment, MAPE mean absolute percentage error, PPG photoplethysmography
| Wearables using exercise-based algorithms provide higher accuracy in the estimation of maximal oxygen consumption ( |
| Wearables using exercise-based estimation seem to be optimal for measuring |
| In this article, the Towards Intelligent Health and Well-Being Network of Physical Activity Assessment (INTERLIVE) network provides best-practice recommendations to be used in future protocols to move towards a more accurate, transparent and comparable validation of |