| Literature DB >> 35260991 |
Rob Argent1,2,3, Megan Hetherington-Rauth4, Julie Stang5, Jakob Tarp5, Francisco B Ortega6,7, Pablo Molina-Garcia6, Moritz Schumann8,9, Wilhelm Bloch8, Sulin Cheng8,9,10, Anders Grøntved11, Jan Christian Brønd11, Ulf Ekelund5, Luis B Sardinha4, Brian Caulfield12,13.
Abstract
BACKGROUND: Consumer wearables and smartphone devices commonly offer an estimate of energy expenditure (EE) to assist in the objective monitoring of physical activity to the general population. Alongside consumers, healthcare professionals and researchers are seeking to utilise these devices for the monitoring of training and improving human health. However, the methods of validation and reporting of EE estimation in these devices lacks rigour, negatively impacting on the ability to make comparisons between devices and provide transparent accuracy.Entities:
Mesh:
Year: 2022 PMID: 35260991 PMCID: PMC9325806 DOI: 10.1007/s40279-022-01665-4
Source DB: PubMed Journal: Sports Med ISSN: 0112-1642 Impact factor: 11.928
Fig. 1PRISMA flowchart of the systematic review process. PRISMA Preferred Reporting Items for Systematic Reviews and Meta-Analyses
Fig. 2Six domains and corresponding variables of interest identified as being of importance in the validation of consumer wearable and smartphone estimation of EE. EE energy expenditure
Proposed best-practice protocols for the validation of wearable and smartphone-derived energy expenditure
| Domain | Variable | Protocol consideration | Reporting consideration |
|---|---|---|---|
| Target population | Population | If the purpose is to validate wearable-derived EE for the general healthy population, a broad heterogeneous sample should be used If the purpose is to use the wearable in a specific clinical application, validation should be performed in an homogenous sample | Report the target population used and method of recruitment |
| Age | When assessing validity for the general healthy population, participants should be representative of a specific group from children/adolescents, adults, or older adults | Average and range of sample age should be reported | |
| Sex | Include an equal sample of males and females within the study | The number of female and male participants should be reported Report on measures of body size and composition (i.e. weight, height, BMI) | |
| Ethnicity | If HR is contributing to EE estimation, ethnicity and skin tone should be reported * If technology used by the device to estimate EE is unknown, then ethnicity and skin tone should be reported | Report ethnicity and race of the study population Report skin tone using the Fitzpatrick scale | |
| Sample size | A sample size calculation should be completed based on the mean and standard error of the differences between device and criterion from either previously published or pilot study data If previous data are not available, we recommend at least 45 participants | A sample size justification should be provided The sample size recruited and the sample size analysed should be clearly reported | |
| Criterion measure | Reference test selection | Report the criterion measure used, calibration criteria, and laboratory-specific %CV For DLW, specifics on isotope dosing, collections and analysis method should be reported | |
| Index device | Placement | Report the placement of the device and information on the order of placement if more than one wrist-worn device is used Report on distance from the wrist | |
| Set-up | All demographic details required by the device for participant initiation should be inputted If the device has the option to select a specific exercise mode (i.e. indoor running, cycling, walking, etc.), choose the mode that best reflects the activity being performed Device should be reset back to baseline/factory settings between each participant to ensure there is no influence on algorithms based on historical use | Report the device model and version and whether firmware updates took place during data acquisition Report what demographic details are imputed into the device per participant for initiation Report what mode (if any) is used during each activity (i.e. indoor running, cycling, walking, etc.) If individual calibration is required, ensure this is reported in detail | |
| Testing conditions | Exercise/activity-specific EE | Activities should be chosen based on the predefined activity modes available on the index device Each work-bout should last at least 6 min. Due consideration should be given to the potential need for a recovery period between activity bouts when higher-intensity exercises are examined to allow sufficient recovery to approximately RMR Consideration should be given to establishing steady state within each activity The protocol should include a wide-range of intensity zones: Walking—at least three intensities plus one at incline > 5% Running—at least two intensities plus one at incline > 5% * If this takes place on a treadmill, then all activities should include a 1–2% incline to account for the metabolic cost of treadmill propulsion Cycling—at least three intensities The intensities and activities used should take into consideration the characteristics and capabilities of the sample, with calculated intensities based on the participant’s maximal HR or VO2max being the preferable option over selecting absolute values or self-selected intensities. When not feasible to calculate personalised fitness levels, reporting absolute exercise intensities (i.e. watts, speed, rpm, etc.) are recommended over participant self-selected intensities | Report the type of activity performed and equipment used (i.e. treadmill running, cycle ergometer, etc.). For each activity, include information on the duration, inclination and intensities used, including how intensity was determined (HR/VO2max, absolute values, or self-selected pace) Report on if and how steady state was established Report on if and how anaerobic EE was measured when exercising at higher intensities |
| Free-living EE | The free-living protocol should involve a duration of 7–14 days whereby the participant completes activities of daily living in an unconstrained environment (home, work, travelling, etc.) | Describe the duration of the free-living protocol and the instructions given to the participants during this time Specify whether participants were asked to use specific modes during the period Report on the adherence of participants to wearing the index device as instructed Report methods of interpolation of missing data in free-living evaluations | |
| Processing | Criterion | At higher/maximal intensities, appropriate methods should be utilised to adjust the criterion measurement to incorporate anaerobic EE | The equations used to determine EE from gaseous consumption and related assumptions should be provided For calculating AEE from TEE derived from DLW, clarify how RMR was determined (i.e. estimated from direct/indirect calorimetry or equation) in addition to any other assumptions or equations used Results should be reported as kilocalories per minute Where possible, raw data should be made available in an appropriate repository in line with open science principles |
| Index | If available, it is preferable to use minute-by-minute data for calculating AEE from the device If minute-by-minute data are not available, calculate EE at steady state for each activity; note EE once steady state is reached (pre-EE) and then again post activity. Subtract the post-EE from the pre-EE and divide by the number of minutes between pre- to post-EE to get kilocalories per minute | Report on how the device displays EE (i.e. AEE, TEE) Results should be reported as kilocalories per minute If available, describe the algorithm and inputs used by the device (i.e. HR, accelerometry, GPS, demographics, etc.) to derive EE. If no information is available, it should be reported that EE was derived using a proprietary algorithm | |
| Synchronisation and epochs for analysis | When possible, use demarcation events for marking timestamps of achieving steady state and end of activity for both criterion and index measure to facilitate synchronisation | Provide as much detail as possible on the synchronisation process in order to allow for study replication | |
| Statistical analysis | Statistical tests | To assess device accuracy, the following statistical tests should be performed per activity/category: 1. Bland–Altman with limits of agreement 2. Least products regression of the differences against the means 3. MAPE Subgroup analysis is encouraged if sample size allows (e.g. sex, age category, ethnicity, BMI, etc.) | Include an illustration of the Bland–Altman plots (a supplementary file may be needed if multiple index devices are tested) Binary conclusions about the validity of the device should not be made if a formal sample size analysis has not been conducted |
EE energy expenditure, DLW doubly labelled water, TEE total energy expenditure, AEE activity energy expenditure, HR heart rate, RMR resting metabolic rate, VO maximal oxygen consumption, BMI body mass index, %CV percentage coefficient of variation, MAPE mean absolute percentage error
Checklist of items to be considered during the validation protocol of wearable and smartphones to estimate energy expenditure
| Age |
| Children (< 12 years) |
| Adolescents (12–18 years) |
| Adults (18–65 years) |
| Older adults (> 65 years) |
| Sex (equal sample of males and females) |
Sample size Calculated based on previously published or pilot study data OR If previous data are not available, at least 45 participants |
Direct or indirect calorimetry DLW |
| Placement of criterion according to the manufacturer’s instructions (applies only to indirect calorimetry) |
Wearable activity monitors placed according to the manufacturer’s instructions Smartphones either handheld or placed in places typically used in everyday living (i.e. pocket, handbags, belt phone holder) |
All demographic details required by the device are inputted Specific exercise mode chosen if applicable |
| Walking, running, and/or cycling with three different intensities and one inclination > 5% |
| Intensity based on either participant maximal HR or VO2max, absolute value, or participant self-selected pace |
| At least 6-min duration for each activity bout with ample recovery time allotted in-between activity bouts performed at higher intensities |
| Participant wears index device for 7–14 days while simultaneously undergoing a DLW protocol |
| Criterion measure processing |
| |
| |
| Index measure processing |
Minute-by-minute data for calculating AEE from the device used if available If minute-by-minute data are not available, pre-EE (EE at the start of steady state for each activity) is subtracted from post-EE (EE post activity) and divided by the number of minutes between pre- to post-EE to get kilocalories per minute |
| Index and criterion synchronisation |
| Bland–Altman with limits of agreement per activity/category |
| Least products regression of the differences against the means |
| MAPE |
DLW doubly labelled water, HR heart rate, EE energy expenditure, AEE activity energy expenditure, RMR resting metabolic rate, VO maximal oxygen consumption, MAPE mean absolute percentage error
| This systematic literature review of validation studies of consumer wearables and smartphone applications has highlighted a heterogeneity between validation methodologies in key domains, particularly in the target populations, data processing and statistical approaches, giving rise to validation bias. |
| The lack of free-living validation leads to limited abilities of users to understand the accuracy of wearable devices and smartphones in estimating energy expenditure (EE) in the intended use case. |
| In this article, the INTERLIVE network provides best-practice recommendations to be used in future protocols to move towards a more accurate, transparent and comparable validation of EE estimation derived from consumer wearables and smartphone applications. |