| Literature DB >> 33361276 |
William Johnston1,2, Pedro B Judice3,4, Pablo Molina García5, Jan M Mühlen6, Esben Lykke Skovgaard7, Julie Stang8, Moritz Schumann6,9, Shulin Cheng6,9, Wilhelm Bloch6, Jan Christian Brønd7, Ulf Ekelund8, Anders Grøntved7, Brian Caulfield1,2, Francisco B Ortega5, Luis B Sardinha10.
Abstract
Consumer wearable and smartphone devices provide an accessible means to objectively measure physical activity (PA) through step counts. With the increasing proliferation of this technology, consumers, practitioners and researchers are interested in leveraging these devices as a means to track and facilitate PA behavioural change. However, while the acceptance of these devices is increasing, the validity of many consumer devices have not been rigorously and transparently evaluated. The Towards Intelligent Health and Well-Being Network of Physical Activity Assessment (INTERLIVE) is a joint European initiative of six universities and one industrial partner. The consortium was founded in 2019 and strives to develop best-practice recommendations for evaluating the validity of consumer wearables and smartphones. This expert statement presents a best-practice consumer wearable and smartphone step counter validation protocol. A two-step process was used to aggregate data and form a scientific foundation for the development of an optimal and feasible validation protocol: (1) a systematic literature review and (2) additional searches of the wider literature pertaining to factors that may introduce bias during the validation of these devices. The systematic literature review process identified 2897 potential articles, with 85 articles deemed eligible for the final dataset. From the synthesised data, we identified a set of six key domains to be considered during design and reporting of validation studies: target population, criterion measure, index measure, validation conditions, data processing and statistical analysis. Based on these six domains, a set of key variables of interest were identified and a 'basic' and 'advanced' multistage protocol for the validation of consumer wearable and smartphone step counters was developed. The INTERLIVE consortium recommends that the proposed protocol is used when considering the validation of any consumer wearable or smartphone step counter. Checklists have been provided to guide validation protocol development and reporting. The network also provide guidance for future research activities, highlighting the imminent need for the development of feasible alternative 'gold-standard' criterion measures for free-living validation. Adherence to these validation and reporting standards will help ensure methodological and reporting consistency, facilitating comparison between consumer devices. Ultimately, this will ensure that as these devices are integrated into standard medical care, consumers, practitioners, industry and researchers can use this technology safely and to its full potential. © Author(s) (or their employer(s)) 2021. Re-use permitted under CC BY. Published by BMJ.Entities:
Keywords: accelerometer; consensus statement; physical activity; validity; walking
Year: 2020 PMID: 33361276 PMCID: PMC8273687 DOI: 10.1136/bjsports-2020-103147
Source DB: PubMed Journal: Br J Sports Med ISSN: 0306-3674 Impact factor: 13.800
Figure 1PRISMA flow chart of the systematic review process. PRISMA, Preferred Reporting Items for Systematic Reviews and Meta-Analyses.
Figure 2Six domains and corresponding set of key variables of interest which were identified as being of importance when considering validation of consumer wearable and smartphone step counters.
Figure 3The multistage protocols for the best-practice validation of consumer wearable and smartphone step counters. We recommend a minimum validation in those with typical gait (basic protocol), across laboratory (step 1) and semifree-living (step 2).
The proposed best-practice protocols for the validation of wearable and smartphone step counting devices
| Methodological domains | Methodological variables | Protocol considerations | Reporting considerations |
| 1. Target population |
| Cross section of participants across the spectrum of ages*, | Provide detailed demographics for each group (ie, age, height, weight, BMI, health condition). |
|
| Equal Sample of males and females in each group. | Report the sex distribution in each group. | |
|
| If the focus of the study is to conduct hypothesis testing about a predefined minimal level of accuracy, a sample size calculation should be completed based on the previously published or pilot study mean and SE of the differences between the devices, using the methodologies outline by Lu | Detail sample recruited and sample analysed for all groups and in all levels of the study. | |
| 2. Criterion measure |
| Video camera with multiple observer (≥2) or equivalent*. | Report camera setup and the level of agreement between observers. |
|
|
| Describe in detail the setup including special situations (eg, going to the toilet). | |
| 3. Index measure |
| Should be placed in an ecological body location which the consumer device was designed for. | Report the exact model and version for the index measure including hardware and software and report in detail the placement protocol. |
| 4. Testing conditions | 4.1. Laboratory assessment protocol |
3 min walk test (30 m walkway) 3 min zig-zag walk test (30 m walkway - three cones) 3 x stair test ascent/descent (minimum 12 step staircase) 3 min stationary cycling 2×1 min steady state treadmill test (consistent, self-selected walking speed). 3 min run test (30 m walkway) 3 min zigzag run test (30 m walkway - three cones) | Report cadence/ gait velocity for each trial (eg, 3 min walk test) and for each group (eg, children) |
|
| Participants are encouraged to complete the tasks as they would do during activities of daily living, at a self-selected pace. Each activity should be completed for ≥3 min unless otherwise stated. simulated sleeping (Get into bed, after 1 min roll over, after 2 min check time on smartphone/wearable, after 3 min get up) writing by hand (sitting) eating/ drinking (sitting) computer use (both typing and mouse use) smartphone use (sitting) simulated video-game (game console controller) standing and talking sweeping (5 m squared) vacuuming (5 m squared) folding laundry (no steps while standing) simulated washing/drying dishes (washing for 1.5 min and drying for 1.5 min) simulated shopping (pick up two shopping bags, walk 10 m, empty bag and put items in cupboard—perform task once) Squat/sit-to-stand (≥20 s) Lunge/split-squat (≥20 s) Low rowing exercise (≥20 s) | Describe in detail the setup for each trial (eg, computer use) to ensure transparency. | |
|
| ≥24 hours free-living period whereby the participant completes activities of daily living in an unconstrained environment (home, work, travelling, etc). A weeklong recording period is optimal to capture interday and weekday-weekend day variation. | Describe in detail the setup to ensure transparency. | |
| 5. Processing |
| The recorded video should be reviewed by >2 independent reviewers in real time and a counting device should be used to record steps. * |
|
|
| Ensure alignment of epochs from criterion measure and index device. | Detail the index device processing as much as possible. | |
|
|
|
| |
|
|
| Describe in detail the synchronisation process allow replication. | |
|
| The act of raising one foot and putting it down in another spot, resulting in the displacement of the centre of mass. | Clearly report the adopted definition. | |
| 6. Statistical analysis |
|
Bland-Altman with LoA Least products regression of the differences against the means MAPE Intraclass correlation with 95% CIs (calculated for the 2×1 min treadmill test only) | Unless a formal sample size analysis has been conducted, binary conclusions about the validity should not be made. Rather, the level of measurement error and bias should be openly reported, taking into consideration all the proposed validation conditions (lab; semi; free-living), to facilitate contextual interpretation. |
If the evaluator decides not to perform the whole validation protocol, there is a minimal protocol to be followed. At a minimum, the validity should be investigated in those with typical gait (basic protocol), across laboratory (step 1) and semifree-living (step 2). Observers should determine which participants are deemed suitable for the different components of the assessment protocols.
*Any device or method which has been demonstrated to possess less than 5% measurement error using the laboratory, semifree-living, and free-living validation protocols detailed below. Additionally, this should be specific to the population of interest. For the case of free-living validation, the INTERLIVE network is aware that video recording and ‘manual’ step counting by two independent evaluators over ≥24 hours recording period (a 1-week recording period (minimum 3–4 complete days) is optimal to capture interday and weekday-weekend variation) is extremely costly in time and resources and therefore likely not feasible for many. Therefore, we feel that this field needs to move forward, developing and validating alternative methods such as insole sensors that after validation and cross-validation could be used as a new and more feasible gold-standard method for free-living validation protocols.
BMI, body mass index; INTERLIVE®, Intelligent Health and Well-Being Network of Physical Activity Assessment; LoA, limits of agreement; MAPE, mean absolute percentage error.
Check-list of items to be considered during the validation protocol development for consumer wearable and smartphone step counters
| Target population assessment | |
| Age | |
| (Children (<12 years) | ◯ |
| Adolescents and healthy adults (13–64 years) | ◯ |
| Older adults (>65 years) | ◯ |
| Sex (equal sample of males and females) | ◯ |
|
Sample size calculation via pilot study OR Sample of convenience (n≥45) | ◯ |
|
| |
| Video camera with multiple observers (≥2) or equivalent* | ◯ |
| Placement to ensure steps are within the field of view† | ◯ |
|
| |
| Placement according to manufacturer’s instructions | ◯ |
|
| |
| Walking: | |
| 3 min walk test | ◯ |
| 3 min zigzag walk test | ◯ |
| 3 x stair test ascent/descent | ◯ |
| 3 min stationary cycling | ◯ |
| 2×1 min steady state treadmill test (reliability) | ◯ |
| Running/fast-walking: | |
| 3 min run test | ◯ |
| 3 min zig-zag run test | ◯ |
| Optional | |
| 1 x incline/ decline walking test | ◯ |
|
| |
| Sedentary activities: | |
| Simulated sleeping | ◯ |
| Writing by hand | ◯ |
| Eating/ drinking | ◯ |
| Household activities: | |
| Standing and talking | ◯ |
| Sweeping | ◯ |
| Vacuuming | ◯ |
| Folding laundry | ◯ |
| Simulated washing/drying dishes | ◯ |
| Simulated shopping | ◯ |
| Exercise-related activities: | |
| Squat/sit-to-stand | ◯ |
| Lunge/split-squat | ◯ |
| Low rowing exercise | ◯ |
|
| |
| Subject’s wear index and criterion device for a minimum of 24 hours, and if a more feasible gold-standard or pseudo-gold-standard method is developed/validated a week assessment would be ideal. | ◯ |
|
| |
| Criterion measure processing | |
| 3. | ◯ |
| Index measure processing | |
| No post processing of the end-user data is allowed | ◯ |
| Epochs for analysis | |
| Note step count at start and end of each trial | ◯ |
| Discard steps which occur between trials | ◯ |
| Record cadence and gait velocity for laboratory and semifree-living | ◯ |
| Index and criterion synchronisation | |
| Participants stand still at start and end of each activity trial while step count and video time (or equivalent criterion count) are noted | ◯ |
| The act of raising one foot and putting it down in another spot, resulting in the displacement of the centre of mass | ◯ |
|
| |
| Mean difference or mean relative difference Bland-Altman LoA | ◯ |
| Least products regression of the differences against the means | ◯ |
| Mean absolute percentage error | ◯ |
| Intraclass correlation coefficient (calculated for the 2×1 min treadmill test only) | ◯ |
*Any device or method which has been demonstrated to possess less than 5% measurement error using the laboratory, semifree-living, and free-living validation protocols detailed below. Additionally, this should be specific to the population of interest. For the case of free-living validation, the INTERLIVE network is aware that video recording and ‘manual’ step counting by two independent evaluators over ≥24 hours recording period (a 1-week recording period (minimum 3–4 complete days) is optimal to capture interday and weekday-weekend variation) is extremely costly in time and resources and therefore likely not feasible for many. Therefore, we feel that this field needs to move forward, developing and validating alternative methods such as insole sensors that after validation and cross-validation could be used as a new and more feasible gold-standard method for free-living validation protocols.
†If an ‘equivalent’ criterion is used, the device should be used as per the methods described within the validation study.
INTERLIVE®, Intelligent Health and Well-Being Network of Physical Activity Assessment; LoA, limits of agreement.
Minimum required reporting sheet for standardised and transparent data sharing
| Target population | Description | Reporting |
| Sampling method | Random, convenient, etc. | |
| Distribution of sex | ♂=n/♀=n | |
| Age | Mean±SD and range (years) | |
| BMI | Mean±SD and range (kg/m²) | |
| Sample size | Provide the no and explain how the sample size was chosen | |
| Health condition (where relevant) | Provide detailed description of cohort with atypical gait characteristics | |
|
| ||
| Video camera with multiple observer (≥2) or equivalent* |
| |
| Placement | Actual placement of camera or equivalent | |
|
| ||
| Placement | Manufacturer’s instructions and actual placement | |
|
| ||
| Type of protocol | Laboratory, semifree-living and/or free-living | |
| Type of trial | List the components of the recommended protocol evaluated (eg, laboratory 3 min walk test, 3 min zig-zag walk test etc) | |
| Duration | Duration of each trial | |
| Contextual factors | Where the testing took place, weather conditions, time of year | |
|
| ||
| Criterion measure processing |
| |
| Index measure processing | Detail methods if end-user data is processed (eg, excluded data etc.) | |
| Epochs for analysis | In seconds (where relevant) | |
| Index and criterion synchronisation | Method used for synchronising step data from index and criterion measures | |
|
| ||
| Mean difference or mean relative difference Bland-Altman LoA | Detail the LoA for each trial | |
| Least products regression of the differences against the means | Detail the presence of any systematic or proportional bias for each trial | |
| MAPE | Detail the MAPE for each trial | |
| Intraclass correlation coefficient | Detail the intra-class correlation coefficient and 95% CI for the 2×1 min treadmill test |
*Any device or method which has been demonstrated to possess less than 5% measurement error using the laboratory, semi free-living, and free-living validation protocols detailed below. Additionally, this should be specific to the population of interest. For the case of free-living validation, the INTERLIVE network is aware that video recording and ‘manual’ step counting by two independent evaluators over ≥24 hours recording period (a 1-week recording period (minimum 3–4 complete days) is optimal to capture interday and weekday-weekend variation) is extremely costly in time and resources and therefore likely not feasible for many. Therefore, we feel that this field needs to move forward, developing and validating alternative methods such as insole sensors that after validation and cross-validation could be used as a new and more feasible gold-standard method for free-living validation protocols.
BMI, body mass index; INTERLIVE®, Intelligent Health and Well-Being Network of Physical Activity Assessment; LoA, limits of agreement; MAPE, mean absolute percentage error.;