Literature DB >> 29942628

Validity of wrist-worn consumer products to measure heart rate and energy expenditure.

Robert S Thiebaud1, Merrill D Funk2, Jacelyn C Patton1, Brook L Massey1, Terri E Shay1, Martin G Schmidt1, Nicolas Giovannitti1.   

Abstract

INTRODUCTION: The ability to monitor physical activity throughout the day and during various activities continues to improve with the development of wrist-worn monitors. However, the accuracy of wrist-worn monitors to measure both heart rate and energy expenditure during physical activity is still unclear. The purpose of this study was to determine the accuracy of several popular wrist-worn monitors at measuring heart rate and energy expenditure.
METHODS: Participants wore the TomTom Cardio, Microsoft Band and Fitbit Surge on randomly assigned locations on each wrist. The maximum number of monitors per wrist was two. The criteria used for heart rate and energy expenditure were a three-lead electrocardiogram and indirect calorimetry using a metabolic cart. Participants exercised on a treadmill at 3.2, 4.8, 6.4, 8 and 9.7 km/h for 3 minutes at each speed, with no rest between speeds. Heart rate and energy expenditure were manually recorded every minute throughout the protocol.
RESULTS: Mean absolute percentage error for heart rate varied from 2.17 to 8.06% for the Fitbit Surge, from 1.01 to 7.49% for the TomTom Cardio and from 1.31 to 7.37% for the Microsoft Band. The mean absolute percentage error for energy expenditure varied from 25.4 to 61.8% for the Fitbit Surge, from 0.4 to 26.6% for the TomTom Cardio and from 1.8 to 9.4% for the Microsoft Band.
CONCLUSION: Data from these devices may be useful in obtaining an estimate of heart rate for everyday activities and general exercise, but energy expenditure from these devices may be significantly over- or underestimated.

Entities:  

Keywords:  Fitbit; Photoplethysmography; activity monitors; fitness trackers; physical activity

Year:  2018        PMID: 29942628      PMCID: PMC6001222          DOI: 10.1177/2055207618770322

Source DB:  PubMed          Journal:  Digit Health        ISSN: 2055-2076


Introduction

The development of wearable technology to track both heart rate and energy expenditure has improved over the past few years due to the use of photoplethysmography. Data from these devices may facilitate healthy behaviors such as increased physical activity.[1,2] However, if the information collected from wearable technology is not accurate, the usefulness of these devices is limited. Photoplethysmography monitors heart rate by using light emitting diodes and a photo diode.[3] Shorter wavelengths (green light) are emitted into the skin to help minimize motion artifacts, but it does not penetrate skin depth as well as longer wavelengths.[3] Despite using shorter wavelength light, motion artifacts are still present with this technology, so different algorithms are used to further decrease motion artifacts.[3] If heart rates are monitored accurately, they could be used to track exercise intensity and improve the estimation of energy expenditure. For proprietary reasons, many technology companies do not reveal how they validate their technology or which variables they use to estimate energy expenditure, so it is difficult for consumers to know how valid different devices are and how they compare with other devices. Few studies have investigated the validity of wrist-worn devices that use photoplethysmography to measure both heart rate and energy expenditure.[4-7] Some studies have exclusively examined heart rate and found sufficient accuracy during treadmill walking and running,[8,9] while others found less accurate readings.[10,11] Many different devices and protocols have been used in previous studies, therefore it is important to build sufficient evidence to provide consumers with valuable information on the validity of popular devices. The purpose of this study was to determine the validity of three common wrist-worn consumer monitors at measuring heart rate and energy expenditure during walking, jogging and running. We hypothesized that the devices would be more accurate in estimating heart rate compared to energy expenditure.

Methods

Participants

Twenty recreationally active males and two females participated in this study (mean (SD): age = 22 (3) years, height = 1.73 (0.09) m and weight = 75.9 (10.2) kg). Participants were told about the nature, purpose, details and any risks associated with the experiment, and each participant gave written informed consent. The University’s Institutional Review Board approved the protocol of the research study.

Exercise protocol

Wrist-worn monitors were randomly placed on subjects’ wrists with a maximum of two monitors on one wrist. Devices were randomly placed on the wrists to avoid any bias that may be produced by placing devices in the same place each time. A possibility of less accurate readings may occur with more than one monitor on a wrist, but monitors were placed on the wrist based on the manufacturer’s instructions. Other studies have also used similar procedures to test the validity of these devices.[6-8] Participants exercised on a treadmill at 3.2, 4.8, 6.4, 8 and 9.7 km/h for 3 minutes at each speed with no rest between speeds. These speeds were chosen to reflect various intensities that the general healthy population may experience, and these speeds have been used in other studies.[8,11] The duration of 3 minutes was chosen to allow heart rate to reach steady state at each intensity. Other studies have also used 3–5-minute stages.[7,8,11] Heart rate was measured using electrocardiography (three-lead electrocardiogram (ECG), Quinton® Q-Stress, version 4.5, Cardiac Science, Bothell, WA, USA). Energy expenditure was measured using a metabolic cart system (Trueone 2400® metabolic measurement system, Parvomedics, Sandy, UT, USA).

Consumer wrist-worn monitors

Fitbit Surges (Fitbit Inc., San Francisco, CA, USA), Microsoft Bands (Microsoft Corp., Redmond, WA, USA), and TomTom Cardios (TomTom Inc., Burlington, MA, USA) were placed on the wrist and set to “treadmill” mode if available and according to the manufacturers’ recommendations.

Statistical analysis

The average heart rate and energy expenditure recorded during the 3 minutes at each speed were used for analysis. Pearson correlations measured associations between criterion variables and wrist-worn monitors. Spearman’s rank correlation coefficients were used for any variables that were not normally distributed. Statistical significance was set at an α level of < 0.01. The criterion measure for heart rate was the ECG and for energy expenditure was the metabolic cart. Mean bias was calculated by subtracting the wrist-worn device from the criterion and 95% limits of agreement were also calculated. For equivalence testing, 95% precision was assumed if the wrist-worn monitors’ 90% confidence intervals were within an equivalence zone that was between ±10% of the criterion mean for energy expenditure and ±5% of the criterion mean for heart rate.12 Mean absolute percentage error (MAPE) ((monitor − criterion)/criterion ×100%) provided a general measurement error for monitors.

Results

Heart rate

Wrist-worn monitors overestimated heart rates compared to the criterion for all speeds except for the Fitbit Surge, which underestimated heart rate at 8 and 9.7 km/h (Table 1). The MAPE varied from 2.17 to 8.06% for the Fitbit Surge, from 1.01 to 7.49% for the TomTom Cardio and from 1.31 to 7.37% for the Microsoft Band (Table 1). The equivalence zones for heart rate are found in figure 1.
Table 1.

Heart rate.

Heart rate (bpm)
Fitbit SurgeTomTom CardioMicrosoft BandECG
3.2 km/h
 Mean (SD)89 (11)85 (10)89 (10)84 (10)
 Mean bias (SD)5 (19)1 (9)6 (8)
 LoA70–10876–9473–106
 Correlation0.57[a]0.89[a]0.62[a]
 MAPE (%)6.5 (13.13)1.01 (1.38)7.37 (10.75)
 Equivalence zones85–9181–87[b]85–9176–92[c], 80–88[d]
4.8 km/h
 Mean (SD)100 (13)98 (14)97 (8)91 (9)
 Mean bias (SD)7 (10)7 (12)5 (10)
 LoA80–12075–12077–116
 Correlation0.530.55[a]0.35
 MAPE (%)8.06 (12.04)7.49 (13.41)6.59 (11.86)
 Equivalence zones95–10392–10094–9982–100[c], 87–96[d]
6.4 km/h
 Mean (SD)114 (11)116 (13)114 (10)112 (9)
 Mean bias (SD)2 (8)5 (9)3 (8)
 LoA98–130100–13398–130
 Correlation0.69[a]0.76[a]0.63[a]
 MAPE (%)2.17 (7.68)4.5 (7.45)2.46 (7.07)
 Equivalence zones110–116[b]111–119109–115[b]100–122[c], 106–117[d]
8 km/h
 Mean (SD)132 (13)141 (12)141 (11)135 (10)
 Mean bias (SD)−4 (8)6 (11)5 (12)
 LoA117–147120–162118–164
 Correlation0.82[a]0.520.37
 MAPE (%)−2.77 (6.04)4.38 (8.42)4.19 (9.15)
 Equivalence zones127–135137–144137–143[b]122–150[c], 129–143[d]
9.7 km/h
 Mean (SD)150 (15)157 (13)156 (13)155 (13)
 Mean bias (SD)−5 (9)2 (6)2 (13)
 LoA133–167146–167132–181
 Correlation0.84[a]0.91[a]0.53
 MAPE (%)−3.35 (5.51)1.11 (5.52)1.31 (8.05)
 Equivalence zones145–153153–160[b]153–160[b]140–171[c], 147–163[d]

Values are mean (SD).

ECG: electrocardiogram; LoA: 95% limits of agreement; MAPE: mean absolute percentage error.

p-value < 0.01.

Indicates that values are within the 5% equivalence zone of the electrocardiogram.

Indicates 10% equivalence area.

Indicates 5% equivalence area.

Figure 1.

Heart rate equivalence testing to evaluate agreement between devices and electrocardiogram (ECG) at 3.2 and 9.7 km/h. Dashed lines represent the 5% equivalence zones for ECG and solid lines represent 90% confidence intervals for different devices.

Heart rate. Values are mean (SD). ECG: electrocardiogram; LoA: 95% limits of agreement; MAPE: mean absolute percentage error. p-value < 0.01. Indicates that values are within the 5% equivalence zone of the electrocardiogram. Indicates 10% equivalence area. Indicates 5% equivalence area.

Energy expenditure

The Fitbit Surge overestimated energy expenditure at each speed, while the TomTom Cardio overestimated energy expenditure at 3.2, 4.8 and 6.4 km/h and underestimated energy expenditure at 8 km/h and 9.7 km/h. The Microsoft Band underestimated energy expenditure at 3.2, 4.8 and 6.4 km/h and overestimated energy expenditure at 8 and 9.7 km/h (Table 2). The MAPE varied from 25.4 to 61.8% for the Fitbit Surge, from 0.4 to 26.6% for the TomTom Cardio and from 1.8 to 9.4% for the Microsoft Band (Table 2). The equivalence zones for energy expenditure are found in figure 2.
Table 2.

Energy expenditure.

Energy expenditure (kcal)
Fitbit SurgeTomTom CardioMicrosoft BandMetCart
3.2 km/h
 Mean (SD)8.7 (2.3)6.2 (2.2)5.4 (1.6)6.1 (1.0)
 Mean bias (SD)2.7 (2.0)0.1 (1.8)−0.7 (1.7)
 LoA4.9–12.52.7–9.62–8.8
 Correlation0.520.57[a]0.19
 MAPE (%)44.5 (33.0)0.4 (31.6)−9.4 (27.9)
 Equivalence zone7.9–9.25.4–6.74.8–5.75.5–6.7
4.8 km/h
 Mean (SD)27.8 (5.4)19.3 (5.2)16.2 (4.0)17.1 (2.9)
 Mean bias (SD)10.3 (4.4)2.2 (3.7)−0.9 (4.3)
 LoA19.2–36.411.9–26.67.8–24.7
 Correlation0.530.70[a]0.24
 MAPE (%)61.8 (27.5)12.0 (22.2)−3.6 (24.1)
 Equivalence zones25.9–29.017.5–20.414.8–17.115.4–18.8
6.4 km/h
 Mean (SD)51.0 (8.1)34.4 (9.1)32.1 (6.2)33.4 (5.7)
 Mean bias (SD)17 (6.3)1.0 (6.6)−1.3 (7.7)
 LoA38.6–63.417.8–47.447.2
 Correlation0.55[a]0.66[a]0.15
 MAPE (%)52.7 (21.8)2.5 (20.3)–1.8 (22.3)
 Equivalence zones48.2–52.731.1–36.4[b]29.9–33.430.1–36.7
8 km/h
 Mean (SD)80 (11.6)49.5 (12.7)59.7 (10.5)57.9 (10.0)
 Mean bias (SD)20.7 (9.8)–9.5 (9.8)1.1 (12.8)
 LoA60.8–99.229.7–68.434.6–84.7
 Correlation0.530.60[a]0.20
 MAPE (%)37.0 (19.1)–16.4 (16.8)4.0 (21.6)
 Equivalence zones75.8–82.545.0–52.256.3–62.3[b]52.1–63.7
9.7 km/h
 Mean (SD)112.7 (16.1)66.7 (17.0)96.1 (16.4)90.7 (15.1)
 Mean bias (SD)21.7 (13.3)–24.0 (13.4)5.4 (17.4)
 LoA86.6–138.740.5–92.961.9–130.3
 Correlation0.60[a]0.56[a]0.37
 MAPE (%)25.4 (15.7)–26.6 (15.0)7.5 (18.6)
 Equivalence zones106.9–116.260.7–70.390.3–99.6[b]81.6–99.8

Values are mean (SD).

MetCart: metabolic cart; LoA: 95% limits of agreement; MAPE: mean absolute percentage error.

p-value < 0.01.

Indicates values are within the equivalence zone of the metabolic cart.

Figure 2.

Equivalence testing for total energy expenditure at 9.7 km/h to evaluate agreement between devices and metabolic cart. Dashed lines represent the 10% equivalence zones for the metabolic cart and solid lines represent 90% confidence intervals for different devices.

Energy expenditure. Values are mean (SD). MetCart: metabolic cart; LoA: 95% limits of agreement; MAPE: mean absolute percentage error. p-value < 0.01. Indicates values are within the equivalence zone of the metabolic cart.

Discussion

The main findings from this study were that wrist-worn monitors produce more accurate readings for heart rates compared to energy expenditure. However, the accuracy of the devices may be influenced by the intensity. Heart rate equivalence testing to evaluate agreement between devices and electrocardiogram (ECG) at 3.2 and 9.7 km/h. Dashed lines represent the 5% equivalence zones for ECG and solid lines represent 90% confidence intervals for different devices. Equivalence testing for total energy expenditure at 9.7 km/h to evaluate agreement between devices and metabolic cart. Dashed lines represent the 10% equivalence zones for the metabolic cart and solid lines represent 90% confidence intervals for different devices. When comparing the accuracy of heart rates from the wrist-worn monitors to the ECG readings, the highest correlations were at the fastest speeds for the Fitbit Surge (r = 0.84) and TomTom Cardio (r = 0.91), while the highest correlation for the Microsoft Band was at 6.4 km/h (r = 0.63). Stahl et al.[8] performed a similar study to ours and found higher correlations than we did for the TomTom Cardio (r = 0.959) and Microsoft Band (r = 0.956), although they used the average heart rates throughout the entire exercise protocol to determine their correlations. In another study, Gillinov et al.[10] found concordance correlations of 0.88 between the TomTom Surge and ECG leads. Part of the reason for the lower correlations between heart rates in the current study may be due to a smaller sample size, a different criterion measure used (ECG vs. polar heart rate monitor) and that we correlated heart rates at each speed instead of an overall heart rate. When examining the MAPE, other studies have found similar results to the current study. For example, Stahl et al.[8] found that MAPE varied from 0.97 to 5.71% for the TomTom Cardio and from 3.06 to 8.39% for the Microsoft Band, while Gillinov et al.[10] found MAPE of 6.2% for the TomTom Cardio. Similarly, we found that MAPE varied from 1.01 to 7.49% for the TomTom Cardio and from 1.31 to 7.37% for the Microsoft Band, with the lower MAPE found at the faster speeds. Shcherbina et al.[6] also found a larger percent error during walking compared to running for the Fitbit Surge and Microsoft Band. Overall, it appears that the faster the heart rate due to increasing speed, the more accurate the devices become. Although heart rate was fairly accurate using these monitors, energy expenditure varied much more and did not necessarily correlate with heart rate. For example, when examining the total energy expenditure, MAPE was greater than 20% for the Fitbit Surge and TomTom Cardio, while that of the Microsoft Band was only 7.5%. This confirms other studies that used similar wrist-worn monitors in that they do not accurately measure energy expenditure.[5-7] Though it is unclear how each device calculates the energy expenditure, it does not appear that monitoring heart rate concurrently creates an accurate measure of energy expenditure. One factor that could have impacted the results was the use of a treadmill mode. Two of the devices had a treadmill mode while one device did not. This may have limited some of the ability to accurately measure heart rate and energy expenditure in the device without a treadmill mode. The impact of this on the results is unclear and future studies should determine how using different modes influences the measurement of heart rate and energy expenditure. In addition, because participants only exercised through the multiple speeds once, the reliability of the measurements is unclear. One limitation of the study was the small sample size. A smaller sample size can lead to a lack of uniformity and can decrease statistical power. Despite the small sample size and large variation in the current study, it appears that the results follow a pattern similar to those of other studies investigating these devices.[8,10]

Conclusions

Wrist-worn monitors report more accurate heart rates than energy expenditure during treadmill exercise. However, the accuracy of the devices for measuring heart rate may not yet be high enough for use in a research setting or for athletes who use heart rate measurement to reach precise heart rates for training purposes. Data from these devices may be useful in obtaining an estimate of heart rate for everyday activities and general exercise, but caution should be taken when using energy expenditure from these devices as the calories may be significantly over- or underestimated.
  12 in total

1.  Study of Artifact-Resistive Technology Based on a Novel Dual Photoplethysmography Method for Wearable Pulse Rate Monitors.

Authors:  Congcong Zhou; Jingjie Feng; Jun Hu; Xuesong Ye
Journal:  J Med Syst       Date:  2015-12-08       Impact factor: 4.460

2.  Wearable devices as facilitators, not drivers, of health behavior change.

Authors:  Mitesh S Patel; David A Asch; Kevin G Volpp
Journal:  JAMA       Date:  2015-02-03       Impact factor: 56.272

3.  Accuracy of Wrist-Worn Heart Rate Monitors.

Authors:  Robert Wang; Gordon Blackburn; Milind Desai; Dermot Phelan; Lauren Gillinov; Penny Houghtaling; Marc Gillinov
Journal:  JAMA Cardiol       Date:  2017-01-01       Impact factor: 14.676

4.  Variable Accuracy of Wearable Heart Rate Monitors during Aerobic Exercise.

Authors:  Stephen Gillinov; Muhammad Etiwy; Robert Wang; Gordon Blackburn; Dermot Phelan; A Marc Gillinov; Penny Houghtaling; Hoda Javadikasgari; Milind Y Desai
Journal:  Med Sci Sports Exerc       Date:  2017-08       Impact factor: 5.411

Review 5.  Using an electronic activity monitor system as an intervention modality: A systematic review.

Authors:  Zakkoyya H Lewis; Elizabeth J Lyons; Jessica M Jarvis; Jacques Baillargeon
Journal:  BMC Public Health       Date:  2015-06-24       Impact factor: 3.295

6.  Accuracy of Heart Rate Watches: Implications for Weight Management.

Authors:  Matthew P Wallen; Sjaan R Gomersall; Shelley E Keating; Ulrik Wisløff; Jeff S Coombes
Journal:  PLoS One       Date:  2016-05-27       Impact factor: 3.240

7.  Estimating Heart Rate, Energy Expenditure, and Physical Performance With a Wrist Photoplethysmographic Device During Running.

Authors:  Jakub Parak; Maria Uuskoski; Jan Machek; Ilkka Korhonen
Journal:  JMIR Mhealth Uhealth       Date:  2017-07-25       Impact factor: 4.773

8.  Accuracy in Wrist-Worn, Sensor-Based Measurements of Heart Rate and Energy Expenditure in a Diverse Cohort.

Authors:  Anna Shcherbina; C Mikael Mattsson; Daryl Waggott; Heidi Salisbury; Jeffrey W Christle; Trevor Hastie; Matthew T Wheeler; Euan A Ashley
Journal:  J Pers Med       Date:  2017-05-24

9.  Estimating Accuracy at Exercise Intensities: A Comparative Study of Self-Monitoring Heart Rate and Physical Activity Wearable Devices.

Authors:  Erin E Dooley; Natalie M Golaszewski; John B Bartholomew
Journal:  JMIR Mhealth Uhealth       Date:  2017-03-16       Impact factor: 4.773

10.  Assessment of laboratory and daily energy expenditure estimates from consumer multi-sensor physical activity monitors.

Authors:  Enhad A Chowdhury; Max J Western; Thomas E Nightingale; Oliver J Peacock; Dylan Thompson
Journal:  PLoS One       Date:  2017-02-24       Impact factor: 3.240

View more
  19 in total

1.  Patient generated health data use in clinical practice: A systematic review.

Authors:  George Demiris; Sarah J Iribarren; Katherine Sward; Solim Lee; Rumei Yang
Journal:  Nurs Outlook       Date:  2019-04-26       Impact factor: 3.250

Review 2.  Moving beyond self-report data collection in the natural environment: A review of the past and future directions for ambulatory assessment in eating disorders.

Authors:  Kathryn E Smith; Tyler B Mason; Adrienne Juarascio; Lauren M Schaefer; Ross D Crosby; Scott G Engel; Stephen A Wonderlich
Journal:  Int J Eat Disord       Date:  2019-07-16       Impact factor: 4.861

3.  Estimation of Heart Rate and Energy Expenditure Using a Smart Bracelet during Different Exercise Intensities: A Reliability and Validity Study.

Authors:  Yihui Cai; Zi Wang; Wanxia Zhang; Weiya Kong; Jiayao Jiang; Ruobing Zhao; Dongxue Wang; Leyi Feng; Guoxin Ni
Journal:  Sensors (Basel)       Date:  2022-06-21       Impact factor: 3.847

Review 4.  Wearable activity trackers-advanced technology or advanced marketing?

Authors:  Ren-Jay Shei; Ian G Holder; Alicia S Oumsang; Brittni A Paris; Hunter L Paris
Journal:  Eur J Appl Physiol       Date:  2022-04-21       Impact factor: 3.346

Review 5.  Accuracy and Precision of Energy Expenditure, Heart Rate, and Steps Measured by Combined-Sensing Fitbits Against Reference Measures: Systematic Review and Meta-analysis.

Authors:  Guillaume Chevance; Natalie M Golaszewski; Elizabeth Tipton; Eric B Hekler; Matthew Buman; Gregory J Welk; Kevin Patrick; Job G Godino
Journal:  JMIR Mhealth Uhealth       Date:  2022-04-13       Impact factor: 4.947

Review 6.  Smart Shirts for Monitoring Physiological Parameters: Scoping Review.

Authors:  Hamzeh Khundaqji; Wayne Hing; James Furness; Mike Climstein
Journal:  JMIR Mhealth Uhealth       Date:  2020-05-27       Impact factor: 4.773

7.  Quantitative Contact-Less Estimation of Energy Expenditure from Video and 3D Imagery.

Authors:  Gregor Koporec; Goran Vučković; Radoje Milić; Janez Perš
Journal:  Sensors (Basel)       Date:  2018-07-26       Impact factor: 3.576

8.  Accuracy of consumer-level and research-grade activity trackers in ambulatory settings in older adults.

Authors:  Salvatore Tedesco; Marco Sica; Andrea Ancillao; Suzanne Timmons; John Barton; Brendan O'Flynn
Journal:  PLoS One       Date:  2019-05-21       Impact factor: 3.240

9.  Heart Rate Measures From Wrist-Worn Activity Trackers in a Laboratory and Free-Living Setting: Validation Study.

Authors:  Andre Matthias Müller; Nan Xin Wang; Jiali Yao; Chuen Seng Tan; Ivan Cherh Chiet Low; Nicole Lim; Jeremy Tan; Agnes Tan; Falk Müller-Riemenschneider
Journal:  JMIR Mhealth Uhealth       Date:  2019-10-02       Impact factor: 4.773

10.  Validity of simplified, calibration-less exercise intensity measurement using resting heart rate during sleep: a method-comparison study with respiratory gas analysis.

Authors:  Hirotaka Matsuura; Masahiko Mukaino; Yohei Otaka; Hitoshi Kagaya; Yasushi Aoshima; Takuya Suzuki; Ayaka Inukai; Emi Hattori; Takayuki Ogasawara; Eiichi Saitoh
Journal:  BMC Sports Sci Med Rehabil       Date:  2019-11-04
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.