PURPOSE: Statistical equivalence testing is more appropriate than conventional tests of difference to assess the validity of physical activity (PA) measures. This article presents the underlying principles of equivalence testing and gives three examples from PA and fitness assessment research. METHODS: The three examples illustrate different uses of equivalence tests. Example 1 uses PA data to evaluate an activity monitor's equivalence to a known criterion. Example 2 illustrates the equivalence of two field-based measures of physical fitness with no known reference method. Example 3 uses regression to evaluate an activity monitor's equivalence across a suite of 23 activities. RESULTS: The examples illustrate the appropriate reporting and interpretation of results from equivalence tests. In the first example, the mean criterion measure is significantly within ±15% of the mean PA monitor. The mean difference is 0.18 METs and the 90% confidence interval of -0.15 to 0.52 is inside the equivalence region of -0.65 to 0.65. In the second example, we chose to define equivalence for these two measures as a ratio of mean values between 0.98 and 1.02. The estimated ratio of mean V˙O2 values is 0.99, which is significantly (P = 0.007) inside the equivalence region. In the third example, the PA monitor is not equivalent to the criterion across the suite of activities. The estimated regression intercept and slope are -1.23 and 1.06. Neither confidence interval is within the suggested regression equivalence regions. CONCLUSIONS: When the study goal is to show similarity between methods, equivalence testing is more appropriate than traditional statistical tests of differences (e.g., ANOVA and t-tests).
PURPOSE: Statistical equivalence testing is more appropriate than conventional tests of difference to assess the validity of physical activity (PA) measures. This article presents the underlying principles of equivalence testing and gives three examples from PA and fitness assessment research. METHODS: The three examples illustrate different uses of equivalence tests. Example 1 uses PA data to evaluate an activity monitor's equivalence to a known criterion. Example 2 illustrates the equivalence of two field-based measures of physical fitness with no known reference method. Example 3 uses regression to evaluate an activity monitor's equivalence across a suite of 23 activities. RESULTS: The examples illustrate the appropriate reporting and interpretation of results from equivalence tests. In the first example, the mean criterion measure is significantly within ±15% of the mean PA monitor. The mean difference is 0.18 METs and the 90% confidence interval of -0.15 to 0.52 is inside the equivalence region of -0.65 to 0.65. In the second example, we chose to define equivalence for these two measures as a ratio of mean values between 0.98 and 1.02. The estimated ratio of mean V˙O2 values is 0.99, which is significantly (P = 0.007) inside the equivalence region. In the third example, the PA monitor is not equivalent to the criterion across the suite of activities. The estimated regression intercept and slope are -1.23 and 1.06. Neither confidence interval is within the suggested regression equivalence regions. CONCLUSIONS: When the study goal is to show similarity between methods, equivalence testing is more appropriate than traditional statistical tests of differences (e.g., ANOVA and t-tests).
Authors: Keith Brazendale; Michael W Beets; Daniel B Bornstein; Justin B Moore; Russell R Pate; Robert G Weaver; Ryan S Falck; Jessica L Chandler; Lars B Andersen; Sigmund A Anderssen; Greet Cardon; Ashley Cooper; Rachel Davey; Karsten Froberg; Pedro C Hallal; Kathleen F Janz; Katarzyna Kordas; Susi Kriemler; Jardena J Puder; John J Reilly; Jo Salmon; Luis B Sardinha; Anna Timperio; Esther M F van Sluijs Journal: J Sci Med Sport Date: 2015-02-23 Impact factor: 4.319
Authors: Scott J Strath; Leonard A Kaminsky; Barbara E Ainsworth; Ulf Ekelund; Patty S Freedson; Rebecca A Gary; Caroline R Richardson; Derek T Smith; Ann M Swartz Journal: Circulation Date: 2013-10-14 Impact factor: 29.690
Authors: Youngwon Kim; Scott E Crouter; Jung-Min Lee; Phillip M Dixon; Glenn A Gaesser; Gregory J Welk Journal: J Sci Med Sport Date: 2014-10-18 Impact factor: 4.319
Authors: Krista S Leonard; Abigail M Pauley; Emily E Hohman; Penghong Guo; Daniel E Rivera; Jennifer S Savage; Matthew P Buman; Danielle Symons Downs Journal: J Sci Med Sport Date: 2020-08-11 Impact factor: 4.319
Authors: Kirti Magudia; Christopher P Bridge; Camden P Bay; Ana Babic; Florian J Fintelmann; Fabian M Troschel; Nityanand Miskin; William C Wrobel; Lauren K Brais; Katherine P Andriole; Brian M Wolpin; Michael H Rosenthal Journal: Radiology Date: 2020-11-24 Impact factor: 11.105
Authors: J B Courtney; K Nuss; K Lyden; K K Harrall; D H Glueck; A Villalobos; R F Hamman; J R Hebert; T G Hurley; J Leiferman; K Li; K Alaimo; J S Litt Journal: Meas Phys Educ Exerc Sci Date: 2020-12-28
Authors: Samantha F Ehrlich; Jill M Maples; Cristina S Barroso; Kathleen C Brown; David R Bassett; Nikki B Zite; Kimberly B Fortner Journal: BMC Pregnancy Childbirth Date: 2021-06-08 Impact factor: 3.007
Authors: Sunku Kwon; Neng Wan; Ryan D Burns; Timothy A Brusseau; Youngwon Kim; Santosh Kumar; Emre Ertin; David W Wetter; Cho Y Lam; Ming Wen; Wonwoo Byun Journal: Sensors (Basel) Date: 2021-02-18 Impact factor: 3.576