| Literature DB >> 29194358 |
Jordan A Carlson1, Bo Liu2, James F Sallis3, Jacqueline Kerr4, J Aaron Hipp5, Vincent S Staggs6, Amy Papa7, Kelsey Dean8, Nuno M Vasconcelos9.
Abstract
Technological advances provide opportunities for automating direct observations of physical activity, which allow for continuous monitoring and feedback. This pilot study evaluated the initial validity of computer vision algorithms for ecological assessment of physical activity. The sample comprised 6630 seconds per camera (three cameras in total) of video capturing up to nine participants engaged in sitting, standing, walking, and jogging in an open outdoor space while wearing accelerometers. Computer vision algorithms were developed to assess the number and proportion of people in sedentary, light, moderate, and vigorous activity, and group-based metabolic equivalents of tasks (MET)-minutes. Means and standard deviations (SD) of bias/difference values, and intraclass correlation coefficients (ICC) assessed the criterion validity compared to accelerometry separately for each camera. The number and proportion of participants sedentary and in moderate-to-vigorous physical activity (MVPA) had small biases (within 20% of the criterion mean) and the ICCs were excellent (0.82-0.98). Total MET-minutes were slightly underestimated by 9.3-17.1% and the ICCs were good (0.68-0.79). The standard deviations of the bias estimates were moderate-to-large relative to the means. The computer vision algorithms appeared to have acceptable sample-level validity (i.e., across a sample of time intervals) and are promising for automated ecological assessment of activity in open outdoor settings, but further development and testing is needed before such tools can be used in a diverse range of settings.Entities:
Keywords: accelerometry; exercise; measurement; parks; public health
Mesh:
Year: 2017 PMID: 29194358 PMCID: PMC5750905 DOI: 10.3390/ijerph14121487
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Activity location as viewed from two video recording angles.
Sample activity session protocol.
| Minute | Combination Number a | Total Number of People | Blocks | ||
|---|---|---|---|---|---|
| Group A | Group B | Group C | |||
| 0 | 1 | 3 | Three people sitting individually | ||
| 1 | 1 | 3 | |||
| 2 | 1 | 3 | |||
| 3 | 1 | 3 | |||
| 4 | 1 | 3 | |||
| 5 | 1 | 3 | |||
| 6 | 1 | 3 | |||
| 7 | 2 | 2 | Two people standing together | ||
| 8 | 2 | 2 | |||
| 9 | 2 | 2 | |||
| 10 | 3 | 1 | One person walking | ||
| 11 | 3 | 1 | |||
| 12 | 3 | 1 | |||
| 13 | 10 | 5 | Two people jogging individually | Two people walking together | |
| 14 | 10 | 5 | |||
| 15 | 10 | 5 | |||
| 16 | 14 | 5 | Two people standing individually | ||
| 17 | 9 | 6 | Two people standing individually | ||
| 18 | 9 | 6 | |||
| 19 | 9 | 6 | |||
| 20 | 5 | 8 | Four people sitting individually | ||
| 21 | 5 | 8 | |||
| 22 | 5 | 8 | |||
| 23 | 5 | 8 | Two people sitting individually | ||
| 24 | 7 | 8 | Two people jogging together | ||
| 25 | 7 | 8 | |||
| 26 | 9 | 3 | One person standing | ||
| 27 | 9 | 3 | |||
| 28 | 9 | 3 | |||
| 29 | 8 | 3 | Two people walking together | ||
| 30 | 8 | 3 | |||
| 31 | 8 | 3 | |||
| 32 | 8 | 3 | |||
| 33 | 8 | 3 | |||
| 34 | 8 | 7 | Three people sitting individually | Two people standing together | |
| 35 | 5 | 5 | |||
| 36 | 7 | 4 | One person jogging | ||
| 37 | 7 | 4 | |||
| 38 | 6 | 4 | One person walking | ||
| 39 | 6 | 4 | |||
a The activities were sitting, standing, walking, and jogging; 1–3 activities were engaged in simultaneously (by different groups of participants), resulting in 14 possible combinations of activities across participants.
Distribution of activity block combinations in the full dataset.
| Activity Block Combinations | Number of Simultaneous Activities | Number of Seconds of Data |
|---|---|---|
| Sitting only | 1 | 420 |
| Standing only | 1 | 540 |
| Walking only | 1 | 840 |
| Jogging only | 1 | 720 |
| Sitting and standing | 2 | 720 |
| Sitting and walking | 2 | 720 |
| Sitting and jogging | 2 | 780 |
| Standing and walking | 2 | 660 |
| Standing and jogging | 2 | 480 |
| Walking and jogging | 2 | 420 |
| Sitting, standing, and walking | 3 | 240 |
| Sitting, standing, and jogging | 3 | 240 |
| Sitting, walking, and jogging | 3 | 240 |
| Standing, walking, and jogging | 3 | 180 |
| Total | - | 7200 a |
a The final dataset included 6630 s of video recording due to missing video data.
Characteristics of testing dataset (N = 2800 s).
| Characteristics | Number (%) |
|---|---|
| Seconds with 1–4 people in scene | 1322 (47.2%) |
| Seconds with 5–9 people in scene | 1478 (52.8%) |
| Seconds with ≥1 person sedentary | 2082 (74.4%) |
| Seconds with ≥1 person in light activity | 488 (17.4%) |
| Seconds with ≥1 person in moderate activity | 1594 (56.9%) |
| Seconds with ≥1 person in vigorous activity | 1509 (53.9%) |
| Seconds with ≥1 person in MVPA | 2454 (87.6%) |
| Seconds with <3 MET-seconds per person | 1358 (48.5%) |
| Seconds with ≥3 MET-seconds per person | 1442 (51.5%) |
MET = metabolic equivalents of tasks; MVPA = moderate to vigorous physical activity.
Agreement and associations between the video-based algorithm (test measure) and accelerometry (criterion measure) in the aggregated minute-level dataset (N = 48 min per camera).
| Variables | Criterion | Camera 1 | Camera 2 | Camera 3 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Mean (SD) | Bias (SD) | %Bias (SD) | ICC | Bias (SD) | %Bias (SD) | ICC | Bias (SD) | %Bias (SD) | ICC | |
| Activity volume variables | ||||||||||
| Number people in scene | 4.34 (2.09) | 0.15 (0.74) | 3.5% (17.1) | 0.94 | 0.08 (0.79) | 1.8% (18.2) | 0.93 | 0.18 (0.77) | 4.1% (17.7) | 0.94 |
| Number of people sedentary | 1.90 (1.89) | 0 (0.37) | 0% (19.5) | 0.98 | 0.01 (0.65) | 0.5% (34.2) | 0.94 | 0.11 (0.51) | 5.8% (26.8) | 0.96 |
| Number of people light | 0.20 (0.16) | −0.06 (0.27) | −30.0% (135.0) | 0.24 | −0.08 (0.29) | −40.0% (145.0) | 0.14 | −0.07 (0.27) | −35.0% (135.0) | 0.21 |
| Number of people moderate | 1.08 (1.02) | −0.12 (0.58) | −11.1% (53.7) | 0.83 | −0.12 (0.52) | −11.1% (48.1) | 0.85 | −0.09 (0.59) | −8.3% (54.6) | 0.83 |
| Number of people vigorous | 1.16 (1.70) | 0.32 (0.77) | 27.6% (66.4) | 0.90 | 0.28 (0.59) | 24.1% (50.9) | 0.93 | 0.22 (0.52) | 19.0% (44.8) | 0.95 |
| Number of people in MVPA | 2.24 (1.73) | 0.20 (0.74) | 8.9% (33.0) | 0.91 | 0.15 (0.57) | 6.7% (25.4) | 0.95 | 0.13 (0.61) | 5.8% (27.2) | 0.94 |
| Total MET-minutes in scene | 18.0 (14.1) | −1.8 (7.7) | −10.0% (42.8) | 0.79 | −3.1 (8.9) | −17.1% (49.4) | 0.68 | −1.7 (8.5) | −9.3% (47.2) | 0.72 |
| Activity distribution variables | ||||||||||
| Proportion of people sedentary | 0.40 (0.28) | −0.01 (0.08) | −2.5% (20.0) | 0.96 | 0 (0.12) | 0% (30.0) | 0.91 | 0.01 (0.08) | 2.5% (20.0) | 0.96 |
| Proportion of people light | 0.06 (0.05) | −0.02 (0.08) | −33.3% (133.3) | 0.33 | −0.02 (0.07) | −33.3% (116.7) | 0.32 | −0.02 (0.08) | −33.3% (133.3) | 0.36 |
| Proportion of people moderate | 0.31 (0.28) | −0.03 (0.15) | −9.7% (48.4) | 0.85 | −0.04 (0.16) | −12.9% (51.6) | 0.82 | −0.03 (0.15) | −9.7% (48.4) | 0.84 |
| Proportion of people vigorous | 0.23 (0.28) | 0.06 (0.14) | 26.1% (60.9) | 0.86 | 0.06 (0.14) | 26.1% (60.9) | 0.85 | 0.04 (0.13) | 17.4% (56.5) | 0.88 |
| Proportion of people MVPA | 0.54 (0.28) | 0.02 (0.11) | 3.7% (20.4) | 0.93 | 0.03 (0.13) | 5.6% (24.1) | 0.90 | 0.01 (0.11) | 1.9% (20.4) | 0.93 |
| Average MET-minutes per person | 4.08 (1.92) | −0.19 (1.58) | −4.7% (38.7) | 0.59 | −0.34 (1.85) | −8.3% (45.3) | 0.43 | 0.04 (1.69) | 1.0% (41.4) | 0.56 |
Bias = predicted (camera) − criterion (accelerometer). %Bias = bias ÷ criterion (accelerometer). ICC = intraclass correlation coefficient. MET = metabolic equivalents of tasks. MVPA = moderate to vigorous physical activity. SD = standard deviation.
Figure 2Agreement plots for the total number of participants in the scene (N = 144 predictions from 48 min).
Figure 3Agreement plots for total MET-minutes and number of participants in each activity category (N = 144 predictions from 48 min).
Figure 4Agreement plots for average MET-minutes per person and proportion of participants in each activity category (N = 144 predictions from 48 min).
Agreement and associations between the video-based algorithm (test measure) and accelerometry (criterion measure) in the unaggregated second-level dataset (N = 2800 seconds per camera).
| Variables | Criterion | Camera 1 | Camera 2 | Camera 3 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Mean (SD) | Bias (SD) | %Bias (SD) | ICC | Bias (SD) | %Bias (SD) | ICC | Bias (SD) | %Bias (SD) | ICC | |
| Activity volume variables | ||||||||||
| Number people in scene | 4.47 (2.16) | 0.18 (0.94) | 4.0% (21.0) | 0.91 | 0.13 (1.16) | 2.9% (26.0) | 0.87 | 0.21 (1.00) | 4.7% (22.4) | 0.9 |
| Number of people sedentary | 1.95 (1.97) | 0.01 (0.65) | 0.5% (33.3) | 0.95 | 0.04 (1.06) | 2.1% (54.4) | 0.86 | 0.13 (0.86) | 6.7% (44.1) | 0.91 |
| Number of people light | 0.20 (0.45) | −0.05 (0.55) | −25.0% (275.0) | 0.1 | −0.07 (0.54) | −35.0% (270.0) | 0.07 | −0.06 (0.54) | −30.0% (270.0) | 0.12 |
| Number of people moderate | 1.01 (1.11) | −0.11 (0.93) | −10.9% (92.1) | 0.64 | −0.12 (0.95) | −11.9% (94.1) | 0.63 | −0.07 (0.95) | −6.9% (94.1) | 0.63 |
| Number of people vigorous | 1.30 (1.84) | 0.33 (1.08) | 25.4% (83.1) | 0.85 | 0.28 (1.09) | 21.5% (83.8) | 0.84 | 0.22 (1.06) | 16.9% (81.5) | 0.86 |
| Number of people in MVPA | 2.32 (1.87) | 0.22 (1.03) | 9.5% (44.4) | 0.86 | 0.16 (1.04) | 6.9% (44.8) | 0.86 | 0.15 (0.98) | 6.5% (42.2) | 0.88 |
| Total MET-minutes in scene | 19.1 (15.1) | −3.1 (10.2) | −16.0% (53.4) | 0.67 | −4.2 (11.1) | −21.7% (58.1) | 0.56 | −2.88 (11.0) | −15.1% (57.7) | 0.6 |
| Activity distribution variables | ||||||||||
| Proportion of people sedentary | 0.40 (0.30) | −0.01 (0.14) | −2.5% (35.0) | 0.89 | −0.01 (0.20) | −2.5% (50.0) | 0.79 | 0 (0.16) | 0% (40.0) | 0.87 |
| Proportion of people light | 0.05 (0.13) | −0.01 (0.17) | −20.0% (340.0) | 0.11 | −0.02 (0.15) | −40.0% (300.0) | 0.13 | −0.01 (0.16) | −20.0% (320.0) | 0.13 |
| Proportion of people moderate | 0.29 (0.31) | −0.03 (0.25) | −10.3% (86.2) | 0.66 | −0.03 (0.27) | −10.3% (93.1) | 0.64 | −0.03 (0.26) | −10.3% (89.7) | 0.64 |
| Proportion of people vigorous | 0.26 (0.29) | 0.05 (0.21) | 19.2% (80.8) | 0.76 | 0.06 (0.22) | 23.1% (84.6) | 0.72 | 0.04 (0.22) | 15.4% (84.6) | 0.75 |
| Proportion of people MVPA | 0.54 (0.31) | 0.03 (0.19) | 5.6% (35.2) | 0.81 | 0.03 (0.22) | 5.6% (40.7) | 0.76 | 0.01 (0.20) | 1.9% (37.0) | 0.8 |
| Average MET-minutes per person | 4.20 (2.05) | −0.30 (2.12) | −7.1% (50.5) | 0.46 | −0.35 (2.52) | −8.3% (60.0) | 0.31 | −0.09 (2.31) | −2.1% (55.0) | 0.4 |
Bias = predicted (camera) − criterion (accelerometer). %Bias = bias ÷ criterion (accelerometer). ICC = intraclass correlation coefficient. METs = metabolic equivalents of tasks. MVPA = moderate to vigorous physical activity. SD = standard deviation.