| Literature DB >> 30241483 |
Duncan S Procter1,2, Angie S Page3,4, Ashley R Cooper3,4, Claire M Nightingale5, Bina Ram5, Alicja R Rudnicka5, Peter H Whincup5, Christelle Clary6, Daniel Lewis6, Steven Cummins6, Anne Ellaway7, Billie Giles-Corti8, Derek G Cook5, Christopher G Owen5.
Abstract
BACKGROUND: Increases in physical activity through active travel have the potential to have large beneficial effects on populations, through both better health outcomes and reduced motorized traffic. However accurately identifying travel mode in large datasets is problematic. Here we provide an open source tool to quantify time spent stationary and in four travel modes(walking, cycling, train, motorised vehicle) from accelerometer measured physical activity data, combined with GPS and GIS data.Entities:
Keywords: Accelerometer; Active travel; GPS; Gradient boosting; Machine learning; Physical activity; Travel mode; Xgboost
Mesh:
Year: 2018 PMID: 30241483 PMCID: PMC6150970 DOI: 10.1186/s12966-018-0724-y
Source DB: PubMed Journal: Int J Behav Nutr Phys Act ISSN: 1479-5868 Impact factor: 6.457
The characteristics of the training data and how we classify travel modes
| Commute Mode | Number of participants | % of total | Training category |
|---|---|---|---|
| Walk | 66 | 20.2 | Walk |
| Cycle | 34 | 10.4 | Cycle |
| Car/Van driver | 48 | 14.7 | Vehicle |
| Car/Van passenger | 6 | 1.8 | Vehicle |
| Motorcycle/ moped/ scooter | 1 | 0.3 | Vehicle |
| Taxi | 2 | 0.6 | Vehicle |
| Bus/minibus/coach | 37 | 11.3 | Vehicle |
| Train (over ground) | 46 | 14.1 | Train |
| Underground | 86 | 26.4 | Train |
| Total | 326 | 100 | – |
Fig. 1a Workflow for creation of training data-set, b) decisions made to manually identify commutes
The confusion-matrix and accuracy scores per mode, expressed as percentages, for the cross-validated model
| Observed mode | Mode | Positive predictive valuea | Sensitivityb | F1 scorec | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Cycle | Walk | Train | Vehicle | Stationary | ||||||
| Predicted mode | Cycle | 8062 | 33 | 5 | 328 | 20 | Cycle | 95.4 | 96.2 | 95.8 |
| Walk | 10 | 9171 | 21 | 32 | 357 | Walk | 95.6 | 92.3 | 93.9 | |
| Train | 3 | 18 | 12,296 | 95 | 18 | Train | 98.9 | 97.5 | 98.2 | |
| Vehicle | 260 | 6 | 139 | 20,911 | 159 | Vehicle | 97.4 | 96.9 | 97.1 | |
| Stationary | 47 | 710 | 155 | 214 | 45,317 | Stationary | 97.6 | 98.8 | 98.2 | |
aPositive predictive value (PPV) or Precision, the ratio of true positives to the sum of true and false positives
bSensitivity or Recall/ True positive rate/Detection rate, the ratio of true positive to true positives and false negatives
cF1 score, the harmonic mean of PPV and sensitivity
The confusion-matrix and accuracy scores per mode, expressed as percentages, compared with manually-identified data
| Observed mode | Mode | Positive predictive valuea | Sensitivityb | F1 scorec | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Cycle | Walk | Train | Vehicle | Stationary | ||||||
| Predicted mode | Cycle | 3651 | 1426 | 116 | 215 | 385 | Cycle | 63.0 | 94.3 | 75.5 |
| Walk | 15 | 23,346 | 12 | 128 | 1858 | Walk | 92.1 | 75.6 | 83.0 | |
| Train | 43 | 749 | 6275 | 89 | 1318 | Train | 74.1 | 96.6 | 83.8 | |
| Vehicle | 156 | 585 | 38 | 23,684 | 3726 | Vehicle | 84.0 | 97.1 | 90.1 | |
| Stationary | 8 | 4776 | 58 | 285 | 329,807 | Stationary | 98.5 | 97.8 | 98.2 | |
aPositive predictive value (PPV) or Precision, the ratio of true positives to the sum of true and false positives
bSensitivity or Recall/ True positive rate/Detection rate, the ratio of true positive to true positives and false negatives
cF1 score, the harmonic mean of PPV and sensitivity
The confusion-matrix and accuracy scores per mode, expressed as percentages, compared with the STAMP-2 study
| Observed mode | Mode | Positive predictive valuea | Sensitivityb | F1 scorec | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Cycle | Walk | Train | Vehicle | Stationary | ||||||
| Predicted mode | Cycle | 1381 | 290 | 0 | 109 | 182 | Cycle | 70.4 | 67.9 | 69.1 |
| Walk | 17 | 7729 | 0 | 1 | 202 | Walk | 97.2 | 73.6 | 83.8 | |
| Train | 1 | 39 | 716 | 0 | 320 | Train | 66.5 | 89.8 | 76.5 | |
| Vehicle | 602 | 329 | 0 | 14,242 | 2840 | Vehicle | 79.1 | 98.1 | 87.5 | |
| Stationary | 33 | 2112 | 81 | 171 | 179,539 | Stationary | 98.7 | 98.1 | 98.4 | |
aPositive predictive value (PPV) or Precision, the ratio of true positives to the sum of true and false positives
bSensitivity or Recall/ True positive rate/Detection rate, the ratio of true positive to true positives and false negatives
cF1 score, the harmonic mean of PPV and sensitivity
Time observed and predicted per participant in each travel mode for the different data-sets
| Travel mode | Mean minutes in travel mode per participant | |||||
|---|---|---|---|---|---|---|
| Cross validation | ENABLE full days | STAMP | ||||
| Reported | Predicted | Manually identified | Predicted | Manually identified | Predicted | |
| Cycle | 5.4 | 5.4 | 30.7 | 46.0 | 33.9 | 32.7 |
| Walk | 6.4 | 6.1 | 245.1 | 201.3 | 175.0 | 132.5 |
| Train | 8.1 | 8.1 | 51.6 | 67.3 | 13.3 | 17.9 |
| Vehicle | 13.9 | 13.8 | 193.7 | 223.7 | 242.1 | 300.2 |
| Stationary | 29.5 | 29.9 | 2675.4 | 2658.2 | 3051.4 | 3032.3 |