| Literature DB >> 34075301 |
Claudio Badii1, Angelo Difino1, Paolo Nesi1, Irene Paoli1, Michela Paolucci1.
Abstract
The modern mobile phones and the complete digitalization of the public and private transport networks have allowed to access useful information to understand the user's mean of transportation. This enables a plethora of old and new applications in the fields of sustainable mobility, smart transportation, assistance, and e-health. The precise understanding of the travel means is at the basis of the development of a large range of applications. In this paper, a number of metrics has been identified to understand whether an individual on the move is stationary, walking, on a motorized private or public transport, with the aim of delivering to city users personalized assistance messages for: sustainable mobility, health, and/or for a better and enjoyable life, etc. Differently from the state-of-the-art solutions, the proposed approach has been designed to provide results, and thus collect metrics, in real operating conditions (imposed on the mobile phones as: a range of different mobile phone kinds, operating system constraints managing Applications, active battery consumption manager, etc.). The paper reports the whole experimentations and results. The solution has been developed in the context of Sii-Mobility Km4City Research Project infrastructure and tools, performed with the collaboration of public transport operators, and GDPR compliant. The same solution has been used in Snap4City mobile Apps with experiments performed in Antwerp and Helsinki.Entities:
Keywords: Classification model; Machine learning; Mobile phones; Smart city; Transportation modes; User behaviour analysis
Year: 2021 PMID: 34075301 PMCID: PMC8153851 DOI: 10.1007/s11042-021-10993-y
Source DB: PubMed Journal: Multimed Tools Appl ISSN: 1380-7501 Impact factor: 2.757
Related Work implementation overview. (*) the value reported refers to Precision %
| Authors | Classes | Data exploited | Sampling | #users | #features | # of mobile phone types | Accuracy % |
|---|---|---|---|---|---|---|---|
| Wang et al. [ | Stationary, Walk, Bike, Bus, Car, Metro | Accel | 0.03 s (accel) | 7 | 23 | 1 | 70 |
| Manzoni et al. [ | Stationary, Walk, Bike, Motorcycle, Car, Bus, Metro, Train | Gps Accel | 1 s (gps) 0.04 s (accel) | 4 | 1 | 1 | 82.1 |
| Reddy et al. [ | Stationary, Walk, Run, Bike, Vehicle | Gps Accel | 1 s (gps) 0.03 s (accel) | 16 | 4 | 1 | 93.6 |
| Stenneth et [ | Stationary, Walk, Bike, Car, Bus, Train | Gps Gis | 14 s (gps) | 6 | 7 | 3 | 93.4 |
| Hemminki [ | Stationary, Walk, Car, Bus, Train, Metro, Tram | Accel | 0.01 s (accel) | 16 | 27 + 5 | 3 | 84.9 (*) |
| Prelipcean [ | Walk, Bike, Car, Bus, Metro, Train, Ferry | Gps Accel | 50 m (gps) 0.2 s (accel) | 9 | 11 | – | 90.8 (*) |
| Yu et al. [ | Stationary, Walk, Run, Bike, Vehicle (Motorcycle, Car, Bus, Metro, Rail, Train) | Accel | 0.03 s (accel) | 224 | 22 + 8 | 1 | 90.6 |
| Yanyun et al. [ | Train, Metro, Bus, Car | Accel | 0.01 s (accel) | 30 | 169 | 1 | 98,6 |
| Ashqar et al. [ | Car, Bus, Bike, Run, Walk | Gps Accel | 0.04 s (gps)0.01 s (accel | 10 | 80 | 2 | 97,0 |
Fig. 1System architecture
Fig. 2Data recovery service timing
Fig. 3User Engagement scenario
Fig. 4Comparison of One Step models: a classic and b Super Learner
Overview of Sensor Data Package feature measured at a given time from the mobile or computed on server-side
| Categories | Metrics | Description of metric variable | Where |
|---|---|---|---|
| Day/Time Baseline and GPS | Day and Time | Day and Time of the sample package | M |
| Non-Working day | 1 if weekend or vacation, 0 if it is a working day | M/S | |
| Time Slot | Slot of the day (morning, afternoon, evening, night) | S | |
| GPS latitude and longitude | Position of the mobile phone in GPS coordinates | M | |
| Accuracy | GPS Sensor’s Accuracy from the mobile phone | M | |
| Location Measure kind | Types of Location measure: GPS, Network, Mixed/Fused | M | |
| Speed | Speed as provided by the GPS driver of the mobile (as m/s) | M/S | |
| Average Speed | Average speed of the measures collected in the last two minutes | M/S | |
| Phone Year | Year/age of the terminal | M | |
| BDS | Availability of a BDS compliant GPS Sensor | M | |
| User Type | User Type: commuter, citizen, students, tourist, etc. | M/S | |
| Accelerometer | Average linear magnitude of acceleration | Average of the acceleration magnitude calculate on five measurements | M |
| Linear acceleration of X-axis | Acceleration of the mobile phone along the X-axis, purged by Earth gravity | M | |
| Linear acceleration of Y-axis | Acceleration of the terminal along the Y-axis, purged by Earth gravity | M | |
| Linear acceleration of Z-axis | Acceleration of the terminal along the Z-axis, purged by Earth gravity | M | |
| Proximity | Rail Line | Bool indicating if the mobile phone is in proximity of a rail line | S |
| Sport Facilities | Bool indicating if the mobile phone is in proximity of a sport facilities | S | |
| Tourist Trail | Bool indicating if the mobile phone is in proximity of a tourist trail | S | |
| Green Areas | Bool indicating if the mobile phone is in proximity of a green areas | S | |
| Bus/Light-rail Line | Bool indicating if the mobile phone is in proximity of a bus line or a light-trail line | S | |
| Cycle Paths | Bool indicating if the mobile phone is in proximity of a cycle path | S | |
| Temporal window | Previous speed | Speed of the mobile phone of the previous 12 min | S |
| Previous average speed | Average speed on the measures collected in a 12 min time slot | S | |
| Previous median speed | Median speed on the measures collected in a 12 min time slot | S | |
| Speed distance | Speed (m/s) calculated on the distance between two consecutive coordinates and the time passed between the observations | S |
Fig. 5Speed and average speed
Fig. 6Distribution of Sampling Period/rate in s
Classification Models Comparison on four classes of transport mode: stationary, non-motorized, private transport, public transport. Best values are market in bold
| Classifier Models | Accuracy % | Precision % | Recall % | F1 score |
|---|---|---|---|---|
| Extreme Gradient Boosting | 94.7 | 77.3 | 82.8 | 0.800 |
| Random Forest | 94.2 | 77.4 | 86.9 | 0.819 |
| Extra-Trees |
Best values are market in bold
Extra-Trees Prediction Model: Statistic by class
| Extra Trees Model | Stay | Walk | Private Transport | Public Transport |
|---|---|---|---|---|
| Recall (Sensitivity) | 97.8 | 73.1 | 86.9 | 91.7 |
| Specificity | 90.1 | 98.8 | 98.7 | 99.6 |
| Precision | 97.7 | 77.0 | 82.7 | 93.6 |
| Neg. Pred. Value | 90.4 | 98.5 | 99.0 | 99.4 |
| Balanced Accuracy | 94.0 | 85.9 | 92.8 | 95.6 |
Super Learner results on each binary Classification Model: Couples A to D
| Method | RCV risk | |
|---|---|---|
| (A) Stationary vs Walking, Private Transport, Public Transport. | ||
| Extra-Trees | 0.0282 | 0.5391 |
| RF | 0.0287 | 0.0562 |
| XGBoost | 0.0300 | 0.4047 |
| (B) Walking vs Stationary, Private Transport, Public Transport. | ||
| Extra-Trees | 0.0234 | 0.6277 |
| RF | 0.0259 | 0.0091 |
| XGBoost | 0.0252 | 0.3632 |
| (C) Public Transport vs Stationary, Walking, Private Transport. | ||
| Extra-Trees | 0.0213 | 0.6857 |
| RF | 0.0235 | 0.0000 |
| XGBoost | 0.0239 | 0.3143 |
| (D) Private Transport vs Stationary, Walking, Public Transport. | ||
| Extra-Trees | 0.0087 | 0.6296 |
| RF | 0.0108 | 0.0000 |
| XGBoost | 0.0096 | 0.3704 |
Binary Classification Models combination based on the highest probability estimate: Statistic by class
| Super Learner Model | Stay | Walk | Private Transport | Public Transport |
|---|---|---|---|---|
| Recall (Sensitivity) | 99.0 | 66.2 | 85.7 | 92.7 |
| Specificity | 89.2 | 99.3 | 99.0 | 99.6 |
| Precision | 97.5 | 83.1 | 86.5 | 95.3 |
| Neg. Pred. Value | 95.5 | 98.2 | 98.9 | 99.4 |
| Balanced Accuracy | 94.1 | 82.8 | 92.4 | 96.1 |
Extra Tree Model results on four classes of transport modality (stationary, non-motorized, private transport, public transport) considering four combinations of the different features
| Model features categories | Extra Tree Model results | |||
|---|---|---|---|---|
| Accuracy % | Precision % | Recall % | F1Score | |
| Baseline and GPS | 91.0 | 68.2 | 75.1 | 0.714 |
| Baseline and GPS + proximity | 92.4 | 73.9 | 69.1 | 0.715 |
| Baseline and GPS + proximity + Accelerometer | 92.6 | 81.4 | 74.4 | 0.777 |
| Baseline and GPS + proximity + Temporal window | 94.9 | 80.5 | 78.7 | 0.787 |
| Baseline and GPS + proximity + Accelerometer + Temporal window | ||||
Better ranked in the comparison are shown in bold
Fig. 7Variables Importance across the classes of the Extra-Trees full model
Fig. 8Scheme of the hierarchical approach in training and execution
Two-steps hierarchical approach confusion matrix (Extra-Tree and Super Learner considering Baseline, GPS, proximity, Accelerometer and Temporal window features)
| Two-Steps Hierarchical Approach | Predicted | ||||
|---|---|---|---|---|---|
| Stay | Walk | Private Transport | Public Transport | ||
| Actual | Stay | 0.98 | 0.30 | 0.09 | 0.03 |
| Walk | 0.01 | 0.60 | 0.02 | 0.01 | |
| Private Transport | 0.01 | 0.07 | 0.87 | 0.07 | |
| Public Transport | 0.00 | 0.03 | 0.01 | 0.89 | |
Fig. 9Trend of percentage of engagement messages viewed and accepted/executed by the users in the period, with respect to the total amount of personalized messages delivered