| Literature DB >> 32575822 |
Alessandro Crivellari1, Euro Beinat1.
Abstract
Neural machine translation is a prominent field in the computational linguistics domain. By leveraging the recent developments of deep learning, it gave birth to powerful algorithms for translating text from one language to another. This study aims to assess the feasibility of transferring the neural machine translation approach into a completely different context, namely human mobility and trajectory analysis. Building a conceptual parallelism between sentences (sequences of words) and motion traces (sequences of locations), we aspire to translate individual trajectories generated by a certain category of users into the corresponding mobility traces potentially generated by a different category of users. The experiment is inserted in the background of tourist mobility analysis, with the goal of translating the motion behavior of tourists belonging to a specific nationality into the motion behavior of tourists belonging to a different nationality. The model adopted is based on the seq2seq approach and consists of an encoder-decoder architecture based on long short-term memory (LSTM) neural networks and neural embeddings. The encoder turns an input location sequence into a corresponding hidden vector; the decoder reverses the process, turning the vector into an output location sequence. The proposed framework, tested on a real-world large-scale dataset, explores an effective attempt of motion transformation between different entities, arising as a potentially powerful source of mobility information disclosure, especially in the context of crowd management and smart city services.Entities:
Keywords: LSTM; encoder–decoder; motion behavior; neural networks; seq2seq; smart tourism; trajectories
Mesh:
Year: 2020 PMID: 32575822 PMCID: PMC7348925 DOI: 10.3390/s20123503
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Visual representation of the trajectory “translation” task. Given a starting location and an input trace, a new output trajectory is generated from the same starting point.
Figure 2Visual exemplification of the encoder–decoder neural network model for trajectory translation.
Figure 3Visual exemplification of the decoder at inference time.
Overall performance comparison on the Dutch-to-Korean dataset between our methodology (ENC-DEC) and the baseline approaches, namely copying model (CP), highest similarity model (HS), most traveled model (MT), and hidden Markov model (HMM).
| Model | Accuracy | Accuracy_±1h | Accuracy_Time-Ind |
|---|---|---|---|
| CP | 0.2758 | 0.3668 | 0.4538 |
| HS | 0.3470 | 0.4291 | 0.5125 |
| MT | 0.3525 | 0.4173 | 0.5125 |
| HMM | 0.3449 | 0.3893 | 0.4451 |
| ENC-DEC | 0.6710 | 0.8000 | 0.8232 |
Overall performance comparison on the Dutch-to-German dataset between our methodology (ENC-DEC) and the baseline approaches, namely copying model (CP), highest similarity model (HS), most traveled model (MT), and hidden Markov model (HMM).
| Model | Accuracy | Accuracy_±1h | Accuracy_Time-Ind |
|---|---|---|---|
| CP | 0.2050 | 0.2928 | 0.3807 |
| HS | 0.2450 | 0.3198 | 0.3993 |
| MT | 0.2499 | 0.2921 | 0.3463 |
| HMM | 0.2726 | 0.3101 | 0.3725 |
| ENC-DEC | 0.5774 | 0.7756 | 0.8138 |
Comparison of accuracy, accuracy_±1h (in round brackets), and accuracy_time-ind (in square brackets) for different numbers of location changes, on the Dutch-to-Korean dataset.
| Model | 1 Change | 2 Changes | 3 Changes | 4 Changes | 5 Changes | 6 Changes |
|---|---|---|---|---|---|---|
| CP | 0.4823 | 0.3531 | 0.2145 | 0.1455 | 0.0807 | 0.0366 |
| (0.6025) | (0.4749) | (0.2980) | (0.2047) | (0.1149) | (0.0638) | |
| [0.7151] | [0.5923] | [0.3805] | [0.2679] | [0.1550] | [0.0883] | |
| HS | 0.5913 | 0.4107 | 0.2947 | 0.2018 | 0.1270 | 0.0674 |
| (0.6798) | (0.5246) | (0.3733) | (0.2662) | (0.1715) | (0.0990) | |
| [0.7735] | [0.6395] | [0.4506] | [0.3329] | [0.2148] | [0.1317] | |
| MT | 0.6117 | 0.3942 | 0.3208 | 0.2091 | 0.1161 | 0.0567 |
| (0.6929) | (0.4836) | (0.3757) | (0.2586) | (0.1474) | (0.0763) | |
| [0.7796] | [0.6508] | [0.4531] | [0.3275] | [0.1856] | [0.1074] | |
| HMM | 0.5652 | 0.4545 | 0.2827 | 0.1927 | 0.1064 | 0.0300 |
| (0.6248) | (0.5009) | (0.3303) | (0.2289) | (0.1279) | (0.0564) | |
| [0.6893] | [0.5787] | [0.3830] | [0.2708] | [0.1544] | [0.0704] | |
| ENC-DEC | 0.8337 | 0.7037 | 0.6192 | 0.5798 | 0.5570 | 0.5358 |
| (0.9457) | (0.8489) | (0.7661) | (0.7109) | (0.6581) | (0.6145) | |
| [0.9610] | [0.8742] | [0.7924] | [0.7368] | [0.6842] | [0.6313] |
Comparison of accuracy, accuracy_±1h (in round brackets), and accuracy_time-ind (in square brackets) for different numbers of location changes, on the Dutch-to-German dataset.
| Model | 1 Change | 2 Changes | 3 Changes | 4 Changes | 5 Changes | 6 Changes |
|---|---|---|---|---|---|---|
| CP | 0.3551 | 0.2898 | 0.1780 | 0.1289 | 0.0772 | 0.0344 |
| (0.4717) | (0.4079) | (0.2661) | (0.1987) | (0.1236) | (0.0622) | |
| [0.5817] | [0.5209] | [0.3549] | [0.2757] | [0.1789] | [0.0901] | |
| HS | 0.4376 | 0.3218 | 0.2183 | 0.1576 | 0.1010 | 0.0529 |
| (0.5241) | (0.4263) | (0.2933) | (0.2203) | (0.1454) | (0.0776) | |
| [0.6102] | [0.5372] | [0.3729] | [0.2915] | [0.1959] | [0.1034] | |
| MT | 0.4643 | 0.3149 | 0.2351 | 0.1574 | 0.0942 | 0.0417 |
| (0.5189) | (0.3848) | (0.2683) | (0.1877) | (0.1140) | (0.0528) | |
| [0.5739] | [0.4773] | [0.3175] | [0.2286] | [0.1420] | [0.0672] | |
| HMM | 0.4441 | 0.3958 | 0.2456 | 0.1797 | 0.1025 | 0.0325 |
| (0.4985) | (0.4407) | (0.2855) | (0.2090) | (0.1211) | (0.0479) | |
| [0.5641] | [0.5347] | [0.3464] | [0.2620] | [0.1577] | [0.0659] | |
| ENC-DEC | 0.8014 | 0.6746 | 0.5492 | 0.4696 | 0.4022 | 0.3466 |
| (0.9307) | (0.8579) | (0.7815) | (0.7150) | (0.6320) | (0.4932) | |
| [0.9508] | [0.8903] | [0.8256] | [0.7655] | [0.6830] | [0.5261] |
Comparison of accuracy, accuracy_±1h (in round brackets), and accuracy_time-ind (in square brackets) for different values of radius of gyration, on the Dutch-to-Korean dataset.
| Model | ≤3 km | 3–10 km | 10–32 km | ≥32 km |
|---|---|---|---|---|
| CP | 0.4549 | 0.2151 | 0.1712 | 0.1160 |
| (0.6021) | (0.2972) | (0.2274) | (0.1517) | |
| [0.7420] | [0.3871] | [0.2837] | [0.1775] | |
| HS | 0.5167 | 0.2953 | 0.2589 | 0.1845 |
| (0.6355) | (0.3789) | (0.3237) | (0.2220) | |
| [0.7685] | [0.4642] | [0.3803] | [0.2477] | |
| MT | 0.5074 | 0.3145 | 0.2802 | 0.1930 |
| (0.6102) | (0.3769) | (0.3296) | (0.2125) | |
| [0.7846] | [0.4519] | [0.3728] | [0.2364] | |
| HMM | 0.5339 | 0.2940 | 0.2419 | 0.1627 |
| (0.5858) | (0.3359) | (0.2846) | (0.1989) | |
| [0.6682] | [0.3932] | [0.3269] | [0.2224] | |
| ENC-DEC | 0.7245 | 0.6310 | 0.6591 | 0.6250 |
| (0.8850) | (0.7946) | (0.7749) | (0.6927) | |
| [0.9180] | [0.8274] | [0.7928] | [0.6982] |
Comparison of accuracy, accuracy_±1h (in round brackets), and accuracy_time-ind (in square brackets) for different values of radius of gyration, on the Dutch-to-German dataset.
| Model | ≤3 km | 3–10 km | 10–32 km | ≥32 km |
|---|---|---|---|---|
| CP | 0.3783 | 0.1673 | 0.1200 | 0.0614 |
| (0.5263) | (0.2575) | (0.1660) | (0.0837) | |
| [0.6675] | [0.3575] | [0.2133] | [0.0983] | |
| HS | 0.4222 | 0.2010 | 0.1666 | 0.0980 |
| (0.5413) | (0.2798) | (0.2110) | (0.1195) | |
| [0.6742] | [0.3660] | [0.2525] | [0.1330] | |
| MT | 0.4209 | 0.2123 | 0.1799 | 0.0904 |
| (0.4970) | (0.2500) | (0.2038) | (0.1009) | |
| [0.6035] | [0.2952] | [0.2293] | [0.1112] | |
| HMM | 0.4609 | 0.2463 | 0.1781 | 0.0897 |
| (0.5096) | (0.2823) | (0.2118) | (0.1136) | |
| [0.6039] | [0.3543] | [0.2502] | [0.1270] | |
| ENC-DEC | 0.6714 | 0.5464 | 0.5588 | 0.4846 |
| (0.8819) | (0.7863) | (0.7483) | (0.5863) | |
| [0.9231] | [0.8384] | [0.7792] | [0.5999] |
Figure 4Accuracy comparison with respect to the hour of the day, on the Dutch-to-Korean dataset (a) and the Dutch-to-German dataset (b).