| Literature DB >> 33265825 |
Alicia Rodriguez-Carrion1, Carlos Garcia-Rubio1, Celeste Campo1.
Abstract
Correctly estimating the features characterizing human mobility from mobile phone traces is a key factor to improve the performance of mobile networks, as well as for mobility model design and urban planning. Most related works found their conclusions on location data based on the cells where each user sends or receives calls or messages, data known as Call Detail Records (CDRs). In this work, we test if such data sets provide enough detail on users' movements so as to accurately estimate some of the most studied mobility features. We perform the analysis using two different data sets, comparing CDRs with respect to an alternative data collection approach. Furthermore, we propose three filtering techniques to reduce the biases detected in the fraction of visits per cell, entropy and entropy rate distributions, and predictability. The analysis highlights the need for contextualizing mobility results with respect to the data used, since the conclusions are biased by the mobile phone traces collection approach.Entities:
Keywords: cell-based location; human mobility; mobility data sets entropy; mobility data sets predictability; ping-pong effect
Year: 2018 PMID: 33265825 PMCID: PMC7512299 DOI: 10.3390/e20100736
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1Data collection approaches. Mobility data collection approaches in the cellular telephony network.
Data sets. General characteristics of the data sets used for the experiments.
| Data Set | MIT | UC3M |
|---|---|---|
|
| 2004/09 | 2013/01 |
|
| 95 | 25 |
|
| 14,487 | 5716 |
Figure 2Comparison of the distributions. Comparison of (a,b) the number of cell changes/day for the MIT and UC3M data sets respectively; (c,d) the number of different cells visited/day for the MIT and UC3M data sets respectively; and (e,f) the fraction of visits concentrated in each percentage of cells, averaged through the population for the MIT and UC3M data sets respectively, all of them for each data collection approach and filtering technique.
Figure 3Comparison of the distributions. Comparison of (a,b) the fraction of visits concentrated in each percentage of cells, averaged through the population for the MIT and UC3M data sets respectively; (c,d) the entropy and entropy rate distributions for the MIT and UC3M data sets respectively; and (e,f) the predictability distribution for the MIT and UC3M data sets respectively, all of them for each data collection approach and filtering technique.