| Literature DB >> 31008012 |
Krittika D'Silva1, Anastasios Noulas2, Mirco Musolesi3,4, Cecilia Mascolo1,4, Max Sklar5.
Abstract
Estimating revenue and business demand of a newly opened venue is paramount as these early stages often involve critical decisions such as first rounds of staffing and resource allocation. Traditionally, this estimation has been performed through coarse-grained measures such as observing numbers in local venues or venues at similar places (e.g., coffee shops around another station in the same city). The advent of crowdsourced data from devices and services carried by individuals on a daily basis has opened up the possibility of performing better predictions of temporal visitation patterns for locations and venues. In this paper, using mobility data from Foursquare, a location-centric platform, we treat venue categories as proxies for urban activities and analyze how they become popular over time. The main contribution of this work is a prediction framework able to use characteristic temporal signatures of places together with k-nearest neighbor metrics capturing similarities among urban regions, to forecast weekly popularity dynamics of a new venue establishment in a city neighborhood. We further show how we are able to forecast the popularity of the new venue after one month following its opening by using locality and temporal similarity as features. For the evaluation of our approach we focus on London. We show that temporally similar areas of the city can be successfully used as inputs of predictions of the visit patterns of new venues, with an improvement of 41% compared to a random selection of wards as a training set for the prediction task. We apply these concepts of temporally similar areas and locality to the real-time predictions related to new venues and show that these features can effectively be used to predict the future trends of a venue. Our findings have the potential to impact the design of location-based technologies and decisions made by new business owners.Entities:
Keywords: Human mobility prediction; Spatio-temporal patterns; Urban computing; Urban traffic
Year: 2018 PMID: 31008012 PMCID: PMC6448359 DOI: 10.1140/epjds/s13688-018-0142-z
Source DB: PubMed Journal: EPJ Data Sci ISSN: 2193-1127 Impact factor: 3.184
Figure 1Venue category popularity. The most popular category in each ward at 17:00. Nightlife Spots are represented in red, Travel & Transport in green, Food in blue, and Outdoors & Recreation in grey
Figure 2Temporal profiles. Normalized temporal profile of different categories of venues
Figure 3Ward temporal profile. Daily temporal profiles and category breakdown of St. Pancras & Somers Town and Camden Town with Primrose Hill, two contrasting wards in London
Figure 4Temporal similarity of wards. J-S divergence of the characteristic weekly temporal profile of the 15 most popular wards. Smaller values signify a smaller divergence and thus more similarity
Figure 5Set of new venues. Coordinates of the set of 305 new venues in London considered in the study
Figure 6Stable temporal profiles. Top panel: the normalized stable temporal profile of the new venue with the profile of similar wards. Bottom panel: the output of the GP trained on the similar ward profiles; this serves as a prediction of the profile of the new venue
Figure 7Profile similarity of different wards. The NRMSE between the stable temporal profile of the new venue and the temporal profile of five similar wards. “Output of GP” is the NRMSE between the output of the trained GP model and the temporal profile of the new venue
Comparative analysis of different similarity criteria
| Criteria | Description of Criteria | NRMSE |
|---|---|---|
| TempGen | Temporally similar wards, same general category | 1.614 |
| TempSpec |
|
|
| Random | Random wards | 2.692 |
| SameAll | Same ward, all categories | 2.1941 |
| SameGen | Same ward, same general category | 1.884 |
| SameSpec | Same ward, same specific category | 1.760 |
| AllAll | All wards, all categories | 1.937 |
| AllGen | All wards, same general category | 2.190 |
| AllSpec | All wards, same specific category | 2.028 |
AUC values of the real-time prediction with a varying number of months of training data
| 2 | 3 | 4 | 5 | 6 | |
|---|---|---|---|---|---|
| History | 0.6748 | 0.6697 | 0.6853 | 0.7286 | 0.7278 |
| TempGen | 0.7507 | 0.7820 | 0.7691 | 0.7824 | 0.7903 |
| TempSpec | 0.7729 | 0.7804 | 0.7829 | 0.7991 | 0.8104 |
| Random | 0.5102 | 0.5185 | 0.5248 | 0.5682 | 0.5993 |
| SameAll | 0.7149 | 0.7310 | 0.7382 | 0.7349 | 0.7352 |
| SameGen | 0.7403 | 0.7481 | 0.7592 | 0.7480 | 0.7791 |
| SameSpec |
|
|
|
|
|
| AllAll | 0.6812 | 0.6892 | 0.6489 | 0.6893 | 0.6832 |
| AllGen | 0.6824 | 0.6853 | 0.6935 | 0.7088 | 0.7129 |
| AllSpec | 0.7201 | 0.7209 | 0.7403 | 0.7459 | 0.7402 |