| Literature DB >> 26147500 |
Daniele Barchiesi1, Helen Susannah Moat2, Christian Alis1, Steven Bishop1, Tobias Preis2.
Abstract
Online social media platforms are opening up new opportunities to analyse human behaviour on an unprecedented scale. In some cases, the fast, cheap measurements of human behaviour gained from these platforms may offer an alternative to gathering such measurements using traditional, time consuming and expensive surveys. Here, we use geotagged photographs uploaded to the photo-sharing website Flickr to quantify international travel flows, by extracting the location of users and inferring trajectories to track their movement across time. We find that Flickr based estimates of the number of visitors to the United Kingdom significantly correlate with the official estimates released by the UK Office for National Statistics, for 28 countries for which official estimates are calculated. Our findings underline the potential for indicators of key aspects of human behaviour, such as mobility, to be generated from data attached to the vast volumes of photographs posted online.Entities:
Mesh:
Year: 2015 PMID: 26147500 PMCID: PMC4493158 DOI: 10.1371/journal.pone.0128470
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Relationship between official and Flickr based estimates of visitors to the UK.
We analyse geotagged photos taken and uploaded to Flickr between 2008 and 2013. We identify users based in 28 countries outside the UK, and determine the average number of Flickr users who visited the UK from each of these countries each year during this period. We find a significant correlation between the detected number of Flickr users visiting the UK and the official estimate of visitors to the UK calculated by the Office for National Statistics (r = 0.86, N = 28, p < 0.001, Pearson’s correlation test). The solid line depicts a least-squares fit, and the shaded area represents a 95% confidence interval.
Fig 2Comparison of estimates of the number of visitors to the UK using standard socio-economic indicators, and estimates using Flickr data.
We analyse data for all 28 countries of origin depicted in Fig 1, from 2008 to 2013. (A) Estimates of the average number of visitors per year to the UK, generated by a regression model using the detected number of Flickr users visiting the UK only. (B) Estimates generated using five socio-economic indicators, namely whether the country of origin has English as an official language, the population of the country, the GDP per capita of the country, the distance between the largest city in the country and London, and the stringency of UK visa requirements for citizens of the country. (C) Estimates generated by a combined model, using both Flickr data and the socio-economic indicators. We find that the combined Flickr and socio-economic model significantly outperforms both the Flickr model (F(5,21) = 4.75, p < 0.005) and the socio-economic model (F(1,21) = 14.57, p < 0.005). (D) In the combined model, regression coefficients for Flickr, language and distance are significantly different from 0. Error bars indicate 95% confidence intervals.