Literature DB >> 24793431

Inferring the Origin Locations of Tweets with Quantitative Confidence.

Reid Priedhorsky1, Aron Culotta2, Sara Y Del Valle.   

Abstract

Social Internet content plays an increasingly critical role in many domains, including public health, disaster management, and politics. However, its utility is limited by missing geographic information; for example, fewer than 1.6% of Twitter messages (tweets) contain a geotag. We propose a scalable, content-based approach to estimate the location of tweets using a novel yet simple variant of gaussian mixture models. Further, because real-world applications depend on quantified uncertainty for such estimates, we propose novel metrics of accuracy, precision, and calibration, and we evaluate our approach accordingly. Experiments on 13 million global, comprehensively multi-lingual tweets show that our approach yields reliable, well-calibrated results competitive with previous computationally intensive methods. We also show that a relatively small number of training data are required for good estimates (roughly 30,000 tweets) and models are quite time-invariant (effective on tweets many weeks newer than the training set). Finally, we show that toponyms and languages with small geographic footprint provide the most useful location signals.

Entities:  

Year:  2014        PMID: 24793431      PMCID: PMC4008124          DOI: 10.1145/2531602.2531607

Source DB:  PubMed          Journal:  CSCW Conf Comput Support Coop Work


  1 in total

1.  Understanding individual human mobility patterns.

Authors:  Marta C González; César A Hidalgo; Albert-László Barabási
Journal:  Nature       Date:  2008-06-05       Impact factor: 49.962

  1 in total
  4 in total

1.  Forecasting the 2013-2014 influenza season using Wikipedia.

Authors:  Kyle S Hickmann; Geoffrey Fairchild; Reid Priedhorsky; Nicholas Generous; James M Hyman; Alina Deshpande; Sara Y Del Valle
Journal:  PLoS Comput Biol       Date:  2015-05-14       Impact factor: 4.475

2.  Network structure and community evolution on Twitter: human behavior change in response to the 2011 Japanese earthquake and tsunami.

Authors:  Xin Lu; Christa Brelsford
Journal:  Sci Rep       Date:  2014-10-27       Impact factor: 4.379

3.  Measuring spatio-textual affinities in twitter between two urban metropolises.

Authors:  Minda Hu; Mayank Kejriwal
Journal:  J Comput Soc Sci       Date:  2021-06-02

4.  Word embeddings and deep learning for location prediction: tracking Coronavirus from British and American tweets.

Authors:  Sarra Hasni; Sami Faiz
Journal:  Soc Netw Anal Min       Date:  2021-07-27
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.