Literature DB >> 33534801

Applying machine learning and geolocation techniques to social media data (Twitter) to develop a resource for urban planning.

Sveta Milusheva1, Robert Marty1, Guadalupe Bedoya1, Sarah Williams2, Elizabeth Resor3, Arianna Legovini1.   

Abstract

With all the recent attention focused on big data, it is easy to overlook that basic vital statistics remain difficult to obtain in most of the world. What makes this frustrating is that private companies hold potentially useful data, but it is not accessible by the people who can use it to track poverty, reduce disease, or build urban infrastructure. This project set out to test whether we can transform an openly available dataset (Twitter) into a resource for urban planning and development. We test our hypothesis by creating road traffic crash location data, which is scarce in most resource-poor environments but essential for addressing the number one cause of mortality for children over five and young adults. The research project scraped 874,588 traffic related tweets in Nairobi, Kenya, applied a machine learning model to capture the occurrence of a crash, and developed an improved geoparsing algorithm to identify its location. We geolocate 32,991 crash reports in Twitter for 2012-2020 and cluster them into 22,872 unique crashes during this period. For a subset of crashes reported on Twitter, a motorcycle delivery service was dispatched in real-time to verify the crash and its location; the results show 92% accuracy. To our knowledge this is the first geolocated dataset of crashes for the city and allowed us to produce the first crash map for Nairobi. Using a spatial clustering algorithm, we are able to locate portions of the road network (<1%) where 50% of the crashes identified occurred. Even with limitations in the representativeness of the data, the results can provide urban planners with useful information that can be used to target road safety improvements where resources are limited. The work shows how twitter data might be used to create other types of essential data for urban planning in resource poor environments.

Entities:  

Year:  2021        PMID: 33534801      PMCID: PMC7857609          DOI: 10.1371/journal.pone.0244317

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


  12 in total

1.  Sociology. Social scientists wade into the tweet stream.

Authors:  Greg Miller
Journal:  Science       Date:  2011-09-30       Impact factor: 47.728

2.  Predicting poverty and wealth from mobile phone metadata.

Authors:  Joshua Blumenstock; Gabriel Cadamuro; Robert On
Journal:  Science       Date:  2015-11-27       Impact factor: 47.728

Review 3.  Economics in the age of big data.

Authors:  Liran Einav; Jonathan Levin
Journal:  Science       Date:  2014-11-07       Impact factor: 47.728

4.  Rule extraction from an optimized neural network for traffic crash frequency modeling.

Authors:  Qiang Zeng; Helai Huang; Xin Pei; S C Wong; Mingyun Gao
Journal:  Accid Anal Prev       Date:  2016-08-31

5.  Private traits and attributes are predictable from digital records of human behavior.

Authors:  Michal Kosinski; David Stillwell; Thore Graepel
Journal:  Proc Natl Acad Sci U S A       Date:  2013-03-11       Impact factor: 11.205

6.  Urban mobility and neighborhood isolation in America's 50 largest cities.

Authors:  Qi Wang; Nolan Edward Phillips; Mario L Small; Robert J Sampson
Journal:  Proc Natl Acad Sci U S A       Date:  2018-07-09       Impact factor: 11.205

7.  A comparative study on machine learning based algorithms for prediction of motorcycle crash severity.

Authors:  Lukuman Wahab; Haobin Jiang
Journal:  PLoS One       Date:  2019-04-04       Impact factor: 3.240

8.  Estimating geographic subjective well-being from Twitter: A comparison of dictionary and data-driven language methods.

Authors:  Kokil Jaidka; Salvatore Giorgi; H Andrew Schwartz; Margaret L Kern; Lyle H Ungar; Johannes C Eichstaedt
Journal:  Proc Natl Acad Sci U S A       Date:  2020-04-27       Impact factor: 11.205

9.  A global database of historic and real-time flood events based on social media.

Authors:  Jens A de Bruijn; Hans de Moel; Brenden Jongman; Marleen C de Ruiter; Jurjen Wagemaker; Jeroen C J H Aerts
Journal:  Sci Data       Date:  2019-12-09       Impact factor: 6.444

10.  Spatial crime distribution and prediction for sporting events using social media.

Authors:  Alina Ristea; Mohammad Al Boni; Bernd Resch; Matthew S Gerber; Michael Leitner
Journal:  Int J Geogr Inf Sci       Date:  2020-02-06       Impact factor: 4.186

View more
  7 in total

1.  Using social media in Kenya to quantify road safety: an analysis of novel data.

Authors:  J Austin Lee; Lyndsey Armes; Benjamin W Wachira
Journal:  Int J Emerg Med       Date:  2022-06-28

2.  The Plebeian Algorithm: A Democratic Approach to Censorship and Moderation.

Authors:  Benjamin Fedoruk; Harrison Nelson; Russell Frost; Kai Fucile Ladouceur
Journal:  JMIR Form Res       Date:  2021-12-21

3.  A data fusion approach to the estimation of temporary populations: An application to Australia.

Authors:  Elin Charles-Edwards; Jonathan Corcoran; Julia Loginova; Radoslaw Panczak; Gentry White; Alexander Whitehead
Journal:  PLoS One       Date:  2021-11-11       Impact factor: 3.240

4.  What Does Twitter Say About Self-Regulated Learning? Mapping Tweets From 2011 to 2021.

Authors:  Mohammad Khalil; Gleb Belokrys
Journal:  Front Psychol       Date:  2022-02-24

5.  Cities as complex systems-Collection overview.

Authors:  Diego Rybski; Marta C González
Journal:  PLoS One       Date:  2022-02-25       Impact factor: 3.240

6.  The Moderating Effects of Entertainers on Public Engagement through Government Activities in Social Media During the COVID-19.

Authors:  Xuefan Dong; Ying Lian
Journal:  Telemat Inform       Date:  2021-11-19

7.  Geolocation of multiple sociolinguistic markers in Buenos Aires.

Authors:  Olga Kellert; Nicholas H Matlis
Journal:  PLoS One       Date:  2022-09-09       Impact factor: 3.752

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.